Member-only story

Build With Qwen 3, MCP, and a Free GPU

You can have them all in a single Notebook

5 min read1 day ago

I was curious to know.

Can we run local LLMs using Ollama, run MCP servers, connect them, and build intelligent apps in a single Notebook?

If this is possible, we can get the most from the free Kaggle Notebooks (or Colabs.) For quantized models, the free GPU we get with them is sufficient.

To run Ollama, you need to access the notebook terminal. Of course, Kaggle and Colab give access to the terminal. But they don’t allow background processes. So, you can’t keep the Ollama server running.

Same story with MCP servers. Most of them run in the background.

But it’s not entirely out of reach.

In this post, I’ll show you how I start Ollama servers in a Kaggle Notebook.

Let’s run the Qwen3 model. I like Qwen 3 for two reasons. First, it’s now the most capable open-source thinking model, and second, we have options for every device. From 0.6 B to 235 B, we can pick the one that best suits our hardware needs.

Running Ollama on Kaggle Notebook

I’m a fan of Llama.CPP: Not Ollama.

Level Up Coding

Build With Qwen 3, MCP, and a Free GPU

You can have them all in a single Notebook

Running Ollama on Kaggle Notebook

Create an account to read the full story.

Published in Level Up Coding

Written by Thuwarakesh Murallie

No responses yet

More from Thuwarakesh Murallie and Level Up Coding

How I Build MCP Servers and Host Them For Free

Wrap any Python function in an MCP in under a minute.

30+ MCP Ideas with Complete Source Code

MCP is going viral. AI agents can now talk to real tools & apps and actually get stuff done.

Converting Unstructured Data into a Knowledge Graph Using an End-to-End Pipeline

Step by Step guide

5 Common Mistakes AI Engineers Make in Their First RAG.

What I wish I knew before building apps with LLMs.

Recommended from Medium

Run a 100B AI Model on Your Laptop: How Microsoft’s BitNet.cpp Makes It Possible

I’m running a 100-billion-parameter AI model like it’s no big deal.

Accurate Audio Transcription Is [Almost] Free Now.

Batch process hours of audio transcription in seconds.

How I Created Stunning AI Videos Locally, Free, Offline, and Uncensored

From static photos to fully animated scenes, here’s how I turned my laptop into a cinematic AI video studio, and how you can too.

I Tried Desktop Commander MCP: Say Goodbye to Token Limits (Mostly)

As someone constantly looking for ways to boost my productivity, I’ve tried nearly every AI coding assistant out there. The essence is, if…

NVIDIA Parakeet V2 vs OpenAI Whisper: Which Is the Best ASR AI Model?

The best Speech Recognition AI model

What is the best vibe coding tool on the market?

Claude, Copilot, Cursor, Windsurf, Cline, RooCode