Build a Local AI Assistant with LLMs

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Build a Local AI Assistant with LLMs

Instructor: JetBrains Academy team

Ask Coursera

6 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

2 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

6 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

2 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

What you'll learn

Build a local AI assistant using LLMs, vector search, and a browser UI.
Process PDFs into searchable chunks and generate embeddings for semantic retrieval.
Create FastAPI endpoints for indexing, querying, and OpenAI-compatible chat.
Generate grounded answers from your own documents with clear source attribution.

Skills you'll gain

Tools you'll learn

Details to know

Shareable certificate

Add to your LinkedIn profile

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 6 modules in this course

Build your own AI assistant that answers questions from your documents – entirely on your local machine. Assuming a basic acquaintance with Python, this course will teach you how to run a local LLM, turn PDFs into searchable chunks, generate embeddings, store them in a vector database, and connect retrieval and generation into a complete RAG (Retrieval-Augmented Generation) pipeline. You’ll create OpenAI-compatible and RAG endpoints with FastAPI, work with Ollama and Qdrant, and finish by building a browser-based interface for asking questions and reviewing sources.

This course stands out because everything is built locally, from end to end. Instead of relying on black-box cloud services, you will master every step of the system you build – from document processing and vector search to prompt construction and answer generation. You’ll learn by actually building, adding one piece at a time. With each module, you’ll unlock a new feature in your project., By the end, you will have a production-ready AI project you can run, customize, and share.

In this module, you’ll set up your local environment and make your first call to a large language model. You’ll learn what LLMs are, why local inference is crucial for privacy and experimentation, and how Docker helps you run the system reliably across different machines. You’ll explore the core architecture of the course project, understand the API layer and the role of HTTP and JSON, and write your first Python function that talks to the model. By the end of this module, you’ll have a working local LLM request flowing through your own API.

What's included

6 videos3 readings1 programming assignment

6 videosTotal 35 minutes

Introduction to Large Language Models7 minutes
Docker Basics9 minutes
The API Layer4 minutes
HTTP APIs and JSON5 minutes
Your First Local LLM Request7 minutes
Optional: PyCharm IDE Setup Tutorial for the Course Practice 2 minutes

3 readingsTotal 40 minutes

Materials_Week 15 minutes
Run the Stack Task20 minutes
Optional: Download PyCharm IDE for the Course Practice15 minutes

1 programming assignmentTotal 180 minutes

Practical Assignment_Week 1180 minutes

In this module, you’ll build and configure the API layer that powers the project. You’ll learn how FastAPI structures endpoints, request models, and validation, how the server communicates with Ollama, and why the OpenAI chat format has become the industry standard. You’ll also work with async requests, system prompts, multi-turn conversations, and automated testing using pytest and FastAPI’s TestClient. By the end of this module, you’ll understand how the chat API works from request to response and how to verify it with tests.

What's included

3 videos1 reading1 programming assignment

In this module, you’ll move from model calls to document processing. You’ll extract text from PDFs, see why PDF parsing is harder than it looks, and learn how to split long documents into retrieval-friendly chunks. You’ll compare chunk sizes, master overlap strategies, and build a document pipeline that attaches critical metadata like source file names and chunk positions. By the end of this module, you’ll be able to turn any PDF into structured chunks that are ready for indexing.

What's included

3 videos1 reading1 programming assignment

In this module, you’ll make text searchable by meaning. You’ll learn what embeddings are, how cosine similarity measures semantic closeness, and why vector databases differ from traditional databases. Then, you’ll connect these ideas in code by generating vectors with an embedding model, storing them in Qdrant, and implementing the indexing endpoint that ties document chunks to vector storage. By the end of this module, your documents will be embedded, stored, and ready for retrieval.

What's included

2 videos1 reading1 programming assignment

In this module, you’ll close the RAG loop. You’ll explore the two core phases of RAG – retrieval and generation – and see how a user’s question becomes a vector, how relevant chunks are selected, and how prompt structure guides the model to answer strictly from context. Then, you’ll implement the query endpoint, tune parameters like Top-K and score thresholds, and return answers with clear sources. By the end of this module, your system will answer questions accurately, grounded entirely in your indexed documents.

What's included

2 videos1 reading1 programming assignment

In this module, you’ll give your project a browser interface and make it demo-ready. You’ll learn how to add a lightweight web UI with HTMX and FastAPI, render templates and HTML fragments, and connect the form-based interface to the same RAG logic you built earlier. You’ll also see how to test the full user flow in the browser and turn your backend project into something easy to share with others. By the end of this module, you’ll have a complete local AI assistant with a usable web interface.

What's included

1 video2 readings1 assignment1 programming assignment

Instructor

JetBrains Academy team

JetBrains

5 Courses127,634 learners

Offered by

JetBrains

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.