Retrieval-Augmented Generation (RAG) and LLM

Published in

Self Study Notes

4 min readDec 10, 2024

Following up my recent post in Participation in GitHub Projects

Below is a structured guide to deep dive into Retrieval-Augmented Generation (RAG) and Large Language Models (LLMs). The focus is on foundational understanding, implementation practices, and hands-on experimentation.

Photo by Kilian Seiler on Unsplash

1. Understanding RAG (Retrieval-Augmented Generation)

Key Concept:

RAG combines:

Retrievers (to fetch relevant data from external sources like knowledge bases or documents).
Generators (to create coherent and context-aware outputs using retrieved information).

This bridges the gap between retrieval-based and generative systems, making RAG suitable for knowledge-intensive tasks.

A. Key Papers to Read

Lewis et al. (2020): Retrieval-augmented generation for knowledge-intensive NLP tasks.

Introduces RAG. Uses Dense Passage Retrieval (DPR) with BART for generating evidence-based responses. Paper

Karpukhin et al. (2020): Dense Passage Retrieval for Open-Domain Question Answering.

Focuses on the retriever component, optimizing retrieval through dense embeddings…

Retrieval-Augmented Generation (RAG) and LLM

1. Understanding RAG (Retrieval-Augmented Generation)

Key Concept:

A. Key Papers to Read

Create an account to read the full story.

Published in Self Study Notes

Written by Cevher Dogan

No responses yet

More from Cevher Dogan and Self Study Notes

Exploring Graph Database Capabilities: Neo4j vs. PostgreSQL

Cypher query language and Apache AGE and the use of JSONB data types

What is RAG (Retrieval-Augmented Generation)?

RAG (Retrieval-Augmented Generation) is a machine learning framework that combines the capabilities of retrieval-based systems and…

Temporal Knowledge Graphs: Unlocking the Power of Time in Data

In a world where data drives decisions, Temporal Knowledge Graphs (TKGs) are rapidly emerging as a game-changing tool.

Temporal Knowledge Graphs

Time in Knowledge Representation

Recommended from Medium

LLM based fine-tuning of Reinforcement Learning Agents

Reinforcement Learning Agents for Industrial Control Systems

Pydantic AI + Web Scraper + Llama 3.3 Python = Powerful AI Research Agent

In this video, I have a super quick tutorial showing you how to create a multi-agent chatbot with Pydantic AI, Web Scraper and Llama 3.3 to…

Lists

Natural Language Processing

Generative AI Recommended Reading

What is ChatGPT?

The New Chatbots: ChatGPT, Bard, and Beyond

Pydantic AI: The Python Agent Framework to BUILD Production-Grade AI Agents!

Pydantic, a powerhouse in the Python ecosystem with over 285 million monthly downloads, has been a cornerstone of robust data validation in…

Revolutionizing Autonomous Agents: Salesforce’s xLAM Outperforms GPT-4

Autonomous agents powered by large language models (LLMs) have garnered considerable research attention. However, the open-source community…

LLM Architectures Explained: NLP Fundamentals (Part 1)

Deep Dive into the architecture & building of real-world applications leveraging NLP Models starting from RNN to the Transformers.

Use RAG to search your data using LLMs

I worked on a hackathon project aimed at enhancing the office knowledge base portal by improving its search functionality with use of…