LlamaIndex RAG Framework

Build LLM applications that connect to your private data using LlamaIndex, a comprehensive data framework for retrieval-augmented generation (RAG). This skill guides you through setting up LlamaIndex, ingesting data from various sources, creating vector indices, and querying your data with LLMs.

What This Skill Does

LlamaIndex helps you augment LLMs with your own data by providing:

**Data connectors** for APIs, PDFs, documents, SQL databases, and more

**Indexing strategies** (vector stores, graphs) to structure data for LLM retrieval

**Advanced query interfaces** that retrieve relevant context and generate knowledge-augmented responses

**Flexible LLM support** including OpenAI, Replicate, HuggingFace, and other providers

Instructions

Step 1: Install LlamaIndex

Choose your installation approach based on your needs.

**Option A: Install core + specific integrations (recommended for production)**

```bash

pip install llama-index-core

pip install llama-index-llms-openai

pip install llama-index-embeddings-openai

```

**Option B: Install starter bundle (includes common integrations)**

```bash

pip install llama-index

```

Step 2: Set Up Your LLM Provider

**For OpenAI:**

```python

import os

os.environ["OPENAI_API_KEY"] = "your-api-key-here"

from llama_index.core import VectorStoreIndex, SimpleDirectoryReader

OpenAI is the default LLM

documents = SimpleDirectoryReader("./data").load_data()

index = VectorStoreIndex.from_documents(documents)

```

**For Llama 2 via Replicate (or other providers):**

```python

import os

os.environ["REPLICATE_API_TOKEN"] = "your-replicate-token"

from llama_index.core import Settings, VectorStoreIndex, SimpleDirectoryReader

from llama_index.embeddings.huggingface import HuggingFaceEmbedding

from llama_index.llms.replicate import Replicate

from transformers import AutoTokenizer

Configure LLM

Settings.llm = Replicate(

model="meta/llama-2-7b-chat:8e6975e5ed6174911a6ff3d60540dfd4844201974602551e10e9e87ab143d81e",

temperature=0.01,

additional_kwargs={"top_p": 1, "max_new_tokens": 300}

)

Set tokenizer to match LLM

Settings.tokenizer = AutoTokenizer.from_pretrained("NousResearch/Llama-2-7b-chat-hf")

Configure embedding model

Settings.embed_model = HuggingFaceEmbedding(model_name="BAAI/bge-small-en-v1.5")

documents = SimpleDirectoryReader("./data").load_data()

index = VectorStoreIndex.from_documents(documents)

```

Step 3: Load and Index Your Data

Place your documents (PDFs, text files, markdown, etc.) in a directory and load them:

```python

from llama_index.core import SimpleDirectoryReader

Load documents from a directory

documents = SimpleDirectoryReader("./data").load_data()

Create a vector store index

index = VectorStoreIndex.from_documents(documents)

```

Step 4: Query Your Index

Create a query engine and ask questions about your data:

```python

Create query engine

query_engine = index.as_query_engine()

Query your data

response = query_engine.query("What are the key findings in the research papers?")

print(response)

```

Step 5: Persist and Reload Your Index

**Save index to disk:**

```python

Persist to ./storage directory

index.storage_context.persist()

```

**Reload index from disk:**

```python

from llama_index.core import StorageContext, load_index_from_storage

Rebuild storage context

storage_context = StorageContext.from_defaults(persist_dir="./storage")

Load index

index = load_index_from_storage(storage_context)

```

Examples

Example 1: Basic RAG Pipeline with OpenAI

```python

import os

os.environ["OPENAI_API_KEY"] = "sk-..."

from llama_index.core import VectorStoreIndex, SimpleDirectoryReader

Load documents

documents = SimpleDirectoryReader("./company_docs").load_data()

Create index

index = VectorStoreIndex.from_documents(documents)

Query

query_engine = index.as_query_engine()

response = query_engine.query("What is our company's return policy?")

print(response)

```

Example 2: Using Custom Data Connectors

```python

Install specific loader

pip install llama-index-readers-file

from llama_index.readers.file import PDFReader

from llama_index.core import VectorStoreIndex

Load PDFs specifically

loader = PDFReader()

documents = loader.load_data(file="./research_paper.pdf")

index = VectorStoreIndex.from_documents(documents)

```

Example 3: Customizing Retrieval Settings

```python

from llama_index.core import VectorStoreIndex

index = VectorStoreIndex.from_documents(documents)

Customize retrieval (e.g., top-k results)

query_engine = index.as_query_engine(similarity_top_k=5)

response = query_engine.query("Explain the methodology")

print(response)

```

Important Notes

**API Keys**: Ensure your LLM provider API keys are set as environment variables

**Data Privacy**: LlamaIndex sends document chunks to your configured LLM provider

**Storage**: By default, indices are stored in-memory; use `persist()` for production

**Integrations**: Browse 300+ integrations on [LlamaHub](https://llamahub.ai) for vector stores, LLMs, data loaders, and more

**Import Namespacing**: Imports from `llama_index.core` use core package; imports without `core` use integration packages

Additional Resources

[Official Documentation](https://docs.llamaindex.ai/en/stable/)

[LlamaHub (Data Loaders & Integrations)](https://llamahub.ai)

[GitHub Repository](https://github.com/run-llama/llama_index)

[Discord Community](https://discord.gg/dGcwcsnxhU)

LlamaIndex RAG Framework

LlamaIndex RAG Framework

What This Skill Does

Instructions

Step 1: Install LlamaIndex

Step 2: Set Up Your LLM Provider

OpenAI is the default LLM

Configure LLM

Set tokenizer to match LLM

Configure embedding model

Step 3: Load and Index Your Data

Load documents from a directory

Create a vector store index

Step 4: Query Your Index

Create query engine

Query your data

Step 5: Persist and Reload Your Index

Persist to ./storage directory

Rebuild storage context

Load index

Examples

Example 1: Basic RAG Pipeline with OpenAI

Load documents

Create index

Query

Example 2: Using Custom Data Connectors

Install specific loader

pip install llama-index-readers-file

Load PDFs specifically

Example 3: Customizing Retrieval Settings

Customize retrieval (e.g., top-k results)

Important Notes

Additional Resources

Reviews (0)