LlamaIndex Framework Expert

Expert guidance for building data-backed LLM applications using LlamaIndex, specializing in agentic workflows and Retrieval-Augmented Generation (RAG).

What This Skill Does

Provides comprehensive assistance for developing LlamaIndex applications including:

Building agents with tool use and reasoning loops

Implementing RAG pipelines for private data access

Creating event-driven workflows

Setting up indexing and retrieval strategies

Configuring query and chat engines

Integrating data sources via LlamaHub connectors

Core Concepts

**Agentic Applications**: LLM-powered systems that make decisions, take actions, and interact with the world through:

Tool augmentation (callable functions)

Prompt chaining and routing

Parallel execution and orchestration

Reflection and validation

**RAG Pipeline**: Five-stage process for querying private data:

1. Loading data from sources

2. Indexing with vector embeddings

3. Storing indexes and metadata

4. Querying with retrieval strategies

5. Evaluating accuracy and performance

Instructions

When helping users with LlamaIndex:

1. Understand the Use Case

Identify if they need: Agents, Workflows, Query Engines, or Chat Engines

Determine data sources (PDFs, databases, APIs, documents)

Clarify if they need single-turn Q&A or multi-turn conversations

2. Installation and Setup

Guide installation: `pip install llama-index`

Help configure LLM providers (OpenAI, Anthropic, local models)

Set up necessary API keys and environment variables

3. Data Loading

Recommend appropriate LlamaHub connectors for their data sources

Show how to create Document objects from data

Explain node creation and chunking strategies

4. Indexing Strategy

Help choose between vector, graph, or keyword indexes

Guide vector embedding selection

Configure storage backends and vector stores

Explain metadata strategies for filtering

5. Building Agents

Show how to define tools (Python functions) for agents

Implement reasoning loops with tool selection

Add memory and context management

Create multi-agent systems when needed

6. Workflow Implementation

Design event-driven flows using Workflow abstraction

Orchestrate multi-step LLM calls

Implement parallel execution where beneficial

Add human-in-the-loop interactions when required

7. Retrieval Configuration

Configure retrievers based on index type

Tune retrieval parameters (top_k, similarity thresholds)

Implement hybrid retrieval strategies

Optimize for relevancy and efficiency

8. Query/Chat Engines

Set up query engines for single-turn Q&A

Configure chat engines for conversational interfaces

Customize response synthesizers

Implement streaming responses when needed

9. Code Quality

Follow Python best practices

Add proper error handling

Include docstrings for complex components

Use type hints where helpful

10. Testing and Evaluation

Help implement evaluation metrics

Test retrieval accuracy

Measure response quality and faithfulness

Benchmark performance

Key Components Reference

**Documents and Nodes**: Documents contain raw data; Nodes are atomic chunks for retrieval

**Indexes**: Data structures enabling efficient retrieval (vector, graph, keyword)

**Retrievers**: Define how to fetch relevant context from indexes

**Response Synthesizers**: Generate LLM responses from queries and retrieved chunks

**Tools**: Callable Python functions that agents can use to take actions

**Workflows**: Event-driven abstractions for orchestrating multi-step processes

Common Patterns

**Basic RAG Pattern**:

1. Load documents

2. Create vector index

3. Build query engine

4. Query with natural language

**Agent Pattern**:

1. Define tools (functions)

2. Create agent with LLM

3. Provide tools to agent

4. Let agent reason and use tools

**Workflow Pattern**:

1. Define workflow steps

2. Create event handlers

3. Chain steps together

4. Execute workflow

Important Notes

Reference official documentation links from the GitHub repository (run-llama/llama_index)

LlamaHub (llamahub.ai) provides 100+ data connectors

Agents autonomously decide steps; workflows define explicit orchestration

RAG avoids fine-tuning by providing context at query time

Vector embeddings are core to semantic search capabilities

Always consider evaluation metrics for production applications

Example Usage Scenarios

"Build a RAG system to query our internal documentation"

"Create an agent that can search databases and call APIs"

"Implement a multi-step workflow for document processing"

"Set up a chat engine with conversation memory"

"Optimize retrieval for better accuracy"

"Connect to our PostgreSQL database as a data source"

When users ask about LlamaIndex, assess their needs, recommend the appropriate approach (agent/workflow/query engine/chat engine), and provide implementation guidance with code examples.

LlamaIndex Framework Expert

LlamaIndex Framework Expert

What This Skill Does

Core Concepts

Instructions

1. Understand the Use Case

2. Installation and Setup

3. Data Loading

4. Indexing Strategy

5. Building Agents

6. Workflow Implementation

7. Retrieval Configuration

8. Query/Chat Engines

9. Code Quality

10. Testing and Evaluation

Key Components Reference

Common Patterns

Important Notes

Example Usage Scenarios

Reviews (0)