Haystack AI Framework

An end-to-end LLM framework for building production-ready RAG applications, AI agents, and semantic search systems. Haystack orchestrates state-of-the-art embedding models, LLMs, and vector databases into flexible pipelines.

What This Skill Does

This skill helps you work with the Haystack AI framework (haystack-ai package v2.x) to build:

**Retrieval-Augmented Generation (RAG)** systems with vector databases

**Semantic search** applications using embeddings

**Question answering** systems over document collections

**AI agents** that can make complex decisions and interact with tools

**Custom NLP pipelines** with modular, swappable components

When to Use This Skill

Use this skill when you need to:

Set up a Haystack project from scratch

Build RAG pipelines with vector stores (Pinecone, Weaviate, Qdrant, etc.)

Integrate multiple LLM providers (OpenAI, Cohere, Hugging Face, Azure, Bedrock)

Create document processing pipelines (file conversion, cleaning, splitting)

Implement semantic search over large document collections

Build AI agents with custom tools and decision-making logic

Debug or optimize existing Haystack pipelines

Convert between different vector database backends

Evaluate and benchmark RAG system performance

Instructions

1. Project Setup & Installation

**Install Haystack:**

```bash

pip install haystack-ai

```

**For latest features from main branch:**

```bash

pip install git+https://github.com/deepset-ai/haystack.git@main

```

**Check for integrations** the user needs (vector stores, LLM providers, file converters). Common integrations:

```bash

OpenAI integration

pip install haystack-ai[openai]

Vector store integrations

pip install pinecone-haystack

pip install weaviate-haystack

pip install qdrant-haystack

File converters

pip install haystack-ai[pdf]

pip install haystack-ai[docx]

```

2. Understanding Haystack Architecture

**Core Concepts:**

**Components**: Modular building blocks (retrievers, generators, embedders, file converters)

**Pipelines**: Connected components that process data sequentially or in parallel

**Document Stores**: Vector databases or search engines for document storage/retrieval

**Documents**: Haystack's data structure with content, metadata, and embeddings

**Key component types:**

`Retriever`: Fetches relevant documents (BM25Retriever, EmbeddingRetriever)

`Generator`: LLM-based text generation (OpenAIGenerator, HuggingFaceGenerator)

`Embedder`: Creates vector embeddings (SentenceTransformersDocumentEmbedder)

`DocumentStore`: Storage backend (InMemoryDocumentStore, PineconeDocumentStore)

`PromptBuilder`: Constructs prompts with templates

`FileConverter`: Converts files to Documents (PDFToDocument, TextFileToDocument)

3. Building a Basic RAG Pipeline

**Step-by-step approach:**

1. **Set up document store and indexing pipeline:**

```python

from haystack import Pipeline

from haystack.components.embedders import SentenceTransformersDocumentEmbedder

from haystack.components.writers import DocumentWriter

from haystack.document_stores.in_memory import InMemoryDocumentStore

Initialize document store

document_store = InMemoryDocumentStore()

Create indexing pipeline

indexing_pipeline = Pipeline()

indexing_pipeline.add_component("embedder", SentenceTransformersDocumentEmbedder())

indexing_pipeline.add_component("writer", DocumentWriter(document_store=document_store))

indexing_pipeline.connect("embedder", "writer")

```

2. **Index documents:**

```python

from haystack import Document

documents = [

Document(content="Your document text here..."),

Document(content="Another document..."),

]

indexing_pipeline.run({"embedder": {"documents": documents}})

```

3. **Build query pipeline:**

```python

from haystack.components.retrievers import InMemoryEmbeddingRetriever

from haystack.components.embedders import SentenceTransformersTextEmbedder

from haystack.components.builders import PromptBuilder

from haystack.components.generators import OpenAIGenerator

query_pipeline = Pipeline()

query_pipeline.add_component("text_embedder", SentenceTransformersTextEmbedder())

query_pipeline.add_component("retriever", InMemoryEmbeddingRetriever(document_store=document_store))

query_pipeline.add_component("prompt_builder", PromptBuilder(template="""

Answer the question based on the context below.

Context: {% for doc in documents %}{{ doc.content }}{% endfor %}

Question: {{question}}

Answer:

"""))

query_pipeline.add_component("llm", OpenAIGenerator(api_key="your-key"))

Connect components

query_pipeline.connect("text_embedder.embedding", "retriever.query_embedding")

query_pipeline.connect("retriever", "prompt_builder.documents")

query_pipeline.connect("prompt_builder", "llm")

```

4. **Run queries:**

```python

result = query_pipeline.run({

"text_embedder": {"text": "What is the main topic?"},

"prompt_builder": {"question": "What is the main topic?"}

})

print(result["llm"]["replies"][0])

```

4. Working with Different LLM Providers

**Always check** which provider the user wants. Common patterns:

**OpenAI:**

```python

from haystack.components.generators import OpenAIGenerator

generator = OpenAIGenerator(api_key=os.getenv("OPENAI_API_KEY"), model="gpt-4")

```

**Cohere:**

```python

from haystack_integrations.components.generators.cohere import CohereGenerator

generator = CohereGenerator(api_key=os.getenv("COHERE_API_KEY"))

```

**Hugging Face (local or hosted):**

```python

from haystack.components.generators import HuggingFaceLocalGenerator

generator = HuggingFaceLocalGenerator(model="meta-llama/Llama-2-7b-hf")

```

**Azure OpenAI:**

```python

from haystack.components.generators import AzureOpenAIGenerator

generator = AzureOpenAIGenerator(

azure_endpoint=os.getenv("AZURE_ENDPOINT"),

api_key=os.getenv("AZURE_API_KEY")

)

```

5. Vector Database Integration

**Determine** which vector store the user needs. Installation and setup examples:

**Pinecone:**

```bash

pip install pinecone-haystack

```

```python

from haystack_integrations.document_stores.pinecone import PineconeDocumentStore

document_store = PineconeDocumentStore(

api_key=os.getenv("PINECONE_API_KEY"),

index="your-index-name"

)

```

**Weaviate:**

```bash

pip install weaviate-haystack

```

```python

from haystack_integrations.document_stores.weaviate import WeaviateDocumentStore

document_store = WeaviateDocumentStore(url="http://localhost:8080")

```

**Qdrant:**

```bash

pip install qdrant-haystack

```

```python

from haystack_integrations.document_stores.qdrant import QdrantDocumentStore

document_store = QdrantDocumentStore(url="http://localhost:6333")

```

6. Document Processing Pipelines

**File conversion components:**

```python

from haystack.components.converters import PDFToDocument, TextFileToDocument

from haystack.components.preprocessors import DocumentCleaner, DocumentSplitter

preprocessing_pipeline = Pipeline()

preprocessing_pipeline.add_component("converter", PDFToDocument())

preprocessing_pipeline.add_component("cleaner", DocumentCleaner())

preprocessing_pipeline.add_component("splitter", DocumentSplitter(split_by="sentence", split_length=10))

preprocessing_pipeline.add_component("embedder", SentenceTransformersDocumentEmbedder())

preprocessing_pipeline.add_component("writer", DocumentWriter(document_store))

preprocessing_pipeline.connect("converter", "cleaner")

preprocessing_pipeline.connect("cleaner", "splitter")

preprocessing_pipeline.connect("splitter", "embedder")

preprocessing_pipeline.connect("embedder", "writer")

```

7. Building AI Agents

**For complex decision-making systems:**

```python

from haystack.components.agents import ReactAgent

from haystack.components.tools import Tool

Define custom tools

def search_tool(query: str) -> str:

# Your search logic

return "Search results..."

tools = [Tool(name="search", func=search_tool, description="Search documents")]

agent = ReactAgent(

llm=OpenAIGenerator(model="gpt-4"),

tools=tools,

max_iterations=5

)

result = agent.run("Find information about X and summarize it")

```

8. Evaluation & Optimization

**Evaluate RAG performance:**

```python

from haystack.components.evaluators import FaithfulnessEvaluator, AnswerRelevanceEvaluator

Add evaluators to pipeline

evaluator = FaithfulnessEvaluator()

result = evaluator.run(

questions=["What is X?"],

contexts=[retrieved_docs],

responses=[generated_answer]

)

```

**Optimize retrieval:**

Adjust `top_k` parameter in retrievers

Experiment with different embedding models

Use hybrid search (BM25 + semantic)

Fine-tune reranking models

9. Deployment Considerations

**For production deployments:**

Use **Hayhooks** to expose pipelines as REST APIs:

```bash

pip install hayhooks

hayhooks run --pipeline your_pipeline.yaml

```

Consider **Haystack Enterprise Starter** for:

- Enterprise-grade templates

- Expert support from Haystack team

- Deployment guides for cloud/on-prem

Implement **error handling** and logging:

```python

import logging

logging.basicConfig(level=logging.INFO)

```

Use **environment variables** for secrets (API keys, credentials)

10. Common Troubleshooting

**Component connection errors:**

Verify output/input socket names match: `pipeline.connect("component1.output_name", "component2.input_name")`

Check component documentation for correct socket names

**Embedding dimension mismatches:**

Ensure document store dimension matches embedding model output

Recreate document store if changing embedding models

**Memory issues with large documents:**

Reduce `split_length` in DocumentSplitter

Process documents in batches

Use streaming where supported

**LLM rate limits:**

Implement retry logic with exponential backoff

Use caching for repeated queries

Consider self-hosted models for high volume

11. Best Practices

1. **Start simple**: Begin with InMemoryDocumentStore and basic components, then scale

2. **Version control pipelines**: Save pipeline configurations as YAML files

3. **Monitor performance**: Track retrieval quality, latency, and LLM costs

4. **Iterative prompt engineering**: Use PromptBuilder templates and iterate on prompt design

5. **Test with diverse queries**: Ensure RAG system handles edge cases

6. **Document metadata**: Use metadata filtering for better retrieval precision

7. **Keep dependencies updated**: `pip install -U haystack-ai` for latest bug fixes

8. **Read component docs**: Each component has specific parameters - check [docs.haystack.deepset.ai](https://docs.haystack.deepset.ai)

12. Key Resources

**Documentation**: https://docs.haystack.deepset.ai

**Tutorials**: https://haystack.deepset.ai/tutorials

**Cookbook**: https://haystack.deepset.ai/cookbook (recipes for common use cases)

**GitHub**: https://github.com/deepset-ai/haystack

**Discord Community**: https://discord.com/invite/xYvH6drSmA

**Integrations**: https://github.com/deepset-ai/haystack-core-integrations

Constraints & Important Notes

Haystack 2.x has breaking changes from 1.x - do not mix versions

Always install integrations separately (e.g., `pinecone-haystack`, not included in base package)

API keys should be stored as environment variables, never hardcoded

Vector stores require matching embedding dimensions across indexing and query pipelines

Some components require additional system dependencies (e.g., Tesseract for OCR)

Telemetry is enabled by default (anonymous usage stats) - can be disabled in config

Free tier LLM APIs have rate limits - plan for production scaling

Example Usage

**User request**: "Help me build a RAG system that searches my company docs and answers questions using GPT-4"

**Your response**:

1. Confirm requirements (doc formats, vector store preference, scale)

2. Install dependencies: `pip install haystack-ai pinecone-haystack`

3. Set up Pinecone document store

4. Create indexing pipeline (file converter → splitter → embedder → writer)

5. Index documents from specified directory

6. Build query pipeline (text embedder → retriever → prompt builder → OpenAI generator)

7. Test with sample queries

8. Provide deployment options (Hayhooks, containerization)

Haystack AI Framework

Haystack AI Framework

What This Skill Does

When to Use This Skill

Instructions

1. Project Setup & Installation

OpenAI integration

Vector store integrations

File converters

2. Understanding Haystack Architecture

3. Building a Basic RAG Pipeline

Initialize document store

Create indexing pipeline

Connect components

4. Working with Different LLM Providers

5. Vector Database Integration

6. Document Processing Pipelines

7. Building AI Agents

Define custom tools

8. Evaluation & Optimization

Add evaluators to pipeline

9. Deployment Considerations

10. Common Troubleshooting

11. Best Practices

12. Key Resources

Constraints & Important Notes

Example Usage

Reviews (0)