Haystack

Haystack is an open-source AI orchestration framework by deepset for building customizable, production-ready LLM applications through modular pipelines. With 24.6K GitHub stars and active development since 2019, it is one of the longest-standing AI frameworks, evolving from a question-answering system to a comprehensive pipeline-based orchestration platform.

framework python pipelines rag production orchestration deepset

Overview

Haystack was created by deepset (Berlin, Germany) in 2019 as an open-source question-answering framework focused on extractive QA over documents. Over six years, it has evolved through multiple paradigm shifts — from neural search with dense retrievers (2020-2021) to RAG pipelines (2022-2023) to the fully redesigned Haystack 2.0 with component-based architecture and agent workflows (2023-2024). The framework emphasizes production-readiness with built-in observability, async execution, and Kubernetes integration.¹⁾²⁾³⁾

Key Features

Modular Pipelines — Directed acyclic graphs (DAGs) of components for RAG, QA, semantic search, and more
200+ Components — Document stores, embeddings, LLMs, retrievers, readers, converters, and evaluators
Production-Ready — OpenTelemetry tracing, retries, caching, error handling, Kubernetes/Docker deployment
Agent Workflows — Tool-using agents with reasoning loops and conditional logic
Hybrid Search — Combine sparse (BM25) and dense (DPR) retrieval with reranking
Component-Based — Stateless, composable components with Pydantic-validated I/O schemas
Evaluation Metrics — Built-in SquadMetric, ExactMatchMetric, and custom evaluators
Document Stores — Pluggable backends: Elasticsearch, OpenSearch, Pinecone, Weaviate, Chroma, In-Memory

Architecture

Haystack's pipeline-centric architecture:

graph LR A[Preprocessor: Document Converter] --> B[Retriever: BM25 / Dense / Hybrid] B --> C[Reader: LLM or Extractive] C --> D[Postprocessor / Ranker]

Infrastructure Layer:

graph TD A[Document Store: Elasticsearch / <a href='/pinecone' class='wikilink1' title='pinecone' data-wiki-id='pinecone'>Pinecone]</a> B[Embedding Models: Sentence Transformers / <a href='/openai' class='wikilink1' title='openai' data-wiki-id='openai'>OpenAI]</a> C[LLMs: <a href='/openai' class='wikilink1' title='openai' data-wiki-id='openai'>OpenAI</a> / <a href='/anthropic' class='wikilink1' title='anthropic' data-wiki-id='anthropic'>Anthropic</a> / HuggingFace / Local] A, B, C

Code Example

Building a RAG pipeline with Haystack 2.x:

from haystack import Pipeline
from haystack.components.generators import OpenAIGenerator
from haystack.components.builders.prompt_builder import PromptBuilder
from haystack.components.retrievers.in_memory import InMemoryBM25Retriever
from haystack.document_stores.in_memory import InMemoryDocumentStore
from haystack.dataclasses import Document
 
# Set up document store with sample data
doc_store = InMemoryDocumentStore()
doc_store.write_documents([
    Document(content="Haystack is an AI orchestration framework by deepset."),
    Document(content="It supports [[modular|modular]] pipelines for RAG and search."),
    Document(content="Haystack 2.0 introduced component-based architecture."),
])
 
# Build RAG pipeline
template = """
Given these documents, answer the question.
Documents: {% for doc in documents %}{{ doc.content }}{% endfor %}
Question: {{ question }}
"""
 
rag_pipeline = Pipeline()
rag_pipeline.add_component("retriever", InMemoryBM25Retriever(document_store=doc_store))
rag_pipeline.add_component("prompt", PromptBuilder(template=template))
rag_pipeline.add_component("llm", OpenAIGenerator(model="gpt-4o"))
rag_pipeline.connect("retriever", "prompt.documents")
rag_pipeline.connect("prompt", "llm")
 
result = rag_pipeline.run({
    "retriever": {"query": "What is Haystack?"},
    "prompt": {"question": "What is Haystack?"}
})
print(result["llm"]["replies"][0])

Haystack vs LangChain

Aspect	Haystack	LangChain
Core Paradigm	Pipeline DAGs with visual UI	Chains/Agents with LCEL
Modularity	Strong typing, 200+ components	Flexible, vast integrations
Production	Built-in observability, K8s, async	Requires LangSmith/LangServe
RAG Focus	Optimized for search/retrieval	General-purpose agents
History	Since 2019 (6+ years)	Since 2022
Stars	24.6K (steady growth)	131K (larger community)

Timeline

2019 — Initial release as extractive QA framework
2020-2021 — Dense retrievers (DPR), Transformers integration, v1.0 stable
2022 — Pivot to RAG amid LLM rise, OpenAI GPT support
2023 — Haystack 2.0 overhaul with component-based design, agents, multi-modal
2024-2025 — Enterprise features, advanced orchestration, self-improving pipelines
2026 — Milestone #2: context-engineered LLM apps, improved agent workflows

References

¹⁾

GitHub Repository

²⁾

Documentation

³⁾

deepset (Maintainer

Table of Contents