Retrieval - Building Blocks¶

Welcome to the Retrieval Building Blocks documentation. These accelerators enable AI applications to access, query, and interact with data through various interfaces and storage mechanisms.

Overview¶

Retrieval capabilities provide the "data access layer" for AI applications, enabling semantic search, NoSQL storage, and efficient federated data retrieval across multiple sources.

Available Building Blocks¶

RAG (Retrieval-Augmented Generation)¶

Complete RAG pipeline with document ingestion, embedding generation, vector storage, and semantic search capabilities.

Key Features:

Document ingestion from IBM Cloud Object Storage
Embedding generation with IBM Watsonx.ai
Vector storage (Milvus or OpenSearch)
Semantic, keyword, and hybrid search
MCP server integration for AI assistants
Bob modes for RAG development guidance

Components:

RAG Accelerator: Complete pipeline with FastAPI REST API
RAG Ingestion MCP Server: Document ingestion for AI assistants
RAG Retrieval MCP Server: Semantic and keyword search
Bob Modes: AI assistant modes for RAG development

Vector Search ¶

Vector ingestion, embedding, and retrieval for semantic similarity search in GenAI pipelines.

Key Features:

Document parsing and extraction
Multiple embedding strategies (dense, hybrid, dual)
Flexible chunking strategies
REST API with authentication

Supported Databases:

Milvus: High-performance vector database
OpenSearch: Hybrid vector and keyword search
DataStax Astra DB: Cloud-native vector database

No SQL Database ¶

Large-scale NoSQL storage with Cassandra compatibility and optional vector capabilities for AI and application workloads.

Key Features:

Apache Cassandra-based serverless database
Vector collections for AI applications
Data API and CQL support
Scalable and highly available

Zero Copy ¶

Federated analytics without copying data. Query data across distributed sources with open lakehouse architecture.

Key Benefits:

Cost Savings: No redundant storage costs
Faster Insights: Avoids ETL delays
Single Source of Truth: Reduces data inconsistencies
Flexibility: Multiple engines access the same data
Governance: Centralized access control

IBM Products:

IBM watsonx.data
IBM Cloud Object Storage (COS)
IBM Db2 Database
Presto Query Engine

Use Cases¶

Common Retrieval Scenarios

RAG Systems: Build complete Retrieval-Augmented Generation pipelines
Question Answering: Intelligent Q&A over document collections
Semantic Search: Find documents based on meaning, not just keywords
Hybrid Search: Combine semantic understanding with keyword precision
Knowledge Management: Create searchable knowledge bases from unstructured data
AI Assistant Integration: Add RAG capabilities via MCP servers
Multi-Cloud Analytics: Query data across AWS, IBM Cloud, and on-premises
Real-Time Insights: Access live data without ETL delays
NoSQL Storage: Scalable storage for AI application data

Resources¶

Support¶

For issues or questions, please refer to the GitHub repository or contact IBM support.

Retrieval - Building Blocks¶

Overview¶

Available Building Blocks¶

RAG (Retrieval-Augmented Generation)¶

Vector Search¶

No SQL Database¶

Zero Copy¶

Use Cases¶

Resources¶

Support¶

Vector Search ¶

No SQL Database ¶

Zero Copy ¶