Bedrock RAG & Knowledge Bases

Ref: https://www.udemy.com/course/aws-ai-practitioner-certified/learn/lecture/44886389

Retrieval-Augmented Generation (RAG) - Basic Concepts

🔧 Allows an FM to reference data sources outside of its training data
- 💡 Like an open-book exam for an LLM → LLM can reference stuff outside their knowledge
- Bedrock takes care of creating Vector Embeddings in the DB of your choice, i.e. a Knowledge Base, based on your data
- Use where real-time data is needed to be fed into the FM
Architecture diagram
Answers to prompts will carry reference numbers to data in the vector DB

Data source documents are chunked, then processed by an embeddings model (creates vectors), then stored into a vector DB → knowledge base created
‼️ The embedding model doesn't need to be the same FM that uses RAG!
Knowledge Base creation diagram
RAG Vector DB types by AWS to learn for the exam:
- Amazon OpenSearch Service – search & analytics DB
  - Managed cluster, serverless
  - Real-time similarity queries, store millions of vector embeddings
  - Scalable index management
  - Fast nearest-neighbor (kNN) search capability
  - 💡 Default to this DB unless you have reasons to choose another DB
- Amazon DocumentDB [with MongoDB compatibility]
  - NoSQL database
  - Real-time similarity queries, store millions of vector embeddings
- Amazon Aurora with PostgreSQL – relational (SQL) database, proprietary on AWS
- Amazon Neptune Analytics – graph database
  - High performance graph analytics solutions
  - GraphRAG (graph-based RAG) solutions
- Amazon S3 Vectors – vector embeddings stored in S3
  - Cost-effective
  - Durable storage
  - Sub-second query performance
RAG Data Sources
- Amazon S3
- Atlassian Confluence
- Microsoft SharePoint
- Salesforce
- Web pages (your website, your social media feed, etc…)
- …etc (More added over time)

💡 Most typical use case is a chatbot with specific domain-knowledge, which can query external sources and knowledge bases

Customer Service Chatbot
- Knowledge Base – products, features, specifications, troubleshooting guides, and FAQs
- RAG application – chatbot that can answer customer queries
Legal Research and Analysis
- Knowledge Base – laws, regulations, case precedents, legal opinions, and expert analysis
- RAG application – chatbot that can provide relevant information for specific legal queries
Healthcare Question-Answering
- Knowledge base – diseases, treatments, clinical guidelines, research papers, patients…
- RAG application – chatbot that can answer complex medical queries