Architekt

Příspěvky

Zobrazují se příspěvky z říjen, 2024

Retrieval-Augmented Generation (RAG) with Embedding-Based Dense Retrieval

RAG is a technique where a generative AI model (like ChatGPT) doesn’t just rely on its own training data to generate responses. Instead, it retrieves relevant information from external sources (like databases or documents) to provide more accurate and up-to-date answers. 2. Keyword-Based Retrieval Keyword-based retrieval is the traditional method used to find relevant information. Here’s how it works: Keywords Extraction: The system looks for specific words or phrases (keywords) that match the user’s query. Matching: It searches the external documents for those exact keywords. Retrieval: Documents containing those keywords are retrieved and used to generate the response. Example: User Query: "Best restaurants in New York" Keywords Extracted: "best," "restaurants," "New York" Process: The system finds documents that contain these words to provide a list of top restaurants in NYC. Pros: Simple and fast. Easy to implement. ...

Pokračování Zde »

A Deep Dive into Data Flow and Transformation: Hybrid RAG System in Action Using AWS Serverless Architecture

Efficiently managing massive datasets while ensuring fast, accurate, and context-aware insights is critical. One of the most innovative solutions emerging in this space is the Hybrid Retrieval-Augmented Generation (RAG) system, which combines retrieval-based AI with generative AI models, enhanced by a Reinforcement Learning from Human Feedback (RLHF) loop. This system not only retrieves data but also generates human-readable insights, continuously improving as it receives feedback from users. In this article, we will dive into how such a system works, focusing on the data flow and the transformations that occur at each stage. To make this relatable for developers, we’ll show how the process can be set up in an AWS Serverless environment using services like Amazon S3 , AWS SageMaker , and pre-trained models from Cohere or Anthropic . Along the way, we’ll use real-world business examples and demonstrate how these components integrate into a pipeline that you could prototype in envi...

Pokračování Zde »