Eduardo Ordax’s Post

🤖 Generative AI Lead @ AWS ☁️ | Startup Advisor | Public Speaker

🚀 𝗔𝗱𝗮𝗽𝘁𝗶𝘃𝗲 𝗮𝗻𝗱 𝗦𝗲𝗹𝗳-𝗥𝗲𝗳𝗹𝗲𝗰𝘁𝗶𝘃𝗲 𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 𝗔𝘂𝗴𝗺𝗲𝗻𝘁𝗲𝗱 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝗼𝗻 (𝗥𝗔𝗚) Ever wondered how to make your LLMs smarter and more reliable? Imagine a system that not only retrieves information but also corrects itself to provide accurate responses. Welcome to the world of RAG with self-correction! 🤖📚 𝗛𝗼𝘄 𝗶𝘁 𝘄𝗼𝗿𝗸𝘀 👇 1️⃣ 𝗤𝘂𝗲𝘀𝘁𝗶𝗼𝗻 𝗥𝗼𝘂𝘁𝗶𝗻𝗴 𝗡𝗼𝗱𝗲: Routes questions to either document retrieval or web search based on relevance. 2️⃣ 𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗲𝗿 𝗡𝗼𝗱𝗲: Transforms questions into embeddings and retrieves relevant documents from the vector store. 3️⃣ 𝗚𝗿𝗮𝗱𝗶𝗻𝗴 𝗗𝗼𝗰𝘂𝗺𝗲𝗻𝘁𝘀 𝗡𝗼𝗱𝗲: LLM grades the retrieved documents for relevance. 4️⃣ 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝗻𝗴 𝗔𝗻𝘀𝘄𝗲𝗿𝘀 𝗡𝗼𝗱𝗲: LLM generates answers if the documents are deemed sufficient. 5️⃣ 𝗦𝗲𝗹𝗳-𝗖𝗼𝗿𝗿𝗲𝗰𝘁𝗶𝗼𝗻 𝗠𝗲𝗰𝗵𝗮𝗻𝗶𝘀𝗺: 🔸 𝗡𝗼 𝗛𝗮𝗹𝗹𝘂𝗰𝗶𝗻𝗮𝘁𝗶𝗼𝗻𝘀: Accurate answers proceed to the next check. 🔸 𝗛𝗮𝗹𝗹𝘂𝗰𝗶𝗻𝗮𝘁𝗶𝗼𝗻𝘀: Inaccurate answers return to the Generation Node for refinement. 6️⃣ 𝗩𝗮𝗹𝗶𝗱𝗮𝘁𝗶𝗻𝗴 𝘁𝗵𝗲 𝗙𝗶𝗻𝗮𝗹 𝗔𝗻𝘀𝘄𝗲𝗿: 🔸 𝗬𝗲𝘀: Accurate answers are provided to the user. 🔸 𝗡𝗼: The system performs a web search for additional information. 7️⃣ 𝗪𝗲𝗯 𝗦𝗲𝗮𝗿𝗰𝗵 𝗜𝗻𝘁𝗲𝗴𝗿𝗮𝘁𝗶𝗼𝗻: Conducts a web search when necessary for additional data or to correct hallucinations. This framework empowers LLMs to dynamically retrieve diverse data sources, refine responses autonomously, and ensure reliability through self-correction mechanisms. 📎 Credits to medium post below:https://lnkd.in/dSucDu9Y #rag #ai #llmops #genai

37 Comments

Arsenii Kvachan

Artificial Intelligence Engineer | NLP, DL, ML

How do you detect hallucinations?

1 Reaction

Sunil Mehta

CEO at Graph & NimbleCat

How many round trips to a LLM are required to get the benefit of this on average?

1 Reaction

Ali Pala

Test Automation Coach | AI Apps Enthusiast | GenAI | LLMs

Also, do such number of steps make the pipeline less efficient or slower? Read and article yesterday about multi agent solutions. Would those be a solution? What do you think? For example Autogen or crewAI.

1 Reaction

Ali Pala

Test Automation Coach | AI Apps Enthusiast | GenAI | LLMs

Great post, thank you. I’m wondering how the model ensures about retrieved data from web search is reliable to correct hallucinations? Might be a rare case but not impossible.

1 Reaction

Sandi Bezjak

AI - QUANTUM COMPUTER - NANO TECH - AR - VR - BIO TECH or Everything of everything | Information Technology Analyst

Here's something I asked in perplexity.ai: adoptive self reflecting retrieval augmented generation self reflecting graphs and all other self reflecting options in anything what you recommend and new never tried options and you are a Grandmaster All-knowing Genius https://www.perplexity.ai/search/adoptive-self-reflecting-retri-Pod2xSijQN.MOCbIyFe00g

1 Reaction

Carlos Escapa

Global Business Developer of Data & AI Solutions, guest lecturer

how does the "self correction" mechanism detect hallucinations?

3 Reactions

Stanislav Dalence, PMP, PSM, LBBP

Top Artificial Intelligence (AI) Voice - Director de Proyecto Senior in NICE CXone

my question would be how to get the "is Hallucination?" question answered via self reflection in order to further process your workflow without human intervention

Dennis D.

Hand-crafted AI solutions for established companies

How do you optimize latency to ensure user experience over that many processing steps between user intent and final answer generation? And how is hallucination measured quantitatively?

Peter Jeitschko

Co-Founder CRO at JetHire.ai | Advanced Generative AI for HR | Find Talent with AI

I wonder how GraphRAG fits into this picture

1 Reaction

Marcelo Grebois

☰ Cloud & Software Architect ☰ MLOps ☰ AIOps ☰ Helping companies scale their platforms to an enterprise grade level

The RAG model enhances LLM reliability by dynamically refining responses with self-correction mechanisms. Experience the power of innovative AI Eduardo Ordax

1 Reaction

See more comments

To view or add a comment, sign in

More Relevant Posts

Joseph C.

I build agentic LLM workflows.
1w
Report this post
What is an agentic LLM workflow? This is an agentic LLM workflow.
Eduardo Ordax

🤖 Generative AI Lead @ AWS ☁️ | Startup Advisor | Public Speaker
1w

🚀 𝗔𝗱𝗮𝗽𝘁𝗶𝘃𝗲 𝗮𝗻𝗱 𝗦𝗲𝗹𝗳-𝗥𝗲𝗳𝗹𝗲𝗰𝘁𝗶𝘃𝗲 𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 𝗔𝘂𝗴𝗺𝗲𝗻𝘁𝗲𝗱 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝗼𝗻 (𝗥𝗔𝗚) Ever wondered how to make your LLMs smarter and more reliable? Imagine a system that not only retrieves information but also corrects itself to provide accurate responses. Welcome to the world of RAG with self-correction! 🤖📚 𝗛𝗼𝘄 𝗶𝘁 𝘄𝗼𝗿𝗸𝘀 👇 1️⃣ 𝗤𝘂𝗲𝘀𝘁𝗶𝗼𝗻 𝗥𝗼𝘂𝘁𝗶𝗻𝗴 𝗡𝗼𝗱𝗲: Routes questions to either document retrieval or web search based on relevance. 2️⃣ 𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗲𝗿 𝗡𝗼𝗱𝗲: Transforms questions into embeddings and retrieves relevant documents from the vector store. 3️⃣ 𝗚𝗿𝗮𝗱𝗶𝗻𝗴 𝗗𝗼𝗰𝘂𝗺𝗲𝗻𝘁𝘀 𝗡𝗼𝗱𝗲: LLM grades the retrieved documents for relevance. 4️⃣ 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝗻𝗴 𝗔𝗻𝘀𝘄𝗲𝗿𝘀 𝗡𝗼𝗱𝗲: LLM generates answers if the documents are deemed sufficient. 5️⃣ 𝗦𝗲𝗹𝗳-𝗖𝗼𝗿𝗿𝗲𝗰𝘁𝗶𝗼𝗻 𝗠𝗲𝗰𝗵𝗮𝗻𝗶𝘀𝗺: 🔸 𝗡𝗼 𝗛𝗮𝗹𝗹𝘂𝗰𝗶𝗻𝗮𝘁𝗶𝗼𝗻𝘀: Accurate answers proceed to the next check. 🔸 𝗛𝗮𝗹𝗹𝘂𝗰𝗶𝗻𝗮𝘁𝗶𝗼𝗻𝘀: Inaccurate answers return to the Generation Node for refinement. 6️⃣ 𝗩𝗮𝗹𝗶𝗱𝗮𝘁𝗶𝗻𝗴 𝘁𝗵𝗲 𝗙𝗶𝗻𝗮𝗹 𝗔𝗻𝘀𝘄𝗲𝗿: 🔸 𝗬𝗲𝘀: Accurate answers are provided to the user. 🔸 𝗡𝗼: The system performs a web search for additional information. 7️⃣ 𝗪𝗲𝗯 𝗦𝗲𝗮𝗿𝗰𝗵 𝗜𝗻𝘁𝗲𝗴𝗿𝗮𝘁𝗶𝗼𝗻: Conducts a web search when necessary for additional data or to correct hallucinations. This framework empowers LLMs to dynamically retrieve diverse data sources, refine responses autonomously, and ensure reliability through self-correction mechanisms. 📎 Credits to medium post below:https://lnkd.in/dSucDu9Y #rag #ai #llmops #genai
2 Comments
Like Comment
To view or add a comment, sign in
Steve Dept (he/him/his)

founding partner at cApStAn Linguistic Quality Control
4mo
Report this post
After Retrieval-augmented Generation (RAG) and Fine-tuning (FT) -- which Pascal Biese defines as two techniques for adapting models to the practical realities of real-world data -- are you ready to discover Retrieval-augmented Fine Tuning (RAFT)? The article is worth reading but I am missing principled design, which are embedded in a number of smaller specialist large language models. Techniques to get more out of generalist LLMs yield smaller incremental progress than combining ontologies, knowledge graphs and domain classification with large language models specially designed to perform a limited number of tasks well.

Pascal Biese

Daily LLM highlights for 34k+ AI experts 📲🤗 | AI/ML Engineer | NLP
4mo

To RAG or not to RAG? A new unexpected answer to this question. Let me explain. As the digital world expands, so does the need for AI to stay current. But LLMs like GPT-4 struggle to integrate new or domain-specific knowledge post-training. This is an issue that frustrates both developers and users seeking timely and accurate information. Retrieval-augmented Generation (RAG) and Fine-tuning (FT) are arguably the two most popular techniques for adapting models to the practical realities of real-world data. A new technique, Retrieval-augmented Fine Tuning (RAFT), now combines the strengths of both of them. It trains models by teaching them to ignore 'distractor documents,' essentially fine-tuning their focus to 𝘵𝘩𝘦 𝘳𝘪𝘨𝘩𝘵 𝘱𝘪𝘦𝘤𝘦𝘴 𝘰𝘧 𝘪𝘯𝘧𝘰𝘳𝘮𝘢𝘵𝘪𝘰𝘯. Testing RAFT across various datasets, including PubMed and HotpotQA, the researchers showcase significant improvements over traditional methods. The real question will be which approach is better suited for information retrieval: a seperate module (RAG) or the LLM itself - if specifically trained to do so (RAFT)? Let's find out in the coming months. As always, I'll let you know. ↓ Liked this post? Click on the link under my name and never miss a paper highlight again 💡
Like Comment
To view or add a comment, sign in
Mark Beccue

Pioneer in AI market research
7mo
Report this post
Good piece on the enterprise evolution of LLM implementation from Cobus Greyling. Models morphing, tools exploding, then folks realize the data challenge. Back to my list of helpers: IBM Dell Technologies, Hewlett Packard Enterprise, Databricks MongoDB SingleStore Salesforce, Oracle SAP Pure Storage and more

Cobus Greyling

LLMs, NLP, NLU, Chatbots, Voicebots, CCAI, Ambient Orchestration, Ubiquitous User Interfaces
7mo

The introduction of LLMs is causing both disruption and demand. Disruption in the way we develop application and harness the power of LLMs. Followed by a demand for ai ready ai. What is clear from market adoption in terms of LLMs, the LLM tooling development is far ahead of LLM implementation; with real-world customer facing execution lagging. From a tooling perspective most of the focus and consideration is given to LLM Stage 4. But soon organisations will learn that any successful AI implementation requires a successful data strategy. And Hence shortly LLM Stage 5 will have to receive much more focus. When LLM implementations moved from Stage 1 to Stage 2, from design-time use to run-time use, there was a realisation that data needs to be delivered to the LLM at inference. The importance of In-Context Learning (ICL) has been highlighted by numerous studies, and hence the importance of injecting prompts with highly succinct, concise and contextually relevant data. #ConversationalAI #LargeLanguageModels #LLMs https://lnkd.in/dyGzPFu9

Five Stages Of LLM Implementation

cobusgreyling.medium.com
Like Comment
To view or add a comment, sign in
Soumik Banerjee

Data Analyst | SQL, ML, BI, R, Python, Data Analytics
3w
Report this post
Excited to share a recent project that highlights the importance of optimization in data processing and its connection to the broader field of data science and AI! I started with a complex and unoptimized script designed to fetch search volume data from an API for various keywords across multiple states. The initial version of the code was far from ideal—it was slow and cost-inefficient due to redundant API requests and poor handling of large data sets. Realizing the need for improvement, I embarked on a journey to optimize the script. The key steps included: 1. **Batch Processing**: Instead of sending API requests for each keyword individually, I grouped keywords into batches, significantly reducing the number of API calls. 2. **State-Specific Processing**: I refined the script to handle only the specific states required, streamlining the data processing workflow. 3. **Error Handling and Logging**: Added robust error handling and logging to ensure that any issues were promptly identified and addressed, allowing the script to continue running smoothly. These optimizations, though simple, had a profound impact. The revamped script not only reduced our data processing costs but also cut down the time required to complete the task drastically. What would have taken two people a considerable amount of manual effort was accomplished in just 1/6th of the time. This project is a great example of how data science techniques, including data collection, processing, and optimization, play a crucial role in AI and machine learning workflows. Efficient data handling is foundational to training robust AI models, making these optimizations essential. Special thanks to my team for their support throughout this process and to our General Manager, Vikram sir, for his invaluable guidance and encouragement. This experience underscored a valuable lesson: sometimes, a few thoughtful improvements can lead to significant gains in productivity and efficiency. Optimization is not just about speed—it's about working smarter, not harder! #DataScience #AI #Optimization #Efficiency #Innovation #TeamSupport
Like Comment
To view or add a comment, sign in
Praveen I

Engineering Leader @ Google | Cloud Data Platforms
9mo
Report this post
Before 5 months, when I explored to customize a foundational LLM, most of the recommendations were pivoted around fine-tuning a LLM using the domain specific knowledge. Of course, there are a lot of articles explaining the process, but I realized it is expensive to fine-tune a LLM while balancing the cost, performance, and accuracy. It's also not easy to avoid the costly consequences like hallucination (with source clarity), access restrictions with leaving knowledge at inference time, fine-tune repetitions with latest data. I'm glad to see pivotal advances around RAG (Retrieval-Augmented Generation) based LLMs, showing signs to avoid some of the costly consequences from fine-tuning process. Some key benefits include: corroboration without hallucination, maintainable search index referencing latest data, clear source indication to general vs knowledge, access restriction based on license tiers etc. RAG-based solution also accelerate launching your AI applications must faster while leveraging LLMs for your domain specific knowledge base. The Data Management and Vector Databases in this article showcases important tools to help with RAG-based LLMs. https://lnkd.in/gxgjsHNQ

Generative AI’s Act Two

https://www.sequoiacap.com

2 Comments
Like Comment
To view or add a comment, sign in
Aashish Nair

Data Scientist at Optimal Solutions Group
7mo
Report this post
🚀 Excited to share my latest Medium article on fine-tuning LLM applications using LangChain! 📝 In this piece, I delve into some of the key tools in the LangChain framework that users can leverage to enhance the accuracy of their applications' responses. #LangChain #LanguageModels #AI #Technology

4 LangChain Tools to Configure for Better LLM Responses

medium.datadriveninvestor.com
Like Comment
To view or add a comment, sign in
Sworna Vidhya Mahadevan

Gen AI, Quantum Computing, Master of Science in ML&AI, Deep Learning, NLP, Statistics, Python, Digital marketing, Mobile App Developer(iOS, Android), Mathematics
2mo Edited
Report this post
𝕴𝖓𝖙𝖗𝖔𝖉𝖚𝖈𝖎𝖓𝖌 Verba 1.0: 𝕬 𝕲𝖆𝖒𝖊-𝕮𝖍𝖆𝖓𝖌𝖊𝖗 𝖎𝖓 𝕴𝖓𝖋𝖔𝖗𝖒𝖆𝖙𝖎𝖔𝖓 𝕽𝖊𝖙𝖗𝖎𝖊𝖛𝖆𝖑 𝖆𝖓𝖉 𝕽𝖊𝖘𝖕𝖔𝖓𝖘𝖊 𝕲𝖊𝖓𝖊𝖗𝖆𝖙𝖎𝖔𝖓 𝑲𝒆𝒚 𝑯𝒊𝒈𝒉𝒍𝒊𝒈𝒉𝒕𝒔: - 𝐇𝐲𝐛𝐫𝐢𝐝 𝐒𝐞𝐚𝐫𝐜𝐡 𝐚𝐧𝐝 𝐒𝐞𝐦𝐚𝐧𝐭𝐢𝐜 𝐂𝐚𝐜𝐡𝐢𝐧𝐠: Combines semantic and keyword search for faster, more accurate data retrieval. - 𝐄𝐧𝐡𝐚𝐧𝐜𝐞𝐝 𝐐𝐮𝐞𝐫𝐲 𝐏𝐫𝐞𝐜𝐢𝐬𝐢𝐨𝐧: Improved ability to handle complex queries and diverse data formats. - 𝗦𝘂𝗽𝗽𝗼𝗿𝘁 𝗳𝗼𝗿 𝗠𝘂𝗹𝘁𝗶𝗽𝗹𝗲 𝗗𝗮𝘁𝗮 𝗙𝗼𝗿𝗺𝗮𝘁𝘀: Efficiently processes various data types like PDFs, CSVs. - 𝐀𝐝𝐯𝐚𝐧𝐜𝐞𝐝 𝐑𝐀𝐆 𝐓𝐞𝐜𝐡𝐧𝐢𝐪𝐮𝐞𝐬: Applies filters and autocompletion before performing Retrieval-Augmented Generation. - 𝐏𝐞𝐫𝐟𝐨𝐫𝐦𝐚𝐧𝐜𝐞 𝐌𝐞𝐭𝐫𝐢𝐜𝐬: Significant improvements in query precision and response accuracy. Discover how Verba 1.0 is transforming AI applications with its innovative approach and robust performance. Sources https://lnkd.in/gpPBv5Xy https://lnkd.in/gxCnHPgE https://lnkd.in/geYTtMd3 Connect with me for more insights!

Meet Verba 1.0: Run State-of-the-Art RAG Locally with Ollama Integration and Open Source Models

https://www.marktechpost.com
Like Comment
To view or add a comment, sign in
Nehil Jain

Building something new. On a mission to improve business efficiency with LLMs. Follow me for posts on startup journey in 2024 & practical applications of AI. | Ex-McKinsey | Ex-Super.com
2mo
Report this post
I want to explain Retrieval as if you are an 8th grader. Not an expert at RAGs? This is for you Types of Retrievals/Searches using AI: >> Text Search (OG Method) Search for pieces of information in your data that have most (if not all) of the same words as your query. This is simple and fast but might miss some relevant information. >> Semantic Search Look for pieces of information in your data that are similar to your query based on their English meaning. This helps find more relevant results even if the exact words don’t match. >> Filter + Semantic Search First, the system figures out some characteristics (metadata) to filter your data. Then, it looks for pieces of information that are similar to your query based on their English meaning. This adds an extra layer of accuracy. >> Summary Search Search for pieces of information based on summaries that have been pre-computed for your data. This method is cheaper than searching over the whole dataset but can be inaccurate for certain tasks. >> Ensemble Search Combine all the techniques above and use either the union (everything that matches any method) or intersection (only what matches all methods) of the results. This ensures you get the best possible results from your searches. Ensemble is the technique that most ML engineers use to win Kaggle ;) This is for people who are not spending hours and hours studying AI engineering. It misses some nuances on these various types, but it gives you a bird's eye view of things. What is the common retrieval mode you see for most AI apps? #aiengineering
Like Comment
To view or add a comment, sign in
aiXplain

6,240 followers
3mo
Report this post
Elevate your models with #FineTune! 🚀 Customize pre-trained models for better performance using your data. Spend less time training and achieve superior results. Learn more in our #KnowledgeBase article. 👇 #aiXplain #AI #AIoptimization #AItools

FineTune

support.aixplain.com
Like Comment
To view or add a comment, sign in
Dr. Chengheri BAO

Founder @Castane AI | Artificial Intelligence | Machine Learning | Deep Learning | LLM
7mo
Report this post
[#𝟭𝟮] 𝗬𝗼𝘂 𝗖𝗮𝗻'𝘁 𝗠𝗮𝗻𝗮𝗴𝗲 𝗪𝗵𝗮𝘁 𝗬𝗼𝘂 𝗖𝗮𝗻'𝘁 𝗠𝗲𝗮𝘀𝘂𝗿𝗲 Measurable doesn't necessarily lead to manageable. I agree with Don Peppers's perspective, which actually isn't contradictory to Peter Drucker's stance. As the question is: 𝗰𝗮𝗻 𝘄𝗲 𝗺𝗮𝗻𝗮𝗴𝗲 𝘄𝗶𝘁𝗵𝗼𝘂𝘁 𝗺𝗲𝗮𝘀𝘂𝗿𝗶𝗻𝗴? Data science is an empirical science. Each model built is the result of an experiment. Given the multitude of potential configurations across data, parameters, and algorithms, we conduct hundreds, even thousands of experiments, seeking the optimal combination for the best performance. But how can we achieve this without measurement, or more importantly, without robust tracking? 𝗧𝗿𝗮𝗰𝗸𝗶𝗻𝗴 𝗶𝘀 𝘁𝗵𝗲 𝗽𝗿𝗼𝗰𝗲𝘀𝘀 𝗼𝗳 𝗿𝗲𝗰𝗼𝗿𝗱𝗶𝗻𝗴 𝗺𝗲𝗮𝘀𝘂𝗿𝗲𝗺𝗲𝗻𝘁𝘀, 𝗮𝗻𝗱 𝗽𝘂𝘁𝘁𝗶𝗻𝗴 𝘁𝗵𝗲𝗺 𝘁𝗼𝗴𝗲𝘁𝗵𝗲𝗿, 𝘀𝗼 𝗮𝘀 𝘁𝗼 𝗴𝘂𝗶𝗱𝗲 𝘂𝘀 𝗳𝗼𝗿 𝘀𝘆𝘀𝘁𝗲𝗺 𝗮𝗱𝗷𝘂𝘀𝘁𝗺𝗲𝗻𝘁𝘀, 𝗮𝗻𝗱 𝗱𝗲𝘁𝗲𝗿𝗺𝗶𝗻𝗶𝗻𝗴 𝘁𝗵𝗲 𝗻𝗲𝘅𝘁 𝘀𝘁𝗲𝗽𝘀. In my ML model development, I actually use two experiment tracking systems. One is a personal setup, which is based on the configuration classes that are saved after each training and testing (as JSON files); the other is an established experiment tracking tool, like #MLflow. For those starting in machine learning modeling, setting up an experiment tracking system early is invaluable. Here is a comprehensive overview of the top 15 ML Experiment Tracking tools: https://lnkd.in/euiWU_4h, provided by #neptune. If you're yet to decide on yours, explore these options and test them out. #machinelearning #artificialintelligence #datascience #ml #ai
Like Comment
To view or add a comment, sign in

35,685 followers

View Profile Follow

Eduardo Ordax’s Post

More from this author

Your SundAI Breakfast #24

Your SundAI Breakfast #23

Your SundAI Breakfast #22

Explore topics