Mervin Praison ✅’s Post

Site Reliability Engineer

1mo

Vector Database

Chief Product Officer @ neptune.ai | Follow me to Learn about LLM and Data Engineering Systems | Author of SwirlAI Newsletter | Public Speaker

1mo

What is a 𝗩𝗲𝗰𝘁𝗼𝗿 𝗗𝗮𝘁𝗮𝗯𝗮𝘀𝗲? With the rise of Foundational Models, Vector Databases skyrocketed in popularity. The truth is that a Vector Database is also useful outside of a Large Language Model context. When it comes to Machine Learning, we often deal with Vector Embeddings. Vector Databases were created to perform specifically well when working with them: ➡️ Storing. ➡️ Updating. ➡️ Retrieving. When we talk about retrieval, we refer to retrieving set of vectors that are most similar to a query in a form of a vector that is embedded in the same Latent space. This retrieval procedure is called Approximate Nearest Neighbour (ANN) search. A query here could be in a form of an object like an image for which we would like to find similar images. Or it could be a question for which we want to retrieve relevant context that could later be transformed into an answer via a LLM. Let’s look into how one would interact with a Vector Database: 𝗪𝗿𝗶𝘁𝗶𝗻𝗴/𝗨𝗽𝗱𝗮𝘁𝗶𝗻𝗴 𝗗𝗮𝘁𝗮. 1. Choose a ML model to be used to generate Vector Embeddings. 2. Embed any type of information: text, images, audio, tabular. Choice of ML model used for embedding will depend on the type of data. 3. Get a Vector representation of your data by running it through the Embedding Model. 4. Store additional metadata together with the Vector Embedding. This data would later be used to pre-filter or post-filter ANN search results. 5. Vector DB indexes Vector Embedding and metadata separately. There are multiple methods that can be used for creating vector indexes, some of them: Random Projection, Product Quantization, Locality-sensitive Hashing. 6. Vector data is stored together with indexes for Vector Embeddings and metadata connected to the Embedded objects. 𝗥𝗲𝗮𝗱𝗶𝗻𝗴 𝗗𝗮𝘁𝗮. 7. A query to be executed against a Vector Database will usually consist of two parts: ➡️ Data that will be used for ANN search. e.g. an image for which you want to find similar ones. ➡️ Metadata query to exclude Vectors that hold specific qualities known beforehand. E.g. given that you are looking for similar images of apartments - exclude apartments in a specific location. 8. You execute Metadata Query against the metadata index. It could be done before or after the ANN search procedure. 9. You embed the data into the Latent space with the same model that was used for writing the data to the Vector DB. 10. ANN search procedure is applied and a set of Vector embeddings are retrieved. Popular similarity measures for ANN search include: Cosine Similarity, Euclidean Distance, Dot Product. Some popular Vector Databases: Qdrant, Pinecone, Weviate, Milvus, Faiss, Vespa. How are you using Vector DBs? Let me know in the comment section! #MachineLearning #GenAI #LLM #AI

To view or add a comment, sign in

More Relevant Posts

Mervin Praison ✅

Site Reliability Engineer
2mo
Report this post
20 coding Patterns

Gaurav Pandey Gaurav Pandey is an Influencer

LinkedIn Top Voice’24 | Software Engineer | 🚀 60K+ @LinkedIn Family | MERN Stack ⚛️ | React NodeJS | Full Stack Developer | Mentor
2mo

Want to crack MAANG Companies ?? Stop Spending Huge Amount of Money on Courses every again👇 🔥 𝐆𝐨𝐨𝐠𝐥𝐞 𝐈𝐬 𝐎𝐟𝐟𝐞𝐫𝐢𝐧𝐠 𝐅𝐑𝐄𝐄 𝐎𝐧𝐥𝐢𝐧𝐞 𝐂𝐨𝐮𝐫𝐬𝐞𝐬 🔥 No payment needed ✅✅ Here're 10 Google courses to get certified: 🔹 7000+ Course Free Access : https://lnkd.in/gEYK8mFG <>. Google Data Analytics: 🔺https://lnkd.in/gzu3RuCm 1. 𝗚𝗼𝗼𝗴𝗹𝗲 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 𝟰 𝗖𝗲𝗿𝘁𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻 Learn how to use Google Analytics 4, the best web analytics tool, to grow your online business. → 5 hours to complete → Intermediate level → Data-driven marketing → FREE certificate 🔹 https://lnkd.in/gu8egPdR 2. 𝗚𝗼𝗼𝗴𝗹𝗲 𝗔𝗱𝘃𝗮𝗻𝗰𝗲𝗱 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 → 4 hours to complete → Advanced level → FREE certificate 🔺 https://lnkd.in/g97NV59x 3. 𝗚𝗼𝗼𝗴𝗹𝗲 𝗔𝗱𝘀 𝗗𝗶𝘀𝗽𝗹𝗮𝘆 𝗖𝗲𝗿𝘁𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻 Learn how to use Google Display features, optimize your campaigns, and measure your results. → 2 hours to complete → Beginner level → FREE certificate 🔹 https://lnkd.in/gtGaniXg 4. 𝗚𝗼𝗼𝗴𝗹𝗲 𝗣𝗿𝗼𝗷𝗲𝗰𝘁 𝗠𝗮𝗻𝗮𝗴𝗲𝗺𝗲𝗻𝘁 𝗣𝗿𝗼𝗳𝗲𝘀𝘀𝗶𝗼𝗻𝗮𝗹 𝗖𝗲𝗿𝘁𝗶𝗳𝗶𝗰𝗮𝘁𝗲 → 40 hours to complete → Beginner level → FREE certificate 🔺 https://lnkd.in/g9e9HpBd 5. 𝗚𝗲𝘁𝘁𝗶𝗻𝗴 𝘀𝘁𝗮𝗿𝘁𝗲𝗱 𝘄𝗶𝘁𝗵 𝗙𝗹𝘂𝘁𝘁𝗲𝗿 𝗗𝗲𝘃𝗲𝗹𝗼𝗽𝗺𝗲𝗻𝘁 Learn the skills to create visually appealing desktop, mobile, and web applications using Flutter, all built from a single codebase. 🔹 https://lnkd.in/gec-FapE 6. 𝗜𝗻𝘁𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻 𝘁𝗼 𝗦𝗤𝗟 Discover the fundamentals of utilizing SQL for extracting and controlling data within a relational database. 🔺 https://lnkd.in/gWjTj78p 7. 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝘃𝗲 𝗔𝗜 [𝗠𝗲𝗴𝗮 𝗖𝗼𝘂𝗿𝘀𝗲] Learn everything you need to know about generative AI products and technologies in this comprehensive course. 🔹 https://lnkd.in/gWZF2FFu
Like Comment
To view or add a comment, sign in
Mervin Praison ✅

Site Reliability Engineer
2mo
Report this post
Thanks for the Mention Graham Walker, MD

Permanente Medicine

5,238 followers
3mo

Technology in health care is accelerating at a breakneck speed, pushing the boundaries of medicine. But how can we ensure that these innovations serve both patients and the physicians using it? Join guest Graham Walker, MD, assistant physician in chief and clinical informatics lead with The Permanente Medical Group, and host Alex McDonald, MD, of the Southern California Permanente Medical Group, as they discuss the role of technology in improving patient outcomes, the importance of collaboration and interdisciplinary approaches, and the need for ongoing education and training to keep up with the rapid pace of change. Date: April 22, 2024 Time: Noon PT/3 p.m. ET Location: Virtual via Zoom Cost: Free Register: https://lnkd.in/d6gxkTS7

PermanenteDocs Chat on pushing the boundaries of medicine with technology a

www.linkedin.com

1 Comment
Like Comment
To view or add a comment, sign in
Mervin Praison ✅

Site Reliability Engineer
3mo
Report this post
System Integrations
Alex Xu
3mo

Top 9 Architectural Patterns for Data and Communication Flow . . 🔹 Peer-to-Peer The Peer-to-Peer pattern involves direct communication between two components without the need for a central coordinator. 🔹 API Gateway An API Gateway acts as a single entry point for all client requests to the backend services of an application. 🔹 Pub-Sub The Pub-Sub pattern decouples the producers of messages (publishers) from the consumers of messages (subscribers) through a message broker. 🔹 Request-Response This is one of the most fundamental integration patterns, where a client sends a request to a server and waits for a response. 🔹 Event Sourcing Event Sourcing involves storing the state changes of an application as a sequence of events. 🔹 ETL ETL is a data integration pattern used to gather data from multiple sources, transform it into a structured format, and load it into a destination database. 🔹 Batching Batching involves accumulating data over a period or until a certain threshold is met before processing it as a single group. 🔹 Streaming Processing Streaming Processing allows for the continuous ingestion, processing, and analysis of data streams in real-time. 🔹 Orchestration Orchestration involves a central coordinator (an orchestrator) managing the interactions between distributed components or services to achieve a workflow or business process. -- Subscribe to our weekly newsletter to get a Free System Design PDF (158 pages): https://bit.ly/3KCnWXq #systemdesign #coding #interviewtips .
1 Comment
Like Comment
To view or add a comment, sign in
Mervin Praison ✅

Site Reliability Engineer
3mo
Report this post
ORPO
Philipp Schmid

Technical Lead & LLMs at Hugging Face 🤗 | AWS ML HERO 🦸🏻♂️
3mo

Can ORPO redefine how we train and align LLMs for RLHF? So far state-of-the-art LLMs followed the process of Base Model → Supervised Fine-tuning → RLHF (PPO/DPO). This is very resource-intensive and complex. 😒 Odds Ratio Preference Optimization (ORPO) proposes a new method to train LLMs by combining SFT and Alignment into a new objective (loss function), achieving state of the art results. 🧐 𝗧𝗿𝗮𝗶𝗻𝗶𝗻𝗴 𝗽𝗿𝗼𝗰𝗲𝘀𝘀: 1️⃣ Create a pairwise preference dataset (chosen/rejected), e.g. Argilla UltraFeedback. 2️⃣ Make sure the dataset doesn’t contain instances where the chosen and rejected responses are the same, or one is empty. 3️⃣ Select a pre-trained LLM (e.g., Llama-2, Mistral) 4️⃣ Train the Base model with ORPO objective on preference dataset ^No extra SFT step, directly applied to base model 🔥 𝗜𝗻𝘀𝗶𝗴𝗵𝘁𝘀: 🧠 Reference model-free → memory friendly 🔄 Replaces SFT+DPO/PPO with 1 single method (ORPO) 🏆 ORPO Outperforms SFT, SFT+DPO on PHI-2, Llama 2, and Mistral 📊 Mistral ORPO achieves 12.20% on AlpacaEval2.0, 66.19% on IFEval, and 7.32 on MT-Bench out Hugging Face Zephyr Beta Paper: https://lnkd.in/ej7jbMyD Github: https://lnkd.in/epmgP4Qy Model: https://lnkd.in/ei9M-d8p We try to integrate ORPO into Hugging Face TRL and validate the results in the coming weeks.
Like Comment
To view or add a comment, sign in

3,166 followers

304 Posts

View Profile Follow

Mervin Praison ✅’s Post

More Relevant Posts

PermanenteDocs Chat on pushing the boundaries of medicine with technology a

www.linkedin.com

Explore topics