Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and Agents

•Download as PPTX, PDF•

0 likes•438 views

- Jon McKinney, Director of Research, H2O.ai - Arno Candel, Chief Technology Officer, H2O.ai H2O Open Source GenAI World SF 2023

Recommended for you

Cloud AI GenAI Overview.pptx

This document provides an overview of Google Cloud's offerings for generative AI. It begins with a primer on large language models and generative AI, explaining what they are and how they have evolved. It then outlines Google's role in pioneering developments in the field like BERT and Transformer models. The rest of the document details Google's portfolio of products and services for generative AI, including foundation models like PaLM, experiences for consumers and enterprises, and tools for developers and AI practitioners. It emphasizes that Google aims to support a wide range of needs through its family of generative AI models and applications.

•by SahithiGurlinka

PyBay23: Understanding LangChain Agents and Tools with Twilio (or with SMS)....

With LangChain, developers “chain” together different LLM components to create more advanced use cases around LLMs. Agents use LLMs to decide what actions should be taken. Get introduced to LangChain about what you can do with Agents, Tools, and communication APIs! Talk given at PyBay 2023 in San Francisco, CA on Sunday, October 8, 2023

•by Elizabeth (Lizzie) Siegle

pythonaiml

Using Generative AI

The document discusses using generative AI to improve learning products by making them better, stronger, and faster. It provides examples of using generative models for game creation, runtime design, and postmortem data analysis. It also addresses ethics and copyright challenges and considers generative AI as both a tool and potential friend. The document explores what models are, how they work, examples of applications, and resources for staying up to date on generative AI advances.

•by Mark DeLoura

educationtechnologygenerative ai

H2O.ai Confidential
Application:
❏ Multi-Chat
❏ RAG w/HYDE
❏ Query w/Web
❏ Map & Map Reduce
❏ Python & Search Agents
Focus:
❏ KGM Models
❏ Parallelism
❏ Instruct LLMs
❏ Prompt Engineering
❏ Testing new LLMs
Goal:
❏ OSS RAG with API
❏ Backend for Enterprise
h2oGPT

Recommended for you

Active Retrieval Augmented Generation.pdf

FLARE proposes a method called Forward-Looking Active REtrieval augmented generation (FLARE) that iteratively retrieves information during text generation based on the predicted upcoming sentence. FLARE uses the predicted next sentence as a query to retrieve documents if it contains low-confidence tokens, then regenerates the sentence. Experiments show FLARE outperforms baselines on multiple knowledge-intensive tasks. However, FLARE did not significantly improve performance on a short-text dataset where continual retrieval of disparate information may not be needed.

•by Po-Chuan Chen

computer sciencemachine learning

GenerativeAI and Automation - IEEE ACSOS 2023.pptx

Generative AI has been rapidly evolving, enabling different and more sophisticated interactions with Large Language Models (LLMs) like those available in IBM watsonx.ai or Meta Llama2. In this session, we will take a use case based approach to look at how we can leverage LLMs together with existing automation technologies like Workflow, Content Management, and Decisions to enable new solutions.

•by Allen Chan

generative aiautomation

Challenges in AI LLMs adoption in the Enterprise

The presentation "ITDays_2023_GeorgeBara" discusses challenges in adopting AI large language models (LLMs) in enterprise settings. The presentation covers: 1. **Challenges in AI LLMs adoption**: It highlights the noise in the current AI landscape and questions the practical use of AI in real businesses. 2. **The DNA of an Enterprise**: Defines enterprise sizes and discusses the new solutions adoption process, emphasizing effective integration and minimizing disruption. 3. **Enterprise-Grade**: Lists qualities like robustness, reliability, scalability, performance, security, and support that are essential for enterprise-grade solutions. 4. **What are LLMs?**: Describes the pre-ChatGPT era with BERT, a model used for language understanding, and details its enterprise applications. 5. **LLM use-cases before ChatGPT**: Focuses on data triage, process automation, knowledge management, and the augmentation of business operations. 6. **EU Digital Decade Report**: Points out that AI adoption in Europe is slow and might not meet the 2030 targets. 7. **Adoption Challenges**: Addresses top challenges such as data security, predictability, performance, control, regulatory compliance, ethics, sustainability, and ROI. 8. **Conclusion**: Reflects on the slow adoption of AI in enterprises, suggesting that a surge might occur once the technology matures and is ready for enterprise use. The presenter concludes by stating that despite the hype around technologies like ChatGPT, enterprises are cautious and will adopt new technologies at their own pace. He anticipates a gradual then sudden adoption pattern once LLMs are proven to be enterprise-ready.

•by George Bara

artificial intelligencelarge language modelstext analytics

H2O.ai Confidential
What LLM saw Where was the answer?

Recommended for you

Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...

Pascal Pfeiffer, Principal Data Scientist, H2O.ai H2O Open Source GenAI World SF 2023 This talk dives into the expansive ecosystem of Large Language Models (LLMs), offering practitioners an insightful guide to various relevant applications, from natural language understanding to creative content generation. While exploring use cases across different industries, it also honestly addresses the current limitations of LLMs and anticipates future advancements.

•by Sri Ambati

How ChatGPT and AI-assisted coding changes software engineering profoundly

This document summarizes a presentation given by Professor Pekka Abrahamsson on how ChatGPT and AI-assisted coding is profoundly changing software engineering. The presentation covers several key points: - ChatGPT and AI tools like Copilot are beginning to be adopted in software engineering to provide code snippets, answers to technical questions, and assist with debugging, but issues around code ownership, reliability, and security need to be addressed. - Early studies show potential benefits of ChatGPT for tasks like software testing education, code quality improvement, and requirements elicitation, but more research is still needed. - Prompt engineering techniques can help maximize the usefulness of ChatGPT for software engineering tasks. Overall, AI

•by Pekka Abrahamsson / Tampere University

chatgptai-assisted codingsoftware engineering

LLMs Bootcamp

This document provides information about a bootcamp to build applications using Large Language Models (LLMs). The bootcamp consists of 11 modules covering topics such as introduction to generative AI, text analytics techniques, neural network models for natural language processing, transformer models, embedding retrieval, semantic search, prompt engineering, fine-tuning LLMs, orchestration frameworks, the LangChain application platform, and a final project to build a custom LLM application. The bootcamp will be held in various locations and dates between September 2023 and January 2024.

•by Fiza987241

llms

H2O.ai Confidential
KGM Image-OCR Models

v
H2O.ai Confidential
Challenges
Context Size:
❏ 100k - Claude2
❏ 128k - OpenAI GPT-4
❏ 32k - LongAlpaca 70B LLaMa-2
❏ 128k - Yarn - Mistral 7B
❏ 192k - Baichuan2
Other Challenges:
❏ Vague or Meta questions
❏ Critique, Verification
❏ Lexical + Semantic
❏ Retrieval vs. Summarization and Extraction
❏ Time-Aware RAG
❏ Multi-Modal Agency
Meta Queries
Retrieval
Understanding

Recommended for you

Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...

Mihai is the Principal Architect for Platform Engineering and Technology Solutions at IBM, responsible for Cloud Native and AI Solutions. He is a Red Hat Certified Architect, CKA/CKS, a leader in the IBM Open Innovation community, and advocate for open source development. Mihai is driving the development of Retrieval Augmentation Generation platforms, and solutions for Generative AI at IBM that leverage WatsonX, Vector databases, LangChain, HuggingFace and open source AI models. Mihai will share lessons learned building Retrieval Augmented Generation, or “Chat with Documents” platforms and APIs that scale, and deploy on Kubernetes. His talk will cover use cases for Generative AI, limitations of Large Language Models, use of RAG, Vector Databases and Fine Tuning to overcome model limitations and build solutions that connect to your data and provide content grounding, limit hallucinations and form the basis of explainable AI. In terms of technology, he will cover LLAMA2, HuggingFace TGIS, SentenceTransformers embedding models using Python, LangChain, and Weaviate and ChromaDB vector databases. He’ll also share tips on writing code using LLM, including building an agent for Ansible and containers. Scaling factors for Large Language Model Architectures: • Vector Database: consider sharding and High Availability • Fine Tuning: collecting data to be used for fine tuning • Governance and Model Benchmarking: how are you testing your model performance over time, with different prompts, one-shot, and various parameters • Chain of Reasoning and Agents • Caching embeddings and responses • Personalization and Conversational Memory Database • Streaming Responses and optimizing performance. A fine tuned 13B model may perform better than a poor 70B one! • Calling 3rd party functions or APIs for reasoning or other type of data (ex: LLMs are terrible at reasoning and prediction, consider calling other models) • Fallback techniques: fallback to a different model, or default answers • API scaling techniques, rate limiting, etc. • Async, streaming and parallelization, multiprocessing, GPU acceleration (including embeddings), generating your API using OpenAPI, etc.

•by Mihai Criveti

langchainpythongenai

How Does Generative AI Actually Work? (a quick semi-technical introduction to...

This document provides a technical introduction to large language models (LLMs). It explains that LLMs are based on simple probabilities derived from their massive training corpora, containing trillions of examples. The document then discusses several key aspects of how LLMs work, including that they function as a form of "lossy text compression" by encoding patterns and relationships in their training data. It also outlines some of the key elements in the architecture and training of the most advanced LLMs, such as GPT-4, focusing on their huge scale, transformer architecture, and use of reinforcement learning from human feedback.

•by ssuser4edc93

chatgpt llm generative ai

Prompt Engineering by Dr. Naveed.pdf

Prompt engineering is a fundamental concept within the field of artificial intelligence, with particular relevance to natural language processing. It involves the strategic embedding of task descriptions within the input data of an AI system, often in the form of a question or query, as opposed to explicitly providing the task description separately. This approach optimizes the efficiency and effectiveness of AI models by encapsulating the desired outcome within the input context, thereby enabling more streamlined and context-aware responses.

•by Naveed Ahmed Siddiqui

aiprompt engineering

H2O.ai Confidential
Enterprise h2oGPTe
h2oGPTe = h2oGPT + focus on
Enterprise Requirements
< Security, Scalability, GUI,
Collaboration, APIs, Airgapped,
Multi-Cloud, HA, K8s />
AI includes:
HyDE, ReRanker, Self-reflection,
Prompt Engineering, etc.

H2O.ai Confidential
Enterprise h2oGPTe
Fully customizable APIs!

H2O.ai Confidential
h2oGPTe RAG Benchmarks Key Take-Aways for h2oGPTe:
● open-source LLMs are good enough
● built on top of h2oGPT for latest AI
● designed for airgapped environments
● runs on any cloud/VPC, K8s
● Python/JS client API + GUI
● fully containerized and scalable
● customizable, any LLM, any language
● guard rails, agents, tools on road map
● own your data and models!
https://h2o.ai/platform/enterprise-h2ogpt/ https://pypi.org/project/h2ogpte/
< DEMO BOOTH!
See live demos! />

Recommended for you

Understanding GenAI/LLM and What is Google Offering - Felix Goh

With the recent buzz on Generative AI & Large Language Models, the question is to what extent can these technologies be applied at work or when you're studying and how easy is it to manage/develop your own models? Hear from our guest speaker from Google as he shares some insights into how industries are evolving with these trends and what are some of Google's offerings from Duet AI in Google Workspace to the GenAI App Builder on Google Cloud.

•by NUS-ISS

Generative AI - The New Reality: How Key Players Are Progressing

The document discusses key players in generative AI and their progress. It provides an overview of generative AI including its evolution since 1950, where the spending is focused, how the technology works, and deployment models. It then profiles several major companies leading advancements in generative AI, including their strategies, growth areas, and risks. These companies are TSMC, Nvidia, Microsoft, Google, Amazon, Tesla, Oracle, Salesforce, SAP, and Palo Alto Networks.

•by Vishal Sharma

aidigital transformationgenai

Vertex AI: Pipelines for your MLOps workflows

The document discusses Vertex AI pipelines for MLOps workflows. It begins with an introduction of the speaker and their background. It then discusses what MLOps is, defining three levels of automation maturity. Vertex AI is introduced as Google Cloud's managed ML platform. Pipelines are described as orchestrating the entire ML workflow through components. Custom components and conditionals allow flexibility. Pipelines improve reproducibility and sharing. Changes can trigger pipelines through services like Cloud Build, Eventarc, and Cloud Scheduler to continuously adapt models to new data.

•by Márton Kodok

cloudmachine learningpipelines

What's hot

Prompting is an art / Sztuka promptowania

Michal Jaskolski

The document discusses advances in large language models from GPT-1 to the potential capabilities of GPT-4, including its ability to simulate human behavior, demonstrate sparks of artificial general intelligence, and generate virtual identities. It also provides tips on how to effectively prompt ChatGPT through techniques like prompt engineering, giving context and examples, and different response formats.

𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬

VINCI Digital - Industrial IoT (IIoT) Strategic Advisory

Gartner provides webinars on various topics related to technology. This webinar discusses generative AI, which refers to AI techniques that can generate new unique artifacts like text, images, code, and more based on training data. The webinar covers several topics related to generative AI, including its use in novel molecule discovery, AI avatars, and automated content generation. It provides examples of how generative AI can benefit various industries and recommendations for organizations looking to utilize this emerging technology.

Use Case Patterns for LLM Applications (1).pdf

M Waleed Kadous

Cloud AI GenAI Overview.pptx

SahithiGurlinka

PyBay23: Understanding LangChain Agents and Tools with Twilio (or with SMS)....

Elizabeth (Lizzie) Siegle

Using Generative AI

Mark DeLoura

Active Retrieval Augmented Generation.pdf

Po-Chuan Chen

GenerativeAI and Automation - IEEE ACSOS 2023.pptx

Allen Chan