Chroma

Chroma

Technology, Information and Internet

San Francisco, CA 4,603 followers

the AI-native open-source embedding database

About us

the AI-native open-source embedding database

Website
https://www.trychroma.com/
Industry
Technology, Information and Internet
Company size
2-10 employees
Headquarters
San Francisco, CA
Type
Privately Held

Locations

Employees at Chroma

Updates

  • View organization page for Chroma, graphic

    4,603 followers

    Embedding Adapters Today, we are pleased to share the first of a series of technical reports with the AI application developer community—our investigation into the use of linear embedding adapters in improving retrieval accuracy in realistic settings. Retrieval accuracy is an important determinant of AI application performance. However, many approaches to improving retrieval accuracy require large labeled corpora, which are often not available to application developers. Additionally, many of these approaches require re-computing the entire set of embeddings. While embedding adapters aren't a new idea, but to our knowledge this is the first time they have been investigated in depth. In this work, we demonstrate that applying a linear transform, trained from relatively few labeled data points, to just the query embedding, produces a significant (up to 70%) improvement in retrieval accuracy across many domains, including across languages. For many applications, this is the difference between working or not. Learn more: https://lnkd.in/gnC_FTFm

    Embedding Adapters

    Embedding Adapters

    research.trychroma.com

  • View organization page for Chroma, graphic

    4,603 followers

    We are hiring for a founding talent / recruiter

    View profile for Jeff Huber, graphic

    founder

    [Hiring for a Founding Talent/Recruiter!] Chroma is nothing without it's people. We take hiring very seriously because our team defines what’s possible for us as a company. Candidates routinely tell us that they are impressed by the depth, speed, and quality of our interview process. A recent candidate: "that was by far the best onsite experience I've ever had" We are looking for the right person to found our talent team and take us to the next level. https://lnkd.in/g24E7Pbf please reach out directly at careers@trychroma.com [please comment/share so others in-market see this]

    Chroma is building the data infrastructure for AI. Join us.

    Chroma is building the data infrastructure for AI. Join us.

    careers.trychroma.com

  • View organization page for Chroma, graphic

    4,603 followers

    By popular demand, Chroma is supporting Backdrop Build V3. We have grants available for excellent projects in the following categories: Autonomous AI systems - Programs backed by large language models (LLMs) which are capable of operating on their own, without direct human prompting, to run a process, gather information, or accomplish a goal. They might use retrieval for their working memory. Systems that learn - Programs which improve and adapt either implicitly through use, or explicitly by being taught and then remembering skills they've learned at the right time. They might use retrieval to store and remember skills and tools. Systems that explore - Programs that are capable of recognizing when knowledge is not yet available, and either ask for help or autonomously seek out relevant new knowledge as needed. They might use retrieval to figure out what they don't yet know, and where to find out. Multi-modal systems - LLMs are now capable of processing not just text but also images, and audio. Chroma supports multi-modal retrieval, and we want to see exciting applications of these capabilities. We are interested in effective demonstrations of these capabilities, far more than full-fledged products. Take Voyager as your inspiration: https://lnkd.in/eKU_f8UF Look into the future of what might be possible, even as a toy, rather than what might be commercially viable today. Apply here: https://backdropbuild.com/

  • Chroma reposted this

    View profile for Anton Troynikov, graphic

    Founder.

    It’s been a year since we launched Chroma's open source embeddings store, making it easy to build retrieval into your AI application. Since then, over 7.3 million individual machines have run Chroma. The ecosystem has evolved, and we’ve learned a lot. Over the last twelve months, our thesis that retrieval would be a fundamental component of AI application development has paid off. Retrieval is also becoming an increasingly important component in AI research. Our investment in developer experience and usability, as well as our integrations with other open-source projects like LlamaIndex and LangChain, have earned our leading position in retrieval in Python, and in open source overall. So, what's next? Everybody already knows Chroma's developer cloud is coming, and soon. At launch, it will have the best product, with the best developer experience, at the best price point. Yes, it's taking longer than expected - we are confident the greater investment is worth it. Almost since launch, we've been saying that vector search alone isn't enough. The last year has shown more of what works and what doesn't in retrieval for AI. It was gratifying to see OpenAIDevs go into detail about what it takes to build retrieval that actually works at developer day.(https://lnkd.in/gBWgYmY3) Automatic embedding model selection, and the dual problem of optimal chunking, result relevancy and ranking, and automatic fine-tuning of the retrieval system aren't optional extras, but critical components which we'll be building into our product. Neural retrieval approaches, like @lateinteraction's CoLBERT, show a lot of promise - it's a natural extension of both retrieval and language models to use much more of the context of both query and document when retrieving. Looking further ahead, the retrieval-AI loop ('RAG') as it's done today is very primitive. Stuffing the LLM's context window with data blindly retrieved from elsewhere, and mixing instruction, data, and other context together is clearly suboptimal. Over the next 12 months, Chroma will be experimenting with architectures and approaches that more directly integrate the retrieval system directly with the execution LLM. There are a lot of promising directions, and open-weights models will help a lot. It's very clear that it's still very early in AI. I often draw an analogy to the early web, or to the early days of aviation - the best thing people can be doing right now is experimenting. Chroma will continue to make it as easy as possible to experiment with AI. LFG.

    • No alternative text description for this image
  • View organization page for Chroma, graphic

    4,603 followers

    Welcome Weili Gu!

    View profile for Anton Troynikov, graphic

    Founder.

    I am pleased to welcome Weili Gu to the Chroma team. Weili joins us from Snowflake, where she contributed core engineering work which has enabled Snowflake's incredible growth over the last seven years. Weili joins us to help us build Chroma's horizontally scalable distributed system, which will power Chroma's cloud, as well as our open-source distributed data plane. Weili joins Ben Eggers, Hammad Bashir, and Liquan Pei in our continued efforts to build scalable, robust, production ready retrieval for AI. There is still much to do. We are hiring experienced senior and staff level engineers across distributed systems, cloud infrastructure, database systems, product engineering, open source, and more. Apply here: https://lnkd.in/gHG-ZsZB

    • No alternative text description for this image
  • View organization page for Chroma, graphic

    4,603 followers

    Our new course with DeepLearning.AI is out just in time for the new year! The course covers advanced retrieval techniques for AI applications. Prof. Andrew Ng and I show you how simple vector search isn't enough on its own, and some simple but powerful approaches to improving retrieval results for your AI application. Since we launched Chroma in Feb '23, we've been saying that vector search isn't enough on its own - building robust applications with AI will take a full-fledged retrieval system. Information retrieval is a long-standing subfield of natural language processing, but its practical applications have until recently been limited to systems like web search, e-commerce recommender systems, and the like. With AI application development exploding, and with every AI application needing retrieval, the number of applications of IR, and the size of the associated problem space are several orders of magnitude larger. As a team, we are proud to contribute to the development of retrieval for AI applications, and I'm very pleased to share what we're learning with the community. look out for more throughout '24!

    View organization page for DeepLearning.AI, graphic

    1,024,531 followers

    Introducing a new course, Advanced Retrieval for AI! Built in partnership with Chroma, this course dives into advanced techniques that use a large language model to get the most relevant context for a Retrieval Augmented Generation system. After taking this course, you will be capable of: ◆ Recognizing when queries are producing poor results. ◆ Using a large language model to expand and improve your queries. ◆ Fine-tuning your embeddings with user feedback. Join today: https://hubs.la/Q02f3MPy0

Similar pages

Funding

Chroma 2 total rounds

Last Round

Seed

US$ 18.0M

See more info on crunchbase