Chroma

Technology, Information and Internet

San Francisco, CA 4,603 followers

the AI-native open-source embedding database

View all 45 employees

About us

the AI-native open-source embedding database

Website: https://www.trychroma.com/
External link for Chroma
Industry: Technology, Information and Internet
Company size: 2-10 employees
Headquarters: San Francisco, CA
Type: Privately Held

Locations

Primary

San Francisco, CA, US

Get directions

Employees at Chroma

See all employees

Updates

Chroma

4,603 followers
2w
Report this post
NEW: Rahul Sonwalkar on building Julius AI https://lnkd.in/g2QJxWJC

Building in AI: Rahul Sonwalkar on Julius

https://www.youtube.com/

Like Comment Share
Chroma

4,603 followers
3w
Report this post
we're hiring a community lead at chroma if you are passionate about open-source and accelerating AI engineering, apply! https://lnkd.in/dBePEexU

1 Comment

Like Comment Share
Chroma

4,603 followers
1mo
Report this post
Embedding Adapters Today, we are pleased to share the first of a series of technical reports with the AI application developer community—our investigation into the use of linear embedding adapters in improving retrieval accuracy in realistic settings. Retrieval accuracy is an important determinant of AI application performance. However, many approaches to improving retrieval accuracy require large labeled corpora, which are often not available to application developers. Additionally, many of these approaches require re-computing the entire set of embeddings. While embedding adapters aren't a new idea, but to our knowledge this is the first time they have been investigated in depth. In this work, we demonstrate that applying a linear transform, trained from relatively few labeled data points, to just the query embedding, produces a significant (up to 70%) improvement in retrieval accuracy across many domains, including across languages. For many applications, this is the difference between working or not. Learn more: https://lnkd.in/gnC_FTFm

Embedding Adapters

research.trychroma.com

Like Comment Share
Chroma

4,603 followers
1mo
Report this post
We are hiring for a founding talent / recruiter

Jeff Huber

founder
1mo Edited

[Hiring for a Founding Talent/Recruiter!] Chroma is nothing without it's people. We take hiring very seriously because our team defines what’s possible for us as a company. Candidates routinely tell us that they are impressed by the depth, speed, and quality of our interview process. A recent candidate: "that was by far the best onsite experience I've ever had" We are looking for the right person to found our talent team and take us to the next level. https://lnkd.in/g24E7Pbf please reach out directly at careers@trychroma.com [please comment/share so others in-market see this]

Chroma is building the data infrastructure for AI. Join us.

careers.trychroma.com

1 Comment

Like Comment Share
Chroma

4,603 followers
4mo
Report this post
By popular demand, Chroma is supporting Backdrop Build V3. We have grants available for excellent projects in the following categories: Autonomous AI systems - Programs backed by large language models (LLMs) which are capable of operating on their own, without direct human prompting, to run a process, gather information, or accomplish a goal. They might use retrieval for their working memory. Systems that learn - Programs which improve and adapt either implicitly through use, or explicitly by being taught and then remembering skills they've learned at the right time. They might use retrieval to store and remember skills and tools. Systems that explore - Programs that are capable of recognizing when knowledge is not yet available, and either ask for help or autonomously seek out relevant new knowledge as needed. They might use retrieval to figure out what they don't yet know, and where to find out. Multi-modal systems - LLMs are now capable of processing not just text but also images, and audio. Chroma supports multi-modal retrieval, and we want to see exciting applications of these capabilities. We are interested in effective demonstrations of these capabilities, far more than full-fledged products. Take Voyager as your inspiration: https://lnkd.in/eKU_f8UF Look into the future of what might be possible, even as a toy, rather than what might be commercially viable today. Apply here: https://backdropbuild.com/

2 Comments

Like Comment Share
Chroma reposted this

Anton Troynikov

Founder.
5mo
Report this post
It’s been a year since we launched Chroma's open source embeddings store, making it easy to build retrieval into your AI application. Since then, over 7.3 million individual machines have run Chroma. The ecosystem has evolved, and we’ve learned a lot. Over the last twelve months, our thesis that retrieval would be a fundamental component of AI application development has paid off. Retrieval is also becoming an increasingly important component in AI research. Our investment in developer experience and usability, as well as our integrations with other open-source projects like LlamaIndex and LangChain, have earned our leading position in retrieval in Python, and in open source overall. So, what's next? Everybody already knows Chroma's developer cloud is coming, and soon. At launch, it will have the best product, with the best developer experience, at the best price point. Yes, it's taking longer than expected - we are confident the greater investment is worth it. Almost since launch, we've been saying that vector search alone isn't enough. The last year has shown more of what works and what doesn't in retrieval for AI. It was gratifying to see OpenAIDevs go into detail about what it takes to build retrieval that actually works at developer day.(https://lnkd.in/gBWgYmY3) Automatic embedding model selection, and the dual problem of optimal chunking, result relevancy and ranking, and automatic fine-tuning of the retrieval system aren't optional extras, but critical components which we'll be building into our product. Neural retrieval approaches, like @lateinteraction's CoLBERT, show a lot of promise - it's a natural extension of both retrieval and language models to use much more of the context of both query and document when retrieving. Looking further ahead, the retrieval-AI loop ('RAG') as it's done today is very primitive. Stuffing the LLM's context window with data blindly retrieved from elsewhere, and mixing instruction, data, and other context together is clearly suboptimal. Over the next 12 months, Chroma will be experimenting with architectures and approaches that more directly integrate the retrieval system directly with the execution LLM. There are a lot of promising directions, and open-weights models will help a lot. It's very clear that it's still very early in AI. I often draw an analogy to the early web, or to the early days of aviation - the best thing people can be doing right now is experimenting. Chroma will continue to make it as easy as possible to experiment with AI. LFG.
8 Comments

Like Comment Share
Chroma

4,603 followers
5mo
Report this post
Outset Capital

2,333 followers
5mo Edited

Photos from last week’s incredible live demos and Q&A with OSS leaders Harrison, Anton, and Brian, moderated by Ali.
- +3
Like Comment Share
Chroma

4,603 followers
6mo
Report this post
Welcome Weili Gu!
Anton Troynikov

Founder.
6mo

I am pleased to welcome Weili Gu to the Chroma team. Weili joins us from Snowflake, where she contributed core engineering work which has enabled Snowflake's incredible growth over the last seven years. Weili joins us to help us build Chroma's horizontally scalable distributed system, which will power Chroma's cloud, as well as our open-source distributed data plane. Weili joins Ben Eggers, Hammad Bashir, and Liquan Pei in our continued efforts to build scalable, robust, production ready retrieval for AI. There is still much to do. We are hiring experienced senior and staff level engineers across distributed systems, cloud infrastructure, database systems, product engineering, open source, and more. Apply here: https://lnkd.in/gHG-ZsZB
Like Comment Share
Chroma

4,603 followers
6mo
Report this post
Our new course with DeepLearning.AI is out just in time for the new year! The course covers advanced retrieval techniques for AI applications. Prof. Andrew Ng and I show you how simple vector search isn't enough on its own, and some simple but powerful approaches to improving retrieval results for your AI application. Since we launched Chroma in Feb '23, we've been saying that vector search isn't enough on its own - building robust applications with AI will take a full-fledged retrieval system. Information retrieval is a long-standing subfield of natural language processing, but its practical applications have until recently been limited to systems like web search, e-commerce recommender systems, and the like. With AI application development exploding, and with every AI application needing retrieval, the number of applications of IR, and the size of the associated problem space are several orders of magnitude larger. As a team, we are proud to contribute to the development of retrieval for AI applications, and I'm very pleased to share what we're learning with the community. look out for more throughout '24!

DeepLearning.AI

1,024,531 followers
6mo

Introducing a new course, Advanced Retrieval for AI! Built in partnership with Chroma, this course dives into advanced techniques that use a large language model to get the most relevant context for a Retrieval Augmented Generation system. After taking this course, you will be capable of: ◆ Recognizing when queries are producing poor results. ◆ Using a large language model to expand and improve your queries. ◆ Fine-tuning your embeddings with user feedback. Join today: https://hubs.la/Q02f3MPy0

1 Comment

Like Comment Share
Chroma reposted this

Turpentine

1,591 followers
8mo Edited
Report this post
🤖 New episode of The Cognitive Revolution is out! Anton Troynikov, cofounder of Chroma, joins Nathan Labenz to discuss: - The categories of organizations adapting to AI: legacy, AI-native, and AI-first - The huge amount of unstructured data that can now be processed by #AI - What it means for Chroma to be in "wartime" mode Links in comments.

2 Comments

Like Comment Share

Funding

Chroma 2 total rounds

Last Round

Seed May 6, 2023

US$ 18.0M

Investors

Quiet Capital + 7 Other investors

See more info on crunchbase

Chroma

Technology, Information and Internet

San Francisco, CA 4,603 followers

the AI-native open-source embedding database

About us

Locations

Employees at Chroma

Jeff Huber

founder

Melissa Megan

Owner at Chroma

Paul Sledd

Talent Acquisition | Engineering | Start-ups | Human Resources

Liquan Pei

Founding Member

Updates

Building in AI: Rahul Sonwalkar on Julius

https://www.youtube.com/

Embedding Adapters

research.trychroma.com

Chroma is building the data infrastructure for AI. Join us.

careers.trychroma.com

Join now to see what you are missing

Similar pages

Pinecone

Weaviate

Qdrant

LangChain

LlamaIndex

Zilliz

Milvus

Cohere

Anthropic

Hugging Face

Funding