In the last few years we've seen an explosion of audio data available online. This coupled with advances in AI technology have allowed organizations to unlock the value of voice data in ways that were previously impossible. As a result, we've seen organizations build new products, services, and capabilities that serve millions of people around the world. Today, we’re announcing Universal-1, our most powerful and accurate model to date, trained on 12.5M hours of multilingual audio data to help power the next generation of Speech AI products and features. Some key stats on Universal-1: • 72% preferred to our most recent model Conformer-2 in human evals • 71% better speaker count estimation and 14% better word timestamp estimation compared to our prior models • Up to 30% fewer hallucinations than seq2seq models like Whisper • Just 38 seconds to process 1 hour of audio Learn more about Universal-1 on our blog: https://lnkd.in/e5inQ-x9
AssemblyAI
Software Development
San Francisco, California 27,153 followers
Industry-leading Speech AI models to automatically recognize and understand speech.
About us
AssemblyAI is a Speech AI company focused on building new state-of-the-art AI models that can transcribe and understand human speech. Our customers, such as CallRail, Fireflies, and Spotify, choose AssemblyAI to build incredible new AI-powered experiences and products based on voice data. AssemblyAI models and frameworks include: - AI Speech-to-Text - Audio Intelligence, including Summarization, Sentiment Analysis, Topic Detection, Content Moderation, PII Redaction, and more - LeMUR, a framework for applying powerful LLMs to transcribed speech, where you can ask sophisticated questions, pull action items and recaps from your transcription, and more To see AssemblyAI in action, choose your favorite audio or video file and upload it into our no-code playground: https://www.assemblyai.com/playground. Also, check out our customer stories and blog: https://www.assemblyai.com/blog.
- Website
-
http://www.assemblyai.com
External link for AssemblyAI
- Industry
- Software Development
- Company size
- 51-200 employees
- Headquarters
- San Francisco, California
- Type
- Privately Held
- Founded
- 2017
Products
AssemblyAI
Speech Recognition Software
At AssemblyAI, we build AI models and systems that developers and product teams use to ship transformational AI-powered audio products. As an applied AI company, our mission is to empower app builders to build 10x faster, focus on their specific use cases and user needs, and win market share with a true technology partner. We've raised over $63M in funding from leading investors, including Insight Partners, Accel, and Y Combinator. Learn more at AssemblyAI.com.
Locations
-
Primary
320 Judah St
San Francisco, California 94122, US
Employees at AssemblyAI
Updates
-
🎉 Starting today, LeMUR (AssemblyAI's framework for leveraging Large Language Models for speech) is more powerful than ever before because we're integrating four of Anthropic's Claude 3 models: Claude 3.5 Sonnet: the most intelligent model to date Claude 3 Opus: good at handling complex analysis, longer tasks with many steps, and higher-order math and coding tasks Claude 3 Sonnet: strikes a balance between intelligence and speed Claude 3 Haiku: fastest, most compact model for near-instant responsiveness Learn more about this latest release on our blog: https://lnkd.in/ek9XjdjB
Claude 3 Models now available with LeMUR
assemblyai.com
-
We've made significant improvements to timestamp accuracy 🚀 96% of timestamps are now accurate within 200ms for English, Spanish, and German for our Best tier Check out other recent changes we've made on our changelog 👇 https://lnkd.in/ep3XT-zK
AssemblyAI | AI models to transcribe and understand speech
assemblyai.com
-
💡 New Tutorial 💡 In our latest tutorial in collaboration with Stream, learn how to build a video conferencing app with Next.js. The app supports video calls, live transcriptions, and an LLM-powered meeting assistant. Check out the full tutorial on our blog: https://lnkd.in/eujGUhhH
Build an AI-powered video conferencing app with Next.js and Stream
assemblyai.com
-
🗣 Speech AI (also known as Voice AI) is expected to transform human relationships with software and is predicted to unlock $10 billion of new software total addressable market in the next five years, says Bessemer Venture Partners in their latest State of the Cloud 2024 report. ⭐ AssemblyAI is honored to be recognized as one of the biggest players in this breakout area of AI growth. Read the full Report here: https://lnkd.in/e_W4TXiE
State of the Cloud 2024
bvp.com
-
🌟 We are thrilled to announce our sponsorship of the Useless Fun AI Build-A-Thon in San Francisco by Haystack and Cloudflare 🌟 Are developers tired of the AI Hype Cycle? It's time to refresh and rejuvenate with a day dedicated to creativity, learning, and fun. Join us for a unique hackathon experience where you can build quirky AI projects, connect with fellow developers, and gain hands-on experience with AI tools. 📅 Date: September 7, 2024 📍 Location: CloudFlare, San Francisco 👩🏻💻 Meet like-minded creators and hackers 🔑 Get free credits to build out your quirky ideas 📚 Learn from other developers 🎉 Have fun! Don't miss out on this exciting opportunity and take a day to create something truly unique! ✍️ Sign-up here: https://lnkd.in/dgTSQwws
-
💡 New Tutorial 💡 Are you looking to optimize your Python and Flask applications by dynamically swapping AI models based on user context? Discover how you can switch between AssemblyAI's state-of-the-art Speech-to-Text models, based on application contexts like user email domain, device, zip code, and more. Our Universal-1 model introduced a dual-class tier system, allowing developers to choose between the highest accuracy “Best” tier and the cost-effective “Nano” tier. This tutorial demonstrates how to leverage these tiers with LaunchDarkly for optimal app performance. Take your app development to the next level and check out the tutorial in the below LaunchDarkly post 👇 #AI #MachineLearning #Python #Flask #LaunchDarkly #AssemblyAI #TechTutorial #AppDevelopment
In this tutorial, learn how to use LaunchDarkly to swap between AssemblyAI models based on application contexts such as user email domain, device, zip code, etc. Tailor AI-powered transcription to your needs. View the full tutorial: https://lnkd.in/gSK4Tbpx
How to Switch AssemblyAI Speech-to-Text Model Tiers by User Email With LaunchDarkly Feature Flags | LaunchDarkly
launchdarkly.com
-
🤖 With the latest developments in generative AI, it is trivial to create speech in a language of your choice. You can generate voice in any manner of speaking you choose. The voice can sound happy, sad, angry or excited. Previously it was difficult to generate speech in your own voice—until now. With their new Professional Voice Cloning feature, ElevenLabs makes it possible and easily accessible. In this tutorial, learn how to build a web-based voice-to-voice cloning app using Gradio. The technologies used in this app are: 1. Gradio - for the interface 2. AssemblyAI - for transcription 3. Python translate module - for translation of text 4. Elevenlabs - for reading translated text in your own voice 👩💻 You can find the code for the simple and complex apps in this repo: https://lnkd.in/eM8T3_EH Watch the full video on YouTube: https://lnkd.in/eXWUHc9C
GitHub - AssemblyAI-Community/Voice-to-Voice-translator: A web app to generate speech in your own voice in any language you want
github.com
-
✨ AI Leaders to Watch in 2024 ✨ Bain & Company recently unveiled their list of top AI Leaders to Watch in 2024, based on factors like funding and interest generated, and AssemblyAI is in great company! See the full list of AI leaders here: https://lnkd.in/eJErF9JF Recognized for being a "must-know" Speech AI leader, we are committed to advancing and democratizing automatic speech recognition and understanding for the world. It's been incredible to see so many organizations leverage Speech AI to launch groundbreaking solutions for real-world problems—including coaching for students and valuable insights from voice data for customer support organizations. We are excited to see more organizations innovate with Speech AI now and in the future.
-
-
We've upgraded our Streaming (formerly Real-time) Speech-to-Text model to improve the accuracy of timestamps - most timestamps are now accurate to within 100ms 🚀 Learn more out this improvement and others we've recently made on our changelog 👇 https://lnkd.in/ep3XT-zK
AssemblyAI | AI models to transcribe and understand speech
assemblyai.com