Mozilla.ai’s Post

View organization page for Mozilla.ai, graphic

2,138 followers

Mozilla and EleutherAI brought together experts to discuss a critical question: How do we create openly licensed and open-access LLM training datasets and how do we tackle the challenges faced by their builders? Mozilla.ai's Senior Director of Product Innovation, Julie V. Belião, participated in the workshop along the many outstanding experts. Thank you to Kasia Odrozek, Stella Biderman, and Ayah Bdeir.

View profile for Julie V. Belião, graphic

Sr. Director of Product Innovation @ Mozilla.ai | Executive Consultant, Product, Strategy, Operations

I recently had the privilege of attending the Open Dataset Convening, organized by EleutherAI and Mozilla in Amsterdam. This event was super insightful as we discussed the creation and sustainability of openly licensed and open-access LLM training datasets 👐 Together, we explored how these datasets could be built, how to make them sustainable, and ways to support their producers. Topics included the potential impact on the industry, ensuring inclusivity and equity, addressing copyright and ethical considerations, and managing data privacy rights. Our discussions also delved into developing legal and ethical frameworks, establishing collaborative structures, and ensuring access via cultural and linguistic diversity. This aligns with many of our goals at Mozilla.ai ✨ where we are committed to transparency, access, agency and inclusivity. For more insights into these discussions and their implications, check out the great article published by Kasia Odrozek and Stella Biderman 🙏 (special thanks 💙 to Ayah Bdeir for organizing and inviting us, and to Santiago Martorana for his support and coordination 🤗 ) https://lnkd.in/d_9jjajG

The Dataset Convening: A community workshop on open AI datasets

The Dataset Convening: A community workshop on open AI datasets

https://blog.mozilla.org/en/

To view or add a comment, sign in

Explore topics