Log in

goodpods headphones icon

To access all our features

Open the Goodpods app
Close icon
Practical AI: Machine Learning, Data Science, LLM - Towards high-quality (maybe synthetic) datasets

Towards high-quality (maybe synthetic) datasets

Practical AI: Machine Learning, Data Science, LLM

10/09/24 • 57 min

plus icon
bookmark
Share icon

As Argilla puts it: “Data quality is what makes or breaks AI.” However, what exactly does this mean and how can AI team probably collaborate with domain experts towards improved data quality? David Berenstein & Ben Burtenshaw, who are building Argilla & Distilabel at Hugging Face, join us to dig into these topics along with synthetic data generation & AI-generated labeling / feedback.

Join the discussion

Changelog++ members save 11 minutes on this episode because they made the ads disappear. Join today!

Sponsors:

  • Fly.ioThe home of Changelog.com — Deploy your apps close to your users — global Anycast load-balancing, zero-configuration private networking, hardware isolation, and instant WireGuard VPN connections. Push-button deployments that scale to thousands of instances. Check out the speedrun to get started in minutes.
  • WorkOSA platform that gives developers a set of building blocks for quickly adding enterprise-ready features to their application. Add Single Sign-On (Okta, Azure, Google, Microsoft OAuth), sync users from any SCIM directory, HRIS integration, audit trails (SIEM), free magic link sign-in. WorkOS is designed for developers and offers a single, elegant interface that abstracts dozens of enterprise integrations. Learn more and get started at WorkOS.com
  • Eight SleepTake your sleep and recovery to the next level. Go to eightsleep.com/PRACTICALAI and use the code PRACTICALAI to get $350 off your very own Pod 4 Ultra. You can try it for free for 30 days - but we’re confident you will not want to return it. Once you experience AI-optimized sleep, you’ll wonder how you ever slept without it. Currently shipping to: United States, Canada, United Kingdom, Europe, and Australia.

Featuring:

Show Notes:

Something missing or broken? PRs welcome!

10/09/24 • 57 min

profile image

1 Listener

plus icon
bookmark
Share icon

Practical AI: Machine Learning, Data Science, LLM - Towards high-quality (maybe synthetic) datasets

Transcript

Transcript for Practical AI #290 Daniel Whitenack:

Welcome to another episode of the Practical AI Podcast. This is Daniel Whitenack. I am CEO at Prediction Guard, where we're building a private, secure gen AI platform, and I'm joined as always by Chris Benson, who is a Principal AI Research Engineer at Lockheed Martin. How are you doing, Chris?

Chris Benson:

Great today, Daniel. How are you?

Daniel Whitenack:

It's a beautiful, beautiful fa

Generate a badge

Get a badge for your website that links back to this episode

Select type & size
Open dropdown icon
share badge image

<a href="https://goodpods.com/podcasts/practical-ai-machine-learning-data-science-llm-57254/towards-high-quality-maybe-synthetic-datasets-75918507"> <img src="https://storage.googleapis.com/goodpods-images-bucket/badges/generic-badge-1.svg" alt="listen to towards high-quality (maybe synthetic) datasets on goodpods" style="width: 225px" /> </a>

Copy