DataTalks.Club
DataTalks.Club
1 Creator
1 Creator
All episodes
Best episodes
Seasons
Top 10 DataTalks.Club Episodes
Goodpods has curated a list of the 10 best DataTalks.Club episodes, ranked by the number of listens and likes each episode have garnered from our listeners. If you are listening to DataTalks.Club for the first time, there's no better place to start than with one of these standout episodes. If you are a fan of the show, vote for your favorite DataTalks.Club episode by adding your comments to the episode page.
12/27/23 • 57 min
We talked about:
- Atita’s background
- How NLP relates to search
- Atita’s experience with Lucidworks and OpenSource Connections
- Atita’s experience with Qdrant and vector databases
- Utilizing vector search
- Major changes to search Atita has noticed throughout her career
- RAG (Retrieval-Augmented Generation)
- Building a chatbot out of transcripts with LLMs
- Ingesting the data and evaluating the results
- Keeping humans in the loop
- Application of vector databases for machine learning
- Collaborative filtering
- Atita’s resource recommendations
Links:
- LinkedIn: https://www.linkedin.com/in/atitaarora/
- Twitter: https://x.com/atitaarora
- Github: https://github.com/atarora
- Human-in-the-Loop Machine Learning: https://www.manning.com/books/human-in-the-loop-machine-learning
- Relevant Search: https://www.manning.com/books/relevant-search
- Let's learn about Vectors: https://hub.superlinked.com/ Langchain: https://python.langchain.com/docs/get_started/introduction
- Qdrant blog: https://blog.qdrant.tech/
- OpenSource Connections Blog: https://opensourceconnections.com/blog/
Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
07/26/24 • 52 min
In this podcast episode, we talked with Guillaume Lemaître about navigating scikit-learn and imbalanced-learn. 🔗 CONNECT WITH Guillaume Lemaître LinkedIn - https://www.linkedin.com/in/guillaume-lemaitre-b9404939/ Twitter - https://x.com/glemaitre58 Github - https://github.com/glemaitre Website - https://glemaitre.github.io/ 🔗 CONNECT WITH DataTalksClub Join the community - https://datatalks-club.slack.com/join/shared_invite/zt-2hu0sjeic-ESN7uHt~aVWc8tD3PefSlA#/shared-invite/email Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/u/0/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ Check other upcoming events - https://lu.ma/dtc-events LinkedIn - https://www.linkedin.com/company/datatalks-club/ Twitter - https://twitter.com/DataTalksClub Website - https://datatalks.club/ 🔗 CONNECT WITH ALEXEY Twitter - https://twitter.com/Al_Grigor Linkedin - https://www.linkedin.com/in/agrigorev/ 🎙 ABOUT THE PODCAST At DataTalksClub, we organize live podcasts that feature a diverse range of guests from the data field. Each podcast is a free-form conversation guided by a prepared set of questions, designed to learn about the guests’ career trajectories, life experiences, and practical advice. These insightful discussions draw on the expertise of data practitioners from various backgrounds. We stream the podcasts on YouTube, where each session is also recorded and published on our channel, complete with timestamps, a transcript, and important links. You can access all the podcast episodes here - https://datatalks.club/podcast.html 📚Check our free online courses ML Engineering course - http://mlzoomcamp.com Data Engineering course - https://github.com/DataTalksClub/data-engineering-zoomcamp MLOps course - https://github.com/DataTalksClub/mlops-zoomcamp Analytics in Stock Markets - https://github.com/DataTalksClub/stock-markets-analytics-zoomcamp LLM course - https://github.com/DataTalksClub/llm-zoomcamp Read about all our courses in one place - https://datatalks.club/blog/guide-to-free-online-courses-at-datatalks-club.html 👋🏼 GET IN TOUCH If you want to support our community, use this link - https://github.com/sponsors/alexeygrigorev If you're a company and want to support us, contact at [email protected]
01/22/24 • 54 min
We talked about:
- Rob’s background
- Going from software engineering to Bayesian modeling
- Frequentist vs Bayesian modeling approach
- About integrals
- Probabilistic programming and samplers
- MCMC and Hakaru
- Language vs library
- Encoding dependencies and relationships into a model
- Stan, HMC (Hamiltonian Monte Carlo) , and NUTS
- Sources for learning about Bayesian modeling
- Reaching out to Rob
Links:
- Book 1: https://bayesiancomputationbook.com/welcome.html
- Book/Course: https://xcelab.net/rm/statistical-rethinking/
Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
08/06/21 • 13 min
We don't have an episode lined up for this week, but we recorded a small chat with Vladimir some time ago. Enjoy it!
We talked about:
- Vladimir's background
- Learning by answering questions
- Don't be afraid of being wrong
- Winnings books
- Learning random things
- Approach learning as a machine learning project
Links:
- Vladimir on LinkedIn: https://www.linkedin.com/in/vladimir-finkelshtein/
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html
Big Data Engineer vs Data Scientist - Roksolana Diachuk
DataTalks.Club
07/09/21 • 61 min
Links:
- Twitter: https://twitter.com/dead_flowers22
- LinkedIn: https://www.linkedin.com/in/roksolanadiachuk/
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html
LLMs for Everyone - Meryem Arik
DataTalks.Club
07/28/23 • 55 min
We talked about:
- Meryam's background
- The constant evolution of startups
- How Meryam became interested in LLMs
- What is an LLM (generative vs non-generative models)?
- Why LLMs are important
- Open source models vs API models
- What TitanML does
- How fine-tuning a model helps in LLM use cases
- Fine-tuning generative models
- How generative models change the landscape of human work
- How to adjust models over time
- Vector databases and LLMs
- How to choose an open source LLM or an API
- Measuring input data quality
- Meryam's resource recommendations
Links:
- Website: https://www.titanml.co/
- Beta docs: https://titanml.gitbook.io/iris-documentation/overview/guide-to-titanml...
- Using llama2.0 in TitanML Blog: https://medium.com/@TitanML/the-easiest-way-to-fine-tune-and-inference-llama-2-0-8d8900a57d57
- Discord: https://discord.gg/83RmHTjZgf
- Meryem LinkedIn: https://www.linkedin.com/in/meryemarik/
Free MLOps course: https://github.com/DataTalksClub/mlops-zoomcamp Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
08/15/24 • 53 min
0:00
hi everyone Welcome to our event this event is brought to you by data dos club which is a community of people who love
0:06
data and we have weekly events and today one is one of such events and I guess we
0:12
are also a community of people who like to wake up early if you're from the states right Christopher or maybe not so
0:19
much because this is the time we usually have uh uh our events uh for our guests
0:27
and presenters from the states we usually do it in the evening of Berlin time but yes unfortunately it kind of
0:34
slipped my mind but anyways we have a lot of events you can check them in the
0:41
description like there's a link um I don't think there are a lot of them right now on that link but we will be
0:48
adding more and more I think we have like five or six uh interviews scheduled so um keep an eye on that do not forget
0:56
to subscribe to our YouTube channel this way you will get notified about all our future streams that will be as awesome
1:02
as the one today and of course very important do not forget to join our community where you can hang out with
1:09
other data enthusiasts during today's interview you can ask any question there's a pin Link in live chat so click
1:18
on that link ask your question and we will be covering these questions during the interview now I will stop sharing my
1:27
screen and uh there is there's a a message in uh and Christopher is from
1:34
you so we actually have this on YouTube but so they have not seen what you wrote
1:39
but there is a message from to anyone who's watching this right now from Christopher saying hello everyone can I
1:46
call you Chris or you okay I should go I should uh I should look on YouTube then okay yeah but anyways I'll you don't
1:53
need like you we'll need to focus on answering questions and I'll keep an eye
1:58
I'll be keeping an eye on all the question questions so um
2:04
yeah if you're ready we can start I'm ready yeah and you prefer Christopher
2:10
not Chris right Chris is fine Chris is fine it's a bit shorter um
2:18
okay so this week we'll talk about data Ops again maybe it's a tradition that we talk about data Ops every like once per
2:25
year but we actually skipped one year so because we did not have we haven't had
2:31
Chris for some time so today we have a very special guest Christopher Christopher is the co-founder CEO and
2:37
head chef or hat cook at data kitchen with 25 years of experience maybe this
2:43
is outdated uh cuz probably now you have more and maybe you stopped counting I
2:48
don't know but like with tons of years of experience in analytics and software engineering Christopher is known as the
2:55
co-author of the data Ops cookbook and data Ops Manifesto and it's not the
3:00
first time we have Christopher here on the podcast we interviewed him two years ago also about data Ops and this one
3:07
will be about data hops so we'll catch up and see what actually changed in in
3:13
these two years and yeah so welcome to the interview well thank you for having
3:19
me I'm I'm happy to be here and talking all things related to data Ops and why
3:24
why why bother with data Ops and happy to talk about the company or or what's changed
3:30
excited yeah so let's dive in so the questions for today's interview are prepared by Johanna berer as always
3:37
thanks Johanna for your help so before we start with our main topic for today
3:42
data Ops uh let's start with your ground can you tell us about your career Journey so far and also for those who
3:50
have not heard have not listened to the previous podcast maybe you can um talk
3:55
about yourself and also for those who did listen to the previous you can also maybe give a summary of what has changed
4:03
in the last two years so we'll do yeah so um my name is Chris so I guess I'm
4:09
a sort of an engineer so I spent about the first 15 years of my career in
4:15
software sort of working and building some AI systems some non- AI systems uh
4:21
at uh Us's NASA and MIT linol lab and then some startups and then um
4:30
Microsoft and then about 2005 I got I got the data bug uh I think you know my
4:35
kids were small and I thought oh this data thing was easy and I'd be able to go home uh for dinner at 5 and life
4:41
would be fine um because I was a big you started your own company right and uh it didn't work out that way
4:50
and um a...
01/24/24 • 55 min
We talked about:
- Ivan’s background
- How Ivan became interested in investing
- Getting financial data to run simulations
- Open, High, Low, Close, Volume
- Risk management strategy
- Testing your trading strategies
- Sticking to your strategy
- Important metrics and remembering about trading fees
- Important features
- Deployment
- How DataTalks.Club courses helped Ivan
- Ivan’s site and course sign-up
Links:
- Exploring Finance APIs: https://pythoninvest.com/long-read/exploring-finance-apis
- Python Invest Blog Articles: https://pythoninvest.com/blog
Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
04/19/24 • 51 min
Links:
- Biodiversity and Artificial Intelligence pdf: https://www.gpai.ai/projects/responsible-ai/environment/biodiversity-and-AI-opportunities-recommendations-for-action.pdf
Free Data Engineering course: https://github.com/DataTalksClub/data-engineering-zoomcamp Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html
Show more best episodes
Show more best episodes
FAQ
How many episodes does DataTalks.Club have?
DataTalks.Club currently has 173 episodes available.
What topics does DataTalks.Club cover?
The podcast is about Podcasts and Technology.
What is the most popular episode on DataTalks.Club?
The episode title 'Standing out as a Data Scientist - Luke Whipps' is the most popular.
What is the average episode length on DataTalks.Club?
The average episode length on DataTalks.Club is 55 minutes.
How often are episodes of DataTalks.Club released?
Episodes of DataTalks.Club are typically released every 7 days.
When was the first episode of DataTalks.Club?
The first episode of DataTalks.Club was released on Nov 21, 2020.
Show more FAQ
Show more FAQ