The Real Python Podcast
Real Python
2 Listeners
All episodes
Best episodes
Top 10 The Real Python Podcast Episodes
Goodpods has curated a list of the 10 best The Real Python Podcast episodes, ranked by the number of listens and likes each episode have garnered from our listeners. If you are listening to The Real Python Podcast for the first time, there's no better place to start than with one of these standout episodes. If you are a fan of the show, vote for your favorite The Real Python Podcast episode by adding your comments to the episode page.
Natural Language Processing and How ML Models Understand Text
The Real Python Podcast
07/29/22 • 58 min
How do you process and classify text documents in Python? What are the fundamental techniques and building blocks for Natural Language Processing (NLP)? This week on the show, Jodie Burchell, developer advocate for data science at JetBrains, talks about how machine learning (ML) models understand text.
Jodie explains how ML models require data in a structured format, which involves transforming text documents into columns and rows. She covers the most straightforward approach, called binary vectorization. We discuss the bag-of-words method and the tools of stemming, lemmatization, and count vectorization.
We jump into word embedding models next. Jodie talks about WordNet, Natural Language Toolkit (NLTK), word2vec, and Gensim. Our conversation lays a foundation for starting with text classification, implementing sentiment analysis, and building projects using these tools. Jodie also shares multiple resources to help you continue exploring NLP and modeling.
Course Spotlight: Learn Text Classification With Python and Keras
In this course, you’ll learn about Python text classification with Keras, working your way from a bag-of-words model with logistic regression to more advanced methods, such as convolutional neural networks. You’ll see how you can use pretrained word embeddings, and you’ll squeeze more performance out of your model through hyperparameter optimization.
Topics:
- 00:00:00 – Introduction
- 00:02:47 – Exploring the topic
- 00:06:00 – Perceived sentience of LaMDA
- 00:10:24 – How do we get started?
- 00:11:16 – What are classification and sentiment analysis?
- 00:13:03 – Transforming text in rows and columns
- 00:14:47 – Sponsor: Snyk
- 00:15:27 – Bag-of-words approach
- 00:19:12 – Stemming and lemmatization
- 00:22:05 – Capturing N-grams
- 00:25:34 – Count vectorization
- 00:27:14 – Stop words
- 00:28:46 – Text Frequency / Inverse Document Frequency (TFIDF) vectorization
- 00:32:28 – Potential projects for bag-of-words techniques
- 00:34:07 – Video Course Spotlight
- 00:35:20 – WordNet and NLTK package
- 00:37:27 – Word embeddings and word2vec
- 00:45:30 – Previous training and too many dimensions
- 00:50:07 – How to use word2vec and Gensim?
- 00:51:26 – What types of projects for word2vec and Gensim?
- 00:54:41 – Getting into GPT and BERT in another episode
- 00:56:11 – How to follow Jodie’s work?
- 00:57:36 – Thanks and goodbye
Show Links:
- Why Google’s “sentient” AI LaMDA is nothing like a person.
- On NYT Magazine on AI: Resist the Urge to be Impressed | Emily M. Bender | Medium
- ELIZA - Wikipedia
- eliza.py - Python 2 version by Daniel Connelly
- dabraude/Pyliza: Python3 Implementation of Eliza
- magneticpoetry.com
- Natural Language Processing With Python’s NLTK Package – Real Python
- Practical Text Classification With Python and Keras – Real Python
- Sentiment Analysis: First Steps With Python’s NLTK Library – Real Python
- NLTK: Natural Language Toolkit
- spaCy · Industrial-strength Natural Language Processing in Python
- Natural Language Processing With spaCy in Python - Real Python
- Stemming - Wikipedia
- Lemmatization - Wikipedia
- Binary/Count Vectorization: sklearn.feature_extraction.text.CountVectorizer— scikit-learn
- TFIDF: sklearn.feature_extraction.text.TfidfVectorizer — scikit-learn
- Porter Stemmer: nltk.stem.porter module — NLTK
- Snowbal...
1 Listener
Moving NLP Forward With Transformer Models and Attention
The Real Python Podcast
08/12/22 • 50 min
What’s the big breakthrough for Natural Language Processing (NLP) that has dramatically advanced machine learning into deep learning? What makes these transformer models unique, and what defines “attention?” This week on the show, Jodie Burchell, developer advocate for data science at JetBrains, continues our talk about how machine learning (ML) models understand and generate text.
This episode is a continuation of the conversation in episode #119. Jodie builds on the concepts of bag-of-words, word2vec, and simple embedding models. We talk about the breakthrough mechanism called “attention,” which allows for parallelization in building models.
We also discuss the two major transformer models, BERT and GPT3. Jodie continues to share multiple resources to help you continue exploring modeling and NLP with Python.
Course Spotlight: Building a Neural Network & Making Predictions With Python AI
In this step-by-step course, you’ll build a neural network from scratch as an introduction to the world of artificial intelligence (AI) in Python. You’ll learn how to train your neural network and make predictions based on a given dataset.
Topics:
- 00:00:00 – Introduction
- 00:02:20 – Where we left off with word2vec...
- 00:03:35 – Example of losing context
- 00:06:50 – Working at scale and adding attention
- 00:12:34 – Multiple levels of training for the model
- 00:14:10 – Attention is the basis for transformer models
- 00:15:07 – BERT (Bidirectional Encoder Representations from Transformers)
- 00:16:29 – GPT (Generative Pre-trained Transformer)
- 00:19:08 – Video Course Spotlight
- 00:20:08 – How far have we moved forward?
- 00:20:41 – Access to GPT-2 via Hugging Face
- 00:23:56 – How to access and use these models?
- 00:30:42 – Cost of training GPT-3
- 00:35:01 – Resources to practice and learn with BERT
- 00:38:19 – GPT-3 and GitHub Copilot
- 00:44:35 – DALL-E is a transformer
- 00:46:13 – Help yourself to the show notes!
- 00:49:19 – How can people follow your work?
- 00:50:03 – Thanks and goodbye
Show Links:
- Recurrent neural network - Wikipedia
- Long short-term memory - Wikipedia
- Vanishing gradient problem - Wikipedia
- Vanishing Gradient Problem | What is Vanishing Gradient Problem?
- Attention Is All You Need | Cornell University
- Visualizing A Neural Machine Translation Model (Mechanics of Seq2seq Models With Attention) – Jay Alammar
- Standing on the Shoulders of Giant Frozen Language Models | Cornell University
- #datalift22 Embeddings paradigm shift: Model training to vector similarity search by Nava Levy - YouTube
- Transformer Neural Networks - EXPLAINED! (Attention is all you need) - YouTube
- BERT 101 - State Of The Art NLP Model Explained
- How GPT3 Works - Easily Explained with Animations - YouTube
- Write With Transformer (GPT2 Live Playground Tool) - Hugging Face
- Language Model with Alpa (GPT3 Live Playground Tool) OPT-175B
- Big Data | Music
- OpenAI API
- 🤗 (Hugging Face)Transformers Notebooks
- GitHub Copilot: Fly With Python at the Speed of Thought
- GitHub Copilot learned about the daily struggle of JavaScript developers after being trained on billions of lines of code. | Marek Sotak on Twitter
- Jodie Burchell’s Blog - Standard error
- Jodie Burchell 🇦🇺🇩🇪 (@t_redactyl) / Twitter
- JetBrains: Essential tools for software ...
1 Listener
Computational Thinking & Learning Python During an AI Revolution
The Real Python Podcast
11/17/23 • 54 min
Has the current growth of artificial intelligence (AI) systems made you wonder what the future holds for Python developers? What are the hidden benefits of learning to program in Python and practicing computational thinking? This week on the show, we speak with author Lawrence Gray about his upcoming book “Mastering Python: A Problem Solving Approach.”
Lawrence shares how learning Python helped him through a dark and trying time. He developed lifelong skills that he wants to pass along through teaching and authoring a book.
We discuss what you can do to prepare for a future where coding jobs are automated through AI. He shares ways that Python can help build the higher-order thinking skills required by future careers. We also talk about how Python can help with computational thinking and promote cognitive development.
This week’s episode is brought to you by Site24x7.
Course Spotlight: Looping With Python enumerate()
Once you learn about for loops in Python, you know that using an index to access items in a sequence isn’t very Pythonic. So what do you do when you need that index value? In this course, you’ll learn all about Python’s built-in enumerate(), where it’s used, and how you can emulate its behavior.
Topics:
- 00:00:00 – Introduction
- 00:02:29 – Learning Python and changing how you think
- 00:05:07 – What is the goal of the book?
- 00:05:59 – Discovering Python during a dark period
- 00:10:32 – What is unique to Python that helped you?
- 00:12:43 – How did you start teaching Python?
- 00:14:40 – Teaching Python to art students
- 00:18:52 – Sponsor: Site24x7.com
- 00:19:48 – Why write about computational thinking?
- 00:21:55 – Why learn Python now?
- 00:25:18 – Multiple modalities for Python
- 00:29:32 – Other optimistic thoughts on the future
- 00:34:15 – Reskilling a workforce
- 00:37:16 – What skills can be developed?
- 00:41:13 – Video Course Spotlight
- 00:42:40 – Bloom’s taxonomy
- 00:48:00 – Sharing the positive impact of Python
- 00:50:26 – What are you excited about in the world of Python?
- 00:51:32 – What do you want to learn next?
- 00:52:45 – How can people follow your work online?
- 00:52:47 – Thanks and goodbye
Show Links:
- The Future of Jobs Report 2023 - World Economic Forum
- Mastering Python: Get Access to Chapter 1
- Computational thinking - Wikipedia
- Bloom’s taxonomy - Wikipedia
- Probably Overthinking It – Data science, Bayesian Statistics, and other ideas
- scikit-learn: machine learning in Python - documentation
- Yellowbrick: Machine Learning Visualization - documentation
- PyCon US 2024 - PyCon US 2024
- Education Summit - PyCon US 2023
- PyTorch - Get Started
- Lawrence Gray Personal Site
Level up your Python skills with our expert-led courses:
1 Listener
Practical Python Decorator Uses & Avoiding datetime Pitfalls
The Real Python Podcast
02/16/24 • 57 min
What are real-life examples of using Python decorators? How can you harness their power in your code? Christopher Trudeau is back on the show this week, bringing another batch of PyCoder’s Weekly articles and projects.
We discuss a recent article series that digs into Python decorators. The first two articles discuss the basics of constructing decorators. The third part describes how popular Python libraries use decorators with call interception, function registration, and enriching the behavior of a function.
Christopher shares a piece about the common pitfalls of working with the Pythondatetime library. The article considers how current third-party libraries don’t address most of these quirks and offers a potential solution with a new library.
We also share several other articles and projects from the Python community, including a couple of news items, a discussion about the popularity of the Rust language, handling unset values in FastAPI with Pydantic, working with Python’s mini-language for formatting strings, mocking Django queryset functions, and a modern replacement for the Requests library.
This week’s episode is brought to you by Sentry.
Course Spotlight: Python Decorators 101
In this course on Python decorators, you’ll learn what they are and how to create and use them. Decorators provide a simple syntax for calling higher-order functions in Python. By definition, a decorator is a function that takes another function and extends the behavior of the latter function without explicitly modifying it.
Topics:
- 00:00:00 – Introduction
- 00:02:53 – Django security releases issued: 5.0.2, 4.2.10, and 3.2.24
- 00:03:10 – Python 3.12.2 and 3.11.8 are now available
- 00:03:21 – Introducing PSF Grants Program Office Hours
- 00:04:19 – Python’s Format Mini-Language for Tidy Strings
- 00:12:22 – Ten Python datetime Pitfalls
- 00:18:34 – Sponsor: Sentry
- 00:19:37 – Real Life Use of Decorators
- 00:29:18 – Handling Unset Values in FastAPI With Pydantic
- 00:35:43 – Video Course Spotlight
- 00:37:06 – The Python Rust-Aissance
- 00:50:19 – django-mock-queries: Mock Django Queryset Functions
- 00:53:09 – niquests: Requests but Multiplexed
- 00:55:55 – Thanks and goodbye
News:
- Django security releases issued: 5.0.2, 4.2.10, and 3.2.24
- Python 3.12.2 and 3.11.8 are now available
- Introducing PSF Grants Program Office Hours
Show Links:
- Python’s Format Mini-Language for Tidy Strings – In this tutorial, you’ll learn about Python’s format mini-language. See how to use it for creating working format specifiers and build nicely formatted strings and messages in your code.
- Ten Python datetime Pitfalls – It’s no secret that the Python datetime library has its quirks. Not only are there probably more than you think, but third-party libraries don’t address most of them! Arie created a new library to explore what a better datetime library could look like.
- Real Life Use of Decorators – Part 3 in a series on how Python decorators are used. This part covers real-life use cases including call interception, function registration, and behavioral enrichment.
- Handling Unset Values in FastAPI With Pydantic – When using the HTTP PATCH method only those fields that got changed are updated. Pydantic sets fields not given as arguments as None so there is no way to distinguish between an explicit None value and an unset field. This post explains how you process this scenario.
Discussion:
- The Python Rust-Aissance – Companies like Polars are showing how with Rust, Python developers now have a better, smoother path towards building high-performance libraries.
- Rye: A Python Developer Experience Vision Continued
- PyO3: Rust bindings for the Python interpreter
- Rust in Linux: Where we are...
1 Listener
Our New "Python Basics" Book & Filling the Gaps in Your Learning Path
The Real Python Podcast
10/23/20 • 50 min
Do you have gaps in your Python learning path? If you’re like me, you may have followed a completely random route to learn Python. This week on the show, David Amos is here to talk about the release of the Real Python book, “Python Basics: A Practical Introduction to Python 3”. The book is designed not only to get beginners up to speed but also to help fill in the gaps many intermediate learners may still have.
David has been working on the book for the last two years, and we dive into all the resources that come with it. These include code challenges, quizzes, and multiple projects that are designed to help you cement your learning. We also discuss the people and processes involved in creating, reviewing, and updating the book.
Spotlight: Python Basics: A Practical Introduction to Python 3
Go from beginner to intermediate in Python with this complete curriculum, up-to-date for Python 3.9. Python Basics includes exercises, interactive quizzes, and sample projects, so you’ll always know what to focus on next in order to build a strong Python foundation.
Topics:
- 00:00:00 – Introduction
- 00:01:36 – Python Basics: A Practical Introduction to Python 3
- 00:02:39 – How has feedback helped the process?
- 00:04:17 – Who is the intended audience?
- 00:06:02 – Covering how to get Python installed on different platforms?
- 00:11:05 – What topics does the book cover?
- 00:14:12 – What can be previewed on Real Python?
- 00:15:03 – What format can you get the book in?
- 00:18:30 – Code challenges included!
- 00:21:33 – What other resources are provided to help with cementing your learning?
- 00:22:41 – What versions of Python are covered?
- 00:23:08 – How does the book fit into the Real Python learning eco-system?
- 00:29:35 – Spotlight: How to get a preview of the book!
- 00:30:38 – What has the writing process been like?
- 00:33:21 – What does didactic mean, in terms of reviewing materials?
- 00:39:23 – What were areas you were excited about updating?
- 00:41:29 – Were there important things you felt needed to be added?
- 00:45:55 – Who worked on the book?
- 00:47:13 – What are you excited about in the world of Python?
- 00:48:13 – What do you want to learn next?
- 00:49:40 – Thanks and goodbye
Show Links:
- Python Basics: A Practical Introduction to Python 3
- Python 3 Installation & Setup Guide
- Create and Modify PDF Files in Python
- Python GUI Programming With Tkinter
- Object-Oriented Programming (OOP) in Python 3
- A Practical Introduction to Web Scraping in Python
Level up your Python skills with our expert-led courses:
Options for Packaging Your Python Application: Wheels, Docker, and More
The Real Python Podcast
08/28/20 • 74 min
Have you wondered, how should I package my Python code? You’ve written the application, but now you need to distribute it to the machines it’s intended to run on. It depends on what the code is, the libraries it depends on, and with whom do you want to share it. This week on the show we have Itamar Turner-Trauring, creator of the website pythonspeed.com. We discuss his article “Options for Packaging Your Python Code: Wheels, Conda, Docker, and More,” covering the how of sharing your code.
Itamar also briefly discusses his Python memory profiler named Fil. We talk about his recent PyCon 2020 presentation, “Small Big Data: What to do When Your Data Doesn’t Fit in Memory.” We also cover several of the resources available on his website for data scientists that want to get deeper into Docker.
Course Spotlight: Python Coding Interviews: Tips & Best Practices
In this step-by-step course, you’ll learn how to take your Python coding interview skills to the next level and use Python’s built-in functions and modules to solve problems faster and more easily.
Topics:
- 00:00:00 – Introduction
- 00:01:36 – About the naming of pythonspeed.com
- 00:03:47 – Fil - Python Memory Profiler
- 00:06:44 – Small Big Data: What to do when your data doesn’t fit in memory - PyCon 2020
- 00:12:17 – Options for packaging your Python code: Wheels, Conda, Docker, and more
- 00:15:13 – Python Wheels
- 00:19:22 – pipx: Install and Run Python Applications in Isolated Environments
- 00:20:52 – PEX, and friends
- 00:24:51 – System Package, RPM or DEB
- 00:29:42 – Conda Packaging and conda-forge
- 00:36:09 – Video Course Spotlight
- 00:37:23 – Self-contained executable: PyInstaller, PyOxidizer, Briefcase
- 00:43:45 – Container image (Docker, Singularity)
- 00:54:55 – Why alpine may not be the best choice
- 01:05:28 – Singularity
- 01:07:50 – What are you excited about in the world of Python?
- 01:10:40 – What do you want to learn next?
- 01:13:54 – Thanks and Goodbye
Show Links:
- Python => Speed: Ship Better Python Software, Faster
- Code Without Rules: Helping You Become a Productive Programmer and Get Work/Life Balance
- Talk Python to Me – Episode #274: Profiling Data Science Code with FIL
- Fil: A New Python Memory Profiler for Data Scientists and Scientists
- Small Big Data: What to do When Your Data Doesn’t Fit in Memory - PyCon 2020
- Episode 16: Thinking in Pandas: Python Data Analysis the Right Way
- Options for Packaging Your Python Code: Wheels, Conda, Docker, and More
- What Are Python Wheels and Why Should You Care?: Real Python article
- pipx — Install and Run Python Applications in Isolated Environments
- pex: A Library and Tool for Generating .pex (Python EXecutable) Files
- WTF is PEX?: Twitter Lightning Talk - YouTube
- RPM (Red Hat Package Manager): Wikipedia article
- DEB (Debian Package - file format): Wikipedia article
- Conda: Package, Dependency and Environment Management for Any Language
- conda-forge: A community-led collection of recipes, build infrastructure and distributions for the conda package manager
- PyInstaller: Freezes (packages) Python applications into stand-alone executables
- PyOxidizer: A utility for producing binaries that embed Python
- Briefcase: Convert a Python project into a standalone native application
- Docker: Get Started with Docker
- Just Enough Docker Packaging: Book
- Ep...
Securing Your Python Software Supply Chain With Dustin Ingram
The Real Python Podcast
07/02/21 • 71 min
How well do you know your software supply chain? When you PIP install a package, what steps can you take to minimize the risk of installing something malicious? This week on the show, we have Dustin Ingram, a director of the Python Software Foundation (PSF) and a maintainer of the Python Package Index (PyPI).
We talk about Dustin’s PyCon 2021 talk titled “Secure Software Supply Chains for Python”. Dustin shares the types of attacks you should be aware of and how you can make your supply chain more trustworthy. We cover tools, techniques, and best practices.
Dustin also discusses what it takes to keep the Python Package Index running and the players working to keep it going into the future.
Course Spotlight: A Beginner’s Guide to Pip
This course is a great introduction to pip for those who are getting started Python, and for those who want to understand more about what is happening when you install new packages into your environment. It’s a worthy investment of your time to understand the fundamentals of pip.
Topics:
- 00:00:00 – Introduction
- 00:01:51 – Developer Advocate at Google
- 00:04:34 – A director of the PSF
- 00:06:27 – A maintainer of PyPI
- 00:12:29 – Secure Software Supply Chains for Python - PyCon 2021
- 00:15:53 – Do I need to be a security expert as a Python developer?
- 00:17:23 – Typo-squatting of package names
- 00:19:46 – Sponsor: Scout APM
- 00:20:52 – Dependency confusion and private repos
- 00:26:00 – What are some best practices?
- 00:31:55 – How to lessen the scale of “I don’t know what I don’t know”?
- 00:36:33 – Tools and techniques that can help
- 00:44:11 – Video Course Spotlight
- 00:45:30 – Namespaces on PyPI
- 00:53:03 – What does it take to power the Python Package Index?
- 01:01:57 – What are you excited about in the world of Python?
- 01:03:55 – What do you want to learn next?
- 01:05:52 – What is something you thought you knew about Python, but were wrong about it?
- 01:08:46 – Shout outs and social information
- 01:10:16 – Thanks and goodbye
Show Links:
- Dustin Ingram: Personal Website
- Python on Google Cloud
- Cloud Run: Develop and deploy highly scalable containerized applications on a fully managed serverless platform
- Python Software Foundation
- PSF Membership FAQ
- PyPI: The Python Package Index
- Secure Software Supply Chains for Python: PyCon 2021 - YouTube
- pip Documentation: Requirements Files
- pip Documentation: Hash-Checking Mode
- PEP-0440: Direct references for pip
- pip-tools: pip-tools keeps your pinned dependencies fresh
- PyPA: Python Packaging User Guide
- The Update Framework (TUF)
- tuf: A secure updater framework for Python
- pipx: Install and Run Python Applications in Isolated Environments
- How to Publish an Open-Source Python Package to PyPI - Real Python Article
- Poetry: Python packaging and dependency management made easy
- PyUp: Python Dependency Security
- Dependabot: Automated dependency updates
- Why Package Signing is not the Holy Grail: Donald Stufft
- Dependency Confusion: How I Hacked Into Apple, Microsoft and Dozens of Other Companies
- What Is Pip? A Guide for New Pythonistas - Real Python Article
- A ‘Worst Nightmare’ Cyberattack: The Untold Story Of The SolarWinds Hack
Building a Content Aggregator and Working With RSS in Python
The Real Python Podcast
11/19/21 • 57 min
Have you wanted to work with RSS feeds in Python? Maybe you’re looking for a new project to build for your portfolio that uses Django, unit tests, and custom commands. This week on the show, we have Real Python author Ricky White to talk about his recent step-by-step project titled, “Build a Content Aggregator in Python.”
Ricky has been authoring the Real Python interview series for several years and was formerly our Community Manager. He talks about what inspired him to create this project and the Python technology and libraries to build it. He also shares advice about adding tests to personal portfolio projects.
We start the show by discussing Python’s GIL (Global Interpreter Lock) and the efforts to potentially remove it in future versions of Python. This change could make a significant impact on Python code running on multi-core processors. We talk about two recent articles covering the developments.
Course Spotlight: Get Started With Django: Build a Portfolio App
In this course, you’ll learn the basics of creating powerful web applications with Django, a Python web framework. You’ll build a portfolio website to showcase your web development projects, complete with a fully functioning blog.
Topics:
- 00:00:00 – Introduction
- 00:02:07 – Citizenship
- 00:03:52 – Ricky’s Real Python interviews
- 00:05:55 – Upcoming interview with Eric Wastl about Advent of Code
- 00:08:05 – Notes From the Meeting On Python GIL Removal
- 00:18:41 – Sponsor: Cloudsmith
- 00:19:26 – Build a Content Aggregator in Python
- 00:20:28 – Django background
- 00:23:37 – What web technologies were you using before Python?
- 00:25:07 – What motivated the project?
- 00:26:46 – Technical hurdles
- 00:30:52 – Including tests in a portfolio project
- 00:32:56 – Django custom commands
- 00:37:02 – Video Course Spotlight
- 00:38:16 – RSS Feeds - Really Simple Syndication and Podcasts
- 00:42:16 – Working with django-apscheduler
- 00:47:06 – Taking the project further and CSS frameworks
- 00:51:04 – What are you excited about in the world of Python?
- 00:53:58 – What do you want to learn next?
- 00:55:52 – Shoutouts and social connections
- 00:56:57 – Thanks and goodbye
Show Links:
- About Ricky White – Real Python
- Build a Content Aggregator in Python: Real Python project-based tutorial
- About - Advent of Code 2021
- A viable solution for Python concurrency: LWN.net
- Notes From the Meeting On Python GIL Removal Between Python Core and Sam Gross
- What Is the Python Global Interpreter Lock (GIL)? – Real Python
- nogil: Python Multithreading without GIL
- Walk AS One
- What is Ankylosing Spondylitis?
- Spondy News
- Writing custom django-admin commands | Django documentation
- django-apscheduler - A Django app that adds a lightweight wrapper around APScheduler
- RabbitMQ - Messaging that just works
- RSS: Really Simple Syndication- Wikipedia
- Flipboard - Stories from 28,875 topics personalized for you
- NetNewsWire: Free and Open Source RSS Reader for Mac and iOS
- Welcome to Feedly
- Bootstrap - The most popular HTML, CSS, and JS library in the world
- Tailwind CSS - Rapidly build modern websites without ever leaving your HTML
- Python Software Foundation News: 2021 End of the year fundraiser!
- Structural Pattern Matching Python 3.10: Cool New Features for You to Try
Expanding the International Python Community With the PSF
The Real Python Podcast
06/18/21 • 60 min
The popularity of Python is continuing to grow Developers across the globe are embracing the language. How is Python being used in all of these different countries? How does an organization like the Python Software Foundation (PSF) work toward the goals in its mission statement for supporting and growing this international community? This week on the show, we have Marlene Mhangami, a PSF board member and part of the Diversity and Inclusion Work Group.
Marlene lives in Zimbabwe on the continent of Africa. She has been organizing events not only locally in Zimbabwe but across all of Africa. She is the chair of Pycon Africa and has given talks at Pycon US, Pycon UK, and Pycon India.
She has been working locally as an organizer and educator. We talk about the challenges of teaching technology and programming to a population of young people. Some of these students don’t have access to computers.
She is also currently pursuing a computer science degree with the University of London. Along with her studies, she is also interning with NVidia. She is working with them on the RAPIDS project with a focus on the cuDF library.
Spotlight: Introduction to Sorting Algorithms in Python
In this course, you’ll learn all about five different sorting algorithms in Python from both a theoretical and a practical standpoint. You’ll also learn several related and important concepts, including Big O notation and recursion.
Topics:
- 00:00:00 – Introduction
- 00:02:05 – Connecting during PyCon 2021
- 00:03:08 – Roles with the Python Software Foundation (PSF)
- 00:05:39 – Python in Africa
- 00:10:31 – School overseas and return to Zimbabwe to build a Python community
- 00:13:47 – Teaching technology and Python to students who don’t have computers
- 00:22:14 – Sponsor: Digital Ocean’s App Platform
- 00:22:50 – Work with the PSF and building geographic diversity
- 00:27:49 – PSF work groups
- 00:32:10 – Organizing PyCon Africa 2019 and bringing a continent of communities together
- 00:35:51 – How is Python being used in Africa?
- 00:38:05 – Video Course Spotlight
- 00:39:24 – Working with NVidia RAPIDS and cuDF
- 00:43:57 – What are you excited about in the world of Python?
- 00:46:12 – What do you want to learn next?
- 00:52:45 – What is something you thought you knew about Python but were wrong about it?
- 00:59:41 – Thanks and goodbye
Show Links:
- Marlene Mhangami: Personal Website
- Python Software Foundation
- PSF Membership
- PyCon US 2021: YouTube Playlist
- Real Python Interview With Marlene Mhangami
- PyCon Africa Conference 2019 Highlights & Interviews: YouTube
- Django Girls
- RAPIDS: Open GPU Data Science
- 10 Minutes to Data Science: Transitioning Between RAPIDS cuDF and CuPy Libraries
Level up your Python skills with our expert-led courses:
Docker + Python for Data Science and Machine Learning
The Real Python Podcast
05/08/20 • 55 min
Docker is a common tool for Python developers creating and deploying applications, but what do you need to know if you want to use Docker for data science and machine learning? What are the best practices if you want to start using containers for your scientific projects? This week we have Tania Allard on the show. She is a Sr. Developer Advocate at Microsoft focusing on Machine Learning, scientific computing, research and open source.
Tania has created a talk for the PyCon US 2020 which is now online. The talk is titled “Docker and Python: Making them Play Nicely and Securely for Data Science and ML.” Her talk draws on her expertise in the improvement of processes, reproducibility and transparency in research and data science. We discuss a variety of tools for making your containers more secure and results reproducible.
Tania is passionate about mentoring, open-source, and its community. She is an organizer for Mentored Sprints for Diverse Beginners, and she talks about the upcoming online sprints for PyCon US 2020. We also discuss her plans to start a podcast.
Topics:
- 00:00:00 – Introduction
- 00:01:43 – Microsoft Senior Developer Advocate Role
- 00:04:07 – PyCon 2020 Talk - Docker and Python: making them play nicely
- 00:05:34 – What is Docker?
- 00:10:08 – Reproducibility of project results
- 00:12:03 – What are the challenges of using Docker for machine learning?
- 00:15:06 – Getting started suggestions
- 00:16:26 – What metadata should be included?
- 00:17:48 – Creating images through stages
- 00:21:16 – What about your data?
- 00:22:40 – Kubernetes: Orchestrating containers
- 00:24:37 – Continuing stages into testing
- 00:25:37 – What are tools for testing security?
- 00:27:07 – Challenges in using containers for ML
- 00:28:52 – What types of databases?
- 00:29:39 – Are you doing initial research on a local machine?
- 00:30:59 – An example of a recent ML project
- 00:32:16 – Papermill: parameterizing and executing notebooks
- 00:33:16 – NLP: Natural Language Processing
- 00:33:58 – Kaggle: Help us better understand COVID-19
- 00:34:42 – What are other best practices for data intensive projects?
- 00:39:13 – Resources to get started in machine learning?
- 00:40:30 – Mentored Sprints for Diverse Beginners
- 00:45:34 – Tania’s upcoming podcast
- 00:48:38 – A visiting fellow at the Alan Turing Institute
- 00:49:08 – Weight lifting
- 00:50:16 – Craft beer
- 00:52:09 – What is something you thought you knew in Python but were wrong about?
- 00:53:50 – What are excited about in the world of Python?
- 00:54:42 – Thank you and Goodbye
Show links:
- Tania Allard: Personal site
- Docker and Python: making them play nicely and securely for Data Science and ML - Tania Allard
- Slides for Docker and Python Talk
- Docker
- XKCD: Python Superfund Site
- Best practices for writing Dockerfiles
- Run Python Versions in Docker: How to Try the Latest Python Release
- Kubernetes: Production-Grade Container Orchestration
- Snyk: Securing open source and containers
- papermill: A tool for parameterizing and executing Jupyter Notebooks
- Natural Language Processing: Wikipedia article
- Natural Language Processing With spaCy in Python: Real Python article
- Kaggle: Help us better understand COVID-19
- datree.io: Scale Engineering organization
- repo2docker: Build, Run, and Push Docker Images from Source Code Repositories
- Jupyter Docker Stacks: A set of ready-to-run Docker images
- binder: Turn a Git Repo into a Collection of Interactive Notebooks
- Hands...
Show more best episodes
Show more best episodes
Featured in these lists
FAQ
How many episodes does The Real Python Podcast have?
The Real Python Podcast currently has 233 episodes available.
What topics does The Real Python Podcast cover?
The podcast is about Open Source, Coding, Code, Development, Software, Podcasts, Dev, Technology, Developer, Data Science, Python and Programming.
What is the most popular episode on The Real Python Podcast?
The episode title 'Moving NLP Forward With Transformer Models and Attention' is the most popular.
What is the average episode length on The Real Python Podcast?
The average episode length on The Real Python Podcast is 59 minutes.
How often are episodes of The Real Python Podcast released?
Episodes of The Real Python Podcast are typically released every 7 days.
When was the first episode of The Real Python Podcast?
The first episode of The Real Python Podcast was released on Mar 6, 2020.
Show more FAQ
Show more FAQ