Log in

goodpods headphones icon

To access all our features

Open the Goodpods app
Close icon
headphones
The Data Engineering Show

The Data Engineering Show

The Firebolt Data Bros

The Data Engineering Show is a podcast for data engineering and BI practitioners to go beyond theory. Learn from the biggest influencers in tech about their practical day-to-day data challenges and solutions in a casual and fun setting. SEASON 1 DATA BROS Eldad and Boaz Farkash shared the same stuffed toys growing up as well as a big passion for data. After founding Sisense and building it to become a high-growth analytics unicorn, they moved on to their next venture, Firebolt, a leading high-performance cloud data warehouse. SEASON 2 DATA BROS In season 2 Eldad adopted a brilliant new little brother, and with their shared love for query processing, the connection was immediate. After excelling in his MS, Computer Science degree, Benjamin Wagner joined Firebolt to lead its query processing team and is a rising star in the data space. For inquiries contact [email protected]
Share icon

All episodes

Best episodes

Top 10 The Data Engineering Show Episodes

Goodpods has curated a list of the 10 best The Data Engineering Show episodes, ranked by the number of listens and likes each episode have garnered from our listeners. If you are listening to The Data Engineering Show for the first time, there's no better place to start than with one of these standout episodes. If you are a fan of the show, vote for your favorite The Data Engineering Show episode by adding your comments to the episode page.

In this episode of The Data Engineering Show, the bros sit with Daniel Pálma, Head of Marketing at Estuary.Join them as they;
  • Talk about Daniel’s career transition from data engineering to marketing and how his background in data engineering has been a tremendous help to his marketing competence.
  • Discuss the role of AI in the evolution of data movement ensuring a faster and easier process of creating data pipelines.
  • Shine light on the challenges of vector databases and structured data in AI applications.
  • Delve into the future of Apache Iceberg and data lakehouses, highlighting their current challenges.
  • Shares insights on the golden age of data expressing the need for more data engineers, data analysts and data practitioners in the data space.
If you enjoyed this episode, make sure to subscribe, rate, and review it on Apple Podcasts, Spotify, and YouTube Podcasts, instructions on how to do this are [insert link].Daniel Pálma serves as Head of Marketing at Estuary, bringing a unique blend of technical expertise and marketing acumen to the data integration space. With nearly a decade of experience as a data engineer across startups, enterprises, and consulting roles, Daniel made a strategic pivot to marketing to help bridge the gap between complex technical solutions and their practical applications for data practitioners. His background in data engineering enables him to deeply understand the customers' challenges and create authentic, education-focused marketing content that resonates with technical audiences. Daniel’s thought leadership and content creation in the data engineering space, combined with his hands-on technical experience, positions him as a valuable voice in conversations about the evolution of data infrastructure and integration technologies. The Data Engineering Show is handcrafted by our friends over at: fame.so
Previous guests include: Joseph Machado of Linkedin, Metthew Weingarten of Disney, Joe Reis and Matt Housely, authors of The Fundamentals of Data Engineering, Zach Wilson of Eczachly Inc, Megan Lieu of Deepnote, Erik Heintare of Bolt, Lior Solomon of Vimeo, Krishna Naidu of Canva, Mike Cohen of Substack, Jens Larsson of Ark, Gunnar Tangring of Klarna, Yoav Shmaria of Similarweb and Xiaoxu Gao of Adyen.
Check out our three most downloaded episodes:
bookmark
plus icon
share episode
The Data Engineering Show - How Eventbrite is Modernizing its Data Stack
play

05/23/22 • 23 min

Archana shares Eventbrite’s data stack modernization process, and how you get engineers to adopt new technologies like dbt which may be outside their comfort zone.

bookmark
plus icon
share episode
The Data Engineering Show - How Amplitude Engineers Process 5 Trillion Real-time Events
play

01/05/23 • 27 min

Weichen Wang, Senior Engineering Manager at Amplitude, came to meet the bros to talk about Amplitude's cutting-edge data stack and how it processes 5 Trillion real-time events while dealing with mutable data and massive scale.

bookmark
plus icon
share episode
The Data Engineering Show - Making Observability a Key Business Driver
play

11/29/22 • 48 min

80% of the code that you write doesn’t work on the first try. And that’s fine. But knowing which 80% is not working and which 20% is working is the actual challenge. After 10 years at Facebook, managing and scaling the Seattle site to over 6000 engineers(!) Vijaye Raji founded Statsig to make observability automated and real-time. How is the semantic layer managed? How was the Statsig team able to build an observability product that handles real-time ever-changing metadata? What are Vijaye’s main takeaways from engineering at Facebook? Tune in.
bookmark
plus icon
share episode
The Data Engineering Show - A ClickHouse Review from a Practitioner’s Point of View
play

09/01/22 • 34 min

Sudeep Kumar, Principal Engineer at Salesforce is a ClickHouse fan. He considers the shift to Clickhouse as one of his biggest accomplishments during his eBay days and walks Boaz through his experience with the platform. How on one hand it handled 2B events per minute, but also how it required rollups which compromised granularity when extending time windows.

Besides a ClickHouse review from a practitioner’s point of view, Sudeep tells us about interesting use-cases he’s working on at Salesforce.

bookmark
plus icon
share episode

According to Maxime Beauchemin, CEO & Founder at Preset and Creator of Apache Superset and Apache Airflow, it's not so straight-forward to understand what you're really getting into and the vastness of the skills that are required in order to build a thriving company.

Picking the right system and services is key for a successful start, and can help you avoid the chaos of having too many tools spread across multiple teams.

Plus, Max walks the bros through the genesis of Airflow, Superset & Presto, and Airflow's old school marketing approach that won the hearts of developers across the world. And just like the terminator, once the machine takes over, you can't stop.

bookmark
plus icon
share episode
The Data Engineering Show - How Similarweb Delivers Customer Facing Analytics Over 100s of TBs
play

07/14/22 • 37 min

According to Yoav Shmaria, VP R&D Platform at Similarweb, the best way to manage data warehouse costs is to tag every table, database or ETL running to have good granularity over every feature.

Besides handy cost management tips, Yoav walks the bros through the tech stack he implemented to analyze 100s of TBs of web data to serve fast customer-facing analytics.

Full disclosure, Similarweb is a Firebolt customer, but the bros kept it objective, and there’s no Firebolt talk in this episode.

bookmark
plus icon
share episode
The Data Engineering Show - How Klarna Designed a New Data Platform in the Cloud
play

06/09/22 • 40 min

Klarna is one of the leading fintech companies in the world, valued at $45B.

While many corporations are “stuck” on-prem, Klarna made the move and today is a cloud-only company. Gunnar Tangring, Klarna’s Lead Data Engineer tells Boaz what this new modernized stack looks like.

bookmark
plus icon
share episode
The Data Engineering Show - A Deep Dive into Slack's Data Architecture
play

05/11/22 • 34 min

Growing from a startup to an IPOed and then an acquired company meant that Slack’s sales org was scaling rapidly.
Apun Hiran, Slack’s Director of Software Engineering explains how the data stack and architecture evolved to support this growth with more reliable and timely metrics.

Speaker: Apun Hiran, Director of Software Engineering (Data), Slack
Hosts: Eldad and Boaz Farkash, CEO and CPO, Firebolt

bookmark
plus icon
share episode
The Data Engineering Show - Vin Vashishta explains why we should stop using dashboards
play

10/04/23 • 35 min

Vin Vashista, the guy we all love to follow, has never seen a dashboard with positive ROI. This time on The Data Engineering Show, he met the bros to talk about the difference between BI dashboards and analytics that actually introduce knowledge. It’s no longer just about the data volume, it’s about quality and relevance.

bookmark
plus icon
share episode

Show more best episodes

Toggle view more icon

FAQ

How many episodes does The Data Engineering Show have?

The Data Engineering Show currently has 81 episodes available.

What topics does The Data Engineering Show cover?

The podcast is about Computer Science, Analytics, Management, Data, Podcasts, Technology and Business.

What is the most popular episode on The Data Engineering Show?

The episode title 'How Amplitude Engineers Process 5 Trillion Real-time Events' is the most popular.

What is the average episode length on The Data Engineering Show?

The average episode length on The Data Engineering Show is 35 minutes.

How often are episodes of The Data Engineering Show released?

Episodes of The Data Engineering Show are typically released every 14 days, 16 hours.

When was the first episode of The Data Engineering Show?

The first episode of The Data Engineering Show was released on Apr 5, 2021.

Show more FAQ

Toggle view more icon

Comments