
Building the Backend: Data Solutions that Power Leading Organizations
Travis Lawrence
All episodes
Best episodes
Seasons
Top 10 Building the Backend: Data Solutions that Power Leading Organizations Episodes
Goodpods has curated a list of the 10 best Building the Backend: Data Solutions that Power Leading Organizations episodes, ranked by the number of listens and likes each episode have garnered from our listeners. If you are listening to Building the Backend: Data Solutions that Power Leading Organizations for the first time, there's no better place to start than with one of these standout episodes. If you are a fan of the show, vote for your favorite Building the Backend: Data Solutions that Power Leading Organizations episode by adding your comments to the episode page.

Reverse ETL with Hightouch
Building the Backend: Data Solutions that Power Leading Organizations
06/15/21 • 32 min
Top 3 Value Bombs:
- Organizations should be sending more holistic customer data back into their marketing solutions.
- Reverse ETL is the process of creating pipelines to extract data from the warehouse/lake and move back into operational components.
- Utilize CDC when extracting data to minimize the impact to your source system.

DataOps Is Not Just DevOps for Data with DataKitchen
Building the Backend: Data Solutions that Power Leading Organizations
03/30/21 • 28 min
In todayâs episode, we will speak with Chris Bergh, a pioneer in the DataOps landscape and the CEO at DataKitchen, a DataOps Platform that Simplifies Complex Data Toolchains and Environments
Top 3 Value Bombs:
- DataOps is not just DevOps for data
- Any organization can get started today and start implementing DataOps practices. Start small and prioritize quick wins.
- The people/process is just as important as the tools used if not more so when implementing DataOps.
If you enjoy this episode and want to learn more, please head on over to DataKitchen.io to download your free copy of the DataOps Cookbook.

Edge Computing and Continuous Intelligence with Swim
Building the Backend: Data Solutions that Power Leading Organizations
10/26/21 • 34 min
In this episode of Building The Backend we hear from Simon Crosby â CTO @ Swim an open source edge computing operating system, where we talk all about edge computing, event streaming and much more.
Below are top 3 value bombs:
- Edge means more than just being physically located somewhere it could also mean in the cloud. It really is the closest point of where your source data is being generated.
- Continuous intelligence is a design pattern where streaming data is directly tied into business operations.
- Kafka is continuing to hold itâs strong position in the event streaming space.

The Data Warehouse for Distributed Clouds - Yellowbrick
Building the Backend: Data Solutions that Power Leading Organizations
06/29/21 • 37 min
In this episode, we speak with Mark Cusack, CTO at Yellowbrick. Yellowbrick is a data warehouse platform that was built from the ground up for performance and cost that can be deployed across clouds and on-prem.
Top 3 Value Bombs:
- Yellowbrick DW was recently named a contender in Cloud Data Warehouses by Forrester Research and they are able to achieve 100X performance at 1/5th the priceâ against many competitors.
- As data production is exponentially increasing at the âedgeâ the need to pre-process and keep the data where it is is becoming critical. The distributed cloud model helps solve this increasing problem.
- Yellowbrick was created from the ground up with a focus on performance and cost, a few of its technical features include a custom Linux-based OS kernel, data is read directly from primary storage into the CPU cache, and custom network drivers.

Enable Faster Data Processing and Access with Apache Arrow with Matt Topol @ Factset
Building the Backend: Data Solutions that Power Leading Organizations
02/01/22 • 49 min
In this episode we speak with Matt Topol, Vice President, Principal Software Architect @ FactSet and dive deep into how they are taking advantage of Apache Arrow for faster processing and data access.
Below are the top 3 value bombs:
- Apache Arrow is an open-source in-memory columnar format that creates a standard way to share and process data structures.
- Apache Arrow Flight eliminates serialization and deserialization which enables faster access to query results compared to traditional JDBC and ODBC interfaces.
- Donât put all your eggs in one basket, whether you're using commercial products or open source, make sure you design a modular architecture that does not tie you down to any one piece of technology.

The Importance of Treating Your Data Initiatives as Products with Murali Bhogavalli
Building the Backend: Data Solutions that Power Leading Organizations
01/18/22 • 26 min
Your data team should not just be keeping the lights on, but should be building and creating data products to support the business. In this episode we speak with Murali Bhogavalli a data product manager and explore what is a data product manager and how they differ from a traditional product manager.
Below are the top 3 value bombs:
- Data should be looked at as a product and treated as such within the organization (i.e. agile methodologies, continuous improvementâ¦)
- Organizations need to be more than just data driven but also data informed. For that to happen, you need to build data literacy into your ecosystem by helping everybody understand what the data means and where is it coming from and the quality of it..
- Product managers typically use data to deliver the outcomes. But for a data PM, data is the deliverable and it also the outcome.

Open-Source Data Catalog Amundsen with Mark Grover @ Stemma
Building the Backend: Data Solutions that Power Leading Organizations
01/11/22 • 41 min
In this episode of Building The Backend we hear from Mark Grover founder @ Stemma, co-creator of Amundsen. Stemma is a fully managed data catalog, powered by the leading open-source data catalog, Amundsen.
Below are top 3 value bombs:
- Automated data catalogs are critical to help wrangle the growing data across organizations. (i.e. Being able to identify out of 150 columns on this table only 10 are being used downstream)
- Tribal knowledge and context cannot be automated - data catalogs cannot be 100% automated.
- Amundsen is an open-source data catalog originally created at Lyft. Stemma has created a managed version of Amundsen.
Help me improve the podcast by completing this 60 second survey: https://buildingthebackend.com/survey

Increase the Quality and Reliability of Your Data
Building the Backend: Data Solutions that Power Leading Organizations
07/27/21 • 31 min
In this episode, we speak with Lior Gavish, the co-founder of Monte Carlo to explore all things data quality. Monte Carlo is a data lineage and observability tool that lowers your data downtime.
Top 3 Value Bombs:
- Data products should be thought of in itâs entirely from the source to the consumer.
- No one data stakeholder can solve data quality issues, itâs a collaboration of the data engineers, business, data consumer and even software to help automate certain aspects of cataloging and capturing meaningful metadata.
- Good data quality processes should alert you to anomalies in your metrics before your data consumers do.

Why You Should Be Using (CDC) Change Data Capture for Ingestion with Datacoral
Building the Backend: Data Solutions that Power Leading Organizations
05/18/21 • 40 min
In this episode, we speak with Raghu Murthy. He is the founder of Datacoral, which provides serverless architectures that support data pipelines and orchestration to facilitate ELT into your Data Warehouse. Prior to founding Datacoral he was at Yahoo, Facebook and was part of the initial team that developed Hive. In this episode we will explore the best patterns for ingesting operational data into your data warehouse, creating metadata first architectures and the role Datacoral serves.
Top 3 Value Bombs:
- If you're migrating relational data that supports CDC, you should be using CDC to migrate it for the majority of use cases.
- ELT/ETL pipelines should be orchestrated by a metadata first style architecture.
- Consumers of the DW, should be notified if data is incomplete.

TRAILER: Welcome to Building the Backend - EP0
Building the Backend: Data Solutions that Power Leading Organizations
02/03/21 • 2 min
Welcome to the Building the Backend Podcast! Weâre a data podcast focused on uncovering the data technologies, processes, and patterns that are driving todayâs most successful companies. In this trailer episode, you will get a glimpse of what to expect with episodes on the show.
Show more best episodes

Show more best episodes
FAQ
How many episodes does Building the Backend: Data Solutions that Power Leading Organizations have?
Building the Backend: Data Solutions that Power Leading Organizations currently has 43 episodes available.
What topics does Building the Backend: Data Solutions that Power Leading Organizations cover?
The podcast is about How To, Podcasts, Technology and Education.
What is the most popular episode on Building the Backend: Data Solutions that Power Leading Organizations?
The episode title 'Transform Your Object Storage Into a Git-like Repository With Paul Singman @ LakeFS' is the most popular.
What is the average episode length on Building the Backend: Data Solutions that Power Leading Organizations?
The average episode length on Building the Backend: Data Solutions that Power Leading Organizations is 34 minutes.
How often are episodes of Building the Backend: Data Solutions that Power Leading Organizations released?
Episodes of Building the Backend: Data Solutions that Power Leading Organizations are typically released every 7 days.
When was the first episode of Building the Backend: Data Solutions that Power Leading Organizations?
The first episode of Building the Backend: Data Solutions that Power Leading Organizations was released on Feb 3, 2021.
Show more FAQ

Show more FAQ