Log in

goodpods headphones icon

To access all our features

Open the Goodpods app
Close icon
headphones
Building the Backend: Data Solutions that Power Leading Organizations

Building the Backend: Data Solutions that Power Leading Organizations

Travis Lawrence

Welcome to the Building the Backend Podcast! We’re a data podcast focused on uncovering the data technologies, processes, and patterns that are driving today’s most successful companies. You will hear from data leaders sharing their knowledge and insights with what’s working and what’s not working for them. Our goal is to bring you valuable insights that will save you and your team time when building a modern data architecture in the cloud. Topics will span from big data, AI, ML, governance, visualizations, and best practices for enabling your organization to be data-driven. If you are a chief data officer, data architect, data engineer, data analyst, and those building the backend data solutions then HIT SUBSCRIBE!
Share icon

All episodes

Best episodes

Seasons

Top 10 Building the Backend: Data Solutions that Power Leading Organizations Episodes

Goodpods has curated a list of the 10 best Building the Backend: Data Solutions that Power Leading Organizations episodes, ranked by the number of listens and likes each episode have garnered from our listeners. If you are listening to Building the Backend: Data Solutions that Power Leading Organizations for the first time, there's no better place to start than with one of these standout episodes. If you are a fan of the show, vote for your favorite Building the Backend: Data Solutions that Power Leading Organizations episode by adding your comments to the episode page.

Building the Backend: Data Solutions that Power Leading Organizations - Reverse ETL with Hightouch

Reverse ETL with Hightouch

Building the Backend: Data Solutions that Power Leading Organizations

play

06/15/21 • 32 min

In this episode, we speak with Tejas Manohar, Co-Founder of Hightouch, a leading Reverse ETL platform. That syncs data from your warehouse or lake back into tools your business teams rely on.
Top 3 Value Bombs:
  1. Organizations should be sending more holistic customer data back into their marketing solutions.
  2. Reverse ETL is the process of creating pipelines to extract data from the warehouse/lake and move back into operational components.
  3. Utilize CDC when extracting data to minimize the impact to your source system.
bookmark
plus icon
share episode
Building the Backend: Data Solutions that Power Leading Organizations - DataOps Is Not Just DevOps for Data with DataKitchen

DataOps Is Not Just DevOps for Data with DataKitchen

Building the Backend: Data Solutions that Power Leading Organizations

play

03/30/21 • 28 min

In today’s episode, we will speak with Chris Bergh, a pioneer in the DataOps landscape and the CEO at DataKitchen, a DataOps Platform that Simplifies Complex Data Toolchains and Environments
Top 3 Value Bombs:

  1. DataOps is not just DevOps for data
  2. Any organization can get started today and start implementing DataOps practices. Start small and prioritize quick wins.
  3. The people/process is just as important as the tools used if not more so when implementing DataOps.

If you enjoy this episode and want to learn more, please head on over to DataKitchen.io to download your free copy of the DataOps Cookbook.

bookmark
plus icon
share episode
Building the Backend: Data Solutions that Power Leading Organizations - Edge Computing and Continuous Intelligence with Swim

Edge Computing and Continuous Intelligence with Swim

Building the Backend: Data Solutions that Power Leading Organizations

play

10/26/21 • 34 min

In this episode of Building The Backend we hear from Simon Crosby – CTO @ Swim an open source edge computing operating system, where we talk all about edge computing, event streaming and much more.

Below are top 3 value bombs:

  • Edge means more than just being physically located somewhere it could also mean in the cloud. It really is the closest point of where your source data is being generated.
  • Continuous intelligence is a design pattern where streaming data is directly tied into business operations.
  • Kafka is continuing to hold it’s strong position in the event streaming space.
bookmark
plus icon
share episode
Building the Backend: Data Solutions that Power Leading Organizations - The Data Warehouse for Distributed Clouds - Yellowbrick

The Data Warehouse for Distributed Clouds - Yellowbrick

Building the Backend: Data Solutions that Power Leading Organizations

play

06/29/21 • 37 min

In this episode, we speak with Mark Cusack, CTO at Yellowbrick. Yellowbrick is a data warehouse platform that was built from the ground up for performance and cost that can be deployed across clouds and on-prem.

Top 3 Value Bombs:

  1. Yellowbrick DW was recently named a contender in Cloud Data Warehouses by Forrester Research and they are able to achieve 100X performance at 1/5th the price​ against many competitors.
  2. As data production is exponentially increasing at the “edge” the need to pre-process and keep the data where it is is becoming critical. The distributed cloud model helps solve this increasing problem.
  3. Yellowbrick was created from the ground up with a focus on performance and cost, a few of its technical features include a custom Linux-based OS kernel, data is read directly from primary storage into the CPU cache, and custom network drivers.
bookmark
plus icon
share episode
Building the Backend: Data Solutions that Power Leading Organizations - Enable Faster Data Processing and Access with Apache Arrow with Matt Topol @ Factset

Enable Faster Data Processing and Access with Apache Arrow with Matt Topol @ Factset

Building the Backend: Data Solutions that Power Leading Organizations

play

02/01/22 • 49 min

In this episode we speak with Matt Topol, Vice President, Principal Software Architect @ FactSet and dive deep into how they are taking advantage of Apache Arrow for faster processing and data access.

Below are the top 3 value bombs:

  • Apache Arrow is an open-source in-memory columnar format that creates a standard way to share and process data structures.
  • Apache Arrow Flight eliminates serialization and deserialization which enables faster access to query results compared to traditional JDBC and ODBC interfaces.
  • Don’t put all your eggs in one basket, whether you're using commercial products or open source, make sure you design a modular architecture that does not tie you down to any one piece of technology.
bookmark
plus icon
share episode
Building the Backend: Data Solutions that Power Leading Organizations - The Importance of Treating Your Data Initiatives as Products with Murali Bhogavalli

The Importance of Treating Your Data Initiatives as Products with Murali Bhogavalli

Building the Backend: Data Solutions that Power Leading Organizations

play

01/18/22 • 26 min

Your data team should not just be keeping the lights on, but should be building and creating data products to support the business. In this episode we speak with Murali Bhogavalli a data product manager and explore what is a data product manager and how they differ from a traditional product manager.

Below are the top 3 value bombs:

  1. Data should be looked at as a product and treated as such within the organization (i.e. agile methodologies, continuous improvement…)
  2. Organizations need to be more than just data driven but also data informed. For that to happen, you need to build data literacy into your ecosystem by helping everybody understand what the data means and where is it coming from and the quality of it..
  3. Product managers typically use data to deliver the outcomes. But for a data PM, data is the deliverable and it also the outcome.
bookmark
plus icon
share episode
Building the Backend: Data Solutions that Power Leading Organizations - Open-Source Data Catalog Amundsen with Mark Grover @ Stemma

Open-Source Data Catalog Amundsen with Mark Grover @ Stemma

Building the Backend: Data Solutions that Power Leading Organizations

play

01/11/22 • 41 min

In this episode of Building The Backend we hear from Mark Grover founder @ Stemma, co-creator of Amundsen. Stemma is a fully managed data catalog, powered by the leading open-source data catalog, Amundsen.

Below are top 3 value bombs:

  • Automated data catalogs are critical to help wrangle the growing data across organizations. (i.e. Being able to identify out of 150 columns on this table only 10 are being used downstream)
  • Tribal knowledge and context cannot be automated - data catalogs cannot be 100% automated.
  • Amundsen is an open-source data catalog originally created at Lyft. Stemma has created a managed version of Amundsen.

Help me improve the podcast by completing this 60 second survey: https://buildingthebackend.com/survey

bookmark
plus icon
share episode
Building the Backend: Data Solutions that Power Leading Organizations - Increase the Quality and Reliability of Your Data

Increase the Quality and Reliability of Your Data

Building the Backend: Data Solutions that Power Leading Organizations

play

07/27/21 • 31 min

In this episode, we speak with Lior Gavish, the co-founder of Monte Carlo to explore all things data quality. Monte Carlo is a data lineage and observability tool that lowers your data downtime.
Top 3 Value Bombs:

  1. Data products should be thought of in it’s entirely from the source to the consumer.
  2. No one data stakeholder can solve data quality issues, it’s a collaboration of the data engineers, business, data consumer and even software to help automate certain aspects of cataloging and capturing meaningful metadata.
  3. Good data quality processes should alert you to anomalies in your metrics before your data consumers do.
bookmark
plus icon
share episode
Building the Backend: Data Solutions that Power Leading Organizations - Why You Should Be Using (CDC) Change Data Capture for Ingestion with Datacoral

Why You Should Be Using (CDC) Change Data Capture for Ingestion with Datacoral

Building the Backend: Data Solutions that Power Leading Organizations

play

05/18/21 • 40 min

In this episode, we speak with Raghu Murthy. He is the founder of Datacoral, which provides serverless architectures that support data pipelines and orchestration to facilitate ELT into your Data Warehouse. Prior to founding Datacoral he was at Yahoo, Facebook and was part of the initial team that developed Hive. In this episode we will explore the best patterns for ingesting operational data into your data warehouse, creating metadata first architectures and the role Datacoral serves.

Top 3 Value Bombs:

  1. If you're migrating relational data that supports CDC, you should be using CDC to migrate it for the majority of use cases.
  2. ELT/ETL pipelines should be orchestrated by a metadata first style architecture.
  3. Consumers of the DW, should be notified if data is incomplete.
bookmark
plus icon
share episode
Building the Backend: Data Solutions that Power Leading Organizations - TRAILER: Welcome to Building the Backend - EP0

TRAILER: Welcome to Building the Backend - EP0

Building the Backend: Data Solutions that Power Leading Organizations

play

02/03/21 • 2 min

Welcome to the Building the Backend Podcast! We’re a data podcast focused on uncovering the data technologies, processes, and patterns that are driving today’s most successful companies. In this trailer episode, you will get a glimpse of what to expect with episodes on the show.

bookmark
plus icon
share episode

Show more best episodes

Toggle view more icon

FAQ

How many episodes does Building the Backend: Data Solutions that Power Leading Organizations have?

Building the Backend: Data Solutions that Power Leading Organizations currently has 43 episodes available.

What topics does Building the Backend: Data Solutions that Power Leading Organizations cover?

The podcast is about How To, Podcasts, Technology and Education.

What is the most popular episode on Building the Backend: Data Solutions that Power Leading Organizations?

The episode title 'Transform Your Object Storage Into a Git-like Repository With Paul Singman @ LakeFS' is the most popular.

What is the average episode length on Building the Backend: Data Solutions that Power Leading Organizations?

The average episode length on Building the Backend: Data Solutions that Power Leading Organizations is 34 minutes.

How often are episodes of Building the Backend: Data Solutions that Power Leading Organizations released?

Episodes of Building the Backend: Data Solutions that Power Leading Organizations are typically released every 7 days.

When was the first episode of Building the Backend: Data Solutions that Power Leading Organizations?

The first episode of Building the Backend: Data Solutions that Power Leading Organizations was released on Feb 3, 2021.

Show more FAQ

Toggle view more icon

Comments