
Data Engineering Design Patterns
Recipes for Solving the Most Common Data Engineering Problems
Failed to add items
Add to Cart failed.
Add to Wish List failed.
Remove from wishlist failed.
Adding to library failed
Follow podcast failed
Unfollow podcast failed
3 months free
Pre-order for $17.49
No default payment method selected.
We are sorry. We are not allowed to sell this product with the selected payment method
-
Narrated by:
-
Charles Constant
About this listen
Data projects are an intrinsic part of an organization's technical ecosystem, but data engineers in many companies continue to work on problems that others have already solved. This hands-on guide shows you how to provide valuable data by focusing on various aspects of data engineering, including data ingestion, data quality, idempotency, and more.
Author Bartosz Konieczny guides you through the process of building reliable end-to-end data engineering projects, from data ingestion to data observability, focusing on data engineering design patterns that solve common business problems in a secure and storage-optimized manner. Each pattern includes a user-facing description of the problem, solutions, and consequences that place the pattern into the context of real-life scenarios.
Throughout this journey, you'll use open source data tools and public cloud services to apply each pattern. You'll learn about challenges data engineers face and their impact on data systems; how these challenges relate to data system components; useful applications of data engineering patterns; how to identify and fix issues with your current data components; and technology-agnostic solutions to new and existing data projects, with open source implementation examples.
People who viewed this also viewed...
-
Fundamentals of Data Engineering
- Plan and Build Robust Data Systems
- By: Joe Reis, Matt Housley
- Narrated by: Adam Verner
- Length: 17 hrs and 31 mins
- Unabridged
-
Overall
-
Performance
-
Story
Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and analysts looking for a comprehensive view of this practice. With this practical book, you'll learn how to plan and build systems to serve the needs of your organization and customers by evaluating the best technologies available through the framework of the data engineering lifecycle.
-
-
Great for Hands on Business owners
- By LmL on 08-14-24
By: Joe Reis, and others
-
Deciphering Data Architectures
- Choosing Between a Modern Data Warehouse, Data Fabric, Data Lakehouse, and Data Mesh
- By: James Serra
- Narrated by: Tom Beyer
- Length: 12 hrs
- Unabridged
-
Overall
-
Performance
-
Story
Data fabric, data lakehouse, and data mesh have recently appeared as viable alternatives to the modern data warehouse. These new architectures have solid benefits, but they're also surrounded by a lot of hyperbole and confusion. This practical book provides a guided tour of these architectures to help data professionals understand the pros and cons of each.
By: James Serra
-
Designing Data-Intensive Applications
- The Big Ideas Behind Reliable, Scalable, and Maintainable Systems
- By: Martin Kleppmann
- Narrated by: Benjamin Lange
- Length: 20 hrs and 56 mins
- Unabridged
-
Overall
-
Performance
-
Story
Author Martin Kleppmann helps you navigate the diverse data landscape by examining the pros and cons of various technologies for processing and storing data. Software keeps changing, but the fundamental principles remain the same. With this book, software engineers and architects will learn how to apply those ideas in practice, and how to make full use of data in modern applications.
-
-
Must read for senior+ software engineers
- By Navid v on 05-29-21
By: Martin Kleppmann
-
Database Internals
- A Deep Dive into How Distributed Data Systems Work, 1st Edition
- By: Alex Petrov
- Narrated by: Mike Chamberlain
- Length: 12 hrs and 51 mins
- Unabridged
-
Overall
-
Performance
-
Story
When it comes to choosing, using, and maintaining a database, understanding its internals is essential. But with so many distributed databases and tools available today, it's often difficult to understand what each one offers. With this practical guide, Alex Petrov guides developers through the concepts behind modern database and storage engine internals.
-
-
I can't believe this exists as an audiobook.
- By Michael Marcin on 12-18-23
By: Alex Petrov
-
Designing Distributed Systems (2nd Edition)
- Patterns and Paradigms for Scalable, Reliable Systems Using Kubernetes
- By: Brendan Burns
- Narrated by: Tom Beyer
- Length: 8 hrs and 33 mins
- Unabridged
-
Overall
-
Performance
-
Story
Author Brendan Burns demonstrates how you can adapt existing software design patterns for designing and building reliable distributed applications. Systems engineers and application developers will learn how these long-established patterns provide a common language and framework for dramatically increasing the quality of your system. This fully updated second edition includes new chapters on AI inference, AI training, and building robust systems for the real world.
By: Brendan Burns
-
Data Pipelines with Apache Airflow
- By: Julian de Ruiter, Bas Harenslak
- Narrated by: Julie Brierley
- Length: 10 hrs and 18 mins
- Unabridged
-
Overall
-
Performance
-
Story
A successful pipeline moves data efficiently, minimizing pauses and blockages between tasks, and keeping processes along the way operational. Apache Airflow provides a single customizable environment for building and managing data pipelines, eliminating the need for a hodgepodge collection of tools, snowflake code, and homegrown processes. Using real-world scenarios and examples, this book teaches you how to simplify and automate data pipelines, reduce operational overhead, and smoothly integrate all the technologies in your stack.
By: Julian de Ruiter, and others
-
Fundamentals of Data Engineering
- Plan and Build Robust Data Systems
- By: Joe Reis, Matt Housley
- Narrated by: Adam Verner
- Length: 17 hrs and 31 mins
- Unabridged
-
Overall
-
Performance
-
Story
Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and analysts looking for a comprehensive view of this practice. With this practical book, you'll learn how to plan and build systems to serve the needs of your organization and customers by evaluating the best technologies available through the framework of the data engineering lifecycle.
-
-
Great for Hands on Business owners
- By LmL on 08-14-24
By: Joe Reis, and others
-
Deciphering Data Architectures
- Choosing Between a Modern Data Warehouse, Data Fabric, Data Lakehouse, and Data Mesh
- By: James Serra
- Narrated by: Tom Beyer
- Length: 12 hrs
- Unabridged
-
Overall
-
Performance
-
Story
Data fabric, data lakehouse, and data mesh have recently appeared as viable alternatives to the modern data warehouse. These new architectures have solid benefits, but they're also surrounded by a lot of hyperbole and confusion. This practical book provides a guided tour of these architectures to help data professionals understand the pros and cons of each.
By: James Serra
-
Designing Data-Intensive Applications
- The Big Ideas Behind Reliable, Scalable, and Maintainable Systems
- By: Martin Kleppmann
- Narrated by: Benjamin Lange
- Length: 20 hrs and 56 mins
- Unabridged
-
Overall
-
Performance
-
Story
Author Martin Kleppmann helps you navigate the diverse data landscape by examining the pros and cons of various technologies for processing and storing data. Software keeps changing, but the fundamental principles remain the same. With this book, software engineers and architects will learn how to apply those ideas in practice, and how to make full use of data in modern applications.
-
-
Must read for senior+ software engineers
- By Navid v on 05-29-21
By: Martin Kleppmann
-
Database Internals
- A Deep Dive into How Distributed Data Systems Work, 1st Edition
- By: Alex Petrov
- Narrated by: Mike Chamberlain
- Length: 12 hrs and 51 mins
- Unabridged
-
Overall
-
Performance
-
Story
When it comes to choosing, using, and maintaining a database, understanding its internals is essential. But with so many distributed databases and tools available today, it's often difficult to understand what each one offers. With this practical guide, Alex Petrov guides developers through the concepts behind modern database and storage engine internals.
-
-
I can't believe this exists as an audiobook.
- By Michael Marcin on 12-18-23
By: Alex Petrov
-
Designing Distributed Systems (2nd Edition)
- Patterns and Paradigms for Scalable, Reliable Systems Using Kubernetes
- By: Brendan Burns
- Narrated by: Tom Beyer
- Length: 8 hrs and 33 mins
- Unabridged
-
Overall
-
Performance
-
Story
Author Brendan Burns demonstrates how you can adapt existing software design patterns for designing and building reliable distributed applications. Systems engineers and application developers will learn how these long-established patterns provide a common language and framework for dramatically increasing the quality of your system. This fully updated second edition includes new chapters on AI inference, AI training, and building robust systems for the real world.
By: Brendan Burns
-
Data Pipelines with Apache Airflow
- By: Julian de Ruiter, Bas Harenslak
- Narrated by: Julie Brierley
- Length: 10 hrs and 18 mins
- Unabridged
-
Overall
-
Performance
-
Story
A successful pipeline moves data efficiently, minimizing pauses and blockages between tasks, and keeping processes along the way operational. Apache Airflow provides a single customizable environment for building and managing data pipelines, eliminating the need for a hodgepodge collection of tools, snowflake code, and homegrown processes. Using real-world scenarios and examples, this book teaches you how to simplify and automate data pipelines, reduce operational overhead, and smoothly integrate all the technologies in your stack.
By: Julian de Ruiter, and others