Delta Lake: The Definitive Guide | Agenda Bookshop Skip to content
A01=Denny Lee
A01=Prashanth Babu
A01=Scott Haines
A01=Tristen Wentling
Age Group_Uncategorized
Age Group_Uncategorized
Author_Denny Lee
Author_Prashanth Babu
Author_Scott Haines
Author_Tristen Wentling
automatic-update
Category1=Non-Fiction
Category=KJT
Category=UFL
Category=UND
COP=United States
Delivery_Pre-order
Delta lake data lake databricks spark rust python data warehouse lakehouse unstructured data data pipelines data lakehouses modern data architecture ACID transactions sql analytics photon delta engine distributed computing distributed storage layer vector
eq_business-finance-law
eq_computing
eq_isMigrated=2
eq_non-fiction
Language_English
PA=Not yet available
Price_€50 to €100
PS=Forthcoming
softlaunch

Delta Lake: The Definitive Guide

Ready to simplify the process of building data lakehouses and data pipelines at scale? In this practical guide, learn how Delta Lake is helping data engineers, data scientists, and data analysts overcome key data reliability challenges with modern data engineering and management techniques.

Authors Denny Lee, Tristen Wentling, Scott Haines, and Prashanth Babu (with contributions from Delta Lake maintainer R. Tyler Croy) share expert insights on all things Delta Lake--including how to run batch and streaming jobs concurrently and accelerate the usability of your data. You'll also uncover how ACID transactions bring reliability to data lakehouses at scale.

This book helps you:

  • Understand key data reliability challenges and how Delta Lake solves them
  • Explain the critical role of Delta transaction logs as a single source of truth
  • Learn the Delta Lake ecosystem with technologies like Apache Flink, Kafka, and Trino
  • Architect data lakehouses with the medallion architecture
  • Optimize Delta Lake performance with features like deletion vectors and liquid clustering
See more
€76.99
A01=Denny LeeA01=Prashanth BabuA01=Scott HainesA01=Tristen WentlingAge Group_UncategorizedAuthor_Denny LeeAuthor_Prashanth BabuAuthor_Scott HainesAuthor_Tristen Wentlingautomatic-updateCategory1=Non-FictionCategory=KJTCategory=UFLCategory=UNDCOP=United StatesDelivery_Pre-orderDelta lake data lake databricks spark rust python data warehouse lakehouse unstructured data data pipelines data lakehouses modern data architecture ACID transactions sql analytics photon delta engine distributed computing distributed storage layer vectoreq_business-finance-laweq_computingeq_isMigrated=2eq_non-fictionLanguage_EnglishPA=Not yet availablePrice_€50 to €100PS=Forthcomingsoftlaunch

Will deliver when available. Publication date 29 Nov 2024

Product Details
  • Dimensions: 178 x 233mm
  • Publication Date: 12 Nov 2024
  • Publisher: O'Reilly Media
  • Publication City/Country: US
  • Language: English
  • ISBN13: 9781098151942

About Denny LeePrashanth BabuScott HainesTristen Wentling

Denny Lee is a Staff Developer Advocate at Databricks. He is a hands-on distributed systems and data sciences engineer with extensive experience developing internet-scale infrastructure, data platforms, and predictive analytics systems for both on-premise and cloud environments. He also has a Masters of Biomedical Informatics from Oregon Health and Sciences University and has architected and implemented powerful data solutions for enterprise Healthcare customers. His current technical focuses include Distributed Systems, Apache Spark, Deep Learning, Machine Learning, and Genomics. Tristen Wentling works in machine learning, data engineering, and statistical analysis using Python, Apache Spark, and Scala. He is a machine learning advocate loves the flexibility of neural networks. Tristen holds an M.S. in Mathematics and B.S. in Applied Mathematics. Scott Haines is a Databricks Beacon and has been working with data systems and distributed systems and architectures for over 15 years. He recently wrote a book encapsulating his journey called Modern Data Engineering with Apache Spark: A Hands-on guide for building mission-critical streaming applications. He enjoys teaching people how to simplify data systems and data-intensive services and takes to the snow in the winter to pursue his love of snowboarding. Prashanth Babu is a Databricks Certified Developer who helps guide design and implementation of customer use cases by building out reference architectures, best practices, frameworks, MVP, and prototypes, which enables customers to succeed in turning their data into value.

Customer Reviews

Be the first to write a review
0%
(0)
0%
(0)
0%
(0)
0%
(0)
0%
(0)
We use cookies to ensure that we give you the best experience on our website. If you continue we'll assume that you are understand this. Learn more
Accept