Overcome challenges in building transactional guarantees on rapidly changing data by using Apache Hudi. With this practical guide, data engineers, data architects, and software architects will discover how to seamlessly build an interoperable lakehouse from disparate data sources and deliver faster insights using your query engine of choice.
Authors Shiyan Xu, Prashant Wason, Bhavani Sudha Saktheeswaran, and Rebecca Bilbro provide practical examples and insights to help you unlock the full potential of data lakehouses for different levels of analytics, from batch to interactive to streaming. You’ll also learn how to evaluate storage choices and leverage built-in automated table optimizations to build, maintain, and operate production data applications.
This book helps you:
Understand the need for transactional data lakehouses and the challenges associated with building them
Explore data ecosystem support provided by Apache Hudi for popular data sources and query engines
Perform different write and read operations on Apache Hudi tables and effectively use them for various use cases, including batch and stream applications
Apply different storage techniques and considerations such as indexing and clustering to maximize your lakehouse performance
Build end-to-end incremental data pipelines using Apache Hudi for faster ingestion and fresher analytics
From the brand
Databases, data science & more
Data Science
Data Visualization
Databases
Streaming
Sharing the knowledge of experts
O’Reilly’s mission is to change the world by sharing the knowledge of innovators. For over 40 years, we’ve inspired companies and individuals to do new things (and do them better) by providing the skills and understanding that are necessary for success.
Our customers are hungry to build the innovations that propel the world forward. And we help them do just that.
ASIN : B0FXYDXSSF
Publisher : O’Reilly Media
Accessibility : Learn more
Publication date : October 24, 2025
Edition : 1st
Language : English
File size : 7.6 MB
Simultaneous device usage : Unlimited
Enhanced typesetting : Enabled
X-Ray : Not Enabled
Word Wise : Not Enabled
Print length : 489 pages
ISBN-13 : 978-1098173791
Page Flip : Enabled
Best Sellers Rank: #62 in Data Warehousing (Books) #97 in Business Intelligence Tools #111 in Software Testing

