❓❓❓HOW LAKEHOUSE TABLE FORMAT WORKS❓❓❓

1. Engine reads table format metadata
2. Builds list of files with relevant data based on metadata
3. Scans those files and executes query

#dataengineering #dataanalytics #bigdata #datalakehouse #apacheiceberg #apachehudi #deltalake

Last updated 1 year ago

Nicolas FrΓ€nkel · @frankel
762 followers · 698 posts · Server mastodon.top

Get a detailed overview of , , and as we discuss their data storage, processing capabilities, and deployment options dzone.com/articles/delta-hudi-

#deltalake #apachehudi #apacheiceberg #analytics #spark

Last updated 1 year ago

This blog from Onehouse about is interesting.

My eye was caught by the chart showing which organisations and companies contribute to the projects. We all know that DB dominates DL. I wonder if the balance on the other two will stay over time or if Onehouse and Tabular (circled) will start to grow.

onehouse.ai/blog/apache-hudi-v

#apachehudi #opensource

Last updated 2 years ago

Wojtek · @WojtekWalczak
0 followers · 1 posts · Server awscommunity.social

My Medium adventure enters a new phase: the first post for a Medium-held publication, Plumbers of Data Science, just got published :)

It's also more technical than my previous writings. The point is to introduce Apache Hudi in a softer way than the official documentation does at the moment. So, if you're interested in starting with Hudi, look no further :)

medium.com/plumbersofdatascien

#apachehudi #ApacheSpark #dataengineering

Last updated 2 years ago