My Medium adventure enters a new phase: the first post for a Medium-held publication, Plumbers of Data Science, just got published :)
It's also more technical than my previous writings. The point is to introduce Apache Hudi in a softer way than the official documentation does at the moment. So, if you're interested in starting with Hudi, look no further :)
#apachehudi #apachespark #dataengineering
https://medium.com/plumbersofdatascience/apache-hudi-copy-on-write-explained-563f1d23d34f
#ApacheHudi #apachespark #dataengineering
4. Function for readability of big numbers - formatReadableDecimalSize(size); function displayName() for better work with long unreadable host names.
5. Adding retries on INSERT with new property "SET insert_keepert_max_retries = 10" - fixing "Session expired. Table is in readonly mode" (the setting is not yet default).
6. Added support for #ApacheHudi and #DeltaLake formats for SELECT.
➡️
RT @094459
This Friday, 9am UK time, on http://twitch.tv.aws, we have the 7th episode of Build on Open Source, where Derek and myself take a look at the latest open source projects. We also have special guest Vinoth Chandar, showcasing some of #apachehudi's superpowers. Can't wait!