FedSearch - Federated network search engine

Tabular · @tabular

79 followers · 193 posts · Server data-folks.masto.host

MinIO put out a great new blog last week that walks you through the details of how to build a #datalakehouse using #ApacheIceberg and MinIO. It even includes Project Jupyter, #PyIceberg, and some ETL.

https://blog.min.io/building-a-data-lakehouse-using-apache-iceberg-and-minio/

#datalakehouse #apacheiceberg #pyiceberg

Last updated 2 years ago

Original post

Tabular · @tabular

79 followers · 192 posts · Server data-folks.masto.host

Rounding out a trifecta of AWS engines that now work with Tabular-managed #ApacheIceberg tables is our latest episode of Tabular Solutions highlighting AWS EMR. Tabular co-founder Jason Reid and Tabular Sr. Developer Advocate Shawn Gordon show how quick and easy it is to do. Previously we illustrated Amazon Athena and Amazon Redshift.

#iceberg #datalake #apacheiceberg #datalakehouse #emr #tabular #dataengineering
https://youtu.be/aBiL3pFtf1A

#apacheiceberg #iceberg #DataLake #datalakehouse #emr #tabular #dataengineering

Last updated 2 years ago

Original post

Tabular · @tabular

79 followers · 191 posts · Server data-folks.masto.host

With the last day of August, comes the Iceberg Community News. There have been important updates as work proceeds to the next version of #ApacheIceberg. #PyIceberg 0.5.0 is coming along quickly and will be available soon, in addition to significant advances with both the Rust and Go implementations. Lots of great blogs from the community and industry news are included. Many thanks to all the hard work of the contributors.

#dataengineering #datalake #datalakehouse

https://tabular.io/blog/iceberg-202308/

#apacheiceberg #pyiceberg #dataengineering #DataLake #datalakehouse

Last updated 2 years ago

Original post

Tabular · @tabular

79 followers · 187 posts · Server data-folks.masto.host

Our new Tabular Solutions video has Fokko Driesprong showing Shawn Gordon how to use #PyIceberg in Outerbounds to set up a #MachineLearning operation on a Tabular-managed #ApacheIceberg table. This powerful combination is now very simple to integrate and use.

Special thanks to Hugo Bowne-Anderson and Eddie Mattia.

#dataengineering #datalake #datalakehouse

https://youtu.be/IJclwoKEOeM

#pyiceberg #machinelearning #apacheiceberg #dataengineering #DataLake #datalakehouse

Last updated 2 years ago

Original post

Tabular · @tabular

78 followers · 186 posts · Server data-folks.masto.host

Take a walk through our new Cascading Privileges feature in Tabular. Managing privileges against your data in #ApacheIceberg can become a difficult and dangerous task, but Tabular makes it easy. Our new Storylane-powered interactive demo will show you how.
#datalake #datalakehouse #dataengineering #iceberg
https://app.storylane.io/share/xjtgjg0gwxpn

#apacheiceberg #DataLake #datalakehouse #dataengineering #iceberg

Last updated 2 years ago

Original post

Tabular · @tabular

78 followers · 184 posts · Server data-folks.masto.host

Mike Taveirne at Snowflake has a new blog that does a great job of explaining when to use #ApacheIceberg in Snowflake. He gives a brief overview of #Iceberg, the types of integrations available, and when to use them. The feature is available in private preview, so if this is of interest, talk to your account team to get access.

#dataengineering #datalakehouse #datawarehouse

https://medium.com/snowflake/when-to-use-iceberg-tables-in-snowflake-c087240759cb

#apacheiceberg #iceberg #dataengineering #datalakehouse #datawarehouse

Last updated 2 years ago

Original post

Tabular · @tabular

78 followers · 183 posts · Server data-folks.masto.host

Akshay Jain has written a very nice blog on #ApacheIceberg with tips on optimizing streaming and batch updates for enhanced performance results. It's worth a read.

#dataengineering #datalake #datalakehouse.

https://medium.com/@akshayjain.developer/mastering-apache-iceberg-optimizing-streaming-and-batch-updates-for-stellar-data-performance-5eae2fbbb2f9

#apacheiceberg #dataengineering #DataLake #datalakehouse

Last updated 2 years ago

Original post

Tabular · @tabular

78 followers · 180 posts · Server data-folks.masto.host

Vino Duraisamy at Snowflake has a great new detailed blog on the design changes in Snowflake's architecture to support #ApacheIceberg tables—a must-read for those interested in this space.

#dataengineering #datalakehouse #datawarehouse

https://medium.com/snowflake/iceberg-tables-on-snowflake-design-considerations-and-life-of-an-insert-query-5026ea10bbea

#apacheiceberg #dataengineering #datalakehouse #datawarehouse

Last updated 2 years ago

Original post

Tabular · @tabular

78 followers · 179 posts · Server data-folks.masto.host

A new Tabular Solutions episode is now available with Tabular co-founder Jason Reid. He shows Shawn Gordon how easy it is to set up a Google Colab notebook and use #ApacheSpark to read/write data from Tabular-managed #ApacheIceberg tables.

#iceberg #datalake #datalakehouse #dataengineering #googlecolab #apachespark

https://youtu.be/VI5dtq-pCN8

#ApacheSpark #apacheiceberg #iceberg #DataLake #datalakehouse #dataengineering #googlecolab

Last updated 2 years ago

Original post

Tabular · @tabular

78 followers · 178 posts · Server data-folks.masto.host

This new Tabular Bits illustrates our new "File Loader" feature. Within the Tabular UI, you can browse your AWS S3 object store and locate a directory containing files you want to be loaded into a particular Tabular-managed #ApacheIceberg table. The files can be in JSON, CSV, TSV, or Parquet. The file loader will load them up, and Tabular will then automatically optimize everything for you.

#datalake #datalakehouse #dataengineering #iceberg

https://youtu.be/CShzVZ0f_9Y

#apacheiceberg #DataLake #datalakehouse #dataengineering #iceberg

Last updated 2 years ago

Original post

Tabular · @tabular

78 followers · 177 posts · Server data-folks.masto.host

Tabular has a convenient file loader tool where you can easily and visually define a path to an S3 directory where you are going to store data that you want to load into #ApacheIceberg. This data can be in JSON, CSV, TSV or Parquet formats. This interactive demo shows you how to do it.
#dataengineering #streamingdata #datalake #datalakehouse
https://app.storylane.io/share/hrubjytflspb

#apacheiceberg #dataengineering #StreamingData #DataLake #datalakehouse

Last updated 2 years ago

Original post

Tabular · @tabular

78 followers · 173 posts · Server data-folks.masto.host

YouTube - Tabular Bits: Creating Tables

This new Tabular Bits walks you through our new "Create table" feature, which is a non-programmatic way to create an #ApacheIceberg table and select all the options for partitioning and ordering.

#datalake #datalakehouse #dataengineering #iceberg

https://youtu.be/vk10pPKX74U

#apacheiceberg #DataLake #datalakehouse #dataengineering #iceberg

Last updated 2 years ago

Original post

Tabular · @tabular

78 followers · 173 posts · Server data-folks.masto.host

Sometimes you just want a non-programmatic way to create an #ApacheIceberg table and select all the options for partitioning and ordering. Well, Tabular has you covered with our cleverly named "Create a table" feature 😁

This Storylane powered interactive demo quickly walks you through the process. Please give it a look and see how easy it is.

#dataengineering #datalake #datalakehouse

https://app.storylane.io/share/rqwcdjcnkjc4

#apacheiceberg #dataengineering #DataLake #datalakehouse

Last updated 2 years ago

Original post

Tabular · @tabular

78 followers · 173 posts · Server data-folks.masto.host

YouTube - Tabular Solutions: CelerData

Tabular Solutions is back with a new episode. This time we meet with Albert Wong, Developer Advocate at CelerData, a managed solution for #StarRocks. Albert shows Shawn Gordon the work CelerData has been doing to integrate their managed StarRocks with Tabular-managed #ApacheIceberg tables for full read/write support.

#iceberg #datalake #datalakehouse #dataengineering

https://youtu.be/bAmcTrX7hCI

#starrocks #apacheiceberg #iceberg #DataLake #datalakehouse #dataengineering

Last updated 2 years ago

Original post

Tabular · @tabular

78 followers · 169 posts · Server data-folks.masto.host

Are you subscribed to our YouTube channel yet? We have a variety of playlists to make finding answers simple. Tabular Bits covers various features of the Tabular product in 2-3 minute episodes. Tabular Solutions shows Tabular working with other products/projects. Ask the #Iceberg Experts talks to various experts in the #ApacheIceberg community on Iceberg topics., and more. Make sure to subscribe so you don't miss an episode.

#dataengineering #datalake #datalakehouse

https://www.youtube.com/@tabularIO

#iceberg #apacheiceberg #dataengineering #DataLake #datalakehouse

Last updated 2 years ago

Original post

Tabular · @tabular

79 followers · 168 posts · Server data-folks.masto.host

As this hot July comes to a close, we bring you all the cool #Iceberg news from this month. #ApacheIceberg 1.3.1 was released with important updates. PyIceberg 0.4.0 was released. #Rust and #Golang support were formalized with active GitHub repositories. Lots of exciting news in the industry and blogs from the community.
#dataengineering #datalake #datalakehouse
https://tabular.io/blog/iceberg-202307/

#iceberg #apacheiceberg #rust #golang #dataengineering #DataLake #datalakehouse

Last updated 2 years ago

Original post

Tabular · @tabular

79 followers · 167 posts · Server data-folks.masto.host

The new episode of Tabular Bits is now available and showcases our newest compute engine integration, AWS Athena SQL. You'll learn how to quickly connect #AWS Athena with your Tabular managed #ApacheIceberg tables in just a few minutes. Perform any function with Athena SQL on #Iceberg that you would any other data source.

#datalake #datalakehouse #dataengineering #iceberg #awsathena

https://youtu.be/IeRzxZhOcwc

#aws #apacheiceberg #iceberg #DataLake #datalakehouse #dataengineering #awsathena

Last updated 2 years ago

Original post

Tabular · @tabular

79 followers · 166 posts · Server data-folks.masto.host

Michael DeRoy of IBM has a great new blog out that goes into some detail about Netezza support of #ApacheIceberg. He provides examples and details that are highly informative.

#dataengineering #datalake #datalakehouse

https://medium.com/@MikeDeRoy/netezzas-evolution-from-warehouse-to-lakehouse-with-watsonx-data-edd0b932f66a

#apacheiceberg #dataengineering #DataLake #datalakehouse

Last updated 2 years ago

Original post

Tabular · @tabular

78 followers · 165 posts · Server data-folks.masto.host

Our new interactive demo (using Storylane) illustrates how to work with our new #AWSAthena compute engine integration. This feature will be live in the Tabular product in a couple of days. Get a head start and see how you can easily work with Athena on your Tabular managed #ApacheIceberg tables. Come give it a try!

#dataengineering #datalake #datalakehouse

https://app.storylane.io/share/avrhx4femm9z

#awsathena #apacheiceberg #dataengineering #DataLake #datalakehouse

Last updated 2 years ago

Original post

Tabular · @tabular

76 followers · 163 posts · Server data-folks.masto.host

Fokko Driesprong has written a great blog to highlight some of the great new features in the recently released #PyIceberg 0.4.0. #Python and #ApacheIceberg are a powerful combination.

#dataengineering #datalake #datalakehouse
https://tabular.io/blog/pyiceberg-0-4-0/

#pyiceberg #python #apacheiceberg #dataengineering #DataLake #datalakehouse

Last updated 2 years ago

Original post