Rounding out a trifecta of AWS engines that now work with Tabular-managed #ApacheIceberg tables is our latest episode of Tabular Solutions highlighting AWS EMR. Tabular co-founder Jason Reid and Tabular Sr. Developer Advocate Shawn Gordon show how quick and easy it is to do. Previously we illustrated Amazon Athena and Amazon Redshift.
#iceberg #datalake #apacheiceberg #datalakehouse #emr #tabular #dataengineering
https://youtu.be/aBiL3pFtf1A
#apacheiceberg #iceberg #DataLake #datalakehouse #emr #tabular #dataengineering
🐢 At Giskard, we're creating a robust #ML framework for #testing ML #models effectively. We help identify #biases and #errors in AI models, from #tabular to #LLMs. Participating in DEFCON allows us to collaborate with leading experts and share our commitment to #AISafety [3/4]
#ml #testing #models #biases #errors #tabular #llms #aisafety
🙌 Join us for our next webinar on how to scan and #test #AI models to detect risks of #biases, #performance issues, and #errors across various types of models, from #tabular to #LLMs.
👉 Join https://gisk.ar/43xyCy7
[1/5]
#test #ai #biases #performance #errors #tabular #llms
PyIceberg: Python Development Setup
This video will walk you through the steps required to set up the Python development environment for PyIceberg. We will set up a local instance of Spark, Rest catalog, and MinIO for querying an actual table. This makes it easy to do interactive development and test everything end to end.
#iceberg #python #pyiceberg #tabular #minio #spark #datalake #datalakehouse #pyarrow
https://youtu.be/D0HJuB0uSio
#iceberg #python #pyiceberg #tabular #minio #spark #DataLake #datalakehouse #pyarrow
Ask the Iceberg Experts talks to Deniz Parmaksiz, a Sr. Machine Learning Engineer at Insider about how the company moved from #Hive to #Iceberg. Deniz talks about some of the techniques they used and the performance benefits they saw as well as space and cost savings on #AWS.
#iceberg #datalake #insider #hive #tabular
https://youtu.be/Gi24ls94Qrc
#hive #iceberg #aws #DataLake #insider #tabular
This episode of "Ask the Iceberg Experts" is pleased to have Jack Ye, a Senior Software Engineer at Amazon for Athena. We talked to Jack about all of the integrations with Apache Iceberg announced in 2022, which is now part of AWS.
#iceberg #DataLake #aws #tabular #jackye
Our latest "Ask the Iceberg Experts" episode asks how to migrate or convert from Hive to Iceberg. As Iceberg co-creator, co-founder, and Head of Engineering at Tabular, Daniel Weeks was the perfect person to ask.
https://youtu.be/E0tDKSdZ_lI
#iceberg #datalake #datalakehouse #tabular #apacheiceberg #hive #danielweeks
#iceberg #DataLake #datalakehouse #tabular #apacheiceberg #hive #danielweeks
One of our co-founders, Jason Reid, was recently on the "Some Engineering" podcast to talk about all things #ApacheIceberg.
#iceberg #datalake #datalakehouse #tabular
https://some.engineering/podcasts/2023/01/12/what-is-apache-iceberg
#apacheiceberg #iceberg #DataLake #datalakehouse #tabular
In the first episode of "Ask the Iceberg Experts" for 2023, we talk about the very exciting REST catalog for Iceberg, with Iceberg co-creator, co-founder, and Head of Engineering at Tabular, Daniel Weeks.
https://youtu.be/0o7IDERLD8c
#iceberg #datalake #datalakehouse #tabular #apacheiceberg
#iceberg #DataLake #datalakehouse #tabular #apacheiceberg
This time on "Ask the Iceberg Experts", we talk about how to choose the right catalog for your data lake files with Iceberg co-creator, co-founder, and Head of Engineering at Tabular, Daniel Weeks.
https://youtu.be/G2YMCPdQfgM
#iceberg #datalake #datalakehouse #tabular
#iceberg #DataLake #datalakehouse #tabular
"From tabular data to knowledge graphs: A survey of semantic table interpretation tasks and methods" ... the largest up-to-date survey on #semantic #tabular #data #interpretation is now published, https://doi.org/10.1016/j.websem.2022.100761 #science
#semantic #tabular #data #interpretation #science
Perhaps useful if you need to deal with #tabular / #spreadsheet data from the #terminal / #CLI
#tabular #spreadsheet #terminal #cli
#DZone #BigDataZone "RION - A Fast, Compact, Versatile Data Format " #BigData #RION #Tabular #Versatile #Compact #Fast #Format #Data ... https://dzone.com/articles/rion-a-fast-compact-versatile-data-format
#dzone #bigdatazone #bigdata #rion #tabular #versatile #compact #fast #format #data