Chris Wensel · @cwensel
161 followers · 1145 posts · Server fosstodon.org

So Tessellate inherits lots of support for various data formats from Cascading
github.com/cwensel/cascading

Even though dropped Cascading support, we were able to port it over.

Now that Parquet is native to Cascading, it should be easier to add support.

This would allow to convert data as it arrives into Iceberg continuously for use in Athena or other data front-ends.

Anyone interested in a challenge?

#apacheparquet #ApacheIceberg #clusterless #aws #java

Last updated 1 year ago

Chris Wensel · @cwensel
150 followers · 1066 posts · Server fosstodon.org

A little more color on this announcement..
fosstodon.org/@cwensel/1105490

First, removed support, so I had to splice the original source into Cascading. But the ParquetScheme didn't honor type information fully. So there is a new TypedParquetScheme that has native support for JSON and Timestamps.

Second, Parquet requires the FileSystem, which means we get the wonderful S3A implementation. But we also get a 331MB jar dependency with the aws bundle.

#apacheparquet #cascading #apachehadoop

Last updated 1 year ago

· @nlamirault
0 followers · 9 posts · Server mastodon.cloud

Great ! With ... Let's go to try ...
---
RT @grafana
✨ Grafana Tempo 2.0 is finally here! ✨

Among other updates, Tempo 2.0 comes with two important new features; a new Apache Parquet backend storage format, and , a new language designed for discovering traces.
grafana.com/blog/2023/02/01/ne
twitter.com/grafana/status/162

#apacheparquet #traceql

Last updated 2 years ago

· @emauviere
57 followers · 9 posts · Server mapstodon.space

Le format devient mainstream, il a pourtant presque 10 ans. En quoi est-il devenu un successeur crédible à ?
Quels sont ses rapports avec , ou ? Comment l'utiliser dans ou ?
Je vous éclaire ici 👇 :
icem7.fr/outils/parquet-devrai

#apacheparquet #csv #ApacheArrow #duckdb #RStats #qgis

Last updated 2 years ago