Tabular · @tabular
69 followers · 139 posts · Server data-folks.masto.host

Rui Li of Bilibili Group has written a very informative blog on how Bilibili built an OLAP with . They have over 1,000 tables that comprise over 10PB of data, and a daily increment of 75TB. is serving over 200,000 queries a day in their system with an average response time of 5 seconds. It's a pretty impressive setup.

medium.com/@lirui.fudan/how-bi

#datalakehouse #apacheiceberg #iceberg #trino

Last updated 1 year ago

Tabular · @tabular
66 followers · 115 posts · Server data-folks.masto.host

Have you read this tutorial from Ryan Blue yet that shows you how to use Trino Software Foundation with for data warehousing?

.

tabular.io/tutorials/using-tri

#apacheiceberg #trino #DataLake #datawarehouse #dataengineering

Last updated 1 year ago

Delta Lake · @deltalakeoss
40 followers · 45 posts · Server social.lfx.dev

Explore the latest advances in leading projects and industry technologies includes , , , dbt, Presto/Trino, DuckDB & much more at - June 26-29!

⭐ Use code is ETLINUX400 (expires June 2) to save $400 off the regular price of the full conference pass.

Register here ➡️ dbricks.co/3lvO1hz

#opensource #deltalake #mlflow #pytorch #dataaisummit #trino #oss #presto #duckdb #sanfrancisco

Last updated 1 year ago

Maxim Syomochkin · @msemochkin
60 followers · 310 posts · Server pkm.social

God bless the authors (and the companies that sponsor them) who put such useful books into the public domain!
It may seem like there is nothing new to read, but the information collected in a coherent and structured form does a great job of organizing knowledge in the mind!

amazon.com/Trino-Definitive-Gu

#trino #book #reading

Last updated 1 year ago

Tabular · @tabular
61 followers · 81 posts · Server data-folks.masto.host

Do you use and wonder how to try it out with Tabular? Wonder no longer. This 2-minute video walks you through our Trino wizard to get you quickly connected.
.

youtu.be/g_MHTFTO75I

#trino #dataengineering #DataLake #datalakehouse #iceberg #apacheiceberg

Last updated 1 year ago

Data Mesh is an architecture for decentralized data storage that enables domain teams to utilize the storage technology of their choice. Microsoft provides Trino, a highly parallel and scalable query engine, as a managed service on Azure HDInsight for Data Mesh architectures. techcommunity.microsoft.com/t5

#datamesh #trino #azurehdinsight

Last updated 2 years ago

Tabular · @tabular
49 followers · 39 posts · Server data-folks.masto.host

The folks at have a clever video showing a solution with and .
youtu.be/yaxPEWRpEzc

#trino #minio #iceberg

Last updated 2 years ago

Tabular · @tabular
48 followers · 34 posts · Server data-folks.masto.host

Great talk from SK Telecom from the recent summit, and their journey to from .

trino.io/blog/2022/12/19/trino

#trino #iceberg #hive

Last updated 2 years ago

Matt "msw" Wilson · @msw
1282 followers · 260 posts · Server mstdn.social

"On November 21, 2022, announced its upstream contributions to
, which improves query performance when accessing CSV and JSON data formats."
aws.amazon.com/blogs/storage/r

#trino #OpenSource #AWS

Last updated 2 years ago

Pat Patterson · @metadaddy
154 followers · 126 posts · Server fosstodon.org

We've been using at both to experiment with as storage, and to query our data set. I wrote up our experience in a blog post: backblaze.com/blog/querying-a-

#trino #backblaze #BackblazeB2 #datalake #DriveStats

Last updated 2 years ago

Monica Miller · @monimiller
355 followers · 193 posts · Server data-folks.masto.host

now that is becoming mainstream.... telling everyone about the coolest federated query engine

#data #trino #federated

Last updated 2 years ago

Monica Miller · @monimiller
355 followers · 193 posts · Server data-folks.masto.host

Hi fellow tooters👋 I'm Monica. As I participate in one of the most interesting social experiments ever and migrate from one social platform to another, I thought I'd introduce myself.

I'm a former data engineer who this year turned developer advocate at . Yay .

Dog mom. Reality TV aficionado. Office mate to @emiller

#trino #starburst

Last updated 2 years ago

Parliamo di news! · @parliamodinews
16 followers · 87657 posts · Server masthead.social
Kirill Zh · @kirill
204 followers · 957 posts · Server s.zholnay.name

Два подхода к Data Warehouse на 2-3 и 120 IT-ков:
- habr.com/ru/post/593809/
- habr.com/ru/company/mediascope

Мой стек позволяет обрабатывать тот же объем данных (11млрд/мес), что и компании 2, хоть и не так глубоко, но на серваке за $200/мес и $0 за ПО

- Витрина вместо BI: Zeppelin + R/Python
- Lake: file.gz + S3
- ETL: dataiku dss
- Процессинг: NiFi
- DB: clickhouse
- Doc/беcсхемное: ArangoDB
- Агрегация разных баз: Trino?

#dwh #bigdata #datalake #prestodb #trino #clickhouse #disworks

Last updated 3 years ago