R Tyler Croy 🦀 · @rtyler
512 followers · 1117 posts · Server hacky.town

I will be joining the community at Data and AI Summit this yea, I hope you will too!

Here are some of my recommended sessions:

buoyantdata.com/blog/2023-05-1

#DeltaLake

Last updated 2 years ago

R Tyler Croy 🦀 · @rtyler
497 followers · 1002 posts · Server hacky.town

We are preparing a new release of for which includes a number of fixes, but most importantly and upgrade of Arrow and DataFusion.

There are so many exciting things going on in the rust data processing ecosystem!

#DeltaLake #rustlang

Last updated 2 years ago

R Tyler Croy 🦀 · @rtyler
459 followers · 965 posts · Server hacky.town

I've been taking more meetings with some startups and open source orgs to offer guidance on the open source data or infra l;landscape lately.

In exchange I have been asking for support in my fundraising (giving.aidslifecycle.org/parti)

If I can help you navigate the , , or ecosystems, let's chat! 👇

cal.com/rtylercroy

#AIDSLifeCycle #aws #rustlang #DeltaLake

Last updated 2 years ago

R Tyler Croy 🦀 · @rtyler
457 followers · 918 posts · Server hacky.town

Doing that Tiger Woods style fist pump alone in my office.

After weeks of off-and-on hacking, and some help from another community member, I have an small and fast Lambda to stream data through SQS into a Delta Table

#DeltaLake

Last updated 2 years ago

· @twitter
1 followers · 42424 posts · Server mstdn.skullb0x.io

Referenced link: hubs.la/Q01HxY400
Originally posted by The Linux Foundation / @linuxfoundation@twitter.com: twitter.com/linuxfoundation/st

🗓️ Thursday, March 23rd
🕝 10:00AM PST
👥 Robert Thompson and Geoff Freeman, hosted by
@dennylee

🦀 D3L2: Implementing a Data Lakehouse for Improved and

RSVP here ➡️ hubs.la/Q01HxY400

#datascience #analytics #opensource #DeltaLake #lakehouse #tmobile

Last updated 2 years ago

· @twitter
1 followers · 42411 posts · Server mstdn.skullb0x.io

Referenced link: hubs.la/Q01HvKZS0
Originally posted by The Linux Foundation / @linuxfoundation@twitter.com: twitter.com/linuxfoundation/st

The Python 0.8.0 release is here! 😁🦀 In the 𝚠𝚛𝚒𝚝𝚎_𝚍𝚎𝚕𝚝𝚊𝚕𝚊𝚔𝚎 function you can use 𝚖𝚘𝚍𝚎='𝚘𝚟𝚎𝚛𝚠𝚛𝚒𝚝𝚎' in combination with 𝚙𝚊𝚛𝚝𝚒𝚝𝚒𝚘𝚗_𝚏𝚒𝚕𝚝𝚎𝚛𝚜 to overwrite part of a Delta Lake table.

View release notes ➡️ hubs.la/Q01HvKZS0

#DeltaLake

Last updated 2 years ago

R Tyler Croy 🦀 · @rtyler
452 followers · 900 posts · Server hacky.town

You can now follow the project in the fediverse! 🥳

social.lfx.dev/@deltalakeoss

#DeltaLake

Last updated 3 years ago

R Tyler Croy 🦀 · @rtyler
452 followers · 900 posts · Server hacky.town

I wrote a blog post sharing some example code on how to write with

Let's all start building data pipelines in Rust!

buoyantdata.com/blog/2023-02-0

#DeltaLake #rustlang

Last updated 3 years ago

· @twitter
1 followers · 39842 posts · Server mstdn.skullb0x.io

Referenced link: hubs.la/Q01BT4JW0
Originally posted by The Linux Foundation / @linuxfoundation@twitter.com: twitter.com/linuxfoundation/st

We are excited to share that Delta Lake users can now use Conda natively to manage their delta-spark dependency!

Try it out today: hubs.la/Q01BT4JW0

@DeltaLakeOSS

#Conda #DeltaLake #opensource

Last updated 3 years ago

conda · @conda
46 followers · 38 posts · Server fosstodon.org

RT @DeltaLakeOSS
We are excited to share users can now use Conda natively to manage their delta-spark dependency! 🦀 Try it out today ➡️ anaconda.org/conda-forge/delta

#DeltaLake #opensource #dataengineering #conda #linuxfoundation #spark

Last updated 3 years ago

R Tyler Croy 🦀 · @rtyler
452 followers · 900 posts · Server hacky.town

I stepped into a podcast a few weeks ago to discuss the creation of delta-rs, which brings native support to and

youtube.com/watch?v=2jgfpJD5D6

#DeltaLake #rustlang #python

Last updated 3 years ago

· @twitter
1 followers · 35962 posts · Server mstdn.skullb0x.io

Referenced link: hubs.la/Q01yFtPP0
Originally posted by The Linux Foundation / @linuxfoundation@twitter.com: twitter.com/linuxfoundation/st

We are excited to announce the release of 2.0.2 on Apache Spark 3.2!! 🎊 This release contains important bug fixes and a few high-demand usability improvements over 2.0.1.

View the release notes: hubs.la/Q01yFtPP0

#DeltaLake

Last updated 3 years ago

· @twitter
1 followers · 34802 posts · Server mstdn.skullb0x.io

Referenced link: hubs.la/Q01xVX6S0
Originally posted by The Linux Foundation / @linuxfoundation@twitter.com: twitter.com/linuxfoundation/st

AWS Glue crawlers now have enhanced support for tables, increasing operational efficiency to extract meaningful insights from analytics services such as Amazon , Amazon EMR, and Glue.

Learn more ➡️ hubs.la/Q01xVX6S0

@DeltaLakeOSS

#DeltaLake #Athena #aws #opensource

Last updated 3 years ago

R Tyler Croy 🦀 · @rtyler
452 followers · 900 posts · Server hacky.town

I did some streaming tonight, and while I didn't get very far in building some lambda ingestion code, I do believe I found and documented a bug in a piece of software that is handling millions of messages each day🎉

#rustlang #DeltaLake

Last updated 3 years ago

Jacek Laskowski · @jaceklaskowski
98 followers · 18 posts · Server fosstodon.org

@oleg Mostly source code (so I can learn even more at the same time). Worked great with (as they all are written in ). Thinking of as it's close to Spark but written in I'd like to know better. HTH

#apachespark #DeltaLake #apachekafka #scala #dask #python

Last updated 3 years ago

@oleg Mostly source code (so I can learn even more at the same time). Worked great with (as they all are written in ). Thinking of as it's close to Spark but written in I'd like to know better. HTH

#apachespark #DeltaLake #apachekafka #scala #dask #python

Last updated 3 years ago

timvw · @timvw
45 followers · 227 posts · Server fosstodon.org
R Tyler Croy 🦀 · @rtyler
452 followers · 900 posts · Server hacky.town

Ended up cleaning up some RecordBatchWriter code and posting this example which demonstrates a simple writing of data to in

github.com/buoyant-data/demo-r

#DeltaLake #rustlang

Last updated 3 years ago

R Tyler Croy 🦀 · @rtyler
452 followers · 900 posts · Server hacky.town

I have some time this evening, debating whether I should live stream some and coding so long as the power holds out

#DeltaLake #rustlang

Last updated 3 years ago

R Tyler Croy 🦀 · @rtyler
452 followers · 900 posts · Server hacky.town

@bflipp there are not protocol or transaction file changes with 2.x, so you can safely run a job using the newer version just to run OPTIMIZE if your existing architecture supports that.

What kind of write workloads do ya have floating about?

#DeltaLake

Last updated 3 years ago