Maybe it just because I've worked with a lot of data that's not customer-centric, but I'm struggling to see a lot of value in Activity Schema. Has anyone on #Datadon used it?
Part 2 of my Simplifying Data Orchestration seires is out on Shipyard's blog!
A no-nonsesne guide on *WHAT* is #DataOrchestration?
Let's simplify together! ππ€ΈββοΈ
Special thanks to: Bill Inmon, Monica Miller, Elena Dyachkova, Rez Khan, Zach Hendlin, & Steven J. Pope for the community contributions in this post.
https://www.shipyardapp.com/blog/what-is-data-orchestration/
#dataOrchestration #dataengineering #datadon
I'm leading a live product demo tomorrow (Thursday, I mean). Please join me! Come see what I do at work!
I'll help you get started with data-diff, and I'll sneak preview Datafold's VS Code extension.
data-diff helps you catch data changes that other testing frameworks will miss, by comparing every single value between dev and prod, with one simple command: data-diff --dbt.
Register here: https://www.datafold.com/virtual-hands-on-lab
Probably want some # for that.
#Fedijobs #DataScience #DataScientist #PublicCommunications #Statistics #DataVisualisations #SocSci #Humanities #USJobs #Datadon
Let's see if there are boost groups:
OK that's all I can find for you.
Good luck!
#fedijobs #datascience #datascientist #publiccommunications #statistics #datavisualisations #socsci #humanities #usjobs #datadon
@al_merose Saw your toot because of the data hashtag π... so +1 for hashtags as a method of curation.
I find the data hashtag itself is a bit 'messy', but at the same time not well used, at least on the Fosstodon federated timeline.
Personally I find the #datadon and #dataengineering hashtags more engineering-relevant π
βπ» Blogged: Using @lakeFS with #ApacheIceberg https://lakefs.io/blog/using-lakefs-with-apache-iceberg/
#datadon #opensource #dataengineering @apacheicebergdevs @tabular
#apacheiceberg #datadon #opensource #dataengineering
I'm super excited to announce I have joined the BNG sports media team to provide analytical content on a weekly basis!
You can continue to follow me on Substack for my broader hockey musings, too.
#HNOM #Hockey #HockeyDon #NHLBruins #Analytics #datadon @bostonbruinsgameday @hockey @hnom
https://blackngoldhockey.com/2023/06/welcome-to-the-bruins-stats-corner/
#hnom #hockey #hockeydon #nhlbruins #analytics #datadon
Booked flights to London for #dbtCoalesce. Last year was such a great kickoff for my data career, and Iβm excited to go back with some real experience under my belt this time #datadon
There comes a time when every open core project will disappoint you, when they will put an exciting feature behind their paid wall.
Sad to see dbt labs get there, and switching in real time: https://github.com/dbt-labs/dbt-core/discussions/6725#discussioncomment-6099860
While sin is a made-up concept, the thing Christians say about sin is 100% true when applied to working with dates in #datascience:
"Sin will take you farther than you want to go, keep you longer than you want to stay, and cost you more than you want to pay."
#datascience #rstats #data #datadon #killingmesoftlywithdates
'My normative lesson is, βHeed Marginalized People.β Fundamentally and foundationally. And, like, donβt include them necessarily in your training data, but include them in the questions that you ask at the outset, and who you think to ask about what you ought to do.
[β¦]
So to ask that question, βWho have we not thought about; whose harms, whose needs, whose voice has been, perhaps speaking, but unheeded, for a very long time? And how do we ensure that the things that they have called out as potential sites of failure, donβt go unremarked, donβt go unaddressed.'
βΈΊ
Dr. Damien P. Williams, @Wolven https://afutureworththinkingabout.com/?p=5442 #ableism #algorithms #bias #biotech #ethics #disability #facialRecognition #feminisms #Google #homophobia #machineLearning #misogyny #neurodiversity #phenomenology #racism #science #neutrality #sexism #cognition #surveillance #dataDon @ethics @dataGovernance @data #historyOfScience
#ableism #algorithms #bias #biotech #ethics #disability #facialrecognition #feminisms #google #homophobia #machinelearning #misogyny #neurodiversity #phenomenology #racism #science #neutrality #sexism #cognition #surveillance #datadon #historyofscience
I'll be writing about team topologies and data work this week. #Datadon, are you familiar with the Team Topologies book/concepts?
#Streamlit's experimental_data_editor is dead! Long live #data_editor!
Also, now you can hide the annoying index column and define sort criteria. #datadon
#streamlit #data_editor #datadon
Hey, #datadon... any thoughts on Predibase? From the folks who made Ludwig, the "declarative" approach to predictive model specification.
Best part of my job is hosting a seminar where engineers share cool things theyβre working
* CI checks for data ingestion
* MetaDiff: Database Metadata version control
* duckdb, dbplyr, and synthea for synthetic clinical data
* instant EKS cluster deployments for changes to our Airflow pipeline
βπ» Final part of my blog series on Write-Audit-Publish (WAP), in which I show in detail how to implement it using #apacheSpark, @deltalakeoss, #Minio, and @lakeFS
---
πpart 1: ππ»What is WAP? https://lakefs.io/blog/data-engineering-patterns-write-audit-publish/?utm_campaign=Social%20media%20activity&utm_source=Mastodon&utm_medium=social&utm_content=blog_rm-wap1
πpart 2: π οΈ Comparing how different tools implement WAP https://lakefs.io/blog/how-to-implement-write-audit-publish/?utm_campaign=Social%20media%20activity&utm_source=Mastodon&utm_medium=social&utm_content=blog_rm-wap2
#ApacheSpark #minio #writeauditpublish #dataengineering #datadon #opensource
π§π¨ How to Implement Write-Audit-Publish (WAP) - an exploration of how WAP can be done on @apacheicebergdevs, #apacheHudi, @deltalakeoss, @lakeFS , or #projectNessie
ππ» What is WAP?
ππ» Check out yesterday's blog: https://lakefs.io/blog/data-engineering-patterns-write-audit-publish/?utm_campaign=Social%20media%20activity&utm_source=Mastodon&utm_medium=social&utm_content=blog_rm-wap1
#apachehudi #projectnessie #dataengineering #data #writeauditpublish #datadon
Data engineers, how many hours per week do you typically spend in meetings?
#DataEngineering #Datadon