Good to see the #drillToDetail podcast with Mark Rittman has returned.
This episode with Chris Tabb was a useful chat about current & future tooling including the #modernDataStack, along with a look back at what's been done before and what we can learn from that.
#drilltodetail #moderndatastack #datadon #podcast #data #dataengineering
π£οΈ"The #ModernDataStack [is] probably just marketing buzzβ¦"
Of course, it's more nuanced than that, but you can rely on @benn for a nice πΆοΈsoundbite ;) @benn has a very approachable way of explaining his view on things in a clear yet detailed way. This was a really good Analytics Engineering podcast episode from late 2021 from @jthandy and @schottj.
π§ https://overcast.fm/+w94VRRIio/
βπ» https://roundup.getdbt.com/p/benn-stancil-friday-night-data-fights
Apache Pinot is a column oriented & distributed OLAP data store that supports realtime analytics with low latency. Check out in this blog how you can spin up an Apache Pinot cluster using Docker.
π Link to blog: https://lnkd.in/em4fDk7b
#apachepinot #moderndatastack #realtimeanalytics #analytics #realtime #olap #docker #devops #distributedsystems
#apachepinot #moderndatastack #realtimeanalytics #analytics #realtime #olap #docker #devops #distributedsystems
Jason M. Lemkin shared an interesting analysis of Hubspot CRM market share gain with tech startups at the expense of Salesforce. Across all verticals, there is a broader shift away from having CRM as THE system of record. https://buff.ly/3iBtSoX #crm #cdp #moderndatastack
I work with data for more than 10 years started as Data Analyst and growing into Analytics Engineer.
My experience includes working with both #moderndatastack such as Snowflake, #dbt , Fivetran, and "older" on-prem Hadoop stack: #spark , hive, Scala.
I speak #sql , #python and JavaScript.
Peace!β€οΈ
#Introduction #moderndatastack #dbt #spark #sql #python
I'm going to pin it down; It will be interesting to search in all MDS vendor's blogs how many times they mentioned Data Contract
#datacontract #datadon #moderndatastack
#datacontract #datadon #moderndatastack
This week I am positively drunk on the power of the #moderndatastack #mds - mostly #meltano and #dbt .
#moderndatastack #mds #meltano #dbt
Ok who's going to "Modern Data Stack Conference" in SF in April? Is this a thing that would be worth attending?
#datadon #conferences #ModernDataStack
#datadon #conferences #moderndatastack
Two observations from my #Twexodus of the past three days:
- the #infosec and #cybersec communities are well represented and well organized here in the #fediverse
- I am having a devil of a time finding folks involved with #DataHusbandry (my own term that covers the care & feeding of the dataset and metadataset)
Where is the conversation on #DataEngineering ? #ModernDataStack ?
#DataMesh ?
I'm leavin' these hashtags here in hopes of finding kindred spirits.
#Twexodus #infosec #cybersec #fediverse #DataHusbandry #DataEngineering #moderndatastack #DataMesh
π I've interviewed numerous data leaders in the past few weeks trying to learn how you build out scalable data infrastructure.
LESSON 3:
π A great way to get buyin is to start measuring the various steps in your data pipeline to understand what's not working. Leading with observability highlights the risk and urgency of data infrastructure and helps leadership tie it to business outcomes.
#data #datascience #dataengineering #dataops #sql #python #moderndatastack #cloud
#Cloud #moderndatastack #python #sql #dataops #dataengineering #datascience #data
π I've interviewed numerous data leaders in the past few weeks trying to learn how you build out scalable data infrastructure.
LESSON 2:
π It's not enough to know the best practices in data infrastructure, you actively have to 1) find your champion to sponsor infrastructure, and 2) sell the solution within your org for either build or buy.
#data #datascience #dataengineering #dataops #sql #python #moderndatastack #cloud
#Cloud #moderndatastack #python #sql #dataops #dataengineering #datascience #data
π I've interviewed numerous data leaders in the past few weeks trying to learn how you build out scalable data infrastructure.
LESSON 1:
π The data industry had a huge shift in workflows with the transition cloud-- making it easier than ever before to store and compute data. This rush to data without gaurdrails are leading to intense reactivity among data professionals and thus burnout.
#data #datascience #dataengineering #dataops #sql #python #moderndatastack #cloud
#Cloud #moderndatastack #python #sql #dataops #dataengineering #datascience #data