What do software engineers think about data contracts? 🤔
When talking about data contracts, we naturally focus on the benefits for our data teams and improving data quality.
But at GoCardless there is more to it than that! The way we have implemented data contracts has led to it becoming a valuable and much relied upon tool for our software engineers.
Here are 3 things our software engineers 𝒍𝒐𝒗𝒆 about data contracts! ♥️
#datacontracts #softwareengineering
Key takeaways from #chadsanderson ’s Data Quality talk #datacontracts #dataengineering #dataops
#chadsanderson #datacontracts #dataengineering #dataops
In a couple of weeks I'll be at the Data & AI Meetup in London about how we're driving data culture change with #DataContracts. Do come say hi if you're around!
I'm also really looking forward to the other talk, which is about #ExplainableAI.
More details and signup here 👉 https://www.meetup.com/data-ai-london/events/289808126/
I'm not convinced Data Contract's are for every data team. Especially small startups, probably over engineering in many cases. Although I can see it's uses at scale. #data #dataengineering #datacontracts
#data #dataengineering #datacontracts
What happens when data producers are responsible to fix the data issues?
#datamesh #data #datacontracts
Hot Take: Data Quality is collaborative. Quality for thee isn't quality for me.
Contracts and schema validation should be decided as producer, and consumer as data pipelines are built.
The trick is to de-dup the checks and only check for the parameters that the upstream source hasn't ensured. Or modify the checks if your (downstream) source has different requirements.
#schema #datacontracts #contracts #metadata #data
Also
#datacontracts
#TrustworthyAI
#ethicalai
#monitoring
#logging
#spark
#ApacheSpark
#apachecassandra
#ApacheKafka
#DataLake
#datalakes
#datawarehousing
#datawarehouses
#datawarehouses #datawarehousing #datalakes #DataLake #ApacheKafka #apachecassandra #ApacheSpark #spark #logging #monitoring #ethicalai #TrustworthyAI #datacontracts
Also
#datacontracts
#TrustworthyAI
#ethicalai
#monitoring
#logging
#spark
#ApacheSpark
#apachecassandra
#ApacheKafka
#DataLake
#datalakes
#datawarehousing
#datawarehouses
#datawarehouses #datawarehousing #datalakes #DataLake #ApacheKafka #apachecassandra #ApacheSpark #spark #logging #monitoring #ethicalai #TrustworthyAI #datacontracts