📚🚨 Thesis alert: Lars Reckhaus explored #OSM data, focusing on #POI research.
🔍 He compared OSM data to a self-made reference dataset for infrastructures.
📊 Findings: OSM promising, additional reviews needed for specific cases. 🌐🗺️ #OSMAnalysis #dataquality
Read his blogpost here: https://heigit.org/bachelors-thesis-using-osm-for-location-analyses-of-residential-real-estate-projects-an-extrinsic-analysis-of-data-quality/
#osm #poi #osmanalysis #DataQuality
#Ireland scored 100% on the #impact dimension of #OpenDataMaturity in 2022! They offer tools to support and incentivize public bodies in measuring #reuse such as a #KPI tool to compare and monitor #DataQuality.
Read more in the report 👉 http://europa.eu/!VthKbk
🐦🔗: https://n.respublicae.eu/EU_opendata/status/1661341403845390337
#Ireland #impact #OpenDataMaturity #reuse #kpi #DataQuality #EUopendata
Do you have a proper code review process for pushing changes to data projects?
With ELT and dbt, small changes to code can have a big impact on downstream models. The last thing you want is to push a breaking change to prod 😱
Why you need code review for you data projects
by Balu Rama Chandra
https://medium.com/inthepipeline/why-you-need-code-review-for-you-data-projects-95e039083df0
#DataEngineering #DataQuality #DataOps #CodeReview #dbt #AnalyticsEngineering
#dataengineering #DataQuality #dataops #codereview #dbt #analyticsengineering
At #SETACDublin @sivanibaskaran presented an amazing approach to best account for #dataquality when conducting #mobility assessments in relation to the #PMT and #vPvM criteria. This approach can be automated!
@ZeroPM_H2020 @InfoNGI
#setacdublin #DataQuality #mobility #PMT #vpvm
Data Quality Is Getting Worse, Monte Carlo Says
https://www.datanami.com/2023/05/02/data-quality-is-getting-worse-monte-carlo-says/
#data #DataQuality
dbt users - Do you consider data warehouse costs when building and maintaining projects?
If so, what steps you taking to keep costs down, or keep an eye on costs?
#dbt #DataQuality #DataBuildTool #DataEngineering #AnalyticsEngineering #DataOps
#dbt #DataQuality #databuildtool #dataengineering #analyticsengineering #dataops
https://ideas.exlibrisgroup.com/forums/395697-leganto/suggestions/37588642-reject-input-if-doi-or-citation-source-url-is-inco #Leganto an old idea worth promoting - would like to see DOIs automatically reformatted to enable automated citation enrichment #dataQuality #LibrarySystems
#leganto #DataQuality #librarysystems
Comparing #SodaCore with #GreatExpectations, I still can't get my head around GE's spaghetti vocabulary and confusing constructs.
You want to run a data test: why do you need to make it more complicated than
1. where's your data?
2. what tests do you want to run?
#sodacore #greatexpectations #DataQuality #datadon
Do you want to get inspired by stories of high-growth countries on the #ODM #dataquality dimension? Join our webinar ‘Open Data Maturity 2022: Diving deeper into the quality dimension’ on Tuesday 18 April 14-16 CET.
More info 👉 https://europa.eu/!3tGh6w
🐦🔗: https://n.respublicae.eu/EU_opendata/status/1646090842028015616
📰Latest paper in ISPRS International Journal of Geo-Information showcasing a new project type that allows the assessment of the completeness of OSM buildings per tile:
🏗️"Assessing Completeness of #OpenStreetMap Building Footprints Using MapSwipe"
Read more:
https://heigit.org/assessing-completeness-of-openstreetmap-building-footprints-using-mapswipe/
#BMBF #research #LOKI #osm #dataquality #disasterriskreduction #disastermanagement #disasters #geography #data #dataanalytics
#OpenStreetMap #bmbf #research #loki #osm #DataQuality #DisasterRiskReduction #DisasterManagement #disasters #geography #data #DataAnalytics
📆 31st of March, ongoing physical meeting at @EFSA_EU of the Hotspot analysis Project for plant health pests (HoPPi) 🌱 #DataQuality #Surveillance #SpeciesSpecific #QPRA @DAFNAE_UniPD @Unicatt @GVAivia
🐦🔗: https://n.respublicae.eu/Plants_EFSA/status/1641726876774289408
#DataQuality #surveillance #SpeciesSpecific #qpra
High-quality #statistics support more transparent and harmonized reporting on #dataquality. To promote this, the European Commission adopted a new recommendation.
Read about it in @EU_Eurostat article 👉 https://europa.eu/!GFFtFk
🐦🔗: https://n.respublicae.eu/EU_opendata/status/1639930227471310855
#statistics #DataQuality #EUopendata
Great post by Balu about how to detect common DQ issues before merging code changes to production:
RT @CKANproject: 📢 #CKANMonthlyLive happening next Wed!
Join us to hear about the City of Warsaw's #OpenDataPortal
🏢 how #CKAN is improving #DataQuality
🏢 the #City as a Platform concept
🏢 how CKAN makes #DataPublishing more efficient and accessible
🐦🔗: https://n.respublicae.eu/EU_opendata/status/1635192275339993088
#CKANMonthlyLive #OpenDataPortal #ckan #DataQuality #city #DataPublishing
Help! We’re doing an RFI for data quality vendors and I want to know who’s who out there? Ideally open core
Vendor content welcome!
I know about Great Expectations, Soda, Elementary, and Re_data
Another good episode of Monday Morning Data Chat from @joereis and Matt Housley: **Honest No-BS Data Modeling w/ Juan Sequeda**: https://anchor.fm/ternary-data/episodes/101---Honest-No-BS-Data-Modeling-w-Juan-Sequeda-e1qcrmu
Three particular bits caught my ear:
* "Everything is a graph" :)
* The pendulum between rigour and speed (https://overcast.fm/+wcMpREzek/27:41)
* A company who saw the importance of #dataquality and made 20% of everyone's bonus dependant on it (https://overcast.fm/+wcMpREzek/33:50)
#DataQuality #datamodeling #dataengineering #datadon #noxp
It was Groundhog day today, so it seems the perfect time to share this article with a gif I made from one of my fav movies:
How to detect schema changes in Snowflake:
https://medium.com/infuseai/how-to-detect-schema-change-in-snowflake-6ffcd28c3f15
So you won't get caught off guard by the same issues again and again :)
#GroundhogDay #Snowflake #database #DataQuality #DataReliability
#GroundHogDay #Snowflake #database #DataQuality #datareliability
All the handwringing over
#AI is like early #Wikipedia.
In the late 90s, Wikipedia for information was considered completely #unreliable, #unorthodox, and #unacceptable.
Now I find things in Wikipedia that are far beyond anything ever in #EncyclopediaBritannica. The #dataquality and #utility are high. I read it as a starting point, and then I read #sourcematerial when I need more detail.
What AI doesn’t provide YET is #references to its content. AI simply needs a good #editor.
#editor #references #sourcematerial #utility #DataQuality #encyclopediabritannica #unacceptable #Unorthodox #unreliable #wikipedia #ai
Se le tassonomie le fate per essere pubblicate su file #pdf non le state facendo bene, al di là dei contenuti. #machinereadability #semantics #dataquality
#pdf #machinereadability #semantics #DataQuality