PipeRider · @piperider
160 followers · 42 posts · Server fosstodon.org

Q. What's the best thing about open-source software?

A. You can modify it and add your own features❀️

That's exactly what Alex, a senior data engineer at domain.com.au, did with PipeRider.

Alex added features specific to his data migration use-case of comparing two datasets on the row-level, with tolerance

youtube.com/watch?v=SSoFyWTAYG

#opensource #dataquality #datareliability #DataLineage #dataops #datatools #dataviz

Last updated 2 years ago

PipeRider · @piperider
157 followers · 39 posts · Server fosstodon.org

The lineage graph in dbt docs is great, but it’s static and represents only one state of your data.

Lineage Diff in PipeRider shows you exactly which nodes have changed in your branch code compared to main.

Lineage Diff shows:

πŸ†• New nodes
♻️ Changed nodes
πŸ‘Š Impacted nodes
πŸ“Š Full lineage graph and change-only views
πŸ“‹ Extra stats: row count and execution time

Read more about how to get Lineage Diff:

medium.com/inthepipeline/dbt-d

#dataquality #dataviz #datareliability #DataLineage #dataengineering

Last updated 2 years ago

PipeRider · @piperider
156 followers · 38 posts · Server fosstodon.org

We've rebooted our Discord community ♻️

If you're looking for:

πŸ›Ÿ Help using PipeRider
πŸ“° Info on new releases
πŸ’¬ General chat about data

Join the community here:

piperider.io/discord

#datacommunity #discord #dataquality #datareliability #dbt #codereviewfordata

Last updated 2 years ago

PipeRider · @piperider
155 followers · 36 posts · Server fosstodon.org

The latest issue of 'In the Pipeline', the PipeRider newsletter, is coming very soon.

Make sure you don't miss out by signing up!

app.us1.list-manage.com/subscr

Featuring info about:

- New PipeRider features
- Data best practices
- Guest interviews
- Interesting data-related content

.

#datatools #dataops #dataquality #datareliability #datanewsletter #dbt #piperider

Last updated 2 years ago

PipeRider · @piperider
155 followers · 36 posts · Server fosstodon.org

The PipeRider Community Office Hours for June 20th is now online:

youtube.com/watch?v=6scbaNMfXS

- New features in PipeRider 0.27

- Lineage Diff in PipeRider Cloud!

- Special guest Spencer Ellinor from Sudo Labs

Timestamps and links in the video description

#dataquality #datareliability #piperider #dataviz #dbt #dataops #lineagediff #codereviewfordata

Last updated 2 years ago

PipeRider · @piperider
155 followers · 34 posts · Server fosstodon.org

PipeRider now has a dedicated tools channel on the dbt Slack!

Join the dbt Slack here:
getdbt.com/community/join-the-

Then, either search for the -piperider channel, or follow this link:
getdbt.slack.com/archives/C05C

See you there πŸ’ͺ

#tools #dbt #dataquality #dataops #opensource #datareliability #datatools #codereviewfordata #piperider

Last updated 2 years ago

PipeRider · @piperider
154 followers · 33 posts · Server fosstodon.org

How do you track down and explore data changes in your dbt project?

One way is to explore rows that fall outside previous boundaries

Check out this upcoming PipeRider feature that highlights a change, and shows the SQL you need to find the affected rows

#dbt #dataquality #eda #datatviz #sql #dataops #datareliability #dataengineering #analyticsengineering

Last updated 2 years ago

PipeRider · @piperider
154 followers · 32 posts · Server fosstodon.org

You already use dbt to empower your data modelling workflow, now you need a way to understand how your code changes affect data

That's where "code review for data" comes in

Zero-config dbt integration and merge with confidence πŸ’―

Get started today at
piperider.io

#dbt #dataquality #datareliability #opensource #dataops #dataviz #dataprofile #datatesting #piperider

Last updated 2 years ago

PipeRider · @piperider
154 followers · 31 posts · Server fosstodon.org

PipeRider 0.26.0 has just been released πŸ“’

'PipeRider compare' is a powerful command, so we've included two options to guide you through using it:

--dry-run
will show each command that will be run

--interactive
will guide you step by step

More info:
github.com/InfuseAI/piperider/

To update just run:
pip install -U piperider

#dataquality #piperider #datareliability #dataops #datadiff #dbt

Last updated 2 years ago

PipeRider · @piperider
153 followers · 30 posts · Server fosstodon.org

Hi all!

PipeRider 0.25.0 has just been release with an increased focus on dbt integration.

You no longer need to initialize PipeRider in dbt projects (piperider init).

To get started in a new project, all you need to do is:

1. Install PipeRider πŸ‘©β€πŸ’»
2. Tag your models 🏷️
3. Run PipeRider πŸƒβ€β™‚οΈ
4. Enjoy rich data profiling reports and improve your code review process πŸ“Š

Find out more:
medium.com/inthepipeline/zero-

#dataquality #dataengineering #dataops #datareliability #analyticsengineering

Last updated 2 years ago

PipeRider · @piperider
148 followers · 23 posts · Server fosstodon.org

PipeRider 0.21.0 it out now with the following main updates:

- The compare command now uses 'three-dot' compare (to compare against with main at the point at which your branch was made)

- PipeRider Cloud supports multiple workspaces

Get the latest version:

github.com/InfuseAI/piperider

#dbt #dataquality #datareliability #dataprofile #dataviz #dataops #dataengineer #dataengineering #analyticsengineering #AnalyticsEngineer

Last updated 2 years ago

PipeRider · @piperider
146 followers · 22 posts · Server fosstodon.org

Are you using dbt in production and managing deployment with CI?

We ran a workshop last week as part of the Data Engineering Zoomcamp:

Understand the impact of data model changes in dbt with PipeRider

youtube.com/watch?v=O-tyUOQccS

#dataengineering #analyticsengineering #dbt #dataquality #datareliability

Last updated 3 years ago

PipeRider · @piperider
140 followers · 21 posts · Server fosstodon.org

πŸ“’ PipeRider 0.18.0 is out now and our support is even better!

- dbt defined metrics in HTML reports

- Visualize metric differences between data profiles

- Metric comparison summary in Markdown to paste into your pull request comment

Start your "code review for data projects" now:

github.com/InfuseAI/piperider

#dbt #dataquality #datareliability #dataops #dataobservability #opensource #snowflake #datawarehouse #dataengineering

Last updated 3 years ago

Data Dave · @datadave
212 followers · 37 posts · Server data-folks.masto.host

It was Groundhog day today, so it seems the perfect time to share this article with a gif I made from one of my fav movies:

How to detect schema changes in Snowflake:

medium.com/infuseai/how-to-det

So you won't get caught off guard by the same issues again and again :)

#GroundHogDay #Snowflake #database #DataQuality #datareliability

Last updated 3 years ago

PipeRider · @piperider
128 followers · 20 posts · Server fosstodon.org

PipeRider 0.16.0 is out now with the following improvements and new features:

- BigQuery repeated fields are now supported!

- PipeRider will now profile database 'views' (easily enable in your project's config.yml)

- Automatically open reports in your default browser when running PipeRider CLI

Easily upgrade with:

pip install -U piperider

Read more:

github.com/InfuseAI/piperider/

#dataquality #datareliability #dataprofiler #dataviz #datavisualization #datareport #database #dataops #eda

Last updated 3 years ago

PipeRider · @piperider
92 followers · 19 posts · Server fosstodon.org

dbt state is now supported from PipeRider 0.14

This means you can profile and run data assertions on only modified models

Read more, or check out the video below

Article:
blog.piperider.io/data-reliabi

Video demo:
youtube.com/watch?v=2J2Cu84Hon

#opensource #dataquality #datareliability #dataengineering #dataengineer #dataobservability #dbt

Last updated 3 years ago

PipeRider · @piperider
90 followers · 18 posts · Server fosstodon.org

Watch out for that schema change, it's a doozy!

You probably don't control upstream tables, so having some way to alert you when a table schema changes can save you time and effort.

Using Snowflake for an example, I made some major changes to a table and showed how they can be detected with :

blog.infuseai.io/how-to-detect

#piperider #snowflake #dataengineering #dataquality #datareliability #dataops #dataobservability #datawarehouse #elt

Last updated 3 years ago

PipeRider · @piperider
90 followers · 18 posts · Server fosstodon.org

I've been playing with schema change detection in PipeRider the last few days making an article for users

If you're dealing with tables updating and want to keep track of changes and other issues then check it out!

πŸ”“ It's open-source and ready to go

⏩ Quick Start:
docs.piperider.io/cli/quick-st

⭐ Star us on GitHub if you love Data Quality :ablobcatheart:
github.com/infuseai/piperider

Supports

#snowflake #dataquality #datareliability #bigquery #redshift #duckdb #csv #sqlite #parquet

Last updated 3 years ago

PipeRider · @piperider
87 followers · 16 posts · Server fosstodon.org
cgebert · @cgebert
14 followers · 5 posts · Server jawns.club

Reading through PipeRider documentation. docs.piperider.io

These profiling reports based on your test results are very cool. πŸ§ͺ

Plus there’s an interactive sample included too: piperider-github-readme.s3.ap-

#data #dbt #dataprofiling #datareliability

Last updated 3 years ago