Jan :rust: :ferris: · @janriemer
598 followers · 4677 posts · Server floss.social

I can't reproduce the bug. ¯\_(ツ)_/¯

Neither in ...
gitlab.com/janriemer/csv-diff/

...nor in
github.com/jqnatividad/qsv/pul

My assumption is that they have forgotten to specify the option --right-delimiter (or --left-delimiter, respectively), when executing `qsv diff`:
github.com/jqnatividad/qsv/iss

Anyway, we now have additional tests in csv-diff and qsv, so definitely a win, regardless of the outcome! 🎉

@floriann FYI

#CSVDiff #QSV #Rust #rustlang #OpenSource #notreproducible

Last updated 1 year ago

Jan :rust: :ferris: · @janriemer
476 followers · 2049 posts · Server floss.social

If you want to know, how to provide a large resource (such as an owned String) to a criterion benchmark, you can use the `iter_batched` method:

docs.rs/criterion/latest/crite

See an example of this in

gitlab.com/janriemer/csv-diff/

3/3

#CSVDiff #Rust #rustlang #performance #benchmark #aiisnotintelligent

Last updated 2 years ago

Jan :rust: :ferris: · @janriemer
460 followers · 1764 posts · Server floss.social

Not sure where this will lead to, but it sounds fun and exciting, so let's try! :awesome: :rust: :ferris:

#CSVDiff

Last updated 2 years ago

Jan :rust: :ferris: · @janriemer
460 followers · 1737 posts · Server floss.social

Yay! Sorting the diff result by columns has just been merged into ! 🥳

github.com/jqnatividad/qsv/pul

#data #CLI #dataScience #CSVDiff #QSV #csv

Last updated 2 years ago

Jan :rust: :ferris: · @janriemer
453 followers · 1625 posts · Server floss.social

A new version of csv-diff is out (v0.1.0-beta.2) 🎉

lib.rs/crates/csv-diff

This version adds a method, which allows you to sort your diff result by columns (it was already possible to sort by lines).

See the changelog for an example:
gitlab.com/janriemer/csv-diff/

Sorting by columns will soon be integrated into qsv, the toolkit:
github.com/jqnatividad/qsv/iss

Thank you @jqnatividad for the idea of this feature! 💚

#CLI #QSV #dataScience #CSVDiff #rustlang #Rust #csv

Last updated 2 years ago

Jan :rust: :ferris: · @janriemer
422 followers · 1027 posts · Server floss.social

@hyde Also check out `qsv`. 🙂

It's an actively maintained fork of xsv (xsv is not maintained anymore).

qsv is _very active_ in development.

And shameless plug in the end 😁
Just a few days ago, `csv-diff` got merged:
github.com/jqnatividad/qsv/pul

csv-diff is a crate for comparing CSVs with ludicrous speed:
gitlab.com/janriemer/csv-diff

So the new command `qsv diff` is now the fastest differ in the world! 🚀

#rustlang #Rust #CLI #data #CSVDiff #BlazinglyFast #csv

Last updated 2 years ago

Jan :rust: :ferris: · @janriemer
412 followers · 915 posts · Server floss.social

Announcement 🎉 🥳

csv-diff will be integrated into qsv, the CSV toolkit soon! 🎉 :ferris:

PR:
github.com/jqnatividad/qsv/pul

Comparing the majestic million dataset with 1,000,000 rows x 12 columns takes less than 800ms and only about 150mb of RAM!
With this, it is the fastest differ in the world!🚀

See the following svg recording for a demo:

gist.githubusercontent.com/jan

csv-diff:
gitlab.com/janriemer/csv-diff

#oxidization #dataScience #performance #difference #CSVDiff #diff #data #rustlang #Rust #csv

Last updated 2 years ago

Jan :rust: :ferris: · @janriemer
398 followers · 573 posts · Server floss.social

🥳 A new version of csv-diff has just been released! 🚀

docs.rs/csv-diff/latest/csv_di

csv-diff is the fastests CSV-diffing library in the world - written in

It can compare two 1,000,000 rows x 9 columns CSVs in < 600ms!

Note that this is still a beta release and the library itself is still very young.

#crate #OpenSource #difference #diff #data #dataScience #performance #CSVDiff #csv #release #rustlang #Rust

Last updated 2 years ago