I can't reproduce the bug. ¯\_(ツ)_/¯
Neither in #CsvDiff ...
https://gitlab.com/janriemer/csv-diff/-/commit/0ac6ba724656dbcc620eefda3f8ec5d8064134d6
...nor in #qsv
https://github.com/jqnatividad/qsv/pull/1297
My assumption is that they have forgotten to specify the option --right-delimiter (or --left-delimiter, respectively), when executing `qsv diff`:
https://github.com/jqnatividad/qsv/issues/1258#issuecomment-1712842883
Anyway, we now have additional tests in csv-diff and qsv, so definitely a win, regardless of the outcome! 🎉
@floriann FYI
#CSVDiff #QSV #Rust #rustlang #OpenSource #notreproducible
If you want to know, how to provide a large resource (such as an owned String) to a criterion benchmark, you can use the `iter_batched` method:
https://docs.rs/criterion/latest/criterion/struct.Bencher.html#method.iter_batched
See an example of this in #CsvDiff
https://gitlab.com/janriemer/csv-diff/-/blob/main/benches/bench_csv_diff.rs#L67
3/3
#CSVDiff #Rust #rustlang #performance #benchmark #aiisnotintelligent
Not sure where this will lead to, but it sounds fun and exciting, so let's try! :awesome: :rust: :ferris:
Yay! Sorting the #csv diff result by columns has just been merged into #qsv! 🥳
#data #CLI #dataScience #CSVDiff #QSV #csv
A new version of csv-diff is out (v0.1.0-beta.2) 🎉
https://lib.rs/crates/csv-diff
This version adds a method, which allows you to sort your diff result by columns (it was already possible to sort by lines).
See the changelog for an example:
https://gitlab.com/janriemer/csv-diff/-/blob/8642a8a7ba14e22d076cee8c3f690c17f41d7528/CHANGELOG.md#010-beta2-19-february-2023
Sorting by columns will soon be integrated into qsv, the #CSV toolkit:
https://github.com/jqnatividad/qsv/issues/714
Thank you @jqnatividad for the idea of this feature! 💚
#CLI #QSV #dataScience #CSVDiff #rustlang #Rust #csv
@hyde Also check out `qsv`. 🙂
It's an actively maintained fork of xsv (xsv is not maintained anymore).
qsv is _very active_ in development.
And shameless plug in the end 😁
Just a few days ago, `csv-diff` got merged:
https://github.com/jqnatividad/qsv/pull/711
csv-diff is a crate for comparing CSVs with ludicrous speed:
https://gitlab.com/janriemer/csv-diff
So the new command `qsv diff` is now the fastest #CSV differ in the world! 🚀
#rustlang #Rust #CLI #data #CSVDiff #BlazinglyFast #csv
Announcement 🎉 🥳
csv-diff will be integrated into qsv, the CSV toolkit soon! 🎉 :ferris:
PR:
https://github.com/jqnatividad/qsv/pull/711
Comparing the majestic million dataset with 1,000,000 rows x 12 columns takes less than 800ms and only about 150mb of RAM!
With this, it is the fastest #CSV differ in the world!🚀
See the following svg recording for a demo:
csv-diff:
https://gitlab.com/janriemer/csv-diff
#Rust #RustLang #Data #Diff #CsvDiff #Difference #Performance #DataScience #Oxidization
#oxidization #dataScience #performance #difference #CSVDiff #diff #data #rustlang #Rust #csv
🥳 A new version of csv-diff has just been released! 🚀
https://docs.rs/csv-diff/latest/csv_diff/
csv-diff is the fastests CSV-diffing library in the world - written in #Rust
It can compare two 1,000,000 rows x 9 columns CSVs in < 600ms!
Note that this is still a beta release and the library itself is still very young.
#RustLang #Release #CSV #CSVDiff #Performance #DataScience #Data #Diff #Difference #OpenSource #Crate
#crate #OpenSource #difference #diff #data #dataScience #performance #CSVDiff #csv #release #rustlang #Rust