home.social

#csvdiff — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #csvdiff, aggregated by home.social.

  1. RE: floss.social/@janriemer/114760

    New version of #CSVDiff is out! :awesome:

    crates.io/crates/csv-diff

    Thanks to it now using `extract_if` instead of a hacky drain-then-filter impl, you can now e.g. diff your x-mas wishlist against your "actual-gifts-received list" _25% faster_ (and be happy, if no diff is reported)! 🎅 🚀

    ⚠️ The new version has an MSRV of 1.88 (in order to use `extract_if`)!

    See the complete changelog for details:
    gitlab.com/janriemer/csv-diff/

    Happy X-Mas y'all! 🎄 🎁

    #Rust #RustLang #CSV #Crate #Release

  2. #Fuzzing along in #CSVDiff :awesome:

    In the second screenshot I've highlighted some interesting parts:

    Key field indices are 2 and 3, so when diffing the records, where key fields are highlighted, they'll be compared as `Modify`, because:
    - key fields are equal between left and right record
    - other fields are unequal between left and right record

    The other two records on the right have no corresponding left record - so those are `Add`ed records

    #Rust #FuzzTesting #RustLang #PropertyTesting

  3. #Fuzzing along in #CSVDiff :awesome:

    In the second screenshot I've highlighted some interesting parts:

    Key field indices are 2 and 3, so when diffing the records, where key fields are highlighted, they'll be compared as `Modify`, because:
    - key fields are equal between left and right record
    - other fields are unequal between left and right record

    The other two records on the right have no corresponding left record - so those are `Add`ed records

    #Rust #FuzzTesting #RustLang #PropertyTesting

  4. #Fuzzing along in #CSVDiff :awesome:

    In the second screenshot I've highlighted some interesting parts:

    Key field indices are 2 and 3, so when diffing the records, where key fields are highlighted, they'll be compared as `Modify`, because:
    - key fields are equal between left and right record
    - other fields are unequal between left and right record

    The other two records on the right have no corresponding left record - so those are `Add`ed records

    #Rust #FuzzTesting #RustLang #PropertyTesting

  5. #Fuzzing along in #CSVDiff :awesome:

    In the second screenshot I've highlighted some interesting parts:

    Key field indices are 2 and 3, so when diffing the records, where key fields are highlighted, they'll be compared as `Modify`, because:
    - key fields are equal between left and right record
    - other fields are unequal between left and right record

    The other two records on the right have no corresponding left record - so those are `Add`ed records

    #Rust #FuzzTesting #RustLang #PropertyTesting

  6. #Fuzzing along in #CSVDiff :awesome:

    In the second screenshot I've highlighted some interesting parts:

    Key field indices are 2 and 3, so when diffing the records, where key fields are highlighted, they'll be compared as `Modify`, because:
    - key fields are equal between left and right record
    - other fields are unequal between left and right record

    The other two records on the right have no corresponding left record - so those are `Add`ed records

    #Rust #FuzzTesting #RustLang #PropertyTesting

  7. Just published a new version of csv-diff (v0.1.1) 🚀

    lib.rs/crates/csv-diff

    This fixes a nasty bug regarding sort order of modified csv records. 😖

    Details in the MR/PR:
    gitlab.com/janriemer/csv-diff/

    Also, two new incoming PRs for #qsv, the #CSV toolkit:

    The first updates to the latest csv-diff, fixing aforementioned bug:
    github.com/dathere/qsv/pull/24

    The second fixes a bug regarding conversion from column names to indices:
    github.com/dathere/qsv/pull/24

    #Rust #RustLang #OpenSource #CSVDiff

  8. Ouch, there is another bug and this time it is actually _in #CSVDiff itself_!

    It happens with sorting the results of modified rows (urgh, I'm also not happy with the sorting code).😨

    Thankfully, datatraveller1 already has found a reproducible example - thank you so much! ❤️

    Bug:
    github.com/dathere/qsv/issues/

    I think I already found a solution, but needs rigorous testing first!

    Potential solution:
    github.com/dathere/qsv/issues/

    #qsv #Bug #csv

  9. Nice, I think I found the bug! 🐛

    See all the explanation and possible solution here:

    => github.com/dathere/qsv/issues/

    Workaround is also present and explained, so should be no blocker for people.

    Will prob provide a fix on the weekend. 🤞

    #CSVDiff #qsv #Bug #Fix #Bugfix

  10. Uh ohhhh, someone reported a bug in qsv's `diff` command.😮 🙈

    github.com/dathere/qsv/issues/

    Hopefully, we can resolve this soon! 🤞🥺

    I have a strong suspicion, but let's see... I need more info first from the OP.

    #Bug #Issue #CSVDiff #Diff #CLI #qsv

  11. #CsvDiff has finally reached v0.1.0, it's first ever non-alpha/-beta release! 🎉

    New features like getting at the headers from the diffresult have been needed for the following PR in qsv (which is in final review):
    github.com/jqnatividad/qsv/pul

    When merged, you'll be able to decide, whether the diffresult should output headers or not (see examples in the PR). :awesome:

    Check out csv-diff's Changelog for the full details:
    gitlab.com/janriemer/csv-diff/

    #CSV #qsv #CLI #DataScience #DataEngineering

  12. I can't reproduce the bug. ¯\_(ツ)_/¯

    Neither in #CsvDiff ...
    gitlab.com/janriemer/csv-diff/

    ...nor in #qsv
    github.com/jqnatividad/qsv/pul

    My assumption is that they have forgotten to specify the option --right-delimiter (or --left-delimiter, respectively), when executing `qsv diff`:
    github.com/jqnatividad/qsv/iss

    Anyway, we now have additional tests in csv-diff and qsv, so definitely a win, regardless of the outcome! 🎉

    @floriann FYI

    #Rust #RustLang #OpenSource #NotReproducible

  13. A new version of csv-diff is out (v0.1.0-beta.2) 🎉

    lib.rs/crates/csv-diff

    This version adds a method, which allows you to sort your diff result by columns (it was already possible to sort by lines).

    See the changelog for an example:
    gitlab.com/janriemer/csv-diff/

    Sorting by columns will soon be integrated into qsv, the #CSV toolkit:
    github.com/jqnatividad/qsv/iss

    Thank you @jqnatividad for the idea of this feature! 💚

    #Rust #RustLang #CSVDiff #DataScience #qsv #CLI