home.social

Search

5 results for “coderdotcam”

  1. I've been wanting to try out for awhile now, and with some of the tinkering I've done at work, I finally had an excellent use case for it.
    Its an opinionated implementation of splitting documents as well as some post processors. For cleaning and splitting, I've clocked it at between 40 and 75x faster than the python implementation, and on my machine it can clean and split 25,000 documents in a second.

    Check it out at github.com/cam-barts/rs_docume

  2. You can do everything correctly and still not be successful