home.social

#pydataamsterdam2023 — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #pydataamsterdam2023, aggregated by home.social.

  1. @pydataamsterdam #PyDataAmsterdam2023 coming to an end. I'm logging off from here already - had an *amazing* time, and will definitely try to attend again in coming years. See you soon #PyData family!

    #PyDataAmsterdam #python

  2. @pydataamsterdam @koaning This is weird but I'm crying. @koaning's keynote resonated with me so damn much for many reasons. I've often reflected about programming as a form of creative expression and how we coders have an underused superpower, and Vincent eloquently articulated so many good examples of this. Don't miss the recording when it's out.

    #PyDataAmsterdam2023 #PyDataAmsterdam #PyData #python

  3. @pydataamsterdam Thomas kind of dodged my question on the enforceability of OpenRAIL 😇 so happy that they exist anyway, it's a conversation we need to have.

    (RAIL = Responsible AI Licenses)

    #PyDataAmsterdam2023 #PyDataAmsterdam #PyData #python

  4. @pydataamsterdam

    Impressive results from Hugging Face: proper filtering of web data can match or exceed performance of commercial models trained on highly curated datasets.

    Dataset: huggingface.co/datasets/tiiuae
    Paper: doi.org/10.48550/arXiv.2306.01

    #PyDataAmsterdam2023 #PyDataAmsterdam #PyData #python

  5. @pydataamsterdam "Choice of training data is the most important [part] of an LLM!"

    Data quality improvements "can be equivalent to a 2x-3x increase in size"

    #PyDataAmsterdam2023 #PyDataAmsterdam #PyData #python

  6. @pydataamsterdam So excited to see the Thomas Wolf and more from the Hugging Face 🤗 giving a promising closing keynote! Just this Monday I was working with some colleagues on a HF + @kedro integration that hopefully will go open source soon.

    "2000+ models in the Hub are private"

    #PyDataAmsterdam2023 #PyDataAmsterdam #PyData #python

  7. @pydataamsterdam

    Spotting what I call the "Jim Downling classification of data pipelines" in this promising talk by Hopsworks

    #PyDataAmsterdam2023 #PyDataAmsterdam #PyData #python

  8. @pydataamsterdam

    Ana Chaloska “To One-Hot or Not: A guide to feature encoding and when to use what” is completely full!

    #PyDataAmsterdam2023 #PyDataAmsterdam #PyData #python