home.social

Search

698 results for “pydata_helsinki”

  1. The #QuartoLive extension lets you plug interactive (editable) #RStats code directly into your Quarto documents. And you can easily build exercises into your docs, too! #PositConf2024 #RStats #PyData 🧵 12/17

  2. The #QuartoLive extension lets you plug interactive (editable) #RStats code directly into your Quarto documents. And you can easily build exercises into your docs, too! #PositConf2024 #RStats #PyData 🧵 12/17

  3. The TargetEncoder PR has been merged into the scikit-learn main branch!

    github.com/scikit-learn/scikit

    It's a very efficient way to deal with high cardinality categorical variables for supervised machine learning tasks. See the following quick tutorial to compare its performance with one-hot encoding, ordinal encoding and native support of categorical variables in Gradient Boosted Trees:

    scikit-learn.org/dev/auto_exam

    It will be part of scikit-learn 1.3.

    #sklearn #PyData #SciPy #MachineLearnig #Python

  4. Noticias sobre Python y Datos de la semana, episodio 72 🐍⚙️🐼

    En resumen: ¡pandas 2.0! Versiones nuevas de Polars y Great Expectations, anotando cantos de pájaros, y despidiéndome para unas breves vacaciones

    astrojuanlu.substack.com/p/epi

    Apoya el noticiero suscribiéndote por correo 📬

    ¡Y sigue a @pandas_dev!

    #pandas #polars #python #pydata #pycamp #noticieropythonydatos

  5. At PyData NYC tutorial, by Jacob Tomlinson, I learned that now it is possible to access the same array on the GPU from pytorch and cupy. I'm loving how it will let you use the strengths of different libraries without dealing with extra memory copies.

    nyc2024.pydata.org/cfp/talk/VA
    #PyDataNYC2024
    #pytorch
    #cupy

  6. @melissawm @jni @simon_brooke @hynek @napari

    I build them for #PyQtGraph. You use the html builder with a little bit of (emphasis on little) custom css and the sphinx pydata theme looks amazing as a docset. I also disabled sidebars which makes for better viewing in dash.

    The longest part was going through all the docs to identify areas that were problematic. I would occasionally identify oddities.

  7. Many of the keynote and session videos for PyData Global 2022 went up online today, and here's my talk:
    youtu.be/IKFGFFtxgow?t=5463

    #graphdatascience

  8. I'll present at PyData Global, Thu Dec 01 13:30 US Pacific:
    "Data Prep for Graphs"
    global2022.pydata.org/cfp/talk

    TL;DR: data prep phase in #graphdatascience work involves tools/techniques vastly different than data science in general. This stage of work is computationally expensive, and ironically much must be performed *prior* to loading into a graph DB.

    Here's a sampler.

    Also, we'll cover the github.com/DerwenAI/pynock proposal for Parquet serialization of graph data.

    #graphthinking

  9. OSSci will be in beautiful Prague this Thursday, May 16. Thanks to the PyData Prague team for putting together a great agenda. 50+ people registered. Please share with your networks.Thanks!

    medium.com/open-source-science

  10. Can you name that algorithm based on this dataflow representation?

    It's Linear Discriminant Analysis as implemented by Scikit Learn!

    I finished up a notebook showing how you can build an Array API compatible library with the egglog e-graph library in Python and use that to optimize a #scikit-learn algorithm with #numba

    egg-smol-python.readthedocs.io

    For more context, I gave a talk on the broader goals this summer:

    egg-smol-python.readthedocs.io

    youtube.com/watch?v=Pbi2uV9vWP

    #pydata #egraph #python

  11. @alesegura @mdwaldman22

    In an open source project called `kglab` (since 2020) we've worked to build integration paths between these different camps, making them more compatible with PyData approaches, and providing tutorials with examples.
    github.com/DerwenAI/kglab
    derwen.ai/docs/kgl/tutorial/

    #graphthinking #graphdatascience

  12. Video is now available from our talk at Ray Summit 2022 "Graphs at scale with Ray, for AI in Manufacturing"
    anyscale.com/ray-summit-2022/a

    Lots of details discussed!

    (free, requires registration details)

    #graphthinking #graphdatascience #ai #manufacturing #ray #pydata

  13. Really excited to attend #JupyterCon in Paris next month!

    @vincent_m and I will give a full day tutorial on predictive Survival Analysis and Competing Risks modeling with a Gradient Boosting model assembled from generic scikit-learn building blocks. We will also introduce many concepts and model evaluation methodology using specialized libraries such as lifelines and scikit-survival.

    Here is the full agenda for this session:

    cfp.jupytercon.com/2023/talk/A

    #PyData #SurvivalAnalysis #sklearn

  14. 🧠 Masterclass spotlight: Decoupled Data (April 17)

    Build a production-grade Python API with clean, reliable database connections.
    In this full-day, hands-on masterclass, Dr. Kristian Rother explores the Repository Pattern and compares SQL, ORM, and NoSQL approaches in real systems.

    🎟️ Space is limited.
    👉 2026.pycon.de/masterclasses/de

  15. 🧠 Masterclasses are published!

    Our new Masterclass Day (April 17) features hands-on, group sessions — from AI and testing to data, security, and Python internals.

    More masterclasses coming soon 👀
    👉 2026.pycon.de/masterclasses/

  16. I finally got plone-sphinx-theme to build using Sphinx Theme Builder, while it inherits Sphinx Book Theme, which in turn inherits from PyData Sphinx Theme. Next steps include tidy it up, write some docs, and package it for its first release. Then all docs will have a single theme for all its projects. @plone
    Thanks to @choldgraf, @pydata, and @pradyunsg for being the giants upon whose shoulders I can stand.

  17. Asked to help debug an online maths question about #BoxPlots, and learnt there are conflicting interpretations of the 1st and 3rd #quartiles which matter with small datasets like teaching examples!

    en.wikipedia.org/wiki/Quartile

    It is also not easy to see from the documentation which any plotting library actually uses, e.g. seaborn seaborn.pydata.org/generated/s

  18. Really looking forward to PyData Global 2024 (online) !!

    I'll be presenting
    "Catching Bad Guys using open data and open models for graphs"
    Thu Dec 5, 14:30-15:00 BST
    global2024.pydata.org/cfp/talk

    #PyDataGlobal #Senzing #ERKG #knowledgegraphs #AI #darkmoney #AML #entityresolution #opendata

  19. @py data science/engineering recently had #NormConf with the same spirit, discussing the everyday, boring, normal work that you need to do for successful data projects etc, maybe you could do a NormConfMobile ?

  20. is hosting a live Q&A tomorrow with the @pycon and @ThePSF team about how the community can get involved with in Pittsburgh!

    What questions should we ask? Let us know in the thread!

    meetup.com/pydata-pittsburgh/e

  21. Ya está abierto el registro para nuestra reunión de octubre: "🗄️ SQL generado con lenguaje natural y MLFlow para productivización de modelos" este mes en las oficinas de Cepsa

    meetup.com/pydata-madrid/event

    ¡Nos vemos el jueves 19 a las 19:00! Y después, al bar a hacer networking 🍻

    #PyDataMadrid #PyData #Python #datascience #machinelearning #text2sql #llms #mlflow

  22. Ya está abierto el registro para nuestra reunión de octubre: "🗄️ SQL generado con lenguaje natural y MLFlow para productivización de modelos" este mes en las oficinas de Cepsa

    meetup.com/pydata-madrid/event

    ¡Nos vemos el jueves 19 a las 19:00! Y después, al bar a hacer networking 🍻

    #PyDataMadrid #PyData #Python #datascience #machinelearning #text2sql #llms #mlflow