home.social

#pyarrow — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #pyarrow, aggregated by home.social.

  1. Munquet 0.2.1 just landed on Flathub 🚀

    Fixed a small race condition when canceling a conversion — turns out the process could finish right before you clicked “Yes” 😅

    Two lines later… all good.

    flathub.org/en/apps/io.gitlab.

    #Flatpak #GTK4 #OpenSource #Parquet #DataScience #Linux #Python #PyArrow

  2. Munquet 0.2.1 just landed on Flathub 🚀

    Fixed a small race condition when canceling a conversion — turns out the process could finish right before you clicked “Yes” 😅

    Two lines later… all good.

    flathub.org/en/apps/io.gitlab.

    #Flatpak #GTK4 #OpenSource #Parquet #DataScience #Linux #Python #PyArrow

  3. Munquet 0.2.1 just landed on Flathub 🚀

    Fixed a small race condition when canceling a conversion — turns out the process could finish right before you clicked “Yes” 😅

    Two lines later… all good.

    flathub.org/en/apps/io.gitlab.

    #Flatpak #GTK4 #OpenSource #Parquet #DataScience #Linux #Python #PyArrow

  4. Munquet 0.2.1 just landed on Flathub 🚀

    Fixed a small race condition when canceling a conversion — turns out the process could finish right before you clicked “Yes” 😅

    Two lines later… all good.

    flathub.org/en/apps/io.gitlab.

    #Flatpak #GTK4 #OpenSource #Parquet #DataScience #Linux #Python #PyArrow

  5. Munquet is now officially on Flathub 🎉

    A native Linux app to convert datasets into Apache Parquet using PyArrow backend. Perfect for data science workflows, analytics, and anyone needing fast local conversions.

    Get it here: flathub.org/en/apps/io.gitlab.

    @gnome @xfce @kde @GTK @linux @flathub

    #apache #pyarrow #datascience #parquet #csv #OpenSource #Python #GNOME #GTK4 #Adwaita

  6. Munquet is now officially on Flathub 🎉

    A native Linux app to convert datasets into Apache Parquet using PyArrow backend. Perfect for data science workflows, analytics, and anyone needing fast local conversions.

    Get it here: flathub.org/en/apps/io.gitlab.

    @gnome @xfce @kde @GTK @linux @flathub

    #apache #pyarrow #datascience #parquet #csv #OpenSource #Python #GNOME #GTK4 #Adwaita

  7. Munquet is now officially on Flathub 🎉

    A native Linux app to convert datasets into Apache Parquet using PyArrow backend. Perfect for data science workflows, analytics, and anyone needing fast local conversions.

    Get it here: flathub.org/en/apps/io.gitlab.

    @gnome @xfce @kde @GTK @linux @flathub

    #apache #pyarrow #datascience #parquet #csv #OpenSource #Python #GNOME #GTK4 #Adwaita

  8. Munquet is now officially on Flathub 🎉

    A native Linux app to convert datasets into Apache Parquet using PyArrow backend. Perfect for data science workflows, analytics, and anyone needing fast local conversions.

    Get it here: flathub.org/en/apps/io.gitlab.

    @gnome @xfce @kde @GTK @linux @flathub

    #apache #pyarrow #datascience #parquet #csv #OpenSource #Python #GNOME #GTK4 #Adwaita

  9. 🚀 Munquet — Convert, merge, rename & validate tabular data into Parquet, fully offline & batch-ready.

    GitLab: gitlab.com/zulfian1732/munquet

    Featured in: @severo 's Awesome Parquet: github.com/severo/awesome-parq 🙏

    #Parquet #OpenSource #Python #GNOME #GTK4 #Adwaita #PyArrow

  10. 🚀 Sneak peek Munquet!
    Convert, merge, rename, and validate tabular data safely into Parquet. Works offline, with batch processing and progress feedback.

    GitLab repo:

    gitlab.com/zulfian1732/munquet

    Flathub release coming soon!

    #Python #GTK4 #GNOME #PyArrow #Parquet #DataScience #Libadwaita

  11. Released scrapy-contrib-bigexporter 1.0.0 (codeberg.org/ZuInnoTe/scrapy-c) - additional export formats for the webscraping framework Scrapy.

    Migrated parquet export from fastparquet to pyarrow as fastparquet is deprecated (docs.dask.org/en/stable/change)

    Migrated orc export from pyorc to pyarrow to reduce the number of dependencies

    #scrapy #crawling #python #parquet #orc #pyarrow #webcrawling #scraping

  12. Easily obtain OSM and OMF data: #Python and CLI tools #QuackOSM and #OvertureMaestro offer easier access to data from #OpenStreetMap (#OSM) and the Overture Maps Foundation (#OMF) through #PyArrow, #GeoParquet, or #DuckDB. These tools can simplify large-scale geospatial data...
    spatialists.ch/posts/2025/05/2 #GIS #GISchat #geospatial #SwissGIS

  13. Easily obtain OSM and OMF data: #Python and CLI tools #QuackOSM and #OvertureMaestro offer easier access to data from #OpenStreetMap (#OSM) and the Overture Maps Foundation (#OMF) through #PyArrow, #GeoParquet, or #DuckDB. These tools can simplify large-scale geospatial data...
    spatialists.ch/posts/2025/05-2 #GIS #GISchat #geospatial #SwissGIS

  14. If the purpose of a library is to "process and transport large data sets" but the code base contains an error message like "array cannot contain more than 2147483646 bytes" then there must be a big misunderstanding somewhere. #pyarrow

  15. Ironia losu: kiedy właśnie dodałeś blokera na #PyArrow w ebuildzie #Gentoo dla #pandas, bo powoduje, że #CPython się wykrzacza, a potem czytasz, że zależność od PyArrow będzie obowiązkowa w przyszłości. Wzdych.

    github.com/pandas-dev/pandas/i

  16. Irony: when you've just added a blocker on #PyArrow in the #Gentoo ebuild for #pandas because it causes #CPython to crash, and then read that pandas are planning on making PyArrow obligatory. Sigh.

    github.com/pandas-dev/pandas/i

  17. Good news - Python's CSV reader supports unicode characters like 🤘 as CSV field delimiters.

    Bad news is that #PyArrow doesn't support it yet :(

    Make PyArrow great again!

    #Python #developer #unicode #CSV #bigdata

  18. Hey #dataNerds 🤓, good news:

    #DuckDB v0.6.0 brings reading #CSV data on par with #PyArrow & #Polars and loads 1.66 GB of #ChicagoCrimes data in 1.9s with 12 cores/24 threads when experimental parallel CSV reader & unordered insertion are enabled.

    🧐 github.com/RandomFractals/chic

    #dataTools 🔬 ...

  19. Hey #dataNerds 🤓, good news:

    #DuckDB v0.6.0 brings reading #CSV data on par with #PyArrow & #Polars and loads 1.66 GB of #ChicagoCrimes data in 1.9s with 12 cores/24 threads when experimental parallel CSV reader & unordered insertion are enabled.

    🧐 github.com/RandomFractals/chic

    #dataTools 🔬 ...