#parquet — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #parquet, aggregated by home.social.
-
https://www.europesays.com/fr/931933/ Crash EgyptAir : le parquet de Paris requiert un non-lieu dix ans après le drame #2026 #Actualités #Crash #dix #EgyptAir : #EU #europe #FaitsDiversJustice #FR #France #News #NonLieu #Paris #parquet #RépubliqueFrançaise #requiert
-
When does #Iceberg beat #Parquet+projection on #AWSGlue, and when doesn't ?
An end-to-end #ETL PoC on #AWS to find out: producer, #Kinesis, two #Firehose paths, two #Glue jobs, #Athena.
🔮 Spoiler: how the data is read is the key to the choice.
In the article: every choice with its why, plus a few gems from some Glue experience 😄
-
When does #Iceberg beat #Parquet+projection on #AWSGlue, and when doesn't ?
An end-to-end #ETL PoC on #AWS to find out: producer, #Kinesis, two #Firehose paths, two #Glue jobs, #Athena.
🔮 Spoiler: how the data is read is the key to the choice.
In the article: every choice with its why, plus a few gems from some Glue experience 😄
-
Saint-Étienne : un homme décède après avoir été roué de coups en pleine rue
Un homme de 38 ans est décédé dans la nuit du mercredi 22 au jeudi 23 avril après…
#SaintEtienne #FR #France #Actu #News #Europe #EU #Saint-Étienne #actu #Actualités #Auvergne-Rhône-Alpes #BAC #beaubrun #décès #europe #Loire #lynchage #parquet #Police #Pompiers #Républiquefrançaise #Rixe #saint-étienne
https://www.europesays.com/fr/888891/ -
https://www.europesays.com/fr/888891/ Saint-Étienne : un homme décède après avoir été roué de coups en pleine rue #actu #Actualités #AuvergneRhôneAlpes #BAC #beaubrun #décès #EU #europe #FR #France #Loire #lynchage #News #parquet #Police #Pompiers #RépubliqueFrançaise #Rixe #Saintétienne #SaintÉtienne
-
Гайд: Как работать с форматом PARQUET
В прошлом году мы начали публиковать данные в каталоге «Если быть точным» в формате Parquet . Его придумали инженеры Twitter и Cloudera в 2013 году, и сегодня он стал стандартом хранения аналитических данных — его используют Google, Amazon, Netflix и большинство современных data-платформ. В этом гайде мы расскажем, как эффективно работать с данными в формате Parquet с помощью Python.
-
🐒 Ah, yes, the holy grail of nerd bragging rights: a 47M+ item #archive of Hacker News, now in the culinary delight format of #Parquet for all your "data chef" needs. 🍽️ Updated every 5 minutes, because clearly, what's more riveting than a play-by-play of techie's daily musings? Oh wait, I forgot—🥱 anything else.
https://huggingface.co/datasets/open-index/hacker-news #HackerNews #DataChef #TechieBraggingRights #DailyUpdates #HackerNews #ngated -
Hacker News archive (47M+ items, 11.6GB) as Parquet, updated every 5m
https://huggingface.co/datasets/open-index/hacker-news
#HackerNews #HackerNews #Archive #Parquet #Data #47MItems #UpdatedEvery5m
-
Ils pilotaient un trafic de stupéfiants depuis la prison : un réseau démantelé entre la Loire et le Puy-de-Dôme
Quatorze trafiquants présumés ont été placés en garde à vue après leur interpellation entre ce mardi 3 …
#SaintEtienne #FR #France #Actu #News #Europe #EU #Saint-Étienne #actu #Actualités #Auvergne-Rhône-Alpes #détention #Drogue #europe #LaTalaudière #maisond'arrêt #parquet #Prison #puy-de-dôme #Républiquefrançaise #Roanne #stupéfiant #trafic
https://www.europesays.com/fr/780817/ -
https://www.europesays.com/fr/780817/ Ils pilotaient un trafic de stupéfiants depuis la prison : un réseau démantelé entre la Loire et le Puy-de-Dôme #actu #Actualités #AuvergneRhôneAlpes #détention #Drogue #EU #europe #FR #France #LaTalaudière #MaisonD'arrêt #News #parquet #Prison #PuyDeDôme #RépubliqueFrançaise #Roanne #SaintÉtienne #stupéfiant #trafic
-
Ils pilotaient un trafic de stupéfiants depuis la prison : un réseau démantelé entre la Loire et le Puy-de-Dôme
Quatorze trafiquants présumés ont été placés en garde à vue après leur interpellation entre ce mardi 3 …
#SaintEtienne #FR #France #Actu #News #Europe #EU #Saint-Étienne #actu #Actualités #Auvergne-Rhône-Alpes #détention #Drogue #europe #LaTalaudière #maisond'arrêt #parquet #Prison #puy-de-dôme #Républiquefrançaise #Roanne #stupéfiant #trafic
https://www.europesays.com/fr/777442/ -
https://www.europesays.com/fr/777442/ Ils pilotaient un trafic de stupéfiants depuis la prison : un réseau démantelé entre la Loire et le Puy-de-Dôme #actu #Actualités #AuvergneRhôneAlpes #détention #Drogue #EU #europe #FR #France #LaTalaudière #MaisonD'arrêt #News #parquet #Prison #PuyDeDôme #RépubliqueFrançaise #Roanne #SaintÉtienne #stupéfiant #trafic
-
Ho provato a riversare un dump #Wikidata in #Parquet e ad interrogarlo con #DuckDB: ci mette meno di un'ora ad estrapolare tutte le 19.939.182 entità che rappresentano persone, incluse le sottoclassi di wdt:Q5.
Decisamente meglio del mio deserializzatore implementato in Go, che per fare la stessa cosa ci mette quasi 8 ore. -
Munquet 0.2.1 just landed on Flathub 🚀
Fixed a small race condition when canceling a conversion — turns out the process could finish right before you clicked “Yes” 😅
Two lines later… all good.
https://flathub.org/en/apps/io.gitlab.zulfian1732.munquet
#Flatpak #GTK4 #OpenSource #Parquet #DataScience #Linux #Python #PyArrow
-
Munquet 0.2.0 is now available on Flathub 🎉
✨ Display real host paths via XDG Portal
🛠 Introduced a .Devel Flatpak manifest for development buildsContinuing to improve the Linux desktop data workflow 🚀
https://flathub.org/en/apps/io.gitlab.zulfian1732.munquet
#Flathub #Flatpak #XDGPortal #GTK4 #OpenSource #Parquet #Python #DataScience
-
Munquet is now officially on Flathub 🎉
A native Linux app to convert datasets into Apache Parquet using PyArrow backend. Perfect for data science workflows, analytics, and anyone needing fast local conversions.
Get it here: https://flathub.org/en/apps/io.gitlab.zulfian1732.munquet
@gnome @xfce @kde @GTK @linux @flathub
#apache #pyarrow #datascience #parquet #csv #OpenSource #Python #GNOME #GTK4 #Adwaita
-
New entries in Awesome #Parquet
- Munquet: A desktop tool to convert CSV files to Parquet
- nail: A CLI tool for analyzing, transforming, and exploring data files
- odbc2parquet: query an ODBC data source and write the result to parquet.
- DataStudio (screenshot): a webapp to explore and visualize data, entirely in the browser.
- a new "Parquet engineering" section that groups best practices for writing Parquet files
-
Should the @geofabrik Download Server offer #GeoParquet files?
Please leave your feedback on this forum thread: https://community.openstreetmap.org/t/osm-data-in-geoparquet-format/141690
#OpenStreetMap #OSM #Geofabrik #Parquet -
🚀 Munquet — Convert, merge, rename & validate tabular data into Parquet, fully offline & batch-ready.
GitLab: https://gitlab.com/zulfian1732/munquet
Featured in: @severo 's Awesome Parquet: https://github.com/severo/awesome-parquet 🙏
-
🚀 Sneak peek Munquet!
Convert, merge, rename, and validate tabular data safely into Parquet. Works offline, with batch processing and progress feedback.GitLab repo:
https://gitlab.com/zulfian1732/munquet
Flathub release coming soon!
#Python #GTK4 #GNOME #PyArrow #Parquet #DataScience #Libadwaita
-
"Une enquête criminelle, ça prend du temps". Le procureur de Lyon, manifestement sous pression, reste extrêmement prudent sur les circonstances de la mort de Quentin D. A rebours des déclarations tonitruantes jusqu'au sommet de l'Etat, notamment les ministres de la Justice et de l'Intérieur.
#Politique #Justice #Proces #Parquet #Lyon #Quentin #Macron #LFI
-
New in Awesome Parquet: the best practices for writing Parquet files (Parquet engineering 🪛 ).
https://github.com/severo/awesome-parquet?tab=readme-ov-file#parquet-engineering
It might become the most useful section.
It's often hard to choose the best parameters: row group size, compression algorithm, whether to include statistics, whether to include indexes, whether to include bloom filters...
Please send me other references (or open a PR), I'm eager to read more about optimizing Parquet files for specific (or general) use.
-
So, we had to find a novel approach. It's the story I'm telling in this blog post. I hope you'll enjoy it.
-
#nextflow #parquet plugin version v0.2.2 is out!
This update introduces powerful new splitting capabilities including by, file, and count options, bringing it in line with standard Nextflow splitters as splitFasta for example.
Specifically, the file option allows you to partition large datasets into smaller chunks, enabling seamless parallel processing
Additionally, this version includes an experimental feature for reading files directly from S3
Read more at https://nextflow-io.github.io/nf-parquet/
-
Le Parquet de Paris annonce son départ de X (ex-Twitter) 💣 💥
#Politique #Twitter #X #ElonMusk #EtatsUnis #SocialMedia #Tech #Justice #Parquet
-
yesterday I've published #nextflow #parquet plugin version 0.2.2-edge2
a big refactor of the plugin to be aligned with others Splitters
now you can chunk a parquet file into smaller files using the `file` option, specify a batch size using `by` option, and so on
happy to see how this plugin is gaining popularity
-
https://www.europesays.com/it/323824/ Basket, serie A2: Avellino esce sconfitto dal parquet di Brindisi #avellino #Basket #Basketball #Brindisi #esce #IT #Italia #Italy #parquet #sconfitto #serie #Sport #Sports
-
記事書いた
Parquetで使用できる型(PhysicalType、LogicalType、ConvertedType)の一覧 #Parquet - Qiita https://qiita.com/kotet/items/ef0faaf8eb0162f3c574 -
📣 R Consortium webinar: Scaling up data analysis in R with Arrow
If “scaling R” has meant databases/clusters or rewriting everything, this session is for you. Dr Nic Crane (Arrow R maintainer; Apache Arrow PMC) will walk through practical, memory-efficient ways to work with larger datasets in R—plus why Parquet is a workflow upgrade and where DuckDB fits.
Register:
https://r-consortium.org/webinars/scaling-up-data-analysis-in-r-with-arrow.html -
Using Apache Parquet? Found a TUI for you 👀📦
🔍 **parqeye** — A TUI for inspecting Parquet data, schemas and metadata.
💯 Browse tables, explore schemas, inspect row groups & view file stats.
🦀 Written in Rust & built with @ratatui_rs
⭐ GitHub: https://github.com/kaushiksrini/parqeye
#rustlang #tui #terminal #parquet #analytics #devtools #opensource
-
Hello #Proteomics !
Thinking about a better mzML to store proteomics data, but not convinced by the #parquet approach, I've converted it into #CBOR :
* Smaller data files (only 66% of the mzML original file) for the exact same data
* Faster to read (25s for a big mzML vs 18s in mzcbor on the same computer)
* Very quick random access to spectra (24.6577 ms for mzML vs 786.731 μs for mzcbor for the same operation using index)I'd like to share it if you are interested at #eubIC #eubic2026
-
The reason I made a sample dataset was that I thought it was a bit sluggish querying the GeoPackage file from DuckDB. The query in the image took 2.56 s on the GeoPackage file. I now tried to save the entire dataset into a Parquet file (sorted on county and municipality) and compressed with ZSTD. The same query takes 0.0140s.
Also the Parquet file is 141 MiB compared to 1.18 GiB for the GeoPackage file. The Parquet file is smaller than the original zip file with the GeoPackage file.
-
House with land - Guaira Paraguay 💓
#Itati is a beautiful spot with a bathing #lake and natural swimming #pool. The area around the #Piramides 🔺 Naturales is considered one of the most #beautiful areas of Paraguay.
open #living/dining area
#guest toilet/shower
#HWR
#parquet flooring
#Melgarejo 20 min
#Planta Urbana 30 min199.000 €
Rooms: 5
Living space: 155m²
Plot: 2.000m²
#Guaira #Paraguay 🇵🇾https://www.bluehomes.com/PPY0057/en/House-with-land/expose.html
-
House with land - Guaira Paraguay 💓
#Itati is a beautiful spot with a bathing #lake and natural swimming #pool. The area around the #Piramides 🔺 Naturales is considered one of the most #beautiful areas of Paraguay.
open #living/dining area
#guest toilet/shower
#HWR
#parquet flooring
#Melgarejo 20 min
#Planta Urbana 30 min199.000 €
Rooms: 5
Living space: 155m²
Plot: 2.000m²
#Guaira #Paraguay 🇵🇾https://www.bluehomes.com/PPY0057/en/House-with-land/expose.html
-
We've created a way to display interactive maps in the browser, completely client-side! #gis #gischat
Drop your data in as #csv or #apache #parquet file, and your vector shapefile as a #geojson, and your map is ready to go!
It's hosted on #GitHub pages (so it's free!) but can be embedded anywhere
Tutorial:
https://odissei-soda.nl/tutorials/map-explorer/Example:
https://sodascience.github.io/map-explorer/(we tried out @penpot in the design process!)
-
Как мы строили хранилище на 70 ПБ данных и не планируем останавливаться
Привет, сегодня я расскажу о том, как наша команда строила платформу обработки и хранения данных для обучения GenAI-моделей в Сбере, и как мы выросли до 70 ПБ сырых данных. Меня зовут Александр, я работаю в Сбере и два года занимался развитием этой платформы.
-
Votre parquet est posé, mais il manque LA touche finale ? Les PLINTHES ! 😱 La coupe des angles vous terrorise ?
Pas de panique ! On a créé LE guide du débutant pour des finitions dignes d'un pro.
#poserdesplinthes #bricolage #diy #renovation #travauxmaison #finition #parquet #astucebricolage #decorationinterieur #outillage #menuiserie
https://lemagdesastuces.fr/comment-poser-des-plinthes-guide-facile/
-
La pose du parquet, ça allait... jusqu'à l'arrivée du premier mur ! 😱 La découpe, c'est votre cauchemar ?
On a créé LE guide pour transformer cette étape stressante en un jeu de précision.
#parquet #bricolage #diy #renovation #travauxmaison #parquetflottant #sciesauteuse #astucebricolage #outillage #decorationinterieur #tutoriel
https://lemagdesastuces.fr/comment-decouper-un-parquet-flottant-proprement/
-
Parqeye – A CLI tool to visualize and inspect Parquet files
https://github.com/kaushiksrini/parqeye
#HackerNews #Parqeye #Parquet #CLI #DataVisualization #DataTools #OpenSource
-
Online GeoParquet Visualizer: For day 7 of the #30DayMapChallenge on the topic of #accessibility, @DomeGIS released the #GeoParquet Visualizer. The GeoParquet Visualizer is a free and open-source web tool built with #MapLibre and #parquet-#wasm that lets users view, style, and share GeoParquet and Parquet datasets directly in the browser. https://spatialists.ch/posts/2025/11/11-online-geoparquet-visualizer/ #GIS #GISchat #geospatial #SwissGIS
-
Оптимизация производительности запросов: мощный тандем StarRocks и Apache Iceberg
Apache Iceberg — табличный формат для озёр данных с поддержкой ACID, Schema Evolution, Hidden Partition и версионирования, но при больших метаданных и работе через S3 страдает планирование запросов и латентность. В связке со StarRocks мы показываем, как распределённый Job Plan, Manifest Cache, CBO с гистограммами, Data Cache и материализованные представления выводят lakehouse‑аналитику на уровень DWH: снижают накладные расходы на метаданные, ускоряют планы и выполнение, а запись обратно в Iceberg сохраняет единый источник истины. Разбираем архитектуру Iceberg, типовые узкие места и практики оптимизации на StarRocks 3.2–3.3, включая кейс WeChat/Tencent.
https://habr.com/ru/articles/963410/
#apache_iceberg #starrocks #lakehouse #data_analysis #data_lake #parquet #manifest #materialized_views
-
Released scrapy-contrib-bigexporter 1.0.0 (https://codeberg.org/ZuInnoTe/scrapy-contrib-bigexporters) - additional export formats for the webscraping framework Scrapy.
Migrated parquet export from fastparquet to pyarrow as fastparquet is deprecated (https://docs.dask.org/en/stable/changelog.html#fastparquet-engine-deprecated)
Migrated orc export from pyorc to pyarrow to reduce the number of dependencies
#scrapy #crawling #python #parquet #orc #pyarrow #webcrawling #scraping
-
https://www.europesays.com/it/185407/ “La musica è una stanza calda con parquet e pianoforte” #anima #calda #Entertainment #gavino #Intrattenimento #IT #Italia #Italy #lea #Music #Musica #parquet #pianoforte #stanza
-
GEOMETRY is a #Parquet logical type for 6 months now. The data is encoded as WKB.
Hyparquet, a pure JavaScript Parquet library, now supports it as of version 1.19.0 by decoding geometry columns to GeoJSON geometries.
You can try the hyparquet demo:
or use the hyperparam CLI tool:
```
hyp https://raw.githubusercontent.com/apache/parquet-testing/master/data/geospatial/geospatial.parquet
```(install locally with `npm I -g hyperparam`).
-
🗺️ Parquet with GEOMETRY type is not GeoParquet.
In a new blog post, structured as an FAQ, I detail the differences between GeoParquet and the latest version of Parquet, which supports geospatial data via the GEOMETRY logical type.
TL;DR: the two standards are orthogonal, compatible, and can be combined, with the only caveat that the columns must be encoded as WKB.
👀 Ready for a deep dive?
➡️ ➡️ ➡️ https://rednegra.net/blog/20250925-parquet-with-geometry-type-is-not-geoparquet/
-
New nf-parquet version 0.2.1 deployed using new plugin repository
Interesting the new way to publish plugins, once I use it a little more I'll write a post about it
-
New nf-parquet version 0.2.1 deployed using new plugin repository
Interesting the new way to publish plugins, once I use it a little more I'll write a post about it
-
New nf-parquet version 0.2.1 deployed using new plugin repository
Interesting the new way to publish plugins, once I use it a little more I'll write a post about it