home.social

#duckdb — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #duckdb, aggregated by home.social.

  1. It's great to be able to find your own post to help you with a new setup! This time #DuckDB #Ruby gem installation on #Windows! You need to do a couple of things so it works.

    notepad.onghu.com/2025/duck-db

    #Programming #Database

  2. It's great to be able to find your own post to help you with a new setup! This time #DuckDB #Ruby gem installation on #Windows! You need to do a couple of things so it works.

    notepad.onghu.com/2025/duck-db

    #Programming #Database

  3. I built a thing: skaldmaps.com

    SkaldMaps is a research, analysis, and rating platform for real estate investments, including primary home buyers (or renters). It lets you analyze, filter, and rank all US ZIP codes by a model you define yourself.

    #RealEstate #GIS #DuckDB #Analytics

  4. PostgreSQL и аналитика: что меняется, когда хранилище становится общим

    HTAP — одна из главных тем в мире СУБД. Вокруг PostgreSQL массово появляются конструкции с внешними аналитическими движками со своими моделями хранения данных и ограничениями совместимости, однако бизнесу не совсем комфортно жить в архитектуре, где транзакционные данные находятся в одной системе, аналитика - в другой, а между ними - разного рода ETL, CDC и прочие parquet-файлы. В Tantor мы движемся по иному пути, развивая HTAP внутри PostgreSQL, а не рядом с ним. Вокруг этой идеи строятся СУБД Tantor Polar и машина баз данных Tantor XData Gen3, в которой OLTP и аналитика, не теряя совместимости с Postgres, работают поверх общего хранилища данных и общей видимости транзакций. В этой статье хочется поговорить не столько о самом термине HTAP, сколько о том, как меняется архитектура PostgreSQL, когда OLTP и аналитика начинают работать поверх общего хранилища данных.

    habr.com/ru/companies/tantor/a

    #tantor #tantor_postgres #xdata #tantor_xdata #oracle_exadata #duckdb #greenplum #kafka #clickhouse

  5. PostgreSQL и аналитика: что меняется, когда хранилище становится общим

    HTAP — одна из главных тем в мире СУБД. Вокруг PostgreSQL массово появляются конструкции с внешними аналитическими движками со своими моделями хранения данных и ограничениями совместимости, однако бизнесу не совсем комфортно жить в архитектуре, где транзакционные данные находятся в одной системе, аналитика - в другой, а между ними - разного рода ETL, CDC и прочие parquet-файлы. В Tantor мы движемся по иному пути, развивая HTAP внутри PostgreSQL, а не рядом с ним. Вокруг этой идеи строятся СУБД Tantor Polar и машина баз данных Tantor XData Gen3, в которой OLTP и аналитика, не теряя совместимости с Postgres, работают поверх общего хранилища данных и общей видимости транзакций. В этой статье хочется поговорить не столько о самом термине HTAP, сколько о том, как меняется архитектура PostgreSQL, когда OLTP и аналитика начинают работать поверх общего хранилища данных.

    habr.com/ru/companies/tantor/a

    #tantor #tantor_postgres #xdata #tantor_xdata #oracle_exadata #duckdb #greenplum #kafka #clickhouse

  6. PostgreSQL и аналитика: что меняется, когда хранилище становится общим

    HTAP — одна из главных тем в мире СУБД. Вокруг PostgreSQL массово появляются конструкции с внешними аналитическими движками со своими моделями хранения данных и ограничениями совместимости, однако бизнесу не совсем комфортно жить в архитектуре, где транзакционные данные находятся в одной системе, аналитика - в другой, а между ними - разного рода ETL, CDC и прочие parquet-файлы. В Tantor мы движемся по иному пути, развивая HTAP внутри PostgreSQL, а не рядом с ним. Вокруг этой идеи строятся СУБД Tantor Polar и машина баз данных Tantor XData Gen3, в которой OLTP и аналитика, не теряя совместимости с Postgres, работают поверх общего хранилища данных и общей видимости транзакций. В этой статье хочется поговорить не столько о самом термине HTAP, сколько о том, как меняется архитектура PostgreSQL, когда OLTP и аналитика начинают работать поверх общего хранилища данных.

    habr.com/ru/companies/tantor/a

    #tantor #tantor_postgres #xdata #tantor_xdata #oracle_exadata #duckdb #greenplum #kafka #clickhouse

  7. PostgreSQL и аналитика: что меняется, когда хранилище становится общим

    HTAP — одна из главных тем в мире СУБД. Вокруг PostgreSQL массово появляются конструкции с внешними аналитическими движками со своими моделями хранения данных и ограничениями совместимости, однако бизнесу не совсем комфортно жить в архитектуре, где транзакционные данные находятся в одной системе, аналитика - в другой, а между ними - разного рода ETL, CDC и прочие parquet-файлы. В Tantor мы движемся по иному пути, развивая HTAP внутри PostgreSQL, а не рядом с ним. Вокруг этой идеи строятся СУБД Tantor Polar и машина баз данных Tantor XData Gen3, в которой OLTP и аналитика, не теряя совместимости с Postgres, работают поверх общего хранилища данных и общей видимости транзакций. В этой статье хочется поговорить не столько о самом термине HTAP, сколько о том, как меняется архитектура PostgreSQL, когда OLTP и аналитика начинают работать поверх общего хранилища данных.

    habr.com/ru/companies/tantor/a

    #tantor #tantor_postgres #xdata #tantor_xdata #oracle_exadata #duckdb #greenplum #kafka #clickhouse

  8. Optimising DuckDB performance on large EC2 instances

    "I have been recommending for some time that DuckDB is often a faster and simpler choice than Spark for most data engineering workloads.

    (...)

    However, there are some performance footguns with this approach: the default configuration can result in surprisingly poor performance, and a few simple changes can make workloads run multiple times faster."

    #duckdb

    robinlinacre.com/optimising_du

  9. #DuckDB's Quack server thing has been out for like 30 seconds and daft folks are putting them on the internet already.

    h/t @silas

  10. I'm quite curious what are the plans for on 2.0. They have an local renaissance in their hands. 😎

    duckdb.org/2026/05/12/quack-re

  11. I'm quite curious what are the plans for #DuckDB on 2.0. They have an #OLAP local renaissance in their hands. 😎

    duckdb.org/2026/05/12/quack-re

  12. I'm quite curious what are the plans for #DuckDB on 2.0. They have an #OLAP local renaissance in their hands. 😎

    duckdb.org/2026/05/12/quack-re

  13. I'm quite curious what are the plans for #DuckDB on 2.0. They have an #OLAP local renaissance in their hands. 😎

    duckdb.org/2026/05/12/quack-re

  14. I'm quite curious what are the plans for #DuckDB on 2.0. They have an #OLAP local renaissance in their hands. 😎

    duckdb.org/2026/05/12/quack-re

  15. Missing RustWeek and want to hear me ramble about data processing in Rust? Head on over here ;) youtu.be/uvfiz7-blyU

  16. Some performance tuning (row batching size, schema sharing, streaming) get the ducknng arrow IPC to 30% slower than quack_query at 10 Million rows, our own adaptation of the DuckDB internal application/vnd.duckdb serializer is 20% faster than quack_query ! Even faster over tcp #nng #DuckDB #quack

    RE: https://bsky.app/profile/did:plc:hmk4f5yf3ohtsvw32sbtwvk3/post/3mlqpm2xpc22t

  17. has client-server now. game changer.

    duckdb.org/quack/

  18. I always tell people that #DuckDB is great for processing millions of rows in analytics, but that it's not for client - server workloads with many small transactions, where PostgreSQL would be a better fit. With quack I'll need to change that advice.

  19. Quack! 📢🦆

    A new extension has been announced for #DuckDB which turns any DuckDB instance into a client or a server. Optimisations have also been made to deliver great performance for simultaneous client connections, transactions and small inserts.

    quack.duckdb.org

  20. duckdb.org/quack turns DuckDB into a client-server database! The Quack protocol unlocks multi-client access for #DuckDB, turning it into a general-purpose database system.

  21. What's the current state of remote parquet-querying tools (duckdb, arrow, maybe polars) and WebR? That is, if I want a WebR/Shinylive app to query a remote parquet file and only transfer the query results? Or is the way to go, say, loading duckdb's WASM build and have a shiny call it via eval_js? Any examples?

    @hrbrmstr @jeroenooms @gws @kirill

    #RStats #WebR #DuckDB

  22. 🚨 Breaking news! 🚨 In a groundbreaking development nobody asked for, we now have "Quack," the #DuckDB client-server #protocol, because clearly what the world needs is more DB protocols named after waterfowl. 🦆🌊 Thank you, tech industry, for yet again overcomplicating the simple task of fetching data! 🎉🙄
    duckdb.org/2026/05/12/quack-re #QuackDB #DatabaseTech #Overcomplication #DataFetching #HackerNews #ngated

  23. 🚨 Breaking news! 🚨 In a groundbreaking development nobody asked for, we now have "Quack," the #DuckDB client-server #protocol, because clearly what the world needs is more DB protocols named after waterfowl. 🦆🌊 Thank you, tech industry, for yet again overcomplicating the simple task of fetching data! 🎉🙄
    duckdb.org/2026/05/12/quack-re #QuackDB #DatabaseTech #Overcomplication #DataFetching #HackerNews #ngated

  24. 🚨 Breaking news! 🚨 In a groundbreaking development nobody asked for, we now have "Quack," the #DuckDB client-server #protocol, because clearly what the world needs is more DB protocols named after waterfowl. 🦆🌊 Thank you, tech industry, for yet again overcomplicating the simple task of fetching data! 🎉🙄
    duckdb.org/2026/05/12/quack-re #QuackDB #DatabaseTech #Overcomplication #DataFetching #HackerNews #ngated

  25. 🚨 Breaking news! 🚨 In a groundbreaking development nobody asked for, we now have "Quack," the #DuckDB client-server #protocol, because clearly what the world needs is more DB protocols named after waterfowl. 🦆🌊 Thank you, tech industry, for yet again overcomplicating the simple task of fetching data! 🎉🙄
    duckdb.org/2026/05/12/quack-re #QuackDB #DatabaseTech #Overcomplication #DataFetching #HackerNews #ngated

  26. 🚨 Breaking news! 🚨 In a groundbreaking development nobody asked for, we now have "Quack," the #DuckDB client-server #protocol, because clearly what the world needs is more DB protocols named after waterfowl. 🦆🌊 Thank you, tech industry, for yet again overcomplicating the simple task of fetching data! 🎉🙄
    duckdb.org/2026/05/12/quack-re #QuackDB #DatabaseTech #Overcomplication #DataFetching #HackerNews #ngated

  27. Joining 130 million points: Dewey Dunnington revisits his 2024 benchmark of #spatialjoin operations on 130 million points, now testing #SedonaDB and #DuckDB #Spatial in comparison to the original tools. The results are remarkable: 6 seconds on a laptop replacing 3-5 minutes...
    spatialists.ch/posts/2026/05/0 #GIS #GISchat #geospatial #SwissGIS

  28. Joining 130 million points: Dewey Dunnington revisits his 2024 benchmark of #spatialjoin operations on 130 million points, now testing #SedonaDB and #DuckDB #Spatial in comparison to the original tools. The results are remarkable: 6 seconds on a laptop replacing 3-5 minutes...
    spatialists.ch/posts/2026/05/0 #GIS #GISchat #geospatial #SwissGIS

  29. Joining 130 million points: Dewey Dunnington revisits his 2024 benchmark of #spatialjoin operations on 130 million points, now testing #SedonaDB and #DuckDB #Spatial in comparison to the original tools. The results are remarkable: 6 seconds on a laptop replacing 3-5 minutes...
    spatialists.ch/posts/2026/05/0 #GIS #GISchat #geospatial #SwissGIS

  30. Joining 130 million points: Dewey Dunnington revisits his 2024 benchmark of #spatialjoin operations on 130 million points, now testing #SedonaDB and #DuckDB #Spatial in comparison to the original tools. The results are remarkable: 6 seconds on a laptop replacing 3-5 minutes...
    spatialists.ch/posts/2026/05/0 #GIS #GISchat #geospatial #SwissGIS

  31. Joining 130 million points: Dewey Dunnington revisits his 2024 benchmark of #spatialjoin operations on 130 million points, now testing #SedonaDB and #DuckDB #Spatial in comparison to the original tools. The results are remarkable: 6 seconds on a laptop replacing 3-5 minutes...
    spatialists.ch/posts/2026/05/0 #GIS #GISchat #geospatial #SwissGIS

  32. DuckDB Labs released #DuckLake 1.0 - a data lake format that stores table metadata in a SQL database, rather than spreading it across object storage files.

    Key features:
    • catalog-stored small updates
    • improved sorting and partitioning
    • compatibility with Iceberg-style data features

    Learn more ⇨ bit.ly/48PsPIS

    #InfoQ #DuckDB #ApacheIceberg #AI #DataLake #DataStorage

  33. DuckDB Labs released #DuckLake 1.0 - a data lake format that stores table metadata in a SQL database, rather than spreading it across object storage files.

    Key features:
    • catalog-stored small updates
    • improved sorting and partitioning
    • compatibility with Iceberg-style data features

    Learn more ⇨ bit.ly/48PsPIS

    #InfoQ #DuckDB #ApacheIceberg #AI #DataLake #DataStorage

  34. DuckDB Labs released #DuckLake 1.0 - a data lake format that stores table metadata in a SQL database, rather than spreading it across object storage files.

    Key features:
    • catalog-stored small updates
    • improved sorting and partitioning
    • compatibility with Iceberg-style data features

    Learn more ⇨ bit.ly/48PsPIS

    #InfoQ #DuckDB #ApacheIceberg #AI #DataLake #DataStorage

  35. DuckDB Labs released 1.0 - a data lake format that stores table metadata in a SQL database, rather than spreading it across object storage files.

    Key features:
    • catalog-stored small updates
    • improved sorting and partitioning
    • compatibility with Iceberg-style data features

    Learn more ⇨ bit.ly/48PsPIS

  36. DuckDB Labs released #DuckLake 1.0 - a data lake format that stores table metadata in a SQL database, rather than spreading it across object storage files.

    Key features:
    • catalog-stored small updates
    • improved sorting and partitioning
    • compatibility with Iceberg-style data features

    Learn more ⇨ bit.ly/48PsPIS

    #InfoQ #DuckDB #ApacheIceberg #AI #DataLake #DataStorage

  37. Off the back of yesterday and looking at #DuckDB and the UI extension, I did a quite write-up.

    Other than the issue I had around doing a lot of schema changes, it's a really nice UI which doesn't make you feel like you've ended up with the "entry level" interface.

    dazfuller.uk/posts/2026-05-04-

  38. Also been playing with #DuckDB UI this afternoon. It’s a great place to start for exploration, and if you don’t have any UI tools available to you.

    Did find out that if you’re making a lot of schema changes then it gets itself into a mess and won’t open again. So had to drop and recreate the db file a few times.

    Probably do a write up on it tomorrow. Along with a Microsoft Fabric RTI demo for later in the week

  39. Over the last couple of months I had to do a lot of number crunching in order to write some #designdocs. While writing such docs is always useful, I recently added a lot to the fun-dimension with #Jupyter and #DuckDB 😅 This might be useful to others, so here's a quick summary of my setup with a simple example: zerokspot.com/weblog/2026/05/0 #blogged

  40. Oh look, another "groundbreaking" #blog post about #DuckDB from a self-proclaimed data wizard. 🙄 Apparently, the limitations of basic text queries are just too much for our hero, who bravely delves into the wild world of Full-Text Search. 🌟 Spoiler alert: it's as thrilling as watching paint dry on a data frame. 🥱
    peterdohertys.website/blog-pos #DataWizard #FullTextSearch #DataFrame #HackerNews #ngated

  41. Oh look, another "groundbreaking" #blog post about #DuckDB from a self-proclaimed data wizard. 🙄 Apparently, the limitations of basic text queries are just too much for our hero, who bravely delves into the wild world of Full-Text Search. 🌟 Spoiler alert: it's as thrilling as watching paint dry on a data frame. 🥱
    peterdohertys.website/blog-pos #DataWizard #FullTextSearch #DataFrame #HackerNews #ngated

  42. Oh look, another "groundbreaking" #blog post about #DuckDB from a self-proclaimed data wizard. 🙄 Apparently, the limitations of basic text queries are just too much for our hero, who bravely delves into the wild world of Full-Text Search. 🌟 Spoiler alert: it's as thrilling as watching paint dry on a data frame. 🥱
    peterdohertys.website/blog-pos #DataWizard #FullTextSearch #DataFrame #HackerNews #ngated