home.social

#vectordb — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #vectordb, aggregated by home.social.

  1. Have pushed 0.9.5-dev branch to codeberg of foxing ( codeberg.org/aenertia/foxing/s ) in preparation for release tagging. A LOT of features and a couple of bug-fixes now the packet/file processing engine has stabilized ; including Semantic Routing to Parsers for Metadata Extraction and in-path Binary analysis using local ORT/BERT models ; letting you get semantic search powers for free when you copy something with foxingd/fxcp #linux #filesystem #bert #vectordb #postgres #xfs #stratis #blake3 #localllm

  2. Have pushed 0.9.5-dev branch to codeberg of foxing ( codeberg.org/aenertia/foxing/s ) in preparation for release tagging. A LOT of features and a couple of bug-fixes now the packet/file processing engine has stabilized ; including Semantic Routing to Parsers for Metadata Extraction and in-path Binary analysis using local ORT/BERT models ; letting you get semantic search powers for free when you copy something with foxingd/fxcp #linux #filesystem #bert #vectordb #postgres #xfs #stratis #blake3 #localllm

  3. Have pushed 0.9.5-dev branch to codeberg of foxing ( codeberg.org/aenertia/foxing/s ) in preparation for release tagging. A LOT of features and a couple of bug-fixes now the packet/file processing engine has stabilized ; including Semantic Routing to Parsers for Metadata Extraction and in-path Binary analysis using local ORT/BERT models ; letting you get semantic search powers for free when you copy something with foxingd/fxcp #linux #filesystem #bert #vectordb #postgres #xfs #stratis #blake3 #localllm

  4. Have pushed 0.9.5-dev branch to codeberg of foxing ( codeberg.org/aenertia/foxing/s ) in preparation for release tagging. A LOT of features and a couple of bug-fixes now the packet/file processing engine has stabilized ; including Semantic Routing to Parsers for Metadata Extraction and in-path Binary analysis using local ORT/BERT models ; letting you get semantic search powers for free when you copy something with foxingd/fxcp #linux #filesystem #bert #vectordb #postgres #xfs #stratis #blake3 #localllm

  5. Have pushed 0.9.5-dev branch to codeberg of foxing ( codeberg.org/aenertia/foxing/s ) in preparation for release tagging. A LOT of features and a couple of bug-fixes now the packet/file processing engine has stabilized ; including Semantic Routing to Parsers for Metadata Extraction and in-path Binary analysis using local ORT/BERT models ; letting you get semantic search powers for free when you copy something with foxingd/fxcp #linux #filesystem #bert #vectordb #postgres #xfs #stratis #blake3 #localllm

  6. @OpenSearchProj was named a Leader and Fast Mover in the 2025 GigaOm Radar for Vector Databases 🏆

    My report highlights:
    ✅ Platform play
    ✅ Search variety
    ✅ Business criteria
    ✅ Security
    And I'd add - it's OPEN SOURCE @linuxfoundation !!
    opensearch.org/gigaom-radar-ve

  7. Stoked seeing the OpenSearch Project featured by Jensen Huang on keynote! 😍

    One of the innovations in V3 has been adding GPU acceleration based on NVIDIA's cuVS. Our benchmarks, using CAGRA algorithm integrated through Facebook's Faiss library, showed:
    ✅ 9.3x faster index builds
    ✅ 3.75x lower cost
    ✅ 2x higher throughput
    ✅ 2.5x lower CPU usage

    linkedin.com/feed/update/urn:l

  8. Chunking: an essential concept to understand for Retrieval-Augmented Generation (#RAG). It is the process of dividing large documents into smaller, manageable segments called “chunks.” Effective chunking preserves semantic meaning while ensuring content fits within model context limits.

    Proper chunking is essential, as it directly affects retrieval quality. Well-structured chunks improve precision and support more accurate responses.

    

#OpenSource #devops #vectordb #programming #vector #search

  9. Can't wait to to have a great ride at ! 🇩🇪

    sucht nicht mehr nach Wörtern, sondern nach Bedeutungen. 🔍

    If you're there and want to learn about search and about the @OpenSearchProject, check out my talk 🙂

    See you 11th March in @JavaLandConf 🎡

    🔸Agenda: meine.doag.org/events/javaland

    🔸Tickets: javaland.eu/

  10. I joined InstaBlinks podcast to talk about vector search, difference from lexical search, and how the @OpenSearchProject facilitates both in a hybrid model.
    Thanks NetApp Instaclustr for having me!
    youtube.com/watch?v=buKXHi6kFw

  11. pgedge-vectorizer: #Postgres extension that automatically vectorizes document contents and keeps vector embeddings current when the underlying content changes.

    Unlike other solutions, no external services or third party pipelines are required. It's also 100% open source under the #PostgreSQL license. ✨

    Check it out on GitHub: 👉 github.com/pgEdge/pgedge-vecto

    #programming #vector #vectordatabase #vectorsearch #vectordb #ai #llm #aiengineering #aidev #dba

  12. NDC London is taking place in 2 weeks! 🇬🇧

    I'll give a talk about Vector Search Made Simple and how the Project can help you, a-la open source!

    See you end of January in London 💂

    ndclondon.com/agenda/vector-se

    @OpenSearchProject

  13. Tôi vừa xây dựng 1 vector database viết sẵn bằng C++, API bằng Go hỗ trợ các thao tác cơ bản. Hiện đang dùng bruteforce search để cải thiện, sắp chuyển sang HNSW. Mời bạn góp ý, test thử nghiệm, nhắn tin trao đổi repo nhé! #VectorDB #C++ #LậpTrìnhGo #PhátTriểnMở #VectorSearch #EarlyAdopters #VectorDatabase #HNSW #DevCommunity #NhàLậpTrình

    reddit.com/r/opensource/commen

  14. One month to NDC London! 🇬🇧
    I'll give a talk about Vector Search Made Simple and how the Project can help you, a-la open source!
    See you last week of January in London 💂‍♂️
    ndclondon.com/agenda/vector-se

    @OpenSearchProject

  15. 🎯 My first Substack is live: "Understanding Vector Databases"

    Traditional databases match keywords.
    Vector databases understand meaning.
    Just published a beginner-friendly guide where I:

    - Explain vectors using simple analogies
    - Show how they power Netflix, Spotify, and ChatGPT
    - Build a semantic search app from scratch
    - Explore applications beyond ML

    The best part? We code together. No prerequisites except curiosity.

    What would you build if your database could understand context, not just keywords?

    Full article with code: open.substack.com/pub/devsimse
    #VectorDB #MachineLearning #Tutorial #Python #TechCommunity

  16. The new Wikidata Embedding Project launches in just over one week - on 1 October. Transforming structured knowledge into a vector format AI can understand, it will offer verifiable AI grounded in real data for global use

    #TrustworthyAI #VectorDB #DevCommunity

  17. When a support rep pasted the company policy into chat and still got a wrong answer, I stopped and ran tests. 🧩🤖 The result: Vectorize 2.0’s real-time sync, PDF extractor and retrieval traces change where errors start — and how fast you find them. Read a practical 72-hour POC, evaluation checks and the exact queries to run. • #RAG#VectorDB#AIOps
    medium.com/@rogt.x1997/what-if

  18. 🎉 Huge thanks to the LanceDB CEO / cofounder Chang She for delivering an incredible talk on "Search, Retrieval, Training, and Analytics with Modern AI Data Lake" at #DataAndAIEngineering #SanFrancisco #meetup !

    📹 Great news - the recording is now available! Check it out if you missed it or want to revisit the key concepts. 👇

    https://watch.softinio.com/w/mVkLgtcQw8Qv5vA4v8SDHB

    #DataEngineering #AIEngineering #SanFrancisco #LanceDB #DataLake #MachineLearning #VectorDB #Database #AI #ArtificialIntelligence

  19. Векторный поиск внутри PostgreSQL: что умеет и где может пригодиться pgvector

    Итак, ваш проект вырос и вам потребовалась новая функциональность, будь то рекомендательный движок, база знаний или автоматизированная первая линия техподдержки. Для всего этого можно использовать векторный и/или семантический поиск, а также интегрировать в проект LLM. Поздравляю — теперь вам нужно еще и хранить embedding-векторы, а также искать по ним ближайшие объекты. Решений два: внешняя векторная БД или интеграция всего этого богатства в существующий стек. Второй путь проще на старте, немного быстрее и обычно дешевле — разумеется, если вы уже используете PostgreSQL. Привет, Хабр! Меня зовут Александр Гришин, я отвечаю за развитие продуктов хранения данных в Selectel:

    habr.com/ru/companies/selectel

    #selectel #postgresql #cloud #dbaas #embeddings #vector #vectordb #pgvector

  20. Devoxx Poland is just a couple of days away!
    Join my talk Wednesday at the Data & AI track to learn about the project, and how it can provide you search, analytics, observability and vector database capabilities, all @linuxfoundation
    👉 devoxx.pl/talk-details/?id=8605

  21. CockroachDB 25.2 tackles enterprise AI's biggest data challenges. Achieve 41% efficiency, a new distributed vector index for billions, and boosted security. Crucial for agentic AI workloads & operational big data at scale.
    #DistributedSQL #VectorDB #CockroachDB

  22. 3.0 is out! 🍾 🥳
    After 3 years of 2.x, it's time for the next leap, which brings major upgrades to performance, data management, functionality, and much more.
    📈 Upgrade to Apache 10 and 21+
    📈 Pull-based ingestion for streaming data, with support for Apache and Amazon
    📈 Power agentic with native support
    📈 Investigate logs with expanded PPL query tools, backed by Apache

    Check out @OpenSearchProject blog:
    opensearch.org/blog/unveiling-

  23. Guest @FranckPachot from #yugabyte joins our very own @noctarius2k in this episode of the weekly, 20 min #CloudCommute #podcast, talking about #distributedsql, #postgresql , #vectordatabases, and more. Tune in!

    The 🎙️ is available on Spotify, iTunes, Pandora, Amazon Music, and more.

    🎥👉 youtu.be/1EAKqwcP2SY

    #vectordatabase #vectorsearch #vectordb #database #databases #postgres #postgressql

  24. 🎉 CrateDB v5.5 is out on the stable channels and it's packed with exiting updates!

    Discover the power of vector storage and similarity search features, revolutionizing complex data analytics, pattern recognition, and AI 🚀 Try out the new drop column feature and much more!

    Find out more about what's new👇
    hubs.ly/Q0283pLg0

    Download CrateDB hubs.ly/Q0283pbF0 🐐