home.social

#semanticsearch — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #semanticsearch, aggregated by home.social.

  1. I spent some time trying to make search behavior visible in one small Quarkus app.

    Full-text is good at exact terms. Vector search helps when user language and catalog language drift apart. Hybrid is usually the one I’d trust first in a real product search.

    This article walks through all three with Quarkus, PostgreSQL, Elasticsearch, Hibernate Search, and local embeddings.

    the-main-thread.com/p/full-tex

    #Java #Quarkus #PostgreSQL #Elasticsearch #SemanticSearch #HibernateSearch #VectorSearch

  2. I'll be speaking at PHP Tek in May — two talks I've been building toward for a while.

    **Kubernetes for PHP Developers**: The translation guide from Docker Compose to production K8s. No 40-hour course required.

    **Semantic Search in Laravel**: Building search that understands meaning using pgvector and embeddings. Based on what I built for DailyMedToday.

    Both talks from production experience, not theory.

    Full details: eric.mann.blog/speaking-at-php

    #PHP #Kubernetes #Laravel #PHPTek #SemanticSearch

  3. RAG-системы: что это такое, принципы работы, архитектура и ограничения

    Retrieval-Augmented Generation (RAG) всё чаще упоминается в контексте LLM и всё чаще фигурирует в требованиях к разработчикам, но за этим термином обычно скрывается довольно размытое представление о том, как такие системы реально устроены. В этой статье я разбираю RAG как архитектурный подход: зачем он вообще появился, какие задачи решает, как выглядит базовый пайплайн от данных до ответа модели и где на практике чаще всего возникают проблемы.

    habr.com/ru/articles/989000/

    #rag #llm #retrieval #nlp #embeddings #semanticsearch #informationretrieval

  4. ARBITER: what it is / what it isn’t

    IS

    semantic scoring
    geometric fit
    negative answers
    offline 26MB

    ISN’T

    LLM
    vector DB
    embeddings
    retrieval

    getarbiter.dev
    #AI #NLP #RAG #AIInfra #SemanticSearch

  5. 🔍 Can AI transform how we discover biological datasets?

    🔗 Public Omics Explorer (POE): Enabling integrative semantic search across GEO omics datasets based on PubMed publications. Computational and Structural Biotechnology Journal, DOI: doi.org/10.1016/j.csbj.2025.11

    📚 CSBJ: csbj.org/

    #Bioinformatics #Genomics #SemanticSearch #ArtificialIntelligence #BiomedicalResearch #FAIRData #OpenScience #ComputationalBiology #DataDiscovery #MachineLearning

  6. 🤖✨ See AI beyond the hype! Steve Eardley demos LLM-powered semantic search in academic repositories, with live comparisons & insights on AI’s promise & pitfalls. 🌐📚

    📄 Abstract: doi.org/10.7557/5.8363

    #AI #OpenScience #Munin2025 #SemanticSearch #UiT

  7. 🚀 NEW on We ❤️ Open Source 🚀

    Jessica Garson shares how vector databases go beyond keywords to power semantic search, embeddings & smarter AI workflows. A practical intro to RAG & context-aware apps.

    Read the article: allthingsopen.org/articles/vec

    #WeLoveOpenSource #VectorDatabases #AI #SemanticSearch #MachineLearning #OpenSource

  8. The Case of the Vanishing #Hit Count: Rethinking Query Craftsmanship in a Post-Boolean World— Reflections from Day 2 of my FSCI 2025 workshop on AI‑powered search

    👉 Understanding the Shift from Exact #Boolean Hits to the "Top-k" Results of #SemanticSearch and the Evaluated Hits of #DeepSearch.

    By Aaron Tay

    open.substack.com/pub/aarontay

    #SearchEvolution #AcademicSearchChallenges #Discovery #InformationDiscovery #InformationLiteracy #infolit #TopK #SearchStrategy #booleanoperations #HitCount

  9. The Case of the Vanishing #Hit Count: Rethinking Query Craftsmanship in a Post-Boolean World— Reflections from Day 2 of my FSCI 2025 workshop on AI‑powered search

    👉 Understanding the Shift from Exact #Boolean Hits to the "Top-k" Results of #SemanticSearch and the Evaluated Hits of #DeepSearch.

    By Aaron Tay

    open.substack.com/pub/aarontay

    #SearchEvolution #AcademicSearchChallenges #Discovery #InformationDiscovery #InformationLiteracy #infolit #TopK #SearchStrategy #booleanoperations #HitCount

  10. The Case of the Vanishing #Hit Count: Rethinking Query Craftsmanship in a Post-Boolean World— Reflections from Day 2 of my FSCI 2025 workshop on AI‑powered search

    👉 Understanding the Shift from Exact #Boolean Hits to the "Top-k" Results of #SemanticSearch and the Evaluated Hits of #DeepSearch.

    By Aaron Tay

    open.substack.com/pub/aarontay

    #SearchEvolution #AcademicSearchChallenges #Discovery #InformationDiscovery #InformationLiteracy #infolit #TopK #SearchStrategy #booleanoperations #HitCount

  11. The Case of the Vanishing #Hit Count: Rethinking Query Craftsmanship in a Post-Boolean World— Reflections from Day 2 of my FSCI 2025 workshop on AI‑powered search

    👉 Understanding the Shift from Exact #Boolean Hits to the "Top-k" Results of #SemanticSearch and the Evaluated Hits of #DeepSearch.

    By Aaron Tay

    open.substack.com/pub/aarontay

    #SearchEvolution #AcademicSearchChallenges #Discovery #InformationDiscovery #InformationLiteracy #infolit #TopK #SearchStrategy #booleanoperations #HitCount

  12. The Case of the Vanishing #Hit Count: Rethinking Query Craftsmanship in a Post-Boolean World— Reflections from Day 2 of my FSCI 2025 workshop on AI‑powered search

    👉 Understanding the Shift from Exact #Boolean Hits to the "Top-k" Results of #SemanticSearch and the Evaluated Hits of #DeepSearch.

    By Aaron Tay

    open.substack.com/pub/aarontay

    #SearchEvolution #AcademicSearchChallenges #Discovery #InformationDiscovery #InformationLiteracy #infolit #TopK #SearchStrategy #booleanoperations #HitCount

  13. Aaron Tay: A Deep Dive into EBSCOhost’s Natural Language Search and Web of Science Smart Search – Two bundled “Ai-powered”search (I). “This post will examine EBSCOhost’s Natural Language Search (NLS) and, in the next post, Web of Science’s Smart Search (not to be confused with Web of Science Research Assistant). Both are interesting because they introduce this ‘semantic’ query translation […]

    https://rbfirehose.com/2025/07/23/aaron-tay-a-deep-dive-into-ebscohosts-natural-language-search-and-web-of-science-smart-search-two-bundled-ai-poweredsearch-i/

  14. Aaron Tay: A Deep Dive into EBSCOhost’s Natural Language Search and Web of Science Smart Search – Two bundled “Ai-powered”search (I). “This post will examine EBSCOhost’s Natural Language Search (NLS) and, in the next post, Web of Science’s Smart Search (not to be confused with Web of Science Research Assistant). Both are interesting because they introduce this ‘semantic’ query translation […]

    https://rbfirehose.com/2025/07/23/aaron-tay-a-deep-dive-into-ebscohosts-natural-language-search-and-web-of-science-smart-search-two-bundled-ai-poweredsearch-i/

  15. 👨‍💻Ah, the classic tale of a dev who bravely attempted to revolutionize #GitHub with semantic search but ended up just building another Notion clone. 🚀 Maybe next time try solving a problem that isn't already solved by 12 other apps on the App Store.📚
    tzx.notion.site/What-I-Learned #devstory #semanticsearch #Notionclone #appdevelopment #innovationfail #HackerNews #ngated

  16. You are attending #DHd2024 and interested in Art History, particularly in Renaissance and Vasari's Life of the Artists? With my colleague @sourisnumerique we are creating a #knowledgegraph to enable new ways to explore and discover Vasari's Renaissance. Come and talk to us, we are interested in your thoughts!

    #renaissance #arthistory #semanticsearch #exploratorysearch #visualization #serendipity @fizise @nfdi4culture @NFDI4Memory

  17. ✨ Open source RAG (Retrieval Augmented Generation) right in your browser! ✨

    now offers an 𝐚𝐝𝐯𝐚𝐧𝐜𝐞𝐝 𝐜𝐡𝐚𝐭 & 𝐬𝐮𝐦𝐦𝐚𝐫𝐲 𝐟𝐞𝐚𝐭𝐮𝐫𝐞 for your search results - all in your browser.

    💡There are very few capable small LLMs that offer high-quality results. Quantized LaMini-Flan-T5-783M offers good performance with 3-4s load time and >6 tokens/s after model download on an old i7.

    do-me.github.io/SemanticFinder/

  18. Hi everybody #introduction, this is FIZ ISE (Information Service Engineering) research group at #FIZKarlsruhe and #AIFB/KIT, switching from the birdcage to this lovely new environment. We will be tooting about our latest research in #semanticweb #knowledgegraph #deeplearning #knowledgeextraction #researchdatamanagement #representationlearning #semanticsearch #exploratorysearch and many more.

    Application areas: #culturalheritage #digitalhumanities #materialsscience
    #datascience #mathematics #ai

  19. @janicenovakowski Hey Janice ... welcome!

    A hint to get started ... free text search doesn't work, so it's worth using HashTags so people can find you and your posts. It's a kind of #SemanticSearch.

    #TeachingMath
    #Education
    #Vancouver
    #Canada
    #K12
    ... that sort of thing.