home.social
  1. meets
    @ollama !
    You can now add superpowers to your favorite in-browser semantic search tool🥳
    Chat with your document or an entire book and fuel with your relevant paragraphs!
    App: do-me.github.io/SemanticFinder/
    Announcement: reddit.com/r/ollama/comments/1

  2. now offers built-in dimensionality reduction with wasm-powered BH-tSNE. Try it now: do-me.github.io/SemanticFinder/

  3. Private semantic & hybrid search for large documents like books in 2 seconds in your browser with !🚀Examples:
    - The Bible (en) do-me.github.io/SemanticFinder
    - Les Misérables (fr) do-me.github.io/SemanticFinder
    - Das Kapital (de) do-me.github.io/SemanticFinder
    Full catalogue: huggingface.co/datasets/do-me/

    You can save your own index and keep it private or share it publicly. Proposals for books or docs of public interest always welcome - just share a source URL and I'll add it to the collection :)

  4. Just added the current leader /UAE-Large-V1 to and it's performing great! Would love a base or small version to have it slightly faster for in-browser semantic search.
    Test here: do-me.github.io/SemanticFinder/

  5. ✨ Open source RAG (Retrieval Augmented Generation) right in your browser! ✨

    now offers an 𝐚𝐝𝐯𝐚𝐧𝐜𝐞𝐝 𝐜𝐡𝐚𝐭 & 𝐬𝐮𝐦𝐦𝐚𝐫𝐲 𝐟𝐞𝐚𝐭𝐮𝐫𝐞 for your search results - all in your browser.

    💡There are very few capable small LLMs that offer high-quality results. Quantized LaMini-Flan-T5-783M offers good performance with 3-4s load time and >6 tokens/s after model download on an old i7.

    do-me.github.io/SemanticFinder/

  6. Just indexed the whole bible in my browser with Jina AI's new 8k embeddings and (do-me.github.io/SemanticFinder/). 742 pages, 80.500 lines or 4.641.000 chars and my browser doesn't even crash. 30-40 mins indexing time but less than 60s for any consecutive search! 🚀

  7. 𝐉𝐢𝐧𝐚 𝐀𝐈 𝟖𝐤 𝐭𝐞𝐱𝐭 𝐞𝐦𝐛𝐞𝐝𝐝𝐢𝐧𝐠𝐬 📄
    I just quantized both available versions for 𝟒𝐱 𝐫𝐞𝐝𝐮𝐜𝐞𝐝 𝐟𝐢𝐥𝐞 𝐬𝐢𝐳𝐞 and usage in .js and (do-me.github.io/SemanticFinder/) when dealing with a large corpus:

    - 𝟐𝟖.𝟓 𝐌𝐁 jina-embeddings-v2-small-en (huggingface.co/do-me/jina-embe)
    - 𝟏𝟎𝟗 𝐌𝐁 jina-embeddings-v2-base-en (huggingface.co/do-me/jina-embe)

    ⚠️ I noted however, that the base version seems to perform a little poor on smaller text chunks. Test in SemanticFinder.

    Jina AI announcement: jina.ai/news/jina-ai-launches-

  8. 𝐆𝐮𝐞𝐫𝐢𝐥𝐥𝐚 𝐒𝐞𝐦𝐚𝐧𝐭𝐢𝐜 𝐒𝐞𝐚𝐫𝐜𝐡 𝐓𝐮𝐭𝐨𝐫𝐢𝐚𝐥 🔎

    Create an open source semantic search web app for the Services Catalogue:

    📦 Mine data
    📝 Chunk and index the documents
    🖋️ Write a static web app
    🚀 Host for free on GitHub pages

    ⭐ Lots of practical tricks. Built with , , , .js, .js.

    𝐓𝐮𝐭𝐨𝐫𝐢𝐚𝐥: geo.rocks/post/semantic-search
    𝐆𝐢𝐭𝐇𝐮𝐛 repo with Jupyter Notebook: github.com/do-me/copernicus-se
    𝐖𝐞𝐛 𝐚𝐩𝐩: do-me.github.io/copernicus-ser

  9. 🔍 𝗦𝗲𝗺𝗮𝗻𝘁𝗶𝗰𝗙𝗶𝗻𝗱𝗲𝗿 𝗕𝗿𝗼𝘄𝘀𝗲𝗿 𝗘𝘅𝘁𝗲𝗻𝘀𝗶𝗼𝗻 🔍
    Varun and me are super excited to present you the open source Chrome extension for - in-browser, privacy-preserving ! 🥳
    GitHub: github.com/do-me/SemanticFinder
    Web: do-me.github.io/SemanticFinder
    Contributors wanted!

  10. just received a major feature contribution from @IamVarunSri! Interactively explore all the results for your semantic search query right in your browser for any text up to hundreds of pages: github.com/do-me/SemanticFinder

    Made with github.com/xenova/transformers.js

  11. You can now pre-index documents with and allow for blazingly fast semantic search for very large documents of e.g. 100 pages right in your browser!
    See the example:
    𝗜𝗣𝗖𝗖 𝗥𝗲𝗽𝗼𝗿𝘁: geo.rocks/semanticfinder/ipcc
    𝗚𝗶𝘁𝗛𝘂𝗯: github.com/do-me/SemanticFinder

  12. Just updated the UI of

    geo.rocks/semanticfinder/

    You can scroll through the results now with one click, enabling you to quickly find what you're looking for! Also working on mobile!

  13. 𝗦𝗲𝗺𝗮𝗻𝘁𝗶𝗰𝗙𝗶𝗻𝗱𝗲𝗿 - A browser-based semantic search engine you can use to query your own texts!

    Demo: geo.rocks/semanticfinder/
    Blog Post: geo.rocks/post/semanticfinder-
    GitHub: github.com/do-me/SemanticFinde

    Built with amazing open-source software: (all-MiniLM-L6-v2), .js, and .

  14. Create a semantic search engine with only a vector database and a light-weight frontend - keep the inference server client-side!

    Tutorial with demo: geo.rocks/post/qdrant-transfor

    Powered by amazing open-source software from , .js and !