home.social

#tokenizers — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #tokenizers, aggregated by home.social.

  1. 🔧 #code2prompt: A command-line tool for converting codebases to #LLM prompts

    Key features:
    • 📁 Generates well-formatted #Markdown prompts with source tree structure
    • 🛠️ Customizable #Handlebars templates for versatile prompt generation
    • 🔍 Respects .gitignore and supports file filtering with glob patterns
    • 🔢 Displays token count using various #tokenizers (cl100k, p50k, r50k_base)
    • 📊 #Git diff integration for commit messages and #PullRequest descriptions
    • 📋 Automatic clipboard copy and option to save output to file

    Additional capabilities:
    • 🔢 Line numbering for source code blocks
    • 🔀 JSON output option for structured data
    • 🚫 Exclusion of files/folders from source tree
    • 📝 Support for user-defined variables in templates

    #opensource project written in #Rust, available on #crates_io and #AUR

    Useful for:
    • Quick #LLM prompt generation from codebases
    • Code documentation and analysis
    • Bug finding and security vulnerability assessment
    • Performance optimization suggestions

    github.com/mufeedvh/code2promp