home.social

#conformalprediction — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #conformalprediction, aggregated by home.social.

  1. How do we make LLM output more trustworthy? A short survey note on three lines of recent work covering five papers: conformal-prediction coverage guarantees, behavioral calibration of the model's prose, and sample-disagreement detection. All three pay the same multi-sample inference tax; the choice is about what you want back.

    benjaminhan.net/posts/20260505

    #Hallucination #LLMs #Calibration #ConformalPrediction #AI

  2. Conformal Factuality casts LM correctness as uncertainty quantification. Decompose the answer into sub-claims, score each, drop the low-confidence ones until the retained set is ~1-α factual. The sub-claim decomposition is doing most of the work, and the conformal machinery rides on top. Atomic-claim splitters have known failure modes, and the guarantee inherits them.

    benjaminhan.net/posts/20260505

    #ConformalPrediction #Calibration #Hallucination #LLMs #ICML #AI

  3. Conformal Language Modeling (CLM) adapts conformal prediction to generative LMs: sample candidates, stop when a calibrated rule fires, return a set guaranteed to contain an acceptable answer. The more interesting half is the component-level filter — per-phrase coverage, not just set-level. That's the primitive for hallucination flagging: highlight the vetted phrases, leave the rest for review.

    benjaminhan.net/posts/20260505

    #ConformalPrediction #LLMs #Hallucination #ICLR #AI

  4. A primer on conformal prediction: the recipe for distribution-free coverage guarantees that doesn't require your model to be calibrated. Rank-based non-conformity scores plus a calibration quantile give you valid prediction sets. Easy inputs get one-class sets; hard ones get many alternatives. Set size is where the uncertainty shows up.

    benjaminhan.net/posts/20260505

    #ConformalPrediction #Calibration #UncertaintyQuantification #AI

  5. 🚨 New blog post 🚨

    "**Optimal prediction sets for plant identification: an interactive guide**"

    josephsalmon.eu/blog/long-tail/

    Joint work with Tiffany Ding and Jean-Baptiste Fermanian.

    #longtail
    #PlantNet
    #AppliedConformalPrediction
    #ConformalPrediction

  6. TIL of #conformalprediction, a way to assess the uncertainty of a prediction (from any algorithm, including from #machineleaning). It is used in research to make #autonomousdriving safer by predicting other agent's movements: youtube.com/watch?v=QvIJH4cZy3

    It does not require an expert model, but in turn it needs a statistically representative dataset.