home.social

#recursivepollution — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #recursivepollution, aggregated by home.social.

  1. NEW BIML Bibliography entry

    papers.ssrn.com/sol3/papers.cf

    AI Agent Traps

    Matija Franklin, Nenad Tomašev, Julian Jacobs, Joel Z. Leibo, Simon Osindero

    This paper is a honey pot analog. Very "airy" with uncanny valley writing...we bet this one was written by an LLM. This is about agents working on a polluted environment and is related to poison, pollution, and recursion.

    #MLsec #Agents #Attacks #RecursivePollution

    berryvilleiml.com/bibliography/

  2. NEW BIML Bibliography entry
    \
    economics.mit.edu/sites/defaul

    AI, Human Cognition and Knowledge Collapse

    Daron Acemoglu, Dingwen Kong, Asuman Ozdaglar

    This paper is very think tanky, and more sociology than anything else. The model is very sparse. Not as relevant to our work on recursive pollution as we were hoping. It does mention the degradation of the information environment.

    #AI-Philosophy #Policy #RecursivePollution

    berryvilleiml.com/bibliography/

  3. NEW BIML Bibliography entry

    arxiv.org/abs/2503.03150

    Position: Model Collapse Does Not Mean What You Think

    Rylan Schaeffer, Joshua Kazdan, Alvan Caleb Arulandu, Sanmi Koyejo

    We think recursive pollution is a better term than model collapse. Weak terminology leads to misunderstanding of impact. See figure 4. This is a very good paper.

    #TOPPAPER #MLsec #RecursivePollution #DataPoisoning

    berryvilleiml.com/references/

  4. NEW BIML Bibliography entry

    arxiv.org/abs/2503.03150

    Position: Model Collapse Does Not Mean What You Think

    Rylan Schaeffer, Joshua Kazdan, Alvan Caleb Arulandu, Sanmi Koyejo

    We think recursive pollution is a better term than model collapse. Weak terminology leads to misunderstanding of impact. See figure 4. This is a very good paper.

    #TOPPAPER #MLsec #RecursivePollution #DataPoisoning

    berryvilleiml.com/references/

  5. NEW BIML Bibliography entry

    arxiv.org/abs/2503.03150

    Position: Model Collapse Does Not Mean What You Think

    Rylan Schaeffer, Joshua Kazdan, Alvan Caleb Arulandu, Sanmi Koyejo

    We think recursive pollution is a better term than model collapse. Weak terminology leads to misunderstanding of impact. See figure 4. This is a very good paper.

    #TOPPAPER #MLsec #RecursivePollution #DataPoisoning

    berryvilleiml.com/references/

  6. NEW BIML Bibliography entry

    arxiv.org/abs/2503.03150

    Position: Model Collapse Does Not Mean What You Think

    Rylan Schaeffer, Joshua Kazdan, Alvan Caleb Arulandu, Sanmi Koyejo

    We think recursive pollution is a better term than model collapse. Weak terminology leads to misunderstanding of impact. See figure 4. This is a very good paper.

    #TOPPAPER #MLsec #RecursivePollution #DataPoisoning

    berryvilleiml.com/references/

  7. NEW BIML Bibliography entry

    arxiv.org/abs/2503.03150

    Position: Model Collapse Does Not Mean What You Think

    Rylan Schaeffer, Joshua Kazdan, Alvan Caleb Arulandu, Sanmi Koyejo

    We think recursive pollution is a better term than model collapse. Weak terminology leads to misunderstanding of impact. See figure 4. This is a very good paper.

    #TOPPAPER #MLsec #RecursivePollution #DataPoisoning

    berryvilleiml.com/references/

  8. NEW BIML Bibliography entry

    arxiv.org/abs/2404.05090

    How Bad is Training on Synthetic Data? A Statistical Analysis of Language Model Collapse

    Mohamed El Amine Seddik, et al

    This treatment fails because the models being studied are TOY models too simple to be interesting.

    #MLsec #RecursivePollution #SyntheticData

    berryvilleiml.com/references/

  9. NEW BIML Bibliography entry

    arxiv.org/abs/2404.05090

    How Bad is Training on Synthetic Data? A Statistical Analysis of Language Model Collapse

    Mohamed El Amine Seddik, et al

    This treatment fails because the models being studied are TOY models too simple to be interesting.

    #MLsec #RecursivePollution #SyntheticData

    berryvilleiml.com/references/

  10. NEW BIML Bibliography entry

    arxiv.org/abs/2404.05090

    How Bad is Training on Synthetic Data? A Statistical Analysis of Language Model Collapse

    Mohamed El Amine Seddik, et al

    This treatment fails because the models being studied are TOY models too simple to be interesting.

    #MLsec #RecursivePollution #SyntheticData

    berryvilleiml.com/references/

  11. NEW BIML Bibliography entry

    arxiv.org/abs/2404.05090

    How Bad is Training on Synthetic Data? A Statistical Analysis of Language Model Collapse

    Mohamed El Amine Seddik, et al

    This treatment fails because the models being studied are TOY models too simple to be interesting.

    #MLsec #RecursivePollution #SyntheticData

    berryvilleiml.com/references/

  12. NEW BIML Bibliography entry

    arxiv.org/abs/2404.05090

    How Bad is Training on Synthetic Data? A Statistical Analysis of Language Model Collapse

    Mohamed El Amine Seddik, et al

    This treatment fails because the models being studied are TOY models too simple to be interesting.

    #MLsec #RecursivePollution #SyntheticData

    berryvilleiml.com/references/

  13. NEW BIML Bibliography entry

    arxiv.org/abs/2502.18865

    A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops

    Shi Fu, Yingjie Wang, Yuzhu Chen, Xinmei Tian, Dacheng Tao

    Published at ICLR 2025. A bit overfocused on the real vs synthetic data problem, this paper covers the depletion of real data available for training ML. STLs are getting very close indeed to recursive pollution, so the math here is relevant.

    #MLsec #RecursivePollution

    berryvilleiml.com/references/

  14. NEW BIML Bibliography entry

    arxiv.org/abs/2410.04840

    Strong Model Collapse

    Elvis Dohmatob, Yunzhen Feng, Arjun Subramonian, Julia Kempe
    (NYU and META)

    Recursive pollution leads to model collapse. This view of strong model collapse describes what happens in the case of recursive data poison.
    #TOPPAPER #MLsec #Data #RecursivePollution

    berryvilleiml.com/references/

  15. NEW BIML Bibliography entry

    arxiv.org/abs/2410.04840

    Strong Model Collapse

    Elvis Dohmatob, Yunzhen Feng, Arjun Subramonian, Julia Kempe
    (NYU and META)

    Recursive pollution leads to model collapse. This view of strong model collapse describes what happens in the case of recursive data poison.
    #TOPPAPER #MLsec #Data #RecursivePollution

    berryvilleiml.com/references/

  16. NEW BIML Bibliography entry

    arxiv.org/abs/2410.04840

    Strong Model Collapse

    Elvis Dohmatob, Yunzhen Feng, Arjun Subramonian, Julia Kempe
    (NYU and META)

    Recursive pollution leads to model collapse. This view of strong model collapse describes what happens in the case of recursive data poison.
    #TOPPAPER #MLsec #Data #RecursivePollution

    berryvilleiml.com/references/

  17. NEW BIML Bibliography entry

    arxiv.org/abs/2410.04840

    Strong Model Collapse

    Elvis Dohmatob, Yunzhen Feng, Arjun Subramonian, Julia Kempe
    (NYU and META)

    Recursive pollution leads to model collapse. This view of strong model collapse describes what happens in the case of recursive data poison.
    #TOPPAPER #MLsec #Data #RecursivePollution

    berryvilleiml.com/references/

  18. NEW BIML Bibliography entry

    arxiv.org/abs/2410.04840

    Strong Model Collapse

    Elvis Dohmatob, Yunzhen Feng, Arjun Subramonian, Julia Kempe
    (NYU and META)

    Recursive pollution leads to model collapse. This view of strong model collapse describes what happens in the case of recursive data poison.
    #TOPPAPER #MLsec #Data #RecursivePollution

    berryvilleiml.com/references/

  19. NEW BIML Bibliography entry

    arxiv.org/abs/2509.16499

    A Closer Look at Model Collapse: From a Generalization-to-Memorization Perspective

    Lianghe Shi, et al

    A very nice set of references to work in model collapse. Collapsed model == lookup table (that is, no generalization). Discussion of recursive pollution as causing variance shrinkage or distribution shift.

    #TOPPAPER #MLsec #Data #RecursivePollution

    berryvilleiml.com/references/

  20. NEW BIML Bibliography entry

    arxiv.org/abs/2509.16499

    A Closer Look at Model Collapse: From a Generalization-to-Memorization Perspective

    Lianghe Shi, et al

    A very nice set of references to work in model collapse. Collapsed model == lookup table (that is, no generalization). Discussion of recursive pollution as causing variance shrinkage or distribution shift.

    #TOPPAPER #MLsec #Data #RecursivePollution

    berryvilleiml.com/references/

  21. NEW BIML Bibliography entry

    arxiv.org/abs/2509.16499

    A Closer Look at Model Collapse: From a Generalization-to-Memorization Perspective

    Lianghe Shi, et al

    A very nice set of references to work in model collapse. Collapsed model == lookup table (that is, no generalization). Discussion of recursive pollution as causing variance shrinkage or distribution shift.

    #TOPPAPER #MLsec #Data #RecursivePollution

    berryvilleiml.com/references/

  22. NEW BIML Bibliography entry

    arxiv.org/abs/2509.16499

    A Closer Look at Model Collapse: From a Generalization-to-Memorization Perspective

    Lianghe Shi, et al

    A very nice set of references to work in model collapse. Collapsed model == lookup table (that is, no generalization). Discussion of recursive pollution as causing variance shrinkage or distribution shift.

    #TOPPAPER #MLsec #Data #RecursivePollution

    berryvilleiml.com/references/

  23. NEW BIML Bibliography entry

    arxiv.org/abs/2509.16499

    A Closer Look at Model Collapse: From a Generalization-to-Memorization Perspective

    Lianghe Shi, et al

    A very nice set of references to work in model collapse. Collapsed model == lookup table (that is, no generalization). Discussion of recursive pollution as causing variance shrinkage or distribution shift.

    #TOPPAPER #MLsec #Data #RecursivePollution

    berryvilleiml.com/references/