home.social

#mechanisticinterpretability — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #mechanisticinterpretability, aggregated by home.social.

  1. Questions? Discussion? Reach out to us:

    Andreas Waldis (UKP Lab/Technische Universität Darmstadt and HSLU Hochschule Luzern), Vagrant Gautam (Universität des Saarlandes), Anne Lauscher (Universität Hamburg), Dietrich Klakow (Universität des Saarlandes), and Iryna Gurevych (UKP Lab/Technische Universität Darmstadt)

    #NLProc #Interpretability #LLMs #ExplainableAI #MechanisticInterpretability #AlignedProbing #ModelInternals

  2. Questions? Discussion? Reach out to us:

    Andreas Waldis (UKP Lab/Technische Universität Darmstadt and HSLU Hochschule Luzern), Vagrant Gautam (Universität des Saarlandes), Anne Lauscher (Universität Hamburg), Dietrich Klakow (Universität des Saarlandes), and Iryna Gurevych (UKP Lab/Technische Universität Darmstadt)

    #NLProc #Interpretability #LLMs #ExplainableAI #MechanisticInterpretability #AlignedProbing #ModelInternals

  3. Questions? Discussion? Reach out to us:

    Andreas Waldis (UKP Lab/Technische Universität Darmstadt and HSLU Hochschule Luzern), Vagrant Gautam (Universität des Saarlandes), Anne Lauscher (Universität Hamburg), Dietrich Klakow (Universität des Saarlandes), and Iryna Gurevych (UKP Lab/Technische Universität Darmstadt)

    #NLProc #Interpretability #LLMs #ExplainableAI #MechanisticInterpretability #AlignedProbing #ModelInternals

  4. Questions? Discussion? Reach out to us:

    Andreas Waldis (UKP Lab/Technische Universität Darmstadt and HSLU Hochschule Luzern), Vagrant Gautam (Universität des Saarlandes), Anne Lauscher (Universität Hamburg), Dietrich Klakow (Universität des Saarlandes), and Iryna Gurevych (UKP Lab/Technische Universität Darmstadt)

    #NLProc #Interpretability #LLMs #ExplainableAI #MechanisticInterpretability #AlignedProbing #ModelInternals

  5. Questions? Discussion? Reach out to us:

    Andreas Waldis (UKP Lab/Technische Universität Darmstadt and HSLU Hochschule Luzern), Vagrant Gautam (Universität des Saarlandes), Anne Lauscher (Universität Hamburg), Dietrich Klakow (Universität des Saarlandes), and Iryna Gurevych (UKP Lab/Technische Universität Darmstadt)

    #NLProc #Interpretability #LLMs #ExplainableAI #MechanisticInterpretability #AlignedProbing #ModelInternals

  6. [Перевод] Как сделать нейросети понятнее: эксперимент OpenAI с разряженными моделями

    Команда AI for Devs подготовила перевод исследования OpenAI о том, как обучение разреженных моделей может сделать ИИ более прозрачным. Авторы показывают: если заставить модель использовать меньше связей, внутри неё появляются понятные цепочки вычислений, которые можно изучать и проверять. Это может стать шагом к созданию мощных, но интерпретируемых систем.

    habr.com/ru/articles/966448/

    #интерпретируемость #разреженныемодели #mechanisticinterpretability #sparsetransformer #цепочкивычислений #circuits #OpenAI #безопасностьИИ #attention #архитектурамоделей

  7. [Перевод] Как сделать нейросети понятнее: эксперимент OpenAI с разряженными моделями

    Команда AI for Devs подготовила перевод исследования OpenAI о том, как обучение разреженных моделей может сделать ИИ более прозрачным. Авторы показывают: если заставить модель использовать меньше связей, внутри неё появляются понятные цепочки вычислений, которые можно изучать и проверять. Это может стать шагом к созданию мощных, но интерпретируемых систем.

    habr.com/ru/articles/966448/

    #интерпретируемость #разреженныемодели #mechanisticinterpretability #sparsetransformer #цепочкивычислений #circuits #OpenAI #безопасностьИИ #attention #архитектурамоделей

  8. [Перевод] Как сделать нейросети понятнее: эксперимент OpenAI с разряженными моделями

    Команда AI for Devs подготовила перевод исследования OpenAI о том, как обучение разреженных моделей может сделать ИИ более прозрачным. Авторы показывают: если заставить модель использовать меньше связей, внутри неё появляются понятные цепочки вычислений, которые можно изучать и проверять. Это может стать шагом к созданию мощных, но интерпретируемых систем.

    habr.com/ru/articles/966448/

    #интерпретируемость #разреженныемодели #mechanisticinterpretability #sparsetransformer #цепочкивычислений #circuits #OpenAI #безопасностьИИ #attention #архитектурамоделей

  9. [Перевод] Как сделать нейросети понятнее: эксперимент OpenAI с разряженными моделями

    Команда AI for Devs подготовила перевод исследования OpenAI о том, как обучение разреженных моделей может сделать ИИ более прозрачным. Авторы показывают: если заставить модель использовать меньше связей, внутри неё появляются понятные цепочки вычислений, которые можно изучать и проверять. Это может стать шагом к созданию мощных, но интерпретируемых систем.

    habr.com/ru/articles/966448/

    #интерпретируемость #разреженныемодели #mechanisticinterpretability #sparsetransformer #цепочкивычислений #circuits #OpenAI #безопасностьИИ #attention #архитектурамоделей

  10. "But every once in a while, Claude breaks bad. It lies. It deceives. It develops weird obsessions. It makes threats and then carries them out. And the frustrating part—true of all LLMs—is that no one knows exactly why." @stevenlevy for Wired

    wired.com/story/ai-black-box-i

    #AI #LLMs #MechanisticInterpretability

  11. Can someone find me a job doable from Amsterdam in mechanistic interpretability? #AI #MI #MechanisticInterpretability

  12. @AAKL @NGIZero @Reuters @EC_NGI

    Trying to regulate AI can be like regulating math in that suddenly certain calculations are illegal.

    Trying to regulate AI can be like regulating the printing press in that suddenly only people with enough lawyers are able to make a printing press.

    Trying to regulate AI can be like trying to regulate free speech. From now on only certain forms of speech are allowed (a bit like the book 1984).

    Microsoft, Facebook, Google, Nvidia and others are investing billions (trillions?). Big Tech and Governments want to continue with data mining, government surveillance and surveillance capitalism. This will go as long as long as we let them. The incentives are there. To change that we need social awareness, a consciousness shift or a libre ethical digital revolution.

    If you truly want to stop AI then think about how to organize:

    Otherwise focusing on research, libre AI, libre riscv, libre silicon and research (ethics,safety,mechanistic interpretability) is currently our best hope.

    #betterwithoutai #antitechrevolution #libreai #libreriscv #libresilicon #aiethics #ethicalai #aisafety #mechanisticinterpretability #riscv #ai #datamining #governmentsurveillance

  13. @AAKL @NGIZero @Reuters @EC_NGI

    Trying to regulate AI can be like regulating math in that suddenly certain calculations are illegal.

    Trying to regulate AI can be like regulating the printing press in that suddenly only people with enough lawyers are able to make a printing press.

    Trying to regulate AI can be like trying to regulate free speech. From now on only certain forms of speech are allowed (a bit like the book 1984).

    Microsoft, Facebook, Google, Nvidia and others are investing billions (trillions?). Big Tech and Governments want to continue with data mining, government surveillance and surveillance capitalism. This will go as long as long as we let them. The incentives are there. To change that we need social awareness, a consciousness shift or a libre ethical digital revolution.

    If you truly want to stop AI then think about how to organize:

    Otherwise focusing on research, libre AI, libre riscv, libre silicon and research (ethics,safety,mechanistic interpretability) is currently our best hope.

    #betterwithoutai #antitechrevolution #libreai #libreriscv #libresilicon #aiethics #ethicalai #aisafety #mechanisticinterpretability #riscv #ai #datamining #governmentsurveillance

  14. @AAKL @NGIZero @Reuters @EC_NGI

    Trying to regulate AI can be like regulating math in that suddenly certain calculations are illegal.

    Trying to regulate AI can be like regulating the printing press in that suddenly only people with enough lawyers are able to make a printing press.

    Trying to regulate AI can be like trying to regulate free speech. From now on only certain forms of speech are allowed (a bit like the book 1984).

    Microsoft, Facebook, Google, Nvidia and others are investing billions (trillions?). Big Tech and Governments want to continue with data mining, government surveillance and surveillance capitalism. This will go as long as long as we let them. The incentives are there. To change that we need social awareness, a consciousness shift or a libre ethical digital revolution.

    If you truly want to stop AI then think about how to organize:

    Otherwise focusing on research, libre AI, libre riscv, libre silicon and research (ethics,safety,mechanistic interpretability) is currently our best hope.

    #betterwithoutai #antitechrevolution #libreai #libreriscv #libresilicon #aiethics #ethicalai #aisafety #mechanisticinterpretability #riscv #ai #datamining #governmentsurveillance

  15. @AAKL @NGIZero @Reuters @EC_NGI

    Trying to regulate AI can be like regulating math in that suddenly certain calculations are illegal.

    Trying to regulate AI can be like regulating the printing press in that suddenly only people with enough lawyers are able to make a printing press.

    Trying to regulate AI can be like trying to regulate free speech. From now on only certain forms of speech are allowed (a bit like the book 1984).

    Microsoft, Facebook, Google, Nvidia and others are investing billions (trillions?). Big Tech and Governments want to continue with data mining, government surveillance and surveillance capitalism. This will go as long as long as we let them. The incentives are there. To change that we need social awareness, a consciousness shift or a libre ethical digital revolution.

    If you truly want to stop AI then think about how to organize:

    Otherwise focusing on research, libre AI, libre riscv, libre silicon and research (ethics,safety,mechanistic interpretability) is currently our best hope.

    #betterwithoutai #antitechrevolution #libreai #libreriscv #libresilicon #aiethics #ethicalai #aisafety #mechanisticinterpretability #riscv #ai #datamining #governmentsurveillance

  16. @AAKL @NGIZero @Reuters @EC_NGI

    Trying to regulate AI can be like regulating math in that suddenly certain calculations are illegal.

    Trying to regulate AI can be like regulating the printing press in that suddenly only people with enough lawyers are able to make a printing press.

    Trying to regulate AI can be like trying to regulate free speech. From now on only certain forms of speech are allowed (a bit like the book 1984).

    Microsoft, Facebook, Google, Nvidia and others are investing billions (trillions?). Big Tech and Governments want to continue with data mining, government surveillance and surveillance capitalism. This will go as long as long as we let them. The incentives are there. To change that we need social awareness, a consciousness shift or a libre ethical digital revolution.

    If you truly want to stop AI then think about how to organize:

    Otherwise focusing on research, libre AI, libre riscv, libre silicon and research (ethics,safety,mechanistic interpretability) is currently our best hope.

    #betterwithoutai #antitechrevolution #libreai #libreriscv #libresilicon #aiethics #ethicalai #aisafety #mechanisticinterpretability #riscv #ai #datamining #governmentsurveillance

  17. @AAKL @Reuters

    Instead of making laws focus on funding:

    • libre AI
    • libre risc v
    • libre silicon
    • research on AI ethics
    • research on AI safety
    • research on mechanistic interpretability
    • NLnet
    • projects like openchatkit and open asisstant

    Laws and regulations (including license restrictions) usually benefit corporations and governments who have lawyers, lobyists, perverse incentives and a drive for data mining and surveillance. Even if you do your best to make a good law it could make it harder for libre efforts to comply. Put your trust in research, decentralization, libre software, libre hardware and research.

    Billions of dollars are being invested by companies like nvidia,google and microsoft.

    #libreai #nlnet #libreai #libreriscv #riscv #libresilicon #aiethics #ethicalai #aisafety #mechanisticinterpretability #freelibresoftware #libresoftware

    @NGIZero @EC_NGI

  18. @gregorni

    Related efforts that I know of.

    The ability to run bloom on your own hardware: https://github.com/NouamaneTazi/bloomz.cpp https://github.com/NouamaneTazi/bloomz.cpp/issues/4

    Vipergpt. Not sure whether it will be libre. (Hopefully) https://viper.cs.columbia.edu/

    King Algorithm Manifesto https://github.com/keskival/king-algorithm-manifesto/blob/main/README.md

    Better Without AI which advocates for mechanistic interpretability https://betterwithout.ai

    There is also a recommended reading list for stochastic parrots but I am not linking it here since I do not want to promote google docs.

    Also does anyone know of any efforts to provide libre riscv AI accelerator that can be run on FPGA?

    #bloom #vipergpt #kingalgorithmmanifesto #stochasticparrots #betterwithoutai #mechanisticinterpretability #riscv

  19. @gregorni

    Related efforts that I know of.

    The ability to run bloom on your own hardware: https://github.com/NouamaneTazi/bloomz.cpp https://github.com/NouamaneTazi/bloomz.cpp/issues/4

    Vipergpt. Not sure whether it will be libre. (Hopefully) https://viper.cs.columbia.edu/

    King Algorithm Manifesto https://github.com/keskival/king-algorithm-manifesto/blob/main/README.md

    Better Without AI which advocates for mechanistic interpretability https://betterwithout.ai

    There is also a recommended reading list for stochastic parrots but I am not linking it here since I do not want to promote google docs.

    Also does anyone know of any efforts to provide libre riscv AI accelerator that can be run on FPGA?

    #bloom #vipergpt #kingalgorithmmanifesto #stochasticparrots #betterwithoutai #mechanisticinterpretability #riscv

  20. @gregorni

    Related efforts that I know of.

    The ability to run bloom on your own hardware: https://github.com/NouamaneTazi/bloomz.cpp https://github.com/NouamaneTazi/bloomz.cpp/issues/4

    Vipergpt. Not sure whether it will be libre. (Hopefully) https://viper.cs.columbia.edu/

    King Algorithm Manifesto https://github.com/keskival/king-algorithm-manifesto/blob/main/README.md

    Better Without AI which advocates for mechanistic interpretability https://betterwithout.ai

    There is also a recommended reading list for stochastic parrots but I am not linking it here since I do not want to promote google docs.

    Also does anyone know of any efforts to provide libre riscv AI accelerator that can be run on FPGA?

    #bloom #vipergpt #kingalgorithmmanifesto #stochasticparrots #betterwithoutai #mechanisticinterpretability #riscv

  21. @gregorni

    Related efforts that I know of.

    The ability to run bloom on your own hardware: https://github.com/NouamaneTazi/bloomz.cpp https://github.com/NouamaneTazi/bloomz.cpp/issues/4

    Vipergpt. Not sure whether it will be libre. (Hopefully) https://viper.cs.columbia.edu/

    King Algorithm Manifesto https://github.com/keskival/king-algorithm-manifesto/blob/main/README.md

    Better Without AI which advocates for mechanistic interpretability https://betterwithout.ai

    There is also a recommended reading list for stochastic parrots but I am not linking it here since I do not want to promote google docs.

    Also does anyone know of any efforts to provide libre riscv AI accelerator that can be run on FPGA?

    #bloom #vipergpt #kingalgorithmmanifesto #stochasticparrots #betterwithoutai #mechanisticinterpretability #riscv

  22. @tero @simon @estranho @emilymbender

    Thanks for the interesting discussion.

    IMHO fully open language models (including the weights) under a libre license (without restrictions) in combination with more research on mechanistic interpretability might eventually lead to useful libre models running on our own hardware.

    https://duckduckgo.com/?q=mechanistic+interpretability+site%3Ahttps%3A%2F%2Fbetterwithout.ai&ia=web

    Question to all: is there anything special about chatgpt4 compared to chatgpt (3.5) ?

    #chatgpt4 #gpt4 #openai #mechanisticinterpretability