home.social

#richardsutton — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #richardsutton, aggregated by home.social.

  1. Richard S. Sutton, einer der Mitbegründer des Reinforcement Learning und seit Jahrzehnten eine zentrale Figur der KI-Forschung, stellt in seinem YouTube-Vortrag „The Future of AI“ eine unbequeme These auf: So beeindruckend heutige KI-Systeme auch wirken – wissenschaftlich stehen wir seiner Ansicht nach noch am Anfang. #KünstlicheIntelligenz #LernenausErfahrung #ReinforcementLearning #RichardSutton #Sprachmodelle

    wahnsinnwissen.de/?p=1124

  2. So many gems in this interview, just little spoiler:

    #DwarkeshPatel Next token prediction!

    #RichardSutton That’s not a goal. It doesn’t change the world…

    youtu.be/21EYKqUsPfg

    ps: My goal now is aging with such clarity thinking, and relaxed dialectical teaching!!!

    #TheBitterLesson #RL #ML #LLMs #ImitationLearning #GoalDrivenExperience

  3. So many gems in this interview, just little spoiler:

    #DwarkeshPatel Next token prediction!

    #RichardSutton That’s not a goal. It doesn’t change the world…

    youtu.be/21EYKqUsPfg

    ps: My goal now is aging with such clarity thinking, and relaxed dialectical teaching!!!

    #TheBitterLesson #RL #ML #LLMs #ImitationLearning #GoalDrivenExperience

  4. So many gems in this interview, just little spoiler:

    #DwarkeshPatel Next token prediction!

    #RichardSutton That’s not a goal. It doesn’t change the world…

    youtu.be/21EYKqUsPfg

    ps: My goal now is aging with such clarity thinking, and relaxed dialectical teaching!!!

    #TheBitterLesson #RL #ML #LLMs #ImitationLearning #GoalDrivenExperience

  5. So many gems in this interview, just little spoiler:

    #DwarkeshPatel Next token prediction!

    #RichardSutton That’s not a goal. It doesn’t change the world…

    youtu.be/21EYKqUsPfg

    ps: My goal now is aging with such clarity thinking, and relaxed dialectical teaching!!!

    #TheBitterLesson #RL #ML #LLMs #ImitationLearning #GoalDrivenExperience

  6. So many gems in this interview, just little spoiler:

    #DwarkeshPatel Next token prediction!

    #RichardSutton That’s not a goal. It doesn’t change the world…

    youtu.be/21EYKqUsPfg

    ps: My goal now is aging with such clarity thinking, and relaxed dialectical teaching!!!

    #TheBitterLesson #RL #ML #LLMs #ImitationLearning #GoalDrivenExperience

  7. @ekmiller

    » The point of the bitter lesson is that the right learning algorithms (those that scale efficiently with massive computation) are exactly what we need.
    Massive computation does not alleviate the need for data efficiency «

    24/11/2023 #RichardSutton

    Nowadays neuroscience forever expansing body of literature, spreading across different subfields, disperse schools, training practices, and multiple sets of technologies. Instead attempts for building a comprehensive knowledge consensus.

  8. #ACMPrize
    #2024ACMPrize
    #ACMTuringAward

    #AndrewBarto
    #RichardSutton

    » #ReinforcementLearning
    An Introduction
    1998
    standard reference...cited over 75,000
    ...
    prominent example of #RL
    #AlphaGo victory
    over best human #Go players
    2016 2017
    ....
    recently has been the development of the chatbot #ChatGPT
    ...
    large language model #LLM trained in two phases ...employs a technique called
    reinforcement learning from human feedback #RLHF «

    aka cheap labor unnamed in papers

    awards.acm.org/about/2024-turi

    2/2

  9. #ACMPrize
    #2024ACMPrize
    #ACMTuringAward

    #AndrewBarto
    #RichardSutton

    » #ReinforcementLearning
    An Introduction
    1998
    standard reference...cited over 75,000
    ...
    prominent example of #RL
    #AlphaGo victory
    over best human #Go players
    2016 2017
    ....
    recently has been the development of the chatbot #ChatGPT
    ...
    large language model #LLM trained in two phases ...employs a technique called
    reinforcement learning from human feedback #RLHF «

    aka cheap labor unnamed in papers

    awards.acm.org/about/2024-turi

    2/2

  10. #ACMPrize
    #2024ACMPrize
    #ACMTuringAward

    #AndrewBarto
    #RichardSutton

    » #ReinforcementLearning
    An Introduction
    1998
    standard reference...cited over 75,000
    ...
    prominent example of #RL
    #AlphaGo victory
    over best human #Go players
    2016 2017
    ....
    recently has been the development of the chatbot #ChatGPT
    ...
    large language model #LLM trained in two phases ...employs a technique called
    reinforcement learning from human feedback #RLHF «

    aka cheap labor unnamed in papers

    awards.acm.org/about/2024-turi

    2/2

  11. #ACMPrize
    #2024ACMPrize
    #ACMTuringAward

    #AndrewBarto
    #RichardSutton

    » #ReinforcementLearning
    An Introduction
    1998
    standard reference...cited over 75,000
    ...
    prominent example of #RL
    #AlphaGo victory
    over best human #Go players
    2016 2017
    ....
    recently has been the development of the chatbot #ChatGPT
    ...
    large language model #LLM trained in two phases ...employs a technique called
    reinforcement learning from human feedback #RLHF «

    aka cheap labor unnamed in papers

    awards.acm.org/about/2024-turi

    2/2

  12. #ACMPrize
    #2024ACMPrize
    #ACMTuringAward

    #AndrewBarto
    #RichardSutton

    » #ReinforcementLearning
    An Introduction
    1998
    standard reference...cited over 75,000
    ...
    prominent example of #RL
    #AlphaGo victory
    over best human #Go players
    2016 2017
    ....
    recently has been the development of the chatbot #ChatGPT
    ...
    large language model #LLM trained in two phases ...employs a technique called
    reinforcement learning from human feedback #RLHF «

    aka cheap labor unnamed in papers

    awards.acm.org/about/2024-turi

    2/2

  13. @emilygorcenski @kordinglab

    » Not at all.

    The point of the bitter lesson is that the right learning algorithms

    (those that scale efficiently with massive computation)

    are exactly what we need.

    Massive computation does not alleviate the need for data efficiency «

    #RichardSutton 24/11/2023

    nitter.cz/RichardSSutton/statu

    #TheBitterLessonInML

    incompleteideas.net/IncIdeas/B