#richardsutton — Public Fediverse posts on home.social

Bytes Europe @[email protected] · 2026-03-14 · 06:34 UTC

QbitAI Exclusive: Interview with Terence Tao https://www.byteseu.com/1869574/ #AIPopularization #AIXScience #BasicScientificResearch #DataQuality #hallucinations #interpretability #mathematics #ProteinFolding #RichardSutton #SAIRFoundation #ScalingTheScienceOfAI #Science #StandardizedCitations #SyntheticData #TerenceTao #traceability

#traceability #terencetao #syntheticdata #standardizedcitations #science #scalingthescienceofai

Wahnsinnwissen @[email protected] · 2026-02-27 · 08:06 UTC

Richard S. Sutton, einer der Mitbegründer des Reinforcement Learning und seit Jahrzehnten eine zentrale Figur der KI-Forschung, stellt in seinem YouTube-Vortrag „The Future of AI“ eine unbequeme These auf: So beeindruckend heutige KI-Systeme auch wirken – wissenschaftlich stehen wir seiner Ansicht nach noch am Anfang. #KünstlicheIntelligenz #LernenausErfahrung #ReinforcementLearning #RichardSutton #Sprachmodelle

https://wahnsinnwissen.de/?p=1124

#kunstlicheintelligenz #lernenauserfahrung #reinforcementlearning #richardsutton #sprachmodelle

Canada News Beep @[email protected] · 2025-11-25 · 09:40 UTC

U of A’s AI expertise stands out in latest Global Ranking of Academic Subjects

The University of Alberta is a glo…
#NewsBeep #News #Canada #agriculturalsciences #AI #Amii #Artificialintelligence #ARWU #Automation #BillFlanagan #BiologicalSciences #CA #CollegeofNaturalandAppliedSciences #earthsciences #GlobalRankingofAcademicSubjects #GRAS #instrumentsscience #MatinaKalcounis-Rueppell #nursing #publichealth #Rankings #RichardSutton #ShanghaiRanking #Technology
https://www.newsbeep.com/ca/305629/

#technology #shanghairanking #richardsutton #rankings #publichealth #nursing

Teixi @[email protected] · 2025-10-24 · 21:02 UTC

So many gems in this interview, just little spoiler:

#DwarkeshPatel Next token prediction!

#RichardSutton That’s not a goal. It doesn’t change the world…

https://youtu.be/21EYKqUsPfg

ps: My goal now is aging with such clarity thinking, and relaxed dialectical teaching!!!

#TheBitterLesson #RL #ML #LLMs #ImitationLearning #GoalDrivenExperience

#dwarkeshpatel #richardsutton #thebitterlesson #rl #ml #llms

Teixi @[email protected] · 2025-10-24 · 21:02 UTC

So many gems in this interview, just little spoiler:

#DwarkeshPatel Next token prediction!

#RichardSutton That’s not a goal. It doesn’t change the world…

https://youtu.be/21EYKqUsPfg

ps: My goal now is aging with such clarity thinking, and relaxed dialectical teaching!!!

#TheBitterLesson #RL #ML #LLMs #ImitationLearning #GoalDrivenExperience

#dwarkeshpatel #richardsutton #thebitterlesson #rl #ml #llms

Teixi @[email protected] · 2025-10-24 · 21:02 UTC

So many gems in this interview, just little spoiler:

#DwarkeshPatel Next token prediction!

#RichardSutton That’s not a goal. It doesn’t change the world…

https://youtu.be/21EYKqUsPfg

ps: My goal now is aging with such clarity thinking, and relaxed dialectical teaching!!!

#TheBitterLesson #RL #ML #LLMs #ImitationLearning #GoalDrivenExperience

#dwarkeshpatel #richardsutton #thebitterlesson #rl #ml #llms

Teixi @[email protected] · 2025-10-24 · 21:02 UTC

So many gems in this interview, just little spoiler:

#DwarkeshPatel Next token prediction!

#RichardSutton That’s not a goal. It doesn’t change the world…

https://youtu.be/21EYKqUsPfg

ps: My goal now is aging with such clarity thinking, and relaxed dialectical teaching!!!

#TheBitterLesson #RL #ML #LLMs #ImitationLearning #GoalDrivenExperience

#goaldrivenexperience #imitationlearning #llms #ml #rl #thebitterlesson

Teixi @[email protected] · 2025-10-24 · 21:02 UTC

So many gems in this interview, just little spoiler:

#DwarkeshPatel Next token prediction!

#RichardSutton That’s not a goal. It doesn’t change the world…

https://youtu.be/21EYKqUsPfg

ps: My goal now is aging with such clarity thinking, and relaxed dialectical teaching!!!

#TheBitterLesson #RL #ML #LLMs #ImitationLearning #GoalDrivenExperience

#dwarkeshpatel #richardsutton #thebitterlesson #rl #ml #llms

Teixi @[email protected] · 2025-03-30 · 03:01 UTC

@ekmiller

» The point of the bitter lesson is that the right learning algorithms (those that scale efficiently with massive computation) are exactly what we need.
Massive computation does not alleviate the need for data efficiency «

24/11/2023 #RichardSutton

Nowadays neuroscience forever expansing body of literature, spreading across different subfields, disperse schools, training practices, and multiple sets of technologies. Instead attempts for building a comprehensive knowledge consensus.

#richardsutton

Teixi @[email protected] · 2025-03-09 · 00:58 UTC

#ACMPrize
#2024ACMPrize
#ACMTuringAward

#AndrewBarto
#RichardSutton

» #ReinforcementLearning
An Introduction
1998
standard reference...cited over 75,000
...
prominent example of #RL
#AlphaGo victory
over best human #Go players
2016 2017
....
recently has been the development of the chatbot #ChatGPT
...
large language model #LLM trained in two phases ...employs a technique called
reinforcement learning from human feedback #RLHF «

aka cheap labor unnamed in papers

https://awards.acm.org/about/2024-turing

2/2

#acmprize #2024acmprize #acmturingaward #andrewbarto #richardsutton #reinforcementlearning

Teixi @[email protected] · 2025-03-09 · 00:58 UTC

#ACMPrize
#2024ACMPrize
#ACMTuringAward

#AndrewBarto
#RichardSutton

» #ReinforcementLearning
An Introduction
1998
standard reference...cited over 75,000
...
prominent example of #RL
#AlphaGo victory
over best human #Go players
2016 2017
....
recently has been the development of the chatbot #ChatGPT
...
large language model #LLM trained in two phases ...employs a technique called
reinforcement learning from human feedback #RLHF «

aka cheap labor unnamed in papers

https://awards.acm.org/about/2024-turing

2/2

#acmprize #2024acmprize #acmturingaward #andrewbarto #richardsutton #reinforcementlearning

Teixi @[email protected] · 2025-03-09 · 00:58 UTC

#ACMPrize
#2024ACMPrize
#ACMTuringAward

#AndrewBarto
#RichardSutton

» #ReinforcementLearning
An Introduction
1998
standard reference...cited over 75,000
...
prominent example of #RL
#AlphaGo victory
over best human #Go players
2016 2017
....
recently has been the development of the chatbot #ChatGPT
...
large language model #LLM trained in two phases ...employs a technique called
reinforcement learning from human feedback #RLHF «

aka cheap labor unnamed in papers

https://awards.acm.org/about/2024-turing

2/2

#acmprize #2024acmprize #acmturingaward #andrewbarto #richardsutton #reinforcementlearning

Teixi @[email protected] · 2025-03-09 · 00:58 UTC

#ACMPrize
#2024ACMPrize
#ACMTuringAward

#AndrewBarto
#RichardSutton

» #ReinforcementLearning
An Introduction
1998
standard reference...cited over 75,000
...
prominent example of #RL
#AlphaGo victory
over best human #Go players
2016 2017
....
recently has been the development of the chatbot #ChatGPT
...
large language model #LLM trained in two phases ...employs a technique called
reinforcement learning from human feedback #RLHF «

aka cheap labor unnamed in papers

https://awards.acm.org/about/2024-turing

2/2

#rlhf #llm #chatgpt #go #alphago #rl

Teixi @[email protected] · 2025-03-09 · 00:58 UTC

#ACMPrize
#2024ACMPrize
#ACMTuringAward

#AndrewBarto
#RichardSutton

» #ReinforcementLearning
An Introduction
1998
standard reference...cited over 75,000
...
prominent example of #RL
#AlphaGo victory
over best human #Go players
2016 2017
....
recently has been the development of the chatbot #ChatGPT
...
large language model #LLM trained in two phases ...employs a technique called
reinforcement learning from human feedback #RLHF «