#deepreinforcementlearning — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #deepreinforcementlearning, aggregated by home.social.
-
Autonomy Talks - Georgia Chalvatzaki: Shaping #Robotic Assistance through Structured #Robot #Learning: https://www.youtube.com/watch?v=e0aQC3C8P7w #robotics #machinelearning
Around 12:30 they present the training of a model-free #MDP #deepreinforcementlearning using a model-based #ai #planner #aiplanner. Indeed it drastically boosts the training.
The general idea is to guide an implicit model using a model-based approximation, and it works also for assembly tasks, computer vision, pick and place…
-
📬 KI-Roboter: Meister des Labyrinth-Geschicklichkeitsspiels
#KünstlicheIntelligenz #Cheat #Cheater #Cheats #CyberRunner #DeepReinforcementLearning #github #KIRoboter #Labyrinth #ProfDrRaffaelloD’Andrea #Robotik https://tarnkappe.info/artikel/kuenstliche-intelligenz/ki-roboter-meister-des-labyrinth-geschicklichkeitsspiels-285483.html -
AI helps 3D printers “write” with coiling fluid ropes like Jackson Pollock - Enlarge / Jackson Pollock working in his Long Island studio adjacent to... - https://arstechnica.com/?p=1980704 #deepreinforcementlearning #machinelearning #jacksonpollock #fluiddynamics #scienceandart #3dprinting #science #physics
-
El lado del mal - AlphaDev: La IA que optimiza la implementación de los algoritmos mejor que los humanos https://www.elladodelmal.com/2023/06/alphadev-la-ia-que-optimiza-la.html #AlphaDev #IA #AI #Programación #Algorítmica #Algorithms #InteligenciaArtificial #DeepReinforcementLearning #Intel #ensamblador #DeepLearning #LLM
-
Faster sorting #algorithms discovered using #DeepReinforcementLearning | #Nature
"Here we show how #ArtificialIntelligence can go beyond the current state of the art by discovering hitherto unknown routines. To realize this, we formulated the task of finding a better sorting routine as a single-player game. We then trained a new deep #ReinforcementLearning agent, #AlphaDev, to play this game. AlphaDev discovered small sorting algorithms from scratch that outperformed previously known human benchmarks. These algorithms have been integrated into the #LLVM standard C++ sort library3. This change to this part of the sort library represents the replacement of a component with an algorithm that has been automatically discovered using reinforcement learning."