home.social

#deepreinforcementlearning — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #deepreinforcementlearning, aggregated by home.social.

  1. IA2 uses Deep Reinforcement Learning to slash database runtimes by 40%, while new Hyperbolic SVM techniques utilize semidefinite relaxation. hackernoon.com/ai-driven-datab #deepreinforcementlearning

  2. IA2 revolutionizes index selection with rapid training, reducing SQL runtime by 61% via adaptive action pruning and workload modeling hackernoon.com/adaptive-action #deepreinforcementlearning

  3. IA2 uses a two-phase framework to generate states and action pools from workloads, enabling RL agents to make sequential index selection decisions. hackernoon.com/unseen-workload #deepreinforcementlearning

  4. The TD3-TD-SWAR model advances database optimization by framing index selection as a DRL problem with adaptive action masking for faster training. hackernoon.com/adaptive-action #deepreinforcementlearning

  5. This research validates a weekly re-trained DRL agent, showing it outperforms static models & Black-Scholes for practical American option hedging. hackernoon.com/validating-hype #deepreinforcementlearning

  6. This methodology details how to train and test DRL agents for American option hedging, introducing a novel weekly re-training strategy using Chebyshev pricing. hackernoon.com/dont-just-train #deepreinforcementlearning

  7. Faster sorting #algorithms discovered using #DeepReinforcementLearning | #Nature

    "Here we show how #ArtificialIntelligence can go beyond the current state of the art by discovering hitherto unknown routines. To realize this, we formulated the task of finding a better sorting routine as a single-player game. We then trained a new deep #ReinforcementLearning agent, #AlphaDev, to play this game. AlphaDev discovered small sorting algorithms from scratch that outperformed previously known human benchmarks. These algorithms have been integrated into the #LLVM standard C++ sort library3. This change to this part of the sort library represents the replacement of a component with an algorithm that has been automatically discovered using reinforcement learning."

    nature.com/articles/s41586-023

  8. Autonomy Talks - Georgia Chalvatzaki: Shaping #Robotic Assistance through Structured #Robot #Learning: youtube.com/watch?v=e0aQC3C8P7 #robotics #machinelearning

    Around 12:30 they present the training of a model-free #MDP #deepreinforcementlearning using a model-based #ai #planner #aiplanner. Indeed it drastically boosts the training.

    The general idea is to guide an implicit model using a model-based approximation, and it works also for assembly tasks, computer vision, pick and place…

  9. AI helps 3D printers “write” with coiling fluid ropes like Jackson Pollock - Enlarge / Jackson Pollock working in his Long Island studio adjacent to... - arstechnica.com/?p=1980704 #deepreinforcementlearning #machinelearning #jacksonpollock #fluiddynamics #scienceandart #3dprinting #science #physics

  10. This review of DRL hedging literature highlights the need for hyperparameter analysis, especially for real-world American option applications. hackernoon.com/avoiding-the-pi #deepreinforcementlearning

  11. This paper makes Deep Reinforcement Learning practical for hedging American options by optimizing hyperparameters and using a weekly re-training strategy. hackernoon.com/how-weekly-ai-t #deepreinforcementlearning

  12. Will the next generation of #LLM come from #DeepMind?
    wired.com/story/google-deepmin
    They may have a shot at it given their expertise in #DeepReinforcementLearning. If their #AI can plan tasks with solid logical grounds, can't they also produce solid explanations?

  13. #introducton 
Here to talk #neuroscience mostly about human #decisionmaking and #learning in #ReinforcementLearning environments. 
I'm doing my #PhD at LMU #Munich, working with #EyeTracking and #EEG data, using #Python and #MATLAB, and am interested in #DeepReinforcementLearning, #senseofagency and #socialagency ! 
On the side, I play #chess and hike #Alpine !
Hello #Fediverse :)

  14. Für Roboter ist das Bewegen in Menschenmengen eine Herausforderung. Aber er kann das Verhalten, der Menschen analysieren, um kollisionsfrei durchzukommen.
    Navigieren in überfüllten Räumen: Roboter nutzen Menschen als "Sensoren"
  15. AI helps 3D printers “write” with coiling fluid ropes like Jackson Pollock - Enlarge / Jackson Pollock working in his Long Island studio adjacent to... - arstechnica.com/?p=1980704 #deepreinforcementlearning #machinelearning #jacksonpollock #fluiddynamics #scienceandart #3dprinting #science #physics

  16. AI helps 3D printers “write” with coiling fluid ropes like Jackson Pollock - Enlarge / Jackson Pollock working in his Long Island studio adjacent to... - arstechnica.com/?p=1980704 #deepreinforcementlearning #machinelearning #jacksonpollock #fluiddynamics #scienceandart #3dprinting #science #physics

  17. AI helps 3D printers “write” with coiling fluid ropes like Jackson Pollock - Enlarge / Jackson Pollock working in his Long Island studio adjacent to... - arstechnica.com/?p=1980704 #deepreinforcementlearning #machinelearning #jacksonpollock #fluiddynamics #scienceandart #3dprinting #science #physics