home.social

#mdps — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #mdps, aggregated by home.social.

  1. Teaching ChatGPT-4o is a great way to learn.

    It's always nice to notice you know something ChatGPT doesn't know, as it typically means you know something most specialists in the field don't know:
    chatgpt.com/share/67ac8053-bcf

    #LLM #mathematics #MarkovChains #MDPs

  2. Teaching ChatGPT-4o is a great way to learn.

    It's always nice to notice you know something ChatGPT doesn't know, as it typically means you know something most specialists in the field don't know:
    chatgpt.com/share/67ac8053-bcf

    #LLM #mathematics #MarkovChains #MDPs

  3. Teaching ChatGPT-4o is a great way to learn.

    It's always nice to notice you know something ChatGPT doesn't know, as it typically means you know something most specialists in the field don't know:
    chatgpt.com/share/67ac8053-bcf

    #LLM #mathematics #MarkovChains #MDPs

  4. Teaching ChatGPT-4o is a great way to learn.

    It's always nice to notice you know something ChatGPT doesn't know, as it typically means you know something most specialists in the field don't know:
    chatgpt.com/share/67ac8053-bcf

    #LLM #mathematics #MarkovChains #MDPs

  5. Teaching ChatGPT-4o is a great way to learn.

    It's always nice to notice you know something ChatGPT doesn't know, as it typically means you know something most specialists in the field don't know:
    chatgpt.com/share/67ac8053-bcf

    #LLM #mathematics #MarkovChains #MDPs

  6. 'Model-Free Representation Learning and Exploration in Low-Rank MDPs', by Aditya Modi, Jinglin Chen, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal.

    jmlr.org/papers/v25/22-0687.ht

    #reinforcement #exploration #mdps

  7. 'Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity', by Ali Kara, Naci Saldi, Serdar Yüksel.

    jmlr.org/papers/v24/21-1457.ht

    #quantization #quantized #mdps

  8. 'Provably Sample-Efficient Model-Free Algorithm for MDPs with Peak Constraints', by Qinbo Bai, Vaneet Aggarwal, Ather Gattami.

    jmlr.org/papers/v24/21-0117.ht

    #mdps #markov #pcmdp

  9. 'Provably Sample-Efficient Model-Free Algorithm for MDPs with Peak Constraints', by Qinbo Bai, Vaneet Aggarwal, Ather Gattami.

    jmlr.org/papers/v24/21-0117.ht

    #mdps #markov #pcmdp

  10. 'Provably Sample-Efficient Model-Free Algorithm for MDPs with Peak Constraints', by Qinbo Bai, Vaneet Aggarwal, Ather Gattami.

    jmlr.org/papers/v24/21-0117.ht

    #mdps #markov #pcmdp

  11. 'Provably Sample-Efficient Model-Free Algorithm for MDPs with Peak Constraints', by Qinbo Bai, Vaneet Aggarwal, Ather Gattami.

    jmlr.org/papers/v24/21-0117.ht

    #mdps #markov #pcmdp

  12. 'Provably Sample-Efficient Model-Free Algorithm for MDPs with Peak Constraints', by Qinbo Bai, Vaneet Aggarwal, Ather Gattami.

    jmlr.org/papers/v24/21-0117.ht

    #mdps #markov #pcmdp