Skip to main content

Showing 1–2 of 2 results for author: Kharlapenko, D

  1. arXiv:2406.06309  [pdf, other

    cs.LG cs.AI

    Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?

    Authors: Denis Tarasov, Kirill Brilliantov, Dmitrii Kharlapenko

    Abstract: In deep Reinforcement Learning (RL), value functions are typically approximated using deep neural networks and trained via mean squared error regression objectives to fit the true value functions. Recent research has proposed an alternative approach, utilizing the cross-entropy classification objective, which has demonstrated improved performance and scalability of RL algorithms. However, existing… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: https://github.com/DT6A/ClORL

  2. arXiv:2405.20318  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    CausalQuest: Collecting Natural Causal Questions for AI Agents

    Authors: Roberto Ceraolo, Dmitrii Kharlapenko, Amélie Reymond, Rada Mihalcea, Mrinmaya Sachan, Bernhard Schölkopf, Zhijing Jin

    Abstract: Humans have an innate drive to seek out causality. Whether fuelled by curiosity or specific goals, we constantly question why things happen, how they are interconnected, and many other related phenomena. To develop AI agents capable of addressing this natural human quest for causality, we urgently need a comprehensive dataset of natural causal questions. Unfortunately, existing datasets either con… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.