Reinforcement Learning Journal, Volume 2

> Home > Journals > Reinforcement Learning Journal

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Volume 2, 2024

- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/LanMYX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/LanMYX24
Qingfeng Lan, A. Rupam Mahmood, Shuicheng Yan, Zhongwen Xu:
Learning to Optimize for Reinforcement Learning. RLJ 2: 481-497 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/DunionA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/DunionA24
Mhairi Dunion, Stefano V. Albrecht:
Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras. RLJ 2: 498-515 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/McInroeJAS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/McInroeJAS24
Trevor McInroe, Adam Jelley, Stefano V. Albrecht, Amos J. Storkey:
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning. RLJ 2: 516-546 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/HugessenCMB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/HugessenCMB24
Adriana Hugessen, Roger Creus Castanyer, Faisal Mohamed, Glen Berseth:
Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning. RLJ 2: 547-562 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/AyoubSZCGSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/AyoubSZCGSS24
Alex Ayoub, David Szepesvari, Francesco Zanini, Bryan Chan, Dhawal Gupta, Bruno Castro da Silva, Dale Schuurmans:
Mitigating the Curse of Horizon in Monte-Carlo Returns. RLJ 2: 563-572 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/LuoPW0P24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/LuoPW0P24
Yudong Luo, Yangchen Pan, Han Wang, Philip Torr, Pascal Poupart:
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization. RLJ 2: 573-592 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/DokoYBP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/DokoYBP24
Gersi Doko, Guang Yang, Daniel S. Brown, Marek Petrik:
ROIL: Robust Offline Imitation Learning without Trajectories. RLJ 2: 593-605 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/Meyer0M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/Meyer0M24
Edan Meyer, Adam White, Marlos C. Machado:
Harnessing Discrete Representations for Continual Reinforcement Learning. RLJ 2: 606-628 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/AbelHH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/AbelHH24
David Abel, Mark K. Ho, Anna Harutyunyan:
Three Dogmas of Reinforcement Learning. RLJ 2: 629-644 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/PapiniMMR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/PapiniMMR24
Matteo Papini, Giorgio Manganini, Alberto Maria Metelli, Marcello Restelli:
Policy Gradient with Active Importance Sampling. RLJ 2: 645-675 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/ZamboniCRM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/ZamboniCRM24
Riccardo Zamboni, Duilio Cirino, Marcello Restelli, Mirco Mutti:
The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough. RLJ 2: 676-692 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/AsriST24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/AsriST24
Zakariae El Asri, Olivier Sigaud, Nicolas Thome:
Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning. RLJ 2: 693-713 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/FungDHM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/FungDHM24
Ho Long Fung, Victor-Alexandru Darvariu, Stephen Hailes, Mirco Musolesi:
Trust-based Consensus in Multi-Agent Reinforcement Learning Systems. RLJ 2: 714-732 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/Luo0JZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/Luo0JZ24
Yu Luo, Fuchun Sun, Tianying Ji, Xianyuan Zhan:
Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies. RLJ 2: 733-762 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/LambrechtsBE24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/LambrechtsBE24
Gaspard Lambrechts, Adrien Bolland, Damien Ernst:
Informed POMDP: Leveraging Additional Information in Model-Based RL. RLJ 2: 763-784 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/LobelP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/LobelP24
Sam Lobel, Ronald Parr:
An Optimal Tightness Bound for the Simulation Lemma. RLJ 2: 785-797 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/AghajohariCDAC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/AghajohariCDAC24
Milad Aghajohari, Tim Cooijmans, Juan Agustin Duque, Shunichi Akatsuka, Aaron C. Courville:
Best Response Shaping. RLJ 2: 798-818 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/DrappoMR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/DrappoMR24
Gianluca Drappo, Alberto Maria Metelli, Marcello Restelli:
A Provably Efficient Option-Based Algorithm for both High-Level and Low-Level Learning. RLJ 2: 819-839 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/JavedSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/JavedSS24
Khurram Javed, Arsalan Sharifnassab, Richard S. Sutton:
SwiftTD: A Fast and Robust Algorithm for Temporal Difference Learning. RLJ 2: 840-863 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/JordanNK0T24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/JordanNK0T24
Scott M. Jordan, Samuel Neumann, James E. Kostas, Adam White, Philip S. Thomas:
The Cliff of Overcommitment with Policy Gradient Step Sizes. RLJ 2: 864-883 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/0001S024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/0001S024
Alexander Levine, Peter Stone, Amy Zhang:
Multistep Inverse Is Not All You Need. RLJ 2: 884-925 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/CramerFST24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/CramerFST24
Emma Cramer, Bernd Frauenknecht, Ramil Sabirov, Sebastian Trimpe:
Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors. RLJ 2: 926-945 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/ChitnisYG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/ChitnisYG24
Rohan Chitnis, Shentao Yang, Alborz Geramifard:
Sequential Decision-Making for Inline Text Autocomplete. RLJ 2: 946-960 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/AntonovD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/AntonovD24
Georgy Antonov, Peter Dayan:
Exploring Uncertainty in Distributional Reinforcement Learning. RLJ 2: 961-978 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/HussingMSKE24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/HussingMSKE24
Marcel Hussing, Jorge A. Mendez, Anisha Singrodia, Cassandra Kent, Eric Eaton:
Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning. RLJ 2: 979-994 (2024)
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/HussingVGFE24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/HussingVGFE24
Marcel Hussing, Claas Voelcker, Igor Gilitschenski, Amir-massoud Farahmand, Eric Eaton:
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence. RLJ 2: 995-1018 (2024)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.