technical paper

AAAI 2024

February 25, 2024

Vancouver , Canada

Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning | VIDEO


ml: bio-inspired learning

concepts and methods


hai: understanding people

cms: simulating human behavior

Toddlers evolve from free exploration with sparse feedback to exploiting prior experiences for goal-directed learning with denser rewards. Drawing inspiration from this Toddler-Inspired Reward Transition, we set out to explore the implications of varying reward transitions when incorporated into Reinforcement Learning (RL) tasks. Central to our inquiry is the transition from sparse to potential-based dense rewards, which share optimal strategies regardless of reward changes. Through various experiments, including those in egocentric navigation and robotic arm manipulation tasks, we found that proper reward transitions significantly influence sample efficiency and success rates. Of particular note is the efficacy of the toddler-inspired Sparse-to-Dense (S2D) transition. Beyond these performance metrics, using Cross-Density Visualizer technique, we observed that transitions, especially the S2D, smooth the policy loss landscape, promoting wide minima that enhance generalization in RL models.


PaperTranscript English (automatic)

Next from AAAI 2024

PM-INR: Prior-Rich Multi-Modal Implicit Large-Scale Scene Neural Representation
technical paper

PM-INR: Prior-Rich Multi-Modal Implicit Large-Scale Scene Neural Representation

AAAI 2024

+4Tao Chen
Yiying Yang and 6 other authors

25 February 2024

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)


  • All Lectures
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2023 Underline - All rights reserved