
Vaneet Aggarwal
policy gradient
transfer learning
classification
parallel reinforcement learning
distributed reinforcement learning
regret analysis
robotic surgery
teleoperation
surgemes
multi-objective reinforcement learning
ml: online learning & bandits
ml: optimization
ml: learning theory
cmdp
zero violation
9
presentations
89
number of views
Presentations

Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment
Prashant Trivedi and 5 other authors

Combinatorial Stochastic-Greedy Bandit
Fares Fourati and 3 other authors

Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes
Qinbo Bai and 2 other authors

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm
Vaneet Aggarwal and 2 other authors

Multi-Objective Reinforcement Learning with Non-Linear Scalarization
Mridul Agarwal and 2 other authors

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Qinbo Bai and 4 other authors

Dexterous Skill Transfer between Surgical Procedures for Teleoperated Robotic Surgery
Glebys Gonzalez and 7 other authors

Communication Efficient Parallel Reinforcement Learning
Mridul Agarwal and 2 other authors

DART: Adaptive Accept Reject Algorithm for Non-Linear Combinatorial Bandits
Mridul Agarwal and 3 other authors