profile picture

Vaneet Aggarwal

policy gradient

transfer learning

classification

parallel reinforcement learning

distributed reinforcement learning

regret analysis

robotic surgery

teleoperation

surgemes

multi-objective reinforcement learning

ml: online learning & bandits

ml: optimization

ml: learning theory

cmdp

zero violation

9

presentations

89

number of views

Presentations

Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment

Prashant Trivedi and 5 other authors

Combinatorial Stochastic-Greedy Bandit

Fares Fourati and 3 other authors

Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes

Qinbo Bai and 2 other authors

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm

Vaneet Aggarwal and 2 other authors

Multi-Objective Reinforcement Learning with Non-Linear Scalarization

Mridul Agarwal and 2 other authors

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach

Qinbo Bai and 4 other authors

Dexterous Skill Transfer between Surgical Procedures for Teleoperated Robotic Surgery

Glebys Gonzalez and 7 other authors

Communication Efficient Parallel Reinforcement Learning

Mridul Agarwal and 2 other authors

DART: Adaptive Accept Reject Algorithm for Non-Linear Combinatorial Bandits

Mridul Agarwal and 3 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved