
1
presentations
SHORT BIO
I am a PhD Student at Imperial College London with the Safe and Trusted AI CDT. I am interested in long-term AI safety. The current focus of my PhD is on scenarios involving multi-agent interactions between both humans and artificial agents. Technically, my interests lie in the intersection of reinforcement and reward learning, game theory, and symbolic approaches to AI.
Presentations

On Agent Incentives to Manipulate Human Feedback in Multi-Agent Reward Learning Scenarios
Francis Rhys Ward and 2 other authors