Lecture image placeholder

Premium content

Access to this content requires a subscription. You must be a premium user to view this content.

Monthly subscription - $9.99Pay per view - $4.99Access through your institutionLogin with Underline account
Need help?
Contact us
Lecture placeholder background

AAAI 2026

July 20, 2026

Singapore, Singapore

Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.

As LLM-based agents grow more autonomous and multi- modal, ensuring they remain controllable, auditable, and faithful to deployer intent becomes critical. Prior benchmarks measured propensity for misaligned behavior and showed that agent personalities and tool access significantly influ- ence misalignment. Building on those insights, we propose a Verifiability-First architecture that (1) integrates run-time attestations of agent actions (cryptographic & symbolic), (2) embeds lightweight Audit Agents that continuously verify in- tent vs. behavior using constrained reasoning, and (3) en- forces challenge–response attestation protocols for high-risk operations. We introduce OPERA (Observability, Provable Execution, Red-team, Attestation), a benchmark suite and evaluation protocol designed to measure (i) detectability of misalignment, (ii) time-to-detection under stealthy strategies, and (iii) resilience of verifiability mechanisms to adversar- ial prompt/persona injection. Our approach aims to shift the evaluation focus from ”how likely misalignment is” to ”how quickly and reliably misalignment can be detected and reme- diated.”

Next from AAAI 2026

Collective Recourse for Generative Urban Visualizations
workshop paper

Collective Recourse for Generative Urban Visualizations

AAAI 2026

20 July 2026

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2026 Underline - All rights reserved