Singapore

Scaling long-context and agentic LLMs is increasingly
limited by memory capacity and bandwidth rather than FLOPs.
I propose an algorithmic framework for context engineering
that models placement, compression, and scheduling as
coupled optimization problems with explicit
accuracy-efficiency trade-offs. Concretely, I will develop
(1) salience-aware retention/eviction policies with
provable approximation guarantees relative to an ideal
oracle; (2) tier-dependent compression schemes that bound
error propagation across memory levels; and (3)
probabilistic prefetch/scheduling that controls tail
latency. I will evaluate on long-context language modeling
and reasoning benchmarks, isolating each component via
ablations and comparing against heuristic baselines under
controlled bandwidth/capacity regimes. Results target
improved throughput and energy metrics at near-baseline
quality, advancing principled, hardware-aware inference
without requiring custom hardware.

AAAI 2026

Algorithms for Context Engineering in LLM Inference: Optimization of Placement, Compression, and Scheduling

contex engineering

optimization

poster

We are pleased to announce the Fortieth AAAI Conference on Artificial Intelligence (AAAI-26), which will be held in Singapore EXPO from January 20 to January 27, 2026.

The purpose of the AAAI conference series is to promote research in Artificial Intelligence (AI) and foster scientific exchange between researchers, practitioners, scientists, students, and engineers across the entirety of AI and its affiliated disciplines. AAAI-26 will feature technical paper presentations, special tracks, invited speakers, workshops, tutorials, poster sessions, senior member presentations, competitions, and exhibit programs, and a range of other activities to be announced.<br><br>

To access this event page, you need to log in with the **email address you registered with**. <br>Access credentials will be sent to your email from Underline -  subject line "Welcome to AAAI 2026". Please be sure to check your spam email folder if you do not see an email confirmation right away.

Please log in

To access this event page, you are required to register.
Please complete your registration to continue.

We recommend reading [**the registration information**](https://aaai.org/conference/aaai/aaai-26/registration/) first.

**Online Registration Form**: https://aaai.getregistered.net/conference-2026 

Registration Required

We are pleased to announce the Fortieth AAAI Conference on Artificial Intelligence (AAAI-26), which will be held in Singapore EXPO from January 20 to January 27, 2026.

This paper proposes an AI-driven framework for real-time
acoustic modelling that enhances audio perception in
dynamic environments. The system combines feedback
microphones, deep learning models, and adaptive acoustic
panels to monitor and optimize room acoustics continuously.
Convolutional and recurrent neural networks estimate
reverberation and clarity metrics, while a reinforcement
learning controller adjusts panel states for optimal
intelligibility. Unlike static treatments, this closed-loop
approach adapts to changing occupancy, noise, and source
locations. The expected outcome is a robust, intelligent
acoustic system with significant applications in education,
healthcare, and immersive audio experiences.

AI-Driven Real-Time Acoustic Modelling for Better Audio Perception in Dynamic Environments

Direct Preference Optimization (DPO) typically relies on a
fixed inverse-temperature β that controls divergence from a
reference model. Fixed β is brittle: too small causes
underregularization
(verbosity, safety drift); too large causes
overregularization
(underfitting). I propose an adaptive per-token
KL controller using EMA smoothing, deadband filtering, and
clipping to dynamically adjust β throughout training.
Initial
results on a 7B model show 72% win rate vs. base SFT and
60% vs. fixed-β DPO. The goal is a practical recipe for
stable,
compute-efficient DPO with reduced manual tuning.

Adaptive KL Control for Direct Preference Optimization in Instruction-Following LLMs

Generative AI shows strong capabilities in language,
reasoning, and code but remains prone to
hallucinations—outputs that are fluent yet incorrect. In
cybersecurity, such errors pose serious risks, from
misleading analysts to potential adversarial exploitation.
This project investigates hallucinations in three
directions: (1) creating benchmarks and interpretability
tools to characterize them in security contexts; (2)
developing mitigation strategies such as
retrieval-augmented generation, symbolic-neural hybrids,
and uncertainty-aware decoding; and (3) integrating these
methods into real-world workflows like vulnerability
assessment, malware analysis, and penetration testing,
while exploring how attackers might exploit hallucinations.
Evaluation will combine accuracy metrics, human-in-the-loop
studies, and red-team simulations. By bridging theory and
applied system design, the work aims to advance
understanding of hallucinations and improve the reliability
of AI in cybersecurity, with broader implications for other
high-stakes areas such as healthcare and law.

Hallucinations at the Firewall

Large language models (LLMs) have rapidly advanced, but
their growing compute and memory demands make them
unsustainable and limit accessibility, especially in
under-resourced regions such as Southeast Asia (SEA). While
recent hybrid architectures combining attention and
state-space models (SSMs) have shown promise, most adopt
sequentially interleaving attention and Mamba layers,
leaving parallel-head mixing of attention and Mamba heads
largely unexplored. Hence, I propose investigating
Hymba-style parallel-head hybrid architecture as a
foundation for efficient, multilingual LLMs in SEA. My
short-term goal is to perform continual pre-training (CPT)
of the released Hymba-1.5B weights on SEA corpora to
evaluate their adaptability across diverse languages. In
the longer term, I plan to study scaling strategies beyond
1.5B parameters, assessing whether parallel-head hybrids
maintain efficiency and performance at larger scales.
Evaluation will combine standard perplexity and benchmark
tasks with SEA-specific benchmarks, alongside profiling for
inference throughput and deployability on
resource-constrained devices. The expected outcomes are
twofold: (1) demonstrating that parallel-head hybrids can
be effectively adapted to SEA multilingual contexts, and
(2) providing evidence that this underexplored architecture
scales efficiently. Success would broaden the design space
of efficient LLMs while advancing equitable access to AI by
enabling practical, low-cost, and locally relevant models
for SEA communities.

Adapting Hybrid Parallel-Head Large Language Models for Southeast Asia

Modern generators often violate basic physics—e.g.,
inconsistent shadows, geometry, and measurement models
limiting trust for video synthesis and computational
imaging. We propose finite-time Schrödinger-Bridge (SB)
world models that cast generation as entropy-regularized
optimal transport from a simple prior to a physics- and
data-consistent distribution. Unlike post-hoc corrections,
our approach injects structure along the transport path:
multi-view geometry (reprojection/epipolar constraints,
homographies, depth-aware warps) for video, and
differentiable optical operators (PSF-based defocus,
lightweight Fourier propagation for coherent/partially
coherent settings) for imaging. With known poses, we
penalize reprojection and warp-aligned photometric/feature
errors; with unknown poses, a compact head estimates
motion/flow with cycle-consistency. Compact UNet/ViT
backbones and short SB horizons target efficiency.
Evaluation spans 3D consistency metrics, physics fidelity
via forward-simulation error, and generative
quality/efficiency (FID/KID, FVD) against strong diffusion
baselines, plug-and-play data-consistency, and
unconstrained SB. By constraining the path rather than only
the endpoint, the method aims to shorten sampling while
improving cross-view coherence and physical plausibility
across sensors (cameras, microscopes, medical scanners).

Physics Consistent World Models via Schrödinger-Bridge Optimal Transport for Computational Imaging and 3D-Consistent Video Generations

I present a compact, testable architecture that endows learning agents with continuous proto-emotional dynamics and interpretable modulators (Persona, Ego, Shadow, Self). The design grounds these modulators in a computational interpretation of Jung’s Map of the Soul, mapping each archetype to a differentiable control that modulates policy selection via a bounded, low-dimensional affect vector. I describe concrete modular implementations, a staged experimental program (toy domains → multi-agent/social tasks → nonstationary transfer), baselines, ablations, and reproducible evaluation metrics.

Persona, Ego, Shadow, and Self: A Map of the Soul Framework for Proto-Emotional Homeostasis in AI

Sexual trauma leaves wounds that science cannot see, yet
survivors live with them every day. Traditional tools rely
on words or self-reports, often forcing survivors to “bleed
in silence” when their pain is doubted or dismissed.
Trauma, however, is not one-dimensional. It disrupts
multiple brain networks and produces states fear,
vigilance, detachment that cannot be captured by words
alone. This creates the need for approaches that reveal
trauma’s complexity in ways that are both objective and
interpretable.
We propose a framework that combines fMRI, EEG, and
interpretable self-supervised AI (DINO) to uncover hidden
patterns of trauma in the brain. Instead of producing
abstract scans or opaque predictions, the system will
generate exploratory measures of trauma response that
support therapists’ understanding while guiding future
research. These measures will be presented through a simple
dashboard that summarizes three indices (TPI, DI, RBS)
alongside heatmaps and plain language notes. By turning
complex data into clear, anonymized session snapshots, the
dashboard provides researchers with a practical output that
can be compared across participants and refined in future
work

Understanding the Management of Rape Trauma with AI and Neuroimaging

The widespread adoption of artificial intelligence(AI) in
cybersecurity has led to the emerging of intelligent
threats, such as Advanced Persistent Threats (APTs),
challenging the conventional deception defense mechanisms.
My work aims to fill this critical gap by developing a game
theoretic defense agent capable of confronting these
intelligent threats. In this proposal, we formalize the
attacker-defender interactions as a Bayesian game model
between AI agents so as to derive equilibrium defense
strategies. Simulation based experiments and real-world
implementations would be conducted to evaluate the proposed
framework. This study is potential to revolutionize cyber
defense methodologies by shifting from bilateral
decision-making to game theoretic strategy evolution.

When AI Meets AI: A Game-Theoretic Defense Framework Against AI Empowered Cyber Threats

RNA 3D structure prediction remains a fundamental challenge
due to limited experimental data, conformational
heterogeneity, and complex folding landscapes. Inspired by
breakthroughs in protein modeling, a data-efficient deep
learning framework that integrates RNA language embeddings,
geometric constraints, and physics-informed priors is
proposed. This approach leverages self-supervised
pretraining, contrastive learning, and SE(3)-equivariant
architectures to capture higher-order structural
relationships from scarce data. Predicted structures are
evaluated using benchmark datasets, thermodynamic
plausibility, and docking-based functional assessments,
ensuring both structural accuracy and biophysical
relevance. By advancing RNA structure prediction and
design, this work aims to accelerate the development of
RNA-based therapeutics, catalytic ribozymes, and precision
medicine applications, while providing an open-source
framework for the broader scientific community.

Towards Data-Efficient Deep Learning for RNA 3D Structure Prediction and Design

This research proposes an extension to the Program Lattice
Transformer (PLT) , a neuro-symbolic framework for program
induction that embeds programs into a structured latent
space. The current PLT model, which uses a flat lattice, is
computationally inefficient when modeling invariant
programs—operations that return to an initial state after a
set number of applications (e.g., a 360° rotation). To
address this, we propose embedding the program space onto a
cylindrical manifold instead of a plane. This approach is
grounded in the principle that only isometric
transformations preserve the lattice's compositional
structure, limiting valid manifolds to developable surfaces
like cylinders . A cylindrical geometry naturally
represents invariant programs as closed loops, enhancing
efficiency. The proposed method will be evaluated on
synthetic tasks like Rubik's Cube and the Abstraction and
Reasoning Corpus (ARC) to demonstrate improved performance
and efficiency. This work serves as a step toward models
that can autonomously configure their own geometric latent
spaces, connecting to future research in geometric deep
learning and meta-learning.

Downloads

Next from AAAI 2026

AI-Driven Real-Time Acoustic Modelling for Better Audio Perception in Dynamic Environments

Stay up to date with the latest Underline news!

PRESENTATIONS

CONFERENCES

COMPANY

RESOURCES

.css-70qvj9{display:-webkit-box;display:-webkit-flex;display:-ms-flexbox;display:flex;-webkit-align-items:center;-webkit-box-align:center;-ms-flex-align:center;align-items:center;}Downloads

Next from AAAI 2026

AI-Driven Real-Time Acoustic Modelling for Better Audio Perception in Dynamic Environments

Stay up to date with the latest Underline news!

PRESENTATIONS

CONFERENCES

COMPANY

RESOURCES

Downloads