Singapore

Automated classification of complex social survey questionnaires is crucial for large-scale social science research but faces significant reliability challenges due to intricate hierarchical label structures, severe class imbalance, semantic ambiguity, and incomplete data coverage. Conventional classification methods often struggle with these combined complexities, yielding results that lack trustworthiness. We introduce HOCM, a framework designed for trustworthy classification in complex, real-world taxonomies. It features two synergistic components: (1) memory-enhanced contrastive learning, tailored to learn robust representations from noisy, imbalanced data by leveraging quality-aware category memory banks; and (2) hierarchical uncertainty calibration, which enforces taxonomic consistency while providing reliable confidence estimates and identifying inputs falling outside well-represented known categories. Our evaluation on a large-scale, real-world social survey dataset—a challenging exemplar of our target problem class—demonstrates that HOCM maintains strong accuracy on known classes while effectively identifying uncertain cases, significantly boosting accuracy on confident predictions. Furthermore, it adeptly detects low-resource/unknown categories. HOCM provides a more reliable automated classification tool, enabling efficient expert review and enhancing the trustworthiness of analysis in domains with complex, hierarchical data.

AAAI 2026

Trustworthy Classification for Complex Social Surveys: A Memory-Enhanced Hierarchical Framework with Calibrated Uncertainty

nlp: text classification & sentiment analysis

app: humanities & computational social science

ml: calibration & uncertainty quantification

poster

We are pleased to announce the Fortieth AAAI Conference on Artificial Intelligence (AAAI-26), which will be held in Singapore EXPO from January 20 to January 27, 2026.

The purpose of the AAAI conference series is to promote research in Artificial Intelligence (AI) and foster scientific exchange between researchers, practitioners, scientists, students, and engineers across the entirety of AI and its affiliated disciplines. AAAI-26 will feature technical paper presentations, special tracks, invited speakers, workshops, tutorials, poster sessions, senior member presentations, competitions, and exhibit programs, and a range of other activities to be announced.<br><br>

To access this event page, you need to log in with the **email address you registered with**. <br>Access credentials will be sent to your email from Underline -  subject line "Welcome to AAAI 2026". Please be sure to check your spam email folder if you do not see an email confirmation right away.

Please log in

To access this event page, you are required to register.
Please complete your registration to continue.

We recommend reading [**the registration information**](https://aaai.org/conference/aaai/aaai-26/registration/) first.

**Online Registration Form**: https://aaai.getregistered.net/conference-2026 

Registration Required

We are pleased to announce the Fortieth AAAI Conference on Artificial Intelligence (AAAI-26), which will be held in Singapore EXPO from January 20 to January 27, 2026.

Photovoltaic (PV) power forecasting is critical for the operation of solar power plants and the coordination of energy within power grids. This work aims to predict future PV power time series by leveraging multimodal data. While recent studies have incorporated numerical modalities such as satellite image sequences and numerical weather prediction (NWP) time series, they often overlook textual modalities—such as the spatio-temporal context of PV plants—and the potential of pretrained large language models (LLMs). In this paper, we build upon existing numerical inputs and further explore the use of spatio-temporal text prompts, generated based on plant coordinate and forecast start time, to enhance the forecasting process. We propose PV-LLM, a satellite-text-prompted framework that integrates a pretrained LLM to improve PV power forecasting. The framework consists of three key components: Text Prompt Construction, Modality-Specific Encoding, and Adaptive Prompt Tuning. First, the Text Prompt Construction module generates spatio-temporal prompts that offer high-level semantic guidance. Next, the Modality-Specific Encoding module encodes each modality according to its unique characteristics, capturing modality-specific patterns while managing varying context lengths. Finally, the Adaptive Prompt Tuning module fine-tunes the LLM to integrate multimodal embeddings, while an adaptive gating mechanism retains its pretrained knowledge. We validate the effectiveness of our proposed framework on a real-world dataset containing multiple PV plants. Experimental results demonstrate that our approach outperforms existing state-of-the-art methods.

Satellite-Text-Prompted Large Language Model for Photovoltaic Power Forecasting

Scaling long-context and agentic LLMs is increasingly
limited by memory capacity and bandwidth rather than FLOPs.
I propose an algorithmic framework for context engineering
that models placement, compression, and scheduling as
coupled optimization problems with explicit
accuracy-efficiency trade-offs. Concretely, I will develop
(1) salience-aware retention/eviction policies with
provable approximation guarantees relative to an ideal
oracle; (2) tier-dependent compression schemes that bound
error propagation across memory levels; and (3)
probabilistic prefetch/scheduling that controls tail
latency. I will evaluate on long-context language modeling
and reasoning benchmarks, isolating each component via
ablations and comparing against heuristic baselines under
controlled bandwidth/capacity regimes. Results target
improved throughput and energy metrics at near-baseline
quality, advancing principled, hardware-aware inference
without requiring custom hardware.

Algorithms for Context Engineering in LLM Inference: Optimization of Placement, Compression, and Scheduling

This paper proposes an AI-driven framework for real-time
acoustic modelling that enhances audio perception in
dynamic environments. The system combines feedback
microphones, deep learning models, and adaptive acoustic
panels to monitor and optimize room acoustics continuously.
Convolutional and recurrent neural networks estimate
reverberation and clarity metrics, while a reinforcement
learning controller adjusts panel states for optimal
intelligibility. Unlike static treatments, this closed-loop
approach adapts to changing occupancy, noise, and source
locations. The expected outcome is a robust, intelligent
acoustic system with significant applications in education,
healthcare, and immersive audio experiences.

AI-Driven Real-Time Acoustic Modelling for Better Audio Perception in Dynamic Environments

Direct Preference Optimization (DPO) typically relies on a
fixed inverse-temperature β that controls divergence from a
reference model. Fixed β is brittle: too small causes
underregularization
(verbosity, safety drift); too large causes
overregularization
(underfitting). I propose an adaptive per-token
KL controller using EMA smoothing, deadband filtering, and
clipping to dynamically adjust β throughout training.
Initial
results on a 7B model show 72% win rate vs. base SFT and
60% vs. fixed-β DPO. The goal is a practical recipe for
stable,
compute-efficient DPO with reduced manual tuning.

Adaptive KL Control for Direct Preference Optimization in Instruction-Following LLMs

Generative AI shows strong capabilities in language,
reasoning, and code but remains prone to
hallucinations—outputs that are fluent yet incorrect. In
cybersecurity, such errors pose serious risks, from
misleading analysts to potential adversarial exploitation.
This project investigates hallucinations in three
directions: (1) creating benchmarks and interpretability
tools to characterize them in security contexts; (2)
developing mitigation strategies such as
retrieval-augmented generation, symbolic-neural hybrids,
and uncertainty-aware decoding; and (3) integrating these
methods into real-world workflows like vulnerability
assessment, malware analysis, and penetration testing,
while exploring how attackers might exploit hallucinations.
Evaluation will combine accuracy metrics, human-in-the-loop
studies, and red-team simulations. By bridging theory and
applied system design, the work aims to advance
understanding of hallucinations and improve the reliability
of AI in cybersecurity, with broader implications for other
high-stakes areas such as healthcare and law.

Hallucinations at the Firewall

Large language models (LLMs) have rapidly advanced, but
their growing compute and memory demands make them
unsustainable and limit accessibility, especially in
under-resourced regions such as Southeast Asia (SEA). While
recent hybrid architectures combining attention and
state-space models (SSMs) have shown promise, most adopt
sequentially interleaving attention and Mamba layers,
leaving parallel-head mixing of attention and Mamba heads
largely unexplored. Hence, I propose investigating
Hymba-style parallel-head hybrid architecture as a
foundation for efficient, multilingual LLMs in SEA. My
short-term goal is to perform continual pre-training (CPT)
of the released Hymba-1.5B weights on SEA corpora to
evaluate their adaptability across diverse languages. In
the longer term, I plan to study scaling strategies beyond
1.5B parameters, assessing whether parallel-head hybrids
maintain efficiency and performance at larger scales.
Evaluation will combine standard perplexity and benchmark
tasks with SEA-specific benchmarks, alongside profiling for
inference throughput and deployability on
resource-constrained devices. The expected outcomes are
twofold: (1) demonstrating that parallel-head hybrids can
be effectively adapted to SEA multilingual contexts, and
(2) providing evidence that this underexplored architecture
scales efficiently. Success would broaden the design space
of efficient LLMs while advancing equitable access to AI by
enabling practical, low-cost, and locally relevant models
for SEA communities.

Adapting Hybrid Parallel-Head Large Language Models for Southeast Asia

Modern generators often violate basic physics—e.g.,
inconsistent shadows, geometry, and measurement models
limiting trust for video synthesis and computational
imaging. We propose finite-time Schrödinger-Bridge (SB)
world models that cast generation as entropy-regularized
optimal transport from a simple prior to a physics- and
data-consistent distribution. Unlike post-hoc corrections,
our approach injects structure along the transport path:
multi-view geometry (reprojection/epipolar constraints,
homographies, depth-aware warps) for video, and
differentiable optical operators (PSF-based defocus,
lightweight Fourier propagation for coherent/partially
coherent settings) for imaging. With known poses, we
penalize reprojection and warp-aligned photometric/feature
errors; with unknown poses, a compact head estimates
motion/flow with cycle-consistency. Compact UNet/ViT
backbones and short SB horizons target efficiency.
Evaluation spans 3D consistency metrics, physics fidelity
via forward-simulation error, and generative
quality/efficiency (FID/KID, FVD) against strong diffusion
baselines, plug-and-play data-consistency, and
unconstrained SB. By constraining the path rather than only
the endpoint, the method aims to shorten sampling while
improving cross-view coherence and physical plausibility
across sensors (cameras, microscopes, medical scanners).

Physics Consistent World Models via Schrödinger-Bridge Optimal Transport for Computational Imaging and 3D-Consistent Video Generations

I present a compact, testable architecture that endows learning agents with continuous proto-emotional dynamics and interpretable modulators (Persona, Ego, Shadow, Self). The design grounds these modulators in a computational interpretation of Jung’s Map of the Soul, mapping each archetype to a differentiable control that modulates policy selection via a bounded, low-dimensional affect vector. I describe concrete modular implementations, a staged experimental program (toy domains → multi-agent/social tasks → nonstationary transfer), baselines, ablations, and reproducible evaluation metrics.

Persona, Ego, Shadow, and Self: A Map of the Soul Framework for Proto-Emotional Homeostasis in AI

Sexual trauma leaves wounds that science cannot see, yet
survivors live with them every day. Traditional tools rely
on words or self-reports, often forcing survivors to “bleed
in silence” when their pain is doubted or dismissed.
Trauma, however, is not one-dimensional. It disrupts
multiple brain networks and produces states fear,
vigilance, detachment that cannot be captured by words
alone. This creates the need for approaches that reveal
trauma’s complexity in ways that are both objective and
interpretable.
We propose a framework that combines fMRI, EEG, and
interpretable self-supervised AI (DINO) to uncover hidden
patterns of trauma in the brain. Instead of producing
abstract scans or opaque predictions, the system will
generate exploratory measures of trauma response that
support therapists’ understanding while guiding future
research. These measures will be presented through a simple
dashboard that summarizes three indices (TPI, DI, RBS)
alongside heatmaps and plain language notes. By turning
complex data into clear, anonymized session snapshots, the
dashboard provides researchers with a practical output that
can be compared across participants and refined in future
work

Understanding the Management of Rape Trauma with AI and Neuroimaging

The widespread adoption of artificial intelligence(AI) in
cybersecurity has led to the emerging of intelligent
threats, such as Advanced Persistent Threats (APTs),
challenging the conventional deception defense mechanisms.
My work aims to fill this critical gap by developing a game
theoretic defense agent capable of confronting these
intelligent threats. In this proposal, we formalize the
attacker-defender interactions as a Bayesian game model
between AI agents so as to derive equilibrium defense
strategies. Simulation based experiments and real-world
implementations would be conducted to evaluate the proposed
framework. This study is potential to revolutionize cyber
defense methodologies by shifting from bilateral
decision-making to game theoretic strategy evolution.

Downloads

Next from AAAI 2026

Satellite-Text-Prompted Large Language Model for Photovoltaic Power Forecasting

Stay up to date with the latest Underline news!

PRESENTATIONS

CONFERENCES

COMPANY

RESOURCES