Below are all poster session assignments for MSLD 2026, organized by session. Each session covers a thematic area of speech and language research.
Poster Session 1 S01 – S06 · Wednesday April 15, 11:30–13:30
| Poster Panel ID | Title | OpenReview # | First Author |
|---|---|---|---|
| S01 — ASR, Speech Translation & Multi-Speaker Speech | |||
| 1 | Variation Outweighs Syntax: An Empirical Analysis of Data Augmentation for Low-Resource ASR | 12 | Katsumi Ibaraki |
| 2 | Leveraging SpeechLLMs for Second-Language Speech Recognition | 161 | Zhu Zhu |
| 3 | Something from Nothing: Data Augmentation for Robust Severity Level Estimation of Dysarthric Speech | 113 | Jaesung Bae |
| 4 | Diarization-Conditioning: A Unified Framework for Multi-Speaker ASR, Code-Switching, and Target Speaker Reasoning with Spoken LLMs | 102 | Alexander Polok |
| 5 | Exploration of Serialization Orders in Multi-Talker ASR | 90 | Chien-yu Huang |
| 6 | Zero-Shot Speech-to-Speech Translation without Parallel Speech | 40 | Zhisheng Zheng |
| S02 — Spoken LMs, Speech Interaction & Dialogue | |||
| 7 | Dialect Matters: Cross-Lingual ASR Transfer for Low-Resource Indic Language Varieties | 84 | Akriti Dhasmana |
| 8 | DuplexGen: Adaptive Synthesis of Human-AI Turn-Taking Dialogues | 106 | Takyoung Kim |
| 9 | K2Profile: A Benchmark for Conversational Student Profiling under Child Speech and ASR Noise | 143 | Xiaocheng Yang |
| 10 | PRiSM: Benchmarking Phone Realization in Speech Models | 142 | Shikhar Bharadwaj |
| 11 | Temporal Scope Stability in Conversational AI: Measuring Present Bias in Multi-Turn Question Answering | 122 | Yash Kumar Atri |
| 12 | Afrispeech Semantics: Evaluating Audio-Semantic Reasoning in Spoken Language Models Across Domains and Accents | 23 | Chibuzor Okocha |
| 13 | Advancing Speech In-Context Learning with Semantic and Acoustic Retrieval | 54 | Haolong Zheng |
| 14 | Can Speech LLMs Think while Listening? | 7 | Yi-Jen Shih |
| 15 | Eliciting interactional competence: Comparing AI and human-elicited roleplays | 39 | Yunwen Su |
| 16 | MoshiRAG: Asynchronous Knowledge Retrieval for Full-Duplex Speech Language Models | 3 | Chung-Ming Chien |
| S03 — Speech Representations, Phonology & Probing | |||
| 17 | Layer-wise Minimal Pair Probing Reveals Contextual Grammatical-Conceptual Hierarchy in Speech Representations | 183 | Linyang He |
| 18 | Self-Supervised Speech Models Encode Phonetic Context via Position-dependent Orthogonal Subspaces | 19 | Kwanghee Choi |
| 19 | [b]=[d]-[t]+[p]: Self-supervised Speech Models Discover Phonological Vector Arithmetic | 17 | Kwanghee Choi |
| 20 | Distinguishing Speech from Writing with Multi-Topic Clustering and Visual Explainable AI | 151 | Mina Rajaei Moghadam |
| 21 | Know Thyself? On the Incapability and Implications of AI Self-Recognition | 28 | Xiaoyan Bai |
| 22 | Toward a Universal Local Speech Feature Extractor through Distillation | 125 | Jessica Yang |
| 23 | Training Language Models and Embeddings for Hybrid Classical/Quantum Computing | 64 | Damir Cavar |
| S04 — Speech Generation, Audio Systems & Codecs | |||
| 24 | Federated in-context learning: Iterative refinement for improved answer quality | 155 | Ruhan Wang |
| 25 | FastMSS: A Scalable Synthetic Conversations Generator | 157 | Alexander Polok |
| 26 | From Hallucination to Articulation: Language Model-Driven Losses for Ultra Low-Bitrate Neural Speech Coding | 121 | Jayeon Yi |
| 27 | Few-Shot Synthetic-Only Accent Adaptation for ASR via LLM-Guided Phoneme Editing | 33 | Yurii Halychanskyi |
| 28 | Quantifying the Modality Gap Between Speech and Text with Generative Perplexity Under Controlled Data and Objective Settings | 65 | Ju-Chieh Chou |
| 29 | When Fine-Tuning Fails and when it Generalises: Role of Data Diversity and Mixed Training in LLM-based TTS | 117 | Anupam Purwar |
| S05 — Multilinguality, Translation & Code-Switching I | |||
| 30 | AudioChat: Unified Audio Storytelling, Editing, and Understanding with Transfusion Forcing | 150 | William Chen |
| 31 | Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits | 9 | Joseph Keshet |
| 32 | Robust Semantic Reasoning in Audio Language Models via In-Context Learning | 83 | Chibuzor Okocha |
| 33 | ESPnet3: Infrastructure for Scalable Speech and Audio Research in the Foundation Model Era | 101 | Masao Someki |
| 34 | SPARCLE: SPeaker-aware Aligned Representations via Contrastive Language Embeddings | 82 | Priyam Mazumdar |
| S06 — Multilinguality, Translation & Code-Switching II | |||
| 35 | MicroBERT-MT: Machine Translation as an Auxiliary Pretraining Task for Low-resource Monolingual Encoders | 58 | Phakphum Artkaew |
| 36 | Evaluating the Impact of Verbal Multiword Expressions on Machine Translation | 21 | Linfeng Liu |
| 37 | Translation-Induced Label Drift across Nine Languages in Natural Language Inference | 44 | Muhammad S. Abdo |
| 38 | An Empirical Recipe for Universal Phone Recognition | 144 | Shikhar Bharadwaj |
| 39 | Cross-Lingual Prompt Steerability: Towards Accurate and Robust LLM Behavior across Languages | 145 | Lechen Zhang |
| 40 | Embracing the Noise: Improving Zero-Shot Dialect Transfer in LLMs via Perturbation-based Continued Pre-training | 124 | Aarohi Srivastava |
| 41 | How Do Multilingual Language Models Handle Multiple Languages? | 61 | Satya Subrahmanya Gautama Shastry Bulusu Venkata |
Poster Session 2 S07 – S11 · Wednesday April 15, 16:00–18:00
| Panel | Title | Paper # | First Author |
|---|---|---|---|
| S07 — Tokenization, Pretraining & Efficient Training | |||
| 1 | Beyond Explicit Tokenization: Investigating Transformer Limitations with Subword Granularity | 172 | Kenneth Sible |
| 2 | Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning | 149 | Dylan Zhang |
| 3 | IBERT: Idiom Cloze-style reading comprehension with Attention | 88 | Haozheng Luo |
| 4 | Internalizing World Models via Self-Play Finetuning for Agentic RL | 112 | Shiqi Chen |
| 5 | Is Sorting Hard for Transformers? | 66 | Nathan W. Henry |
| 6 | Loop the Middle: Adaptive Depth Transformers via Selective Middle-Layer Recurrence | 184 | Lechen Zhang |
| 7 | Midtraining Bridges Pretraining and Posttraining Distributions | 8 | Emmy Liu |
| 8 | Semantics-Aware Generative Latent Data Augmentation for Learning in Low-Resource Domains | 109 | Jaesung Bae |
| 9 | Surprisingly Strong and Sparse RL with SGD in LLMs | 38 | Sagnik Mukherjee |
| 10 | Code-Mixed Telugu-English Hate Speech Detection | 70 | Satya Subrahmanya Gautama Shastry Bulusu Venkata |
| 11 | FLEXITOKENS: Flexible Tokenization for Evolving Language Models | 80 | Abraham Toluwase Owodunni |
| 12 | How do language models perceive creative language: A deep dive into generation and multilingual adaptation of novel slang | 43 | Zhewei Sun |
| S08 — Prompting, Steering & Test-Time Adaptation | |||
| 13 | ATLAS: Adaptive Test-Time Latent Steering with External Verifiers for Enhancing LLMs' Reasoning | 16 | Tuc Nguyen |
| 14 | BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization | 163 | Saket Reddy |
| 15 | Can you steer Whisper with steering vectors from GPT2-xl? | 74 | Ayush Jain |
| 16 | Rescorla-Wagner Steering of LLMs for Undesired Behaviors over Disproportionate Inappropriate Context | 181 | Rushi Wang |
| 17 | Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs | 156 | Pengrui Han |
| 18 | Critical tokens and inference-time scaling | 32 | Jinu Lee |
| 19 | On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference | 158 | Yue Yu |
| S09 — Reasoning & Inference-Time Scaling | |||
| 20 | Causal Graph Ensembles of Reasoning Traces to Improve LLM Reasoning and Mitigate Hallucinations | 146 | Amruta Parulekar |
| 21 | FROST: Filtering Reasoning Outliers with Attention for Efficient Reasoning | 87 | Haozheng Luo |
| 22 | How Hard is Math? Using Quantitative Metrics to Measure LLM Alignment to Human Intuitions of Difficulty | 76 | Micah Helzerman |
| 23 | Long Listening Thoughts: Eliciting Open Auditory Reasoning with Deliberative Perception and Cognitive Refinement | 136 | Jaeyeon Kim |
| 24 | MMGR: Multi-Modal Generative Reasoning Benchmarking World Models for Language-Grounded Agents | 132 | Zefan Cai |
| 25 | Towards A Universally Causal Agent | 41 | Qirun Dai |
| 26 | Understanding Reasoning Collapse in LLM Agent Reinforcement Learning | 67 | Zihan Wang |
| 27 | Not All Tokens Need to Be Said: Selective Latent Execution for Efficient Chain-of-Thought Reasoning | 170 | Jiarui Liu |
| 28 | OSExpert: Computer-Use Agents Learning Professional Skills via Exploration | 180 | Jiateng Liu |
| S10 — Causal, Legal & Structured Reasoning | |||
| 29 | Agentic Causal Discovery Of Unseen Worlds | 141 | Dylan Zhang |
| 30 | Executable Counterfactuals: Improving LLMs' Causal Reasoning Through Code | 165 | Aniket Vashishtha |
| 31 | Beyond Pattern Matching: Teaching Language Models to Plan, Sequence, and Execute Reasoning Primitives | 177 | Jiarui Liu |
| 32 | Exploring Chemical Space with LLM Reasoning | 13 | Yihan Zhu |
| 33 | LiveMathematicianBench: A Benchmark for Research-Level Mathematics with Proof Sketches | 185 | Linyang He |
| 34 | Perception-Aware Policy Optimization for Multimodal Reasoning | 105 | Zhenhailong Wang |
| 35 | R-KV: Redundancy-aware KV Cache Compression for Reasoning Models | 131 | Zefan Cai |
| 36 | Spatial Reasoning Through Modality Switching Across Language, Vision, and Symbols | 135 | Shreya Rajpal |
| 37 | Think Multilingual, Not Harder: A Data-Efficient Framework for Teaching Reasoning Models to Code-Switch | 119 | Eleanor M Lin |
| 38 | VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents | 108 | Kangrui Wang |
| S11 — Agents, Tool Use & Exploration | |||
| 39 | From Payer Policy to Structured Requirement Checklists: Leveraging Large Language Model Agents for Insurance Document Understanding | 164 | Yi-Jyun Sun |
| 40 | How Do AI Agents Spend Your Money? Analyzing and Predicting Token Consumption in Agentic Coding Tasks | 171 | Longju Bai |
| 41 | LimAgents: A Multi-Agent RAG Framework and Large-Scale Corpus for Research Limitation Generation | 179 | Ibrahim Al Azher |
| 42 | Environment Exploration as a Scaling Paradigm for Interactive Agents | 178 | Jiateng Liu |
| 43 | A Student-Role Web-Agent Architecture for Testing Educational Interfaces | 53 | Siyang Liu |
| 44 | AgentDebug: Where LLM Agents Fail and How They can Learn From Failures | 98 | Kunlun Zhu |
| 45 | Can LLM Agents Use Tools Economically? Benchmarking Cost-Optimal Planning and Adaptive Replanning in Dynamic Environments | 130 | Jiayu Liu |
| 46 | Debating AI in Education: Public Conversations on Reddit | 116 | Asma Arrak |
| 47 | MAP-AgMO: Multi-Agent framework for Personalized Agricultural Multi-Objective decision-making | 92 | Josué Kpodo |
| 48 | Navigating Worlds and Minds: Dynamic Evaluation of LLM Agent Robustness under Progressively disclosing Dual-Constraints | 175 | Jiayu Liu |
| 49 | ReplicatorBench: Benchmarking LLM Agents on Replicability Studies in Social and Behavioral Sciences | 10 | Bang Nguyen |
| 50 | SkillCraft: Can LLM Agents Learn to Use Tools Skillfully? | 115 | Shiqi Chen |
| 51 | Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data | 118 | Emre Can Acikgoz |
Poster Session 3 S12 – S15 · Thursday April 16, 11:30–13:30
| Panel | Title | Paper # | First Author |
|---|---|---|---|
| S12 — Memory, Personalization & User-Centric Agents | |||
| 1 | Can we trust LLMs to manage time? Training Self-Evolving Assistant with Reinforcement Learning | 104 | Bingxuan Li |
| 2 | Jointly Optimizing Clinical Reasoning and Memory Management of Self-Learning Agent for Disease Diagnosis | 103 | Bingxuan Li |
| 3 | Learning Where to Remember: Dynamic Memory Routing for Reliable LLM Agents | 123 | Hyeonjeong Ha |
| 4 | MultiSessionCollab: Learning User Preferences with Memory to Improve Long-Term Collaboration | 77 | Shuhaib Mehri |
| 5 | PlugMem: A Task-Agnostic Plugin Memory Module for LLM Agents | 62 | Ke Yang |
| 6 | User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction | 86 | Yuren Hao |
| 7 | Benchmarking and Improving LLM Robustness for Personalized Generation | 55 | Chimaobi Okite |
| 8 | MM-tau-p²: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings | 188 | Anupam Purwar |
| 9 | PersonalBench: A Human-Grounded Benchmark for LLM Personalization | 174 | Lechen Zhang |
| 10 | Towards User-Centric Agent Training and Benchmarking | 4 | Cheng Qian |
| S13 — Retrieval, Knowledge Graphs & Grounded QA | |||
| 11 | Training-Free Voter-Adjudicator Framework for Schema-Guided Biomedical Annotation | 160 | Gibong Hong |
| 12 | A Clinical SKOS Ontology and Evaluation Benchmark for LLM Query Generation over ICU Knowledge Graphs | 26 | Khurrum Ali |
| 13 | AgriNarratives: Extraction and Analysis of Genotype – Environment – Management Interactions Using Natural Language Processing | 111 | Siraj Osman Omer |
| 14 | Condition-Gated Reasoning for Context-Dependent Biomedical Question Answering | 159 | Jash Rajesh Parekh |
| 15 | LLM OntologyRAG - Extending a Food-Agent with a Description Logic Knowledge Representation | 36 | Damir Cavar |
| 16 | NutriSync AI: Graph-Augmented Retrieval for Personalized Food-Safety Guidance Grounding LLM Recommendations in Biomedical Knowledge-Graph Evidence and Quantitative RAG Evaluation | 60 | Kumar Koushik Telaprolu |
| 17 | Ontology-Grounded Knowledge Graph Construction for Alzheimer's Disease Literature Using Multi-Model Ensemble Embeddings | 46 | Muhammad S. Abdo |
| 18 | Multi-Agent LLMs for Style-Controlled and Faithful Scientific Conclusion Generation | 153 | Mosab Rezaei |
| 19 | Not In Our Time: Transport-Based Content Selection as an Inspectable, Manipulable Alternative to RAG | 147 | Chris Brew |
| 20 | PatientAdvocateLM: A Clinically Grounded Retrieval-Augmented Conversational Agent for Medical Visit Preparation | 52 | Siyang Liu |
| 21 | Pixel-Grounded Retrieval for Knowledgeable Large Multimodal Models | 45 | Jeonghwan Kim |
| 22 | Tackling Distractor Documents in Multi-Hop QA with Reinforcement and Curriculum Learning | 79 | Jerry Huang |
| S14 — Beliefs, Persuasion, Benchmarks & Domain Applications | |||
| 23 | Persuasion-R1: Reinforcement Learning for Training and Analyzing Persuasive LLM Agents | 120 | Nimet Beyza Bozdag |
| 24 | "Natural language as an Action Language" | 37 | Samuel Corey |
| 25 | MineCEraft: Evaluating Language Models as Construction Engineers in the World of Minecraft | 75 | Sewoong Lee |
| 26 | Evaluating Legal Reasoning Traces with Legal Issue Tree Rubrics | 30 | Jinu Lee |
| 27 | From Pagina to Webpage: On Developing and Documenting a Digitized Latin Collection | 94 | Stephen Bothwell |
| 28 | L1 Acquisition in Telicity: Connecting Linguistic Cues and LLM-Based Surprisal | 85 | Ellie Xia |
| 29 | MORA: AI-Mediated Story-Based practice for Speech Sound Disorder from Clinic to Home | 47 | Sumin Hong |
| 30 | Benchmarking LLMs for Pediatric Gastroenterology Knowledge Tasks | 128 | Dalia Khaizaran |
| 31 | Evaluating LLM Creativity as Long-Tail Performance | 182 | Yichen Wang |
| 32 | Learning to Predict Future-Aligned Research Proposals with Language Models | 167 | Heng Wang |
| 33 | Scientific Olfactory Information Extraction: Toward a Unified NLP Framework for Chemosensory Knowledge Discovery | 162 | Evan Guerra |
| 34 | Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration | 166 | Priyanka Kargupta |
| 35 | The Story is Not the Science: Execution-Grounded Evaluation of Mechanistic Interpretability Research | 27 | Xiaoyan Bai |
| 36 | Why AI Scientists Are Not Yet Ready for Open-Ended and Fully Autonomous Scientific Discovery | 114 | Kunlun Zhu |
| S15 — Conversation Quality, Hallucination & Trust | |||
| 37 | Improving Hallucination Detection in Dialog via Social Framing Analysis | 186 | Parisa Rabbani |
| 38 | Drift No More? Context Equilibria in Multi-Turn LLM Interactions | 6 | Vardhan Dongre |
| 39 | PSI-Bench: Towards Interpretable and Clinically Grounded Evaluation of Depressive Patient Simulators | 99 | Nguyen Khoi Hoang |
| 40 | Towards a Benchmark for Epistemic Modality in Large Language Models | 169 | Tianjia Dong |
| 41 | Veridicality Beyond Factuality: Turkish Evidentiality as a Test of Human and LLM Reasoning | 93 | Sercan Karakas |
| 42 | Hallucination in the Wild: A Field Guide for LLM Users | 5 | Ashley Lewis |
| 43 | Quantifying Contextual Hallucinations in NLP Research Papers Before and After the LLM Era | 95 | Adiba Ibnat Hossain |
| 44 | The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination | 126 | Yuji Zhang |
| 45 | TRUST Agents 2.0: Comparing Two Agentic Fact-Checking Pipelines for Explainable and Uncertainty-Aware Verification | 72 | Satya Subrahmanya Gautama Shastry Bulusu Venkata |
Poster Session 4 S16 – S20 · Thursday April 16, 15:45–17:45
| Panel | Title | Paper # | First Author |
|---|---|---|---|
| S16 — Bias, Fairness & Social Values | |||
| 1 | FAC-FACodec: Controllable Zero-Shot Foreign Accent Conversion with Factorized Speech Codec | 35 | Yurii Halychanskyi |
| 2 | Fairness Failure Modes of Multimodal LLMs | 73 | Canyu Chen |
| 3 | From Preferences to Prejudice: Alignment Tuning Amplifies Social Bias in Language-Conditioned Video Generation | 133 | Zefan Cai |
| 4 | SoNoLiSi: Simulating the Social Norm Lifecycle with Generative Agents | 134 | Rasika Muralidharan |
| 5 | The Curious Case of Curiosity across Human Cultures and LLMs | 18 | Angana Borah |
| 6 | Wait, am I Being Fair? Characterizing Deductive Stereotyping and Mitigating It with Fair-GCG | 1 | Naihao Deng |
| 7 | Who Gets Which Message? Auditing Demographic Bias in LLM-Generated Targeted Text | 11 | Tunazzina Islam |
| 8 | Do Emotions Influence Moral Judgment in Large Language Models? | 24 | Mohammad Saim |
| 9 | Modeling Annotator Disagreement with Demographic-Aware Experts and Synthetic Perspectives | 51 | Yinuo Xu |
| 10 | Who Plays Which Role When? Communication Role Dynamics for Peer Recognition and Team Performance Prediction | 139 | Yifan Song |
| S17 — Multimodal Reasoning, Vision-Language & Spatial | |||
| 12 | ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction | 71 | Qineng Wang |
| 12 | SATURN: Symbolic Spatial Reasoning from 3D Scene Structure for Vision-Language | 176 | Danial Kamali |
| 13 | Theory of Space: Can Foundation Models Construct Spatial Beliefs through Active Exploration? | 97 | Pingyue Zhang |
| 14 | World Model as Tool: An Empirical Study on Agent Foresight Governance | 129 | Cheng Qian |
| 15 | NePTune: A Neuro-Pythonic Framework for Tunable Compositional Reasoning on Vision-Language | 81 | Danial Kamali |
| S18 — Multimodal Generation, Memes & Visual Communication | |||
| 16 | AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking | 89 | Xilin Jiang |
| 17 | Beyond Multi-modality: Evaluating AI's Meme Literacy in Conversational Contexts on Bluesky | 57 | Juyoung An |
| 18 | A Computational Approach to Visual Metonymy | 20 | Saptarshi Ghosh |
| 19 | CLAMP: Constrained Language-Action Multimodal Planning | 48 | Tianyi Ma |
| 20 | FoR-SALE: Frame of Reference-guided Spatial Adjustment in LLM-based Diffusion Editing | 42 | Tanawan Premsri |
| 21 | MENTOR: Efficient Multimodal-Conditioned Tuning for Language-Guided Visual Generation Agents | 140 | Haozhe Zhao |
| 22 | Structured Multimodal World Models for Knowledge Localization, Safe Editing, and Predictive Situation Understanding | 100 | Aditi Tiwari |
| S19 — Social Media, Slang & Online Discourse | |||
| 23 | Email in the Era of LLMs | 34 | Dang Nguyen |
| 24 | Modeling LLM Persuasion via Interactive 2T1L Game | 56 | Shivani Kumar |
| 25 | Towards stable belief LLMs | 148 | Sumit Kumar |
| 26 | Vulnerability of LLMs' Stated Beliefs? LLMs Belief Resistance Check Through Strategic Persuasive Conversation Interventions | 68 | Fan Huang |
| 27 | A Computational Analysis of Social Media Reframing of News Events | 96 | Achyutarama R Ganti |
| 28 | Audience Evaluations of TV Characters in Reddit Communities | 137 | Souha Ben Hassine |
| 29 | Detecting Emerging Drug Slang and Code Language in Social Media Posts | 59 | Damir Cavar |
| 30 | From Utterances to Networks: Modelling Slang Adoption and Diffusion Across Subreddits | 29 | Xiaoning Wang |
| 31 | Iterative Topic Taxonomy Induction with LLMs: A Case Study of Electoral Advertising | 31 | Tunazzina Islam |
| 32 | Measuring Student's Perception of LLM Capabilities and Usage | 22 | Oluchi Obadoni |
| 33 | The Non-Traditional News Ecosystems of Podcasts | 154 | Michela Marchini |
| S20 — Safety, Alignment & Unlearning | |||
| 34 | Alignment Faking in Language Models is Frequent: Value-focused Diagnosis and Efficient Mitigation | 25 | Inderjeet Jayakumar Nair |
| 35 | Anatomy of an Unsafe Thought: Behavior Profiling and Safety Scoring in Reasoning Chains | 69 | Ishita Kakkar |
| 36 | From Rubrics to Rewards: Aligning Question Generation for Early Literacy | 168 | Yi-Jyun Sun |
| 37 | Prior Beliefs Prejudice LLM-as-Judge: Evidence from Persuasion Evaluation | 110 | Pardis Sadat Zahraei |
| 38 | I Am Aligned, But With Whom? Diagnosing Structural Alignment Failures in Multilingual LLMs | 107 | Pardis Sadat Zahraei |
| 39 | Learning to Optimize Multi-Objective Alignment Through Dynamic Reward Weighting | 2 | Yining Lu |
| 40 | Optimizing Diversity and Quality through Base-Aligned Model Collaboration | 91 | Yichen Wang |
| 41 | Community-Driven Model Development is Unreliable and Unsafe | 15 | Jack Sanderson |
| 42 | Data Synthesis with Influence Rewarded Models | 49 | Ishika Agarwal |
| 43 | How Catastrophic is Your LLM? Certifying Risks in Conversation | 173 | Chengxiao Wang |
| 44 | Let there be Frontier Model System Certification | 187 | Isha Chaudhary |
| 45 | Geometric-disentanglement Unlearning | 127 | Duo Zhou |
| 46 | Knowledge Control for Trustworthy and Responsible AI | 14 | Zheyuan Liu |
| 47 | Unlearning-induced Collateral Corruption in Machine Unlearning for LLMs | 63 | Bo Su |