Poster Sessions

Below are all poster session assignments for MSLD 2026, organized by session. Each session covers a thematic area of speech and language research.

Poster Session 1 S01 – S06  ·  Wednesday April 15, 11:30–13:30

Poster Panel ID Title OpenReview # First Author
S01 — ASR, Speech Translation & Multi-Speaker Speech
1 Variation Outweighs Syntax: An Empirical Analysis of Data Augmentation for Low-Resource ASR 12Katsumi Ibaraki
2 Leveraging SpeechLLMs for Second-Language Speech Recognition 161Zhu Zhu
3 Something from Nothing: Data Augmentation for Robust Severity Level Estimation of Dysarthric Speech 113Jaesung Bae
4 Diarization-Conditioning: A Unified Framework for Multi-Speaker ASR, Code-Switching, and Target Speaker Reasoning with Spoken LLMs 102Alexander Polok
5 Exploration of Serialization Orders in Multi-Talker ASR 90Chien-yu Huang
6 Zero-Shot Speech-to-Speech Translation without Parallel Speech 40Zhisheng Zheng
S02 — Spoken LMs, Speech Interaction & Dialogue
7 Dialect Matters: Cross-Lingual ASR Transfer for Low-Resource Indic Language Varieties 84Akriti Dhasmana
8 DuplexGen: Adaptive Synthesis of Human-AI Turn-Taking Dialogues 106Takyoung Kim
9 K2Profile: A Benchmark for Conversational Student Profiling under Child Speech and ASR Noise 143Xiaocheng Yang
10 PRiSM: Benchmarking Phone Realization in Speech Models 142Shikhar Bharadwaj
11 Temporal Scope Stability in Conversational AI: Measuring Present Bias in Multi-Turn Question Answering 122Yash Kumar Atri
12 Afrispeech Semantics: Evaluating Audio-Semantic Reasoning in Spoken Language Models Across Domains and Accents 23Chibuzor Okocha
13 Advancing Speech In-Context Learning with Semantic and Acoustic Retrieval 54Haolong Zheng
14 Can Speech LLMs Think while Listening? 7Yi-Jen Shih
15 Eliciting interactional competence: Comparing AI and human-elicited roleplays 39Yunwen Su
16 MoshiRAG: Asynchronous Knowledge Retrieval for Full-Duplex Speech Language Models 3Chung-Ming Chien
S03 — Speech Representations, Phonology & Probing
17 Layer-wise Minimal Pair Probing Reveals Contextual Grammatical-Conceptual Hierarchy in Speech Representations 183Linyang He
18 Self-Supervised Speech Models Encode Phonetic Context via Position-dependent Orthogonal Subspaces 19Kwanghee Choi
19 [b]=[d]-[t]+[p]: Self-supervised Speech Models Discover Phonological Vector Arithmetic 17Kwanghee Choi
20 Distinguishing Speech from Writing with Multi-Topic Clustering and Visual Explainable AI 151Mina Rajaei Moghadam
21 Know Thyself? On the Incapability and Implications of AI Self-Recognition 28Xiaoyan Bai
22 Toward a Universal Local Speech Feature Extractor through Distillation 125Jessica Yang
23 Training Language Models and Embeddings for Hybrid Classical/Quantum Computing 64Damir Cavar
S04 — Speech Generation, Audio Systems & Codecs
24 Federated in-context learning: Iterative refinement for improved answer quality 155Ruhan Wang
25 FastMSS: A Scalable Synthetic Conversations Generator 157Alexander Polok
26 From Hallucination to Articulation: Language Model-Driven Losses for Ultra Low-Bitrate Neural Speech Coding 121Jayeon Yi
27 Few-Shot Synthetic-Only Accent Adaptation for ASR via LLM-Guided Phoneme Editing 33Yurii Halychanskyi
28 Quantifying the Modality Gap Between Speech and Text with Generative Perplexity Under Controlled Data and Objective Settings 65Ju-Chieh Chou
29 When Fine-Tuning Fails and when it Generalises: Role of Data Diversity and Mixed Training in LLM-based TTS 117Anupam Purwar
S05 — Multilinguality, Translation & Code-Switching I
30 AudioChat: Unified Audio Storytelling, Editing, and Understanding with Transfusion Forcing 150William Chen
31 Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits 9Joseph Keshet
32 Robust Semantic Reasoning in Audio Language Models via In-Context Learning 83Chibuzor Okocha
33 ESPnet3: Infrastructure for Scalable Speech and Audio Research in the Foundation Model Era 101Masao Someki
34 SPARCLE: SPeaker-aware Aligned Representations via Contrastive Language Embeddings 82Priyam Mazumdar
S06 — Multilinguality, Translation & Code-Switching II
35 MicroBERT-MT: Machine Translation as an Auxiliary Pretraining Task for Low-resource Monolingual Encoders 58Phakphum Artkaew
36 Evaluating the Impact of Verbal Multiword Expressions on Machine Translation 21Linfeng Liu
37 Translation-Induced Label Drift across Nine Languages in Natural Language Inference 44Muhammad S. Abdo
38 An Empirical Recipe for Universal Phone Recognition 144Shikhar Bharadwaj
39 Cross-Lingual Prompt Steerability: Towards Accurate and Robust LLM Behavior across Languages 145Lechen Zhang
40 Embracing the Noise: Improving Zero-Shot Dialect Transfer in LLMs via Perturbation-based Continued Pre-training 124Aarohi Srivastava
41 How Do Multilingual Language Models Handle Multiple Languages? 61Satya Subrahmanya Gautama Shastry Bulusu Venkata

Poster Session 2 S07 – S11  ·  Wednesday April 15, 16:00–18:00

Panel Title Paper # First Author
S07 — Tokenization, Pretraining & Efficient Training
1 Beyond Explicit Tokenization: Investigating Transformer Limitations with Subword Granularity 172Kenneth Sible
2 Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning 149Dylan Zhang
3 IBERT: Idiom Cloze-style reading comprehension with Attention 88Haozheng Luo
4 Internalizing World Models via Self-Play Finetuning for Agentic RL 112Shiqi Chen
5 Is Sorting Hard for Transformers? 66Nathan W. Henry
6 Loop the Middle: Adaptive Depth Transformers via Selective Middle-Layer Recurrence 184Lechen Zhang
7 Midtraining Bridges Pretraining and Posttraining Distributions 8Emmy Liu
8 Semantics-Aware Generative Latent Data Augmentation for Learning in Low-Resource Domains 109Jaesung Bae
9 Surprisingly Strong and Sparse RL with SGD in LLMs 38Sagnik Mukherjee
10 Code-Mixed Telugu-English Hate Speech Detection 70Satya Subrahmanya Gautama Shastry Bulusu Venkata
11 FLEXITOKENS: Flexible Tokenization for Evolving Language Models 80Abraham Toluwase Owodunni
12 How do language models perceive creative language: A deep dive into generation and multilingual adaptation of novel slang 43Zhewei Sun
S08 — Prompting, Steering & Test-Time Adaptation
13 ATLAS: Adaptive Test-Time Latent Steering with External Verifiers for Enhancing LLMs' Reasoning 16Tuc Nguyen
14 BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization 163Saket Reddy
15 Can you steer Whisper with steering vectors from GPT2-xl? 74Ayush Jain
16 Rescorla-Wagner Steering of LLMs for Undesired Behaviors over Disproportionate Inappropriate Context 181Rushi Wang
17 Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs 156Pengrui Han
18 Critical tokens and inference-time scaling 32Jinu Lee
19 On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference 158Yue Yu
S09 — Reasoning & Inference-Time Scaling
20 Causal Graph Ensembles of Reasoning Traces to Improve LLM Reasoning and Mitigate Hallucinations 146Amruta Parulekar
21 FROST: Filtering Reasoning Outliers with Attention for Efficient Reasoning 87Haozheng Luo
22 How Hard is Math? Using Quantitative Metrics to Measure LLM Alignment to Human Intuitions of Difficulty 76Micah Helzerman
23 Long Listening Thoughts: Eliciting Open Auditory Reasoning with Deliberative Perception and Cognitive Refinement 136Jaeyeon Kim
24 MMGR: Multi-Modal Generative Reasoning Benchmarking World Models for Language-Grounded Agents 132Zefan Cai
25 Towards A Universally Causal Agent 41Qirun Dai
26 Understanding Reasoning Collapse in LLM Agent Reinforcement Learning 67Zihan Wang
27 Not All Tokens Need to Be Said: Selective Latent Execution for Efficient Chain-of-Thought Reasoning 170Jiarui Liu
28 OSExpert: Computer-Use Agents Learning Professional Skills via Exploration 180Jiateng Liu
S10 — Causal, Legal & Structured Reasoning
29 Agentic Causal Discovery Of Unseen Worlds 141Dylan Zhang
30 Executable Counterfactuals: Improving LLMs' Causal Reasoning Through Code 165Aniket Vashishtha
31 Beyond Pattern Matching: Teaching Language Models to Plan, Sequence, and Execute Reasoning Primitives 177Jiarui Liu
32 Exploring Chemical Space with LLM Reasoning 13Yihan Zhu
33 LiveMathematicianBench: A Benchmark for Research-Level Mathematics with Proof Sketches 185Linyang He
34 Perception-Aware Policy Optimization for Multimodal Reasoning 105Zhenhailong Wang
35 R-KV: Redundancy-aware KV Cache Compression for Reasoning Models 131Zefan Cai
36 Spatial Reasoning Through Modality Switching Across Language, Vision, and Symbols 135Shreya Rajpal
37 Think Multilingual, Not Harder: A Data-Efficient Framework for Teaching Reasoning Models to Code-Switch 119Eleanor M Lin
38 VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents 108Kangrui Wang
S11 — Agents, Tool Use & Exploration
39 From Payer Policy to Structured Requirement Checklists: Leveraging Large Language Model Agents for Insurance Document Understanding 164Yi-Jyun Sun
40 How Do AI Agents Spend Your Money? Analyzing and Predicting Token Consumption in Agentic Coding Tasks 171Longju Bai
41 LimAgents: A Multi-Agent RAG Framework and Large-Scale Corpus for Research Limitation Generation 179Ibrahim Al Azher
42 Environment Exploration as a Scaling Paradigm for Interactive Agents 178Jiateng Liu
43 A Student-Role Web-Agent Architecture for Testing Educational Interfaces 53Siyang Liu
44 AgentDebug: Where LLM Agents Fail and How They can Learn From Failures 98Kunlun Zhu
45 Can LLM Agents Use Tools Economically? Benchmarking Cost-Optimal Planning and Adaptive Replanning in Dynamic Environments 130Jiayu Liu
46 Debating AI in Education: Public Conversations on Reddit 116Asma Arrak
47 MAP-AgMO: Multi-Agent framework for Personalized Agricultural Multi-Objective decision-making 92Josué Kpodo
48 Navigating Worlds and Minds: Dynamic Evaluation of LLM Agent Robustness under Progressively disclosing Dual-Constraints 175Jiayu Liu
49 ReplicatorBench: Benchmarking LLM Agents on Replicability Studies in Social and Behavioral Sciences 10Bang Nguyen
50 SkillCraft: Can LLM Agents Learn to Use Tools Skillfully? 115Shiqi Chen
51 Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data 118Emre Can Acikgoz

Poster Session 3 S12 – S15  ·  Thursday April 16, 11:30–13:30

Panel Title Paper # First Author
S12 — Memory, Personalization & User-Centric Agents
1 Can we trust LLMs to manage time? Training Self-Evolving Assistant with Reinforcement Learning 104Bingxuan Li
2 Jointly Optimizing Clinical Reasoning and Memory Management of Self-Learning Agent for Disease Diagnosis 103Bingxuan Li
3 Learning Where to Remember: Dynamic Memory Routing for Reliable LLM Agents 123Hyeonjeong Ha
4 MultiSessionCollab: Learning User Preferences with Memory to Improve Long-Term Collaboration 77Shuhaib Mehri
5 PlugMem: A Task-Agnostic Plugin Memory Module for LLM Agents 62Ke Yang
6 User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction 86Yuren Hao
7 Benchmarking and Improving LLM Robustness for Personalized Generation 55Chimaobi Okite
8 MM-tau-p²: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings 188Anupam Purwar
9 PersonalBench: A Human-Grounded Benchmark for LLM Personalization 174Lechen Zhang
10 Towards User-Centric Agent Training and Benchmarking 4Cheng Qian
S13 — Retrieval, Knowledge Graphs & Grounded QA
11 Training-Free Voter-Adjudicator Framework for Schema-Guided Biomedical Annotation 160Gibong Hong
12 A Clinical SKOS Ontology and Evaluation Benchmark for LLM Query Generation over ICU Knowledge Graphs 26Khurrum Ali
13 AgriNarratives: Extraction and Analysis of Genotype – Environment – Management Interactions Using Natural Language Processing 111Siraj Osman Omer
14 Condition-Gated Reasoning for Context-Dependent Biomedical Question Answering 159Jash Rajesh Parekh
15 LLM OntologyRAG - Extending a Food-Agent with a Description Logic Knowledge Representation 36Damir Cavar
16 NutriSync AI: Graph-Augmented Retrieval for Personalized Food-Safety Guidance Grounding LLM Recommendations in Biomedical Knowledge-Graph Evidence and Quantitative RAG Evaluation 60Kumar Koushik Telaprolu
17 Ontology-Grounded Knowledge Graph Construction for Alzheimer's Disease Literature Using Multi-Model Ensemble Embeddings 46Muhammad S. Abdo
18 Multi-Agent LLMs for Style-Controlled and Faithful Scientific Conclusion Generation 153Mosab Rezaei
19 Not In Our Time: Transport-Based Content Selection as an Inspectable, Manipulable Alternative to RAG 147Chris Brew
20 PatientAdvocateLM: A Clinically Grounded Retrieval-Augmented Conversational Agent for Medical Visit Preparation 52Siyang Liu
21 Pixel-Grounded Retrieval for Knowledgeable Large Multimodal Models 45Jeonghwan Kim
22 Tackling Distractor Documents in Multi-Hop QA with Reinforcement and Curriculum Learning 79Jerry Huang
S14 — Beliefs, Persuasion, Benchmarks & Domain Applications
23 Persuasion-R1: Reinforcement Learning for Training and Analyzing Persuasive LLM Agents 120Nimet Beyza Bozdag
24 "Natural language as an Action Language" 37Samuel Corey
25 MineCEraft: Evaluating Language Models as Construction Engineers in the World of Minecraft 75Sewoong Lee
26 Evaluating Legal Reasoning Traces with Legal Issue Tree Rubrics 30Jinu Lee
27 From Pagina to Webpage: On Developing and Documenting a Digitized Latin Collection 94Stephen Bothwell
28 L1 Acquisition in Telicity: Connecting Linguistic Cues and LLM-Based Surprisal 85Ellie Xia
29 MORA: AI-Mediated Story-Based practice for Speech Sound Disorder from Clinic to Home 47Sumin Hong
30 Benchmarking LLMs for Pediatric Gastroenterology Knowledge Tasks 128Dalia Khaizaran
31 Evaluating LLM Creativity as Long-Tail Performance 182Yichen Wang
32 Learning to Predict Future-Aligned Research Proposals with Language Models 167Heng Wang
33 Scientific Olfactory Information Extraction: Toward a Unified NLP Framework for Chemosensory Knowledge Discovery 162Evan Guerra
34 Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration 166Priyanka Kargupta
35 The Story is Not the Science: Execution-Grounded Evaluation of Mechanistic Interpretability Research 27Xiaoyan Bai
36 Why AI Scientists Are Not Yet Ready for Open-Ended and Fully Autonomous Scientific Discovery 114Kunlun Zhu
S15 — Conversation Quality, Hallucination & Trust
37 Improving Hallucination Detection in Dialog via Social Framing Analysis 186Parisa Rabbani
38 Drift No More? Context Equilibria in Multi-Turn LLM Interactions 6Vardhan Dongre
39 PSI-Bench: Towards Interpretable and Clinically Grounded Evaluation of Depressive Patient Simulators 99Nguyen Khoi Hoang
40 Towards a Benchmark for Epistemic Modality in Large Language Models 169Tianjia Dong
41 Veridicality Beyond Factuality: Turkish Evidentiality as a Test of Human and LLM Reasoning 93Sercan Karakas
42 Hallucination in the Wild: A Field Guide for LLM Users 5Ashley Lewis
43 Quantifying Contextual Hallucinations in NLP Research Papers Before and After the LLM Era 95Adiba Ibnat Hossain
44 The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination 126Yuji Zhang
45 TRUST Agents 2.0: Comparing Two Agentic Fact-Checking Pipelines for Explainable and Uncertainty-Aware Verification 72Satya Subrahmanya Gautama Shastry Bulusu Venkata

Poster Session 4 S16 – S20  ·  Thursday April 16, 15:45–17:45

Panel Title Paper # First Author
S16 — Bias, Fairness & Social Values
1 FAC-FACodec: Controllable Zero-Shot Foreign Accent Conversion with Factorized Speech Codec 35Yurii Halychanskyi
2 Fairness Failure Modes of Multimodal LLMs 73Canyu Chen
3 From Preferences to Prejudice: Alignment Tuning Amplifies Social Bias in Language-Conditioned Video Generation 133Zefan Cai
4 SoNoLiSi: Simulating the Social Norm Lifecycle with Generative Agents 134Rasika Muralidharan
5 The Curious Case of Curiosity across Human Cultures and LLMs 18Angana Borah
6 Wait, am I Being Fair? Characterizing Deductive Stereotyping and Mitigating It with Fair-GCG 1Naihao Deng
7 Who Gets Which Message? Auditing Demographic Bias in LLM-Generated Targeted Text 11Tunazzina Islam
8 Do Emotions Influence Moral Judgment in Large Language Models? 24Mohammad Saim
9 Modeling Annotator Disagreement with Demographic-Aware Experts and Synthetic Perspectives 51Yinuo Xu
10 Who Plays Which Role When? Communication Role Dynamics for Peer Recognition and Team Performance Prediction 139Yifan Song
S17 — Multimodal Reasoning, Vision-Language & Spatial
12 ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction 71Qineng Wang
12 SATURN: Symbolic Spatial Reasoning from 3D Scene Structure for Vision-Language 176Danial Kamali
13 Theory of Space: Can Foundation Models Construct Spatial Beliefs through Active Exploration? 97Pingyue Zhang
14 World Model as Tool: An Empirical Study on Agent Foresight Governance 129Cheng Qian
15 NePTune: A Neuro-Pythonic Framework for Tunable Compositional Reasoning on Vision-Language 81Danial Kamali
S18 — Multimodal Generation, Memes & Visual Communication
16 AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking 89Xilin Jiang
17 Beyond Multi-modality: Evaluating AI's Meme Literacy in Conversational Contexts on Bluesky 57Juyoung An
18 A Computational Approach to Visual Metonymy 20Saptarshi Ghosh
19 CLAMP: Constrained Language-Action Multimodal Planning 48Tianyi Ma
20 FoR-SALE: Frame of Reference-guided Spatial Adjustment in LLM-based Diffusion Editing 42Tanawan Premsri
21 MENTOR: Efficient Multimodal-Conditioned Tuning for Language-Guided Visual Generation Agents 140Haozhe Zhao
22 Structured Multimodal World Models for Knowledge Localization, Safe Editing, and Predictive Situation Understanding 100Aditi Tiwari
S19 — Social Media, Slang & Online Discourse
23 Email in the Era of LLMs 34Dang Nguyen
24 Modeling LLM Persuasion via Interactive 2T1L Game 56Shivani Kumar
25 Towards stable belief LLMs 148Sumit Kumar
26 Vulnerability of LLMs' Stated Beliefs? LLMs Belief Resistance Check Through Strategic Persuasive Conversation Interventions 68Fan Huang
27 A Computational Analysis of Social Media Reframing of News Events 96Achyutarama R Ganti
28 Audience Evaluations of TV Characters in Reddit Communities 137Souha Ben Hassine
29 Detecting Emerging Drug Slang and Code Language in Social Media Posts 59Damir Cavar
30 From Utterances to Networks: Modelling Slang Adoption and Diffusion Across Subreddits 29Xiaoning Wang
31 Iterative Topic Taxonomy Induction with LLMs: A Case Study of Electoral Advertising 31Tunazzina Islam
32 Measuring Student's Perception of LLM Capabilities and Usage 22Oluchi Obadoni
33 The Non-Traditional News Ecosystems of Podcasts 154Michela Marchini
S20 — Safety, Alignment & Unlearning
34 Alignment Faking in Language Models is Frequent: Value-focused Diagnosis and Efficient Mitigation 25Inderjeet Jayakumar Nair
35 Anatomy of an Unsafe Thought: Behavior Profiling and Safety Scoring in Reasoning Chains 69Ishita Kakkar
36 From Rubrics to Rewards: Aligning Question Generation for Early Literacy 168Yi-Jyun Sun
37 Prior Beliefs Prejudice LLM-as-Judge: Evidence from Persuasion Evaluation 110Pardis Sadat Zahraei
38 I Am Aligned, But With Whom? Diagnosing Structural Alignment Failures in Multilingual LLMs 107Pardis Sadat Zahraei
39 Learning to Optimize Multi-Objective Alignment Through Dynamic Reward Weighting 2Yining Lu
40 Optimizing Diversity and Quality through Base-Aligned Model Collaboration 91Yichen Wang
41 Community-Driven Model Development is Unreliable and Unsafe 15Jack Sanderson
42 Data Synthesis with Influence Rewarded Models 49Ishika Agarwal
43 How Catastrophic is Your LLM? Certifying Risks in Conversation 173Chengxiao Wang
44 Let there be Frontier Model System Certification 187Isha Chaudhary
45 Geometric-disentanglement Unlearning 127Duo Zhou
46 Knowledge Control for Trustworthy and Responsible AI 14Zheyuan Liu
47 Unlearning-induced Collateral Corruption in Machine Unlearning for LLMs 63Bo Su