| Multi-LLM and LLM-as-a-Judge Scenarios |
| Room – Cherno more hall; Chair – Salima Lamsiya |
| 15.00-15.20 | Multi-LLM Debiasing Framework (Deonna M. Owens, Ryan Rossi, Sungchul Kim, Tong Yu, Franck Dernoncourt, Xiang Chen, Ruiyi Zhang, Jiuxiang Gu, Hanieh Deilamsalehy and Nedim Lipka) |
| 15.20-15.40 | Multi-LLM Verification for Question Answering under Conflicting Contexts (Geetanjali Rakshit and Jeffrey Flanigan) |
| 15.40-16.00 | The Illusion of a Perfect Metric: Why Evaluating AI´S Words Is Harder than It Looks (Maria Paz Oliva, Adriana D. Correia, Ivan Vankov and Viktor Botev) |