Accepted papers

Selection rates of the conference RANLP 2025

Submitted Papers: 234

Accepted papers total: 174

  • Accepted as Regular papers (30 min oral presentation): 35
  • Accepted as Short papers (20 min oral presentation): 51
  • Accepted for Poster Presentation: 83
  • Accepted as Demo papers (to be presented during poster sessions): 5

Selection Rates:

  • Regular papers: 14,9%
  • Short papers: 36,7%
  • Poster and demo papers: 74,3%

Regular papers

  • Where and How as Key Factors for Knowledge-Enhanced Constrained Commonsense Generation – Ivan Martinez-Murillo, Paloma Moreda Pozo and Elena Lloret
  • Exposing Pink Slime Journalism: Linguistic Signatures and Robust Detection Against LLM-Generated Threats – Sadat Shahriar, Navid Ayoobi, Arjun Mukherjee, Mostafa Musharrat and Sai Vishnu Vamsi Senagasetty
  • Aspect-Based Sentiment Analysis for Investigating Polarization in YouTube Comments – Daniel Miehling, Daniel Dakota and Sandra Kübler
  • Revisiting the Photobook Task for LLM Grounding Benchmarking – Saki Imai, Mert Inan, Anthony B. Sicilia and Malihe Alikhani
  • Towards CEFR-targeted Text Simplification for Question Adaptation – Luca Benedetto and Paula Buttery
  • Enhancing Transformer-Based Rerankers with Synthetic Data and LLM-Based Supervision – Dimitar Peshevski, Kiril Blazhevski, Martin Popovski and Gjorgji Madjarov
  • Evaluating Transliteration Ambiguity in Adhoc Romanized Sinhala: A Dataset for Transliteration Disambiguation – Sandun Sameera Perera and Deshan Koshala Sumanathilaka
  • When Does Language Transfer Help? Sequential Fine-Tuning for Cross-Lingual Euphemism Detection – Julia Sammartino, Libby Barak, Jing Peng and Anna Feldman
  • An Annotation Scheme for Factuality and its Application to Parliamentary Proceedings – Gili Goldin, Shira Wigderson, Ella Rabinovich and Shuly Wintner
  • The Impact of Named Entity Recognition on Transformer-Based Multi-Label Dietary Recipe Classification – Kemalcan Bora and Horacio Saggion
  • Q&A-LF : A French Question-Answering Benchmark for Measuring Fine-Grained Lexical Knowledge – Alexander Petrov, Alessandra Thais Mancas, Viviane Binet, Antoine Venant, Francois Lareau, Yves Lepage and Phillippe Langlais
  • Can we Predict Innovation? Narrow Experts Versus Competent Generalists – Amir Hazem and Motohashi Kazuyuki
  • Simplifications are Absolutists: How Simplified Language Reduces Word Sense Awareness in LLM-Generated Definitions – Lukas Ellinger, Miriam Anschütz and Georg Groh
  • Decoding Emotion in Ancient Poetry: Leveraging Generative Models for Classical Chinese Sentiment Analysis – Quanqi Du, Loic De Langhe, Els Lefever and Veronique Hoste
  • Aspect–Sentiment Quad Prediction with Distilled Large Language Models – Filippos Karolos Ventirozos, Peter Appleby and Matthew Shardlow
  • Am I Blue or Is My Hobby Counting the Teardrops? Expression Leakage in Large Language Models as a Symptom of Irrelevancy Disruption – Berkay Kopru, Mehrzad Mashal, Yigit Gurses, Akos Kadar, Maximilian Schmitt, Ditty Mathew, Felix Burkhardt, Florian Eyben and Björn W. Schuller
  • Deep Language Geometry: Constructing a Metric Space from LLM Weights – Maksym Shamrai and Vladyslav Hamolia
  • TinyMentalLLMs Enable Depression Detection in Chinese Social Media Texts – JINYUAN XU, Tian LAN, Mathieu Valette, Pierre Magistry and LEI LI
  • MariATE: Automatic Term Extraction Using Large Language Models in the Maritime Domain – Shijie Liu, Els Lefever and Veronique Hoste
  • Prompt Engineering for Nepali NER: Leveraging Hindi-Capable LLMs for Low-Resource Languages – Dipendra Yadav, sumaiya suravee, Stefan Kemnitz, Tobias Strauss and Kristina Yordanova
  • Seeing, Signing, and Saying: A Vision-Language Model-Assisted Pipeline for Sign Language Data Acquisition and Curation from Social Media – Shakib Yazdani, Yasser HAMIDULLAH, Cristina España-Bonet and Josef van Genabith
  • Advancing Clinical Translation in Nepali with Fine-Tuned Multilingual Models – Benyamin Ahmadnia, Sumaiya Shaikh, Shazan Mohammed, Bibek Poudel and Sahar Hooshmand
  • BiGCAT: A Graph-Based Representation Learning Model with LLM Embeddings for Named Entity Recognition – Md. Akram Hossain, Abdul Aziz, Muhammad Anwarul Azim, Abu Nowshed Chy, Md Zia Ullah and Mohammad Khairul Islam
  • Benchmarking Korean Idiom Understanding: A Comparative Analysis of Local and Global Models – Xiaonan Wang, Seoyoon Park and Hansaem Kim
  • Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets – Eduard Barbu, Meeri-Ly Muru and Sten Marcus Malva
  • FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback – Ashish Singh, Ashutosh Singh, Prateek Agarwal, Zixuan Huang, Arpita Singh, Tong Yu, Sungchul Kim, Victor Soares Bursztyn, Nesreen K. Ahmed, Puneet Mathur, Erik Learned-Miller, Franck Dernoncourt and Ryan Rossi
  • Enabling On-Premises Large Language Models for Secure Space Traffic Management – Enrique De Alba
  • Isolating LLM Performance Gains in Pre-training versus Instruction-tuning for Mid-resource Languages: The Ukrainian Benchmark Study – Yurii Paniv
  • SiLVERScore: Why Aren’t Semantically-Aware Embeddings Used for Sign Language Generation Evaluation? – Saki Imai, Mert Inan, Anthony B. Sicilia and Malihe Alikhani
  • A Culturally-Rich Romanian NLP Dataset from “Who Wants to Be a Millionaire?” Videos – Alexandru Ganea, Antonia-Adelina Popovici and Marius Dumitran
  • Dutch CrowS-Pairs: Adapting a Challenge Dataset for Measuring Social Biases in Language Models for Dutch – Elza Strazda and Gerasimos Spanakis
  • GRILE: A Benchmark for Grammar Reasoning and Explanation in Romanian LLMs – Marius Dumitran, Angela Dumitran and Alexandra Mihaela Danila
  • Multi-LLM Text Summarization – Jiangnan Fang, Cheng-Tse Liu, Jieun Kim, Yash Bhedaru, Ethan Liu, Nikhil Singh, Nedim Lipka, Puneet Mathur, Nesreen K. Ahmed, Franck Dernoncourt, Ryan Rossi and Hanieh Deilamsalehy
  • A Survey on Small Language Models – Chien Van Nguyen, Xuan Shen, Ryan Aponte, Yu Xia, Samyadeep Basu, Zhengmian Hu, Jian Chen, Mihir Parmar, Sasidhar Kunapuli, Joe Barrow3, Junda Wu, Ashish Singh, Yu Wang, Jiuxiang Gu, Nesreen K. Ahmed, Nedim Lipka, Ruiyi Zhang, Xiang Chen, Tong Yu, Sungchul Kim, Hanieh Deilamsalehy, Namyong Park, Michael Rimer, Zhehao Zhang, Huanrui Yang, Puneet Mathur, Gang Wu, Franck Dernoncourt, Ryan Rossi and Thien Huu Nguyen
  • Quantifying the Overlap: Attribution Maps and Linguistic Heuristics in Encoder-Decoder Machine Translation Models – Aria Nourbakhsh, salima lamsiyah and Christoph Schommer

Short papers

  • Evaluating Large Language Models on Multiword Expressions in Multilingual and Code-Switched Contexts Frances – Adriana Laureano De Leon, Harish Tayyar Madabushi and Mark Lee
  • A Deep Dive into Multi-Head Attention and Multi-Aspect Embedding – Filip Ginter, Jenna Kanerva and Maryam Teimouri
  • Analysis of Vocabulary and Subword Tokenization Settings for Optimal Fine-tuning of MT – Javad Pourmostafa Roshan Sharami, Dimitar Shterionov and Pieter Spronck
  • Instruction-Tuning LLaMA for Synthetic Medical Note Generation in Swedish and English – Lotta Kiefer, Jesujoba Alabi, Thomas Vakili, Hercules Dalianis and Dietrich Klakow
  • Detecting Deception in Fake News Across Languages: The Role of Linguistic Markers – Alba Perez-Montero, Silvia Gargova, Elena Lloret and Paloma Moreda Pozo
  • Generating and Analyzing Disfluency in a Code-Mixed Setting – Aryan Paul, Tapabrata Mondal, Dipankar Das and Sivaji Bandyopadhyay
  • PerSpaCor: Correcting Space and ZWNJ Errors in Persian Text with Transformer Models – Matin Ebrahimkhani and Ebrahim Ansari
  • Evaluating Large Language Models on Arabic Dialect Sentiment Analysis – Maram I. Alharbi, Saad Ezzini, Hansi Hettiarachchi, Tharindu Ranasinghe and Ruslan Mitkov
  • Advancing Active Learning with Ensemble Strategies – Naif Alatrush, Sultan Alsarra, Afraa Alshammari, Luay Abdeljaber, Niamat Zawad, Javier Osorio, Latifur Khan, Patrick T. Brandt and Vito D’Orazio
  • Top Ten From Lakhs: A Transformer-based Retrieval System for Identifying Previously Fact-Checked Claims Across Multiple Languages – Srijani Debnath, Pritam Pal and Dipankar Das
  • Towards Intention-aligned Reviews Summarization: Enhancing LLM Outputs with Pragmatic – Cues María Miró Maestre, Robiert Sepúlveda-Torres, Ernesto Luis Estevanell-Valladares, Armando Suárez Cueto and Elena Lloret
  • ExPe: Exact Positional Encodings for Generative Transformer Models with Extrapolating Capabilities – Aleksis Ioannis Datseris, Sylvia Vassileva, Ivan K. Koychev and Svetla Boytcheva
  • The Hidden Cost of Structure: How Constrained Decoding Affects Language Model Performance – Maximilian Schall and Gerard de Melo
  • Revealing Gender Bias in Language Models through Fashion Image Captioning – Maria Villalba-Oses, Victoria Muñoz-Garcia and Juan Pablo Consuegra-Ayala
  • Investigating Large Language Models (LLMs) Capabilities for Sexism Detection on a Low-Resource Language – Lutfiye Seda Mut Altin and Horacio Saggion
  • Domain Knowledge Distillation for Multilingual Sentence Encoders in Cross-lingual Sentence Similarity Estimation – Risa Kondo, Hiroki Yamauchi, Tomoyuki Kajiwara, Marie Katsurai and Takashi Ninomiya
  • Contextual Cues in Machine Translation: Investigating the Potential of Multi-Source Input Strategies in LLMs and NMT Systems – Lia Shahnazaryan, Patrick Simianer and Joern Wuebker
  • Event Extraction for Bulgarian with LLMs – Kiril Simov, Nikolay Paev, Petya Osenova and Stefan Marinov
  • GPT-Based Lexical Simplification for Multi-Word Expressions Using Prompt Engineering – Sardar Khan Khayamkhani and Matthew Shardlow
  • An Annotated Dataset of Emotions and Hope Speech in English and Arabic – Wajdi Zaghouani and Md Rafiul Biswas
  • An Annotated Corpus of Arabic Tweets for Hate Speech Analysis – Wajdi Zaghouani and Md Rafiul Biswas
  • The Evaluation of Medical Terms Complexity Using Lexical Features and Large Language Models – Liliya Makhmutova, Giancarlo Dondoni Salton, Fernando Perez-Tellez and Robert J. Ross
  • Branching Out: Exploration of Chinese Dependency Parsing with Fine-tuned Large Language Models – He Zhou, Emmanuele Chersoni and Yu-Yin Hsu
  • Subtle Shifts, Significant Threats: Leveraging XAI methods and LLMs to undermine Language Models – Robustness Adrián Moreno Muñoz, L. Alfonso Ureñ-López and Eugenio Martínez Cámara
  • Multi-LLM Debiasing Framework – Deonna M. Owens, Ryan Rossi, Sungchul Kim, Tong Yu, Franck Dernoncourt, Xiang Chen, Ruiyi Zhang, Jiuxiang Gu, Hanieh Deilamsalehy and Nedim Lipka
  • Personalized Author Obfuscation with Large Language Models – Mohammad Shokri
  • AntiSemRO: Studying the Romanian expression of Antisemitism – Anca Dinu, Andreea C. Moldovan and Adina Marincea
  • Visual Priming Effect on Large-scale Vision Language Models – Daiki Yoshida, Haruki Sakajo, Kazuki Hayashi, Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi and Taro Watanabe
  • Zero-shot OCR Accuracy of Low-Resourced Languages: A Comparative Analysis on Sinhala and Tamil – Nevidu Jayatilleke and Nisansa de Silva
  • Evaluation of Pretrained and Instruction-Based Pretrained Models for Emotion Detection in Arabic Social Media Text – Md Rafiul Biswas, Shimaa Ibrahim, Mabrouka Bessghaier and Wajdi Zaghouani
  • Evaluating LLMs on Deceptive Text Across Cultures – Katerina Papantoniou, Panagiotis Papadakos and Dimitris Plexousakis
  • Performance Gaps in Acted and Naturalistic Speech: Insights from Speech Emotion Recognition Strategies on Customer Service Calls – Lily Kawaoto, Hita Gupta, Ning Yu and Daniel Dakota
  • KoWit-24: A Richly Annotated Dataset of Wordplay in News Headlines – Alexander Baranov, Anna Palatkina, Yulia Makovka and Pavel Braslavski
  • Alankaar: A Dataset for Figurativeness Understanding in Bangla – Geetanjali Rakshit
  • ASQ: Automatically Generating Question-Answer Pairs using AMRs – Geetanjali Rakshit and Jeffrey Flanigan
  • Reverse Prompting: A Novel Computational Paradigm in Schizophrenia based on Large Language Models – Ivan Nenchev, Christiane Montag and Sandra Anna Just
  • Toward True Neutrality: Evaluating Inference-Time Debiasing Strategies for Gender Coreference Resolution in LLMs – Arati Mohapatra and S Jaya Nirmala
  • LLM-based Embedders for Prior Case Retrieval – Damith Premasiri, Tharindu Ranasinghe and Ruslan Mitkov
  • Fusion of Object-Centric and Linguistic Features for Domain-Adapted Multimodal Learning – Jordan Konstantinov Kralev
  • Fast Thinking with Structured Prompts: Enabling LLM Reasoning Without Chain-of-Thought Generation – Kirill Morozov, Liubov Chubarova and Irina Piontkovskaya
  • Efficient Financial Fraud Detection on Mobile Devices Using Lightweight Large Language Models – Lakpriya Senevirathna and Deshan Koshala Sumanathilaka
  • Alignment of Historical Manuscript Transcriptions and Translations – Maarten Janssen, Piroska Lendvai and Anna Jouravel
  • Automatic Correction of Writing Anomalies in Hausa Texts – Ahmad Mustapha Wali and Sergiu Nisioi
  • Multi-LLM Verification for Question Answering under Conflicting Contexts – Geetanjali Rakshit and Jeffrey Flanigan
  • Toponym Resolution: Will prompt engineering change expectations? – Isuri Anuradha, Deshan Koshala Sumanathilaka, Ruslan Mitkov and Paul Rayson
  • AIDEN: Automatic Speaker Notes Creation and Navigation for Enhancing Online Learning Experience – Stalin Varanasi, Umer Butt, Guenter Neumann and Josef van Genabith
  • Cross-Lingual Fact Verification: Analyzing LLMs Performance Patterns Across Languages – Hanna Shcharbakova, Tatiana Anikina, Natalia Skachkova and Josef van Genabith
  • Beyond Methods and Datasets: Introducing SH-NER for Hardware and Software Entity Recognition in Scientific Text – aftab anjum, Nimra Maqbool and Ralf Krestel
  • Trust but Verify: A Comprehensive Survey of Faithfulness Evaluation Methods in Abstractive Text Summarization – salima lamsiyah, Aria Nourbakhsh and Christoph Schommer
  • The Illusion of a Perfect Metric: Why Evaluating AI´s Words Is Harder Than It Looks – Maria Paz Oliva, Adriana D. Correia, Ivan Vankov and Viktor Botev

Poster papers

  • On the Limitations of Large Language Models (LLMs): False Attribution – Tosin Adewumi, Nudrat Habib, Lama Alkhaled and Elisa Barney
  • Forecasting Online Negativity Spikes with Multilingual Transformers for Strategic Decision-Making – Rowan Martnishn, Varun Kadari, Shravan Athikinasetti, Vishal Green, Zach miller, Julia Brady, Viraj Chawda and Nikhil Badlani
  • Building a Clean Bartangi Language Corpus and Training Word Embeddings for Low-Resource Language Modeling – Warda Tariq
  • Recognizing the Structure and Content of Hungarian Civil Registers – Kata Ágnes Szűcs and Noémi Vadász
  • Output trend analysis in semantic classification of katakana words using a large language model – kazuki kodaki and Minoru Sasaki
  • Exploiting Primacy Effect To Improve Large Language Models – Bianca Raimondi and Maurizio Gabbrielli
  • Comparative Analysis of Human and Large Language Model Performance in Pharmacology Multiple-Choice Questions – Ricardo Rodriguez, Stéphane Huet, Benoit Favre and Mickael Rouvier
  • Evaluating Bilingual Lexicon Induction without Lexical Data – Michaela Denisová and Pavel Rychly
  • Candidate Profile Evaluation – A RAG Approach with Synthetic Data Generation for Tech Jobs – Anum Afzal, Ishwor Subedi and Florian Matthes
  • Reversing Causal Assumptions: Explainability in Online Sports Dialogues – Asteria Kaeberlein and Malihe Alikhani
  • How LLMs Influence Perceived Bias in Journalism – Asteria Kaeberlein and Malihe Alikhani
  • Task-Oriented Dialogue Systems through Function Calling – Tiziano Labruna, Giovanni Bonetta and Bernardo Magnini
  • When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively – Tiziano Labruna, Jon Ander Campos and Gorka Azkune
  • Integrating Large Language Models for Comprehensive Study and Sentiment Analysis of Student Feedback – Jana Kuzmanova, Katerina Zdravkova and Ivan Chorbev
  • A Question-Answering Based Framework/Metric for Evaluation of Newspaper Article Summarization – Vasanth Seemakurthy, Shashank Sundar, Siddharth Arvind, Siddhant Jagdish and Ashwini M. Joshi
  • A Low-Resource Speech-Driven NLP Pipeline for Sinhala Dyslexia Assistance – Peshala Sandali Perera and Deshan Koshala Sumanathilaka
  • Strategies for Efficient Retrieval-augmented Generation in Clinical Domains with RAPTOR: A Benchmarking Study – Xumou Zhang, Qixuan Hu, Jinman Kim and Adam Dunn
  • Legal Terminology Extraction in Spanish: Gold-standard Generation and LLM Evaluation – Lucia Palacios Palacios, Beatriz Guerrero García, Patricia Martín Chozas and Elena Montiel Ponsoda
  • PoliStance-TR: A Dataset for Turkish Stance Detection in Political Domain – Muhammed Cihat Unal, Yasemin Sarkın, Alper Karamanlioglu and Berkan Demirel
  • Integrating Archaic and Regional Lexicons into a Preliminary Readability Formula for Romanian – Madalina Chitez, Mihai Dascalu, Roxana Rogobete, Cristina Aura Udrea, Karla Csürös and Ana-Maria Bucur
  • Using LLMs for Multilingual Clinical Entity Linking to ICD-10 – Sylvia Vassileva, Ivan K. Koychev and Svetla Boytcheva
  • Towards Safer Hebrew Communication: A Dataset for Offensive Language Detoxification – Natalia Vanetik, Lior Liberov, Marina Litvak and Chaya Liebeskind
  • Classifying Emotions in Tweets from the Financial Market: A BERT-based approach – Wesley Pompeu Carvalho and Norton Trevisan Roman
  • Annotating Hate Speech Towards Identity Groups – Donnie Parent, Nina Georgiades, Charvi Mishra, Khaled Mohammed and Sandra Kübler
  • On the Interaction of Identity Hate Classification and Data Bias – Donnie Parent, Nina Georgiades, Charvi Mishra, Khaled Mohammed and Sandra Kübler
  • From Posts to Predictions: A User-Aware Framework for Faithful and Transparent Detection of Mental Health Risks on Social Media – Hessam Amini and Leila Kosseim
  • LLM-Based Product Recommendation with Prospect Theoretic Self Alignment Strategy – Manying Zhang, Zehua Cheng and Damien Nouvel
  • Multilingual Pre-training Meets Supervised Neural Machine Translation: A Reproducible Evaluation on English–French and Finnish Translation – Benyamin Ahmadnia, Yeswanth Soma and Hossein Sarrafzadeh
  • Exploring the Performance of Large Language Models for Event Detection and Extraction in the Domain of Health – Hristo Tanev, Nicolas Stefanovitch, Tomáš Harmatha and Diana Francisco De Sousa
  • Enhancing Textual Understanding: Automated Claim Span Identification in English, Hindi, Bengali and CodeMix – Rudra Roy, Pritam Pal, Dipankar Das, Saptarshi Ghosh and Biswajit Paul Paul
  • Toward Quantum-Enhanced Natural Language Understanding: Sarcasm and Claim Detection with QLSTM – Pritam Pal and Dipankar Das
  • Linguistic Complexity and Socio-cultural Patterns in Hip-Hop Lyrics – Aayam Bansal, Raghav Agarwal and Kaashvi Jain
  • Leveraging LLaMa for Malayalam Text Summarisation: An Experimental Study – Hristo Tanev, Anitha S. Pillai and REVATHY V. R
  • EDAudio: Easy Data Augmentation for Dialectal Audio – Lea Fischbach, Akbar Karimi, Alfred Lameli and Lucie Flek
  • Detecting Gender Stereotypical Language using Model-agnostic and Model-specific Explanations – Manuela Nayantara Jeyaraj and Sarah Jane Delany
  • QuARK: LLM-Based Domain-Specific Question Answering using Retrieval Augmented Generation and Knowledge Graphs – Edward Burgin, Sourav Dutta and Mingxue Wang
  • From Courtroom to Corpora: Building a Name Entity Corpus for Urdu Legal Texts – Adeel Zafar, Sohail Ashraf and Slawomir Nowaczyk
  • Financial News as a Proxy of European Central Bank Interest Rate Adjustments – Davide Paris, Martina Menzio and Elisabetta Fersini
  • LLM Compression: How Far Can We Go in Balancing Size and Performance? – Sahil Sk, Debashish Dhal, Sonal Khosla, Akash Dhaka, Shantipriya Parida, Sk Shahid, Sambit Shekhar, Dilip Prasad and Ondrej Bojar
  • chakoshi: A Customizable Guardrail for LLMs with a Focus on Japanese-Language Moderation – Ryota Matsui, Kazuhiro Arai, Kenji Miyama, Yudai Yamamoto, Kaito Sugimoto and Yoshimasa Iwase
  • Instruction Finetuning to Attribute Language Stage, Dialect and Provenance Region to Historical Church Slavic Texts – Piroska Lendvai, Uwe Reichel, Anna Jouravel, Achim Rabus and Elena Renje
  • C-SHAP: Collocation-Aware Explanations for Financial NLP – Martina Menzio, Elisabetta Fersini and Davide Paris
  • Named Entity Recognition and Relation Extraction for better Gut-Brain Interplay Understanding – Aleksis Ioannis Datseris, Mario Kuzmanov, Ivelina Nikolova-Koleva and Svetla Boytcheva
  • Balancing the Scales: Addressing Gender Bias in Social Media Toxicity Detection – Beatriz Botella-Gil, Juan Pablo Consuegra-Ayala, Alba Bonet-Jover and Paloma Moreda-Pozo
  • Authorship Verification Using Cloze Test with Large Language Models – Tomáš Foltýnek, Tomáš Kancko and Pavel Rychly
  • The challenge of performing named entity recognition in real-world unstructured textual data from the domain of dementia – sumaiya suravee and Kristina Yordanova
  • Detecting Fake News in the Era of Language Models – Muhammad Irfan Fikri Sabri, Hansi Hettiarachchi and Tharindu Ranasinghe
  • Unsupervised Mixed-language Multi-document Summarisation – Anushiya Thevapalan and Nisansa de Silva
  • Reddit-V: A Virality Prediction Dataset and Zero-Shot Evaluation with Large Language Models – Samir El-amrany, Matthias R. Brust, Salima Lamsiyah and Pascal Bouvry
  • Utilizing Large Language Models for Focused Conversational Assistants – Shruti Dhavalikar and Karthika Vijayan
  • APIO: Automatic Prompt Induction and Optimization for Grammatical Error Correction and Text Simplification – Artem Chernodub, Aman Saini, Yejin Huh, Vivek Kulkarni and Vipul Raheja
  • Pushing the (Generative) Envelope: Measuring the Effect of Prompt Technique and Temperature on the Generation of Model-based Systems Engineering Artifacts – Erin Smith Crabb, Cedric Bernard, Matthew Jones and Daniel Dakota
  • Demographic Features for Annotation-Aware Classification – Narjes Tahaei and Sabine Bergler
  • The Erosion of LLM Signatures: Can We Still Distinguish Human and LLM-Generated Scientific Ideas After Iterative Paraphrasing? – Sadat Shahriar, Navid Ayoobi and Arjun Mukherjee
  • Multi-Agent Reinforcement Learning for Interactive Code Debugging with Human Feedback and Memory – Anjana Krishnamoorthy, Kartik Ivatury and Benyamin Ahmadnia
  • Differential Robustness in Transformer Language Models: Empirical Evaluation Under Adversarial Text Attacks – Taniya Gidatkar, Oluwaseun Ajao and Matthew Shardlow
  • MLDataForge: Accelerating Large-Scale Dataset Preprocessing and Access for Multimodal Foundation Model Training – Andrea Blasi Núñez, Lukas Paul Achatius Galke and Peter Schneider-Kamp
  • PolyHope-M at RANLP2025 Subtask-1 Binary Hope Speech Detection: Spanish Language Classification Approach with Comprehensive Learning using Transformer, and Traditional ML, and DL – Md. Julkar Naeen, Sourav Kumar Das and Sharun Akter Khushbu
  • Detecting the Change of Mental Health Status via Reddit Posts in Response to Global Negative Events – Zenan Chen, Judita Preiss and Peter A. Bath
  • Modelling the Relative Contributions of Stylistic Features in Forensic Authorship Attribution – G. Çağatay Sat, John Blake and Evgeny Pyshkin
  • Benchmarking Item Difficulty Classification in German Vocational Education and Training – Alonso Palomino and Benjamin Paassen
  • Exploring the Usage of Knowledge Graphs in Identifying Human and LLM-Generated Fake Reviews – Ming Liu and Massimo Poesio
  • Mitigating Bias in Text Classification via Prompt-Based Text Transformation – Charmaine Barker and Dimitar Kazakov
  • Synthetic vs. Gold: The Role of LLM-Generated Labels and Data in Cyberbullying Detection – arefeh kazemi, Sri Balaaji Natarajan Kalaivendan, Joachim Wagner, Hamza Qadeer, Kanishk Verma and Brian Davis
  • From the Tractatus Logico-Philosophicus to Later Wittgenstein: A NLP-Based Comparative Analysis – Andreiana Mihail, Silviu-Florin Gheorghe, Andrei Fotea and Liviu P. Dinu
  • Improving LLM Generalisation for Cyberbullying Detection via Aggression-Enhanced Prompt Engineering – Aisha Saeid, Anu Sabu, Girish Koushik, Ferrante Neri and Diptesh Kanojia
  • Harnessing Open-Source LLMs for Tender Named Entity Recognition – Asim Abbas, Venelin Kovatchev, Mark Lee, Niloofer Shanavas and Mubashir Ali
  • Cross-Lingual Learning for Fake News Detection: A Multilingual Study in the COVID-19 Health Domain – Ezequias de Oliveira Rocha, Francisco Igor de Lima Mendes, Cláudio de Souza Baptista and André Luíz Firmino Alves
  • Semantic clustering of conditions in Brazilian environmental operation licenses – Lívia Aniely de Oliveira Almeida, Cláudio de Souza Baptista and André Luiz Firmino Alves
  • Graph-based RAG for Low-Resource Aromanian–Romanian Translation – Laurentiu G. Ghetoiu and Sergiu Nisioi
  • A linguistically-informed comparison between multilingual BERT and language-specific BERT models: The case of differential object marking in Romanian – Maria Tepei and Jelke Bloem
  • Can LLMs Disambiguate Grounded Language? The Case of PP Attachment – John Blackmore and Matthew Stone
  • Optimism, Pessimism, and the Language Between: Model Interpretability And Psycholinguistic Profiling – Stefana Arina Tabusca and Liviu P. Dinu
  • F-LoRA-QA: Finetuning LLaMA Models with Low-Rank Adaptation for French Botanical Question Generation and Answering – Ayoub Nainia, Régine Vignes-Lebbe, Hajar Mousannif and Jihad Zahir
  • ILID: Native Script Language Identification for Indian Languages – Yash Ingle and Pruthwik Mishra
  • ESAQueryRank: Ranking Query Interpretations for Document Retrieval Using Explicit Semantic Analysis – Avijeet Shil and Wei Jin
  • Prompting Techniques for Reducing Social Bias in LLMs through System 1 and System 2 Cognitive Processes – Mahammed Kamruzzaman and Gene Louis Kim
  • HoloBERT: Enhancing Transformer Models for Oral and Transcribed Data through Advanced Pre-training and Fine-tuning Techniques – Isuri Anuradha, Le An Ha and Ruslan Mitkov
  • Lingdex.org:Leveraging LLMs to Structure and Explore Linguistic Olympiad Puzzles for Learning and Teaching Linguistics – Jonathan Sakunkoo and Annabella Sakunkoo
  • Arabic to Romanian Machine Translation: A Case Study on Distant Language Pairs – Ioan Alexandru Hirica, Stefana Arina Tabusca and Sergiu Nisioi
  • Towards a Map of Related Words in Romance Languages – Liviu P. Dinu, Ana Sabina Uban, Ioan-Bogdan Iordache, Claudia Vlad, Simona Georgescu, Laurentiu Zoicas and Anca Dinu
  • A Framework for Fine-Tuning LLMs using Heterogeneous Feedback – Ryan Aponte, Ryan A. Rossi, Shunan Guo, Franck Dernoncourt, Tong Yu, Xiang Chen, Subrata Mitra and Nedim Lipka
  • A Novel Context Intensive Graph-based Multimodal Fusion Technique – Raj Ratn Pranesh and Sumit Kumar
  • PersianSciQA: A new Dataset for Bridging the Language Gap in Scientific Question Answering – safoura aghadavoud jolfaei, azadeh mohebi and zahra hemmat

Demo papers

  • T2Know: Analysis and Trend Platform using the Knowledge Extracted from Scientific Texts – Rafael Muñoz Guillena, Manuel Palomar, Yoan Gutiérrez and Mar Bonora
  • FreeTxt: Analyse and Visualise Multilingual Qualitative Survey Data for Cultural Heritage sites – Nouran Khallaf, Ignatius Ezeani, Dawn Knight, Paul Rayson, Mo El-Haj, John Vidler, James Davies and Fernando Alva-Manchego
  • SENTimental – A Simple Multilingual Sentiment Collection Tool – John Vidler, Paul Rayson and Dawn Knight
  • Anonymise: A Tool for Multilingual Document Pseudonymisation – Rinalds Vīksna and Inguna Skadiņa
  • “Simple-Tool”: A Tool for the Automatic Transformation of Spanish Texts into Easy-to-Read – Beatriz Botella-Gil, Isabel Espinosa-Zaragoza, Paloma Moreda Pozo and Manuel Palomar
Scroll to Top