• Resource papers

● T^2Ranking: A large-scale Chinese Benchmark for Passage Ranking
Xiaohui Xie, Qian Dong, Bingning Wang, Feiyang Lv, Ting Yao, Weinan Gan, Zhijing Wu, Xiangsheng Li, Haitao Li, Yiqun Liu, Jin Ma

● BizGraphQA: A Dataset for Image-based Inference over Graph-structured Diagrams from Business Domains
Petr Babkin, William Watson, Zhiqiang Ma, Lucas Cecchi, Natraj Raman, Armineh Nourbakhsh, Sameena Shah

● Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions and Prospects
Xinghua Qu, Hongyang Liu, Zhu Sun, Xiang Yin, Yew Soon Ong, Lu Lu, Zejun Ma

● SocialDial: A Benchmark for Socially-Aware Dialogue Systems
Haolan Zhan, Zhuang Li, Yufei Wang, Linhao Luo, Tao Feng, Xiaoxi Kang, Yuncheng Hua, Lizhen Qu, Lay Ki Soon, Suraj Sharma, Ingrid Zukerman, Zhaleh Semnani-Azad, Gholamreza Haffari

● U-NEED: A Fine-grained Dataset for User Needs-Centric E-commerce Conversational Recommendation
Yuanxing Liu, Weinan Zhang, Baohua Dong, Yan Fan, Hang Wang, Fan Feng, Yifan Chen, Ziyu Zhuang, Hengbin Cui, Yongbin Li, Wanxiang Che

● End-to-End Multimodal Fact-Checking and Explanation Generation: A Challenging Dataset and Models
Barry Menglong Yao, Aditya Shah, Lichao Sun, Jin-Hee Cho, Lifu Huang

● Recipe-MPR: A Test Collection for Evaluating Multi-aspect Preference-based Natural Language Retrieval
Haochen Zhang, Anton Korikov, Parsa Farinneya, Mohammad Mahdi Abdollah Pour, Manasa Bharadwaj, Ali Pesaranghader, Xi Yu Huang, Yi Xin Lok, Zhaoqi Wang, Nathan Jones, Scott Sanner

● Beyond Single Items: Exploring User Preferences in Item Sets with the Conversational Playlist Curation Dataset
Arun Tejasvi Chaganty, Megan Leszczynski, Shu Zhang, Ravi Ganti, Krisztian Balog, Filip Radlinski

● Introducing MBIB – the first Media Bias Identification Benchmark Task and Dataset Collection
Martin Wessel, Tomáš Horych, Terry Ruas, Akiko Aizawa, Bela Gipp, Timo Spinde

● MG-ShopDial: A Multi-Goal Conversational Dataset for e-Commerce
Nolwenn Bernard, Krisztian Balog

● Towards Explainable Conversational Recommender Systems
Shuyu Guo, Shuo Zhang, Weiwei Sun, Pengjie Ren, Zhumin Chen, Zhaochun Ren

● The JOKER Corpus: English–French Parallel Data for Multilingual Wordplay Recognition
Liana Ermakova, Anne-Gwenn Bosser, Adam Jatowt, Tristan Miller

● Form-NLU: Dataset for the Form Natural Language Understanding
Yihao Ding, Siqu Long, Jiabin Huang, Kaixuan Ren, Xingxiang Luo, Hyunsuk Chung, Soyeon Caren Han

● MMEAD: MS MARCO Entity Annotations and Disambiguations
Chris Kamphuis, Aileen Lin, Siwen Yang, Jimmy Lin, Arjen P. de Vries, Faegheh Hasibi

● The Information Retrieval Experiment Platform
Maik Fröbe, Jan Heinrich Reimer, Sean MacAvaney, Niklas Deckers, Simon Reich, Janek Bevendorff, Benno Stein, Matthias Hagen, Martin Potthast

● Towards a More User-Friendly and Easy-to-Use Benchmark Library for Recommender Systems
Lanling Xu, Zhen Tian, Gaowei Zhang, Junjie Zhang, Lei Wang, Bowen Zheng, Yifan Li, Jiakai Tang, Zeyu Zhang, Yupeng Hou, Xingyu Pan, Wayne Xin Zhao, Xu Chen, Ji-Rong Wen

● The Archive Query Log: Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives
Jan Heinrich Reimer, Sebastian Schmidt, Maik Fröbe, Lukas Gienapp, Harrisen Scells, Benno Stein, Matthias Hagen, Martin Potthast

● GammaGL: A Multi-Backend Library for Graph Neural Networks
Yaoqi Liu, Cheng Yang, Tianyu Zhao, Hui Han, Siyuan Zhang, Jing Wu, Guangyu Zhou, Hai Huang, Hui Wang, Chuan Shi

● tieval: An Evaluation Framework for Temporal Information Extraction Systems
Hugo Sousa, Ricardo Campos, Alípio Mário Jorge

● HC3: A Suite of Test Collections for CLIR Evaluation over Informal Text
Dawn Lawrie, James Mayfield, Douglas W. Oard, Eugene Yang, Suraj Nair, Petra Galuščáková

● RecStudio: Towards a Highly-Modularized Recommender System
Defu Lian, Xu Huang, Xiaolong Chen, Jin Chen, Yankai Wang, Haoran Jin, Rui Fan, Xingmei Wang, Zheng Liu, Le Wu, Enhong Chen

● MR2: A Benchmark for Multimodal Retrieval-Augmented Rumor Detection in Social Media
Xuming Hu, Zhijiang Guo, Junzhe Chen, Lijie Wen, Philip Yu

● BioSift: A Dataset for Filtering Biomedical Abstracts for Drug Repurposing and Clinical Meta-Analysis
David Kartchner, Irfan Al-Hussaini, Haydn Turner, Jennifer Deng, Shubham Lohiya, Prasanth Bathala, Cassie Mitchell

● MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs
Jifan Yu, Mengying Lu, Qingyang Zhong, Zijun Yao, Shangqing Tu, Zhengshan Liao, Xiaoya Li, Manli Li, Lei Hou, Hai-Tao Zheng, Juanzi Li, Jie Tang

● RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System
Kai Wang, Zhene Zou, Minghao Zhao, Qilin Deng, Yue Shang, Yile Liang, Runze Wu, Xudong Shen, Tangjie Lyu, Changjie Fan

● JDsearch: A Personalized Product Search Dataset with Real Queries and Full Interactions
Jiongnan Liu, Zhicheng Dou, Guoyu Tang, Sulong Xu

● iQPP: A Benchmark for Image Query Performance Prediction
Eduard Poesina, Radu Tudor Ionescu, Josiane Mothe

● SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-shot Neural Sparse Retrieval
Nandan Thakur, Kexin Wang, Iryna Gurevych, Jimmy Lin

● AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation
Jheng-Hong Yang, Carlos Lassance, Stéphane Clinchant, Rafael Sampaio De Rezende, Miriam Redi, Krishna Srinivasan, Jimmy Lin

● DICE: a Dataset of Italian Crime Event news
Giovanni Bonisoli, Maria Pia Di Buono, Laura Po, Federica Rollo

● BDI-Sen: A Sentence Dataset for Clinical Symptoms of Depression
Anxo Pérez, Javier Parapar, Álvaro Barreiro, Silvia López-Larrosa

● MobileRec: A Large Scale Dataset for Mobile Apps Recommendation
M.H. Maqbool, Umar Farooq, Adib Mosharrof, A.B. Siddique, Hassan Foroosh

● MythQA: Query-Based Large-Scale Check-Worthy Claim Detection through Multi-Answer Open-Domain Question Answering
Yang Bai, Anthony Colas, Daisy Zhe Wang

● RiverText: A Python Library for Training and Evaluating Incremental Word Embeddings from Text Data Streams
Gabriel Iturra-Bocaz, Felipe Bravo-Marquez

● FedAds: A Benchmark for Privacy-Preserving CVR Estimation with Vertical Federated Learning
Penghui Wei, Hongjian Dou, Shaoguo Liu, Rongjun Tang, Li Liu, Liang Wang, Bo Zheng

● The BETTER Cross-Language Datasets
Ian Soboroff

● REFinD: Relation Extraction Financial Dataset
Simerjot Kaur, Charese Smiley, Akshat Gupta, Joy Sain, Dongsheng Wang, Suchetha Siddagangappa, Toyin Aguda, Sameena Shah

● Linked-DocRED – Enhancing DocRED with Entity-Linking to Evaluate End-To-End Document-Level Information Extraction Pipelines
Pierre-Yves Genest, Pierre-Edouard Portier, Elöd Egyed-Zsigmond, Martino Lovisetto

● DECAF: A Modular and Extensible Conversational Search Framework
Marco Alessio, Guglielmo Faggioli, Nicola Ferro

● LongEval-Retrieval: French-English Dynamic Test Collection for Continuous Web Search Evaluation
Petra Galuščáková, Romain Deveaud, Gabriela González Sáez, Philippe Mulhem, Lorraine Goeuriot, Martin Popel, Florina Piroi