Daily Papers
Qwen2.5-VL Technical Report
Shuai Bai, Keqin Chen, Xuejing Liu, Jialin Wang, Wenbin Ge, Sibo Song, Kai Dang, Peng Wang, Shijie Wang, Jun Tang, Humen Zhong, Yuanzhi Zhu, Mingkun Yang, Zhaohai Li, Jianqiang Wan, Pengfei Wang, Wei Ding, Zheren Fu, Yiheng Xu, Jiabo Ye, Xi Zhang, Tianbao Xie, Zesen Cheng, Hang Zhang, Zhibo Yang, Haiyang Xu, Junyang Lin•Feb 19, 2025•823
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based
Reinforcement Learning
Hao Gao, Shaoyu Chen, Bo Jiang, Bencheng Liao, Yiang Shi, Xiaoyang Guo, Yuechuan Pu, Haoran Yin, Xiangyu Li, Xinbang Zhang, Ying Zhang, Wenyu Liu, Qian Zhang, Xinggang Wang•Feb 18, 2025•291
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song
Generation
Zihan Liu, Shuangrui Ding, Zhixiong Zhang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang•Feb 18, 2025•251
MoM: Linear Sequence Modeling with Mixture-of-Memories
Jusen Du, Weigao Sun, Disen Lan, Jiaxi Hu, Yu Cheng•Feb 19, 2025•201
Craw4LLM: Efficient Web Crawling for LLM Pretraining
Shi Yu, Zhiyuan Liu, Chenyan Xiong•Feb 19, 2025•191
LongPO: Long Context Self-Evolution of Large Language Models through
Short-to-Long Preference Optimization
Guanzheng Chen, Xin Li, Michael Qizhe Shieh, Lidong Bing•Feb 19, 2025•181
Small Models Struggle to Learn from Strong Reasoners
Yuetai Li, Xiang Yue, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Bill Yuchen Lin, Bhaskar Ramasubramanian, Radha Poovendran•Feb 17, 2025•152
Autellix: An Efficient Serving Engine for LLM Agents as General Programs
Michael Luo, Xiaoxiang Shi, Colin Cai, Tianjun Zhang, Justin Wong, Yichuan Wang, Chi Wang, Yanping Huang, Zhifeng Chen, Joseph E. Gonzalez, Ion Stoica•Feb 19, 2025•141
SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question
Answering?
Yucheng Shi, Tianze Yang, Canyu Chen, Quanzheng Li, Tianming Liu, Xiang Li, Ninghao Liu•Feb 18, 2025•91
Presumed Cultural Identity: How Names Shape LLM Responses
Siddhesh Pawar, Arnav Arora, Lucie-Aimée Kaffee, Isabelle Augenstein•Feb 17, 2025•81
Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety
Mechanisms Tend to Be Anchored in The Template Region
Chak Tou Leong, Qingyu Yin, Jian Wang, Wenjie Li•Feb 19, 2025•81
Thinking Preference Optimization
Wang Yang, Hongye Jin, Jingfeng Yang, Vipin Chaudhary, Xiaotian Han•Feb 17, 2025•82
AdaptiveStep: Automatically Dividing Reasoning Step through Model
Confidence
Yuliang Liu, Junjie Lu, Zhaoling Chen, Chaofeng Qu, Jason Klein Liu, Chonghan Liu, Zefan Cai, Yunhui Xia, Li Zhao, Jiang Bian, Chuheng Zhang, Wei Shen, Zhouhan Lin•Feb 19, 2025•61
MMTEB: Massive Multilingual Text Embedding Benchmark
Kenneth Enevoldsen, Isaac Chung, Imene Kerboua, Márton Kardos, Ashwin Mathur, David Stap, Jay Gala, Wissam Siblini, Dominik Krzemiński, Genta Indra Winata, Saba Sturua, Saiteja Utpala, Mathieu Ciancone, Marion Schaeffer, Gabriel Sequeira, Diganta Misra, Shreeya Dhakal, Jonathan Rystrøm, Roman Solomatin, Ömer Çağatan, Akash Kundu, Martin Bernstorff, Shitao Xiao, Akshita Sukhlecha, Bhavish Pahwa, Rafał Poświata, Kranthi Kiran GV, Shawon Ashraf, Daniel Auras, Björn Plüster, Jan Philipp Harries, Loïc Magne, Isabelle Mohr, Mariya Hendriksen, Dawei Zhu, Hippolyte Gisserot-Boukhlef, Tom Aarsen, Jan Kostkan, Konrad Wojtasik, Taemin Lee, Marek Šuppa, Crystina Zhang, Roberta Rocca, Mohammed Hamdy, Andrianos Michail, John Yang, Manuel Faysse, Aleksei Vatolin, Nandan Thakur, Manan Dey, Dipam Vasani, Pranjal Chitale, Simone Tedeschi, Nguyen Tai, Artem Snegirev, Michael Günther, Mengzhou Xia, Weijia Shi, Xing Han Lù, Jordan Clive, Gayatri Krishnakumar, Anna Maksimova, Silvan Wehrli, Maria Tikhonova, Henil Panchal, Aleksandr Abramov, Malte Ostendorff, Zheng Liu, Simon Clematide, Lester James Miranda, Alena Fenogenova, Guangyu Song, Ruqiya Bin Safi, Wen-Ding Li, Alessia Borghini, Federico Cassano, Hongjin Su, Jimmy Lin, Howard Yen, Lasse Hansen, Sara Hooker, Chenghao Xiao, Vaibhav Adlakha, Orion Weller, Siva Reddy, Niklas Muennighoff•Feb 19, 2025•31
REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large
Language Models
DongGeon Lee, Hwanjo Yu•Feb 19, 2025•31
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule
Generation
Zhiyuan Liu, Yanchen Luo, Han Huang, Enzhi Zhang, Sihang Li, Junfeng Fang, Yaorui Shi, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua•Feb 18, 2025•31
MVL-SIB: A Massively Multilingual Vision-Language Benchmark for
Cross-Modal Topical Matching
Fabian David Schmidt, Florian Schneider, Chris Biemann, Goran Glavaš•Feb 18, 2025•21
Train Small, Infer Large: Memory-Efficient LoRA Training for Large
Language Models
Jun Zhang, Jue Wang, Huan Li, Lidan Shou, Ke Chen, Yang You, Guiming Xie, Xuejian Gong, Kunlong Zhou•Feb 19, 2025•21
GIMMICK -- Globally Inclusive Multimodal Multitask Cultural Knowledge
Benchmarking
Florian Schneider, Carolin Holtermann, Chris Biemann, Anne Lauscher•Feb 19, 2025•21
ActionPiece: Contextually Tokenizing Action Sequences for Generative
Recommendation
Yupeng Hou, Jianmo Ni, Zhankui He, Noveen Sachdeva, Wang-Cheng Kang, Ed H. Chi, Julian McAuley, Derek Zhiyuan Cheng•Feb 19, 2025•21
Judging the Judges: A Collection of LLM-Generated Relevance Judgements
Hossein A. Rahmani, Clemencia Siro, Mohammad Aliannejadi, Nick Craswell, Charles L. A. Clarke, Guglielmo Faggioli, Bhaskar Mitra, Paul Thomas, Emine Yilmaz•Feb 19, 2025•11
AIDE: AI-Driven Exploration in the Space of Code
Zhengyao Jiang, Dominik Schmidt, Dhruv Srikanth, Dixing Xu, Ian Kaplan, Deniss Jacenko, Yuxiang Wu•Feb 18, 2025•11
Reducing Hallucinations in Language Model-based SPARQL Query Generation
Using Post-Generation Memory Retrieval
Aditya Sharma, Luis Lara, Amal Zouaq, Christopher J. Pal•Feb 19, 2025•11
High-Fidelity Novel View Synthesis via Splatting-Guided Diffusion
Xiang Zhang, Yang Zhang, Lukas Mehl, Markus Gross, Christopher Schroers•Feb 18, 2025•11
TESS 2: A Large-Scale Generalist Diffusion Language Model
Jaesung Tae, Hamish Ivison, Sachin Kumar, Arman Cohan•Feb 19, 2025•12
InfiR : Crafting Effective Small Language Models and Multimodal Small
Language Models in Reasoning
Congkai Xie, Shuo Cai, Wenjun Wang, Pengxiang Li, Zhijie Sang, Kejing Yang, Yiming Zhang, Zhen Li, Guanghao Zhu, Zeyu Liu, Yang Yu, Yuhang Liu, Su Lu, Baoyi He, Qi Zhou, Xiaotian Han, Jianbo Yuan, Shengyu Zhang, Fei Wu, Hongxia Yang•Feb 17, 2025•11
Noise May Contain Transferable Knowledge: Understanding Semi-supervised
Heterogeneous Domain Adaptation from an Empirical Perspective
Yuan Yao, Xiaopu Zhang, Yu Zhang, Jian Jin, Qiang Yang•Feb 19, 2025•01