每日論文
PRIMA.CPP:在低資源日常家用集群上加速700億規模大語言模型推理PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday
Home Clusters
PRIMA.CPP:在低資源日常家用集群上加速700億規模大語言模型推理
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday
Home Clusters
Zonghang Li, Tao Li, Wenjiao Feng, Mohsen Guizani, Hongfang Yu•Apr 7, 2025•382
FUSION:視覺-語言表徵的完全整合,實現深度跨模態理解FUSION: Fully Integration of Vision-Language Representations for Deep
Cross-Modal Understanding
FUSION:視覺-語言表徵的完全整合,實現深度跨模態理解
FUSION: Fully Integration of Vision-Language Representations for Deep
Cross-Modal Understanding
Zheng Liu, Mengjie Liu, Jingzhou Chen, Jingwei Xu, Bin Cui, Conghui He, Wentao Zhang•Apr 14, 2025•252
Mavors:面向多模态大型語言模型的多粒度視頻表徵Mavors: Multi-granularity Video Representation for Multimodal Large
Language Model
Mavors:面向多模态大型語言模型的多粒度視頻表徵
Mavors: Multi-granularity Video Representation for Multimodal Large
Language Model
Yang Shi, Jiaheng Liu, Yushuo Guan, Zhenhua Wu, Yuanxing Zhang, Zihao Wang, Weihong Lin, Jingyun Hua, Zekun Wang, Xinlong Chen, Bohan Zeng, Wentao Zhang, Fuzheng Zhang, Wenjing Yang, Di Zhang•Apr 14, 2025•181
InternVL3:探索開源多模態模型的高階訓練與測試時優化方案InternVL3: Exploring Advanced Training and Test-Time Recipes for
Open-Source Multimodal Models
InternVL3:探索開源多模態模型的高階訓練與測試時優化方案
InternVL3: Exploring Advanced Training and Test-Time Recipes for
Open-Source Multimodal Models
Jinguo Zhu, Weiyun Wang, Zhe Chen, Zhaoyang Liu, Shenglong Ye, Lixin Gu, Yuchen Duan, Hao Tian, Weijie Su, Jie Shao, Zhangwei Gao, Erfei Cui, Yue Cao, Yangzhou Liu, Weiye Xu, Hao Li, Jiahao Wang, Han Lv, Dengnian Chen, Songze Li, Yinan He, Tan Jiang, Jiapeng Luo, Yi Wang, Conghui He, Botian Shi, Xingcheng Zhang, Wenqi Shao, Junjun He, Yingtong Xiong, Wenwen Qu, Peng Sun, Penglong Jiao, Lijun Wu, Kaipeng Zhang, Huipeng Deng, Jiaye Ge, Kai Chen, Limin Wang, Min Dou, Lewei Lu, Xizhou Zhu, Tong Lu, Dahua Lin, Yu Qiao, Jifeng Dai, Wenhai Wang•Apr 14, 2025•161
VL-Rethinker:利用強化學習激勵視覺語言模型的自我反思VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models
with Reinforcement Learning
VL-Rethinker:利用強化學習激勵視覺語言模型的自我反思
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models
with Reinforcement Learning
Haozhe Wang, Chao Qu, Zuming Huang, Wei Chu, Fangzhen Lin, Wenhu Chen•Apr 10, 2025•161
AgentRewardBench:評估網路代理軌跡的自動評量系統AgentRewardBench: Evaluating Automatic Evaluations of Web Agent
Trajectories
AgentRewardBench:評估網路代理軌跡的自動評量系統
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent
Trajectories
Xing Han Lù, Amirhossein Kazemnejad, Nicholas Meade, Arkil Patel, Dongchan Shin, Alejandra Zambrano, Karolina Stańczak, Peter Shaw, Christopher J. Pal, Siva Reddy•Apr 11, 2025•131
DUMP:基於強化學習的大型語言模型分佈級自動化課程學習後訓練DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM
Post-training
DUMP:基於強化學習的大型語言模型分佈級自動化課程學習後訓練
DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM
Post-training
Zhenting Wang, Guofeng Cui, Kun Wan, Wentian Zhao•Apr 13, 2025•101
我們是否已經統一了圖像生成與理解?對GPT-4o圖像生成能力的實證研究Have we unified image generation and understanding yet? An empirical
study of GPT-4o's image generation ability
我們是否已經統一了圖像生成與理解?對GPT-4o圖像生成能力的實證研究
Have we unified image generation and understanding yet? An empirical
study of GPT-4o's image generation ability
Ning Li, Jingran Zhang, Justin Cui•Apr 9, 2025•101
S1-Bench:評估大型推理模型系統一思維能力的簡易基準S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability
of Large Reasoning Models
S1-Bench:評估大型推理模型系統一思維能力的簡易基準
S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability
of Large Reasoning Models
Wenyuan Zhang, Shuaiyi Nie, Xinghua Zhang, Zefeng Zhang, Tingwen Liu•Apr 14, 2025•92
LLM-SRBench:基於大型語言模型的科學方程式發現新基準LLM-SRBench: A New Benchmark for Scientific Equation Discovery with
Large Language Models
LLM-SRBench:基於大型語言模型的科學方程式發現新基準
LLM-SRBench: A New Benchmark for Scientific Equation Discovery with
Large Language Models
Parshin Shojaee, Ngoc-Hieu Nguyen, Kazem Meidani, Amir Barati Farimani, Khoa D Doan, Chandan K Reddy•Apr 14, 2025•51
SocioVerse:一個由LLM代理驅動的社會模擬世界模型,擁有千萬真實用戶池SocioVerse: A World Model for Social Simulation Powered by LLM Agents
and A Pool of 10 Million Real-World Users
SocioVerse:一個由LLM代理驅動的社會模擬世界模型,擁有千萬真實用戶池
SocioVerse: A World Model for Social Simulation Powered by LLM Agents
and A Pool of 10 Million Real-World Users
Xinnong Zhang, Jiayu Lin, Xinyi Mou, Shiyue Yang, Xiawei Liu, Libo Sun, Hanjia Lyu, Yihang Yang, Weihong Qi, Yue Chen, Guanying Li, Ling Yan, Yao Hu, Siming Chen, Yu Wang, Jingxuan Huang, Jiebo Luo, Shiping Tang, Libo Wu, Baohua Zhou, Zhongyu Wei•Apr 14, 2025•41
TinyLLaVA-Video-R1:邁向更小型化的視訊推理多模態大模型TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning
TinyLLaVA-Video-R1:邁向更小型化的視訊推理多模態大模型
TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning
Xingjian Zhang, Siwei Wen, Wenjun Wu, Lei Huang•Apr 13, 2025•41
EmoAgent:評估與保障人機互動的心理健康安全EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental
Health Safety
EmoAgent:評估與保障人機互動的心理健康安全
EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental
Health Safety
Jiahao Qiu, Yinghui He, Xinzhe Juan, Yiming Wang, Yuhan Liu, Zixin Yao, Yue Wu, Xun Jiang, Ling Yang, Mengdi Wang•Apr 13, 2025•22
可執行的功能抽象:推斷高階數學問題的生成式程式Executable Functional Abstractions: Inferring Generative Programs for
Advanced Math Problems
可執行的功能抽象:推斷高階數學問題的生成式程式
Executable Functional Abstractions: Inferring Generative Programs for
Advanced Math Problems
Zaid Khan, Elias Stengel-Eskin, Archiki Prasad, Jaemin Cho, Mohit Bansal•Apr 14, 2025•11