ChatPaper.ai
メニューを開く
ホーム
今日の論文
料金プラン
アカウント
ワークスペース
🇯🇵
日本語
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文デイリー
翻訳付きの日次キュレーションされたAI研究論文
January 15th, 2025
MiniMax-01: ライトニングアテンションを用いたファウンデーションモデルのスケーリング
MiniMax-01: Scaling Foundation Models with Lightning Attention
MiniMax, Aonian Li, Bangwei Gong, Bo Yang, Boji Shan, Chang Liu, Cheng Zhu, Chunhao Zhang, Congchao Guo, Da Chen, Dong Li, Enwei Jiao, Gengxin Li, Guojun Zhang, Haohai Sun, Houze Dong, Jiadai Zhu, Jiaqi Zhuang, Jiayuan Song, Jin Zhu, Jingtao Han, Jingyang Li, Junbin Xie, Junhao Xu, Junjie Yan, Kaishun Zhang, Kecheng Xiao, Kexi Kang, Le Han, Leyang Wang, Lianfei Yu, Liheng Feng, Lin Zheng, Linbo Chai, Long Xing, Meizhi Ju, Mingyuan Chi, Mozhi Zhang, Peikai Huang, Pengcheng Niu, Pengfei Li, Pengyu Zhao, Qi Yang, Qidi Xu, Qiexiang Wang, Qin Wang, Qiuhui Li, Ruitao Leng, Shengmin Shi, Shuqi Yu, Sichen Li, Songquan Zhu, Tao Huang, Tianrun Liang, Weigao Sun, Weixuan Sun, Weiyu Cheng, Wenkai Li, Xiangjun Song, Xiao Su, Xiaodong Han, Xinjie Zhang, Xinzhu Hou, Xu Min, Xun Zou, Xuyang Shen, Yan Gong, Yingjie Zhu, Yipeng Zhou, Yiran Zhong, Yongyi Hu, Yuanxiang Fan, Yue Yu, Yufeng Yang, Yuhao Li, Yunan Huang, Yunji Li, Yunpeng Huang, Yunzhi Xu, Yuxin Mao, Zehan Li, Zekang Li, Zewei Tao, Zewen Ying, Zhaoyang Cong, Zhen Qin, Zhenhua Fan, Zhihang Yu, Zhuo Jiang, Zijia Wu
•
Jan 14, 2025
•
258
5
MangaNinja: 正確な参照に従った線画の着色
MangaNinja: Line Art Colorization with Precise Reference Following
Zhiheng Liu, Ka Leong Cheng, Xi Chen, Jie Xiao, Hao Ouyang, Kai Zhu, Yu Liu, Yujun Shen, Qifeng Chen, Ping Luo
•
Jan 14, 2025
•
48
3
3DIS-FLUX: DiT レンダリングを用いたシンプルかつ効率的なマルチインスタンス生成
3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering
Dewei Zhou, Ji Xie, Zongxin Yang, Yi Yang
•
Jan 9, 2025
•
32
2
パディングトーン:T2Iモデルにおけるパディングトークンの機構的分析
Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models
Michael Toker, Ido Galil, Hadas Orgad, Rinon Gal, Yoad Tewel, Gal Chechik, Yonatan Belinkov
•
Jan 12, 2025
•
31
2
Omni-RGPT: トークンマークを介した画像とビデオの領域レベル理解の統合
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks
Miran Heo, Min-Hung Chen, De-An Huang, Sifei Liu, Subhashree Radhakrishnan, Seon Joo Kim, Yu-Chiang Frank Wang, Ryo Hachiuma
•
Jan 14, 2025
•
30
2
1 ステップのビデオ生成のための拡散敵対的事後トレーニング
Diffusion Adversarial Post-Training for One-Step Video Generation
Shanchuan Lin, Xin Xia, Yuxi Ren, Ceyuan Yang, Xuefeng Xiao, Lu Jiang
•
Jan 14, 2025
•
29
4
指示に従ったシングルセル解析のためのマルチモーダルAIコパイロット
A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following
Yin Fang, Xinle Deng, Kangwei Liu, Ningyu Zhang, Jingyang Qian, Penghui Yang, Xiaohui Fan, Huajun Chen
•
Jan 14, 2025
•
24
2
FramePainter: ビデオ拡散を用いたインタラクティブ画像編集への付与事前情報
FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors
Yabo Zhang, Xinpeng Zhou, Yihan Zeng, Hang Xu, Hui Li, Wangmeng Zuo
•
Jan 14, 2025
•
17
2
コンパクトなテキスト感知型一次元トークンを用いたテキストから画像へのマスク生成モデルの民主化
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Dongwon Kim, Ju He, Qihang Yu, Chenglin Yang, Xiaohui Shen, Suha Kwak, Liang-Chieh Chen
•
Jan 13, 2025
•
16
2
HALoGEN: 素晴らしいLLM幻覚とその発生源
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them
Abhilasha Ravichander, Shrusti Ghela, David Wadden, Yejin Choi
•
Jan 14, 2025
•
16
2
PokerBench: 大規模言語モデルをプロのポーカープレイヤーに育成する
PokerBench: Training Large Language Models to become Professional Poker Players
Richard Zhuang, Akshat Gupta, Richard Yang, Aniket Rahane, Zhengyu Li, Gopala Anumanchipalli
•
Jan 14, 2025
•
13
2
Tarsier2: 詳細なビデオ説明から包括的なビデオ理解への大規模ビジョン言語モデルの進化
Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding
Liping Yuan, Jiawei Wang, Haomiao Sun, Yuchen Zhang, Yuan Lin
•
Jan 14, 2025
•
12
2
出力中心の特徴記述による自動解釈性の向上
Enhancing Automated Interpretability with Output-Centric Feature Descriptions
Yoav Gur-Arieh, Roy Mayan, Chen Agassy, Atticus Geiger, Mor Geva
•
Jan 14, 2025
•
10
2
OpenCSG中国語コーパス:LLMトレーニングのための一連の高品質な中国語データセット
OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training
Yijiong Yu, Ziyun Dai, Zekun Wang, Wei Wang, Ran Chen, Ji Pei
•
Jan 14, 2025
•
7
2
大規模言語モデルが非構造化テキストデータの判断者としての可能性と危険性
Potential and Perils of Large Language Models as Judges of Unstructured Textual Data
Rewina Bedemariam, Natalie Perez, Sreyoshi Bhaduri, Satya Kapoor, Alex Gil, Elizabeth Conjar, Ikkei Itoku, David Theil, Aman Chadha, Naumaan Nayyar
•
Jan 14, 2025
•
6
2
MatchAnything:大規模事前学習を用いた汎用クロスモダリティ画像マッチング
MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training
Xingyi He, Hao Yu, Sida Peng, Dongli Tan, Zehong Shen, Hujun Bao, Xiaowei Zhou
•
Jan 13, 2025
•
5
3
AfriHate: アフリカ言語向けのヘイトスピーチと虐待的な言語の多言語コレクションデータセット
AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages
Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Abinew Ali Ayele, David Ifeoluwa Adelani, Ibrahim Said Ahmad, Saminu Mohammad Aliyu, Nelson Odhiambo Onyango, Lilian D. A. Wanzare, Samuel Rutunda, Lukman Jibril Aliyu, Esubalew Alemneh, Oumaima Hourrane, Hagos Tesfahun Gebremichael, Elyas Abdi Ismail, Meriem Beloucif, Ebrahim Chekol Jibril, Andiswa Bukula, Rooweither Mabuya, Salomey Osei, Abigail Oppong, Tadesse Destaw Belay, Tadesse Kebede Guge, Tesfa Tegegne Asfaw, Chiamaka Ijeoma Chukwuneke, Paul Röttger, Seid Muhie Yimam, Nedjma Ousidhoum
•
Jan 14, 2025
•
5
2
Graph-PReFLexORを使用したイン・シチュー・グラフ推論と知識拡張
In-situ graph reasoning and knowledge expansion using Graph-PReFLexOR
Markus J. Buehler
•
Jan 14, 2025
•
3
2