ChatPaper.aiChatPaper

探索与导航Hugging Face模型图谱

Charting and Navigating Hugging Face's Model Atlas

March 13, 2025
作者: Eliahu Horwitz, Nitzan Kurer, Jonathan Kahana, Liel Amar, Yedid Hoshen
cs.AI

摘要

随着数百万个公开可用的神经网络模型的出现,搜索和分析大规模模型库变得愈发重要。要驾驭如此众多的模型,需要一份“地图”,但由于大多数模型缺乏完善的文档,绘制这样一份地图颇具挑战。为了挖掘模型库的潜在价值,我们绘制了一份初步的地图,代表了Hugging Face平台上已文档化的模型部分。这份地图提供了模型景观及其演变的惊艳可视化展示。我们展示了该地图的多种应用,包括预测模型属性(如准确率)以及分析计算机视觉模型的趋势。然而,鉴于当前地图仍不完整,我们提出了一种方法来绘制未文档化的区域。具体而言,我们基于现实世界中主导的模型训练实践,识别出高置信度的结构先验。利用这些先验知识,我们的方法能够精确映射地图中先前未文档化的区域。我们公开了我们的数据集、代码及交互式地图。
English
As there are now millions of publicly available neural networks, searching and analyzing large model repositories becomes increasingly important. Navigating so many models requires an atlas, but as most models are poorly documented charting such an atlas is challenging. To explore the hidden potential of model repositories, we chart a preliminary atlas representing the documented fraction of Hugging Face. It provides stunning visualizations of the model landscape and evolution. We demonstrate several applications of this atlas including predicting model attributes (e.g., accuracy), and analyzing trends in computer vision models. However, as the current atlas remains incomplete, we propose a method for charting undocumented regions. Specifically, we identify high-confidence structural priors based on dominant real-world model training practices. Leveraging these priors, our approach enables accurate mapping of previously undocumented areas of the atlas. We publicly release our datasets, code, and interactive atlas.

Summary

AI-Generated Summary

PDF271March 14, 2025