ChatPaper.aiChatPaper

SkillWeaver:網絡代理能通過發現與精進技能實現自我提升

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

April 9, 2025
作者: Boyuan Zheng, Michael Y. Fatemi, Xiaolong Jin, Zora Zhiruo Wang, Apurva Gandhi, Yueqi Song, Yu Gu, Jayanth Srinivasa, Gaowen Liu, Graham Neubig, Yu Su
cs.AI

摘要

為了在複雜環境中生存並茁壯成長,人類已演化出精密的自我提升機制,這些機制包括環境探索、將經驗分層抽象化為可重複使用的技能,以及協作構建不斷增長的技能庫。儘管近期取得了進展,自主網絡代理仍缺乏關鍵的自我提升能力,在程序性知識的抽象、技能的精煉及技能組合方面面臨挑戰。在本研究中,我們引入了SkillWeaver,這是一個以技能為中心的框架,使代理能夠通過自主合成可重複使用的API來實現自我提升。面對新網站時,代理自主發現技能,執行這些技能進行練習,並將練習經驗提煉成穩健的API。通過迭代探索,不斷擴展輕量級、即插即用的API庫,顯著增強了代理的能力。在WebArena和真實世界網站上的實驗證明了SkillWeaver的有效性,分別實現了31.8%和39.8%的相對成功率提升。此外,由強大代理合成的API通過可轉移技能大幅提升了較弱代理的能力,在WebArena上最高提升了54.3%。這些結果表明,將多樣化的網站互動精煉成API並在不同網絡代理間無縫共享,具有顯著的成效。
English
To survive and thrive in complex environments, humans have evolved sophisticated self-improvement mechanisms through environment exploration, hierarchical abstraction of experiences into reuseable skills, and collaborative construction of an ever-growing skill repertoire. Despite recent advancements, autonomous web agents still lack crucial self-improvement capabilities, struggling with procedural knowledge abstraction, refining skills, and skill composition. In this work, we introduce SkillWeaver, a skill-centric framework enabling agents to self-improve by autonomously synthesizing reusable skills as APIs. Given a new website, the agent autonomously discovers skills, executes them for practice, and distills practice experiences into robust APIs. Iterative exploration continually expands a library of lightweight, plug-and-play APIs, significantly enhancing the agent's capabilities. Experiments on WebArena and real-world websites demonstrate the efficacy of SkillWeaver, achieving relative success rate improvements of 31.8% and 39.8%, respectively. Additionally, APIs synthesized by strong agents substantially enhance weaker agents through transferable skills, yielding improvements of up to 54.3% on WebArena. These results demonstrate the effectiveness of honing diverse website interactions into APIs, which can be seamlessly shared among various web agents.

Summary

AI-Generated Summary

PDF01April 10, 2025