Skip to content

Commit

Permalink
Update index.html
Browse files Browse the repository at this point in the history
  • Loading branch information
WadeYin9712 authored Nov 8, 2023
1 parent 1ca8b88 commit 44db073
Showing 1 changed file with 6 additions and 4 deletions.
10 changes: 6 additions & 4 deletions docs/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -198,18 +198,20 @@ <h1 class="title is-2 publication-title">🪄 Lumos: Learning Agents with Unifie
<ul>
<li><strong>🧩 A General Agent Modular Framework</strong>
<ul>
<li><strong>Lumos</strong> consists of planning, grounding, and execution modules built based on LLAMA-2-7B and off-the-shelf APIs. <strong>Lumos</strong> utilizes a unified data format that encompasses multiple task types, thereby enabling the developed agent framework to conveniently support a range of interactive tasks.</li>
<li>🧩 <strong>Lumos</strong> consists of planning, grounding, and execution modules built based on LLAMA-2-7B and off-the-shelf APIs.</li>
<li>🤗 <strong>Lumos</strong> utilizes a unified data format that encompasses multiple task types, thereby enabling the developed agent framework to conveniently support a range of interactive tasks.</li>
</ul>
</li>
<li><strong>🌍 Diverse Training Data</strong>
<ul>
<li><strong>Lumos</strong> is trained with ~40K diverse high-quality subgoal/action annotations from ground-truth reasoning steps in existing benchmarks with GPT-4. Our gathered data can be instrumental for future research in developing open-source agents for complex interactive tasks.</li>
<li>🌍 <strong>Lumos</strong> is trained with ~40K diverse high-quality subgoal/action annotations from ground-truth reasoning steps in existing benchmarks with GPT-4.</li>
<li>⚒️ <strong>Lumos</strong> data can be instrumental for future research in developing open-source agents for complex interactive tasks.</li>
</ul>
</li>
<li><strong>🚀 Competitive Performance</strong>
<ul>
<li>🚀 <strong>Lumos</strong> beats GPT-4/3.5-based agents on web/complex QA tasks such as Mind2Web and HotpotQA, and larger open agents on math tasks.</li>
<li>🚀 <strong>Lumos</strong> exceeds it even outperforms contemporaneous agents that have been fine-tuned with in-domain HotpotQA and Mind2Web annotations, with examples including FiReAct and AgentLM.</li>
<li>🚀 <strong>Lumos</strong> is comparable or even beats GPT-4/3.5-based agents on web/complex QA tasks such as Mind2Web and HotpotQA, and larger open agents on math tasks.</li>
<li>🚀 <strong>Lumos</strong> exceeds contemporaneous agents that have been fine-tuned with in-domain HotpotQA and Mind2Web annotations, such as FiReAct and AgentLM.</li>
<li>🚀 <strong>Lumos</strong> outperforms open agent baseline formulations like chain-of-thoughts and integrated training.</li>
<li>🚀 <strong>Lumos</strong> surpasses larger open LLM agents and domain-specific agents by a large margin on an unseen task, WebShop.</li>
</ul>
Expand Down

0 comments on commit 44db073

Please sign in to comment.