Upload content.py (#7)
Browse files- Upload content.py (57676c44a8bdd9fe6c12a705108020f4abf7a6a2)
Co-authored-by: xuetianci <xuetianci99@users.noreply.huggingface.co>
- content.py +11 -8
content.py
CHANGED
@@ -1,8 +1,8 @@
|
|
1 |
TITLE = """<h1 align="center" id="space-title">π Online-Mind2Web Leaderboard</h1>"""
|
2 |
LINKS = """
|
3 |
<div align="center">
|
|
|
4 |
<a href="https://tiancixue.notion.site/An-Illusion-of-Progress-Assessing-the-Current-State-of-Web-Agents-1ac6cd2b9aac80719cd6f68374aaf4b4?pvs=4">Blog</a> |
|
5 |
-
<a href="#">Paper</a> |
|
6 |
<a href="https://github.com/OSU-NLP-Group/Online-Mind2Web">Code</a> |
|
7 |
<a href="https://huggingface.co/datasets/osunlp/Online-Mind2Web">Data</a>
|
8 |
</div>
|
@@ -28,13 +28,14 @@ Submissions must include a sequence of images (i.e., screenshots in the trajecto
|
|
28 |
|
29 |
CITATION_BUTTON_LABEL = "Copy the following snippet to cite these results. Note: Online-Mind2Web is derived from the original Mind2Web dataset. We kindly ask that you cite both the original and this work when using or referencing the data."
|
30 |
CITATION_BUTTON_TEXT = r"""
|
31 |
-
@article{
|
32 |
-
|
33 |
-
|
34 |
-
|
35 |
-
|
36 |
-
|
37 |
-
|
|
|
38 |
}
|
39 |
@inproceedings{deng2023mind2web,
|
40 |
author = {Deng, Xiang and Gu, Yu and Zheng, Boyuan and Chen, Shijie and Stevens, Sam and Wang, Boshi and Sun, Huan and Su, Yu},
|
@@ -58,7 +59,9 @@ If you have conducted your own human evaluation, please also attach your human-e
|
|
58 |
The result of each task is stored in a folder named as its `task_id`, containing:
|
59 |
- `trajectory/`: Stores screenshots of each step.
|
60 |
- `result.json`: Task metadata and action history.
|
|
|
61 |
Here is an [example](https://github.com/OSU-NLP-Group/Online-Mind2Web/tree/main/data/example/fb7b4f784cfde003e2548fdf4e8d6b4f) of the format.
|
|
|
62 |
**Structure:**
|
63 |
```
|
64 |
main_directory/
|
|
|
1 |
TITLE = """<h1 align="center" id="space-title">π Online-Mind2Web Leaderboard</h1>"""
|
2 |
LINKS = """
|
3 |
<div align="center">
|
4 |
+
<a href="https://arxiv.org/abs/2504.01382">Paper</a> |
|
5 |
<a href="https://tiancixue.notion.site/An-Illusion-of-Progress-Assessing-the-Current-State-of-Web-Agents-1ac6cd2b9aac80719cd6f68374aaf4b4?pvs=4">Blog</a> |
|
|
|
6 |
<a href="https://github.com/OSU-NLP-Group/Online-Mind2Web">Code</a> |
|
7 |
<a href="https://huggingface.co/datasets/osunlp/Online-Mind2Web">Data</a>
|
8 |
</div>
|
|
|
28 |
|
29 |
CITATION_BUTTON_LABEL = "Copy the following snippet to cite these results. Note: Online-Mind2Web is derived from the original Mind2Web dataset. We kindly ask that you cite both the original and this work when using or referencing the data."
|
30 |
CITATION_BUTTON_TEXT = r"""
|
31 |
+
@article{xue2025illusionprogressassessingcurrent,
|
32 |
+
title={An Illusion of Progress? Assessing the Current State of Web Agents},
|
33 |
+
author={Tianci Xue and Weijian Qi and Tianneng Shi and Chan Hee Song and Boyu Gou and Dawn Song and Huan Sun and Yu Su},
|
34 |
+
year={2025},
|
35 |
+
eprint={2504.01382},
|
36 |
+
archivePrefix={arXiv},
|
37 |
+
primaryClass={cs.AI},
|
38 |
+
url={https://arxiv.org/abs/2504.01382},
|
39 |
}
|
40 |
@inproceedings{deng2023mind2web,
|
41 |
author = {Deng, Xiang and Gu, Yu and Zheng, Boyuan and Chen, Shijie and Stevens, Sam and Wang, Boshi and Sun, Huan and Su, Yu},
|
|
|
59 |
The result of each task is stored in a folder named as its `task_id`, containing:
|
60 |
- `trajectory/`: Stores screenshots of each step.
|
61 |
- `result.json`: Task metadata and action history.
|
62 |
+
|
63 |
Here is an [example](https://github.com/OSU-NLP-Group/Online-Mind2Web/tree/main/data/example/fb7b4f784cfde003e2548fdf4e8d6b4f) of the format.
|
64 |
+
|
65 |
**Structure:**
|
66 |
```
|
67 |
main_directory/
|