Spaces:
AIR-Bench
/
Running on CPU Upgrade

Commit History

feat: use -1 instead of 0 for presenting missing evals
7419959

nan commited on

fix a bug in read_evals.py
439a031

hanhainebula commited on

fix a import bug in envs.py
7d7a455

hanhainebula commited on

chore: update the version
394f64e

nan commited on

feat-add-reranker-tab-0607 (#21)
64a83a1
verified

nan commited on

feat-add-meta-info-after-uploading-0607 (#20)
9d64883
verified

nan commited on

feat-switch-to-ndcg-for-qa-0607 (#19)
973bd2a
verified

nan commited on

feat-use-recall-as-default-metric-0605 (#18)
bbfe4c1
verified

nan commited on

feat-add-tabs-for-noreranker-0605 (#17)
b80bda9
verified

nan commited on

Merge branch 'main' of https://huggingface.co/spaces/AIR-Bench/leaderboard
c1df819

nan commited on

fix: fix the missing file
0646823

nan commited on

refactor-leaderboard-0605 (#16)
a0387d8
verified

nan commited on

fix-show-no-reranker-bug-0531 (#14)
dccb8fe
verified

nan commited on

fix a bug for anonymous retrieval model submisson
933b575
verified

hanhainebula commited on

fix-bug-in-selecting-domain-0522 (#13)
eb0f9c5
verified

nan commited on

fix a bug in METRIC_LIST
443f557

hanhainebula commited on

modify the default value of reranking selector
2a65b26

hanhainebula commited on

modify the default value of reranking selector
658d5a4

hanhainebula commited on

feat-convert-reranker-selection-to-dropdown-0521 (#12)
1f30550
verified

nan commited on

feat-rename-to-retrieval-methods-0520 (#11)
f000c74
verified

nan commited on

Fix bug in dataset_dict: "gpt-3" -> "gpt3"
8102fce
verified

hanhainebula commited on

Fix bug in dataset_dict: "health" -> "healthcare"
4a44211
verified

hanhainebula commited on

feat-improve-submission-page-0517 (#10)
e93da78
verified

nan commited on

fix-bug-in-show-details-0517 (#9)
7ca7624
verified

nan commited on

feat-add-benchmark-version-selector-0515 (#7)
2c23c13
verified

nan commited on

feat-add-no-reranker-button-0515 (#5)
9002757
verified

nan commited on

feat-add-reranker-url-validator-0515 (#4)
240d9ce
verified

nan commited on

Modify the evaluation steps
606d718

hanhainebula commited on

feat-add-toggle-button-for-revision-col-0514 (#3)
77ded94
verified

nan commited on

fix-reranking-0514 (#2)
3bab3e9
verified

nan commited on

Merge branch 'main' of https://huggingface.co/spaces/AIR-Bench/leaderboard
7cbfcef

nan commited on

fix: fix the bug in selecting reranking models
6925231

nan commited on

Modify the submission tips
2dca7ce
verified

hanhainebula commited on

fix: fix the mean calculation for NAN values
08fea1e

nan commited on

Modify the submission tips
51dbe36
verified

hanhainebula commited on

Modify the commands of evaluating
ca2a141
verified

hanhainebula commited on

Merge branch 'main' of https://huggingface.co/spaces/AIR-Bench/leaderboard
d27648d

nan commited on

fix: fix the bug in the annoymous checkbox
f03a7b5

nan commited on

Add msmarco for qa task
43fbed5
verified

hanhainebula commited on

Fix "is_anonymous" parameter
7dac66f
verified

hanhainebula commited on

Add indent for metadata json file
9e747ff
verified

hanhainebula commited on

Update src/utils.py
8fcdb2a
verified

hanhainebula commited on

Calculate md5 in the frontend
d306dfd
verified

hanhainebula commited on

Fix check when loading results file
cf2d912
verified

hanhainebula commited on

Loading all results files
05cd94e
verified

hanhainebula commited on

Modify the save path in search_results hf repo
158e42c
verified

hanhainebula commited on

feat: add preview tag
982af90

nan commited on