Christopher
TNTOutburst
AI & ML interests
None yet
Organizations
None yet
TNTOutburst's activity
Qwen1.5 add to leaderboard
7
#597 opened 8 months ago
by
TNTOutburst
Version for Qwen1.5-72B
#9 opened 8 months ago
by
TNTOutburst
Fine-tune for Qwen1.5
2
#14 opened 8 months ago
by
TNTOutburst
Any updates on redesigning the leaderboard?
2
#595 opened 8 months ago
by
TNTOutburst
152334H/miqu-1-70b-sf marked as private or deleted
3
#587 opened 8 months ago
by
TNTOutburst
meta-llama/Llama-2-70b-hf is set as "Private or deleted"
5
#580 opened 8 months ago
by
TNTOutburst
Improvement: "Metrics over time" has private/deleted models
2
#571 opened 9 months ago
by
TNTOutburst
Brainstorming: Call for a Time-Sensitive, Rolling-Update Benchmark Crowdsourced by the Community
24
#481 opened 10 months ago
by
JosephusCheung
Brainstorming: Suggestions for improving the leaderboard
25
#477 opened 10 months ago
by
xxyyy123
[FLAG] fblgit/una-xaberius-34b-v1beta
125
#444 opened 10 months ago
by
XXXGGGNEt
Black Box Benchmarks over Contamination Scanning
6
#470 opened 10 months ago
by
TNTOutburst
High ARC benchmark score
1
#1 opened 10 months ago
by
TNTOutburst
100 on HellaSwag benchmark
1
#1 opened 10 months ago
by
TNTOutburst
[FLAG] TigerResearch/tigerbot-70b-chat-v4-4k
23
#438 opened 10 months ago
by
fblgit
Feature request: Run 100B + models automatically
15
#434 opened 10 months ago
by
ChuckMcSneed
model was not found on hub!
3
#433 opened 10 months ago
by
liuda1
[FLAG?] Tigerbot-70b-chat-v2 scores are too high.
9
#414 opened 11 months ago
by
TNTOutburst
High ARC and TruthfulQA scores
3
#4 opened 10 months ago
by
TNTOutburst
Add Orca-2 7b and 13b to queue
2
#397 opened 11 months ago
by
TNTOutburst
Can't sort certain columns
1
#386 opened 11 months ago
by
TNTOutburst
Improve speed leaderboard front end
7
#249 opened about 1 year ago
by
Ostixe360
Two airoboros-l2-70b-2.1 models on leaderboard. One with far larger TruthfulQA
1
#238 opened about 1 year ago
by
TNTOutburst
[FLAG] Voicelab/trurl-2-13b: training data surely includes the test data, right?
6
#202 opened about 1 year ago
by
TNTOutburst
Why are there no OpenAI models here? we need GPT-3.5 and GPT4 to compare!
2
#169 opened about 1 year ago
by
FarisHijazi
FreeWilly2 by Stability AI is about to beat GPT3.5
3
#120 opened about 1 year ago
by
gsaivinay
Add a column: average score per billion parameters
2
#88 opened over 1 year ago
by
rfernand
How long does it take to run these tests?
7
#90 opened over 1 year ago
by
Goldenblood56
why isn't truthfulQA shown in the leaderboards?
1
#81 opened over 1 year ago
by
wfzimmerman
Models for Human/GPT4 Eval
25
#65 opened over 1 year ago
by
natolambert
[feature request] prioritize the queue (by user voting?)
3
#46 opened over 1 year ago
by
zed9h