Long Context Evaluation

#430
by mrfakename - opened

Hi,
Is there any evaluation to test ability to perform well on longer prompts?

I would like this one, too

Open LLM Leaderboard org

Hi! If one of you wants to set up the above dataset as a leaderboard, I can give you a hand. (We won't add it to the Open LLM Leaderboard however)

clefourrier changed discussion status to closed
This comment has been hidden

Sign up or log in to comment