Commit History

Remove modebar
496735b

rzanoli commited on

Remove graph visualization for prompt
fc7c800

rzanoli commited on

Place charts on the main page immediately before the leaderboard table
9835969

rzanoli commited on

Add Size field to the leaderboard
cb9f237

rzanoli commited on

Remove author prefix from model names
831dff0

rzanoli commited on

Rename prompts for LS, SU, NER, and REL
d0105c8

rzanoli commited on

Add theoretical performance of a model that scores the highest on every individual task
6b09246

rzanoli commited on

Add award icons for 5-shot and 0-shot models; shorten some table column names for clarity
56e849d

rzanoli commited on

Add model positions in the ranking
13fe545

rzanoli commited on

Added computation and display of the standard deviation across individual prompt accuracy values for each task
67324c2

rzanoli commited on

Small changes
d4cf66e

rzanoli commited on

Small changes
5888550

rzanoli commited on

Small Changes
338193d

rzanoli commited on

Small Changes
cf654bf

rzanoli commited on

Small Changes
4456936

rzanoli commited on

Small Changes
602e1b0

rzanoli commited on

Small Changes
7aacef3

rzanoli commited on

Small Changes
b5e0623

rzanoli commited on

Small Changes
3b91660

rzanoli commited on

Small Changes
c03f591

rzanoli commited on

Small Changes
7a90675

rzanoli commited on

Small changes
ea6af72

rzanoli commited on

Small changes
5a8f6c4

rzanoli commited on

Small changes
5b04d4e

rzanoli commited on

Small changes
dbd3b18

rzanoli commited on

Minor changes
12c62aa

rzanoli commited on

Small changes
c996d40

rzanoli commited on

Small changes
8886020

rzanoli commited on

Small changes
cae4d0f

rzanoli commited on

Merge branch 'main' of https://huggingface.co/spaces/evalitahf/evalita_llm_leaderboard
d04734c

rzanoli commited on

Add new scripts for model processing and tasks management
d1c3cb5

rzanoli commited on

Add new scripts for model processing and tasks management
ad489d5

rzanoli commited on

Update src/about.py
2e1205f
verified

evalitahf commited on

Duplicate from demo-leaderboard-backend/leaderboard
6b0f21c
verified

evalitahf clefourrier HF Staff commited on