1.19.5
1.19.5 (2024-11-14)
Fix
-
fix: update task metadata to allow for null (#1448) (
04ac3f2
) -
fix: Count unique texts, data leaks in calculate metrics (#1438)
-
add more stat
-
add more stat
-
update statistics (
dd5d226
)
Unknown
-
Update tasks table (
f6a49fe
) -
Leaderboard: Fixed code benchmarks (#1441)
-
fixed code benchmarks
-
fix: Made n_parameters formatting smarter and more robust
-
fix: changed jina-embeddings-v3 number of parameters from 572K to 572M
-
fix: Fixed use_instuctions typo in model overview
-
fix: Fixed sentence-transformer compatibility switch
-
Ran linting
-
Added all languages, tasks, types and domains to options
-
Removed resetting options when a new benchmark is selected
-
All results now get displayed, but models that haven't been run on everything get nan values in the table (
3a1a470
) -
Leaderboard 2.0: added performance x n_parameters plot + more benchmark info (#1437)
-
Added elementary speed/performance plot
-
Refactored table formatting code
-
Bumped Gradio version
-
Added more general info to benchmark description markdown block
-
Adjusted margin an range on plot
-
Made hover information easier to read on plot
-
Made range scaling dynamic in plot
-
Moved citation next to benchmark description
-
Made titles in benchmark info bold (
76c2112
)