Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: remove * imports #1569

Merged
merged 78 commits into from
Dec 9, 2024
Merged
Changes from 1 commit
Commits
Show all changes
78 commits
Select commit Hold shift + click to select a range
dd5d226
fix: Count unique texts, data leaks in calculate metrics (#1438)
Samoed Nov 14, 2024
04ac3f2
fix: update task metadata to allow for null (#1448)
KennethEnevoldsen Nov 14, 2024
f6a49fe
Update tasks table
github-actions[bot] Nov 14, 2024
78c0e4e
1.19.5
invalid-email-address Nov 14, 2024
4e86cea
Fix: Made data parsing in the leaderboard figure more robust (#1450)
x-tabdeveloping Nov 14, 2024
039d010
Fixed task loading (#1451)
x-tabdeveloping Nov 14, 2024
feb1ab7
fix: publish (#1452)
x-tabdeveloping Nov 14, 2024
3397633
1.19.6
invalid-email-address Nov 14, 2024
14d7523
fix: Fix load external results with `None` mteb_version (#1453)
Samoed Nov 14, 2024
68eb498
1.19.7
invalid-email-address Nov 14, 2024
58c459b
WIP: Polishing up leaderboard UI (#1461)
x-tabdeveloping Nov 15, 2024
1b920ac
fix: loading pre 1.11.0 (#1460)
Samoed Nov 15, 2024
a988fef
1.19.8
invalid-email-address Nov 15, 2024
9b2aece
fix: swap touche2020 to maintain compatibility (#1469)
isaac-chung Nov 17, 2024
8bb4a29
1.19.9
invalid-email-address Nov 17, 2024
2fb6fe7
docs: Add sum per language for task counts (#1468)
isaac-chung Nov 18, 2024
fde124a
fix: pinned datasets to <3.0.0 (#1470)
Napuh Nov 19, 2024
7186e04
1.19.10
invalid-email-address Nov 19, 2024
1cc6c9e
feat: add CUREv1 retrieval dataset (#1459)
dbuades Nov 21, 2024
4408717
Update tasks table
github-actions[bot] Nov 21, 2024
3ff38ec
1.20.0
invalid-email-address Nov 21, 2024
917ad7f
fix: check if `model` attr of model exists (#1499)
Samoed Nov 26, 2024
cde720e
1.20.1
invalid-email-address Nov 26, 2024
0affa31
fix: Leaderboard demo data loading (#1507)
x-tabdeveloping Nov 27, 2024
594f643
1.20.2
invalid-email-address Nov 27, 2024
35245d3
fix: leaderboard only shows models that have ModelMeta (#1508)
x-tabdeveloping Nov 27, 2024
9282796
1.20.3
invalid-email-address Nov 27, 2024
942f212
fix: align readme with current mteb (#1493)
Samoed Nov 27, 2024
09f004c
1.20.4
invalid-email-address Nov 27, 2024
cfd43ac
docs: Add lang family mapping and map to task table (#1486)
isaac-chung Nov 28, 2024
377a63d
Update tasks table
github-actions[bot] Nov 28, 2024
e3d2b54
fix: Ensure that models match the names on embedding-benchmarks/resul…
KennethEnevoldsen Nov 29, 2024
9980c60
1.20.5
invalid-email-address Nov 29, 2024
b02ae82
fix: Adding missing metadata on models and mathcing names up with the…
x-tabdeveloping Nov 29, 2024
ba09b11
1.20.6
invalid-email-address Nov 29, 2024
8e12250
feat: Evaluate missing splits (#1525)
isaac-chung Nov 29, 2024
ee1edac
1.21.0
invalid-email-address Nov 29, 2024
343b6e0
fix: Correct typos superseeded -> superseded (#1532)
isaac-chung Nov 30, 2024
e949d2a
1.21.1
invalid-email-address Nov 30, 2024
5b6f20f
fix: Task load data error for SICK-BR-STS and XStance (#1534)
isaac-chung Dec 1, 2024
ec9413a
1.21.2
invalid-email-address Dec 1, 2024
39349ff
fix: Proprietary models now get correctly shown in leaderboard (#1530)
x-tabdeveloping Dec 2, 2024
d07c29b
1.21.3
invalid-email-address Dec 2, 2024
5fa7b7b
docs: Add Model Meta parameters and metadata (#1536)
isaac-chung Dec 2, 2024
36bab4d
fix: add more model meta (jina, e5) (#1537)
isaac-chung Dec 4, 2024
ac4a706
1.21.4
invalid-email-address Dec 4, 2024
c2f4c26
Add cohere models (#1538)
KennethEnevoldsen Dec 4, 2024
5013df8
fix: add nomic models (#1543)
KennethEnevoldsen Dec 4, 2024
97ab272
fix: Added all-minilm-l12-v2 (#1542)
KennethEnevoldsen Dec 4, 2024
df11c38
fix: Added arctic models (#1541)
KennethEnevoldsen Dec 4, 2024
37fdfa1
fix: add sentence trimming to OpenAIWrapper (#1526)
yjoonjang Dec 4, 2024
1e62184
1.21.5
invalid-email-address Dec 4, 2024
a44a46c
fix: Fixed metadata errors (#1547)
x-tabdeveloping Dec 4, 2024
d713525
1.21.6
invalid-email-address Dec 4, 2024
279a4ee
fix: remove curev1 from multlingual (#1552)
KennethEnevoldsen Dec 5, 2024
e339735
1.21.7
invalid-email-address Dec 5, 2024
2ee8d44
fix: Add Model2vec (#1546)
x-tabdeveloping Dec 6, 2024
2905813
Made result loading more permissive, changed eval splits for HotPotQA…
x-tabdeveloping Dec 6, 2024
a6ce6f9
1.21.8
invalid-email-address Dec 6, 2024
fc64791
docs: Correction of SICK-R metadata (#1558)
rafalposwiata Dec 7, 2024
611b6a1
feat(google_models): fix issues and add support for `text-embedding-0…
dbuades Dec 7, 2024
5e7e033
1.22.0
invalid-email-address Dec 7, 2024
ac44e58
fix(bm25s): search implementation (#1566)
dbuades Dec 7, 2024
346179f
Merge branch 'refs/heads/main' into update_cli
Samoed Dec 7, 2024
b8ff89c
1.22.1
invalid-email-address Dec 7, 2024
03347eb
docs: Fix dependency library name for bm25s (#1568)
isaac-chung Dec 7, 2024
6489fca
fix: Add training dataset to model meta (#1561)
KennethEnevoldsen Dec 8, 2024
1d21818
feat: (cohere_models) cohere_task_type issue, batch requests and tqdm…
dbuades Dec 8, 2024
68bd8ac
fix(publichealth-qa): ignore rows with `None` values in `question` o…
dbuades Dec 8, 2024
2550a27
1.23.0
invalid-email-address Dec 8, 2024
d474451
fix wongnai
Samoed Dec 8, 2024
2015ee5
update inits
Samoed Dec 8, 2024
23fb642
fix tests
Samoed Dec 8, 2024
54a7f5c
lint
Samoed Dec 8, 2024
07f1391
Merge branch 'refs/heads/main' into update_imports
Samoed Dec 8, 2024
d67225b
update imports
Samoed Dec 9, 2024
8653c27
fix tests
Samoed Dec 9, 2024
4ba6ff5
lint
Samoed Dec 9, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
19 changes: 12 additions & 7 deletions docs/create_tasks_table.py
Original file line number Diff line number Diff line change
@@ -68,7 +68,7 @@ def create_tasks_table(tasks: list[mteb.AbsTask]) -> str:
return table


def create_task_lang_table(tasks: list[mteb.AbsTask]) -> str:
def create_task_lang_table(tasks: list[mteb.AbsTask], sort_by_sum=False) -> str:
table_dict = {}
## Group by language. If it is a multilingual dataset, 1 is added to all languages present.
for task in tasks:
@@ -82,22 +82,27 @@ def create_task_lang_table(tasks: list[mteb.AbsTask]) -> str:
## Wrangle for polars
pl_table_dict = []
for lang, d in table_dict.items():
d.update({"lang": lang})
d.update({"0-lang": lang}) # for sorting columns
pl_table_dict.append(d)

df = pl.DataFrame(pl_table_dict).sort(by="lang")
df = pl.DataFrame(pl_table_dict).sort(by="0-lang")
df = df.with_columns(sum=pl.sum_horizontal(get_args(TASK_TYPE)))
df = df.select(sorted(df.columns))
if sort_by_sum:
df = df.sort(by="sum", descending=True)

total = df.sum()

task_names_md = " | ".join(sorted(get_args(TASK_TYPE)))
horizontal_line_md = "---|---" * len(sorted(get_args(TASK_TYPE)))
horizontal_line_md = "---|---" * (len(sorted(get_args(TASK_TYPE))) + 1)
table = f"""
| Language | {task_names_md} |
| Language | {task_names_md} | Sum |
|{horizontal_line_md}|
"""

for row in df.iter_rows():
table += f"| {row[-1]} "
for num in row[:-1]:
table += f"| {row[0]} "
for num in row[1:]:
table += f"| {num} "
table += "|\n"

2,108 changes: 1,054 additions & 1,054 deletions docs/tasks.md

Large diffs are not rendered by default.