Added model results to repo and updated CLI to create consistent folder structure. #254

KennethEnevoldsen · 2024-03-18T10:31:02Z

This is a suggested change. The goal is to make it easier to add model evaluations along with a dataset (as a kind of test).

This includes a few changes:

the results folder is now added to contain results using the form results/{model_name}/{task_results}
Changes CLI to follow this format
- Additionally also adds a model_meta.json with the model_name, time of run and versions during run.

…er structure.

Muennighoff

Interesting - is the intention to have one result file for every dataset? Could be a good idea, so it's easy to get an idea of what kind of performance to expect

README.md

Co-authored-by: Niklas Muennighoff <n.muennighoff@gmail.com>

KennethEnevoldsen · 2024-03-18T13:42:23Z

Yep exactly. This also makes it easier for us to review dataset submissions as they then also include at least 1-3 models that have run on the data.

We want it to be both:

Meaningful variation: some variation between more or less capable models e.g. e5-multilingual-small and paraphrase-mpnet and ideally in the expected direction
Non-trivial task: we also don't want e5 small "solve the task"

KennethEnevoldsen · 2024-03-18T13:46:01Z

The assumption is to have the results at least for newly submitted datasets.

Muennighoff · 2024-03-18T13:56:31Z

Makes sense, should this be specified somewhere e.g. in how to contribute? Also maybe it makes sense to explicitly tell people always to run model X so it's a bit easier to compare if it's always the same model? If it's just 1 model, it should probably be a multilingual one 🤔

KennethEnevoldsen · 2024-03-18T14:56:36Z

Also maybe it makes sense to explicitly tell people always to run model X so it's a bit easier to compare if it's always the same model? If it's just 1 model, it should probably be a multilingual one 🤔

Yep I plan to make a PR with how to add datasets that include some standard models (small and multilingual). I was thinking e5-multilingual-small and the sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2.

imenelydiaker · 2024-03-18T15:50:08Z

Great idea! Both models are multilingual so yes why not! Just keep in mind that for some languages they may not perform very well.. [under-represented languages in training datasets]

imenelydiaker · 2024-03-18T16:00:56Z

results/sentence-transformers__all-MiniLM-L6-v2/model.json

@@ -0,0 +1 @@
+{"model_name": "sentence-transformers/all-MiniLM-L6-v2", "time_of_run": "2024-03-18 11:22:22.739054", "versions": {"sentence_transformers": "2.0.0", "transformers": "4.6.1", "pytorch": "1.8.1"}}


Maybe add a revision number to make sure it's the same model version that is used? Wdyt?

I would actually love to, but couldn't figure out how to do it. I don't believe it is recorded in the model object. You can naturally fetch the latest from the repo, but then hitting the cache causes discrepancies.

I was thinking about doing the same as for datasets in mteb (revision_id). Just specifying the commit id from the HF repo that stores the model.

Yes I would love to do that. However, I am not sure the commit id is available in the model object (you would have to know it beforehand). I would love to add that (but seems like that is outside the scope of this PR)

@imenelydiaker do anyone from your team have the time to give it a go?

Okay I see what you mean. I can check this and open another PR.

mteb/cmd.py

MartinBernstorff · 2024-03-19T08:50:47Z

I suppose this closes #248 - would add it to the PR description, but can't edit 👍

KennethEnevoldsen · 2024-03-19T08:54:23Z

@MartinBernstorff this is #254

MartinBernstorff · 2024-03-19T08:55:39Z

Ah yeah, have updated my comment 👍

…er structure. (embeddings-benchmark#254) * Added model results to repo and updated CLI to create consistent folder structure. * ci: updated ci to use make install * Added missing pytest dependencies * Update README.md Co-authored-by: Niklas Muennighoff <n.muennighoff@gmail.com> --------- Co-authored-by: Niklas Muennighoff <n.muennighoff@gmail.com>

* refactor: rename description to metadata dict * refactor: add TaskMetadata and first example * update 9 files * update TaskMetadata.py * update TaskMetadata.py * update TaskMetadata.py * update LICENSE, TaskMetadata.py and requirements.dev.txt * update 151 files * update 150 files * update 43 files and delete 1 file * update 106 files * update 45 files * update 6 files * update 14 files * Added model results to repo and updated CLI to create consistent folder structure. (#254) * Added model results to repo and updated CLI to create consistent folder structure. * ci: updated ci to use make install * Added missing pytest dependencies * Update README.md Co-authored-by: Niklas Muennighoff <n.muennighoff@gmail.com> --------- Co-authored-by: Niklas Muennighoff <n.muennighoff@gmail.com> * Restructing the readme (#262) * restructing the readme * removed double specification of versions and moved all setup to pyproject.toml * correctly use flat-layout for the package * build(deps): update TaskMetadata.py and pyproject.toml * update 221 files * build(deps): update pyproject.toml * build(deps): update pyproject.toml * build(deps): update pyproject.toml --------- Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com> Co-authored-by: Niklas Muennighoff <n.muennighoff@gmail.com>

KennethEnevoldsen added 3 commits March 18, 2024 11:28

Added model results to repo and updated CLI to create consistent fold…

6bc154f

…er structure.

ci: updated ci to use make install

6dd9f4f

Added missing pytest dependencies

ccd7d7e

KennethEnevoldsen requested review from imenelydiaker and Muennighoff March 18, 2024 11:59

Muennighoff reviewed Mar 18, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

Update README.md

420ef56

Co-authored-by: Niklas Muennighoff <n.muennighoff@gmail.com>

imenelydiaker mentioned this pull request Mar 18, 2024

Ensure tasks are neither trivially easy nor impossible #248

Closed

imenelydiaker reviewed Mar 18, 2024

View reviewed changes

mteb/cmd.py Show resolved Hide resolved

KennethEnevoldsen merged commit 8a758bc into main Mar 19, 2024
3 checks passed

KennethEnevoldsen deleted the add-results-folder branch March 19, 2024 13:19

imenelydiaker mentioned this pull request Apr 3, 2024

Adding French team contribution points #302

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added model results to repo and updated CLI to create consistent folder structure. #254

Added model results to repo and updated CLI to create consistent folder structure. #254

KennethEnevoldsen commented Mar 18, 2024

Muennighoff left a comment

KennethEnevoldsen commented Mar 18, 2024 •

edited

Loading

KennethEnevoldsen commented Mar 18, 2024

Muennighoff commented Mar 18, 2024

KennethEnevoldsen commented Mar 18, 2024

imenelydiaker commented Mar 18, 2024 •

edited

Loading

imenelydiaker Mar 18, 2024

KennethEnevoldsen Mar 18, 2024

imenelydiaker Mar 19, 2024

KennethEnevoldsen Mar 19, 2024

imenelydiaker Mar 19, 2024 •

edited

Loading

MartinBernstorff commented Mar 19, 2024 •

edited

Loading

KennethEnevoldsen commented Mar 19, 2024

MartinBernstorff commented Mar 19, 2024

		@@ -0,0 +1 @@
		{"model_name": "sentence-transformers/all-MiniLM-L6-v2", "time_of_run": "2024-03-18 11:22:22.739054", "versions": {"sentence_transformers": "2.0.0", "transformers": "4.6.1", "pytorch": "1.8.1"}}

Added model results to repo and updated CLI to create consistent folder structure. #254

Added model results to repo and updated CLI to create consistent folder structure. #254

Conversation

KennethEnevoldsen commented Mar 18, 2024

Muennighoff left a comment

Choose a reason for hiding this comment

KennethEnevoldsen commented Mar 18, 2024 • edited Loading

KennethEnevoldsen commented Mar 18, 2024

Muennighoff commented Mar 18, 2024

KennethEnevoldsen commented Mar 18, 2024

imenelydiaker commented Mar 18, 2024 • edited Loading

imenelydiaker Mar 18, 2024

Choose a reason for hiding this comment

KennethEnevoldsen Mar 18, 2024

Choose a reason for hiding this comment

imenelydiaker Mar 19, 2024

Choose a reason for hiding this comment

KennethEnevoldsen Mar 19, 2024

Choose a reason for hiding this comment

imenelydiaker Mar 19, 2024 • edited Loading

Choose a reason for hiding this comment

MartinBernstorff commented Mar 19, 2024 • edited Loading

KennethEnevoldsen commented Mar 19, 2024

MartinBernstorff commented Mar 19, 2024

KennethEnevoldsen commented Mar 18, 2024 •

edited

Loading

imenelydiaker commented Mar 18, 2024 •

edited

Loading

imenelydiaker Mar 19, 2024 •

edited

Loading

MartinBernstorff commented Mar 19, 2024 •

edited

Loading