Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: Added sizes to the metadata #276

Merged
merged 18 commits into from
Mar 24, 2024
Merged

fix: Added sizes to the metadata #276

merged 18 commits into from
Mar 24, 2024

Conversation

KennethEnevoldsen
Copy link
Contributor

@KennethEnevoldsen KennethEnevoldsen commented Mar 21, 2024

this allow for automatic metadata table generations

Copy link
Contributor

@MartinBernstorff MartinBernstorff left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Excellent work, Kenneth! We end up with a lot of empty fields, but at least we document it clearly, and can handle it if we need the data 👍

@@ -144,3 +146,6 @@ class TaskMetadata(BaseModel):

text_creation: TEXT_CREATION_METHOD | None
bibtex_citation: str | None

n_samples: dict[str, int]
Copy link
Contributor

@MartinBernstorff MartinBernstorff Mar 22, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe we should specify these as dict[SPLIT_TYPE, int]? This can be something as simple as a type-alias, but makes it semantically clearer.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Might also be nice to provide a little utility that gets this info, and document the utility here.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it would be possible to check it using the eval_splits attr.

mteb/abstasks/TaskMetadata.py Outdated Show resolved Hide resolved
Base automatically changed from add-mmteb to main March 24, 2024 11:41
@KennethEnevoldsen
Copy link
Contributor Author

@MartinBernstorff it seems like we generally agree on this PR. I have left a comment unresolved as I am not entirely sure what you mean, but we can add it, in a future PR.

@KennethEnevoldsen KennethEnevoldsen merged commit cd4a012 into main Mar 24, 2024
3 checks passed
@KennethEnevoldsen KennethEnevoldsen deleted the add-size-to-meta branch March 24, 2024 12:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants