Ensure tasks are neither trivially easy nor impossible #248

MartinBernstorff · 2024-03-15T14:06:33Z

When writing #247, the dataset contained two possible "label" columns:

Rating: A rating of the cohesiveness of the comment
Domain: Whether the text is from Wikipedia or Reddit

I did not make a task with the domain column as label, because I imagined it would be trivially easy. Perhaps ideally, the PR submission process should test this? E.g. also submit the run-information as .json, to see that the task is non-trivial?

On the other end of the spectrum, it's probabyl also important to ensure that the task is doable. We could do this by running the task with e.g. both

intfloat/multilingual-e5-small and
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2,

and ensuring there is a differential between them.

To facilitate making this easy to do, I suggest:

Writing a wrapper function that users can call, which runs a task with the two models
Modifying the evaluation script to:
- Suffix the task result name with the model, e.g. amazon_counterfactual_classification_e5_small.json
- Log the location of the result.json to terminal after running evaluation, to make it apparent for the user where they are located

Been a pleasure so far! What do you guys think? 😊

The text was updated successfully, but these errors were encountered:

KennethEnevoldsen · 2024-03-17T16:49:16Z

@imenelydiaker would like your thoughts on this. I generally agree with @MartinBernstorff of at least providing the results, though I would probably just use the CLI for it.

imenelydiaker · 2024-03-18T15:58:34Z

I answered this here: #254. It's a really great idea! 🤩

KennethEnevoldsen · 2024-03-24T12:43:13Z

This has been added in #275

MartinBernstorff mentioned this issue Mar 19, 2024

Added model results to repo and updated CLI to create consistent folder structure. #254

Merged

KennethEnevoldsen closed this as completed Mar 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure tasks are neither trivially easy nor impossible #248

Ensure tasks are neither trivially easy nor impossible #248

MartinBernstorff commented Mar 15, 2024 •

edited

Loading

KennethEnevoldsen commented Mar 17, 2024

imenelydiaker commented Mar 18, 2024

KennethEnevoldsen commented Mar 24, 2024

Ensure tasks are neither trivially easy nor impossible #248

Ensure tasks are neither trivially easy nor impossible #248

Comments

MartinBernstorff commented Mar 15, 2024 • edited Loading

KennethEnevoldsen commented Mar 17, 2024

imenelydiaker commented Mar 18, 2024

KennethEnevoldsen commented Mar 24, 2024

MartinBernstorff commented Mar 15, 2024 •

edited

Loading