Added integration test for four model types #89

x-tabdeveloping · 2024-01-23T11:00:07Z

Integration test for:

SBERT
E5
Translate -> E5
FastText

x-tabdeveloping · 2024-01-23T11:00:41Z

Addresses #84

x-tabdeveloping · 2024-01-23T11:11:19Z

Ah crap I forgot DKHate was gated

KennethEnevoldsen

One change otherwise looks good

tests/cli/test_cli.py

x-tabdeveloping · 2024-01-24T08:34:18Z

So problem: It simply takes too long to download all models and then run them, even if we only take the four that are currently in the test. The Fasttext model is especially heavyweight, which is a bummer because I just figured out that it had a bug. The translate E5 also has a very hefty translation model, so I'm frankly not sure how we could pull this off, especially on Github. I will try to think of a smart solution. Any thoughts? @KennethEnevoldsen

x-tabdeveloping · 2024-01-24T08:37:15Z

I think for one, we could drop all-MiniLM-L6-v2, and the regular E5, since translate-e5 depends on the same functionality, so we technically hit three birds with one stone. (though if a pull request changes these dependencies then this would no longer be the case)

x-tabdeveloping · 2024-01-24T08:40:44Z

With FastText I'm at a bit of a loss. I think maybe we could add the wiki models to the test as they have a smaller vocabulary??

KennethEnevoldsen · 2024-01-24T09:02:23Z

I would exclude the translation model and potentially remove the fasttext (though ideally download a smaller one). One thing you can also do is have it reuse the cache dir of the models (but that is mostly to speed things up).

x-tabdeveloping · 2024-01-24T09:53:01Z

I believe it should run in a reasonable timeframe now.

x-tabdeveloping · 2024-01-24T10:12:25Z

Hmm very weird behaviour. Pybind is clearly installed and the fasttext install still doesn't recognize it, and I can't reproduce this on my own machine, something fishy going on.

x-tabdeveloping · 2024-01-24T10:34:14Z

Maybe adding an actual task (not just dummy) with all-MiniLM could be a good idea, but I'm not entirely sure cause right now it tests most types of models and runs relatively fast, what do you think?

KennethEnevoldsen · 2024-01-24T10:43:07Z

Maybe adding an actual task (not just dummy) with all-MiniLM could be a good idea, but I'm not entirely sure cause right now it tests most types of models and runs relatively fast, what do you think?

For the integration test it would be nice if it is an MTEB task.
However, one option is to mark the test as "slow" and only run it in the GitHub tests suite (so you don't have to run it locally). I don't think one task is too bad though. Especially if we pick LCC.

x-tabdeveloping · 2024-01-24T10:45:00Z

Okay, how about we keep the dummy task for the heftier models (so that we can test whether they run at all). And then add LCC for a lighter model (like MiniLM)?

KennethEnevoldsen · 2024-01-24T10:46:42Z

Add a comment for it.

Hmm very weird behaviour. Pybind is clearly installed and the fasttext install still doesn't recognize it, and I can't reproduce this on my own machine, something fishy going on.

If this is still a problem I would remove fasttext from the integration test for now and then get this merged in and create an issue.

x-tabdeveloping · 2024-01-24T10:48:05Z

It does work now fortunately, I fixed it by using the fasttext-wheel package from PyPI instead, which uses the old ways to install the thingy. It's a bit of a nasty workaround though.

KennethEnevoldsen · 2024-01-24T11:40:03Z

It does work now fortunately, I fixed it by using the fasttext-wheel package from PyPI instead, which uses the old ways to install the thingy. It's a bit of a nasty workaround though.

as long as it is an optional dependency I don't have too many issues.

x-tabdeveloping · 2024-01-24T12:21:57Z

@KennethEnevoldsen is this fine with you as it stands or do you want me to change something?

KennethEnevoldsen

only a few minor things, but otherwise feel free to merge

makefile

tests/test_integration.py

x-tabdeveloping · 2024-01-26T11:29:14Z

@KennethEnevoldsen tests are failing because on an assertion. Do you have an explanation?

KennethEnevoldsen · 2024-01-26T11:34:28Z

It is because of the score not being correct:

is_approximately_equal(0.40919975266183484, 0.423)

It is due to an incorrect merge I believe. I will mark the sections

tests/cli/test_cli.py

Added integration test for four model types

61ff3bb

x-tabdeveloping requested a review from KennethEnevoldsen January 23, 2024 11:00

x-tabdeveloping linked an issue Jan 23, 2024 that may be closed by this pull request

Integration tests for all model types. #84

Closed

Changed DKHate to LCC

8c004af

KennethEnevoldsen approved these changes Jan 23, 2024

View reviewed changes

tests/cli/test_cli.py Outdated Show resolved Hide resolved

x-tabdeveloping added 4 commits January 23, 2024 15:20

Moved integration test to new file

6d3ba6d

Renamed test function

3c50f4f

Added fasttext to testing dependencies

045de9f

Added pybind install to makefile

31d2a35

x-tabdeveloping enabled auto-merge January 23, 2024 15:14

Fixed issue with fasttext models

0a015ba

x-tabdeveloping added 2 commits January 24, 2024 10:40

Added Dummy task to integration test, remove all-MiniLM-L6

2f75712

Added fasttext as dependency in the makefile

012a689

Added pybind as an optional dependency for fasttext

77cdd4b

Switched out fasttext package with fasttext-wheel

2cd74d4

x-tabdeveloping disabled auto-merge January 24, 2024 10:33

x-tabdeveloping force-pushed the stuff_runs_tests branch from dd8bc70 to 2cd74d4 Compare January 24, 2024 10:40

Added return type to appease linter

828e556

Added integration test for MiniLM + LCC

4560a16

x-tabdeveloping requested a review from KennethEnevoldsen January 24, 2024 12:22

KennethEnevoldsen approved these changes Jan 24, 2024

View reviewed changes

makefile Outdated Show resolved Hide resolved

tests/test_integration.py Outdated Show resolved Hide resolved

tests/test_integration.py Outdated Show resolved Hide resolved

x-tabdeveloping added 2 commits January 26, 2024 09:56

Moved models to @parametrize

fafcb39

Merge branch 'main' into stuff_runs_tests

01e8979

KennethEnevoldsen reviewed Jan 26, 2024

View reviewed changes

tests/cli/test_cli.py Show resolved Hide resolved

Reset test cases incorrectly overwritten by a merge conflict resolution

ccdd886

x-tabdeveloping merged commit f427a6d into main Jan 26, 2024
4 of 6 checks passed

x-tabdeveloping deleted the stuff_runs_tests branch January 26, 2024 12:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added integration test for four model types #89

Added integration test for four model types #89

x-tabdeveloping commented Jan 23, 2024

x-tabdeveloping commented Jan 23, 2024

x-tabdeveloping commented Jan 23, 2024

KennethEnevoldsen left a comment •

edited

Loading

x-tabdeveloping commented Jan 24, 2024

x-tabdeveloping commented Jan 24, 2024

x-tabdeveloping commented Jan 24, 2024

KennethEnevoldsen commented Jan 24, 2024 •

edited

Loading

x-tabdeveloping commented Jan 24, 2024

x-tabdeveloping commented Jan 24, 2024

x-tabdeveloping commented Jan 24, 2024

KennethEnevoldsen commented Jan 24, 2024

x-tabdeveloping commented Jan 24, 2024

KennethEnevoldsen commented Jan 24, 2024

x-tabdeveloping commented Jan 24, 2024

KennethEnevoldsen commented Jan 24, 2024 •

edited

Loading

x-tabdeveloping commented Jan 24, 2024

KennethEnevoldsen left a comment

x-tabdeveloping commented Jan 26, 2024

KennethEnevoldsen commented Jan 26, 2024 •

edited

Loading

Added integration test for four model types #89

Added integration test for four model types #89

Conversation

x-tabdeveloping commented Jan 23, 2024

x-tabdeveloping commented Jan 23, 2024

x-tabdeveloping commented Jan 23, 2024

KennethEnevoldsen left a comment • edited Loading

Choose a reason for hiding this comment

x-tabdeveloping commented Jan 24, 2024

x-tabdeveloping commented Jan 24, 2024

x-tabdeveloping commented Jan 24, 2024

KennethEnevoldsen commented Jan 24, 2024 • edited Loading

x-tabdeveloping commented Jan 24, 2024

x-tabdeveloping commented Jan 24, 2024

x-tabdeveloping commented Jan 24, 2024

KennethEnevoldsen commented Jan 24, 2024

x-tabdeveloping commented Jan 24, 2024

KennethEnevoldsen commented Jan 24, 2024

x-tabdeveloping commented Jan 24, 2024

KennethEnevoldsen commented Jan 24, 2024 • edited Loading

x-tabdeveloping commented Jan 24, 2024

KennethEnevoldsen left a comment

Choose a reason for hiding this comment

x-tabdeveloping commented Jan 26, 2024

KennethEnevoldsen commented Jan 26, 2024 • edited Loading

KennethEnevoldsen left a comment •

edited

Loading

KennethEnevoldsen commented Jan 24, 2024 •

edited

Loading

KennethEnevoldsen commented Jan 24, 2024 •

edited

Loading

KennethEnevoldsen commented Jan 26, 2024 •

edited

Loading