Enable using hybrid retrieval at deploy. #107

vkehfdl1 · 2024-02-03T19:36:40Z

The original hybrid_rrf and hybrid_cc functions takes ids and scores as their input. So, it was hard to use these hybrid modules at deploy.
To understand this difference, you must understand the way optimization and deploy run is working.
As you know, the module functions (including its decorator of course) is treat as independent functions both on optimization and deployment process.
In optimization process, we rely run.py functions for executing each module functions. Most of the case, run.py do not run something special for running modules, but hybrid was special. To running hybrid functions, retrieval run.py must select best modules among user's target_modules input, and extracting its ids and scores. Finally, we could use hybrid modules properly.
However, in deployment Runner.run, do not use run.py functions. Because run.py functions merely contains optimization process, and it is super inefficient to use run.py at deployment feature.
Instead using run.py, at Runner extracts the best module name and module parameters from summary.csv, which made at run.py, and construct new config dictionary. With that dictionary, Runner can run whole modules one by one with selected parameters.

And here was the problem. As you know, hybrid modules must pass ids and scores, and that parameters made at run.py. But, at summary.csv, there was no ids and scores parameters, but target_modules, because summary.csv save user's input parameters as default. Hence, it was impossible to run hybrid modules at deployment Runner class.

So, what was the solution?
I swapped module parameters of hybrid modules at summary.csv in retrieval run.py. It means, I delete idsand scores at module params, and add target_modules and target_module_params as new hybrid module params.
And at the retrieval decorator, if there are no ids and scores parameters, I run other retrieval module with input target_modules and target_module_params.
In this way, you can run another retrieval module at retrieval node decorator, and obtain ids and scores, which is input of hybrid modules.
Since summary.csv module params saved with target_modules and target_module_params, we now can use hybrid modules at deployment!!

close #91

p.s. I thought it was great challenge for me, but it resolved pretty simple. I think the isolation of three parts (optimization, deployment runner, and modules) is really great and flexible. Maybe we can find a way to resolve some other weird methods, thanks to this isolation structure.

…s and target_modules

…arget_module_params for deploy

Eastsidegunn

LGTM

bwook00

LGTM

jeffrey added 8 commits February 3, 2024 11:46

add full trail result at resources folder for testing Runner

2fde5a7

test by full trial_folder and run whole pipeline.

0db8cc5

Merge branch 'main' into Feature/#83

64963c6

Fix pytest fixture that contains config.yaml

93309ad

Using hybrid retrieval without using run.py, with target_module_param…

293b8f7

…s and target_modules

delete ids and scores module at summary, and add target_modules and t…

25978e8

…arget_module_params for deploy

Merge branch 'main' into Feature/#91

1fe86bd

Merge branch 'main' into Feature/#91

60f6dcc

vkehfdl1 requested review from Eastsidegunn and bwook00 February 3, 2024 19:36

vkehfdl1 enabled auto-merge (squash) February 3, 2024 19:36

Eastsidegunn approved these changes Feb 3, 2024

View reviewed changes

bwook00 approved these changes Feb 3, 2024

View reviewed changes

vkehfdl1 merged commit 4c3c356 into main Feb 3, 2024
2 checks passed

vkehfdl1 deleted the Feature/#91 branch February 3, 2024 20:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable using hybrid retrieval at deploy. #107

Enable using hybrid retrieval at deploy. #107

vkehfdl1 commented Feb 3, 2024

Eastsidegunn left a comment

bwook00 left a comment

Enable using hybrid retrieval at deploy. #107

Enable using hybrid retrieval at deploy. #107

Conversation

vkehfdl1 commented Feb 3, 2024

Eastsidegunn left a comment

Choose a reason for hiding this comment

bwook00 left a comment

Choose a reason for hiding this comment