Refinement: setting batch_size for different models #212

riboyuan99 · 2024-03-05T23:24:40Z

Since Google's model, OpenAI's model, and Azure OpenAI's model don't support batching locally, we will instead use num_thread to "fake batch" the data.

riboyuan99 · 2024-03-05T23:25:59Z

Will format the code shortly

notion-workspace · 2024-03-06T00:14:30Z

[Uniflow] OpenAI Client batch_size and num_thread

CambioML · 2024-03-06T04:51:32Z

Please reformat your code to resolve build failure https://github.com/CambioML/uniflow-llm-based-pdf-extraction-text-cleaning-data-clustering/actions/runs/8164445774/job/22319716558?pr=212

CambioML · 2024-03-06T04:54:13Z

Could you please also check all notebook regarding whether you need to make corresponding changes as well as cookbook repo? Get @jojortz to review the cookbook repo.

riboyuan99 · 2024-03-06T14:52:46Z

Could you please also check all notebook regarding whether you need to make corresponding changes as well as cookbook repo? Get @jojortz to review the cookbook repo.

Noted

CambioML · 2024-03-06T04:52:54Z

uniflow/flow/server.py

+        if not batch_size:
+            batch_size = self._config.model_config.get(
+                "num_thread", 1
+            )  # pylint: disable=no-member
+


please add comment regarding why we are hacking this way, especially the logic we were talking about regarding local hosted model and proprietary models.

CambioML · 2024-03-06T04:53:32Z

uniflow/op/model/model_config.py

@@ -26,7 +26,7 @@ class GoogleModelConfig(ModelConfig):
    candidate_count: int = 1
    num_thread: int = 1
    # this is not real batch inference, but size to group for thread pool executor.
-    batch_size: int = 1
+    # batch_size: int = 1


you should just remove them and the same for all below. Then, check again to make sure all batch_size is removed for local hosted model.

…y 2. If batch_size is not found, use num_thread instead

riboyuan99 requested review from CambioML, goldmermaid, SayaZhang, CluckRookie and SeisSerenata as code owners March 5, 2024 23:24

CambioML approved these changes Mar 11, 2024

View reviewed changes

Ribo Yuan and others added 6 commits March 10, 2024 23:55

1. Commented out batch_size for models that don't support this locall…

756c690

…y 2. If batch_size is not found, use num_thread instead

fix batch_size logic

fb0a2b3

Formatted the code

e2a4c81

Formatted the code

6d4016f

Update server.py

d8db248

Update model_config.py

fb01da8

CambioML force-pushed the openai_client branch from 27df6dc to fb01da8 Compare March 11, 2024 06:55

CambioML merged commit 2fcf828 into CambioML:main Mar 11, 2024
5 checks passed

riboyuan99 deleted the openai_client branch March 27, 2024 22:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refinement: setting batch_size for different models #212

Refinement: setting batch_size for different models #212

riboyuan99 commented Mar 5, 2024

riboyuan99 commented Mar 5, 2024

notion-workspace bot commented Mar 6, 2024

CambioML commented Mar 6, 2024

CambioML commented Mar 6, 2024

riboyuan99 commented Mar 6, 2024

CambioML Mar 6, 2024

CambioML Mar 6, 2024

Refinement: setting batch_size for different models #212

Refinement: setting batch_size for different models #212

Conversation

riboyuan99 commented Mar 5, 2024

riboyuan99 commented Mar 5, 2024

notion-workspace bot commented Mar 6, 2024

CambioML commented Mar 6, 2024

CambioML commented Mar 6, 2024

riboyuan99 commented Mar 6, 2024

CambioML Mar 6, 2024

Choose a reason for hiding this comment

CambioML Mar 6, 2024

Choose a reason for hiding this comment