Changed transformers_client design so it can support most (all?) HuggingFace models #208

adjahossoualexandre · 2024-09-11T16:19:29Z

New design

TransformerClient into 3 client classes:

TransformerEmbeddingModelClient

Merger of TransformerEmbedder and TransformerClient

TransformerRerankerModelClient

Merger of TransformerReranker and TransformerClient

TransformerLLMModelClient

Merger of TransformerLLM and TransformerClient

Pros:

Abstracts model/tokenizer initialisation as well as inference code
Hence, most of the time, the only customization needed is to change some parameters during client instantiation
Works with many different model architectures (example here)
Make the code easier to read and more maintainable since each class takes care of only one client
Easier to customize for users as it removes the need for model SDKs
Removes the need to check whether a model is supported or not

Cons:

The tradeoff here would be to couple inference code with the response and api kwargs handling.

Added 'tokenizer_kwargs' in TransformerLLMModelClient constructor for more flexibility. Added chat template argument in constructor for more flexibility. Added pad token check. Added tokenizer in '_infer_from_pipeline()' when chat_template is used (required). Fixed _handle_input() for 'apply_chat_template'==True. Not sure: ficed message in convert_inputs_to_api_kwargs().

Moved get_device andclean_device_cache at top of file. Allow user to specify autoclasses for Reranker models.

…models.

adjahossoualexandre added 30 commits August 28, 2024 18:21

DRAFT: merge TransformerClient & TransformerEmbedder into 1 class.

e8a690b

Fixed typo.

123396c

Added type hints to signatures + removed now useless model_type.

2121774

Removed now useless model_types.

e2023b2

Added test for TransformerEmbeddingModelClient execution.

a93bd6a

Changed my mind.

424cbfb

Changed my mind. removing model type might introduce issues.

ef2783d

Removed now useless argument.

3601445

Removed now useless arguments.

3cdab7b

DRAFT: merge TransformerClient and TransformerLLM in 1 class.

2d1152f

Added ests for TransformerLLMModelClient.

7daf8ae

Removed temporary log.

a4eb3bb

Changed my mind: added model_type back into call().

6a9c657

Changed my mind about model type. See prev commit.

cd1823b

Ensured tokenizer_kwargs has 'return_tensors' set to 'pt' by default.

2bf711a

DRAFT: merge TransformerClient and TransformerReranker in 1 class.

570c8b1

Commented out old classes.

881cfb6

Added tests for TransformerRerankerModelClient.

bd610d2

Add test for llm response + remove test for old class.

25fe834

Fixed test class name.

e3b3b25

Multiline message:

2c6ee8c

Moved get_device andclean_device_cache at top of file. Allow user to specify autoclasses for Reranker models.

Deleted code for the old TransformerClientClass.

1bd8545

Added __doc__ for the client classes.

0059bbf

Formatting.

53d5384

Added example for transformers_client module + fixed import.

be8041c

Restored originial file.

f4abfeb

Fixed test class name.

39f2ae2

Added kwargs for model and tokenizer init.

5481ee7

Fixed typo.

eab39cd

adjahossoualexandre added 6 commits September 11, 2024 10:53

Fixed missing tokenizer_kwargs in TransformerLLMModelClient.

dcd3a7a

Addded local_files_only to TransformerEmbeddingModelClient

8e39e01

Fixed mutable default arguments.

92faa91

Removed dict args that were conflicting with @lru_cache.

6e5f109

Added tests to check transformer_client compatibility with different …

508079d

…models.

Formatting.

c8fe73a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changed transformers_client design so it can support most (all?) HuggingFace models #208

Changed transformers_client design so it can support most (all?) HuggingFace models #208

adjahossoualexandre commented Sep 11, 2024

Changed transformers_client design so it can support most (all?) HuggingFace models #208

Are you sure you want to change the base?

Changed transformers_client design so it can support most (all?) HuggingFace models #208

Conversation

adjahossoualexandre commented Sep 11, 2024

New design

Pros:

Cons: