[FEATURE] Separate Local Model Registration to support Custom, Pretrained and Sparse Encoding Models #377

joshpalis · 2024-01-05T23:01:44Z

Is your feature request related to a problem?

Coming from flaky integration test failures due to the memory limitations of Github action runners which facilitate integration tests (Reference), it has been determined that mitigation would require using a smaller model to test out local model registration within an integration test cluster.

The amazon/neural-sparse/opensearch-neural-sparse-tokenizer-v1 has been determined to be the smallest model we can use for testing (Documentation) , however the required fields for registering a sparse encoding model is different than the required fields for registering a custom text embedding model.

In order to replace the model used in integration testing with this sparse encoding model, it is required to separate out the RegisterLocalModelStep into a RegisterCustomLocalModelStep and RegisterSparseEncodingLocalModelStep. Additionally, we require support for registering an OpenSearch -provided pretained model, which does not require a URL.

What solution would you like?

The RegisterCustomLocalModelStep relates to the following documentation and will have the following required and optional fields :

Required keys :

name
version
model_format
function_name
model_content_hash_value
url
model_type
embedding_dimension
framework_type

Optional keys :

description
model_group_id
all_config
deploy

The RegisterSparseEncodingModelStep relates to the following documentation and will have the following required and optional fields

Required keys :

name
version
model_format
function_name
model_content_hash_value
url

Optional keys :

description
model_group_id
deploy

The RegisterLocalPretrainedModelStep relates to the following documentation and will have the following required and optional fields

Required keys :

name
version
model_format

Optional keys :

description
model_group_id
deploy

The text was updated successfully, but these errors were encountered:

dbwiddis · 2024-01-06T01:41:04Z

Suggestion:

rename existing RegisterLocalModelStep into an abstract parent step.
add an abstract method to get the required and optional keys
create 3 subclasses implementing those required/optional keys methods, each with their own NAME

joshpalis added enhancement New feature or request untriaged labels Jan 5, 2024

joshpalis self-assigned this Jan 5, 2024

This was referenced Jan 5, 2024

Retrieves installed plugins from the PlugisService instead of the ClusterState #375

Merged

Use provisioning thread pool in Process Node #374

Merged

Added a noop workflow step to delete model group #376

Merged

dbwiddis added v2.12.0 and removed untriaged labels Jan 7, 2024

joshpalis mentioned this issue Jan 8, 2024

Eliminate Google Guava dependency #383

Merged

joshpalis changed the title ~~[FEATURE] Separate Local Model Registration to support Custom and Sparse Encoding Models~~ [FEATURE] Separate Local Model Registration to support Custom, Pretrained and Sparse Encoding Models Jan 8, 2024

joshpalis mentioned this issue Jan 8, 2024

Adds Register Local custom, pretrained or sparse encoding model steps #384

Merged

joshpalis closed this as completed Jan 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Separate Local Model Registration to support Custom, Pretrained and Sparse Encoding Models #377

[FEATURE] Separate Local Model Registration to support Custom, Pretrained and Sparse Encoding Models #377

joshpalis commented Jan 5, 2024 •

edited

Loading

dbwiddis commented Jan 6, 2024

[FEATURE] Separate Local Model Registration to support Custom, Pretrained and Sparse Encoding Models #377

[FEATURE] Separate Local Model Registration to support Custom, Pretrained and Sparse Encoding Models #377

Comments

joshpalis commented Jan 5, 2024 • edited Loading

Is your feature request related to a problem?

What solution would you like?

dbwiddis commented Jan 6, 2024

joshpalis commented Jan 5, 2024 •

edited

Loading