new-release? #494

michaelfeil · 2024-12-10T07:51:43Z

Related Issue

Checklist

I have read the CONTRIBUTING guidelines.
I have added tests to cover my changes.
I have updated the documentation (docs folder) accordingly.

Additional Notes

Add any other context about the PR here.

greptile-apps

PR Summary

This PR introduces support for matryoshka embeddings and the nomic-ai/nomic-embed-text-v1.5 model through several key changes:

Added matryoshka dimension support with new dimensions field across text, audio and image embedding models, including validation and slicing logic
Added einops dependency and updated version to 0.0.73 to support nomic-ai/nomic-embed-text-v1.5 model requirements
Introduced MatryoshkaDimError exception class for proper dimension validation handling
Simplified error handling in audio/vision utils by removing redundant try-except blocks
Standardized error message format across endpoints using {ex.__class__} -> {ex} pattern

The changes appear well-structured and maintain backward compatibility while adding new capabilities.

_{12 file(s) reviewed, 10 comment(s)}
_{Edit PR Review Bot Settings | Greptile}

greptile-apps · 2024-12-10T07:52:23Z

libs/client_infinity/infinity_client/infinity_client/models/open_ai_embedding_input_text.py

        modality (Union[Unset, OpenAIEmbeddingInputTextModality]):  Default: OpenAIEmbeddingInputTextModality.TEXT.
    """

    input_: Union[List[str], str]
    model: Union[Unset, str] = "default/not-specified"
    encoding_format: Union[Unset, EmbeddingEncodingFormat] = UNSET
    user: Union[None, Unset, str] = UNSET
+    dimensions: Union[Unset, int] = 0


style: dimensions default value of 0 should be documented in docstring to match other attributes

libs/client_infinity/infinity_client/infinity_client/models/open_ai_embedding_input_text.py

libs/client_infinity/infinity_client/infinity_client/models/open_ai_embedding_input_audio.py

libs/infinity_emb/infinity_emb/fastapi_schemas/pymodels.py

greptile-apps · 2024-12-10T07:52:59Z

libs/infinity_emb/infinity_emb/fastapi_schemas/pymodels.py

@@ -24,6 +24,7 @@
    RootModel,
    Tag,
    conlist,
+    conint,


style: conint is imported but not used in the changes shown

greptile-apps · 2024-12-10T07:52:59Z

libs/infinity_emb/infinity_emb/fastapi_schemas/pymodels.py

@@ -54,7 +55,7 @@ class _OpenAIEmbeddingInput(BaseModel):
    model: str = "default/not-specified"
    encoding_format: EmbeddingEncodingFormat = EmbeddingEncodingFormat.float
    user: Optional[str] = None
-    dimensions: Optional[Annotated[int, Field(strict=True, gt=0, lt=8193)]] = None
+    dimensions: int = 0


logic: changing from Optional[int] to int with default=0 is a breaking change for API clients expecting null values

greptile-apps · 2024-12-10T07:53:04Z

libs/infinity_emb/infinity_emb/inference/batch_handler.py

+def matryososka_slice(
+    embeddings: list[np.ndarray], matryoshka_dim: Optional[int]
+) -> list[np.ndarray]:


syntax: function name 'matryososka_slice' is misspelled, should be 'matryoshka_slice'

greptile-apps · 2024-12-10T07:53:05Z

libs/infinity_emb/infinity_emb/inference/batch_handler.py

            getattr(self.model_worker[0]._model, "sampling_rate", -42),
        )


style: sampling rate fallback of -42 should be documented or use a more meaningful default

greptile-apps · 2024-12-10T07:53:09Z

libs/infinity_emb/infinity_emb/primitives.py

+class MatryoshkaDimError(Exception):
+    pass


style: Empty exception class needs docstring explaining when this error is raised and what it means

greptile-apps · 2024-12-10T07:53:28Z

libs/infinity_emb/infinity_emb/transformer/audio/utils.py

+        resolved_audios = await asyncio.gather(
+            *[resolve_audio(audio, allowed_sampling_rate, session) for audio in audio_urls]
+        )


style: consider using gather with return_exceptions=True to handle partial failures more gracefully

codecov-commenter · 2024-12-10T08:04:48Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

Attention: Patch coverage is 95.00000% with 1 line in your changes missing coverage. Please review.

Project coverage is 79.92%. Comparing base (edd9107) to head (b1eae04).
Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
...finity_emb/infinity_emb/inference/batch_handler.py	88.88%	1 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #494      +/-   ##
==========================================
+ Coverage   79.68%   79.92%   +0.23%     
==========================================
  Files          42       42              
  Lines        3476     3467       -9     
==========================================
+ Hits         2770     2771       +1     
+ Misses        706      696      -10

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

new-release?

9e400f4

greptile-apps bot reviewed Dec 10, 2024

View reviewed changes

rm conint

932b2bc

michaelfeil added 2 commits December 10, 2024 00:18

fix: optimum classifier

103cb6f

lint

b1eae04

michaelfeil merged commit d614094 into main Dec 10, 2024
36 checks passed

michaelfeil deleted the 0.0.73-release branch December 10, 2024 08:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new-release? #494

new-release? #494

michaelfeil commented Dec 10, 2024

greptile-apps bot left a comment

greptile-apps bot Dec 10, 2024

greptile-apps bot Dec 10, 2024

greptile-apps bot Dec 10, 2024

greptile-apps bot Dec 10, 2024

greptile-apps bot Dec 10, 2024

greptile-apps bot Dec 10, 2024

greptile-apps bot Dec 10, 2024

codecov-commenter commented Dec 10, 2024 •

edited

Loading

		getattr(self.model_worker[0]._model, "sampling_rate", -42),
		)

new-release? #494

new-release? #494

Conversation

michaelfeil commented Dec 10, 2024

Related Issue

Checklist

Additional Notes

greptile-apps bot left a comment

Choose a reason for hiding this comment

PR Summary

greptile-apps bot Dec 10, 2024

Choose a reason for hiding this comment

greptile-apps bot Dec 10, 2024

Choose a reason for hiding this comment

greptile-apps bot Dec 10, 2024

Choose a reason for hiding this comment

greptile-apps bot Dec 10, 2024

Choose a reason for hiding this comment

greptile-apps bot Dec 10, 2024

Choose a reason for hiding this comment

greptile-apps bot Dec 10, 2024

Choose a reason for hiding this comment

greptile-apps bot Dec 10, 2024

Choose a reason for hiding this comment

codecov-commenter commented Dec 10, 2024 • edited Loading

Codecov Report

codecov-commenter commented Dec 10, 2024 •

edited

Loading