Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ML] Include the chunk text offsets in chunked inference response #118659

Merged
merged 7 commits into from
Dec 16, 2024

Conversation

davidkyle
Copy link
Member

@davidkyle davidkyle commented Dec 13, 2024

Creates a new interface ChunkedInference for carrying the chunked inference results to Semantic Text, the response object contains the chunked text and the text offsets in the original source string. In future the text will be removed and just the offsets returned.

It was noted that the InferenceService::chunkedInfer method is always called on the local node and the result classes are never serialised across nodes, hence these classes do not need to implement toXContent or namedwritable.

There is a lot of churn in this PR but almost all of it is due to changing the signature of InferenceService::chunkedInfer.

@davidkyle davidkyle added >refactoring :ml Machine learning auto-backport Automatically create backport pull requests when merged v9.0.0 v8.18.0 labels Dec 13, 2024
@davidkyle davidkyle marked this pull request as ready for review December 13, 2024 13:02
@davidkyle davidkyle requested a review from Mikep86 December 13, 2024 13:02
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/ml-core (Team:ML)

@elasticsearchmachine elasticsearchmachine added the Team:ML Meta label for the ML team label Dec 13, 2024
Copy link
Contributor

@Mikep86 Mikep86 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM pending green CI!

@davidkyle davidkyle enabled auto-merge (squash) December 16, 2024 09:37
@davidkyle davidkyle merged commit c4e964e into elastic:main Dec 16, 2024
16 checks passed
@elasticsearchmachine
Copy link
Collaborator

💔 Backport failed

Status Branch Result
8.x Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 118659

davidkyle added a commit to davidkyle/elasticsearch that referenced this pull request Dec 16, 2024
…astic#118659)

# Conflicts:
#	x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/services/elastic/ElasticInferenceService.java
#	x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/services/huggingface/elser/HuggingFaceElserService.java
#	x-pack/plugin/inference/src/test/java/org/elasticsearch/xpack/inference/services/googleaistudio/GoogleAiStudioServiceTests.java
elasticsearchmachine pushed a commit that referenced this pull request Dec 16, 2024
…18659) (#118775)

# Conflicts:
#	x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/services/elastic/ElasticInferenceService.java
#	x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/services/huggingface/elser/HuggingFaceElserService.java
#	x-pack/plugin/inference/src/test/java/org/elasticsearch/xpack/inference/services/googleaistudio/GoogleAiStudioServiceTests.java
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
auto-backport Automatically create backport pull requests when merged backport pending :ml Machine learning >refactoring Team:ML Meta label for the ML team v8.18.0 v9.0.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants