Default semantic_text fields to ELSER on EIS when available #134708

mridula-s109 · 2025-09-15T09:43:22Z

🚨 Please check with search inference team on when to merge this - we're working out some rate limiting issues ahead of GA of ELSER on EIS

Summary

Implements dynamic default selection for semantic_text fields to automatically use ELSER on EIS
(.elser-2-elastic) when available, with graceful fallback to ML nodes (.elser-2-elasticsearch).

Problem

Currently, semantic_text fields are hardcoded to default to .elser-2-elasticsearch (ML nodes). This misses
opportunities for better performance and cost efficiency when EIS is available.

Solution

Dynamic Detection: Uses ModelRegistry.containsDefaultConfigId() to detect EIS availability
Smart Fallback: Automatically selects .elser-2-elastic when available, falls back to
.elser-2-elasticsearch
Zero Configuration: Works transparently without requiring user changes
Full Compatibility: On-prem users continue using ML nodes until they enable cloud-connected mode

Changes

Added getPreferredElserInferenceId() method for dynamic selection logic
Updated semantic_text field mapper to use dynamic default instead of hardcoded value
Added comprehensive tests for both EIS available/unavailable scenarios

Testing

✅ All existing tests pass (backward compatibility verified)
✅ New test covers dynamic selection logic
✅ Manual testing confirms proper fallback behavior

Impact

Cloud users: Automatically get better performance with EIS
On-prem users: No changes, continue using ML nodes seamlessly
Existing fields: Completely unaffected, no migration needed

mridula-s109 · 2025-09-15T09:59:47Z

Hey @ioanatia! 👋

Just implemented the dynamic ELSER default selection for semantic_text fields.

The approach is intentionally minimal -leverages existing ModelRegistry.containsDefaultConfigId() for
EIS detection rather than building new infrastructure. New semantic_text fields automatically default
to .elser-2-elastic when available, gracefully fall back to .elser-2-elasticsearch when not.

A few things I'd especially appreciate feedback on:

Does the overall approach make sense?
Is the error handling sufficient (graceful fallback when ModelRegistry fails)?
Any concerns about the scope or missing edge cases?

Still draft mode, but wanted to get your input before finalizing. Thanks!

...nference/src/main/java/org/elasticsearch/xpack/inference/mapper/SemanticTextFieldMapper.java

ioanatia · 2025-09-15T15:12:44Z

relying on modelRegistry.containsDefaultConfigId(EIS_ELSER_INFERENCE_ID) seems to be right.

but I think we need more tests.
Maybe take a look at some of the tests from https://github.com/elastic/elasticsearch/tree/main/x-pack/plugin/inference/src/test/java/org/elasticsearch/xpack/inference/services/elastic and see if we can mock the presence of EIS in tests.
Then we can properly test that we pick the right inference ID - by first creating an index with a semantic_text field (and unspecified inference ID) and then requesting the mapping of this new index. We should see the default EIS endpoint.

ioanatia · 2025-09-15T15:49:26Z

In terms of tests, what we can also do is to add another mock service in https://github.com/elastic/elasticsearch/tree/main/x-pack/plugin/inference/qa/test-service-plugin/src/main/java/org/elasticsearch/xpack/inference/mock

and then use it in yaml tests.

the InferenceService allows to register default endpoints:

elasticsearch/server/src/main/java/org/elasticsearch/inference/InferenceService.java

Line 229 in 66582a3

default List<DefaultConfigId> defaultConfigIds() {

I think we can mock the default endpoints we have in EIS and use the same name in the mock inference service.
Let me know if you need any help!

...rc/main/java/org/elasticsearch/xpack/inference/services/elastic/ElasticInferenceService.java

elasticsearchmachine · 2025-10-07T19:48:12Z

Hi @mridula-s109, I've created a changelog YAML for you.

elasticsearchmachine · 2025-10-07T19:48:12Z

Pinging @elastic/search-relevance (Team:Search - Relevance)

seanhandley · 2025-10-10T10:23:25Z

Thanks @mridula-s109 !

Can we merge this around Oct 22-24? This would mean it goes live on serverless Monday 27, ECH in 9.3, and fits our rollout plan ☺️

cc @maxjakob for visibility

Mikep86

Looking better! I left some comments about how to clean up the tests. IMO the biggest thing left is the default inference ID assertion in InferenceSemanticTextIT. If that is truly returning variable values, it could be an indicator of something we need to address. Or it could be a race condition in a flaky test 😁 . Best to characterize it and address it early.

docs/changelog/134708.yaml

...e-tests/src/javaRestTest/java/org/elasticsearch/xpack/inference/InferenceSemanticTextIT.java

...nference/src/main/java/org/elasticsearch/xpack/inference/mapper/SemanticTextFieldMapper.java

...nce/src/test/java/org/elasticsearch/xpack/inference/mapper/SemanticTextFieldMapperTests.java

seanhandley · 2025-10-13T13:38:36Z

FYI folks we're working on some rate limiting issues with ELSER and we may want to defer this.

Please check with @maxjakob or myself before merging.

mridula-s109 · 2025-10-13T14:57:57Z

FYI folks we're working on some rate limiting issues with ELSER and we may want to defer this.

Please check with @maxjakob or myself before merging.

Thanks for letting me know @seanhandley , will do the same!

mridula-s109 · 2025-10-14T19:49:23Z

FYI folks we're working on some rate limiting issues with ELSER and we may want to defer this.

Please check with @maxjakob or myself before merging.

Thanks @Mikep86! All comments addressed. Have verified about the default assertion in the SemanticTextEISDefaultIT, there is no flakiness or underlying issue at the moment. Since we're holding off on merging due to the ELSER rate limiting work, no rush on further review. Take your time! 😊

...nce/src/test/java/org/elasticsearch/xpack/inference/mapper/SemanticTextFieldMapperTests.java

...e-tests/src/javaRestTest/java/org/elasticsearch/xpack/inference/InferenceSemanticTextIT.java

mridula-s109 · 2025-10-17T11:14:35Z

...-tests/src/javaRestTest/java/org/elasticsearch/xpack/inference/SemanticTextEISDefaultIT.java

+     *
+     * My understanding is that the @Before will be run after the node starts up and wouldn't be sufficient to handle
+     * this scenario. That is why this needs to be @BeforeClass.
+     */


I was initially unsure about keeping this comment due to potential redundancy, but decided to leave it as-is since the added verbosity makes it more explanatory.

mridula-s109 · 2025-10-17T11:31:25Z

@ioanatia @Mikep86 quick update: I’ve addressed the review feedback and pushed the latest changes.
Just a heads-up we’ve scheduled a short sync with @seanhandley and @maxjakob on Wednesday, October 22 (1:00–1:15 PM) to proceed with merging this PR.
Please let me know if there’s anything else you’d like me to adjust before then. 🙌

Mikep86

LGTM to merge once we get the 👍 from @seanhandley and @maxjakob

Mikep86 · 2025-10-17T15:43:34Z

...-tests/src/javaRestTest/java/org/elasticsearch/xpack/inference/SemanticTextEISDefaultIT.java

+    @Before
+    public void setUp() throws Exception {
+        super.setUp();
+        // Ensure the mock EIS server has an authorized response ready before each test
+        mockEISServer.enqueueAuthorizeAllModelsResponse();
+    }


Based on my understanding of the conversation in #134708 (comment) (and the linked references), I think we only need the @BeforeClass annotated method. However, this additional @Before method will be harmless at worst, so nothing to block over. As @ioanatia said earlier, we can sync with the ML team and clean this up later.

liranabn · 2025-10-20T10:42:41Z

@mridula-s109 , do we have a separate PR to update the docs?
specifically here

If you don’t specify an inference endpoint, the inference_id field defaults to .elser-2-elasticsearch, a preconfigured endpoint for the elasticsearch service.

Defaulting EIS on ELSER

328d079

mridula-s109 requested a review from ioanatia September 15, 2025 09:43

mridula-s109 self-assigned this Sep 15, 2025

mridula-s109 added >enhancement v9.2.0 labels Sep 15, 2025

mridula-s109 and others added 2 commits September 15, 2025 10:52

Extended testing

0b89373

[CI] Auto commit changes from spotless

1cc9fd4

ioanatia reviewed Sep 15, 2025

View reviewed changes

...nference/src/main/java/org/elasticsearch/xpack/inference/mapper/SemanticTextFieldMapper.java Outdated Show resolved Hide resolved

ioanatia reviewed Sep 15, 2025

View reviewed changes

...nference/src/main/java/org/elasticsearch/xpack/inference/mapper/SemanticTextFieldMapper.java Outdated Show resolved Hide resolved

mridula-s109 added 4 commits September 17, 2025 16:18

Merge branch 'main' into default_elser_on_eis_semantic

0829058

Edited to include the default

7dcd31f

Cleaned up the mapper implementation

f19972e

COmpile issue

3d58a50

mridula-s109 commented Sep 17, 2025

View reviewed changes

...rc/main/java/org/elasticsearch/xpack/inference/services/elastic/ElasticInferenceService.java Show resolved Hide resolved

elasticsearchmachine and others added 5 commits September 17, 2025 15:47

[CI] Auto commit changes from spotless

394bf99

Merge branch 'main' into default_elser_on_eis_semantic

213aa24

Added tests

c92d2b3

[CI] Auto commit changes from spotless

1301a43

Merge branch 'main' into default_elser_on_eis_semantic

6ecdb6c

elasticsearchmachine added v9.3.0 and removed v9.2.0 labels Oct 2, 2025

mridula-s109 and others added 6 commits October 3, 2025 00:24

Refactored the variable names

9f662ce

Cleanup done

904a242

Removed unnecessary files

6355bed

Unit tests and mock is working

b4ed763

[CI] Auto commit changes from spotless

4f0d7c0

Merge branch 'main' into default_elser_on_eis_semantic

4680417

elasticsearchmachine removed the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Oct 7, 2025

Update docs/changelog/134708.yaml

9564444

mridula-s109 and others added 3 commits October 8, 2025 12:30

Integration test

0a678ed

[CI] Auto commit changes from spotless

d23256b

Merge branch 'main' into default_elser_on_eis_semantic

f0fc256

mridula-s109 requested review from Mikep86 and ioanatia October 8, 2025 13:11

Mikep86 requested changes Oct 10, 2025

View reviewed changes

mridula-s109 and others added 3 commits October 14, 2025 20:36

Resolved all PR comments

0edfb91

Merge branch 'main' into default_elser_on_eis_semantic

f3c0da5

[CI] Auto commit changes from spotless

dd765ae

mridula-s109 requested review from Mikep86 and kderusso October 14, 2025 19:56

Merge branch 'main' into default_elser_on_eis_semantic

5d932ef

Mikep86 reviewed Oct 15, 2025

View reviewed changes

...nce/src/test/java/org/elasticsearch/xpack/inference/mapper/SemanticTextFieldMapperTests.java Outdated Show resolved Hide resolved

...e-tests/src/javaRestTest/java/org/elasticsearch/xpack/inference/InferenceSemanticTextIT.java Outdated Show resolved Hide resolved

mridula-s109 added 2 commits October 17, 2025 12:10

Cleaned up the redudant reference of TestInferencePlugin

536d326

Included both before and before test

cd98727

mridula-s109 commented Oct 17, 2025

View reviewed changes

elasticsearchmachine and others added 2 commits October 17, 2025 11:18

[CI] Auto commit changes from spotless

cf797e4

Merge branch 'main' into default_elser_on_eis_semantic

9e50bdc

Mikep86 approved these changes Oct 17, 2025

View reviewed changes

ioanatia approved these changes Oct 20, 2025

View reviewed changes

Default semantic_text fields to ELSER on EIS when available #134708

Are you sure you want to change the base?

Default semantic_text fields to ELSER on EIS when available #134708

Conversation

mridula-s109 commented Sep 15, 2025 • edited by seanhandley Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

Changes

Testing

Impact

Uh oh!

mridula-s109 commented Sep 15, 2025

Uh oh!

Uh oh!

Uh oh!

ioanatia commented Sep 15, 2025

Uh oh!

ioanatia commented Sep 15, 2025

Uh oh!

Uh oh!

elasticsearchmachine commented Oct 7, 2025

Uh oh!

elasticsearchmachine commented Oct 7, 2025

Uh oh!

seanhandley commented Oct 10, 2025

Uh oh!

Mikep86 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

seanhandley commented Oct 13, 2025

Uh oh!

mridula-s109 commented Oct 13, 2025

Uh oh!

mridula-s109 commented Oct 14, 2025

Uh oh!

Uh oh!

Uh oh!

mridula-s109 Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

mridula-s109 commented Oct 17, 2025

Uh oh!

Mikep86 left a comment

Choose a reason for hiding this comment

Uh oh!

Mikep86 Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

liranabn commented Oct 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

mridula-s109 commented Sep 15, 2025 •

edited by seanhandley

Loading