feat(openai): align embedding instrumentation with pending spec #2210

codefromthecrypt · 2025-09-17T20:26:56Z

Impact

This PR aligns openai with the specification changes around embeddings in #2162

Spec changes from 2162

Consistent span name: "CreateEmbeddings"
Standardized attribute structure: embedding.embeddings.N.embedding.{text|vector}
Unified invocation parameter tracking: embedding.invocation_parameters
Proper llm.system attribute for provider identification

Code improvements:

Full batch embedding support with indexed attributes
Separated invocation parameters from input data
Improved handling of token IDs vs text inputs

Signed-off-by: Adrian Cole <[email protected]>

mikeldking · 2025-09-18T00:05:56Z

...on/openinference-instrumentation-openai/src/openinference/instrumentation/openai/_request.py

+            # Use consistent span names: "CreateEmbeddings" for embeddings, class name for others
+            if cast_to is self._openai.types.CreateEmbeddingResponse:
+                span_name = "CreateEmbeddings"


just for my understanding here - what does the consistent name buy us rather than maybe naming based on function name.

backends typically index the span name field for query and sometimes people do aggregates/auto-complete on the span name. For example, zipkin does auto-complete on span name, but not arbitrary attributes. Some tools dereive metrics from spans and need to aggregate on something, typically that's a span name. So, if something that is in a spec leaves the span name out, it ends up unable to be usable for things like this.

here's an oldie example of this, which is subverted when span names are subtly different for the same operation https://github.com/openzipkin/zipkin-api/blob/master/zipkin2-api.yaml#L44

I see. I mainly ask because I tend to face a fair amount of pressure to cram things like agent operations in names because there's a need for operators to groc the control flow.

With embedding's generation, though I think this makes a lot of sense

Agreed. I don't think agent ops are commodity yet maybe not for a long while. Wont go trying to normalize those ;)

...rumentation-openai/src/openinference/instrumentation/openai/_request_attributes_extractor.py

mikeldking · 2025-09-18T00:11:38Z

...umentation-openai/src/openinference/instrumentation/openai/_response_attributes_extractor.py

-try:
-    _NUMPY: Optional[ModuleType] = import_module("numpy")
-except ImportError:
-    _NUMPY = None
-


I honestly don't remember why this was needed.

originally it was doing this

vector = _NUMPY.frombuffer(base64.b64decode(_vector), dtype="float32").tolist()

but we have test cases to prove we can get the vectors at the moment without it. If we need to re-introduce it we probably should with a failing test

Signed-off-by: Adrian Cole <[email protected]>

codefromthecrypt · 2025-09-18T00:45:35Z

...rumentation-openai/src/openinference/instrumentation/openai/_request_attributes_extractor.py

-    # openai.types.completion_create_params.CompletionCreateParamsBase
-    # See https://github.com/openai/openai-python/blob/f1c7d714914e3321ca2e72839fe2d132a8646e7f/src/openai/types/completion_create_params.py#L11  # noqa: E501
+    """
+    Extract attributes from parameters for the LEGACY completions API.


here's a rewording of the comments which I think better explains the prompt union weirdness

codefromthecrypt · 2025-09-19T00:30:55Z

gonna be away this weekend, so lemme know if there's anything else to do here. regardless, I'll chase up the others next week.

codefromthecrypt · 2025-09-25T05:48:40Z

thanks, put to action here! envoyproxy/ai-gateway#1232

This PR aligns litellm with the specification changes around embeddings in Arize-ai#2162 **Spec changes from 2162** - Consistent span name: `"CreateEmbeddings"` - Standardized attribute structure: `embedding.embeddings.N.embedding.{text|vector}` - Unified invocation parameter tracking: `embedding.invocation_parameters` - Proper `llm.system` attribute for provider identification **Code improvements:** - Full batch embedding support with indexed attributes - Separated invocation parameters from input data - Improved handling of token IDs vs text inputs - Vectors stored as tuples instead of JSON strings This is the same as Arize-ai#2210, except litellm. Signed-off-by: Adrian Cole <[email protected]>

feat(openai): align embedding instrumentation with pending spec

09ad411

Signed-off-by: Adrian Cole <[email protected]>

codefromthecrypt requested a review from a team as a code owner September 17, 2025 20:26

github-project-automation bot added this to Instrumentation Sep 17, 2025

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Sep 17, 2025

This comment was marked as outdated.

Sign in to view

codefromthecrypt added 2 commits September 18, 2025 09:00

feedback

156c402

Signed-off-by: Adrian Cole <[email protected]>

less obnoxious import

12b9371

Signed-off-by: Adrian Cole <[email protected]>

mikeldking reviewed Sep 18, 2025

View reviewed changes

...rumentation-openai/src/openinference/instrumentation/openai/_request_attributes_extractor.py Show resolved Hide resolved

mikeldking approved these changes Sep 18, 2025

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Sep 18, 2025

clarify legacy completions api

838c557

Signed-off-by: Adrian Cole <[email protected]>

codefromthecrypt commented Sep 18, 2025

View reviewed changes

This was referenced Sep 18, 2025

add tracing support for the embeddings router filter envoyproxy/ai-gateway#1085

Closed

feat: add embedding hiding configuration and align spec with instrumentation #2162

Draft

axiomofjoy approved these changes Sep 18, 2025

View reviewed changes

axiomofjoy merged commit df5b9d5 into Arize-ai:main Sep 22, 2025
16 checks passed

github-project-automation bot moved this to Done in Instrumentation Sep 22, 2025

github-actions bot mentioned this pull request Sep 19, 2025

chore: release main #2216

Merged

codefromthecrypt deleted the openai-embeddings branch September 25, 2025 05:48

codefromthecrypt mentioned this pull request Sep 28, 2025

feat(litellm): align embedding instrumentation with pending spec #2238

Open

feat(openai): align embedding instrumentation with pending spec #2210

feat(openai): align embedding instrumentation with pending spec #2210

Uh oh!

Conversation

codefromthecrypt commented Sep 17, 2025

Impact

Uh oh!

This comment was marked as outdated.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

codefromthecrypt commented Sep 19, 2025

Uh oh!

Uh oh!

codefromthecrypt commented Sep 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants