Embedding
Introduction
This guide demonstrates how to use the SAP AI SDK for Java to perform embedding tasks using OpenAI models deployed on SAP AI Core.
All classes under any of the ...model
packages are generated from an OpenAPI specification.
This means that these model classes are not guaranteed to be stable and may change with future releases.
They are safe to use, but may require updates even in minor releases.
New User Interface (v1.4.0)
We're excited to introduce a new user interface for OpenAI embedding calls starting with version 1.4.0. This update is designed to improve the SAP AI SDK by:
- Decoupling Layers: Separating the convenience layer from the model classes to deliver a more stable and maintainable experience.
- Staying Current: Making it easier for the SAP AI SDK to adapt to the latest changes in the OpenAI API specification.
- Consistent Design: Aligning with the Orchestrator convenience API for a smoother transition and easier adoption.
Please Note:
- The new interface is gradually being rolled out across the SAP AI SDK.
- We welcome your feedback to help us refine this interface.
- The existing interface (v1.0.0) is deprecated but available.
Prerequisites
Before using the AI Core module, ensure that you have met all the general requirements outlined in the overview. Additionally, include the necessary Maven dependency in your project.
Maven Dependencies
Add the following dependency to your pom.xml
file:
<dependencies>
<dependency>
<groupId>com.sap.ai.sdk.foundationmodels</groupId>
<artifactId>openai</artifactId>
<version>${ai-sdk.version}</version>
</dependency>
</dependencies>
See an example pom in our Spring Boot application
Usage
In addition to the prerequisites above, we assume you have already set up the following to carry out the examples in this guide:
-
A Deployed OpenAI Model in SAP AI Core
-
Refer to How to deploy a model to AI Core for setup instructions.
-
In case the model is deployed in a custom resource group, refer to this section.
-
Example deployed model from the AI Core
/deployments
endpoint{
"id": "d123456abcdefg",
"deploymentUrl": "https://api.ai.region.aws.ml.hana.ondemand.com/v2/inference/deployments/d123456abcdefg",
"configurationId": "12345-123-123-123-123456abcdefg",
"configurationName": "gpt-4o-mini",
"scenarioId": "foundation-models",
"status": "RUNNING",
"statusMessage": null,
"targetStatus": "RUNNING",
"lastOperation": "CREATE",
"latestRunningConfigurationId": "12345-123-123-123-123456abcdefg",
"ttl": null,
"details": {
"scaling": {
"backendDetails": {}
},
"resources": {
"backendDetails": {
"model": {
"name": "gpt-4o-mini",
"version": "latest"
}
}
}
},
"createdAt": "2024-07-03T12:44:22Z",
"modifiedAt": "2024-07-16T12:44:19Z",
"submissionTime": "2024-07-03T12:44:51Z",
"startTime": "2024-07-03T12:45:56Z",
"completionTime": null
}
-
Embedding
Since v1.4.0
Get the embeddings of a text input in list of float values:
var request = new OpenAiEmbeddingRequest(List.of("Hello World"));
OpenAiEmbeddingResponse response = OpenAiClient.forModel(TEXT_EMBEDDING_3_SMALL).embedding(request);
float[] embedding = embedding.getEmbeddings().get(0);
Since v1.0.0
var request = new OpenAiEmbeddingParameters().setInput("Hello World");
OpenAiEmbeddingOutput embedding = OpenAiClient.forModel(TEXT_EMBEDDING_3_SMALL).embedding(request);
float[] embedding = embedding.getData().get(0).getEmbedding();