Class HerokuEmbeddingModel

Heroku embedding model implementation compatible with AI SDK v5.

This class provides embedding generation capabilities using Heroku's AI infrastructure, specifically designed to work seamlessly with the Vercel AI SDK's embedding functions.

HerokuEmbeddingModel

Example

Basic usage with AI SDK:

import { embed, embedMany } from "ai";
import { heroku } from "heroku-ai-provider";

const model = heroku.embedding("cohere-embed-multilingual");

// Single embedding
const { embedding } = await embed({
  model,
  value: "Hello, world!"
});

// Multiple embeddings
const { embeddings } = await embedMany({
  model,
  values: ["First text", "Second text", "Third text"]
});

Example

Direct model usage:

import { HerokuEmbeddingModel } from "heroku-ai-provider";

const model = new HerokuEmbeddingModel(
  "cohere-embed-multilingual",
  process.env.EMBEDDING_KEY!,
  "https://us.inference.heroku.com/v1/embeddings"
);

const result = await model.doEmbed({
  values: ["Text to embed"]
});

console.log(result.embeddings[0]); // [0.1, 0.2, -0.3, ...]

Implements

EmbeddingModelV2<string>

Index

Constructors

constructor

new HerokuEmbeddingModel(
    model: string,
    apiKey: string,
    baseUrl: string,
): HerokuEmbeddingModel
Creates a new HerokuEmbeddingModel instance.
Parameters
- model: string
  The Heroku embedding model identifier (e.g., "cohere-embed-multilingual")
- apiKey: string
  Your Heroku AI API key for embeddings
- baseUrl: string
  The base URL for the Heroku embeddings API
Returns HerokuEmbeddingModel
Throws
When parameters are invalid or missing
Example
```
const model = new HerokuEmbeddingModel(
  "cohere-embed-multilingual",
  process.env.EMBEDDING_KEY!,
  "https://us.inference.heroku.com/v1/embeddings"
);
```
- Defined in src/models/embedding.ts:152

Properties

`Readonly`specificationVersion

specificationVersion: "v2" = ...

The embedding model must specify which embedding model interface version it implements. This will allow us to evolve the embedding model interface and retain backwards compatibility. The different implementation versions can be handled as a discriminated union on our side.

`Readonly`provider

provider: "heroku" = ...

Name of the provider for logging purposes.

`Readonly`modelId

modelId: string

Provider-specific model ID for logging purposes.

`Readonly`maxEmbeddingsPerCall

maxEmbeddingsPerCall: 100

Limit of how many embeddings can be generated in a single API call.

Use Infinity for models that do not have a limit.

`Readonly`supportsParallelCalls

supportsParallelCalls: true

True if the model can handle multiple embedding calls in parallel.

Methods

doEmbed

doEmbed(
    options: {
        values: string[];
        abortSignal?: AbortSignal;
        providerOptions?: SharedV2ProviderOptions;
        headers?: Record<string, string | undefined>;
    },
): Promise<
    {
        embeddings: number[][];
        usage?: { tokens: number };
        providerMetadata?: SharedV2ProviderMetadata;
        response?: { headers?: Record<string, string>; body?: unknown };
    },
>
Generate embeddings for the provided text values.

This method implements the AI SDK v5 EmbeddingModelV2 interface, providing seamless integration with the Vercel AI SDK's embedding functions.
Parameters
- options: {
      values: string[];
      abortSignal?: AbortSignal;
      providerOptions?: SharedV2ProviderOptions;
      headers?: Record<string, string | undefined>;
  }
  Configuration object containing values to embed and optional settings
  - values: string[]
    Array of text strings to generate embeddings for
  - OptionalabortSignal?: AbortSignal
    Optional AbortSignal for request cancellation
  - OptionalproviderOptions?: SharedV2ProviderOptions
  - Optionalheaders?: Record<string, string | undefined>
    Optional additional HTTP headers
Returns Promise<
    {
        embeddings: number[][];
        usage?: { tokens: number };
        providerMetadata?: SharedV2ProviderMetadata;
        response?: { headers?: Record<string, string>; body?: unknown };
    },
>
Promise resolving to embedding results with usage information
Throws
When the API request fails or input validation fails
Example
Basic embedding generation:
```
const result = await model.doEmbed({
  values: ["Hello, world!", "How are you?"]
});

console.log(result.embeddings.length); // 2
console.log(result.embeddings[0].length); // 1024 (embedding dimension)
console.log(result.usage?.tokens); // Token count used
```
Example
With abort signal for cancellation:
```
const controller = new AbortController();

// Cancel after 5 seconds
setTimeout(() => controller.abort(), 5000);

try {
  const result = await model.doEmbed({
    values: ["Long text to embed..."],
    abortSignal: controller.signal
  });
} catch (error) {
  if (error.name === 'AbortError') {
    console.log('Request was cancelled');
  }
}
```
Example
Error handling:
```
try {
  const result = await model.doEmbed({
    values: [""] // Empty string will cause validation error
  });
} catch (error) {
  if (error instanceof APICallError) {
    console.error('API Error:', error.message);
    console.error('Status:', error.statusCode);
  }
}
```
Implementation of EmbeddingModelV2.doEmbed
- Defined in src/models/embedding.ts:347

embedSingle

embedSingle(text: string): Promise<{ embedding: number[] }>
Generate embedding for a single text string.

This is a convenience method that wraps doEmbed for single-text use cases.
Parameters
- text: string
  The text string to generate an embedding for
Returns Promise<{ embedding: number[] }>
Promise resolving to the embedding vector
Throws
When the API request fails or input validation fails
Example
```
const result = await model.embedSingle("Hello, world!");
console.log(result.embedding); // [0.1, 0.2, -0.3, ...]
```
- Defined in src/models/embedding.ts:553

embedBatch

embedBatch(
texts: string[],
chunkSize?: number,
): Promise<{ embeddings: number[][] }>
Generate embeddings for multiple texts with automatic chunking.

This method automatically splits large batches into smaller chunks to respect API limits and processes them sequentially.
Parameters
- texts: string[]
  Array of text strings to generate embeddings for
- chunkSize: number = ...
  Maximum number of texts to process in each API call
Returns Promise<{ embeddings: number[][] }>
Promise resolving to all embedding vectors
Throws
When any API request fails or input validation fails
Example
```
const texts = Array.from({ length: 150 }, (_, i) => `Text ${i}`);
const result = await model.embedBatch(texts, 50); // Process in chunks of 50
console.log(result.embeddings.length); // 150
```
- Defined in src/models/embedding.ts:579

Class HerokuEmbeddingModel

Example

Example

Implements

Index

Constructors

Properties

Methods

Constructors

constructor

Parameters

Returns HerokuEmbeddingModel

Throws

Example

Properties

ReadonlyspecificationVersion

Readonlyprovider

ReadonlymodelId

ReadonlymaxEmbeddingsPerCall

ReadonlysupportsParallelCalls

Methods

doEmbed

Parameters

values: string[]

OptionalabortSignal?: AbortSignal

OptionalproviderOptions?: SharedV2ProviderOptions

Optionalheaders?: Record<string, string | undefined>

Returns Promise< { embeddings: number[][]; usage?: { tokens: number }; providerMetadata?: SharedV2ProviderMetadata; response?: { headers?: Record<string, string>; body?: unknown }; },>

Throws

Example

Example

Example

embedSingle

Parameters

Returns Promise<{ embedding: number[] }>

Throws

Example

embedBatch

Parameters

Returns Promise<{ embeddings: number[][] }>

Throws

Example

Settings

On This Page

`Readonly`specificationVersion

`Readonly`provider

`Readonly`modelId

`Readonly`maxEmbeddingsPerCall

`Readonly`supportsParallelCalls

`Optional`abortSignal?: AbortSignal

`Optional`providerOptions?: SharedV2ProviderOptions

`Optional`headers?: Record<string, string | undefined>

Returns Promise<
{
embeddings: number[][];
usage?: { tokens: number };
providerMetadata?: SharedV2ProviderMetadata;
response?: { headers?: Record<string, string>; body?: unknown };
},
>