Skip to content
All features
AI runtimepackages/ai/local-embedding

local-embedding package

On-device embedding via ONNX / WebGPU. Avoids the round-trip when the embed is short, private, or rate-limited at the provider.

Open docs
Stability
Stable
Scope
Global
Boundary
packages/ai/local-embedding

local-embedding

packages/ai/local-embedding

AI runtime · tool registry · MCP boundary

Ready

Capability graph

ai/local-embedding
local-embedding
Tool policy
Stream UI
Output

Run ledger

  1. 1
    Input
    packages/ai/local-embedding/request
  2. 2
    Plan
    message reducer -> model turn
  3. 3
    Tool policy
    tool schema · auth scope · rate limit
  4. 4
    Stream UI
    events -> UI state machine
  5. 5
    Output
    @nebutra/agents response envelope
p50 latency
93 ms
events/sec
686/s
providers
2
eval score
88
Usagelocal-embedding.ts
typescript
local-embedding.ts
1import { localEmbedding } from "@nebutra/local-embedding";
2
3const result = await localEmbedding.run({
4  tenantId: org.id,
5  // local-embedding is part of the AI runtime — composable with other AI primitives.
6  input: payload,
7});