Skip to main content

Module types

Module types 

Source
Expand description

Public DTOs and the internal EmbedRequest enum exchanged between the pool and the worker threads.

Structsยง

DualEmbedding
Paired dense + sparse embeddings produced from a single forward pass.
EmbedStats
Per-request diagnostic statistics captured inside the worker and forwarded to the handler layer for inclusion in the completion log event.
ProbeResult ๐Ÿ”’
Result of a single probe session.run() call.
SparseEmbedding
Sparse embedding output from the BGE-M3 sparse-linear projection layer.

Enumsยง

EmbedRequest ๐Ÿ”’

Constantsยง

OS_HEADROOM_BYTES ๐Ÿ”’
OS headroom reserved for kernel, stack, ORT arena, and other non-model allocations. Subtracted from available memory before computing per-worker workspace.