Add TurboQuant vector quantization algorithm by sroussey · Pull Request #354 · workglow-dev/workglow

sroussey · 2026-04-01T23:12:37Z

Fix TurboQuantizeOptions - make bits and seed optional fields
Fix createPrng seed=0 handling by XOR-mixing seed with golden-ratio constant
Fix rotation correctness: store and use paddedDimensions throughout (avoids dropping coordinates for non-power-of-2 dims)
Fix misleading docstring in getQuantizationParams (accurately describes uniform quantizer now)
Add buffer length validation to unpackCodes()
Fix TURBO branch in VectorQuantizeTask to report targetType: FLOAT32 (matches actual Float32Array output)
Change turboBits/turboSeed schema fields from type: "number" to type: "integer"
Add turbo-method test cases to VectorQuantizeTask.test.ts (type/metadata, determinism, array-of-vectors)
Update TurboQuantize.test.ts storage/compression tests for padded-dimension calculations

Implement near-optimal vector quantization based on "TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate" (Zandieh et al., 2025). The algorithm uses randomized Walsh-Hadamard rotation + optimal per-coordinate scalar quantization to achieve ~2.7x of theoretical distortion limits. Data-oblivious and per-vector, making it ideal for streaming RAG pipelines. - Add turboQuantize/turboDequantize in @workglow/util/schema - Add turboQuantizedInnerProduct/turboQuantizedCosineSimilarity for direct similarity on quantized vectors - Extend VectorQuantizeTask with "turbo" method option and turboBits/ turboSeed parameters - Add 29 tests covering roundtrip quality, compression, and similarity https://claude.ai/code/session_01YD75mdbcw6ygET7hdjQdWD

…tion or class' Co-authored-by: Copilot Autofix powered by AI <223894421+github-code-quality[bot]@users.noreply.github.com>

Copilot

Pull request overview

Adds a new TurboQuant vector quantization implementation to @workglow/util, exports it via the util schema entrypoint, and wires an optional “turbo” path into VectorQuantizeTask, along with a dedicated TurboQuant test suite.

Changes:

Added TurboQuantize.ts implementing TurboQuant quantize/dequantize + similarity helpers and storage sizing utilities.
Exported TurboQuant APIs from packages/util/src/schema-entry.ts.
Added VectorQuantizeTask input options for selecting linear vs turbo behavior.
Added TurboQuantize unit tests.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 8 comments.

File	Description
packages/util/src/vector/TurboQuantize.ts	Implements TurboQuant quantize/dequantize + quantized similarity and storage helpers.
packages/util/src/schema-entry.ts	Re-exports TurboQuant APIs for public consumption via `@workglow/util/schema`.
packages/test/src/test/util/TurboQuantize.test.ts	Adds unit coverage for TurboQuant roundtrip, similarity estimates, determinism, and utilities.
packages/ai/src/task/VectorQuantizeTask.ts	Adds method selection and TurboQuant configuration to the vector quantization task.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-01T23:19:30Z

+  readonly bits: number;
+  /** Seed for deterministic random rotation. If omitted, uses a fixed default seed. */
+  readonly seed: number | undefined;


TurboQuantizeOptions makes seed (and bits) required properties even though the implementation treats them as optional via defaults. This prevents callers from passing { bits: 4 } or {}. Make these fields optional (e.g., bits?: number; seed?: number) or provide a separate TurboQuantizeOptionsInput type that reflects the defaulting behavior.

Suggested change

readonly bits: number;

/** Seed for deterministic random rotation. If omitted, uses a fixed default seed. */

readonly seed: number | undefined;

readonly bits?: number;

/** Seed for deterministic random rotation. If omitted, uses a fixed default seed. */

readonly seed?: number;

Copilot · 2026-04-01T23:19:31Z

+function createPrng(seed: number): () => number {
+  let state = seed | 0 || 1;
+  return () => {
+    state ^= state << 13;
+    state ^= state >> 17;
+    state ^= state << 5;
+    // Convert to [0, 1) range
+    return (state >>> 0) / 4294967296;


createPrng coerces seed = 0 to state 1 (seed | 0 || 1), so a caller-provided seed of 0 will not be honored. Either document this explicitly or map 0 to a non-zero constant in a way that preserves the input seed’s determinism contract (e.g., hash/mix the seed instead of treating 0 specially).

Copilot · 2026-04-01T23:19:31Z

+function randomRotate(values: Float64Array, seed: number): Float64Array {
+  const d = values.length;
+  // Pad to next power of 2 for Hadamard transform
+  const paddedLen = nextPowerOf2(d);
+  const result = new Float64Array(paddedLen);
+  result.set(values);
+
+  const prng = createPrng(seed);
+
+  // Apply 3 rounds for good mixing (standard practice for randomized Hadamard)
+  for (let round = 0; round < 3; round++) {
+    // Random sign flips (diagonal Rademacher matrix)
+    for (let i = 0; i < paddedLen; i++) {
+      if (prng() < 0.5) {
+        result[i] = -result[i];
+      }
+    }
+
+    // Fast Walsh-Hadamard transform (in-place, normalized)
+    fastWalshHadamard(result);
+  }
+
+  // Return only the first d dimensions (drop padding)
+  return result.subarray(0, d);
+}


The padding/truncation in randomRotate breaks orthogonality/invertibility for non-power-of-two dimensions: you rotate in paddedLen space but then drop the padded coordinates (subarray(0, d)). This loses information and means inverseRandomRotate() cannot correctly undo the rotation (and inner products won’t be preserved) for common dimensions like 768. Consider keeping/quantizing all paddedLen coordinates (and storing paddedLen in the result) or using an orthogonal transform that supports arbitrary lengths; alternatively, explicitly require power-of-two dimensions and throw otherwise.

Copilot · 2026-04-01T23:19:31Z

+function unpackCodes(packed: Uint8Array, bits: number, count: number): number[] {
+  const codes: number[] = new Array(count);
+
+  let bitPos = 0;
+  for (let i = 0; i < count; i++) {
+    let code = 0;
+    let remaining = bits;
+    let shift = 0;
+    while (remaining > 0) {
+      const byteIdx = bitPos >> 3;
+      const bitOffset = bitPos & 7;
+      const bitsToRead = Math.min(remaining, 8 - bitOffset);
+      const mask = (1 << bitsToRead) - 1;
+      code |= ((packed[byteIdx] >> bitOffset) & mask) << shift;
+      shift += bitsToRead;
+      bitPos += bitsToRead;
+      remaining -= bitsToRead;
+    }
+    codes[i] = code;


unpackCodes() does not validate that packed.length is large enough for count * bits bits. If a truncated/invalid buffer is passed, typed-array out-of-bounds reads yield undefined which is coerced to 0, silently producing wrong codes (and potentially masking data corruption). Add an explicit length check (expected bytes = ceil(count * bits / 8)) and throw on mismatch.

Copilot · 2026-04-01T23:19:31Z

+    let quantized: TypedArray[];
+
+    if (method === QuantizationMethod.TURBO) {
+      quantized = vectors.map((v) => {
+        const result = turboQuantize(v, { bits: turboBits, seed: turboSeed });
+        return turboDequantize(result);
+      });
+    } else {
+      quantized = vectors.map((v) => this.vectorQuantize(v, targetType, normalize));
+    }


In the TURBO branch, the task returns turboDequantize(...) (a Float32Array) but still reports targetType as the requested type, and does not actually quantize to targetType. This is an observable mismatch (e.g., targetType: INT8 can return a Float32Array) and defeats the task’s “reduce storage” purpose. Either (1) change the output schema to return TurboQuant’s packed codes + metadata, (2) set targetType to FLOAT32 for the turbo path, and/or (3) post-process the dequantized vector through vectorQuantize(..., targetType, ...) if you intend turbo to be a preconditioning step.

Copilot · 2026-04-01T23:19:32Z

  override async executeReactive(input: VectorQuantizeTaskInput): Promise<VectorQuantizeTaskOutput> {
-    const { vector, targetType, normalize = true } = input;
+    const {
+      vector,
+      targetType,
+      normalize = true,
+      method = QuantizationMethod.LINEAR,
+      turboBits = 4,
+      turboSeed = 42,
+    } = input;
    const isArray = Array.isArray(vector);
    const vectors = isArray ? vector : [vector];
    const originalType = this.getVectorType(vectors[0]);

-    const quantized = vectors.map((v) => this.vectorQuantize(v, targetType, normalize));
+    let quantized: TypedArray[];
+
+    if (method === QuantizationMethod.TURBO) {
+      quantized = vectors.map((v) => {
+        const result = turboQuantize(v, { bits: turboBits, seed: turboSeed });
+        return turboDequantize(result);
+      });
+    } else {
+      quantized = vectors.map((v) => this.vectorQuantize(v, targetType, normalize));
+    }


TurboQuant support in VectorQuantizeTask isn’t covered by the existing VectorQuantizeTask tests (they only exercise the linear path). Add at least one test case that sets method: 'turbo' and asserts the returned type/metadata behavior you intend (and that it is deterministic for a fixed seed).

Copilot · 2026-04-01T23:19:32Z

+/**
+ * Computes optimal quantization boundaries and reconstruction points for
+ * coordinates of a rotated unit vector.
+ *
+ * After random rotation, each coordinate of a d-dimensional unit vector follows
+ * approximately N(0, 1/d). For practical purposes with moderate dimensions (>50),
+ * we use uniform quantization over the range [-c/sqrt(d), c/sqrt(d)] where c
+ * controls the coverage (we use c ≈ 3 for 99.7% coverage).
+ */
+function getQuantizationParams(
+  bits: number,
+  dimensions: number
+): { readonly levels: number; readonly scale: number } {
+  const levels = 1 << bits; // 2^bits quantization levels
+  // After rotation, coordinates are approximately N(0, 1/d).
+  // Standard deviation is 1/sqrt(d). Cover ±3 standard deviations.
+  const coverage = 3.0;
+  const scale = coverage / Math.sqrt(dimensions);
+  return { levels, scale };


The docstrings here describe “optimal quantization boundaries / Beta distribution” behavior, but getQuantizationParams() actually implements a fixed-range uniform quantizer (±3/√d) and doesn’t compute any boundaries or reconstruction points beyond a linear scale. Please align the documentation with the implemented quantizer, or implement the described non-uniform/optimal scalar quantization so callers aren’t misled about distortion guarantees.

Copilot · 2026-04-01T23:19:32Z

+    turboBits: {
+      type: "number",
+      title: "TurboQuant Bits",
+      description:
+        "Bits per dimension for TurboQuant method (1-8). Lower = more compression. 4 bits gives ~8x compression with near-lossless quality.",
+      default: 4,
+      minimum: 1,
+      maximum: 8,
+    },
+    turboSeed: {
+      type: "number",
+      title: "TurboQuant Seed",
+      description:
+        "Seed for the random rotation in TurboQuant. All vectors in the same collection must use the same seed for similarity search to work.",
+      default: 42,
+    },


turboBits and turboSeed are modeled as type: "number" in the input schema, but the underlying implementation treats bits as an integer (and bitwise PRNG logic will effectively truncate non-integer seeds). Consider using type: "integer" (or multipleOf: 1) for these fields to match runtime expectations and fail validation earlier.

sroussey · 2026-04-01T23:58:05Z

@copilot apply changes based on the comments in this thread

…uantizeTask Agent-Logs-Url: https://github.com/workglow-dev/workglow/sessions/6f050f81-5568-4d3e-90f0-422553b01ee8 Co-authored-by: sroussey <127349+sroussey@users.noreply.github.com>

Copilot · 2026-04-02T00:08:00Z

@copilot apply changes based on the comments in this thread

Applied all 8 review changes in commit 051f0b7:

TurboQuantizeOptions — bits and seed are now optional (?), so callers can pass {} or { bits: 4 }.
createPrng seed=0 — seed is now XOR-mixed with 0x9e3779b9 before use, giving every distinct integer seed (including 0) its own distinct non-zero PRNG state.
Rotation orthogonality — randomRotate now returns all paddedLen coordinates instead of truncating to d; TurboQuantizeResult gains paddedDimensions; turboDequantize and turboQuantizedInnerProduct use paddedDimensions throughout, preserving full invertibility for non-power-of-2 dimensions like 768.
getQuantizationParams docstring — rewritten to accurately describe the fixed-range uniform quantizer (no more "Beta distribution / optimal boundaries" language).
unpackCodes validation — throws with a descriptive message when the buffer is smaller than ceil(count * bits / 8).
turboBits/turboSeed schema — changed from type: "number" to type: "integer".
TURBO branch return type — now explicitly returns targetType: TensorType.FLOAT32 to match the actual Float32Array output.
New VectorQuantizeTask turbo tests — three cases: correct type/metadata, determinism with fixed seed, and array-of-vectors. Storage/compression ratio tests in TurboQuantize.test.ts updated for the padded-dimension calculations.

…tible output TurboQuant's rotation + optimal scalar quantization now outputs directly into byte-aligned TypedArrays (Int8Array, Uint8Array, Int16Array, Uint16Array) with the same .length as the input vector. This means the output works transparently with all existing storage backends and cosineSimilarity search — no dimensional mismatch. - Add turboQuantizeToTypedArray() that rotates then quantizes into the target integer type at its native bit width - Update VectorQuantizeTask turbo branch to call turboQuantizeToTypedArray directly instead of quantize+dequantize roundtrip - Remove turboBits parameter (bit width determined by targetType) - Add 14 tests for the new function covering type output, similarity preservation, determinism, range bounds, and edge cases https://claude.ai/code/session_01YD75mdbcw6ygET7hdjQdWD

github-code-quality bot found potential problems Apr 1, 2026

View reviewed changes

Comment thread packages/util/src/vector/TurboQuantize.ts Fixed

Potential fix for pull request finding 'Unused variable, import, func…

9b209b1

…tion or class' Co-authored-by: Copilot Autofix powered by AI <223894421+github-code-quality[bot]@users.noreply.github.com>

sroussey requested a review from Copilot April 1, 2026 23:15

Copilot started reviewing on behalf of sroussey April 1, 2026 23:15 View session

Copilot AI reviewed Apr 1, 2026

View reviewed changes

Copilot started work on behalf of sroussey April 1, 2026 23:58 View session

fix(util,ai): address PR review comments on TurboQuantize and VectorQ…

051f0b7

…uantizeTask Agent-Logs-Url: https://github.com/workglow-dev/workglow/sessions/6f050f81-5568-4d3e-90f0-422553b01ee8 Co-authored-by: sroussey <127349+sroussey@users.noreply.github.com>

Copilot finished work on behalf of sroussey April 2, 2026 00:08

sroussey force-pushed the main branch from abc663b to eca722a Compare April 9, 2026 01:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TurboQuant vector quantization algorithm#354

Add TurboQuant vector quantization algorithm#354
sroussey wants to merge 4 commits intomainfrom
claude/integrate-arxiv-paper-VF55c

sroussey commented Apr 1, 2026 •

edited by Copilot AI

Loading

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

sroussey commented Apr 1, 2026

Uh oh!

Copilot AI commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

sroussey commented Apr 1, 2026 • edited by Copilot AI Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

sroussey commented Apr 1, 2026

Uh oh!

Copilot AI commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sroussey commented Apr 1, 2026 •

edited by Copilot AI

Loading