perf: Vectorize fuzzy_lookup_embedding with numpy ops by KRRT7 · Pull Request #234 · microsoft/typeagent-py

KRRT7 · 2026-04-10T19:27:47Z

Summary

Replace Python-level list comprehension + sort in fuzzy_lookup_embedding with numpy vectorized operations (np.flatnonzero, np.argpartition)
Rewrite fuzzy_lookup_embedding_in_subset to compute dot products only for subset indices instead of scanning all vectors + predicate filter

Benchmark (Azure Standard_D2s_v5, 384-dim embeddings)

Benchmark	Before	After	Speedup
`fuzzy_lookup_embedding` (1K vecs)	259μs	64μs	4.1x
`fuzzy_lookup_embedding` (10K vecs)	6.1ms	531μs	11.5x
`fuzzy_lookup_embedding_in_subset` (1K of 10K)	3.2ms	244μs	13.0x

Why this matters

These functions are called on every fuzzy_lookup — the core search path for conversation queries. At 10K vectors (a long conversation), the Python iteration path takes 6ms per query. With multiple queries per request (e.g., searching across properties + timestamps + topics), this adds up.

The numpy path stays in C for the heavy lifting: score filtering via np.flatnonzero, O(n) top-k via np.argpartition, and subset dot products via fancy indexing. ScoredInt objects are only created for the final top-k results.

Test plan

make format check test passes (470 passed, 12 pre-existing online test failures)
Benchmark included: tests/benchmarks/test_benchmark_vectorbase.py

Replace Python-level iteration + sort with numpy operations: - No-predicate path: np.flatnonzero for score filtering, np.argpartition for O(n) top-k selection — avoids building ScoredInt for every vector - Predicate path: numpy pre-filters by score, applies predicate only to candidates above threshold - Subset lookup: numpy fancy indexing computes dot products only for subset indices instead of delegating to full-vector scan with predicate

- Add pytest-benchmark to dev dependency group so CI has the benchmark fixture available - Replace hand-rolled StubEmbeddingModel with create_test_embedding_model() to satisfy IEmbeddingModel protocol (fixes pyright)

KRRT7 added 4 commits April 10, 2026 13:02

Add benchmark for fuzzy_lookup_embedding at realistic scales

9ac74b6

Fix CI: add pytest-benchmark dev dep, use create_test_embedding_model

a64145e

- Add pytest-benchmark to dev dependency group so CI has the benchmark fixture available - Replace hand-rolled StubEmbeddingModel with create_test_embedding_model() to satisfy IEmbeddingModel protocol (fixes pyright)

Update uv.lock for pytest-benchmark

8392224

gvanrossum approved these changes Apr 11, 2026

View reviewed changes

gvanrossum merged commit 8c8f67a into microsoft:main Apr 11, 2026
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Vectorize fuzzy_lookup_embedding with numpy ops#234

perf: Vectorize fuzzy_lookup_embedding with numpy ops#234
gvanrossum merged 4 commits intomicrosoft:mainfrom
KRRT7:perf/vectorbase-numpy

KRRT7 commented Apr 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

KRRT7 commented Apr 10, 2026

Summary

Benchmark (Azure Standard_D2s_v5, 384-dim embeddings)

Why this matters

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants