Chrome's New Embedding Model: 57% Smaller, Maintains Performance in Semantic Search Tasks

Chrome’s New Embedding Model: Smaller, Faster, Same Quality

TL;DR Chrome’s latest update incorporates a new text embedding model that is 57% smaller (35.14MB vs 81.91MB) than its predecessor while maintaining virtually identical performance in semantic search tasks. The size reduction was achieved primarily through quantization of the embedding matrix from float32 to int8 precision, with no measurable degradation in embedding quality or search ranking. Discovery and Extraction During routine analysis of Chrome’s binary components, I discovered a new vers...