Chrome’s New Embedding Model: Smaller, Faster, Same Quality
TL;DR
Chrome’s latest update incorporates a new text embedding model that is 57% smaller (35.14MB vs 81.91MB) than its predecessor while maintaining virtually identical performance in semantic search tasks.
The size reduction was achieved primarily through quantization of the embedding matrix from float32 to int8 precision, with no measurable degradation in embedding quality or search ranking.
Discovery and Extraction
During routine analysis of Chrome’s binary components, I discovered a new vers...
Read more at dejan.ai