DIVE: Embedding Compression via Self-Limiting Gradient Updates

May 22, 2026 · 4:00 AM UTC ·2 min read · 0 reactions · 0 comments · 24 views

#machine learning #artificial intelligence #information retrieval

TL;DR · WeSearch summary

The paper titled 'DIVE: Embedding Compression via Self-Limiting Gradient Updates' introduces a new method for compressing high-dimensional embeddings from large language models. The proposed DIVE method utilizes a self-limiting hinge-based triplet loss and a head-wise NT-Xent contrastive loss to improve retrieval performance, especially in scenarios with limited labeled data. Results show that DIVE outperforms existing compression methods across multiple datasets and compression ratios.

Key facts

▪DIVE addresses overfitting issues in embedding compression methods when labeled data is scarce.
▪The method employs a self-limiting hinge-based triplet loss to control gradient updates.
▪DIVE outperforms three baseline adapters on six BEIR datasets at various compression ratios.

Original article

arXiv cs.AI

Read full at arXiv cs.AI →

Opening excerpt (first ~120 words) tap to expand

Computer Science > Computation and Language arXiv:2605.20689 (cs) [Submitted on 20 May 2026] Title:DIVE: Embedding Compression via Self-Limiting Gradient Updates Authors:Dongfang Zhao View a PDF of the paper titled DIVE: Embedding Compression via Self-Limiting Gradient Updates, by Dongfang Zhao View PDF HTML (experimental) Abstract:High-dimensional embeddings from large language models impose significant storage and computational costs on vector search systems. Recent embedding compression methods, including Matryoshka-Adaptor (EMNLP 2024), Search-Adaptor (ACL 2024), and SMEC (EMNLP 2025), enable dimensionality reduction through lightweight residual adapters, but their training objectives cause severe overfitting when labeled data is scarce, degrading retrieval performance below the…

Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv cs.AI.

Anonymous · no account needed

Discussion

0 comments

DIVE: Embedding Compression via Self-Limiting Gradient Updates

Discussion

More from arXiv cs.AI