WeSearch

DIVE: Embedding Compression via Self-Limiting Gradient Updates

·2 min read · 0 reactions · 0 comments · 9 views
#machine learning#artificial intelligence#information retrieval
DIVE: Embedding Compression via Self-Limiting Gradient Updates
⚡ TL;DR · AI summary

The paper titled 'DIVE: Embedding Compression via Self-Limiting Gradient Updates' introduces a new method for compressing high-dimensional embeddings from large language models. The proposed DIVE method utilizes a self-limiting hinge-based triplet loss and a head-wise NT-Xent contrastive loss to improve retrieval performance, especially in scenarios with limited labeled data. Results show that DIVE outperforms existing compression methods across multiple datasets and compression ratios.

Key facts
Original article
arXiv cs.AI
Read full at arXiv cs.AI →
Opening excerpt (first ~120 words) tap to expand

Computer Science > Computation and Language arXiv:2605.20689 (cs) [Submitted on 20 May 2026] Title:DIVE: Embedding Compression via Self-Limiting Gradient Updates Authors:Dongfang Zhao View a PDF of the paper titled DIVE: Embedding Compression via Self-Limiting Gradient Updates, by Dongfang Zhao View PDF HTML (experimental) Abstract:High-dimensional embeddings from large language models impose significant storage and computational costs on vector search systems. Recent embedding compression methods, including Matryoshka-Adaptor (EMNLP 2024), Search-Adaptor (ACL 2024), and SMEC (EMNLP 2025), enable dimensionality reduction through lightweight residual adapters, but their training objectives cause severe overfitting when labeled data is scarce, degrading retrieval performance below the…

Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv cs.AI.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from arXiv cs.AI