Search: "preference learning"

2 stories match your query across our 700+ source catalog. Ranked by relevance and recency.

2 results for "preference learning"

Aligning with Your Own Voice: Self-Corrected Preference Learning for Hallucination Mitigation in LVLMs

Large Vision-Language Models (LVLMs) frequently suffer from hallucinations. Existing preference learning-based approaches largely rely on proprietary models to construct preference datasets. We identi…

Tue, 28 Apr 2026 04:13:21 GMT · 2 views

ARXIV.ORG

Explanation Quality Assessment as Ranking with Listwise Rewards

We reformulate explanation quality assessment as a ranking problem rather than a generation problem. Instead of optimizing models to produce a single "best" explanation token-by-token, we train reward…

Tue, 28 Apr 2026 04:13:21 GMT · 2 views

Or browse by topic

World US Politics Technology AI Markets Business Science Climate Health Culture Media

Results for "preference learning".

Aligning with Your Own Voice: Self-Corrected Preference Learning for Hallucination Mitigation in LVLMs

Explanation Quality Assessment as Ranking with Listwise Rewards

Or browse by topic