Search: "diagnostic errors"

2 stories match your query across our 700+ source catalog. Ranked by relevance and recency.

2 results for "diagnostic errors"

CT-FineBench: A Diagnostic Fidelity Benchmark for Fine-Grained Evaluation of CT Report Generation

The evaluation of generated reports remains a critical challenge in Computed Tomography (CT) report generation, due to the large volume of text, the diversity and complexity of findings, and the prese…

Tue, 28 Apr 2026 04:13:21 GMT · 7 views

ARXIV CS.AI

DO-Bench: An Attributable Benchmark for Diagnosing Object Hallucination in Vision-Language Models

Object level hallucination remains a central reliability challenge for vision language models (VLMs), particularly in binary object existence verification. Existing benchmarks emphasize aggregate accu…

Wed, 29 Apr 2026 04:04:25 GMT · 4 views

Or browse by topic

World US Politics Technology AI Markets Business Science Climate Health Culture Media

Results for "diagnostic errors".

CT-FineBench: A Diagnostic Fidelity Benchmark for Fine-Grained Evaluation of CT Report Generation

DO-Bench: An Attributable Benchmark for Diagnosing Object Hallucination in Vision-Language Models

Or browse by topic