Search: "artifacts" — WeSearch Press

4 stories match your query across our 700+ source catalog. Ranked by relevance and recency.

4 results for "artifacts"

Judging the Judges: A Systematic Evaluation of Bias Mitigation Strategies in LLM-as-a-Judge Pipelines

LLM-as-a-Judge has become the dominant paradigm for evaluating language model outputs, yet LLM judges exhibit systematic biases that compromise evaluation reliability. We present a comprehensive empir…

Tue, 28 Apr 2026 04:13:21 GMT · 4 views

ARXIV.ORG

MetaGAI: A Large-Scale and High-Quality Benchmark for Generative AI Model and Data Card Generation

The rapid proliferation of Generative AI necessitates rigorous documentation standards for transparency and governance. However, manual creation of Model and Data Cards is not scalable, while automate…

Tue, 28 Apr 2026 04:13:21 GMT · 3 views

ARXIV.ORG

Information-Theoretic Measures in AI: A Practical Decision Guide

Information-theoretic (IT) measures are ubiquitous in artificial intelligence: entropy drives decision-tree splits and uncertainty quantification, cross-entropy is the default classification loss, mut…

Tue, 28 Apr 2026 04:13:21 GMT · 3 views

Y COMBINATOR

Terra API (YC W21) Hiring: Applied AI Strategist(Health Intelligence)

What this role actually is This is not “market research.” No 60‑page decks. No generic “digital health is big” observations. This is a continuous loop: market → signal → implication → decision → shi…

Sun, 26 Apr 2026 19:59:59 GMT · 7 views

Or browse by topic

World US Politics Technology AI Markets Business Science Climate Health Culture Media

Results for "artifacts".

Judging the Judges: A Systematic Evaluation of Bias Mitigation Strategies in LLM-as-a-Judge Pipelines

MetaGAI: A Large-Scale and High-Quality Benchmark for Generative AI Model and Data Card Generation

Information-Theoretic Measures in AI: A Practical Decision Guide

Terra API (YC W21) Hiring: Applied AI Strategist(Health Intelligence)

Or browse by topic