20 results for "ai bias"
SpaceX Lawsuit Forces California Agency to Admit Bias Against Elon Musk and Back Down
California's Coastal Commission has formally apologized to Elon Musk — and not just in a press statement, or off the record. This is a signed, court-enforceable federal settlement.…
The Imbalanced User-AI Relationships as an Ethical Failure of Front-End Design in Healthcare AI
Ethical discourse on AI in healthcare has focused predominantly on back-end concerns such as bias, fairness and explainability, while the front-end interface, where patients and clinicians actually en…
I made a prompt that fixes AI-written content.
I use it on everything now - Try it on your AI content and let me know if it works for you. AI SIGNALS TO FIX: Replace curly quotes (“”) with straight quotes ("") Replace em-dash (—) and en-dash (–) w…
AI prefers resumes written by itself: Self-preferencing in Algorithmic Hiring
As artificial intelligence (AI) tools become widely adopted, large language models (LLMs) are increasingly involved on both sides of decision-making processes, ranging from hiring to content moderatio…
Judging the Judges: A Systematic Evaluation of Bias Mitigation Strategies in LLM-as-a-Judge Pipelines
LLM-as-a-Judge has become the dominant paradigm for evaluating language model outputs, yet LLM judges exhibit systematic biases that compromise evaluation reliability. We present a comprehensive empir…
When AI reviews science: Can we trust the referee?
The volume of scientific submissions continues to climb, outpacing the capacity of qualified human referees and stretching editorial timelines. At the same time, modern large language models (LLMs) of…
FAIR_XAI: Improving Multimodal Foundation Model Fairness via Explainability for Wellbeing Assessment
In recent years, the integration of multimodal machine learning in wellbeing assessment has offered transformative potential for monitoring mental health. However, with the rapid advancement of Vision…
ZenBrain: A Neuroscience-Inspired 7-Layer Memory Architecture for Autonomous AI Systems
Despite a century of empirical memory research, existing AI agent memory systems rely on system-engineering metaphors (virtual-memory paging, flat LLM storage, Zettelkasten notes), none integrating pr…
Grounding Before Generalizing: How AI Differs from Humans in Causal Transfer
Extracting abstract causal structures and applying them to novel situations is a hallmark of human intelligence. While Large Language Models (LLMs) and Vision Language Models (VLMs) have shown strong …
Multi-Dimensional Evaluation of Sustainable City Trips with LLM-as-a-Judge and Human-in-the-Loop
Evaluating nuanced conversational travel recommendations is challenging when human annotations are costly and standard metrics ignore stakeholder-centric goals. We study LLMs-as-Judges for sustainable…
Ferrari’s Failed Move For Max Verstappen’s Engineer Detailed
Ferrari, according to a new report, was interested in signing Red Bull's race engineer, Gianpiero Lambiase.…
Landmark SCOTUS Redistricting Decision
For free and unbiased Medicare help, dial 279-266-0516 to speak with my trusted partner, Chapter, or go to - *Paid partnership__Wedne...…
Quantifying Divergence in Inter-LLM Communication Through API Retrieval and Ranking
Large language models (LLMs) increasingly operate as autonomous agents that reason over external APIs to perform complex tasks. However, their reliability and agreement remain poorly characterized. We…
Amazon Connect Talent
Amazon Connect Talent is an agentic AI hiring solution built for talent acquisition leaders managing high-volume hiring. It delivers agentic AI-led interviews, science-backed assessments, and consiste…
Infisical (YC W23) Is Hiring Full Stack Software Engineers (Remote)
Infisical https://infisical.com/ is looking to hire exceptional talent to join our teams in building the open source security infrastructure stack for the AI era. We're building a generational compan…
Analytica: Soft Propositional Reasoning for Robust and Scalable LLM-Driven Analysis
Large language model (LLM) agents are increasingly tasked with complex real-world analysis (e.g., in financial forecasting, scientific discovery), yet their reasoning suffers from stochastic instabili…
Context-Aware Hospitalization Forecasting Evaluations for Decision Support using LLMs
Medical and public health experts must make real-time resource decisions, such as expanding hospital bed capacity, based on projected hospitalization trends during large-scale healthcare disruptions (…
Beyond the Attention Stability Boundary: Agentic Self-Synthesizing Reasoning Protocols
As LLM agents transition to autonomous digital coworkers, maintaining deterministic goal-directedness in non-linear multi-turn conversations emerged as an architectural bottleneck. We identify and for…
A systematic evaluation of vision-language models for observational astronomical reasoning tasks
Vision-language models (VLMs) are increasingly proposed as general-purpose tools for scientific data interpretation, yet their reliability on real astronomical observations across diverse modalities r…
Introducing AutoMuon, a one line drop in for AdamW [P]
Hey everyone, I've been working on a small Python package called AutoMuon that makes the Muon optimizer usable as a drop-in replacement for AdamW in arbitrary PyTorch training pipelines. The core idea…