WeSearch
Hub / Tags / Reasoning
TAG · #REASONING

Reasoning coverage.

Every story in the WeSearch catalog tagged with #reasoning, chronological, with view counts. Subscribe to the per-tag RSS feed to follow this topic in your reader of choice.

60 stories tagged with #reasoning, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.

⌘ RSS feed for this tag →   or   search "Reasoning"

RELATED TAGS
#ai23#ml12#language-models2#sudoku2#scientific-reasoning2#spatial-reasoning2#clinical-reasoning2#healthcare2#deep-learning1#parallel-reasoning1#computer-vision1#social-reasoning1
ARXIV CS.AI

Visual Graph Scaffolds for Structural Reasoning in Large Language Models

Graphs have been used to enhance large language models (LLMs) for structured reasoning, mostly as external knowledge sources are provided to models at test time. In this paper, we …

20 views ·
#artificial intelligence#machine learning#language models
ARXIV CS.AI

ChatHealthAI: Aligning Electronic Health Record Representations with Large Language Models for Grounded Clinical Reasoning

Large language models (LLMs) exhibit strong natural-language reasoning abilities for clinical decision support, but struggle to effectively model structured longitudinal electronic…

16 views ·
#artificial intelligence#healthcare#machine learning
ARXIV CS.AI

Thinking Past the Answer: Evaluating Harmful Overthinking in Large Reasoning Models

Large Reasoning Models (LRMs) improve performance by generating explicit intermediate reasoning traces through increased test-time compute, yet the assumption that longer reasoning…

15 views ·
#artificial intelligence#machine learning
ARXIV CS.AI

Inducing Reasoning Primitives from Agent Traces

ReAct-style LLM agents often rediscover the same reasoning routines across problems, yet leave those routines trapped in transient scratchpads. We introduce Reasoning Primitive Ind…

19 views ·
#artificial intelligence#machine learning#natural language processing
ARXIV CS.AI

CORE: Conflict-Oriented Reasoning for General Multimodal Manipulation Detection

The rapid rise of generative AI has made multimodal fake news increasingly realistic and pervasive, posing severe threats to public trust and social stability. Existing detection m…

20 views ·
#artificial intelligence#fake news#machine learning
ARXIV CS.AI

The Shadow Price of Reasoning: Economic Perspective on Optimal Budget Allocation for LLMs

Inference-time scaling has emerged as a critical avenue for enhancing Large Language Models' performance, yet real-world deployment is constrained by strict computational budgets. …

17 views ·
#artificial intelligence#budget allocation#large language models
ARXIV CS.AI

Perceive Before Reasoning: A Pre-Reasoning Perception Framework for Efficient and Reliable Proactive Mobile Agents

Multimodal large language models (MLLMs) have substantially advanced mobile agents, yet proactive mobile assistance remains challenging because agents must decide \emph{when} to in…

19 views ·
#artificial intelligence#mobile agents#machine learning
ARXIV CS.AI

CP-Agent: Context-Aware Multimodal Reasoning for Cellular Morphological Profiling under Chemical Perturbations

Cell Painting combines multiplexed fluorescent staining, high-content imaging, and quantitative analysis to generate high-dimensional phenotypic readouts to support diverse downstr…

23 views ·
#artificial intelligence#drug discovery#cell biology
ARXIV CS.AI

ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning

Large Reasoning Models (LRMs) have achieved remarkable progress thanks to Reinforcement Learning with Verifiable Rewards (RLVR) on Chain-of-Thoughts (CoTs). However, since long CoT…

12 views ·
#artificial intelligence#machine learning
ARXIV CS.AI

Bridging Auxiliary Constraints to Resolve Instruction Following in Large Reasoning Models

Large Reasoning Models (LRMs) have demonstrated impressive capabilities in many tasks, yet they struggle with reliably following multiple instructions, either by failing to satisfy…

16 views ·
#artificial intelligence#machine learning#natural language processing
ARXIV CS.AI

TSQAgent: Rating Time Series Data Quality via Dedicated Agentic Reasoning

Assessing the quality of time series (TS) data is fundamental yet inherently challenging due to the multifaceted nature of quality dimensions. Recently, large language models (LLMs…

16 views ·
#artificial intelligence#data quality#machine learning
ARXIV CS.AI

From Answers to States: Verifiable Process-Level Evaluation of Chemical Reasoning in Large Language Models

Large language models are increasingly used as chemistry assistants, yet most chemistry benchmarks still score only final answers. This masks a critical failure mode: a model may o…

17 views ·
#artificial intelligence#chemistry#machine learning
ARXIV CS.AI

Code-on-Graph: Iterative Programmatic Reasoning via Large Language Models on Knowledge Graphs

Knowledge Graphs (KGs) are widely used to mitigate the limitations of Large Language Models (LLMs), such as outdated knowledge and hallucinations. Existing LLM-KG integration frame…

22 views ·
#artificial intelligence#knowledge graphs#language models
ARXIV CS.AI

Unveiling the Structure of Do-Calculus Reasoning via Derivation Graphs

The do-calculus defines a general system of inference for interventional queries, allowing causal quantities to be transformed through successive applications of its rules. This pr…

26 views ·
#artificial intelligence#causal inference#do-calculus
ARXIV CS.AI

When to Re-Plan: Subgoal Persistence in Hierarchical Latent Reasoning

Long-horizon reasoning requires a system to commit to medium-horizon intent without becoming rigid: re-plan too often and computation never coheres into multi-step structure; commi…

21 views ·
#artificial intelligence#machine learning
HUGGING FACE - BLOG

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

A Blog post by NVIDIA on Hugging Face…

26 views ·
#artificial intelligence#technology#robotics
DEV.TO (TOP)

Claude Does Not Need More Prompts. It Needs Reasoning Discipline.

Large language models are good at sounding structured. That is not the same as being structured. Ask...…

16 views ·
#ai#programming#productivity
R/ARTIFICIAL

The Evil of corporate America and their reasoning skills is that of people who enter a building to find the exit.

19 views ·
R/LOCALLLAMA

Gryphe/Pantheon-Reasoning-27B · Hugging Face

11 views ·
SLATE

My Stepmom Won’t Come to My House to See My Baby. Her Reasoning Is Not Normal.

This cannot be healthy.…

14 views ·
#parenting#family#anxiety
ARXIV.ORG

Cassandra: Enabling Reasoning LLMs at Edge via Self-Speculative Decoding

Speculative decoding has emerged as a promising lossless approach for accelerating Large Language Models (LLMs). As reasoning LLMs increasingly suffer from decode-stage overhead an…

13 views ·
#hardware#machine learning#artificial intelligence
A FEW THOUGHTS ON CRYPTOGRAPHI

Fooling around with encrypted reasoning blobs

This is a quick post I wanted to write about a “hobby project” I spent a weekend on. It has little to do with real cryptography, and mostly doesn’t expose a particularly exciting ……

11 views ·
#technology#artificial intelligence#encryption
CRYPTO BRIEFING

AutoTTS reduces token usage by 69.5% in LLM reasoning strategies

AutoTTS, a framework from Meta, Google, and university researchers, cuts LLM token usage by 69.5% while maintaining accuracy, with implications for AI-driven crypto tools.…

12 views ·
#artificial intelligence#machine learning#research
VENTUREBEAT

Researchers automated LLM reasoning strategy design and cut token usage by 69.5%

20 views ·
R/ARTIFICIAL

I gave my AI agents email instead of better reasoning. They started fixing each other's bugs.

18 views ·
R/IPHONE

What's the reasoning behind not letting us download our entire SMS messages as easy as possible?

16 views ·
ARXIV CS.AI

Reasoning, Code, or Both? How Large Language Models Handle Variations in Math Questions

Large Language Models (LLMs) achieve impressive accuracy on mathematical reasoning benchmarks, yet their performance drops when problems are modified with simple changes like diffe…

22 views ·
#artificial intelligence#machine learning#mathematics
ARXIV CS.AI

Which Changes Matter? Towards Trustworthy Legal AI via Relevance-Sensitive Evaluation and Solver-Grounded Reasoning

Legal reasoning requires distinguishing changes that matter from those that do not. Legal AI should remain stable under legally irrelevant perturbations, but should change when per…

22 views ·
#artificial intelligence#legal#evaluation
ARXIV CS.AI

MedGuideX: Internalizing Decision Logic from Executable Guidelines into Large Language Models for Clinical Reasoning

Clinical practice guidelines (CPGs) encode evidence-based decision logic that clinicians apply by evaluating patient variables, conditional criteria, and recommendation rules. Howe…

23 views ·
#artificial intelligence#healthcare#clinical reasoning
ARXIV CS.AI

Composition Collapse: Stable Factual Knowledge Does Not Imply Compositional Reasoning

Post-training is routinely evaluated through aggregate benchmark scores that treat multi-hop reasoning as a single capability -- as if a model that answers more questions correctly…

17 views ·
#artificial intelligence#machine learning#research
ARXIV CS.AI

Traceable Knowledge Graph Reasoning Enables LLM-Assisted Decision Support for Industrial VOCs in the Steel Industry

Key knowledge for steel-industry volatile organic compounds (VOCs) governance is scattered across unstructured scientific literature, making it difficult to integrate process, poll…

16 views ·
#artificial intelligence#pollution control#steel industry
ARXIV CS.AI

Scaling, Benchmarking, and Reasoning of Vision-Language Agents for Mobile GUI Navigation

Vision-Language Models (VLMs) have shown rapid progress in mobile GUI navigation. This paper presents a systematic study of data scaling, benchmarking, and reasoning for VLM-based …

14 views ·
#artificial intelligence#machine learning#mobile applications
ARXIV CS.AI

RepoMirage: Probing Repository Context Reasoning in Code Agents with Perturbations

Code agents are currently having skillful performance on repository-level software engineering benchmarks, but it remains unclear whether success on end-to-end tasks such as issue …

18 views ·
#software engineering#artificial intelligence#code agents
R/SINGULARITY

5.5/5.4 Reasoning Cheatsheet (YMMV)

24 views ·
R/MACHINELEARNING

Verbosity is not faithfulness: an architectural argument that reasoning models cannot perform faithful inference [D]

18 views ·
TECHRADAR

What Sudoku reveals about the limits of LLMs

LLM failure to solve reasoning puzzles exposes deep architectural limits…

18 views ·
#artificial intelligence#sudoku
GITHUB

Show HN: skills-for-humanity – 171 structured reasoning skills for Claude Code

Structured reasoning methodologies from history's most rigorous thinkers, packaged as Claude Code skills. - human-avatar/skills-for-humanity…

24 views ·
#technology#education
ARXIV CS.AI

How Much Thinking is Enough? Quantifying and Understanding Redundancy in LLM Reasoning

Reasoning-capable large language models solve hard problems by emitting long chains of thought, paying heavily in latency, GPU time, and energy. Casual inspection of their traces r…

18 views ·
#artificial intelligence#machine learning#language models
ARXIV CS.AI

DRIVE: Modeling Skills at the Reasoning and Interaction Levels for Web Agents under Continual Learning

Web agents require both high-level reasoning (for task decomposition) and low-level interactions (for page elements manipulation) to conduct different tasks. However, these knowled…

21 views ·
#artificial intelligence#machine learning#web agents
ARXIV CS.AI

Residual Drift Dominates Contradiction in Multi-Turn Constraint Reasoning

How do multi-turn reasoning systems fail? The expected answer is logical contradiction, in which the system's maintained state becomes unsatisfiable. We show that the dominant mode…

18 views ·
#artificial intelligence#machine learning
ARXIV CS.AI

LGMT: Logic-Grounded Metamorphic Testing for Evaluating the Reasoning Reliability of LLMs

Large Language Models (LLMs) achieve strong performance on logical reasoning benchmarks, yet their reliability remains uncertain. Existing evaluations rely on static benchmarks, wh…

20 views ·
#artificial intelligence#machine learning#software engineering
ARXIV CS.AI

LC-ERD: Mining Latent Logic for Self-Evolving Reasoning via Consistency-Regulated Reward Decomposition

The evolution of Large Language Model (LLM) reasoning is bottlenecked by the scarcity of high-quality process data. While self-alignment via endogenous rewards offers a solution, m…

24 views ·
#artificial intelligence#machine learning#language models
ARXIV CS.AI

HyperGuide: Hyperbolic Guidance for Efficient Multi-Step Reasoning in Large Language Models

Multi-step reasoning remains a central challenge for large language models: single-pass generation is efficient but lacks accuracy; tree-search methods explore multiple paths but a…

15 views ·
#artificial intelligence#machine learning#language models
ARXIV CS.AI

Understanding and Mitigating Premature Confidence for Better LLM Reasoning

Long chains of thought (CoT) from current language models frequently contain logical gaps and unjustified leaps, limiting the gains from additional test-time compute. Improving rea…

15 views ·
#artificial intelligence#machine learning#language models
ARXIV CS.AI

SAM: State-Adaptive Memory for Long-Horizon Reasoning Agent

Long-horizon agentic reasoning requires large language models to act over long interaction histories containing thoughts, tool calls, observations, and partial conclusions. The cha…

15 views ·
#artificial intelligence#machine learning
ARXIV CS.AI

AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning

Recent progress on long-horizon agentic tasks has been driven largely by scaling up individual agents through stronger models, better tools, and more effective scaffolding. In cont…

17 views ·
#artificial intelligence#collective reasoning#agent systems
ARXIV CS.AI

Reasoning as an Attack Surface: Adaptive Evolutionary CoT Jailbreaks for LLMs

Large Reasoning Models (LRMs) have demonstrated remarkable capabilities in reasoning and generation tasks and are increasingly deployed in real-world applications. However, their e…

16 views ·
#artificial intelligence#security#machine learning
ARXIV CS.AI

PALoRA: Projection-Adaptive LoRA for Preserving Reasoning in Large Language Models

Efficiently updating Large Language Models (LLMs) with new or evolving factual knowledge remains a central challenge, as even parameter-efficient adaptation can erode previously ac…

14 views ·
#artificial intelligence#machine learning#language models
ARXIV CS.AI

GlobalDentBench: A Multinational Benchmark for Evaluating LLM Clinical Reasoning in Dentistry with Expert Calibration

While large language models (LLMs) hold transformative potential for medicine, their reasoning robustness and safety in real-world clinical scenarios remain critically underexplore…

24 views ·
#artificial intelligence#dentistry#clinical reasoning
ARXIV CS.AI

Measuring Reasoning Quality in LLMs: A Multi-Dimensional Behavioral Framework

LLMs have achieved remarkable success in complex reasoning tasks, yet current evaluation approaches predominantly rely on final-answer correctness, offering limited insight into th…

14 views ·
#artificial intelligence#machine learning#evaluation
ARXIV CS.AI

Geo-Expert: Towards Expert-Level Geological Reasoning via Parameter-Efficient Fine-Tuning

While general-purpose Large Language Models (LLMs) applied to Geology often hallucinate when reasoning about subsurface structures and deep-time evolution, current AI in Earth scie…

14 views ·
#artificial intelligence#geology#machine learning
ARXIV CS.AI

Clustering as Reasoning: A $k$-Means Interpretation of Chain-of-Thought Graph Learning

Chain-of-Thought (CoT) prompting has shown promise in enhancing the reasoning capabilities of large language models (LLMs) on text-attributed graphs (TAGs). This work reframes CoT-…

14 views ·
#artificial intelligence#machine learning#graph theory
ARXIV CS.AI

Boosting Inference with Guided Reasoning: Stochastic Exploration for Recursive Models

Recent work on recursive architectures has shown that tiny neural networks can be surprisingly powerful on structured reasoning tasks. The trick is to model reasoning trajectories …

21 views ·
#artificial intelligence#machine learning#neural networks
ARXIV CS.AI

Context-CoT: Enhancing Context Learning via High-Quality Reasoning Synthesis

While LLMs excel at reasoning over prompts using static pretrained knowledge, they struggle significantly with context learning-the ability to dynamically extract, internalize, and…

15 views ·
#artificial intelligence#machine learning#context learning
ARXIV CS.AI

Credit Assignment with Resets in Language Model Reasoning

Contemporary reinforcement learning with verifiable reward methods post-train language models on multi-step reasoning by assigning a single outcome reward uniformly across all toke…

16 views ·
#artificial intelligence#reinforcement learning#language models
YCOMBINATOR

Show HN: YourMemory, persistent memory layer with temporal reasoning for agents

15 views ·
#technology#artificial intelligence#memory
R/MACHINELEARNING

Call for Papers - Workshop on Efficient Reasoning at COLM 2026 [R]

13 views ·
GITHUB

Show HN: Smriti: Shared Reasoning State for Claude Code and Codex

Contribute to himanshudongre/smriti development by creating an account on GitHub.…

14 views ·
#technology#software#collaboration
YCOMBINATOR

Ask HN: Local model experiences with 'high-reasoning distill' finetunes

14 views ·
#technology#artificial intelligence#machine learning
DEV.TO (TOP)

Hướng Dẫn Thiết Lập Reasoning Proxy DeepSeek V4-Pro với Cursor (2026)

Cắm DeepSeek V4-Pro vào Cursor bằng cấu hình OpenAI-compatible mặc định, bạn có thể gặp lỗi HTTP 400...…

14 views ·
#ai#api#llm