WeSearch
Hub / Tags / Language Processing
TAG · #LANGUAGE-PROCESSING

Language Processing coverage.

Every story in the WeSearch catalog tagged with #language-processing, chronological, with view counts. Subscribe to the per-tag RSS feed to follow this topic in your reader of choice.

23 stories tagged with #language-processing, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.

⌘ RSS feed for this tag →   or   search "Language Processing"

RELATED TAGS
#natural-language-processing20#ai18#ml15#ai-models2#llms1#post-training1#ai-alignment1#tokens1#temperature1#text-to-speech1#latency1#speech-synthesis1
ARXIV CS.AI

Inducing Reasoning Primitives from Agent Traces

ReAct-style LLM agents often rediscover the same reasoning routines across problems, yet leave those routines trapped in transient scratchpads. We introduce Reasoning Primitive Ind…

12 views ·
#artificial intelligence#machine learning#natural language processing
ARXIV CS.AI

DMF: A Deterministic Memory Framework for Conversational AI Agents

Conversational AI agents require memory systems that are both scalable and semantically coherent across long interaction horizons. Existing approaches rely predominantly on large l…

11 views ·
#artificial intelligence#machine learning#natural language processing
ARXIV CS.AI

Bridging Auxiliary Constraints to Resolve Instruction Following in Large Reasoning Models

Large Reasoning Models (LRMs) have demonstrated impressive capabilities in many tasks, yet they struggle with reliably following multiple instructions, either by failing to satisfy…

10 views ·
#artificial intelligence#machine learning#natural language processing
ARXIV CS.AI

A Dataset of Robot-Patient and Doctor-Patient Medical Dialogues for Spoken Language Processing Tasks

Large Language Models (LLMs) have brought huge improvements to Artificial Intelligence (AI), which can be applied to general-purpose tasks. However, their application to textual or…

16 views ·
#artificial intelligence#healthcare#machine learning
ARXIV CS.AI

From Norms to Indicators (N2I-RAG): An Agentic Retrieval-Augmented Generation Framework for Legal Indicator Computation

Computing legal indicators from normative texts is a key task in legal monitoring and policy evaluation, but presents significant challenges due to the complexity, scale, and inter…

13 views ·
#artificial intelligence#legal technology#natural language processing
ARXIV CS.AI

PathCal: State-Aware Reflection-Marker Calibration for Efficient Reasoning

The emergence of Large Reasoning Language Models (LRMs) has paved the way for tackling complex reasoning tasks through test-time scaling by generating long-form Chain-of-Thought (C…

12 views ·
#artificial intelligence#machine learning#natural language processing
ARXIV CS.AI

Parallel Context Compaction for Long-Horizon LLM Agent Serving

Long-horizon LLM agents accumulate growing conversation histories that eventually exceed the model's context window. Context compaction via LLM-based summarization keeps the conver…

17 views ·
#artificial intelligence#machine learning#natural language processing
ARXIV CS.AI

Seeing without Looking: Do Vision-Language Benchmarks Really Test Vision?

Benchmark accuracy is often implicitly assumed to reflect grounded visual understanding in vision-language models (VLMs), yet it remains unclear to what extent such scores truly re…

7 views ·
#computer vision#artificial intelligence
ARXIV CS.AI

Metacognition as Reward: Reinforcing LLM Reasoning via Knowledge and Regulation Signals

Recent RL methods have substantially improved the reasoning abilities of LLMs. Existing reward designs mainly follow two paradigms: (1) Reinforcement learning with verifiable rewar…

9 views ·
#artificial intelligence#machine learning#natural language processing
ARXIV CS.AI

SSDAU: Structured Semantic Data Augmentation for Joint Entity and Relation Extraction

Joint Entity and Relation Extraction (JERE) is highly susceptible to weak generalization due to low-quality training data. Data augmentation is a common strategy to enhance model…

11 views ·
#data augmentation#natural language processing#machine learning
ARXIV CS.AI

Chronicle: A Multimodal Foundation Model for Joint Language and Time Series Understanding

Real-world time series come with text: metadata, descriptions, news, reports. Yet time series foundation models process numerical sequences in isolation, and the multimodal text-an…

12 views ·
#machine learning#artificial intelligence#natural language processing
ARXIV CS.AI

Distributional Alignment as a Criterion for Designing Task Vectors in In-Context Learning

In-context learning (ICL) allows large language models (LLMs) to adapt to new tasks through demonstrations, yet it suffers from escalating inference costs as context length increas…

9 views ·
#artificial intelligence#machine learning#natural language processing
HUGGINGFACE

The Ettin Reranker Family

We’re on a journey to advance and democratize artificial intelligence through open source and open science.…

11 views ·
#machine learning#artificial intelligence#natural language processing
ARXIV CS.AI

AgentNLQ: A General-Purpose Agent for Natural Language to SQL

Natural language to SQL (NL2SQL) conversion is an important problem for researchers and enterprises due to the ubiquitous importance of relational databases in broad-ranging practi…

18 views ·
#artificial intelligence#natural language processing#sql
ARXIV CS.AI

Retrieve Only Relevant Tables Whether Few or Many: Adaptive Table Retrieval Method

Retrieving relevant tables from extensive databases for a given natural language query is essential for accurately answering questions in tasks such as text-to-SQL. Existing table …

13 views ·
#information retrieval#artificial intelligence#natural language processing
ARXIV CS.AI

HyperPersona: A Multi-Level Hypergraph Framework for Text-Based Automatic Personality Prediction

As a modern commodity, language has become a vast repository of socially and psychologically significant traits and concepts, reflecting the ways people encode pattern of thoughts,…

14 views ·
#artificial intelligence#personality prediction
ROSMINE ML BLOG

Fixing LLM Writing with Distribution Fine Tuning

Technical Report: How Distribution Fine Tuning (DFT) improves LLM writing quality…

11 views ·
#artificial intelligence#machine learning#natural language processing
ARXIV CS.AI

Domain-Independent Game Abstraction using Word Embedding Techniques

Many games of interest in the real world are often intractably large, thereby necessitating the use of game abstraction to shrink them in size, typically by many magnitudes. Over t…

12 views ·
#game theory#artificial intelligence#natural language processing
ARXIV CS.AI

Towards Generalization of Block Attention via Automatic Segmentation and Block Distillation

Block attention, which processes the input as separate blocks that cannot attend to one another, offers significant potential to improve KV cache reuse in long-context scenarios su…

11 views ·
#artificial intelligence#machine learning#natural language processing
DEV.TO (TOP)

Teaching an AI to Pick Its Own Brain: Building Adaptive Model Routing

Part 2 of the crab-bot series. If you missed Part 1, start here. The Problem Nobody...…

13 views ·
#ai#machine learning#chatbots
DEEPGRAM

The Streaming Latency Tradeoff: Why Some TTS Models Lose Accuracy in Real Time

Streaming TTS loses 5-20x context vs batch processing, causing pronunciation failures on alphanumeric IDs. Learn when to choose batch over real-time synthesis.…

12 views ·
#text-to-speech#ai models#latency
DEV.TO (TOP)

What Are Tokens and Temperature in AI Models?

A practical explanation of tokens, max tokens, and temperature for managers and engineers using Claude, Gemini, Llama, Gemma, and Qwen.…

10 views ·
#ai models#tokens#temperature
PYTORCH

A Primer on LLM Post-Training

8 views ·
#llms#post-training#ai alignment