30 results for "training"
Porting a Scratch-Built 500M LLM Training Pipeline to ROCm on Strix Halo
A lightweight transformer language model built from scratch in PyTorch, trained on a single consumer GPU with a full pipeline for data processing, pretraining, and instruction tuning. - epscylonb/1...…
Agentic AI platforms for autonomous training and rule induction of human-human and virus-human protein-protein interactions
We instruct an AI agent to construct two separate agentic AI platforms: one for autonomous training of predictive ML models for human-human and virus-human PPI, and the other for inducing explicit gen…
Decoupled DiLoCo: Resilient, Distributed AI Training at Scale
Google’s new distributed architecture keeps AI training runs on track across distant data centers, with exceptional efficiency – even when hardware fails.…
The loss curve said tie. The judges said otherwise. Seeking replication for an early LLM training result [R]
Fast experiment on T4 GPU. Self play training on Dark Hex (Colab notebook) [P]
PRADA to provide skill training to Athani leather artisans
A Primer on LLM Post-Training
You were training AI while catching Pokemon [video]
Enjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.…
LabelSets — open quality standard for AI training data (LQS v3.1) [D]
Built a third-party quality rating system for ML datasets. Multi-oracle (7 scorers across 5 algorithm families), conformal prediction intervals on downstream F1, Ed25519-signed certs, and a contaminat…
Training LFM-2.5-350M on Reddit post summarization with GRPO on my 3x Mac Minis — final evals and t-test evals are here
So, with this project I want to see if a length constrained (like 64 tokens only) quality summarization can be done by tiny LLMs using GRPO! So, I trained two variants of this task: using just length …
IIM-Kozhikode launches ‘Uyare’ project to empower women entrepreneurs
IIM Kozhikode launches 'Uyare' project to empower women entrepreneurs through training and resources in Kozhikode and Malappuram.…
Architectural Requirements for Agentic AI Containment
The April 2026 disclosure that a frontier large language model escaped its security sandbox, executed unauthorized actions, and concealed its modifications to version control history demonstrates that…
PyTorch NaNs Are Silent Killers — So I Built a 3ms Hook to Catch Them at the Exact Layer
NaNs don’t crash your training — they quietly destroy it. After losing hours to a silent failure in a ResNet training run, I built a lightweight detector that pinpoints the exact layer and batch where…
China’s upgrades maritime rescue range and depth as ambitions on high seas expand
Body responsible for strategically sensitive South China Sea completes ‘formal transition’ after extensive deepwater training.…
Claude can now build your entire weekly meal plan with real grocery prices from your actual store FOR FREE. Here's how
I saw this viral thread on X going around about using Claude as a dietitian. Tried all 12 prompts directly. Nutrition logic was genuinely impressive. macro calculations, meal timing, gut health protoc…
Mitigating Belief Inertia via Active Intervention in Embodied Agents
Recent advancements in large language models (LLMs) have enabled agents to tackle complex embodied tasks through environmental interaction. However, these agents still make suboptimal decisions and pe…
Calls for ‘student premium’ to support disadvantaged young people after GCSEs
Social mobility groups say post-16 funding gap risks young people falling out of education, work and training A coalition of 14 social mobility organisations is urging the government to fund a “studen…
Forest fire prevention measures intensified in Erode Division
Erode Division intensifies forest fire prevention with technology, community training, and a dedicated control room for early response.…
The Power of Power Law: Asymmetry Enables Compositional Reasoning
Natural language data follows a power-law distribution, with most knowledge and skills appearing at very low frequency. While a common intuition suggests that reweighting or curating data towards a un…
Towards Causally Interpretable Wi-Fi CSI-Based Human Activity Recognition with Discrete Latent Compression and LTL Rule Extraction
We address Human Activity Recognition (HAR) utilizing Wi-Fi Channel State Information (CSI) under the joint requirements of causal interpretability, symbolic controllability, and direct operation on h…
PhySE: A Psychological Framework for Real-Time AR-LLM Social Engineering Attacks
The emerging threat of AR-LLM-based Social Engineering (AR-LLM-SE) attacks (e.g. SEAR) poses a significant risk to real-world social interactions. In such an attack, a malicious actor uses Augmented R…
StoryTR: Narrative-Centric Video Temporal Retrieval with Theory of Mind Reasoning
Current video moment retrieval excels at action-centric tasks but struggles with narrative content. Models can see \textit{what is happening} but fail to reason \textit{why it matters}. This semantic …
ArguAgent: AI-Supported Real-Time Grouping for Productive Argumentation in STEM Classrooms
Argumentation is a core practice in STEM education, but its productivity depends on who participates and how they interact. Higher-achieving students often dominate the talk and decision-making, while…
MetaGAI: A Large-Scale and High-Quality Benchmark for Generative AI Model and Data Card Generation
The rapid proliferation of Generative AI necessitates rigorous documentation standards for transparency and governance. However, manual creation of Model and Data Cards is not scalable, while automate…
When AI reviews science: Can we trust the referee?
The volume of scientific submissions continues to climb, outpacing the capacity of qualified human referees and stretching editorial timelines. At the same time, modern large language models (LLMs) of…
Tandem: Riding Together with Large and Small Language Models for Efficient Reasoning
Recent advancements in large language models (LLMs) have catalyzed the rise of reasoning-intensive inference paradigms, where models perform explicit step-by-step reasoning before generating final ans…
Does Machine Unlearning Preserve Clinical Safety? A Risk Analysis for Medical Image Classification
The application of Deep Learning in medical diagnosis must balance patient safety with compliance with data protection regulations. Machine Unlearning enables the selective removal of training data fr…
An empirical evaluation of the risks of AI model updates using clinical data: stability, arbitrariness, and fairness
Artificial Intelligence and Machine Learning (AI/ML) models used in clinical settings are increasingly deployed to support clinical decision-making. However, when training data become stale due to cha…
Representational Curvature Modulates Behavioral Uncertainty in Large Language Models
In autoregressive large language models (LLMs), temporal straightening offers an account of how the next-token prediction objective shapes representations. Models learn to progressively straighten the…
An Analysis of the Coordination Gap between Joint and Modular Learning for Job Shop Scheduling with Transportation Resources
Efficient job-shop scheduling with transportation resources is critical for high-performance manufacturing. With the rise of "decentralized factories", multi-agent reinforcement learning has emerged a…