Search: "fine tuning" — WeSearch Press

GITHUB

Finetuning Activates Verbatim Recall of Copyrighted Books in LLMs

The official code repo of Alignment Whack-a-Mole: Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models - cauchy221/Alignment-Whack-a-Mole-Code…

Thu, 30 Apr 2026 03:50:48 GMT · 7 views

ARXIV CS.AI

Parameter Efficiency Is Not Memory Efficiency: Rethinking Fine-Tuning for On-Device LLM Adaptation

Parameter-Efficient Fine-Tuning (PEFT) has become the standard for adapting large language models (LLMs). In this work we challenge the wide-spread assumption that parameter efficiency equates memory …

Wed, 29 Apr 2026 04:04:25 GMT · 4 views

ARXIV CS.AI

Neural Network Optimization Reimagined: Decoupled Techniques for Scratch and Fine-Tuning

With the accumulation of resources in the era of big data and the rise of pre-trained models in deep learning, optimizing neural networks for various tasks often involves different strategies for fine…

Wed, 29 Apr 2026 04:04:25 GMT · 4 views

MACHINE LEARNING

Going from 3B/7B dense to Nemotron 3 Nano (hybrid Mamba-MoE) for multi-task reasoning — what changes in the fine-tuning playbook? [D]

Following up on something I posted a few days back about fine-tuning for multi-task reasoning. Read a lot since then, and I've moved past the dense 3B vs 7B question — landing on Nemotron 3 Nano (the …

Sun, 26 Apr 2026 16:10:10 GMT · 8 views

TECHMEME

Sources and memos: Tencent employees used Claude Code to assist them with evaluating and fine-tuning the company's new Hy3 model to improve its performance (Juro Osawa/The Information)

By Juro Osawa / The Information. View the full context on Techmeme.…

Tue, 28 Apr 2026 13:19:59 GMT · 19 views

ARXIV.ORG

OpenGame: Open Agentic Coding for Games

Game development sits at the intersection of creative design and intricate software engineering, demanding the joint orchestration of game engines, real-time loops, and tightly coupled state across ma…

Wed, 29 Apr 2026 05:34:25 GMT · 7 views

ARXIV CS.AI

Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing

Serving transformer language models with high throughput requires caching Key-Values (KVs) to avoid redundant computation during autoregressive generation. The memory footprint of KV caching is signif…

Wed, 29 Apr 2026 04:04:25 GMT · 4 views

ARXIV CS.AI

PivotMerge: Bridging Heterogeneous Multimodal Pre-training via Post-Alignment Model Merging

Multimodal Large Language Models (MLLMs) rely on multimodal pre-training over diverse data sources, where different datasets often induce complementary cross-modal alignment capabilities. Model mergin…

Wed, 29 Apr 2026 04:04:25 GMT · 4 views

ARXIV CS.AI

SGP-SAM: Self-Gated Prompting for Transferring 3D Segment Anything Models to Lesion Segmentation

Large segmentation foundation models such as the Segment Anything Model (SAM) have reshaped promptable segmentation in natural images, and recent efforts have extended these models to medical images a…

Wed, 29 Apr 2026 04:04:25 GMT · 4 views

ARXIV CS.AI

MAE-Based Self-Supervised Pretraining for Data-Efficient Medical Image Segmentation Using nnFormer

Transformer architectures, including nnFormer,have demonstrated promising results in volumetric medical image segmentation by being able to capture long-range spatial interactions. Although they have …

Wed, 29 Apr 2026 04:04:25 GMT · 4 views

ARXIV CS.AI

Probing Visual Planning in Image Editing Models

Visual planning represents a crucial facet of human intelligence, especially in tasks that require complex spatial reasoning and navigation. Yet, in machine learning, this inherently visual problem is…

Wed, 29 Apr 2026 04:04:25 GMT · 4 views

ARXIV.ORG

A Systematic Approach for Large Language Models Debugging

Large language models (LLMs) have become central to modern AI workflows, powering applications from open-ended text generation to complex agent-based reasoning. However, debugging these models remains…

Tue, 28 Apr 2026 04:13:21 GMT · 4 views

ARXIV.ORG

Does Machine Unlearning Preserve Clinical Safety? A Risk Analysis for Medical Image Classification

The application of Deep Learning in medical diagnosis must balance patient safety with compliance with data protection regulations. Machine Unlearning enables the selective removal of training data fr…

Tue, 28 Apr 2026 04:13:21 GMT · 4 views

ARXIV.ORG

A2DEPT: Large Language Model-Driven Automated Algorithm Design via Evolutionary Program Trees

Designing heuristics for combinatorial optimization problems (COPs) is a fundamental yet challenging task that traditionally requires extensive domain expertise. Recently, Large Language Model (LLM)-b…

Tue, 28 Apr 2026 04:13:21 GMT · 4 views

Results for "fine tuning".

Finetuning Activates Verbatim Recall of Copyrighted Books in LLMs

Parameter Efficiency Is Not Memory Efficiency: Rethinking Fine-Tuning for On-Device LLM Adaptation

Neural Network Optimization Reimagined: Decoupled Techniques for Scratch and Fine-Tuning

Going from 3B/7B dense to Nemotron 3 Nano (hybrid Mamba-MoE) for multi-task reasoning — what changes in the fine-tuning playbook? [D]

Sources and memos: Tencent employees used Claude Code to assist them with evaluating and fine-tuning the company's new Hy3 model to improve its performance (Juro Osawa/The Information)

OpenGame: Open Agentic Coding for Games

Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing

PivotMerge: Bridging Heterogeneous Multimodal Pre-training via Post-Alignment Model Merging

SGP-SAM: Self-Gated Prompting for Transferring 3D Segment Anything Models to Lesion Segmentation

MAE-Based Self-Supervised Pretraining for Data-Efficient Medical Image Segmentation Using nnFormer

Probing Visual Planning in Image Editing Models

A Systematic Approach for Large Language Models Debugging

Does Machine Unlearning Preserve Clinical Safety? A Risk Analysis for Medical Image Classification

A2DEPT: Large Language Model-Driven Automated Algorithm Design via Evolutionary Program Trees

Or browse by topic