#post-training — Tagged Stories

Every story in the WeSearch catalog tagged with #post-training, chronological, with view counts. Subscribe to the per-tag RSS feed to follow this topic in your reader of choice.

2 stories tagged with #post-training, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.

⌘ RSS feed for this tag → or search "Post Training"

ARXIV CS.AI

Complementing reinforcement learning with SFT through logit averaging in the post training of LLMs

We introduce a novel method that averages the logits of a frozen reference policy (e.g., SFT) and a trainable policy, and incorporate the method into Group Relative Policy Optimiza…

15 views · Fri, 22 May 2026 04:02:00 GMT

#machine learning #artificial intelligence #reinforcement learning

X (FORMERLY TWITTER)

Distribution Fine Tuning (DFT): A post training step that fixes LLM writing

I fixed why LLMs write so poorly, and I have a demo to prove it Announcing Distribution Fine Tuning (DFT): A post training step that fixes LLM writing Model outputs fooled pangra…

23 views · Mon, 18 May 2026 19:19:56 GMT

Browse more

All tags Search "Post Training" RSS feed World US Technology Markets

Post Training coverage.

Complementing reinforcement learning with SFT through logit averaging in the post training of LLMs

Distribution Fine Tuning (DFT): A post training step that fixes LLM writing

Browse more