OMLX v0.3.9 Stable Merges Native MTP (Multi-Token Prediction)
The release of OMLX v0.3.9 introduces several enhancements, including Native Multi-Token Prediction (MTP) for various models. This update aims to improve decoding speed and efficiency, particularly for image and text requests. Additionally, the release includes stability improvements and new features for better user experience and performance.
- ▪OMLX v0.3.9 consolidates previous pre-releases and includes post-release stabilization fixes.
- ▪Native MTP allows supported models to predict multiple tokens simultaneously, enhancing decoding speed.
- ▪The update features major stability improvements for low-memory machines, ensuring better performance under pressure.
Opening excerpt (first ~120 words) tap to expand
jundot / omlx Public Notifications You must be signed in to change notification settings Fork 1.3k Star 14.8k Code Issues 287 Pull requests 60 Discussions Projects Security and quality 0 Insights Additional navigation options Code Issues Pull requests Discussions Projects Security and quality Insights Releases v0.3.9 v0.3.9 Latest Latest Compare Choose a tag to compare Sorry, something went wrong. Filter Loading Sorry, something went wrong. Uh oh! There was an error while loading. Please reload this page. No results found View all tags jundot released this 21 May 16:42 · 1 commit to main since this release v0.3.9 8cad121 This is the stable 0.3.9 release, consolidating the 0.3.9.dev1, 0.3.9.dev2, and 0.3.9rc1 pre-releases plus the post-rc stabilization fixes.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at GitHub.