LM Studio finally added support for MTP Speculative Decoding
LM Studio added beta support for MTP speculative decoding, giving local model users a new inference acceleration option.
Excerpt
https://preview.redd.it/1uuzjm0ll72h1.png?width=923&format=png&auto=webp&s=1af7d7594be1e08ff7ad6797e2bc53e9410769a3
update to 0.4.14 Build 2 (Beta) and make sure your llama.cpp engine is 2.15.0
https://preview.redd.it/x0vdwjb3n72h1.png?width=742&format=png&auto=webp&s=6367de44208004d2f50194d78a542c46b040dceb
you also must select "Manually choose model load parameters" and enable MTP in those before loading the model it is NOT on by default
Read at source: https://www.reddit.com/r/LocalLLaMA/comments/1ti99an/lm_studio_finally_added_support_for_mtp/
Discussions
- reddit · 112 points · 30 comments
- reddit · 131 points · 33 comments
- reddit · 141 points · 43 comments
- reddit · 151 points · 46 comments
- reddit · 163 points · 53 comments
- reddit · 169 points · 53 comments
- reddit · 177 points · 58 comments
- reddit · 180 points · 62 comments
- reddit · 184 points · 63 comments
- reddit · 187 points · 65 comments
- reddit · 194 points · 67 comments
- reddit · 199 points · 68 comments
- reddit · 205 points · 73 comments
- reddit · 204 points · 76 comments
- reddit · 208 points · 76 comments
- reddit · 213 points · 82 comments
- reddit · 216 points · 84 comments
- reddit · 219 points · 86 comments
- reddit · 218 points · 86 comments
- reddit · 218 points · 87 comments
- reddit · 221 points · 89 comments
- reddit · 227 points · 89 comments
- reddit · 225 points · 89 comments
- reddit · 233 points · 89 comments
- reddit · 233 points · 90 comments
- reddit · 232 points · 91 comments
- reddit · 233 points · 91 comments
- reddit · 234 points · 92 comments
- reddit · 231 points · 92 comments
- reddit · 233 points · 92 comments
- reddit · 234 points · 93 comments
- reddit · 238 points · 93 comments
- reddit · 238 points · 94 comments
- reddit · 239 points · 95 comments