AMD Hipfire - a new inference engine optimized for AMD GPU's
Hipfire is a new open-source inference engine targeting AMD GPUs with a custom mq4 quantization method, releasing models on HuggingFace and claiming significant speedups per community benchmarks.
Excerpt
Came across hipfire the other day. It's a brand new inference engine focused on all AMD GPU's (not just the latest).
[Github.](https://github.com/Kaden-Schutt/hipfire)
It uses a special mq4 quantization method. The hipfire creator is pumping out [models on huggingface.](https://huggingface.co/schuttdev)
I don't know enough about quantization to know how good these quants are in terms of quality, but as an RDNA3 aficionado I'm happy AMD is getting some attention.
[Localmaxxing](https://www.localmaxxing.com/) is a new LLM benchmarking site, and shows some pretty dramatic speedups for hipfire inference.
Edit: I should have just said hipfire - I don't think this is connected to AMD officially.
Read at source: https://www.reddit.com/r/LocalLLaMA/comments/1swpsv0/amd_hipfire_a_new_inference_engine_optimized_for/
Discussions
- reddit · 104 points · 21 comments
- reddit · 119 points · 29 comments
- reddit · 129 points · 33 comments
- reddit · 141 points · 38 comments
- reddit · 156 points · 39 comments
- reddit · 171 points · 44 comments
- reddit · 183 points · 46 comments
- reddit · 191 points · 48 comments
- reddit · 198 points · 48 comments
- reddit · 204 points · 49 comments
- reddit · 211 points · 50 comments
- reddit · 218 points · 50 comments
- reddit · 219 points · 52 comments
- reddit · 224 points · 53 comments
- reddit · 233 points · 54 comments
- reddit · 237 points · 55 comments
- reddit · 243 points · 57 comments
- reddit · 248 points · 60 comments
- reddit · 254 points · 62 comments
- reddit · 257 points · 63 comments
- reddit · 261 points · 67 comments
- reddit · 265 points · 67 comments
- reddit · 265 points · 68 comments
- reddit · 269 points · 68 comments
- reddit · 270 points · 69 comments
- reddit · 269 points · 70 comments
- reddit · 271 points · 70 comments
- reddit · 273 points · 70 comments
- reddit · 274 points · 70 comments
- reddit · 273 points · 70 comments
- reddit · 277 points · 70 comments
- reddit · 276 points · 70 comments