Luce DFlash: Qwen3.6-27B at up to 2x throughput on a single RTX 3090

· r/LocalLLaMA ·

Luce DFlash is an inference acceleration tool that achieves up to 2x throughput for running Qwen3.6-27B on a single RTX 3090, targeting local LLM enthusiasts on consumer hardware.

Categories: OSS & Tools

Excerpt

r/LocalLLaMA · 107 points · 23 comments · i.redd.it

Discussions