@doo - megumin.org

doo@sh.itjust.works

0 Posts
1 Comment

Joined 3 years ago

Cake day: June 19th, 2023

You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.

OverviewCommentsPosts

doo@sh.itjust.workstoTechnology@lemmy.ml•Luce DFlash: Qwen3.6-27B at up to 2x throughput on a single RTX 3090
link
fedilink
English
arrow-up
4·
29 days ago
The irony. Before llamacpp the only way to run llama was using other and on Nvidia GPUs. Then llamacpp expanded to other models, introduced gguf, added backends to run on GPUs and now we’re taking about running qwen using just python on a single Nvidia. Ouroboros is complete.

link
fedilink