2 pointsby ahendest2 hours ago1 comment

julia-kafarska2 hours ago
I run Qwen 35B on my local machine daily but also over 200B params with flash-moe occasionally. In today's world, with all the open models spending a lot of money make sense if your needs a bigger then couple of people.
- ahendest38 minutes ago
  how is your token/s for qwen and for flash-moe? and what system you are using? and do you satisfied on them? thanks for reply!!