Hacker News
new
top
best
ask
show
job
Zyphra releases the ZAYA1-8B MoE model optimized for intelligence density
(
huggingface.co
)
3 points
by
mirzap
3 hours ago
2 comments
GorbachevyChase
11 minutes ago
Seems like a big deal. Surprised there isn’t much engagement here.
mirzap
3 hours ago
Technical report:
https://www.zyphra.com/zaya1-8b-technical-report
Announcement post:
https://www.zyphra.com/post/zaya1-8b