Hacker News
new
top
best
ask
show
job
DiffusionBlocks: Training Neural Networks One Block at a Time
(
pub.sakana.ai
)
3 points
by
sebg
3 hours ago
1 comment
billconan
2 hours ago
I do not understand.
how is this different from building smaller transformer layers, and each layer just denoises less?