However it seems the software part is always a blocker, and these architecture only address well a handful of program types.
Is this time different? I think it is. The paradigm here is about small threads that yield very often, and we have that way of programming: Erlang, Go, Java's virtual threads.