4 pointsby seventeen292 hours ago1 comment

da-x2 hours ago
I've read elsewhere that it's 10x less electricity for inference workloads compared to standard GPUs. It is not clear to me, are the model weights built into the silicon (e.g. per model tapeout), or is this a new kind of chip architecture that still has weights in DRAM/SRAM?
- slongfield2 hours ago
  Full disclosure: I work at Etched.
  Weights are not burnt into silicon per-model. They're in SRAM/HBM. There's some more info on the website (etched.com) and we'll be sharing more details about model benchmarks this summer.