The companies building giant AI compute for themselves have started renting it out: SpaceX leased all of Colossus to Anthropic, and Meta says a cloud business is on the table. Apple is about to have the most efficient, most private inference fleet on earth. Can it keep that to itself?
Why would Apple’s inference fleet be more efficient? They would need to get into the TPU/water cooling business to make that happen. Apple’s silicon is great but remember it’s based on ARM an already existing ISA with reference designs. The burden to make Apple’s CPU silicon is lower than what it would be to make a TPU.