https://developer.chrome.com/docs/ai/prompt-api
I just checked the stats:
Model Name: v3Nano
Version: 2025.06.30.1229
Backend Type: GPU (highest quality)
Folder size: 4,072.13 MiB
Different use case but a similar approach.I expect that at some point this will become a native web feature, but not anytime soon, since the model download is many multiples the size of the browser itself. Maybe at some point these APIs could use LLMs built into the OS, like we do for graphics drivers.
Summarizer.create()
(requires user activation)Every webpage I've ever visited has full JS execution privileges and I trust half of them less than an LLM
And what you call sketchy is what billions of people default to every day when they use web applications.
It's usually too much when an app asks someone to setup a local LLM but this I believe could solve that problem?
If you want to see an example of this, https://querylight.tryformation.com/ is where I put my search library and demo. It does vector search in the browser.
Anyone know if this is somehow possible without going through an extension?