The progressive encoding system should take the 90-minute cold start (encoding 68K text terms through SigLIP) down to ~30 seconds by encoding a seed vocabulary first, then background-encoding the rest while you're already processing images.
It's pure Rust, single binary, pip install photon-imager or build from source.
Would love feedback, contributions, and forks. Some areas where help would be especially welcome: - Windows support (currently macOS + Linux only) - Additional model backends beyond SigLIP - Frontend/UI for browsing tagged collections - Database integration examples (pgvector, Qdrant, etc.)