MatejSprogar2 hours ago
I’m the author. AGITB is a small, header-only C++ benchmark for testing predictive models on raw binary streams.
It’s a deterministic, pass/fail test suite intended as a necessary (not sufficient) step toward general intelligence. Most systems fail by design; reports of models that pass are especially interesting to me.
Feedback on the benchmark design is welcome.