4 pointsby tulpa8 hours ago3 comments

armcat3 hours ago
This person did a great comparison against Qwen models, and despite them having 8x less active params, they outperform the Cohere model in every category: https://x.com/DJLougen/status/2057196012918149368?s=20
james2doyle6 hours ago
I’m always surprised by how performant the Cohere models are. They output quick. I tested out the BF16 and it seems pretty good. I tried out the FP8 one and it did seem a bit dumber. Curious to see how this ranks in benchmarks
tulpa8 hours ago
New open-source model from Cohere