• Hacker News
  • new
  • top
  • best
  • ask
  • show
  • job
DeepSeek's mHC: Stabilizing Training Divergence from 3,000x to 1.6x
2 pointsby Research_Brief8 hours ago
  • Guidelines
  • FAQ
  • Lists
  • API
  • Security
  • Legal
  • Apply to YC
  • Contact

Search: