1 pointby Yuriy_Bakhvalova month ago1 comment
  • Yuriy_Bakhvalova month ago
    Hi HN, I'm the author. This is a 4-paper cycle where I derive the kernel from first principles. Key features: No SGD, 500 layers. Happy to answer questions!