1 pointby gpu_systems8 hours ago1 comment
  • gpu_systems8 hours ago
    Ever had a CUDA workload mysteriously slow down and then burned hours figuring out why?

    This tool measures actual Unified Memory behavior: cold path (page faults), warm path (resident), and pressure pass (sustained load). It lets you compare two runs on the same GPU under different system conditions to see how memory migration and residency change.