Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Ooops, missed one sentence in my previous response. Stanford's MegaKernel project tackles a similar challenge but focuses on manual CUDA implementation. While MPK takes a compiler-driven approach—users express their LLMs at the PyTorch level, and MPK automatically compiles them into optimized megakernels. Our goal is to make programming megakernels much more accessible.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: