Tong WU
Hi!š
Iām Tong WU (å“ē«„), currently a final year undergraduate in EECS, Peking University, supervised by Prof. Zhi Yang.
My research interests lie in:
- Co-design of algorithms and systems for efficient LLM training and inferenceāØ
- AI hardwares, compilers and DSLs (still learningā¦)āļø
- Designing and optimizing AI operators to the speed-of-lightš
Currently as a member of Tile-AI, Iām actively contributing to TileLang, a popular DSL for streamline the development of efficient kernels, as well as relevant open-source projects. They include TileScale, a distributed programming language in progress, and the high-performance operator library TileOps. Iām also a part-time intern in ByteDance now.
If youāre interested in my work, please feel free to contact me via my email: wutong1109 [AT] stu [DOT] pku [DOT] edu [DOT] cn
selected publications
- NeurIPSArkVale: Efficient Generative LLM Inference with Recallable Key-Value EvictionIn Proceedings of the 38th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 2025
- ICLRTileLang: Bridge Programmability and Performance in Modern Neural KernelsThe Fourteenth International Conference on Learning Representations (ICLR 2026), 2026