Tong WU

avatar.jpg

Hi!šŸ‘‹

I’m Tong WU (吓竄), currently a final year undergraduate in EECS, Peking University, supervised by Prof. Zhi Yang.

My research interests lie in:

  • Co-design of algorithms and systems for efficient LLM training and inference✨
  • AI hardwares, compilers and DSLs (still learning…)āš™ļø
  • Designing and optimizing AI operators to the speed-of-lightšŸš€

Currently as a member of Tile-AI, I’m actively contributing to TileLang, a popular DSL for streamline the development of efficient kernels, as well as relevant open-source projects. They include TileScale, a distributed programming language in progress, and the high-performance operator library TileOps. I’m also a part-time intern in ByteDance now.

If you’re interested in my work, please feel free to contact me via my email: wutong1109 [AT] stu [DOT] pku [DOT] edu [DOT] cn

selected publications

  1. NeurIPS
    ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction
    Renze Chen, Zhuofeng Wang, Beiquan Cao, Tong Wu, Size Zheng, Xiuhong Li, Xuechao Wei, Shengen Yan, Meng Li, and Yun Liang
    In Proceedings of the 38th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 2025
  2. ICLR
    TileLang: Bridge Programmability and Performance in Modern Neural Kernels
    Lei Wang, Yu Cheng, Yining Shi, Zhiwen Mo, Zhengju Tang, Wenhao Xie, Tong Wu, Lingxiao Ma, Yuqing Xia, Jilong Xue, Fan Yang, and Zhi Yang
    The Fourteenth International Conference on Learning Representations (ICLR 2026), 2026