Tong WU

avatar.jpg

Hi!šŸ‘‹

I’m Tong WU (吓竄), currently a senior undergraduate in EECS, Peking University, supervised by Prof. Zhi Yang. I’m also a research intern in AIGCIC now.

My research interests lie in:

  • Co-design of algorithms and systems for efficient LLM training and inferencešŸš€
  • AI hardwares, compilers and DSLs (still learning…)āš™ļø

Currently as a member of Tile-AI, I’m actively contributing to TileLang, a popular DSL for streamline the development of efficient kernels , as well as relevant open-source projects. They include TileScale, a distributed programming language in progress, and the high-performance operator library TileOps.

If you’re interested in my work, please feel free to contact me via my email: wutong1109 [AT] stu [DOT] pku [DOT] edu [DOT] cn

selected publications

  1. NeurIPS
    ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction
    Renze Chen, Zhuofeng Wang, Beiquan Cao, Tong Wu, Size Zheng, Xiuhong Li, Xuechao Wei, Shengen Yan, Meng Li, and Yun Liang
    In Proceedings of the 38th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 2025