publications

2026

  1. ICLR
    TileLang: Bridge Programmability and Performance in Modern Neural Kernels
    Lei Wang, Yu Cheng, Yining Shi, Zhiwen Mo, Zhengju Tang, Wenhao Xie, Tong Wu, Lingxiao Ma, Yuqing Xia, Jilong Xue, Fan Yang, and Zhi Yang
    The Fourteenth International Conference on Learning Representations (ICLR 2026), 2026

2025

  1. NeurIPS
    ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction
    Renze Chen, Zhuofeng Wang, Beiquan Cao, Tong Wu, Size Zheng, Xiuhong Li, Xuechao Wei, Shengen Yan, Meng Li, and Yun Liang
    In Proceedings of the 38th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 2025