Bingyao Li 「李冰瑶」

Ph.D. Candidate
Department of Computer Science
School of Computing and Information
University of Pittsburgh
bil35 [at] pitt [dot] edu
Curriculum Vitae

Hi! I am currently a fourth-year Ph.D. student in Department of Computer Science at University of Pittsburgh, advised by Dr. Xulong Tang. I received my B.E. and M.S. degree from the College of Intelligence and Computing, Tianjin University in 2017 and 2020, respectively.

My research interests lie primarily in the area of computer architecture, with a focus on virtual memory and address translation for multi-GPU.

I am currently exploring system-level optimizations for Large Language Model serving, as a part of my internship at NVIDIA Research in Summer’24.


  • 07/2024: One paper is accepted by MICRO 2024. Thanks to all collaborators!
  • 06/2024: Gave a talk on "Towards Efficient and Salable Computing for Multi-GPUs" at NVIDIA.
  • 05/2024: Start intern at NVIDIA Architecture Research Group.
  • 10/2023: One paper is accepted by HPCA 2024. Thanks to all collaborators!
  • 07/2023: One paper is accepted by MICRO 2023. Thanks to all collaborators!
  • 06/2023: Awarded the CS50 Outstanding Research Fellowship.
  • 04/2023: Gave a talk on address translation in multi-GPU at Tianjin University.
  • 02/2023: One paper is accepted by DAC 2023. Thanks to all collaborators!
  • 10/2022: One paper is accepted by HPCA 2023. Thanks to all collaborators!
  • 04/2022: Awarded the CS50 Outstanding Research Fellowship.
  • 03/2022: One paper is accepted by WWW 2022 workshop. Thanks to all collaborators!
  • 07/2021: One paper is accepted by MICRO 2021. Thanks to all collaborators!


STAR: Sub-Entry Sharing-Aware TLB for Multi-Instance GPU
Bingyao Li, Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, and Xulong Tang
In Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture
MICRO 2024
GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement
Yueqi Wang*, Bingyao Li*, Aamer Jaleel, Jun Yang, and Xulong Tang
The 30th IEEE International Symposium on High-Performance Computer Architecture
* The authors contribute equally.
HPCA 2024
IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations
Bingyao Li, Yanan Guo, Yueqi Wang, Aamer Jaleel, Jun Yang, and Xulong Tang
In Proceedings of the 56th IEEE/ACM International Symposium on Microarchitecture
MICRO 2023
Orchestrated Scheduling and Partitioning for Improved Address Translation in GPUs
Bingyao Li, Yueqi Wang, and Xulong Tang
The 60th Design Automation Conference
DAC 2023
Trans-FW: Short Circuiting Page Table Walk in Multi-GPU Systems via Remote Forwarding
Bingyao Li, Jieming Yin, Anup Holey, Youtao Zhang, Jun Yang, and Xulong Tang
The 29th IEEE International Symposium on High-Performance Computer Architecture
HPCA 2023
Optimizing Data Layout for Training Deep Neural Networks
Bingyao Li*, Qi Xue*, Geng Yuan*, Sheng Li, Xiaolong Ma, Yanzhi Wang, and Xulong Tang
The ACM Web Conference Workshop
* The authors contribute equally.
WWW 2022 workshop
Improving Address Translation in Multi-GPUs via Sharing and Spilling aware TLB Design
Bingyao Li, Jieming Yin, Youtao Zhang, and Xulong Tang
In Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture
MICRO 2021
mcatCS: A Highly Efficient Cross-Matching Scheme for Multi-Band Astronomical Catalogs
Bingyao Li, Ce Yu, Chen Li, Xiaoteng Hu, Jian Xiao, Shanjiang Tang, Chenzhou Cui, and Dongwei Fan
Publication of the Astronomical Society of the Pacific, 2019, 131(999)
Astronomical Data Fusion: Recent Progress and Future Prospects - A Survey
Ce Yu, Bingyao Li, Jian Xiao, Chao Sun, Shanjiang Tang, Chongke Bi, Chenzhou Cui, and Dongwei Fan
Springer Experimental Astronomy, 2019(6)
An Efficient Retrieval Method for Astronomical Catalog Time Series Data
Bingyao Li, Ce Yu, Xiaoteng Hu, Jian Xiao, Shanjiang Tang, Lianmeng Li, and Bin Ma
18th International Conference on Algorithms and Architectures for Parallel Processing
ICA3PP 2018
GAIDR: An Efficient Time Series Subsets Retrieval Method for Geo-Distributed Astronomical Data
Xiaoteng Hu, Ce Yu, Bingyao Li, Shanjiang Tang, Jian Xiao, and Yanyan Huang
20th IEEE International Conference on High Performance Computing and Communications
HPCC 2018


  • CS1550: Introduction to Operating System
  • Fall 2021, Teaching Assistant, University of Pittsburgh

Awards & Honors

  • CS50 Outstanding Research Fellowship, University of Pittsburgh, 2022, 2023
  • SCI Fellowship, University of Pittsburgh, 2020
  • National Scholarship, Ministry of Education of China, 2019
  • Graduate First Prize Scholarship, Tianjin University, 2017, 2019