Bingyao Li ​「李冰瑶」

Ph.D. Candidate in University of Pittsburgh

Hi! I am currently a final-year Ph.D. student in Department of Computer Science at University of Pittsburgh, advised by Dr. Xulong Tang. I received my B.E. and M.S. degree from the College of Intelligence and Computing, Tianjin University in 2017 and 2020, respectively.

My research interests lie broadly in the areas of GPU architecture and systems. Specifically, my work focuses on designing architectures and system features for next-generation GPU systems, aimed at enabling and supporting emerging large-scale applications on both single- and multi-GPU platforms.

Currently, I am actively engaged in extending my research field to system-level optimizations for large language models serving, as part of my internship at NVIDIA Research in Summer’24.

I am currently on the job market for tenure-track fac​ulty positions and research scientist positions in the industry. Please kindly contact me if there are any opportunities.

11/2024 One paper is accepted by HPCA 2025 (shepherding). Thanks to all collaborators!
07/2024 One paper is accepted by MICRO 2024. Thanks to all collaborators!
06/2024 Gave a talk on "Towards Efficient and Salable Computing for Multi-GPUs" at NVIDIA.
05/2024 Start intern at NVIDIA Architecture Research Group.
10/2023 One paper is accepted by HPCA 2024. Thanks to all collaborators!
07/2023 One paper is accepted by MICRO 2023. Thanks to all collaborators!
06/2023 Awarded the CS50 Outstanding Research Fellowship.
04/2023 Gave a talk on ​"Towards Efficient and Salable Computing for Multi-GPUs" at Tianjin University.
02/2023 One paper is accepted by DAC 2023. Thanks to all collaborators!
10/2022 One paper is accepted by HPCA 2023. Thanks to all collaborators!
04/2022 Awarded the CS50 Outstanding Research Fellowship.
07/2021 One paper is accepted by MICRO 2021. Thanks to all collaborators!

News !

MICRO
2024
STAR: Sub-Entry Sharing-Aware TLB f​or Multi-Instance GPU
Bingyao Li, Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, and Xulong Tang
In Proceedings of the 57th IEEE/ACM International Symposium on​ Microarchitecture
HPCA
2024
GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement
Yueqi Wang*, Bingyao Li*, Aamer Jaleel, Jun Yang, and Xulong Tang  
(* The authors contribute equally)
The 30th IEEE International Symposium on High-Performance Computer Architecture
Publication (Full list)
MICRO
2023
IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations
Bingyao Li, Yanan Guo, Yueqi Wang, Aamer Jaleel, Jun Yang, and Xulong Tang
In Proceedings of the 56th IEEE/ACM International Symposium on​ Microarchitecture
DAC
2023
Orchestrated Scheduling and Partitioning for Improved Address Translation in GPUs
Bingyao Li, Yueqi Wang, and Xulong Tang
The 60th Design Automation Conference
HPCA
2023
Trans-FW: Short Circuiting Page Table Walk in Multi-GPU Systems via Remote Forwarding
Bingyao Li, Jieming Yin, Anup Holey, Youtao Zhang, Jun Yang, and Xulong Tang
The 29th IEEE International Symposium on High-Performance Computer Architecture
MICRO
2021
Improving Address Translation in Multi-GPUs via Sharing and Spilling aware TLB Design
Bingyao Li, ​Jieming Yin, Youtao Zhang, and Xulong Tang
In Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture

News !

11/2024

One paper is accepted by HPCA 2025 (shepherding) . Thanks to all collaborators!

07/2024

One paper is accepted by MICRO 2024. Thanks to all collaborators!

06/2024

Gave a talk on "Towards Efficient and Salable Computing for Multi-GPUs" at NVIDIA.

05/2024

Start intern at NVIDIA Architecture Research Group.

10/2023

One paper is accepted by HPCA 2024. Thanks to all collaborators!

07/2023

One paper is accepted by MICRO 2023. Thanks to all collaborators!

Awarded the CS50 Outstanding Research Fellowship .

06/2023

04/2023

Gave a talk on ​"Towards Efficient and Salable Computing for Multi-GPUs" at Tianjin University.

02/2023

One paper is accepted by DAC 2023. Thanks to all collaborators!

One paper is accepted by HPCA 2023. Thanks to all collaborators!

10/2022

04/2022

Awarded the CS50 Outstanding Research Fellowship.

07/2021

One paper is accepted by MICRO 2021. Thanks to all collaborators!

Publication (Full list)

MICRO 2024

STAR: Sub-Entry Sharing-Aware TLB f​or Multi-Instance GPU

Bingyao Li, Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, and Xulong Tang

In Proceedings of the 57th IEEE/ACM International Symposium on​ Microarchitecture

HPCA 2024

GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement

Yueqi Wang*, Bingyao Li*, Aamer Jaleel, Jun Yang, and Xulong Tang  
(* The authors contribute equally)

The 30th IEEE International Symposium on High-Performance Computer Architecture

MICRO 2023

IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations

Bingyao Li, Yanan Guo, Yueqi Wang, Aamer Jaleel, Jun Yang, and Xulong Tang

In Proceedings of the 56th IEEE/ACM International Symposium on Microarchitecture

DAC 2023

Orchestrated Scheduling and Partitioning for Improved Address Translation in GPUs

Bingyao Li, Yueqi Wang, and Xulong Tang

The 60th Design Automation Conference

Trans-FW: Short Circuiting Page Table Walk in Multi-GPU Systems via Remote Forwarding

HPCA 2023

Bingyao Li, Jieming Yin, Anup Holey, Youtao Zhang, Jun Yang, and Xulong Tang

The 29th IEEE International Symposium on High-Performance Computer Architecture

MICRO 2021

Improving Address Translation in Multi-GPUs via Sharing and Spilling aware TLB Design

Bingyao Li, ​Jieming Yin, Youtao Zhang, and Xulong Tang

In Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture