Bingyao Li ​「李冰瑶」

Ph.D. Candidate in University of Pittsburgh

Hi! I am currently a final-year Ph.D. student in Department of Computer Science at University of Pittsburgh, advised by Dr. Xulong Tang. I received my B.E. and M.S. degree from the College of Intelligence and Computing, Tianjin University in 2017 and 2020, respectively.

My research interests lie broadly in advanced computer architectures, high-performance computing systems, and emerging parallel applications, with a specific focus on GPU ecosystem from architecture to application. My work aims to design architectures and system features for next-generation GPU systems, enabling and supporting emerging large-scale applications on both single- and multi-GPU platforms.

Currently, I am actively engaged in extending my research field to system-level optimizations for large language models serving, as part of my internship at NVIDIA Research in Summer’24.

I am currently on the job market for tenure-track fac​ulty positions and research scientist positions in the industry. Please kindly contact me if there are any opportunities.

News !

11/2024 One paper is accepted by HPCA 2025. Thanks to all collaborators!
07/2024 One paper is accepted by MICRO 2024. Thanks to all collaborators!
06/2024 Gave a talk on "Towards Efficient and Salable Computing for Multi-GPUs" at NVIDIA.
05/2024 Start intern at NVIDIA Architecture Research Group.
10/2023 One paper is accepted by HPCA 2024. Thanks to all collaborators!
07/2023 One paper is accepted by MICRO 2023. Thanks to all collaborators!
06/2023 Awarded the CS50 Outstanding Research Fellowship.
04/2023 Gave a talk on ​"Towards Efficient and Salable Computing for Multi-GPUs" at Tianjin University.
02/2023 One paper is accepted by DAC 2023. Thanks to all collaborators!
10/2022 One paper is accepted by HPCA 2023. Thanks to all collaborators!
04/2022 Awarded the CS50 Outstanding Research Fellowship.
07/2021 One paper is accepted by MICRO 2021. Thanks to all collaborators!

Publication (Full list)

MICRO 2024 STAR: Sub-Entry Sharing-Aware TLB f​or Multi-Instance GPU
Bingyao Li , Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, and Xulong Tang
In Proceedings of the 57th IEEE/ACM International Symposium on​ Microarchitecture
HPCA 2024 GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement
Yueqi Wang*, Bingyao Li* , Aamer Jaleel, Jun Yang, and Xulong Tang  
(* The authors contribute equally)
The 30th IEEE International Symposium on High-Performance Computer Architecture
MICRO 2023 IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations
Bingyao Li, Yanan Guo, Yueqi Wang, Aamer Jaleel, Jun Yang, and Xulong Tang
In Proceedings of the 56th IEEE/ACM International Symposium on Microarchitecture
DAC 2023 Orchestrated Scheduling and Partitioning for Improved Address Translation in GPUs
Bingyao Li , Yueqi Wang, and Xulong Tang
The 60th Design Automation Conference
HPCA 2023 Trans-FW: Short Circuiting Page Table Walk in Multi-GPU Systems via Remote Forwarding
Bingyao Li
, Jieming Yin, Anup Holey, Youtao Zhang, Jun Yang, and Xulong Tang
The 29th IEEE International Symposium on High-Performance Computer Architecture
MICRO 2021 Improving Address Translation in Multi-GPUs via Sharing and Spilling aware TLB Design
Bingyao Li
, ​Jieming Yin, Youtao Zhang, and Xulong Tang
In Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture
11/2024 One paper is accepted by HPCA 2025. Thanks to all collaborators!
07/2024 One paper is accepted by MICRO 2024. Thanks to all collaborators!
06/2024 Gave a talk on "Towards Efficient and Salable Computing for Multi-GPUs" at NVIDIA.
05/2024 Start intern at NVIDIA Architecture Research Group.
10/2023 One paper is accepted by HPCA 2024. Thanks to all collaborators!
07/2023 One paper is accepted by MICRO 2023. Thanks to all collaborators!
06/2023 Awarded the CS50 Outstanding Research Fellowship.
04/2023 Gave a talk on ​"Towards Efficient and Salable Computing for Multi-GPUs" at Tianjin University.
02/2023 One paper is accepted by DAC 2023. Thanks to all collaborators!
10/2022 One paper is accepted by HPCA 2023. Thanks to all collaborators!
04/2022 Awarded the CS50 Outstanding Research Fellowship.
07/2021 One paper is accepted by MICRO 2021. Thanks to all collaborators!

News !

MICRO 2024 STAR: Sub-Entry Sharing-Aware TLB f​or Multi-Instance GPU
Bingyao Li , Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, and Xulong Tang
In Proceedings of the 57th IEEE/ACM International Symposium on​ Microarchitecture
HPCA 2024 GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement
Yueqi Wang*, Bingyao Li* , Aamer Jaleel, Jun Yang, and Xulong Tang  
(* The authors contribute equally)
The 30th IEEE International Symposium on High-Performance Computer Architecture
MICRO 2023 IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations
Bingyao Li, Yanan Guo, Yueqi Wang, Aamer Jaleel, Jun Yang, and Xulong Tang
In Proceedings of the 56th IEEE/ACM International Symposium on Microarchitecture
DAC 2023 Orchestrated Scheduling and Partitioning for Improved Address Translation in GPUs
Bingyao Li, Yueqi Wang, and Xulong Tang
The 60th Design Automation Conference
HPCA 2023 Trans-FW: Short Circuiting Page Table Walk in Multi-GPU Systems via Remote Forwarding
Bingyao Li
, Jieming Yin, Anup Holey, Youtao Zhang, Jun Yang, and Xulong Tang
The 29th IEEE International Symposium on High-Performance Computer Architecture
MICRO 2021 Improving Address Translation in Multi-GPUs via Sharing and Spilling aware TLB Design
Bingyao Li
, ​Jieming Yin, Youtao Zhang, and Xulong Tang
In Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture

Publication (Full list)

News !

11/2024 One paper is accepted by HPCA 2025. Thanks to all collaborators!
07/2024 One paper is accepted by MICRO 2024. Thanks to all collaborators!
06/2024 Gave a talk on "Towards Efficient and Salable Computing for Multi-GPUs" at NVIDIA.
05/2024 Start intern at NVIDIA Architecture Research Group.
10/2023 One paper is accepted by HPCA 2024. Thanks to all collaborators!
07/2023 One paper is accepted by MICRO 2023. Thanks to all collaborators!
06/2023 Awarded the CS50 Outstanding Research Fellowship.
04/2023 Gave a talk on ​"Towards Efficient and Salable Computing for Multi-GPUs" at Tianjin University.
02/2023 One paper is accepted by DAC 2023. Thanks to all collaborators!
10/2022 One paper is accepted by HPCA 2023. Thanks to all collaborators!
04/2022 Awarded the CS50 Outstanding Research Fellowship.
07/2021 One paper is accepted by MICRO 2021. Thanks to all collaborators!
MICRO 2024 STAR: Sub-Entry Sharing-Aware TLB f​or Multi-Instance GPU
Bingyao Li , Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, and Xulong Tang
In Proceedings of the 57th IEEE/ACM International Symposium on​ Microarchitecture
HPCA 2024 GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement
Yueqi Wang*, Bingyao Li* , Aamer Jaleel, Jun Yang, and Xulong Tang  
(* The authors contribute equally)
The 30th IEEE International Symposium on High-Performance Computer Architecture
MICRO 2023 IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations
Bingyao Li, Yanan Guo, Yueqi Wang, Aamer Jaleel, Jun Yang, and Xulong Tang
In Proceedings of the 56th IEEE/ACM International Symposium on Microarchitecture
DAC 2023 Orchestrated Scheduling and Partitioning for Improved Address Translation in GPUs
Bingyao Li, Yueqi Wang, and Xulong Tang
The 60th Design Automation Conference
HPCA 2023 Trans-FW: Short Circuiting Page Table Walk in Multi-GPU Systems via Remote Forwarding
Bingyao Li
, Jieming Yin, Anup Holey, Youtao Zhang, Jun Yang, and Xulong Tang
The 29th IEEE International Symposium on High-Performance Computer Architecture
MICRO 2021 Improving Address Translation in Multi-GPUs via Sharing and Spilling aware TLB Design
Bingyao Li
, ​Jieming Yin, Youtao Zhang, and Xulong Tang
In Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture

Publication (Full list)

News !

11/2024 One paper is accepted by HPCA 2025. Thanks to all collaborators!
07/2024 One paper is accepted by MICRO 2024. Thanks to all collaborators!
06/2024 Gave a talk on "Towards Efficient and Salable Computing for Multi-GPUs" at NVIDIA.
05/2024 Start intern at NVIDIA Architecture Research Group.
10/2023 One paper is accepted by HPCA 2024. Thanks to all collaborators!
07/2023 One paper is accepted by MICRO 2023. Thanks to all collaborators!
06/2023 Awarded the CS50 Outstanding Research Fellowship.
04/2023 Gave a talk on ​"Towards Efficient and Salable Computing for Multi-GPUs" at Tianjin University.
02/2023 One paper is accepted by DAC 2023. Thanks to all collaborators!
10/2022 One paper is accepted by HPCA 2023. Thanks to all collaborators!
04/2022 Awarded the CS50 Outstanding Research Fellowship.
07/2021 One paper is accepted by MICRO 2021. Thanks to all collaborators!
MICRO 2024 STAR: Sub-Entry Sharing-Aware TLB f​or Multi-Instance GPU
Bingyao Li , Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, and Xulong Tang
In Proceedings of the 57th IEEE/ACM International Symposium on​ Microarchitecture
HPCA 2024 GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement
Yueqi Wang*, Bingyao Li* , Aamer Jaleel, Jun Yang, and Xulong Tang  
(* The authors contribute equally)
The 30th IEEE International Symposium on High-Performance Computer Architecture
MICRO 2023 IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations
Bingyao Li , Yanan Guo, Yueqi Wang, Aamer Jaleel, Jun Yang, and Xulong Tang
In Proceedings of the 56th IEEE/ACM International Symposium on Microarchitecture
DAC 2023 Orchestrated Scheduling and Partitioning for Improved Address Translation in GPUs
Bingyao Li, Yueqi Wang, and Xulong Tang
The 60th Design Automation Conference
HPCA 2023 Trans-FW: Short Circuiting Page Table Walk in Multi-GPU Systems via Remote Forwarding
Bingyao Li
, Jieming Yin, Anup Holey, Youtao Zhang, Jun Yang, and Xulong Tang
The 29th IEEE International Symposium on High-Performance Computer Architecture
MICRO 2021 Improving Address Translation in Multi-GPUs via Sharing and Spilling aware TLB Design
Bingyao Li
, ​Jieming Yin, Youtao Zhang, and Xulong Tang
In Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture

Publication (Full list)

11/2024 One paper is accepted by HPCA 2025. Thanks to all collaborators!
07/2024 One paper is accepted by MICRO 2024. Thanks to all collaborators!
06/2024 Gave a talk on "Towards Efficient and Salable Computing for Multi-GPUs" at NVIDIA.
05/2024 Start intern at NVIDIA Architecture Research Group.
10/2023 One paper is accepted by HPCA 2024. Thanks to all collaborators!
07/2023 One paper is accepted by MICRO 2023. Thanks to all collaborators!
06/2023 Awarded the CS50 Outstanding Research Fellowship.
04/2023 Gave a talk on ​"Towards Efficient and Salable Computing for Multi-GPUs" at Tianjin University.
02/2023 One paper is accepted by DAC 2023. Thanks to all collaborators!
10/2022 One paper is accepted by HPCA 2023. Thanks to all collaborators!
04/2022 Awarded the CS50 Outstanding Research Fellowship.
07/2021 One paper is accepted by MICRO 2021. Thanks to all collaborators!

News !

MICRO
2024
STAR: Sub-Entry Sharing-Aware TLB f​or Multi-Instance GPU
Bingyao Li, Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, and Xulong Tang
In Proceedings of the 57th IEEE/ACM International Symposium on​ Microarchitecture
HPCA
2024
GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement
Yueqi Wang*, Bingyao Li*, Aamer Jaleel, Jun Yang, and Xulong Tang  
(* The authors contribute equally)
The 30th IEEE International Symposium on High-Performance Computer Architecture
Publication (Full list)
MICRO
2023
IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations
Bingyao Li, Yanan Guo, Yueqi Wang, Aamer Jaleel, Jun Yang, and Xulong Tang
In Proceedings of the 56th IEEE/ACM International Symposium on​ Microarchitecture
DAC
2023
Orchestrated Scheduling and Partitioning for Improved Address Translation in GPUs
Bingyao Li, Yueqi Wang, and Xulong Tang
The 60th Design Automation Conference
HPCA
2023
Trans-FW: Short Circuiting Page Table Walk in Multi-GPU Systems via Remote Forwarding
Bingyao Li, Jieming Yin, Anup Holey, Youtao Zhang, Jun Yang, and Xulong Tang
The 29th IEEE International Symposium on High-Performance Computer Architecture
MICRO
2021
Improving Address Translation in Multi-GPUs via Sharing and Spilling aware TLB Design
Bingyao Li, ​Jieming Yin, Youtao Zhang, and Xulong Tang
In Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture

News !

11/2024 One paper is accepted by HPCA 2025. Thanks to all collaborators!
07/2024 One paper is accepted by MICRO 2024. Thanks to all collaborators!
06/2024 Gave a talk on "Towards Efficient and Salable Computing for Multi-GPUs" at NVIDIA.
05/2024 Start intern at NVIDIA Architecture Research Group.
10/2023 One paper is accepted by HPCA 2024. Thanks to all collaborators!
07/2023 One paper is accepted by MICRO 2023. Thanks to all collaborators!
06/2023 Awarded the CS50 Outstanding Research Fellowship.
04/2023 Gave a talk on ​"Towards Efficient and Salable Computing for Multi-GPUs" at Tianjin University.
02/2023 One paper is accepted by DAC 2023. Thanks to all collaborators!
10/2022 One paper is accepted by HPCA 2023. Thanks to all collaborators!
04/2022 Awarded the CS50 Outstanding Research Fellowship.
07/2021 One paper is accepted by MICRO 2021. Thanks to all collaborators!

Publication (Full list)

MICRO 2024 STAR: Sub-Entry Sharing-Aware TLB f​or Multi-Instance GPU
Bingyao Li , Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, and Xulong Tang
In Proceedings of the 57th IEEE/ACM International Symposium on​ Microarchitecture
HPCA 2024 GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement
Yueqi Wang*, Bingyao Li* , Aamer Jaleel, Jun Yang, and Xulong Tang  
(* The authors contribute equally)
The 30th IEEE International Symposium on High-Performance Computer Architecture
MICRO 2023 IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations
Bingyao Li , Yanan Guo, Yueqi Wang, Aamer Jaleel, Jun Yang, and Xulong Tang
In Proceedings of the 56th IEEE/ACM International Symposium on Microarchitecture
DAC 2023 Orchestrated Scheduling and Partitioning for Improved Address Translation in GPUs
Bingyao Li, Yueqi Wang, and Xulong Tang
The 60th Design Automation Conference
HPCA 2023 Trans-FW: Short Circuiting Page Table Walk in Multi-GPU Systems via Remote Forwarding
Bingyao Li
, Jieming Yin, Anup Holey, Youtao Zhang, Jun Yang, and Xulong Tang
The 29th IEEE International Symposium on High-Performance Computer Architecture
MICRO 2021 Improving Address Translation in Multi-GPUs via Sharing and Spilling aware TLB Design
Bingyao Li
, ​Jieming Yin, Youtao Zhang, and Xulong Tang
In Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture