Hi! I am an Assistant Professor in the Department of Computer Science and Engineering, University of California, Riverside. I received my Ph.D. in Computer Science from the University of Pittsburgh, where I was advised by Dr. Xulong Tang. I earned my B.E. and M.S. degrees from the College of Intelligence and Computing at Tianjin University in 2017 and 2020, respectively.

My research spans advanced computer architectures, high-performance computing, and emerging parallel applications, with a particular emphasis on the GPU ecosystem from architecture to application. I design architectures and system features for next-generation GPU platforms, enabling and supporting emerging large-scale applications on both single- and multi-GPU platforms. I am also actively working on building high-performance LLM infrastructure and systems.

Prospective Students: I am always looking for self-motivated Ph.D. students. Feel free to contact me at bingyaol@ucr.edu with CV and transcript.

Updates
  • 03/2026 One paper is accepted by ISCA 2026!
  • 11/2024 One paper is accepted by HPCA 2025. Thanks to all collaborators!
  • 07/2024 One paper is accepted by MICRO 2024. Thanks to all collaborators!
  • 06/2024 Gave a talk on "Towards Efficient and Salable Computing for Multi-GPUs" at NVIDIA.
  • 05/2024 Start intern at NVIDIA Architecture Research Group.
  • 10/2023 One paper is accepted by HPCA 2024. Thanks to all collaborators!
  • 07/2023 One paper is accepted by MICRO 2023. Thanks to all collaborators!
  • 06/2023 Awarded the CS50 Outstanding Research Fellowship.
  • 04/2023 Gave a talk on "Towards Efficient and Salable Computing for Multi-GPUs" at Tianjin University.
  • 02/2023 One paper is accepted by DAC 2023. Thanks to all collaborators!
  • 10/2022 One paper is accepted by HPCA 2023. Thanks to all collaborators!
  • 04/2022 Awarded the CS50 Outstanding Research Fellowship.
  • 07/2021 One paper is accepted by MICRO 2021. Thanks to all collaborators!
Selected Publications (view all )
ConServe: Contiguity-Preserving Memory Management for Multi-Turn LLM Serving
Bingyao Li
ISCA 2026. The 53rd IEEE/ACM International Symposium on Computer Architecture
OASIS: Object-Aware Page Management for Multi-GPU Systems
Yueqi Wang, Bingyao Li, Mohamed Tarek Ibn Ziad, Lieven Eeckhout, Jun Yang, Aamer Jaleel, and Xulong Tang
HPCA 2025. The 31th IEEE International Symposium on High-Performance Computer Architecture
Bingyao Li, Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, and Xulong Tang
MICRO 2024. In Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture
Yueqi Wang*, Bingyao Li*, Aamer Jaleel, Jun Yang, and Xulong Tang (*The authors contribute equally)
HPCA 2024. The 30th IEEE International Symposium on High-Performance Computer Architecture
Bingyao Li, Yanan Guo, Yueqi Wang, Aamer Jaleel, Jun Yang, and Xulong Tang
MICRO 2023. In Proceedings of the 56th IEEE/ACM International Symposium on Microarchitecture
Bingyao Li, Yueqi Wang, and Xulong Tang
DAC 2023. The 60th Design Automation Conference
Bingyao Li, Jieming Yin, Anup Holey, Youtao Zhang, Jun Yang, and Xulong Tang
HPCA 2023. The 29th IEEE International Symposium on High-Performance Computer Architecture
Bingyao Li, Jieming Yin, Youtao Zhang, and Xulong Tang
MICRO 2021. In Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture