About Me
I’m Bin Gao (高彬) from China. I’m a final year PhD Student of School of Computing (SoC) at the National University of Singapore (NUS). I’m working with Prof. Weng-Fai Wong. Before this, I received my bachelor degree and master degree from Huazhong University of Science and Technology (HUST), China in 2017 and 2020, respectively.
I am actively looking for postdoctoral positions starting in mid-2025. Please feel free to reach out to me!
Research Interests
Computer Systems; Computer Architecture; LLM Systems; Edge Computing; Geo-distributed Data Analysis
Publications
(* indicates the corresponding author)
Conferences
- Bin Gao, Zhuomin He, Yizhen Yao, Zhanzhi Lou, Zhi Zhou, Weng-Fai Wong, “Online Context Caching for Distributed Large Language Models Serving”, in the Proceedings of IEEE International Conference on Computer Communications (INFOCOM), London, UK, 2025.
- Zhuomin He, Yizhen Yao, Pengfei Zuo, Bin Gao, Qinya Li, Zhenzhe Zheng, Fan Wu, “AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference”, in the Proceedings of Annual AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, Pennsylvania, USA, 2025.
- Bin Gao, Zhehui Wang, Zhuomin He, Tao Luo, Weng-Fai Wong, Zhi Zhou, “IMI: In-memory Multi-job Inference Acceleration for Large Language Models”, in the Proceedings of International Conference on Parallel Processing (ICPP), Gotland, Sweden, 2024.
- Bin Gao, Zhuomin He, Puru Sharma, Qingxuan Kang, Djordje Jevdjic, Junbo Deng, Xingkun Yang, Zhou Yu, Pengfei Zuo, “Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”, in the Proceedings of USENIX Annual Technical Conference (ATC), Santa Clara, CA, US, 2024.
- Bin Gao, Qingxuan Kang, Hao-Wei Tee, Kyle Timothy Ng Chu, Alireza Sanaee, Djordje Jevdjic, “Scalable and Effective Page-table and TLB management on NUMA Systems”, in the Proceedings of USENIX Annual Technical Conference (ATC), Santa Clara, CA, US, 2024.
- Puru Sharma, Gary Goh Yipeng, Bin Gao, Longshen Ou, Dehui Lin, Deepak Sharma, Djordje Jevdjic, “DNA Storage Toolkit: A Modular End-to-End DNA Data Storage Codec and Simulator”, in the Proceedings of IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Indianapolis, Indiana, 2024.
- Qingyuan Wang, Bin Gao*, Zhi Zhou, Fei Xu, Chenghao Ouyang, “DAG-Aware Optimization for Geo-Distributed Data Analytics ”, in the Proceedings of International Conference on Parallel Processing (ICPP), Salt Lake City, UTAH, USA, 2023.
- Guohao Ying, Xin He, Bin Gao, Bo Han, and Xiaowen Chu, “EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs”, in the Proceedings of European Conference on Computer Vision (ECCV), Tel Aviv, 2022
- Bin Gao, Zhi Zhou, Fangming Liu, Fei Xu, “Winning at the Starting Line: Joint Network Selection and Service Placement for Mobile Edge Computing”, in the Proceedings of IEEE International Conference on Computer Communications (INFOCOM), Paris, France, 2019.
Journals
- Bin Gao, Zhi Zhou, Fangming Liu, Fei Xu, Bo Li, “An Online Framework for Joint Network Selection and Service Placement in Mobile Edge Computing”, IEEE Transaction on Mobile Computing, 2021.
- Kongyang Zhao, Bin Gao, Zhi Zhou, “Cost-Effective Task Scheduling for Effective Task Scheduling for Collaborative Cross Collaborative Cross-Edge Analytics”, ZTE Communications, 2021.
Workshops and Posters
- Yunkai Liang, Bin Gao, Pengfei Zuo, Zhi Zhou, Xu Chen, “PipeDecode: Efficient LLM Inference using Pipelines within Decoding”, in the Proceedings of USENIX Symposium on Operating Systems Design and Implementation (OSDI), Santa Clara, CA, US, 2024.
- Bin Gao, Hao-Wei Tee, Alireza Sanaee, Soh Boon Jun, Djordje Jevdjic, “OS-level Implications of Using DRAM Caches in Memory Disaggregation”, in the Proceedings of IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2022.
- Bin Gao, Hejing Li, Jialin Li, and Antoine Kaufmann, “Improving Disaggregated System Evaluation with Modular End-to-End Simulation”, in Workshop On Resource Disaggregation and Serverless Computing (WORDS), 2022.
Teaching
Service
- NUS SoC Faculty Recruitment Student Committee.
- EuroSys 2025, shadow committee.
- SOSP 2021 Student Volunteer.
- EuroSys 2022 AE Committee.
- OSDI/ATC 2022 AE Committee.
- Reviewer for IEEE Transaction on Mobile Computing.
- Reviewer for IEEE Transactions on Services Computing.
Awards
- Research Achievement Award, National University of Singapore, 2025.
- USENIX ATC Student Grant, 2024.
- Silver Prize, “Internet+” Innovation Entrepreneurship Competition, Ministry of Education of the People’s Republic of China, 2022. Click for news.
- Excellent Graduate, Huazhong University of Science and Technology, 2020.
- National Scholarship, Ministry of Education of the People’s Republic of China, 2019.
- Merit Student, Huazhong University of Science and Technology, 2019.
- Huawei Scholarship, Huawei Technologies Co., Ltd., 2018.
- Excellent Student Cadre, Huazhong University of Science and Technology, 2018.
- First-class Academic Scholarship, Huazhong University of Science and Technology, 2017-2020.
- Excellent Graduate, Huazhong University of Science and Technology, 2017.
- National Encouragement Scholarship, Ministry of Education of the People’s Republic of China, 2016.