蔡振坤
AWS Shanghai AI Lab
Email: zkcai [at] gmail.com
I received my Ph.D. degree from CUHK in 2022, advised by Prof. James Cheng. Before that, I obtained my B.Eng. degree from SCUT in 2017. My research interests cover the area of large-scale machine learning frameworks and GNN systems.
Currently, I am in the job market.
I joined AWS Shanghai AI lab as an applied scientist in 2022, working closely with Dr. Minjie Wang and Prof. Zheng Zhang.
gSampler: General and Efficient GPU-based Graph Sampling for Graph Learning. SOSP 2023
Ping Gong, Renjie Liu, Zunyao Mao, Zhenkun Cai, Xiao Yan, Cheng Li, Minjie Wang, Zhuozhao Li
FEC: Efficient Deep Recommendation Model Training with Flexible Embedding Communication. SIGMOD 2023
Kaihao Ma, Xiao Yan, Zhenkun Cai, Yuzhen Huang, Yidi Wu, James Cheng
DGI: Easy and Efficient Inference for GNNs. KDD 2023
Peiqi Yin, Xiao Yan, Jinjing Zhou, Qiang Fu, Zhenkun Cai, James Cheng, Bo Tang, Minjie Wang
DSP: Efficient GNN training with multiple GPUs. PPoPP 2023
Zhenkun Cai*, Qihui Zhou*, Xiao Yan, Da Zheng, Xiang Song, Chenguang Zheng, James Cheng, George Karypis
TensorOpt: Exploring the Tradeoffs in Distributed DNN Training with Auto-Parallelism. TPDS 2022
Zhenkun Cai, Kaihao Ma, Xiao Yan, Yidi Wu, Yuzhen Huang, James Cheng, Teng Su, Fan Yu
DGCL: An Efficient Communication Library for Distributed GNN Training. Eurosys 2021
Zhenkun Cai, Xiao Yan, Yidi Wu, Kaihao Ma, James Cheng, Fan Yu
Seastar: Vertex-Centric Programming for Graph Neural Networks. Eurosys 2021
Yidi Wu, Kaihao Ma, Zhenkun Cai, Tatiana Jin, Boyang Li, Chenguang Zheng, James Cheng, Fan Yu
Elastic Deep Learning in Multi-Tenant GPU Clusters. TPDS 2021
Yidi Wu, Kaihao Ma, Xiao Yan, Zhi Liu, Zhenkun Cai, Yuzhen Huang, James Cheng, Han Yuan, Fan Yu
Improving Resource Utilization by Timely Fine-Grained Scheduling. Eurosys 2020
Tatiana Jin, Zhenkun Cai, Boyang Li, Chenguang Zheng, Guanxian Jiang, James Cheng
FlexPS: Flexible Parallelism Control in Parameter Server Architecture. VLDB 2018
Yuzhen Huang, Tatiana Jin, Yidi Wu, Zhenkun Cai, Xiao Yan, Yuying Guo, Fan Yang, Jinfeng Li, James Cheng
Scalable De Novo Genome Assembly Using Pregel. ICDE 2018
Da Yan, Hongzhi Chen, Zhenkun Cai, James Cheng, Bin Shao
TensorOpt: Training Large-scale DNNs with Auto-parallell
EDL: An Elastic Deep Learning System on GPUs
FlexPS: A Parameter Server with Flexible Parallelism Control
DGCL: A Distributed Graph Communication Library for GNN systems
Seastar: A Vertex-centric GNN System
PPS: Fair And Efficient Scheduling for Multi-Tenant GPU Clusters
Ursa: A Framework for both Resource Scheduling and Execution for OLAP Jobs