蔡振坤
Amazon Web Service
Email: zekucai@gmail.com
I am an Applied Scientist at AWS, working on large-scale machine learning infrastructure. I joined AWS in 2022, starting in the Shanghai office, where I worked on Graph Neural Network (GNN) systems. Later, I moved to Santa Clara, AWS US, where I now focus on LLM infrastructure. Before joining AWS, I earned my PhD from The Chinese University of Hong Kong (CUHK), advised by Prof. James Cheng.
gSampler: General and Efficient GPU-based Graph Sampling for Graph Learning. SOSP 2023
Ping Gong, Renjie Liu, Zunyao Mao, Zhenkun Cai, Xiao Yan, Cheng Li, Minjie Wang, Zhuozhao Li
FEC: Efficient Deep Recommendation Model Training with Flexible Embedding Communication. SIGMOD 2023
Kaihao Ma, Xiao Yan, Zhenkun Cai, Yuzhen Huang, Yidi Wu, James Cheng
DGI: Easy and Efficient Inference for GNNs. KDD 2023
Peiqi Yin, Xiao Yan, Jinjing Zhou, Qiang Fu, Zhenkun Cai, James Cheng, Bo Tang, Minjie Wang
DSP: Efficient GNN training with multiple GPUs. PPoPP 2023
Zhenkun Cai*, Qihui Zhou*, Xiao Yan, Da Zheng, Xiang Song, Chenguang Zheng, James Cheng, George Karypis
TensorOpt: Exploring the Tradeoffs in Distributed DNN Training with Auto-Parallelism. TPDS 2022
Zhenkun Cai, Kaihao Ma, Xiao Yan, Yidi Wu, Yuzhen Huang, James Cheng, Teng Su, Fan Yu
DGCL: An Efficient Communication Library for Distributed GNN Training. Eurosys 2021
Zhenkun Cai, Xiao Yan, Yidi Wu, Kaihao Ma, James Cheng, Fan Yu
Seastar: Vertex-Centric Programming for Graph Neural Networks. Eurosys 2021
Yidi Wu, Kaihao Ma, Zhenkun Cai, Tatiana Jin, Boyang Li, Chenguang Zheng, James Cheng, Fan Yu
Elastic Deep Learning in Multi-Tenant GPU Clusters. TPDS 2021
Yidi Wu, Kaihao Ma, Xiao Yan, Zhi Liu, Zhenkun Cai, Yuzhen Huang, James Cheng, Han Yuan, Fan Yu
Improving Resource Utilization by Timely Fine-Grained Scheduling. Eurosys 2020
Tatiana Jin, Zhenkun Cai, Boyang Li, Chenguang Zheng, Guanxian Jiang, James Cheng
FlexPS: Flexible Parallelism Control in Parameter Server Architecture. VLDB 2018
Yuzhen Huang, Tatiana Jin, Yidi Wu, Zhenkun Cai, Xiao Yan, Yuying Guo, Fan Yang, Jinfeng Li, James Cheng
Scalable De Novo Genome Assembly Using Pregel. ICDE 2018
Da Yan, Hongzhi Chen, Zhenkun Cai, James Cheng, Bin Shao
TensorOpt: Training Large-scale DNNs with Auto-parallell
EDL: An Elastic Deep Learning System on GPUs
FlexPS: A Parameter Server with Flexible Parallelism Control
DGCL: A Distributed Graph Communication Library for GNN systems
Seastar: A Vertex-centric GNN System
PPS: Fair And Efficient Scheduling for Multi-Tenant GPU Clusters
Ursa: A Framework for both Resource Scheduling and Execution for OLAP Jobs