Accelerated Auto-Tuning of GPU Kernels for Tensor Computations
Chendi Li*, Yufan Xu*, Sina Mahdipour Saravani, P. Sadayappan
ICS 2024
CoNST: Code Generator for Sparse Tensor Networks
Saurabh Raje, Yufan Xu, Atanas Rountev, Edward F. Valeev, P. Sadayappan
TACO 2024
PEAK: Generating High-Performance Schedules in MLIR
Amir Tavakkoli*, Sameeran Joshi*, Shreya Singh, Yufan Xu, P. Sadayappan, Marry Hall
LCPC 2023
Effective Performance Modeling and Domain-Specific Compiler Optimization of CNNs for GPU
Yufan Xu, Qiwei Yuan, Erik Curtis Barton, Rui Li, P. Sadayappan, Aravind Sukumaran-Rajam
PACT 2022
Training of Deep Learning Pipelines on Memory-Constrained GPUs via Segmented Fused-Tiled Execution
Yufan Xu, Saurabh Raje, Atanas Rountev, Gerald Sabin, Aravind Sukumaran-Rajam, P. Sadayappan
CC 2022
Efficient Distributed Algorithms for Convolutional Neural Networks
Rui Li, Yufan Xu, Aravind Sukumaran-Rajam, Atanas Rountev, P. Sadayappan
SPAA 2021
Analytical characterization and design space exploration for optimization of CNNs
Rui Li, Yufan Xu, Aravind Sukumaran-Rajam, Atanas Rountev, P. Sadayappan
ASPLOS 2021
Dependence-Aware, Unbounded Sound Predictive Race Detection
Kaan Genç, Jake Roemer, Yufan Xu, Michael D. Bond
OOPSLA 2019
Software Engineer II, Programming Systems
PhD Software Engineer Intern (Summer 2023)
Compiler Intern (Summer 2022)
Research Assistant (Fall 2019-May 2024)
Teaching Assistant (Fall 2017-Spring 2019)
Software Engineer Intern (Summer 2019)
Application Engineer Intern (Summer 2014)