Follow
Xiaonan Tian
Xiaonan Tian
Compiler Engineer@NVIDIA
Verified email at uh.edu - Homepage
Title
Cited by
Cited by
Year
Compiling a High-level Directive-Based Programming Model for GPGPUs
X Tian, R Xu, Y Yan, Z Yun, S Chandrasekaran, B Chapman
The 26th International Workshop on Languages and Compilers for Parallel …, 2013
592013
Nas parallel benchmarks for gpgpus using a directive-based programming model
R Xu, X Tian, S Chandrasekaran, Y Yan, B Chapman
Languages and Compilers for Parallel Computing: 27th International Workshop …, 2015
382015
Compiler transformation of nested loops for general purpose GPUs
X Tian, R Xu, Y Yan, S Chandrasekaran, D Eachempati, B Chapman
Concurrency and Computation: Practice and Experience 28 (2), 537-556, 2016
162016
Multi‐GPU support on single node using directive‐based programming model
R Xu, X Tian, S Chandrasekaran, B Chapman
Scientific Programming 2015 (1), 621730, 2015
162015
The OpenACC data model: Preliminary study on its major challenges and implementations
M Wolfe, S Lee, J Kim, X Tian, R Xu, B Chapman, S Chandrasekaran
Parallel Computing 78, 15-27, 2018
102018
Implementing the OpenACC data model
M Wolfe, S Lee, J Kim, X Tian, R Xu, S Chandrasekaran, B Chapman
2017 IEEE International Parallel and Distributed Processing Symposium …, 2017
102017
OpenACC Parallelization and optimization of NAS parallel benchmarks
R Xu, X Tian, S Chandrasekaran, Y Yan, B Chapman
Proc. GPU Technol. Conf, 1-27, 2014
92014
Optimizing GPU Register Usage: Extensions to OpenACC and Compiler Optimizations
X Tian, D Khaldi, D Eachempati, R Xu, B Chapman
2016 45th International Conference on Parallel Processing (ICPP), 572 - 581, 2016
82016
OpenUH: open source OpenACC compiler
X Tian, R Xu, B Chapman
GTC2014, HPCTools Group Computer Science Department University of Houston, 2014
72014
Reduction operations in parallel loops for GPGPUs
R Xu, X Tian, Y Yan, S Chandrasekaran, B Chapman
Proceedings of Programming Models and Applications on Multicores and …, 2014
72014
Performance and power characteristics of matrix multiplication algorithms on multicore and shared memory machines
Y Yan, J Kemp, X Tian, AM Malik, B Chapman
2012 SC Companion: High Performance Computing, Networking Storage and …, 2012
72012
Assessing one-to-one parallelism levels mapping for openmp offloading to gpus
C Shen, X Tian, D Khaldi, B Chapman
Proceedings of the 8th International Workshop on Programming Models and …, 2017
62017
An analytical model-based auto-tuning framework for locality-aware loop scheduling
R Xu, S Chandrasekaran, X Tian, B Chapman
High Performance Computing: 31st International Conference, ISC High …, 2016
62016
Acceleration of bulk memory operations in a heterogeneous multicore architecture
JH Lee, Z Liu, X Tian, DH Woo, W Shi, D Boumber, Y Yan, KA Kwon
Proceedings of the 21st international conference on Parallel architectures …, 2012
32012
A Compiler Optimization Framework for Directive-Based GPU Computing
X Tian
2016
The system can't perform the operation now. Try again later.
Articles 1–15