Publications


“Using GPUs to Compute Large Out-of-card FFTs”, Liang Gu, Jakob Siegel and Xiaoming Li. 25th International Conference on Supercomputing (ICS 2011), Tucson, Arizona, USA, June, 2011. (accepted)

“Soft Error Propagation in Floating-Point Programs”, Sha Li and Xiaoming Li. Proceedings of International Performance Computing and Communications Conference (IPCCC 2010), Albuquerque, New Mexico, USA, December, 2010.

“Efficient Sparse Matrix-Matrix Multiplication on Heterogeneous High Performance Systems”, Jakob Siegel, Oreste Villa, Sriram Krishnamoorthy, Antonino Tumeo and Xiaoming Li. Pro- ceedings of The Workshop on Application/Architecture Co-design for Extreme-scale Computing (AACEC) in conjunction with the IEEE International Conference on Cluster Computing 2010 (Cluster 2010). Crete Greece, September 2010.

“A Micro-benchmark Suite for AMD GPUs”, Ryan Taylor and Xiaoming Li. Proceedings of the Third International Workshop on Parallel Programming Models and Systems Software for High- End Computing (P2S2) in conjunctioin with The 39th International Conference on Parallel Processing (ICPP'10), San Diego, CA, September 2010.

“Software-based predication for AMD GPUs”, Ryan Taylor, Xiaoming Li. Proceedings of Inter- national Workshop on Highly-Efficientcient Accelerators and Recon gurable Technologies (HEART) in conjunction with The 24th International Conference on Supercomputing (ICS'10), Tsukuba, Japan, June, 2010.

“An Empirically Tuned 2D and 3D FFT Library on CUDA GPU”, Liang Gu, Xiaoming Li and Jakob Siegel. Proceedings of International Conference on Supercomputing (ICS 2010). Tsukuba, Japan. June, 2010.

“Context-aware Code Optimization”, Murat Bolat and Xiaoming Li. Proceedings of Interna- tional Performance Computing and Communications Conference (IPCCC 2009), Phoenix, Arizona, USA, December, 2009.

“Iterative Layer-Based Raytracing on CUDA”, Alejandro Segovia, Xiaoming Li and Guang Gao. Proceedings of International Performance Computing and Communications Conference (IPCCC 2009), Phoenix, Arizona, USA, December, 2009.

“DFT Performance Prediction in FFTW”, Liang Gu and Xiaoming Li. Proceedings of Lan- guages and Compilers for Parallel Computing, 22nd International Workshop, (LCPC 2009), Newark, Delaware, USA, October, 2009.

“CUDA Memory Optimizations for Large Data-Structures in the Gravit Simulator”, Jakob Siegel, Juergen Ributzka and Xiaoming Li. International Workshop on Simulation and Mod- elling. Proceedings of The 38th International Conference on Parallel Processing (ICPP) 2009, September 2009.

“An Empirically Optimized Radix Sort for GPU”, Bonan Huang, Jinlan Gao and Xiaoming Li. Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA) 2009, August, 2009.

“A Model-driven Optimization for FFTW”, Liang Gu and Xiaoming Li. Poster. Proceedings of the 23rd International Conference on Supercomputing (ICS) 2009, June, 2009.

“A Control-structure Splitting Optimization for GPGPU”, Snaider Carillo, Jakob Siegel and Xiaoming Li, Proceedings of ACM International Conference on Computing Frontier (CF) 2009, March, 2009.

“Dynamic Optimization Option Search in GCC”, Eunjung Park, Mihailo Kaplarevic, Yingping Zhang, Xiaoming Li and Guang R. Gao, GCC Developers’ Summit, July, 2007.

“Automatic Program Segment Similarity Detection in Targeted Program Performance Improvement”, Haiping Wu, Eunjung Park, Mihailo Kaplarevic, Yingping Zhang, Murat Bolat, Xiaoming Li, Guang R. Gao, Workshop on Performance Optimization for High-Level Languages and Libraries, in conjunction with 21st IEEE International Parallel & Distributed Processing Symposium (IPDPS). March 2007.

“Experience of Optimizing FFT on Intel Architectures”, Daniel Orozco, Liping Xue, Murat Bolat, Xiaoming Li, Guang R. Gao. Workshop on Performance Optimization for High-Level Languages and Libraries, in conjunction with 21st IEEE International Parallel & Distributed Processing Symposium (IPDPS). March 2007.

"Analyzing the Use of a Software Modeling Tool". Xiaoming Li, Daryl Shannon, Jabari Walker, Sarfraz Khurshid, Darko Marinov. The Sixth Workshop on Language Descriptions, Tools and Applications (LDTA 2006). April 2006.

"Optimizing Sorting with Genetic Algorithm''. Xiaoming Li, María Jesús Garzarán, and David Padua.  In Proc. of the 3rd International Symposium on Code Generation and Optimization (CGO-2005), pages 99-110, San Jose, CA, USA, 2005.

"Is Search Really Necessary to Generate High-Performance BLAS?''. Kamen Yotov, Xiaoming Li, Gang Ren, Maria Garzaran, David Padua, Keshav Pingali and Paul Stodghill. Proceedings of the IEEE Special Issue on Program Generation, Optimization, and Platform Adaptation, Vol. 93, No. 2, pages 358-386, February, 2005.

"Optimizing Sorting with Genetic Algorithm''. Xiaoming Li, María Jesús Garzarán, and David Padua. The 12th International Workshop on Compilers for Parallel Computers (CPC 2006), A Coruna, Spain, January, 2006. (Invited paper).

"Optimizing Matrix Multiplication with a Classifier Learning System''.Xiaoming Li and María Jesús Garzarán. Languages and Compilers for Parallel Computing, 16th International Workshop, (LCPC 2005), New York, NY, USA, 2005.

"Analytic Models and Empirical Search: A Hybrid Approach to Code Optimization''. Arkady Epshteyn, María Jesús Garzarán, Gerald DeJong, David Padua, Gang Ren, Xiaoming Li, Kamen Yotov and Keshav Pingali. Languages and Compilers for Parallel Computing, 16th International Workshop, (LCPC 2005), New York, NY, USA, 2005.

"A Dynamically Tuned Sorting Library''. Xiaoming Li, María Jesús Garzarán, and David Padua. In Proc. of the International Symposium on Code Generation and Optimization (CGO-2004), pages 111-124, March 2004.

"A Comparison of Empirical and Model-driven Optimization''. Kamen Yotov, Xiaoming Li, Gang Ren, Michael Cibulskis, Gerald DeJong, María Jesús Garzarán, David Padua, Keshav Pingali, Paul Stodghill, and Peng Wu. In Proc. of the International Conference on Programming Language Design and Implementation (PLDI 2003), pages 63-76, June 2003.

"Data Dependence Analysis In Presence Of Inheritance and Polymorphism''. Xiaoming Li, Daoxu Chen, Li Xie. Proceedings of HPC-Asia2000, Vol. 1, pages 220-228, IEEE Computer Society Press, Beijing, May 2000.

"The Design and Implementation of The Scheduling Protocol in JAPS''.Xiaoming Li, Daoxu Chen, Li Xie. Journal of Computer Science (Chinese), Vol. 28, No. 1, January 2001.

"PTSP: The Parallel Task Support Platform in JAPS''. Xiaoming Li, Daoxu Chen, Li Xie. Journal of Computer Science (Chinese), Vol. 27, No. 7, pages 5-8, July 2000.