CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication W Liu, B Vinter Proceedings of the 29th ACM International Conference on Supercomputing (ICS …, 2015 | 330 | 2015 |
An efficient GPU general sparse matrix-matrix multiplication for irregular data W Liu, B Vinter 2014 IEEE 28th International Parallel and Distributed Processing Symposium …, 2014 | 166 | 2014 |
A Framework for General Sparse Matrix-Matrix Multiplication on GPUs and Heterogeneous Processors W Liu, B Vinter Journal of Parallel and Distributed Computing 85, 47-61, 2015 | 114 | 2015 |
A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves W Liu, A Li, J Hogg, IS Duff, B Vinter Euro-Par 2016: Parallel Processing, 2016 | 106 | 2016 |
Locality-Aware CTA Clustering for Modern GPUs A Li, SL Song, W Liu, X Liu, A Kumar, H Corporaal Proceedings of the 22nd ACM International Conference on Architectural …, 2017 | 90 | 2017 |
Fast Segmented Sort on GPUs K Hou, W Liu, H Wang, W Feng Proceedings of the 31st ACM International Conference on Supercomputing (ICS '17), 2017 | 70 | 2017 |
swSpTRSV: a fast sparse triangular solve with sparse level tile layout on sunway architectures X Wang, W Liu, W Xue, L Wu Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of …, 2018 | 68 | 2018 |
Speculative Segmented Sum for Sparse Matrix-Vector Multiplication on Heterogeneous Processors W Liu, B Vinter Parallel Computing 49, 179-193, 2015 | 65 | 2015 |
Exploring and Analyzing the Real Impact of Modern On-Package Memory on HPC Scientific Kernels A Li, W Liu, MRB Kristensen, B Vinter, H Wang, K Hou, A Marquez, ... International Conference for High Performance Computing, Networking, Storage …, 2017 | 57 | 2017 |
Unsupervised Behavior-Specific Dictionary Learning for Abnormal Event Detection H Ren, W Liu, SI Olsen, S Escalera, TB Moeslund British Machine Vision Conference (BMVC), 28.1-28.13, 2015 | 57 | 2015 |
IA-SpGEMM: An Input-aware Auto-tuning Framework for Parallel Sparse Matrix-Matrix Multiplication Z Xie, G Tan, W Liu, N Sun Proceedings of the 33rd ACM International Conference on Supercomputing (ICS '19), 2019 | 55 | 2019 |
Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides W Liu, A Li, J Hogg, IS Duff, B Vinter Concurrency and Computation: Practice and Experience, 2017 | 55 | 2017 |
Parallel Transposition of Sparse Data Structures H Wang, W Liu, K Hou, W Feng Proceedings of the 30th ACM International Conference on Supercomputing (ICS '16), 2016 | 52 | 2016 |
TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs Y Niu, Z Lu, M Dong, Z Jin, W Liu, G Tan 2021 IEEE 35th International Parallel and Distributed Processing Symposium …, 2021 | 39 | 2021 |
TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs Y Niu, Z Lu, H Ji, S Song, Z Jin, W Liu 27th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel …, 2022 | 36 | 2022 |
Warp-Consolidation: A Novel Execution Model for GPUs A Li, W Liu, L Wang, K Barker, SL Song Proceedings of the 32nd ACM International Conference on Supercomputing (ICS …, 2018 | 33 | 2018 |
Parallel and Scalable Sparse Basic Linear Algebra Subprograms W Liu University of Copenhagen, 2015 | 28 | 2015 |
clMF: A Fine-Grained and Portable Alternating Least Squares Algorithm for Parallel Matrix Factorization J Chen, J Fang, W Liu, T Tang, C Yang Future Generation Computer Systems 108, 1192-1205, 2020 | 20 | 2020 |
Efficient and Portable ALS Matrix Factorization for Recommender Systems J Chen, J Fang, W Liu, T Tang, X Chen, C Yang Proceedings of the 6th International Workshop on Parallel and Distributed …, 2017 | 19 | 2017 |
Register-Aware Optimizations for Parallel Sparse Matrix-Matrix Multiplication J Liu, X He, W Liu, G Tan International Journal of Parallel Programming, 2018 | 18 | 2018 |