Grus: Toward unified-memory-efficient high-performance graph processing on gpu P Wang, J Wang, C Li, J Wang, H Zhu, M Guo ACM Transactions on Architecture and Code Optimization (TACO) 18 (2), 1-25, 2021 | 32 | 2021 |
Excavating the potential of GPU for accelerating graph traversal P Wang, L Zhang, C Li, M Guo 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019 | 18 | 2019 |
Tapping into nfv environment for opportunistic serverless edge function deployment L Zhang, W Feng, C Li, X Hou, P Wang, J Wang, M Guo IEEE Transactions on Computers 71 (10), 2698-2704, 2021 | 16 | 2021 |
Skywalker: Efficient alias-method-based graph sampling and random walk on gpus P Wang, C Li, J Wang, T Wang, L Zhang, J Leng, Q Chen, M Guo 2021 30th International Conference on Parallel Architectures and Compilation …, 2021 | 16 | 2021 |
Oversubscribing gpu unified virtual memory: Implications and suggestions C Shao, J Guo, P Wang, J Wang, C Li, M Guo Proceedings of the 2022 ACM/SPEC on International Conference on Performance …, 2022 | 14 | 2022 |
Characterizing and orchestrating NFV-ready servers for efficient edge data processing L Zhang, C Li, P Wang, Y Liu, Y Hu, Q Chen, M Guo Proceedings of the international symposium on quality of service, 1-10, 2019 | 14 | 2019 |
Performance of training sparse deep neural networks on GPUs J Wang, Z Huang, L Kong, J Xiao, P Wang, L Zhang, C Li 2019 IEEE High Performance Extreme Computing Conference (HPEC), 1-5, 2019 | 11 | 2019 |
PSL: exploiting parallelism, sparsity and locality to accelerate matrix factorization on x86 platforms W Deng, P Wang, J Wang, C Li, M Guo International Symposium on Benchmarking, Measuring and Optimization, 101-109, 2019 | 10 | 2019 |
Excavating the potential of graph workload on rdma-based far memory architecture J Wang, C Li, T Wang, L Zhang, P Wang, J Mei, M Guo 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022 | 9 | 2022 |
ACE-GCN: A Fast data-driven FPGA accelerator for GCN embedding J Romero Hung, C Li, P Wang, C Shao, J Guo, J Wang, G Shi ACM Transactions on Reconfigurable Technology and Systems (TRETS) 14 (4), 1-23, 2021 | 8 | 2021 |
Memory system optimization for graph processing: A survey J Wang, L Zhang, PY Wang, J XU, C LI, H ZHU, X QIAN, M GUO Scientia Sinica Informationis 49 (3), 295-313, 2019 | 5 | 2019 |
Dragon: dynamic recurrent accelerator for graph online convolution J Romero Hung, C Li, T Wang, J Guo, P Wang, C Shao, J Wang, G Shi, ... ACM Transactions on Design Automation of Electronic Systems 28 (1), 1-27, 2023 | 4 | 2023 |
Fargraph+: Excavating the parallelism of graph processing workload on RDMA-based far memory system J Wang, C Li, Y Liu, T Wang, J Mei, L Zhang, P Wang, M Guo Journal of Parallel and Distributed Computing 177, 144-159, 2023 | 3 | 2023 |
Optimizing GPU-Based Graph Sampling and Random Walk for Efficiency and Scalability P Wang, C Xu, C Li, J Wang, T Wang, L Zhang, X Hou, M Guo IEEE Transactions on Computers 72 (9), 2508-2521, 2023 | 2 | 2023 |
面向图计算的内存系统优化技术综述 王靖, 张路, 王鹏宇, 徐嘉鸿, 李超, 朱浩瑾, 钱学海, 过敏意 中国科学: 信息科学 49 (3), 295-313, 2019 | 2 | 2019 |
HyFarM: Task Orchestration on Hybrid Far Memory for High Performance Per Bit J Wang, C Li, J Mei, H He, T Wang, P Wang, L Zhang, M Guo, H Wu, ... 2022 IEEE 40th International Conference on Computer Design (ICCD), 33-41, 2022 | 1 | 2022 |
Graph sampling and random walk acceleration method and system on GPU C Li, P Wang, J Wang, H Zhu, M Guo US Patent 11,875,426, 2024 | | 2024 |
Adaptive unified memory management method and system for large-scale graphs C Li, P Wang, S Chuanming, J Wang, J Guo, H Zhu, M Guo US Patent App. 18/040,905, 2023 | | 2023 |
High-Throughput GPU Random Walk with Fine-Tuned Concurrent Query Processing C Xu, C Li, P Wang, X Hou, J Wang, S Sun, M Guo, H Wu, D Chen, X Liu Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and …, 2023 | | 2023 |
GreenCom 2023 N Arshad, M Brocanelli, Q Chen, T Hoefler, X Hou, X Hou, Y Jiang, C Li, ... | | |