Locality analysis through static parallel sampling D Chen, F Liu, C Ding, S Pai ACM SIGPLAN Notices 53 (4), 557-570, 2018 | 28 | 2018 |
LD: low-overhead GPU race detection without access monitoring P Li, X Hu, D Chen, J Brock, H Luo, EZ Zhang, C Ding ACM Transactions on Architecture and Code Optimization (TACO) 14 (1), 1-25, 2017 | 23 | 2017 |
Cache exclusivity and sharing: Theory and optimization C Ye, C Ding, H Luo, J Brock, D Chen, H Jin ACM Transactions on Architecture and Code Optimization (TACO) 14 (4), 1-26, 2017 | 14 | 2017 |
Write locality and optimization for persistent memory D Chen, C Ye, C Ding Proceedings of the Second International Symposium on Memory Systems, 77-87, 2016 | 11 | 2016 |
Automated transformation of GPU-specific OpenCL kernels targeting performance portability on multi-core/many-core CPUs D Huang, M Wen, C Xun, D Chen, X Cai, Y Qiao, N Wu, C Zhang European Conference on Parallel Processing, 210-221, 2014 | 11 | 2014 |
CLAM: Compiler lease of cache memory I Prechtl, B Reber, C Ding, D Patru, D Chen Proceedings of the International Symposium on Memory Systems, 281-296, 2020 | 8 | 2020 |
Scalable parallel motion estimation on muti-GPU system D Chen, HY Su, W Mei, LX Wang, CY Zhang Applied Mechanics and Materials 347, 3708-3714, 2013 | 6 | 2013 |
CARL: Compiler assigned reference leasing C Ding, D Chen, F Liu, B Reber, W Smith ACM Transactions on Architecture and Code Optimization (TACO) 19 (1), 1-28, 2022 | 5 | 2022 |
Uniform lease vs. LRU cache: Analysis and evaluation D Chen, C Ding, F Liu, B Reber, W Smith, P Li Proceedings of the 2021 ACM SIGPLAN International Symposium on Memory …, 2021 | 5 | 2021 |
Improving performance portability for GPU-specific OpenCL kernels on multi-core/many-core CPUs by analysis-based transformations M Wen, D Huang, C Xun, D Chen Frontiers of Information Technology & Electronic Engineering 16, 899-916, 2015 | 4 | 2015 |
Poster: Static reuse time analysis using dependence distance D Chen, F Liu, C Ding, C Lim International Workshop on Languages and Compilers for Parallel Computing …, 2017 | 3 | 2017 |
Automatic mapping single-device OpenCL program to heterogeneous multi-device platform D Chen, C Xun, D Huang, M Wen, C Zhang 2013 IEEE 10th International Conference on High Performance Computing and …, 2013 | 3 | 2013 |
CodeR: Issue Resolving with Multi-Agent and Task Graphs D Chen, S Lin, M Zeng, D Zan, JG Wang, A Cheshkov, J Sun, H Yu, ... arXiv preprint arXiv:2406.01304, 2024 | 2 | 2024 |
CodeS: Natural Language to Code Repository via Multi-Layer Sketch D Zan, A Yu, W Liu, D Chen, B Shen, W Li, Y Yao, Y Gong, X Chen, ... arXiv preprint arXiv:2403.16443, 2024 | 2 | 2024 |
Efficient fine-grained shared buffer management for multiple OpenCL devices C Xun, D Chen, Q Lan, C Zhang Journal of Zhejiang University Science C 14 (11), 859-872, 2013 | 2 | 2013 |
CLAM: Compiler leasing of accelerator memory D Chen, C Ding, D Patru Languages and Compilers for Parallel Computing: 32nd International Workshop …, 2021 | 1 | 2021 |
A Lightweight Framework for Adaptive Retrieval In Code Completion With Critique Model W Zhang, T Fu, T Yuan, G Zhang, D Chen, J Wang arXiv preprint arXiv:2406.10263, 2024 | | 2024 |
VIDGCN: Embracing input data diversity with a configurable graph convolutional network accelerator H Ming, T Pan, D Chen, C Ye, H Liu, L Tang, X Liao, H Jin Journal of Systems Architecture 141, 102924, 2023 | | 2023 |
PLUM: static parallel program locality analysis under uniform multiplexing F Liu, D Chen, W Smith, C Ding Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of …, 2020 | | 2020 |
Statistical caching for near memory management D Chen, F Liu, M Jiao, C Ding, S Pai Proceedings of the International Symposium on Memory Systems, 411-416, 2019 | | 2019 |