Measuring and relieving the over-smoothing problem for graph neural networks from the topological view D Chen, Y Lin, W Li, P Li, J Zhou, X Sun Proceedings of the AAAI conference on artificial intelligence 34 (04), 3438-3445, 2020 | 1002 | 2020 |
Modeling the stock relation with graph network for overnight stock movement prediction W Li, R Bao, K Harimoto, D Chen, J Xu, Q Su Proceedings of the twenty-ninth international conference on international …, 2021 | 156 | 2021 |
Topology-imbalance learning for semi-supervised node classification D Chen, Y Lin, G Zhao, X Ren, P Li, J Zhou, X Sun Advances in Neural Information Processing Systems 34, 29885-29897, 2021 | 74 | 2021 |
Deepseek llm: Scaling open-source language models with longtermism X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ... arXiv preprint arXiv:2401.02954, 2024 | 65 | 2024 |
Label words are anchors: An information flow perspective for understanding in-context learning L Wang, L Li, D Dai, D Chen, H Zhou, F Meng, J Zhou, X Sun arXiv preprint arXiv:2305.14160, 2023 | 57 | 2023 |
CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade L Li, Y Lin, D Chen, S Ren, P Li, J Zhou, X Sun Findings of EMNLP 2021, 2020 | 42* | 2020 |
Incorporating fine-grained events in stock movement prediction D Chen, Y Zou, K Harimoto, R Bao, X Ren, X Sun ECONLP 2019, 2019 | 40 | 2019 |
Deepseekmoe: Towards ultimate expert specialization in mixture-of-experts language models D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen, J Li, W Zeng, X Yu, Y Wu, ... arXiv preprint arXiv:2401.06066, 2024 | 31 | 2024 |
Towards codable text watermarking for large language models L Wang, W Yang, D Chen, H Zhou, Y Lin, F Meng, J Zhou, X Sun arXiv preprint arXiv:2307.15992, 2023 | 27 | 2023 |
Group, extract and aggregate: Summarizing a large amount of finance news for forex movement prediction D Chen, K Harimoto, R Bao, Q Su, X Sun ECONLP 2019, 2019 | 25 | 2019 |
Math-shepherd: Verify and reinforce llms step-by-step without human annotations P Wang, L Li, Z Shao, RX Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui CoRR, abs/2312.08935, 2023 | 15 | 2023 |
Rethinking the Promotion Brought by Contrastive Learning to Semi-Supervised Node Classification D Chen, Y Lin, L Li, X Ren, P Li, J Zhou, X Sun IJCAI 2022, 2020 | 15* | 2020 |
Math-shepherd: A label-free step-by-step verifier for llms in mathematical reasoning P Wang, L Li, Z Shao, RX Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui arXiv preprint arXiv:2312.08935, 2023 | 13 | 2023 |
Leveraging word-formation knowledge for Chinese word sense disambiguation H Zheng, L Li, D Dai, D Chen, T Liu, X Sun, Y Liu Findings of the Association for Computational Linguistics: EMNLP 2021, 918-923, 2021 | 10 | 2021 |
Diffusion theory as a scalpel: Detecting and purifying poisonous dimensions in pre-trained language models caused by backdoor or bias Z Zhang, D Chen, H Zhou, F Meng, J Zhou, X Sun arXiv preprint arXiv:2305.04547, 2023 | 5 | 2023 |
Integrating local real data with global gradient prototypes for classifier re-balancing in federated long-tailed learning W Yang, D Chen, H Zhou, F Meng, J Zhou, X Sun arXiv preprint arXiv:2301.10394, 2023 | 5 | 2023 |
HighwayGraph: Modelling long-distance node relations for improving general graph neural network D Chen, X Liu, Y Lin, P Li, J Zhou, Q Su, X Sun arXiv preprint arXiv:1911.03904, 2019 | 4 | 2019 |
Predicting Popular News Comments Based on Multi-Target Text Matching Model D Chen, S Ma, P Yang, Q Su Natural Language Processing and Chinese Computing: 8th CCF International …, 2019 | 3* | 2019 |
Improving Node Classification by Co-training Node Pair Classification: A Novel Training Framework for General Graph Neural Networks D Chen, X Liu, Y Lin, P Li, J Zhou, Q Su, X Sun arXiv preprint arXiv:1911.03904, 2019 | 2 | 2019 |
Fed-FA: Theoretically modeling client data divergence for federated language backdoor defense Z Zhang, D Chen, H Zhou, F Meng, J Zhou, X Sun Advances in Neural Information Processing Systems 36, 2024 | 1 | 2024 |