A survey on evaluation of large language models Y Chang, X Wang, J Wang, Y Wu, L Yang, K Zhu, H Chen, X Yi, C Wang, ... ACM TIST, 2024 | 752 | 2024 |
On the robustness of chatgpt: An adversarial and out-of-distribution perspective J Wang, X Hu, W Hou, H Chen, R Zheng, Y Wang, L Yang, H Huang, ... ICLR 2023 Workshop, 2023 | 162 | 2023 |
Promptbench: Towards evaluating the robustness of large language models on adversarial prompts K Zhu, J Wang, J Zhou, Z Wang, H Chen, Y Wang, L Yang, W Ye, ... arXiv preprint arXiv:2306.04528, 2023 | 127 | 2023 |
HTML: Hierarchical Transformer-based Multi-task Learning for Volatility Prediction L Yang, TLJ Ng, B Smyth, R Dong WWW 2020, 2020 | 101 | 2020 |
PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization Y Wang, Z Yu, Z Zeng, L Yang, C Wang, H Chen, C Jiang, R Xie, J Wang, ... Internation Conference on Learning Representation (ICLR 2024), 2024 | 94 | 2024 |
USB: A Unified Semi-supervised Learning Benchmark for Classification Y Wang, H Chen, Y Fan, W Sun, R Tao, W Hou, R Wang, L Yang, Z Zhou, ... NeurIPS 2022 Dataset and Benchmark, 2022 | 89* | 2022 |
Survey on factuality in large language models: Knowledge, retrieval and domain-specificity C Wang, X Liu, Y Yue, X Tang, T Zhang, C Jiayang, Y Yao, W Gao, X Hu, ... arXiv preprint arXiv:2310.07521, 2023 | 75 | 2023 |
Generating Plausible Counterfactual Explanations for Deep Transformers in Financial Text Classification L Yang, EM Kenny, TLJ Ng, Y Yang, B Smyth, R Dong COLING 2020, 2020 | 68 | 2020 |
Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis L Yang, J Li, P Cunningham, Y Zhang, B Smyth, R Dong ACL 2021, 2021 | 55 | 2021 |
Explainable Text-Driven Neural Network for Stock Prediction L Yang, Z Zhang, S Xiong, L Wei, J Ng, L Xu, R Dong CCIS 2018 (Best Paper Nomination), 2018 | 54 | 2018 |
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective L Yang, S Zhang, L Qin, Y Li, Y Wang, H Liu, J Wang, X Xie, Y Zhang ACL 2023 Findings, 2023 | 45 | 2023 |
MAEC: A Multimodal Aligned Earnings Conference Call Dataset for Financial Risk Prediction J Li*, L Yang*, B Smyth, R Dong CIKM 2020, 2020 | 40 | 2020 |
Causal inference meets machine learning P Cui, Z Shen, S Li, L Yang, Y Li, Z Chu, J Gao Proceedings of the 26th ACM SIGKDD International Conference on Knowledge …, 2020 | 39 | 2020 |
Deepfake text detection in the wild Y Li, Q Li, L Cui, W Bi, L Wang, L Yang, S Shi, Y Zhang ACL 2024, 2024 | 31 | 2024 |
A Rationale-Centric Framework for Human-in-the-loop Machine Learning J Lu*, L Yang*, BM Namee, Y Zhang ACL 2022, 2022 | 30 | 2022 |
Leveraging BERT to Improve the FEARS Index for Stock Forecasting L Yang, Y Xu, J Ng, R Dong IJCAI 2019, 2019 | 28 | 2019 |
Fast-detectgpt: Efficient zero-shot detection of machine-generated text via conditional probability curvature G Bao, Y Zhao, Z Teng, L Yang, Y Zhang Internation Conference on Learning Representation (ICLR 2024), 2024 | 27 | 2024 |
NumHTML: Numeric-Oriented Hierarchical Transformer Model for Multi-task Financial Forecasting L Yang, J Li, R Dong, Y Zhang, B Smyth AAAI 2022, 2022 | 23 | 2022 |
FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition L Yang, L Yuan, L Cui, W Gao, Y Zhang COLING 2022, 2022 | 17 | 2022 |
Multi-level attention-based neural networks for distant supervised relation extraction L Yang, TLJ Ng, C Mooney, R Dong AICS 2017, 2017 | 14 | 2017 |