Multimodal pre-training based on graph attention network for document understanding Z Zhang, J Ma, J Du, L Wang, J Zhang IEEE Transactions on Multimedia, 2022 | 25 | 2022 |
Query-driven Generative Network for Document Information Extraction in the Wild H Cao, X Li, J Ma, D Jiang, A Guo, Y Hu, H Liu, Y Liu, B Ren ACM-MM 2022, 4261-4271, 2022 | 10 | 2022 |
GMN: Generative Multi-modal Network for Practical Document Information Extraction H Cao, J Ma, A Guo, Y Hu, H Liu, D Jiang, Y Liu, B Ren NAACL 2022, 2022 | 10 | 2022 |
HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures J Ma, J Du, P Hu, Z Zhang, J Zhang, H Zhu, C Liu AAAI 2023, 2023 | 5 | 2023 |
Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction P Hu, J Ma, Z Zhang, J Du, J Zhang arXiv preprint arXiv:2307.16253, 2023 | 3 | 2023 |
SEMv2: Table Separation Line Detection Based on Conditional Convolution Z Zhang, P Hu, J Ma, J Du, J Zhang, H Zhu, B Yin, B Yin, C Liu arXiv preprint arXiv:2303.04384, 2023 | 3 | 2023 |
An Open-Source Library of 2D-GMM-HMM Based on Kaldi Toolkit and Its Application to Handwritten Chinese Character Recognition J Ma, Z Wang, J Du ICIG 2021, 235-244, 2021 | 3 | 2021 |
SEMv2: Table separation line detection based on instance segmentation Z Zhang, P Hu, J Ma, J Du, J Zhang, B Yin, B Yin, C Liu Pattern Recognition 149, 110279, 2024 | 2 | 2024 |
USTC-iFLYTEK at DocILE: a multi-modal approach using domain-specific GraphDoc Y Wang, J Du, J Ma, P Hu, Z Zhang, J Zhang Working Notes of CLEF, 2023 | 2 | 2023 |
Generate, transform, and clean: the role of GANs and transformers in palm leaf manuscript generation and enhancement N Thuon, J Du, Z Zhang, J Ma, P Hu International Journal on Document Analysis and Recognition (IJDAR), 1-18, 2024 | 1 | 2024 |
Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023 H Wang, Y Xi, H Chen, J Du, Y Song, Q Wang, H Zhou, C Wang, J Ma, ... ACM-MM 2023, 9531-9535, 2023 | 1 | 2023 |
SEMv3: A Fast and Robust Approach to Table Separation Line Detection C Qin, Z Zhang, P Hu, C Liu, J Ma, J Du arXiv preprint arXiv:2405.11862, 2024 | | 2024 |
Viewing Writing as Video: Optical Flow based Multi-Modal Handwritten Mathematical Expression Recognition H Cheng, J Du, P Hu, J Ma, Z Zhang, M Xue ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition Y Dai, H Chen, J Du, R Wang, S Chen, J Ma, H Wang, CH Lee CVPR 2024, 2024 | | 2024 |
Bidirectional Trained Tree-Structured Decoder for Handwritten Mathematical Expression Recognition H Cheng, C Liu, P Hu, Z Zhang, J Ma, J Du arXiv preprint arXiv:2401.00435, 2023 | | 2023 |
Enhancing Math Word Problem Solving Through Salient Clue Prioritization: A Joint Token-Phrase-Level Feature Integration Approach J Xie, J Ma, X Zhang, J Zhang, J Du 2023 International Conference on Asian Language Processing (IALP), 252-257, 2023 | | 2023 |
Group, Contrast and Recognize: A Self-supervised Method for Chinese Character Recognition X Jiang, J Du, P Hu, M Xue, J Ma, J Wu, J Zhang International Conference on Document Analysis and Recognition, 411-427, 2023 | | 2023 |