Multimodal tree decoder for table of contents extraction in document images P Hu, Z Zhang, J Zhang, J Du, J Wu 2022 26th international conference on pattern recognition (ICPR), 1756-1762, 2022 | 10 | 2022 |
SEMv2: Table separation line detection based on instance segmentation Z Zhang, P Hu, J Ma, J Du, J Zhang, H Zhu, B Yin, B Yin, C Liu Pattern Recognition 149, 110279, 2024 | 7* | 2024 |
HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures J Ma, J Du, P Hu, Z Zhang, J Zhang, H Zhu, C Liu AAAI 2023, 2023 | 6 | 2023 |
Count, decode and fetch: a new approach to handwritten Chinese character error correction P Hu, J Ma, Z Zhang, J Du, J Zhang arXiv preprint arXiv:2307.16253, 2023 | 3 | 2023 |
USTC-iFLYTEK at DocILE: A Multi-modal Approach Using Domain-specific GraphDoc. Y Wang, J Du, J Ma, P Hu, Z Zhang, J Zhang CLEF (Working Notes), 598-610, 2023 | 3 | 2023 |
Generate, transform, and clean: the role of GANs and transformers in palm leaf manuscript generation and enhancement N Thuon, J Du, Z Zhang, J Ma, P Hu International Journal on Document Analysis and Recognition (IJDAR), 1-18, 2024 | 2 | 2024 |
Hierarchical audio-visual information fusion with multi-label joint decoding for mer 2023 H Wang, Y Xi, H Chen, J Du, Y Song, Q Wang, H Zhou, C Wang, J Ma, ... Proceedings of the 31st ACM International Conference on Multimedia, 9531-9535, 2023 | 2 | 2023 |
Group, Contrast and Recognize: A Self-supervised Method for Chinese Character Recognition X Jiang, J Du, P Hu, M Xue, J Ma, J Wu, J Zhang International Conference on Document Analysis and Recognition, 411-427, 2023 | 2 | 2023 |
SEMv3: A Fast and Robust Approach to Table Separation Line Detection C Qin, Z Zhang, P Hu, C Liu, J Ma, J Du arXiv preprint arXiv:2405.11862, 2024 | 1 | 2024 |
Bidirectional Trained Tree-Structured Decoder for Handwritten Mathematical Expression Recognition H Cheng, C Liu, P Hu, Z Zhang, J Ma, J Du arXiv preprint arXiv:2401.00435, 2023 | 1 | 2023 |
UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure Recognition Z Zhang, S Liu, P Hu, J Ma, J Du, J Zhang, Y Hu arXiv preprint arXiv:2409.13148, 2024 | | 2024 |
DocMamba: Efficient Document Pre-training with State Space Model P Hu, Z Zhang, J Ma, S Liu, J Du, J Zhang arXiv preprint arXiv:2409.11887, 2024 | | 2024 |
ICDAR 2024 Competition on Recognition of Chemical Structures M Chen, H Wu, Q Chang, H Cheng, J Ma, P Hu, Z Zhang, C Liu, C Pi, J Hu, ... International Conference on Document Analysis and Recognition, 397-409, 2024 | | 2024 |
Radical Similarity Based Model Optimization and Post-correction for Chinese Character Recognition Z Han, J Du, M Xue, J Ma, P Hu, Z Zhang International Conference on Document Analysis and Recognition, 152-168, 2024 | | 2024 |
Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios Y Jiang, Q Wang, J Du, M Hu, P Hu, Z Liu, S Cheng, Z Nian, Y Dong, ... arXiv preprint arXiv:2406.15160, 2024 | | 2024 |
SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding J Ma, Y Wang, C Liu, J Du, Y Hu, Z Zhang, P Hu, Q Wang, J Zhang arXiv preprint arXiv:2406.08757, 2024 | | 2024 |
Viewing Writing as Video: Optical Flow based Multi-Modal Handwritten Mathematical Expression Recognition H Cheng, J Du, P Hu, J Ma, Z Zhang, M Xue ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
MATHS: MULTIMODAL TRANSFORMER-BASED HUMAN-READABLE SOLVER Y Pan, Z Zhang, J Ma, P Hu, J Du, Q Wang, J Zhang, D Liu, S Wei | | |