关注
Pengfei Hu
Pengfei Hu
在 mail.ustc.edu.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Multimodal tree decoder for table of contents extraction in document images
P Hu, Z Zhang, J Zhang, J Du, J Wu
2022 26th international conference on pattern recognition (ICPR), 1756-1762, 2022
102022
SEMv2: Table separation line detection based on instance segmentation
Z Zhang, P Hu, J Ma, J Du, J Zhang, H Zhu, B Yin, B Yin, C Liu
Pattern Recognition 149, 110279, 2024
7*2024
HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures
J Ma, J Du, P Hu, Z Zhang, J Zhang, H Zhu, C Liu
AAAI 2023, 2023
62023
Count, decode and fetch: a new approach to handwritten Chinese character error correction
P Hu, J Ma, Z Zhang, J Du, J Zhang
arXiv preprint arXiv:2307.16253, 2023
32023
USTC-iFLYTEK at DocILE: A Multi-modal Approach Using Domain-specific GraphDoc.
Y Wang, J Du, J Ma, P Hu, Z Zhang, J Zhang
CLEF (Working Notes), 598-610, 2023
32023
Generate, transform, and clean: the role of GANs and transformers in palm leaf manuscript generation and enhancement
N Thuon, J Du, Z Zhang, J Ma, P Hu
International Journal on Document Analysis and Recognition (IJDAR), 1-18, 2024
22024
Hierarchical audio-visual information fusion with multi-label joint decoding for mer 2023
H Wang, Y Xi, H Chen, J Du, Y Song, Q Wang, H Zhou, C Wang, J Ma, ...
Proceedings of the 31st ACM International Conference on Multimedia, 9531-9535, 2023
22023
Group, Contrast and Recognize: A Self-supervised Method for Chinese Character Recognition
X Jiang, J Du, P Hu, M Xue, J Ma, J Wu, J Zhang
International Conference on Document Analysis and Recognition, 411-427, 2023
22023
SEMv3: A Fast and Robust Approach to Table Separation Line Detection
C Qin, Z Zhang, P Hu, C Liu, J Ma, J Du
arXiv preprint arXiv:2405.11862, 2024
12024
Bidirectional Trained Tree-Structured Decoder for Handwritten Mathematical Expression Recognition
H Cheng, C Liu, P Hu, Z Zhang, J Ma, J Du
arXiv preprint arXiv:2401.00435, 2023
12023
UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure Recognition
Z Zhang, S Liu, P Hu, J Ma, J Du, J Zhang, Y Hu
arXiv preprint arXiv:2409.13148, 2024
2024
DocMamba: Efficient Document Pre-training with State Space Model
P Hu, Z Zhang, J Ma, S Liu, J Du, J Zhang
arXiv preprint arXiv:2409.11887, 2024
2024
ICDAR 2024 Competition on Recognition of Chemical Structures
M Chen, H Wu, Q Chang, H Cheng, J Ma, P Hu, Z Zhang, C Liu, C Pi, J Hu, ...
International Conference on Document Analysis and Recognition, 397-409, 2024
2024
Radical Similarity Based Model Optimization and Post-correction for Chinese Character Recognition
Z Han, J Du, M Xue, J Ma, P Hu, Z Zhang
International Conference on Document Analysis and Recognition, 152-168, 2024
2024
Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios
Y Jiang, Q Wang, J Du, M Hu, P Hu, Z Liu, S Cheng, Z Nian, Y Dong, ...
arXiv preprint arXiv:2406.15160, 2024
2024
SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding
J Ma, Y Wang, C Liu, J Du, Y Hu, Z Zhang, P Hu, Q Wang, J Zhang
arXiv preprint arXiv:2406.08757, 2024
2024
Viewing Writing as Video: Optical Flow based Multi-Modal Handwritten Mathematical Expression Recognition
H Cheng, J Du, P Hu, J Ma, Z Zhang, M Xue
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
MATHS: MULTIMODAL TRANSFORMER-BASED HUMAN-READABLE SOLVER
Y Pan, Z Zhang, J Ma, P Hu, J Du, Q Wang, J Zhang, D Liu, S Wei
系统目前无法执行此操作,请稍后再试。
文章 1–18