关注
He Huang
He Huang
在 nvidia.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Generative dual adversarial network for generalized zero-shot learning
H Huang, C Wang, PS Yu, CD Wang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
2662019
An introduction to image synthesis with generative adversarial nets
H Huang, PS Yu, C Wang
2232018
Fast conformer with linearly scalable attention for efficient speech recognition
D Rekesh, NR Koluguri, S Kriman, S Majumdar, V Noroozi, H Huang, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
412023
Deep latent factor model with hierarchical similarity measure for recommender systems
J Han, L Zheng, H Huang, Y Xu, SY Philip, W Zuo
Information Sciences 503, 521-532, 2019
282019
dpMood: exploiting local and periodic typing dynamics for personalized mood prediction
H Huang, B Cao, SY Phillip, CD Wang, AD Leow
2018 IEEE International Conference on Data Mining (ICDM), 157-166, 2018
282018
Salm: Speech-augmented language model with in-context learning for speech recognition and translation
Z Chen, H Huang, A Andrusenko, O Hrinchuk, KC Puvvada, J Li, S Ghosh, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
162024
Passive sensing of affective and cognitive functioning in mood disorders by analyzing keystroke kinematics and speech dynamics
F Hussain, JP Stange, SA Langenecker, MG McInnis, J Zulueta, ...
Digital phenotyping and mobile sensing: New developments in …, 2019
112019
Efficient sequence transduction by jointly predicting tokens and durations
H Xu, F Jia, S Majumdar, H Huang, S Watanabe, B Ginsburg
International Conference on Machine Learning, 38462-38484, 2023
82023
Multi-label Zero-shot Classification by Learning to Transfer from External Knowledge
H Huang, Y Chen, W Tang, W Zheng, QG Chen, P Yu
BMVC'2020 (oral), 2020
82020
Addressing Class Imbalance in Scene Graph Parsing by Learning to Contrast and Score
H Huang, S Saito, Y Kikuchi, E Matsumoto, W Tang, PS Yu
ACCV'2020, 2020
72020
Property-aware multi-speaker data simulation: A probabilistic modelling technique for synthetic data generation
TJ Park, H Huang, C Hooper, N Koluguri, K Dhawan, A Jukic, J Balam, ...
arXiv preprint arXiv:2310.12371, 2023
52023
Leveraging pretrained asr encoders for effective and efficient end-to-end speech intent classification and slot filling
H Huang, J Balam, B Ginsburg
arXiv preprint arXiv:2307.07057, 2023
42023
Passive sensing of affective and cognitive functioning in mood disorders by Analyzing keystroke kinematics and speech dynamics
F Hussain, JP Stange, SA Langenecker, MG McInnis, J Zulueta, ...
Digital Phenotyping and Mobile Sensing: New Developments in …, 2022
42022
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System
TJ Park, H Huang, A Jukic, K Dhawan, KC Puvvada, N Koluguri, N Karpov, ...
arXiv preprint arXiv:2310.12378, 2023
32023
Translational concept embedding for generalized compositional zero-shot learning
H Huang, W Tang, J Zhang, PS Yu
arXiv preprint arXiv:2112.10871, 2021
32021
Dapred: Dynamic attention location prediction with long-short term movement regularity
J Liu, Q Yuan, C Yang, H Huang, C Zhang, P Yu
32019
Desta: Enhancing speech language models through descriptive speech-text alignment
KH Lu, Z Chen, SW Fu, H Huang, B Ginsburg, YCF Wang, H Lee
arXiv preprint arXiv:2406.18871, 2024
22024
Less is More: Accurate Speech Recognition & Translation without Web-Scale Data
KC Puvvada, P Żelasko, H Huang, O Hrinchuk, NR Koluguri, K Dhawan, ...
arXiv preprint arXiv:2406.19674, 2024
12024
BESTOW: Efficient and Streamable Speech Language Model with the Best of Two Worlds in GPT and T5
Z Chen, H Huang, O Hrinchuk, KC Puvvada, NR Koluguri, P Żelasko, ...
arXiv preprint arXiv:2406.19954, 2024
12024
Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR
W Wang, K Dhawan, T Park, KC Puvvada, I Medennikov, S Majumdar, ...
arXiv preprint arXiv:2409.01438, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–20