Shuai Wang

Shuai Wang

Senior Researcher

Tencent

Biography

I obtained my Ph.D. degree in Shanghai Jiao Tong University in 2020.09, under the supervision of Kai Yu and Yanmin Qian. During the Ph.D. my research interests include deep learning based approaches for speaker recognition, speaker diarization and voice activity detection.

I serve as a regular reviewer for speech related conferences/journals: Interspeech, ICASSP, ICME and TASLP.

Currently, I work at Tencent as a senior researcher and my research area extended to speech synthesis, which is a facinating task.

Interests

  • Sound generation
  • Speech synthesis
  • Speaker recognition
  • Speaker diarization
  • Bayesian methods

Education

  • PhD in Computer Science and Technology, 2020

    Shanghai Jiao Tong University

  • BSc in Software Engineering, 2014

    Northwestern Polytechnical University

Experience

 
 
 
 
 

Research assistant

Speech@FIT in Brno University of Technology

Feb 2019 – Oct 2019 Brno, Czech Republic

Work on several research papers and contribute to

  • The VoxSRC 2019 speaker recognition challenge (1st place in 2 tracks)
  • The DIHARD 2019 speaker diarization challenge (1st place in 4 tracks)
  • The NIST SRE 2019 speaker recognition challenge
 
 
 
 
 

Researcher

AISPEECH

Jun 2018 – Dec 2018 Suzhou, China

Working on deep learning based speaker recognition systems for real-world applications such as

  • Smart phones
  • Car equipment
  • Smart-home devices
 
 
 
 
 

Researcher

AISPEECH

May 2017 – Aug 2018 Suzhou, China
Working on single-channel multi-speaker recognition, check https://www.youtube.com/watch?v=YFHboRGedY4 for more details.

Recent Posts

Recent Publications

Quickly discover relevant content by filtering publications.

Speaker Embedding Augmentation with Noise Distribution Matching.

SYNAUG:SYNTHESIS-BASED DATA AUGMENTATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION

Unit Selection Synthesis based Data Augmentation for Fixed Phrase Speaker Verification