35 / 2021-06-22 21:17:35
Pathological Voice Feature Generation Using Generative Adversarial Network
终稿
jinyang Qian / Soochow University
Denghuang Zhao / Soochow University
Ziqi Fan / Soochow University
di wu / Soochow University
Yishen Xu / Soochow University
Zhi Tao / Soochow University
Due to the limitation of the establishment of pathological voice database, the sample size is often insufficient and imbalanced, which leads to the defect of many deep learning methods to be used in pathological voice database. In this paper, generative adversarial network (GAN) is used to generate voice data in the feature aspect to improve the imbalanced distribution of samples. GAN uses a generator and a discriminator for adversarial training to generate data similar to the original data distribution. Back propagation generative adversarial network (BPGAN) and deep convolution generative adversarial network (DCGAN) are used to generate vector features and matrix features respectively. It is found that the recall of minority sample is significantly higher when using balanced training set added data generated by GAN than imbalanced training set. Scatter diagram of generated feature vector and grayscale diagram of feature matrix are drawn. By adding the feature vector generated by BPGAN and DCCGAN, the recall of the test set for the minority sample is increased by 4.75%, and 21% respectively. The results show that GAN can generate different feature sets of pathological voice effectively.
重要日期
  • 会议日期

    10月21日

    2021

    10月23日

    2021

  • 10月26日 2021

    注册截止日期

主办单位
Southeast University, China
移动端
在手机上打开
小程序
打开微信小程序
客服
扫码或点此咨询