The acoustic pattern monitoring of a wind turbine has the advantages of being intuitive and contactless, but it is susceptible to noise interference. For this reason, this paper proposes a drive train fault diagnosis method based on the fusion of acoustic and vibration information. First, nine vibration sensors and three acoustic sensors are arranged in the nacelle of the doubly-fed wind turbine to collect vibration information from the main shaft, gearbox, generator, and yaw reducer, as well as acoustic information in the nacelle and yaw platform, respectively. An acoustic-vibration information fusion method, based on drive train spectrum similarity, is constructed, combining time-domain, spectral, and envelope analyses to reveal the operational status and potential fault characteristics of each component. The results show that acoustic-vibration fusion monitoring improves drive train fault diagnosis reliability, laying the foundation for both continuous fault monitoring and independent acoustic pattern monitoring applications in similar units.