株式会社キャンパスクリエイト

Aiming to be a Global Open Innovation Hub
centered on Japanese universities
-Campus Create Co., Ltd―

Speech processing technology capable of extracting clear voice even in noisy environments with time variation


Speech processing technology capable of extracting clear voice even in noisy environments with time variation

Organization Name

Tetsuya Shimamura Graduate school of Science and Engineering, Saitama University, Professor

Technical field

There are various noises in the everyday environment. Although noise removal technologies have advanced, it has been difficult to remove noise that has a large time change. In our laboratory, we have developed new noise reduction/removal technologies, such as in-frame processing method that works even with time-varying noise. This is effective not only for call quality such as telephone calls, but also for personal authentication and device control by voice, and is applicable not only to voice, but also to images. It can also support the use of deep learning. We welcome companies that are willing to develop applications and businesses utilizing this technology.

Contact us

Details

Key point

  • We have successfully developed new noise reduction/removal technologies, such as in-frame processing method that works even with time-varying noise, which is effective not only for call quality such as telephone calls, but also for personal authentication and device control by voice, and is applicable not only to voice, but also to images.
  • It can also support the use of deep learning.

Benefit

The advantages of the noise reduction/removal technology developed in this laboratory are as follows.
・A noise suppression method that uses only the current frame.
・Can be used for various frame-based noise reduction techniques in real-time processing.
・Comprises of multiple methods and can effectively emphasize voice (signal-to-noise ratio improvement) in various noisy environments including the conditions with time-varying noise.
・Little distortion in musical noise (residual noise) and sound spectrum.
・Application to the neural network is possible. ・Applicable not only to voice but also to image.

Market Application

This technology is suitable for extracting a clear voice in a real-life environment with many noises.
It is useful not only for improving the quality of voice calls in a real-life environment, but also for improving voice recognition performance.
・Voice recognition in mobile devices
・Instructions and conversations with electrical appliances and AI devices at home.
・Applicable to exporting audio to text in a real-life environment and uses that require precise voice recognition.
・Speech recognition and speaker recognition for automatic driving of a car
・Applicable to voice security systems.
Moreover, the noise reduction/removal technology developed in this laboratory is applicable to various fields where the signal and noise are separated (e.g., ocean, human body, living organism, music, etc.) and also to images.

Contact us

Mail form

Please fill in the following items and click the "Send" button.
The person in charge will call you back in contact.
Please see this one on handling personal information.