Google Deep Learning Model Separates Individual Voices From A Video
To prove that, the company has discussed a new research project that isolates the voice of a single person in a video, even when the voice is in a crowd or marred by background noise. Using deep learning intelligence, a model computationally produces a video in which some speech is enhanced. Using audio and visual signals from speakers, the model can detect mouth movement. By doing this, Google can replicate the ability of humans to focus on a single sound even if there are others in the environment....