Business

Google AI can focus on individual speakers in a crowd

Fri, Apr 13 2018 11:43:04 AM

San Francisco, Apr 13 (IANS): Just as most smartphone cameras now allow users to focus on a single object among many, it may soon be possible to pick out individual voices in a crowd by suppressing all other sounds, thanks to a new Artificial Intelligence (AI) system developed by Google researchers.

This is an important development as computers as not as good as humans at focusing their attention on a particular person in a noisy environment.

Known as the cocktail party effect, the capability to mentally "mute" all other voices and sounds comes natural to us humans.

However, automatic speech separation -- separating an audio signal into its individual speech sources -- remains a significant challenge for computers, Inbar Mosseri and Oran Lang, software engineers at Google Research, wrote in a blog post this week.

In a new paper, the researchers presented a deep learning audio-visual model for isolating a single speech signal from a mixture of sounds such as other voices and background noise.

"In this work, we are able to computationally produce videos in which speech of specific people is enhanced while all other sounds are suppressed," Mosseri and Lang said.

The method works on ordinary videos with a single audio track, and all that is required from the user is to select the face of the person in the video they want to hear, or to have such a person be selected algorithmically based on context.

The researchers believe this capability can have a wide range of applications, from speech enhancement and recognition in videos, through video conferencing, to improved hearing aids, especially in situations where there are multiple people speaking.

"A unique aspect of our technique is in combining both the auditory and visual signals of an input video to separate the speech," the researchers said.

"Intuitively, movements of a person's mouth, for example, should correlate with the sounds produced as that person is speaking, which in turn can help identify which parts of the audio correspond to that person," they explained.

The visual signal not only improves the speech separation quality significantly in cases of mixed speech, but, importantly, it also associates the separated, clean speech tracks with the visible speakers in the video, the researchers said.

Follow Daijiworld News Network on

Latest

UNESCO, IT Ministry to help India create AI policy with global ethical standards

Centre approves uniform protection protocol for users of Indian grid

S&P revises South Africa's outlook to positive

I’m a fool essentially providing free funding to OpenAI: Musk once told Altman

FIIs to reduce selling in India towards year-end, fresh allocations to occur

Centre launches campaign for assessment year 2024-25 to help taxpayers

Russia's energy giant to stop gas deliveries to Austria

Business

Google AI can focus on individual speakers in a crowd

Top Stories

Mangaluru: Nitte DU’s 14th convocation ceremony held

Leave a Comment Your Email address will not be published.

Title: Google AI can focus on individual speakers in a crowd

You might also like

Sushmita Sen believes in putting intentions into action

Elton John calls becoming a dad to kids later in life, the ‘greatest things’

Camera Assistant dies in tragic accident on ‘Anupamaa’ sets

Will.i.am reveals why he lives in a hotel

‘Mahavatar Narsimha’ motion poster promises high-voltage animated film based on scriptures

Manushi Chhillar shares checklist for the week, ticks off small joys

Sharvi says Badshah gave the design idea for her outfit in their collaboration ‘Morni’

Jigar Saraiya recollects how Anu Malik offered him their first paid gig

Rithvik Dhanjani expresses his love for furry friends with 'Pet Stories By The Pet Station'

Tension brews between Nayanthara, Dhanush as actress points gun at him over usage of footage

Manipur govt urges MHA to withdraw AFSPA from six police stations

TN Police to issue Red Corner Notice against elusive gangster in BSP leader murder case

MP: NHAI engineers suspended for lifting Madhavrao Scindia’s statue in ‘disrespectful’ manner

RJD leader questions Nitish Kumar’s repeated clarifications on staying in NDA

Andhra college student jumps to death from hostel building after petty spat

Bengal: ED investigating bank accounts of Bangladeshi citizens arrested in Hawala scam

Even Trinamool leaders not safe in Bengal: BJP

Jamaat-e-Islami Hind President asks cadre to reach out to larger society beyond community

‘Militant’ bodies airlifted to Manipur after post-mortem in Assam

Jumbo deaths: MP govt planning major reshuffle of Bandhavgarh Tiger Reserve staff