MIT’s PixelPlayer can isolate the sounds of instruments using AI

Nitin Naresh July 5, 2018

0 424 2 minutes read

Equalizers are one way to pump up the bass in your favorite tunes, but researchers at the Massachusetts Institute of Technology’s Computer Science and Artificial Intelligence Lab (CSAIL) have a better solution. Their system — PixelPlayer — uses artificial intelligence to distinguish between and isolate the sounds of instruments, and make them louder or softer.
The fully trained PixelPlayer system, given a video as the input, splits the accompanying audio and identifies the source of sound, and then calculates the volume of each pixel in the image and “spatially localizes” it — i.e., identifies regions in the clip that generate similar sound waves.
It’s detailed in “The Sound of Pixels,” a new paper accepted at the upcoming European Conference on Computer Vision, scheduled for September in Munich, Germany.
“We expected a best-case scenario where we could recognize which instruments make which kinds of sounds,” Hang Zhao, a Ph.D student at CSAIL and a coauthor on the paper, said. “We were surprised that we could actually spatially locate the instruments at the pixel level. Being able to do that opens up a lot of possibilities, like being able to edit the soundtrack audio of individual instruments by a single click on the video.”

Above: PixelPlayer learned to associate sound waves with pixels in video frames.

At the core of PixelPlayer is a neural network trained on MUSIC (Multimodal Sources of Instrument Combinations), a dataset of 714 untrimmed, unlabeled videos from YouTube. (Five hundred videos — 60 hours’ worth — were used for training, and the rest were used for validation and testing.) During the training process, the researchers fed the algorithm clips of performers playing acoustic guitars, cellos, clarinets, flutes, and other instruments.
It’s just one part of PixelPlayer’s multipronged machine learning framework. After the trained video analysis algorithm extracts visual features from the clips’ frames, a second neural network — an audio analysis network — splits the sound into components and extracts features from them. Finally, an audio synthesizer network uses the output from the two networks to associate specific pixels with sound waves.
PixelPlayer is entirely self-supervised, meaning that it doesn’t require humans to annotate the data, and it’s capable of identifying the sounds of more than 20 instruments. (Zhao said a larger dataset would allow it to recognize more, but that it would have trouble handling subtle differences between subclasses of instruments.) It can also recognize elements of music, like harmonic frequencies from a violin.
The researchers think PixelPlayer could aid in sound editing, or be used on robots to better understand environmental sounds that animals, vehicles, and other objects make.
“We expect our work can open up new research avenues for understanding the problem of sound source separation using both visual and auditory signals,” they wrote.
Source: VentureBeat
To Read Our Daily News Updates, Please Visit Inventiva Or Subscribe Our Newsletter & Push.

MIT’s PixelPlayer can isolate the sounds of instruments using AI

Nitin Naresh

Read Next

A Nation At Crossroads: Carney And Poilievre Battle For Canada’s Future

Donald Trump, Putin’s “Ally” In Plain Sight? Is There More Than Meets The Eye?

Volkswagen’s $1.4 Billion Tax Bill, India Says ‘No’ To Volkswagen Demands. A Roadblock For Volkswagen In India?

Delhi High Court Judge’s Cash Stack. Is India’s Judiciary A Case Of A Rotting Apple From Inside? If Judges And Politicians Are Corrupt, Where Do Citizens Turn For Justice?

Burning The Justice: Judicial Trust In Flames. How Deep Rooted Corrupted Is Indian Judiciary?

Claiming Its First Corporate Casualties, Accenture And Nike Hit Hard Due To Tariff Hikes And Doge Cuts, But Trump’s Policies Could Shake Things Up Even More!

A Secret Handshake Between Tata Group And Tesla? Has Tesla’s Global Slowdown Put India In The Spotlight?

Prologis Bets Big On India’s Booming Logistics Sector With A $1 Billion Investment. India’s Logistics And Warehousing Sector Is Set For A Game-Changing 2025

Is Trump Steering The U.S. Toward Recession? Harvard Economist Warns Of Global Impact And Chances Of U.S. Recession Thrice The Normal Rate

Elon Musk, Oligarch Allegations, And The Tesla Arson Attacks. Why The ‘Everyone Hates Elon’ Movement Is Petty, Childish, And Pointless

Claiming Its First Corporate Casualties, Accenture And Nike Hit Hard Due To Tariff Hikes And Doge Cuts, But Trump’s Policies Could Shake Things Up Even More!

A Secret Handshake Between Tata Group And Tesla? Has Tesla’s Global Slowdown Put India In The Spotlight?

Prologis Bets Big On India’s Booming Logistics Sector With A $1 Billion Investment. India’s Logistics And Warehousing Sector Is Set For A Game-Changing 2025

Is Trump Steering The U.S. Toward Recession? Harvard Economist Warns Of Global Impact And Chances Of U.S. Recession Thrice The Normal Rate

Elon Musk, Oligarch Allegations, And The Tesla Arson Attacks. Why The ‘Everyone Hates Elon’ Movement Is Petty, Childish, And Pointless

A Nation At Crossroads: Carney And Poilievre Battle For Canada’s Future

Donald Trump, Putin’s “Ally” In Plain Sight? Is There More Than Meets The Eye?

Volkswagen’s $1.4 Billion Tax Bill, India Says ‘No’ To Volkswagen Demands. A Roadblock For Volkswagen In India?

Delhi High Court Judge’s Cash Stack. Is India’s Judiciary A Case Of A Rotting Apple From Inside? If Judges And Politicians Are Corrupt, Where Do Citizens Turn For Justice?

Burning The Justice: Judicial Trust In Flames. How Deep Rooted Corrupted Is Indian Judiciary?

Claiming Its First Corporate Casualties, Accenture And Nike Hit Hard Due To Tariff Hikes And Doge Cuts, But Trump’s Policies Could Shake Things Up Even More!

A Secret Handshake Between Tata Group And Tesla? Has Tesla’s Global Slowdown Put India In The Spotlight?

Prologis Bets Big On India’s Booming Logistics Sector With A $1 Billion Investment. India’s Logistics And Warehousing Sector Is Set For A Game-Changing 2025

Is Trump Steering The U.S. Toward Recession? Harvard Economist Warns Of Global Impact And Chances Of U.S. Recession Thrice The Normal Rate

Elon Musk, Oligarch Allegations, And The Tesla Arson Attacks. Why The ‘Everyone Hates Elon’ Movement Is Petty, Childish, And Pointless

A Nation At Crossroads: Carney And Poilievre Battle For Canada’s Future

Donald Trump, Putin’s “Ally” In Plain Sight? Is There More Than Meets The Eye?

Volkswagen’s $1.4 Billion Tax Bill, India Says ‘No’ To Volkswagen Demands. A Roadblock For Volkswagen In India?

Delhi High Court Judge’s Cash Stack. Is India’s Judiciary A Case Of A Rotting Apple From Inside? If Judges And Politicians Are Corrupt, Where Do Citizens Turn For Justice?

Burning The Justice: Judicial Trust In Flames. How Deep Rooted Corrupted Is Indian Judiciary?

Leave a Reply Cancel reply

Acer may shutter or sell StarVR after location-based VR revenues sink

Indonesia short on oxygen, seeks help as virus cases soar

Floods- Why are Pune and Mumbai prone to it?

The solar storms will hit the Earth and cause disruption in GPS and mobile connectivity.

The death of democracy in India

Employee Engagement In The Hybrid Workplace Of The Future

Read Next

Is Trump Steering The U.S. Toward Recession? Harvard Economist Warns Of Global Impact And Chances Of U.S. Recession Thrice The Normal Rate

Elon Musk, Oligarch Allegations, And The Tesla Arson Attacks. Why The ‘Everyone Hates Elon’ Movement Is Petty, Childish, And Pointless

A Nation At Crossroads: Carney And Poilievre Battle For Canada’s Future

Donald Trump, Putin’s “Ally” In Plain Sight? Is There More Than Meets The Eye?

Volkswagen’s $1.4 Billion Tax Bill, India Says ‘No’ To Volkswagen Demands. A Roadblock For Volkswagen In India?

Subscribe to our mailing list to get the new updates!

RPG Maker MV is coming to the Switch, PS4, and Xbox One

MoviePass subscribers will now pay surcharges for popular showtimes

Related Articles

Leave a Reply Cancel reply

Acer may shutter or sell StarVR after location-based VR revenues sink

Indonesia short on oxygen, seeks help as virus cases soar

Floods- Why are Pune and Mumbai prone to it?

The solar storms will hit the Earth and cause disruption in GPS and mobile connectivity.

The death of democracy in India

Employee Engagement In The Hybrid Workplace Of The Future