The end to out-of-tune karaoke! AI auto-corrects notes that are sung off key (but it doesn’t make them sound robotic like conventional autotune)
- The study using AI looked specifically at pitch correction in a karaoke setting
- An algorithm learns to estimate the amount of tuning required to stay in tune
- It works without using a musical score as reference unlike commercial systems
- The system corrects off-key notes to harmonise with the accompanying music
- Nuances in vocal are maintained and stops corrected vocals sounding robotic
Listening to your friends butcher your favourite songs during karaoke could become a thing of the past, thanks to software created by scientists.
Their AI system can bring your pitch closer to the original artist’s intentions, without making your voice sound robotic or artificial.
That means nuances in your voice are kept and that intentional vocal flourishes aren’t completely erased.
Any minor wobbles, or even massive deviations from the pitch of a song, are simply moved closer to how the song was meant to be sung.
The software, which is not yet commercially available, does this by shifting the pitch of individual sung notes to align them more closely with the accompanying music.
Scroll down for audio clips
Listening to your friends butcher your favourite songs during karaoke nights could become a thing of the past, thanks to software created by scientists. Their AI system can bring your pitch closer to the original artist’s, without making it sound overly robotic or artificial (stock image)
Most commercial autotuning systems require the user to input a melody score, or instructions to modulate pitch by a particular pitch or scale.
Sanna Wager, a PHD candidate and main author of the study at Indiana University, told New Scientist: ‘When looking at how to correct the current note, we look at what the singer did over the past few seconds.’
The current tool must be applied to recordings after they have been made, but the end product could used to make changes on-the-fly.
Her paper, published on the pre-print repository Arxiv.org, contains audio samples of how a voice altered by the commercial version of the product could sound (below).
The AI system can bring your pitch closer to the original artist’s intentions, without making your voice sound robotic or artificial. That means nuances in your voice are kept and that intentional vocal flourishes aren’t completely erased (stock image)
To create the system, researchers at Indiana University Bloomington used 4,702 amateur voice recordings from the online karaoke platform Smule to ‘train’ their AI algorithm to recognise and correction off-key notes.
The team selected 500 tracks that were performed ‘in-tune’ and split the tracks into separate files, one for voice and one for the accompanying music.
They then intentionally created an ‘out-of-tune’ version of the voice track by randomly shifting notes up to a semitone higher, while the accompaniment music was kept the same.
The AI learnt to predict the amount that each voice note needed to be adjusted in order to stay ‘in-pitch’ with the instrumental accompaniment.
This modulation was then applied to all the off-key notes in each solo voice recording to correct the entire voice track.
Writing in the paper, its authors said: ‘This approach differs from commercially used automatic pitch correction systems, where notes in the vocal tracks are shifted to be centered around notes in a user-defined score or mapped to the closest pitch among the twelve equal-tempered scale degrees.’