So Fake It's Real

Written by Eric Olsen
Published November 24, 2003

New vocal technology called Vocaloid:

    Developed at Pompeu Fabra University in Spain and financed by the Yamaha Corporation, the software, which is due to be released to consumers in January, allows users to cast their own (or anyone else's) songs in a disembodied but exceedingly life-like concert-quality voice. Just as a synthesizer might be programmed to play a series of notes like a violin one time and then like a tuba the next, a computer equipped with Vocaloid will be able to "sing" whatever combination of notes and words a user feeds it. The first generation of the software will be available for $200. But its arrival raises the prospect of a time when anyone with a laptop will be able to repurpose any singer's voice or even bring long-gone virtuosos back to life. In an era when our most popular singers are marketed in every conceivable way - dolls, T-shirts, notebooks, make-up lines - the voice may become one more extension of a pop-star brand.

    The human voice has proven the most difficult of all sounds to synthesize. Digital technology can produce something clear enough to convey meaning, but only in a clipped monotone that sounds more like a robot than a real live person. A convincing human voice, spoken or sung, with all its complex, flowing articulations and quivering uncertainties has been unattainable. Yamaha has not yet made Vocaloid available for scrutiny, but judging by some early samples and demonstrations, the company seem to have made that quantum leap.

    You can think of the software as a kind of audio font: musical notation and lyrics can be translated into the chosen voice, then saved for replay, just as a word processor might translate a text into Helvetica or Times New Roman and print it out as many times as you like.

    These fonts are made up of a database of phonemes, the basic sounds that make up any language. To create the database, technicians record a singer performing as many as 60 pages of scripted articulations (like "epp, pep, lep"). Assorted pitches and techniques like glissandos and legatos are also thrown in the mix; with all the combinations, the process takes a week of five-hour singing days. The resultant font is "reminiscent" of the singer's voice, says Ed Stratton, the managing director of Zero-G Limited, a London-based company that has licensed the Vocaloid technology.

    ....Hit music producers like Dan (The Automator) Takemura (a creator of the Gorillaz, a band that appeared only in an animated form, but sold several million albums anyway) and the Matrix (the trio of Scott Spock, Graham Edwards and his wife, Lauren Christy, that produced the three No. 1 hits from Avril Lavigne's last album) say they are likely at least to try recording with Vocaloid instead of backup singers. "As producers, you run into some artists and oh god, it's so hard to get the right vocal," Mr. Spock said. "It's intriguing, this idea of `O.K., just give me all your vowels and all your consonants and I'll see you later.' "

    page 1 | 2
Career media professional Eric Olsen is honored to be the founder and publisher of Blogcritics.org, which, quite frankly, rules - as do his wife and four children.
Keep reading for information and comments on this article, and add some feedback of your own!
So Fake It's Real
Published: November 24, 2003
Type:
Section: Sci/Tech
Filed Under: Music: News
Writer: Eric Olsen
Eric Olsen's BC Writer page
Eric Olsen's personal site
Spread the Word
Like this article?
Email this
Submit to del.icio.us Save to del.icio.us
RSS Feeds
All RSS Feeds (240+)
Comments on this article
BC articles by Eric Olsen
Music: News
All Sci/Tech Articles
Eric Olsen's personal weblog
All BC articles
All BC Comments

Comments

#1 — November 25, 2003 @ 11:12AM — Tom Johnson [URL]

What a shame they couldn't have used a normal audio format. I couldn't make heads or tails of the all-Japanese site for the "Mid Radio Player" you have to use.

But wow, is this creepy. I'm already bothered by the software that keeps people's voices on key. I can see real abuses of this in the future - not only applying dead celebrities vocals to things they might never have had anything to do with were they alive, but also using these impersonations to spruce up half-finished demos or vault songs for release to the public. It's just wrong, man, wrong!

#2 — November 25, 2003 @ 11:17AM — Eric Olsen

Clapton font: "It's all wrong but it's alright.."

#3 — November 25, 2003 @ 11:29AM — TDavid [URL]

You can find windows media file format at the vocaloid site samples available here: http://www.vocaloid.com/en/sample.html

That first sample is downright eerie. Despite it being in Japanese and me not understanding a word, it sounds fantastic.

#4 — November 25, 2003 @ 11:49AM — Eric Olsen

thanks TD! The machines are coming to deal with difficult divas, recalcitrant harmonists, and pitch-challenged poptarts. Somewhere down the line we are going to have to decide what we really value about "humanness."

Want comments emailed to you? No spam, promise! Address:

Add your comment, speak your mind

(Or ping: http://blogcritics.org/mt/tb/10410)

Personal attacks are not allowed. Please read our comment policy.





Remember Name/URL?

Please preview your comment!

Fresh
Articles
Fresh
Comments