Thursday, January 31, 2008

Search Inside the Music from Sun

Sun's Search Inside the Music

For those of us who love music, wishful thinking always includes the creation of a playlist that's specifically tailored to our interests and mood. Current recommender systems utilize one of two ways to sort music: collaborative filtering or content matching. Amazon uses collaborative filtering and offers suggestions based on what other's have found interesting. Pandora, as it turns out, uses content matching by hiring musicians to analyze songs and group them by hand into micro-genres. The Search Inside the Music poject from Sun aims to bring together the best of both worlds!!

Being the greedy individuals that we are, the best approach is one that offers the best of both worlds. Enter, supposedly, Sun's Search Inside the Music project! This project, led by principal investigator Paul Lamere, is is believed by team members to be the best music similarity algorithm because it's based on the actual sound. Leading recommenders categorize music based on artist, album, song and genre. Search Inside the Music analyzes features such as frequency, beats per minute, pitch, harmony, key, timbre, instrumentaion, tempo, intensity and energy level to map out the rhtyhm structure and determine the genre and which instruments are playing. In essence, it uses acoustic similarity to help people find music that "sounds similar" to music that they already like. According to Lamere, it takes a Pandora staffer about 20 minutes to categorize each song. The Sun system, on the other hand, is fully automated -- it's a computer-driven algorithm capable of whipping along at a clip of 3 seconds per CPU per song.

One of the more impressive capabilities of the Search Inside the Music system is its ability to generate a visualization of the acoustic distance among songs of different genres. This display shows which specific songs are similar to other songs of the same genre, but it also illustrates the degree of acoustic similarity between songs of different genres. This type of visualization provides a way to quickly create customized playlists based on acoustic similarity.

"For examples, let's say you've had a rough day at work; you're leaving the office and heading into heavy rush-hour traffic, and you want to hear myusic that will help you reduce your stress level as you drive home. The Search Inside the Music system can quickly generate a playlist that serves as a 'musical journey,' starting with higher-intensity songs that match your current stress level, and gradually diminshing the intesity of the songs as you make your way home. So during your 50-minute commute, you make the transition from Rage Against the Machine and led Zeppelin to Schumann's piano music--smoothly and seamlessly. And all the songs that are played are songs that you like."

Sun's other innovation is a tagging system that categorizes music based not on who's purchased it but on its attributes, described with tags like "quirky", "indie", "rock", "fast", "frenzied", "90's", or "cute" and "fun". For example, querying Sun's prototype search engine for Led Zeppelin brings up "tagomendations" such as the Rolling Stones and Jimi Hendrix. The user can then click a "why" button to find out why a particular song was recommended. Hendrix is recommended for Led Zeppelin based on tags like 'guitar gods', 'classic rock', 'guitar virtuoso' and 'psychedelic'. Sun is compiling these tags by searching reviews, lyrics, music blogs, social tagging sites and artist biographies, and incorporating the information into a prototype search engine. Compiling the tags based on a comprehensive search of the Web prevents people from gaming the system by generating their own tags to enhance the popularity of certain tracks.

In addition to recommendations for other music, the search engine provides links to videos, pictures and upcoming concerts, if the artist search for is alive and touring.

While there are certainly aspects/feelings that you can't get from analyzing the audio, it can help artists who are so new that they fly under the radar.

The more prevalent concerns about such a system stem from scalability. Where would analytical data be stored, how can the system be scaled to handle billions of songs and how can we find the computer power necessary to support such a system. Sun's computing model called grid computing, which lets you plug into a huge network of extremely power computers and draw on their combined CPU power on demand, provides a promising solution. Future advancements to the computerized analysis will enable recognition of major and minor chords, bridges and choruses, and the rhythm patterns of reggae, pop and ska according to Lamere. Though the Search Inside the Music project is not likely to be released as a commercial product, Sun officials say they plan to make the software available as open source, perhaps within six months.

A new music recommendation system from Sun (November 5, 2007)
Machine learning fuels Sun music recommendation technology (October 31, 2007)
Sun Micro spins its music software (April 9, 2007)
Search Inside the Music (September 26, 2006)

1 comment:

Ajeet said...

on the topic of pandora ... it is intrersting to note that pandora cannot play music on demand ... so i can request all tracks that sound like "XYZ" but cant play the track XYZ itself ... they say its the result of licensing restrictions.

on the topic of recommendations for music ... check this out ... http://blogs.sun.com/plamere/entry/collaborative_playlists

i found this on the same blog as the other tagomendations link ... dont know if this project is tied up to pandora or music genome .. but the concept of sharing playlists with friends is awesome ... we all do it by sharing and talking about music ... spotify makes it a lot easier and convenient