Quantcast

NetFlix: Predictive Analytics & Recommendations

The NetFlix Contest begins today — the contest to improve the Netflix recommendation engine, with a $1MM grand prize and a $50K progress awards. To win the grand prize, the contestant must show an improvement of 10% better than the current Cinematch algorithm. Here’s their explanation:

The Netflix Prize seeks to substantially improve the accuracy of predictions about how much someone is going to love a movie based on their movie preferences. Improve it enough and you win one (or more) Prizes. Winning the Netflix Prize improves our ability to connect people to the movies they love.

Here’s the best part — they’ve provided training data set containing 500,000 past and current recommendations. This should be fun — I’ve downloaded the 650 MB file and will begin playing with it tonight. I’ll get to use some python and revisit the good ‘ol graduate school days of machine learning and neural nets.

Netflix, Inc.

More explanation of the Rules here:

Netflix is all about connecting people to the movies they love. To help customers find those movies, we’ve developed our world-class movie recommendation system: CinematchSM. Its job is to predict whether someone will enjoy a movie based on how much they liked or disliked other movies. We use those predictions to make personal movie recommendations based on each customer’s unique tastes. And while Cinematch is doing pretty well, it can always be made better.

Now there are a lot of interesting alternative approaches to how Cinematch works that we haven’t tried. Some are described in the literature, some aren’t. We’re curious whether any of these can beat Cinematch by making better predictions. Because, frankly, if there is a much better approach it could make a big difference to our customers and our business.

So, we thought we’d make a contest out of finding the answer. It’s “easy” really. We provide you with a lot of anonymous rating data, and a prediction accuracy bar that is 10% better than what Cinematch can do on the same training data set. (Accuracy is a measurement of how closely predicted ratings of movies match subsequent actual ratings.) If you develop a system that we judge most beats that bar on the qualifying test set we provide, you get serious money and the bragging rights. But (and you knew there would be a catch, right?) only if you share your method with us and describe to the world how you did it and why it works.

Serious money demands a serious bar. We suspect the 10% improvement is pretty tough, but we also think there is a good chance it can be achieved. It may take months; it might take years. So to keep things interesting, in addition to the Grand Prize, we’re also offering a $50,000 Progress Prize each year the contest runs. It goes to the team whose system we judge shows the most improvement over the previous year’s best accuracy bar on the same qualifying test set. No improvement, no prize. And like the Grand Prize, to win you’ll need to share your method with us and describe it for the world.

If you enjoy what you find here, please consider subscribing to shmula.com or you can enter your email address here to be subscribed via email:




5 Comments »

  1. Hans said,

    October 2, 2006 @ 7:15 am

    This looks like a lot of fun. The chances of winning are probably pretty small unless you’re deep in the Machine Learning/AI field (look at who came up with their system in the FAQ, then consider who your competition will be), but it can’t hurt to try and would be a lot of fun.

    I’m very impressed by the way the contest it layed out. Good evaluation techniques, sizeable data set, results back to the world, etc. Kudos to netflix.

  2. james said,

    October 2, 2006 @ 9:14 am

    Good luck! I love Nerflix and would love to see them get even better. I always wondered how their recommendations worked.

  3. Bill said, said,

    October 3, 2006 @ 8:29 am

    Good Luck to all

  4. Jordy said,

    October 18, 2006 @ 8:52 pm

    That sounds like a lot of fun.

    One of the first things Netflix needs to do is allow users to rate things with half or even quarter stars. If people didn’t have to round (I do all the time), the input would be a lot better, and so would the output.

  5. Jordy Blog » Blog Archive » Improving Cinemax, the Netflix Recommendation Engine said,

    October 18, 2006 @ 10:25 pm

    [...] Well now it looks like Netflix has placed a bounty on an algorithm to make the Recommendation Engine even better.  My friend Peter Abilla indicates he might give it a try. He’s also posted about the rules and the $1 million dollar prize. I wish I had more time because this is one problem I would love to help solve. (I’m pretty nerdy like that.) Plus it would be cool to have a million bucks.  :) One thing I would recommend (kind of unrelated to the algorithm) is to allow users to rate movies with half, or even quarter starts.  More precise input means better output, and I know I’ve had to round many times when I thought a movie deserved 3 and a half stars, etc. [...]

RSS feed for comments on this post · TrackBack URI

Leave a Comment