KDD 2011: KDD Cup

来源:百度文库 编辑:神马文学网 时间:2024/04/25 22:10:51
KDD-Cup 2011:
Recommending Music Items based on the Yahoo! Music Dataset We challenge participants to identify user tastes in music by analyzing real ratings of Yahoo! Music anonymized users. The dataset represents a snapshot of the community's preferences for various musical items. A distinctive feature of this dataset is that user ratings are not given to a single type of entities as is usually done but to four different types, namely tracks, albums, artists, and genres, tied together within a hierarchy. Thus, any given track is associated with its album, performing artist and associated genres. Similarly, any given album is associated with an artist and genre(s).

The competition offers two tracks, differing by the dataset size and the accuracy metric. The first track employs a dataset containing over 260M ratings. For this dataset the task is to predict test set ratings as accurately as possible. A second track concentrates on a smaller training set with about 62M train ratings. Here, the goal is to separate items rated highly by the users from items never rated by the users.

The main technique we expect participants to use is collaborative filtering. We believe that more successful attempts will require novel techniques and approaches, as this challenge dataset pushes the limits of current recommender systems in several dimensions:

  • Structure: The Yahoo! Music dataset comprises typed items, which are all linked together within a defined hierarchy.
  • Time: The Yahoo! Music dataset reports rating times, which allows performing session analysis of user activities and to infer exact chronological order of ratings.
  • Scale: The Yahoo! Music dataset is very large with as many as 625K items, much larger than any similar public dataset, where usually only the number of users is large. It also includes significantly more ratings than the currently largest one - the Netflix Prize dataset.
Important Dates (tentative)
  • March 1, 2011: Registration opens
  • March 15, 2011: Competition begins
  • June 30, 2011: Competition ends
  • August 21, 2011: Workshop and winners presentation
Workshop Following KDD-Cup’s tradition we will hold a workshop during KDD’s opening day. During the workshop we will invite the competition leaders to present their techniques, together with several external speakers. Workshop details will be published later. Organizing Committee
  • Gideon Dror : Yahoo! Labs, Israel
  • Yehuda Koren: Yahoo! Labs, Israel
  • Markus Weimer, Yahoo! Labs, US