[Air-L] ACM WSDM Cup Tasks Announced

Ugur Kursuncu ugurkursuncu at gmail.com
Mon Oct 4 10:31:51 PDT 2021


WSDM Cup 2022 is a machine learning competition-style event co-located with the leading WSDM (Web Search and Data Mining) conference.  
 
Task Competitions Start: October 15, 2021
Task Competitions End: January 24, 2022
Winner Reports Submitted: February 15, 2022
WSDM Cup’22 Conference Event: February 21-25, 2022
 
We are excited to announce the following confirmed tasks for the WSDM Cup 2022. 
 
Task sponsor: IQIYI
Task focus: User Retention Score Prediction
 
Task abstract: iQIYI is the world's leading movie and video streaming platform, with nearly 8 billion hours spent on iQIYI app each month and over 500 million monthly active users <https://urldefense.proofpoint.com/v2/url?u=https-3A__en.wikipedia.org_wiki_Monthly-5Factive-5Fusers&d=DwMFaQ&c=ncDTmphkJTvjIDPh0hpF_w&r=f__6UenGa2tD_QmIUP1F_w&m=7Ke-27pfRyiaZZwEhgVjiaIHYKAtEVAs57LduBOzf5s&s=pAi35JLe0PxOO_wll019Vz3CLqhmPtKlFWAKFHV6RXo&e=>. It features highly popular original content, as well as a comprehensive library of other professionally-produced content and user-generated content.  iQIYI app uses deep-learning AI algorithms and massive user data, to produce content that caters to various user tastes, and to deliver superior entertainment experience to them. User retention is a key indicator to measure the users' satisfaction. It uses a retention score for the next N days to evaluate the retention. For example, a user having retention score 3 for the next 7 days means this user would launch the iQIYI app in 3 separate days during the period. It is very challenging to predict the retention score. It not only depends on the individual, but also on the trends of the entertainment, e.g., the number of daily users fluctuates severely when some blockbuster original contents are released. The task’s goal is to predict user retention scores using deep models.
 
-----

Task sponsor: Intel / Amazon
Task focus: Temporal Link Prediction

Task abstract: Real world datasets can often be expressed as graphs, with entities as nodes and interactions as edges.  Examples include user-user interaction in social networks, user-item interactions in recommender systems, etc.  Moreover, these graphs are often in practice temporal, with new edges coming in with timestamps.  Contrary to link prediction which asks if an edge exists between two nodes on a partially observed graph, temporal link prediction asks if an edge will exist between two nodes within a given time span.  It is more useful than traditional link prediction as one can then build multiple applications around the model, such as forecasting the demand of customers in E-commerce, or forecasting what event will happen in a social network, etc.  We are expecting a single model that can handle two kinds of data simultaneously: a dynamic event graph with entities as nodes and events as edges, and a user-item graph with users and items as nodes and various interactions as edges.  The task will be predicting whether an edge of a given type will exist between two given nodes before a given timestamp.
 
-----
 
Task sponsor: University of Amsterdam / University of Massachusetts Amherst / Amazon
Task focus: Cross-market Recommendation
 
Task abstract: E-commerce companies often operate across markets; for instance Amazon has expanded their operations and sales to 18 markets (i.e. countries) around the globe. The cross-market recommendation concerns the problem of recommending relevant products to users in a target market (e.g., a resource-scarce market) by leveraging data from similar high-resource markets, e.g. using data from the U.S. market to improve recommendations in a target market. The key challenge, however, is that data, such as user interaction data with products (clicks, purchases, reviews), convey certain biases of the individual markets. Therefore, the algorithms trained on a source market are not necessarily effective in a different target market.  Despite its significance, small progress has been made in cross-market recommendation, mainly due to a lack of experimental data for the researchers. In this WSDM Cup challenge, we provide user purchase and rating data on various markets, enriched with review data in different languages, with a considerable number of shared item subsets. The goal is to improve individual recommendation systems in these target markets by leveraging data from similar auxiliary markets.


—
Ugur Kursuncu, Ph.D.
Publicity Co-Chair, WSDM 2022

Assistant Professor
Georgia State University





More information about the Air-L mailing list