TY - GEN
T1 - Effective crowd expertise modeling via cross domain sparsity and uncertainty reduction
AU - Xie, Sihong
AU - Hu, Qingbo
AU - Shao, Weixiang
AU - Zhang, Jingyuan
AU - Gao, Jing
AU - Fan, Wei
AU - Yu, Philip S.
N1 - Publisher Copyright: Copyright © by SIAM.
PY - 2016
Y1 - 2016
N2 - Characterizations of crowd expertise is vital to online applications where the crowd plays a central role, such as StackExchange for question-answering and Linkedln as a workforce market. With accurately estimated worker expertise, new jobs can be assigned to the right workers more effectively and efficiently. Most existing methods solely rely on the sparse worker-job interactions, leading to poorly estimated expertise that does not generalize well to a large amount of unseen jobs. Though transfer learning can utilize external domains to mitigate the sparsity, the auxiliary domains can themselves suffer from incomplete information, leading to inferior performance. There is a lack of principled framework to handle the sparse and incomplete data to achieve better expertise modeling. Based on multitask learning, we propose a framework that uses the knowledge learned from one domain to gradually resolve the data sparsity or incompleteness problem in the other alternatively. Experimental results on several question-answering datasets demonstrate the effectiveness and convergence of the iterative framework.
AB - Characterizations of crowd expertise is vital to online applications where the crowd plays a central role, such as StackExchange for question-answering and Linkedln as a workforce market. With accurately estimated worker expertise, new jobs can be assigned to the right workers more effectively and efficiently. Most existing methods solely rely on the sparse worker-job interactions, leading to poorly estimated expertise that does not generalize well to a large amount of unseen jobs. Though transfer learning can utilize external domains to mitigate the sparsity, the auxiliary domains can themselves suffer from incomplete information, leading to inferior performance. There is a lack of principled framework to handle the sparse and incomplete data to achieve better expertise modeling. Based on multitask learning, we propose a framework that uses the knowledge learned from one domain to gradually resolve the data sparsity or incompleteness problem in the other alternatively. Experimental results on several question-answering datasets demonstrate the effectiveness and convergence of the iterative framework.
UR - https://www.scopus.com/pages/publications/84991710654
U2 - 10.1137/1.9781611974348.73
DO - 10.1137/1.9781611974348.73
M3 - Conference contribution
T3 - 16th SIAM International Conference on Data Mining 2016, SDM 2016
SP - 648
EP - 656
BT - 16th SIAM International Conference on Data Mining 2016, SDM 2016
A2 - Venkatasubramanian, Sanjay Chawla
A2 - Meira, Wagner
PB - Society for Industrial and Applied Mathematics Publications
T2 - 16th SIAM International Conference on Data Mining 2016, SDM 2016
Y2 - 5 May 2016 through 7 May 2016
ER -