Supervised by: Ministry of Culture of PRC

Sponsored by:National Library of China
  Library Society of China

ISSN 1001-8867    CN 11-2746/G2

Investigating Weak Supervision in Deep Ranking

Abstract: A number of deep neural networks have beenproposed to improve the performance of documentranking in information retrieval studies. However,the training processes of these models usually needa large scale of labeled data, leading to data shortagebecoming a major hindrance to the improvement ofneural ranking models’ performances. Recently, severalweakly supervised methods have been proposed toaddress this challenge with the help of heuristics or users’interaction in the Search Engine Result Pages (SERPs)to generate weak relevance labels. In this work, weadopt two kinds of weakly supervised relevance, BM25-based relevance and click model-based relevance, andmake a deep investigation into their differences in thetraining of neural ranking models. Experimental resultsshow that BM25-based relevance helps models capturemore exact matching signals, while click model-basedrelevance enhances the rankings of documents that maybe preferred by users. We further proposed a cascaderanking framework to combine the two weakly supervisedrelevance, which significantly promotes the rankingperformance of neural ranking models and outperformsthe best result in the last NTCIR-13 We Want Web (WWW)task. This work reveals the potential of constructing betterdocument retrieval systems based on multiple kinds of weak relevance signals.

Keywords: document ranking, ad hoc retrieval, neuralranking model, weak supervision.


怀集县| 灯塔市| 休宁县| 弥渡县| 雷波县| 鄂温| 霍山县| 屏山县| 延吉市| 田阳县| 旬阳县| 石家庄市| 丰县| 密山市| 姜堰市| 垣曲县| 朝阳区| 手机| 策勒县| 怀来县| 夏河县| 赤城县| 永寿县| 工布江达县| 邵武市| 德安县| 潞城市| 中宁县| 津市市| 靖安县| 黔西| 镇安县| 通州市| 镇原县| 武定县| 凤凰县| 大余县| 贵南县| 平果县| 太湖县| 万山特区|