项目作者： Sam3448

项目描述：
Word2Vec and Pseudo Relevance Feedback for Elasticsearch in IR

高级语言： Java

项目主页：

项目地址: git://github.com/Sam3448/PseudoFeedback.git

创建时间： 2018-02-14T18:50:39Z
项目社区：https://github.com/Sam3448/PseudoFeedback
开源协议：
下载

PseudoFeedback

Pseudo Feedback for Elasticsearch in Information Retrieval.

Basic structure

This work focuses on query expansion for first step, and pseudo feedback for second.

For query expansion, I experiments on both locally trained word embedding (MT bi-lingual English source) and FastText pre-trained Wiki word embedding. The results get better when using pre-trained embedding.

For retrieval part, I also indexed and searched MT and GOLD (manually) translated data. Though vocabulary matches for MT data, GOLD works better, with 3% better of miss probabillity.

Further experiments will be conducted for Pseudo Feedback.


