项目作者: cleanzr

项目描述 :
Cora data set for Entity Resolution
高级语言: R
项目地址: git://github.com/cleanzr/cora.git
创建时间: 2018-05-25T23:53:20Z
项目社区:https://github.com/cleanzr/cora

开源协议:

下载


cora

Package Description

This package provides cleaned and formatted data for for entity resolution (record linkage or de-duplication) from the Cora data set. The Cora data set contains 1879 records with citation information on published papers, which includes features such as titles, authors, year published, and other information. The data set has a respective “gold” data set that provides information on which records are a match based on the id.

Package Installation

  1. # Install the development version from GitHub
  2. devtools::install_github(“resteorts/cora”)