Skip to content

re-index DBpedias by not indexing disambiguation pages #34

@m1ci

Description

@m1ci

Currently, we index any label-URI pairs, however some pairs point to disambiguation pages. This issue freme-project/e-Entity#49 a results from this.
We need to re-index DBpedias by removing the disambiguation pages.

To distinguish whether URL is disambiguation page or not we can use the DBpedia disambiguation pages dataset http://downloads.dbpedia.org/2015-04/core/disambiguations_en.nt.bz2

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions