Skip to content

generate institution acronym from homepage #125

@VladimirAlexiev

Description

@VladimirAlexiev

(Followup of #105)
This query

PREFIX dbprop: <https://dbpedia.org/property/>
PREFIX soa: <https://semopenalex.org/ontology/>
select (count(?x) as ?institutions) (count(?page) as ?pages) (count(?acro) as ?acronyms) {
  ?x a soa:Institution
  optional {?x <http://xmlns.com/foaf/0.1/homepage> ?page}
  optional {?x dbprop:acronym ?acro}
}

shows SOA has 111k Instutitutions, 109k homepages, but only 47k acronyms.

I don't know whether there are official rules on how acronyms are formed:
But looking at the values of homepages, it seems quite feasible to extract good acronyms from them.

You need to find the "business-meaningful" word from the homepage URL

  • eg "www" and "edu" and "fr" are not business-meaningful.
  • that will depend a bit on TLD (i.e. country)

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions