Skip to content

[EN] Collect dataset of Armenian ancient cultural places from Pleiades project #30

@ivbeg

Description

@ivbeg

Goal

The goal is to collect a dataset of ancient Armenian cultural places.

Tasks

There is a database of ancient places called Pleiades https://pleiades.stoa.org. It has more than 40k
It's possible to extract data related to Armenia and to create a dataset.

Tasks are:

  1. Download the full dump of the Pleiades database https://pleiades.stoa.org/downloads
  2. Find objects related to Armenia in the Pleiades database. Keywords are Urartu, Armenia, Armenian, and maybe other words too.
  3. Extract these objects from the data dump.
  4. Save them as CSV and JSON datasets.
  5. Publish results in the Github repository.

Context

Ancient places are essential to understanding the country and the nation's culture. Pleiades is one of the essential digital humanities projects with data related to antiques. This data will help to create more Armenian-related cultural projects.

Requirements

A public GitHub repository should be created to store and publish the code and possibly the data under one of the free and open licenses, such as Creative Commons or MIT. Please make the code as reusable and maintainable as possible and provide it with some instructions and requirements.

Wishes

It would be best to comment on your code so that even beginners can understand what it does.

Resources

Prepared by

The Open Data Armenia team prepared this task.

Metadata

Metadata

Assignees

No one assigned

    Labels

    analysisTasks that require data analysis skillsextractionTask that require data extraction (scraping) skillstopic-cultureTasks dedicatated Armenian culture, language and history

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions