This tool is a command line interface to manage the web archive setup at the German National Library (DNB).
It's tasks are:
- get meta-data of archived webpages from the catalog graph
- retrieve archive files from the repository
- execute the indexers to make the archived webpages available
It is still in prototype state and under construction.
This tool gets archived webpages from the DNB catalog graph.
It queries for items (snapshots) that are part of (dcterms:isPartOf
) online resource media.
This projects uses poetry.
$ poetry install
First you need to set the environment variables:
$ source default.env
List IDNs of the snapshots
$ poetry run wacli list