Skip to content

Registry of published datasets in the Datasets Project

Notifications You must be signed in to change notification settings

cmsdroff/registry

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A tabular Data Package of Data Packages including the core datasets in the Frictionless Data Project.

Currently two registers maintained here:

  • catalog-list.csv - catalog of all the community data packages we can find (at the moment largely those found on github via automatic search)
  • core-list.csv - "Core" Datasets (hand-maintained)

Preparation

Catalog List

The main Catalog list is scraped using the python script scripts/scrape.py:

# install deps
pip install -r scripts/requirements.txt
# scrape data
python scripts/scrape.py

Note we'd prefer not to scrape and use the API but we can't do the relevant query via the API - see http://developer.github.com/changes/2013-10-18-new-code-search-requirements/

Core List

To add a dataset please add it to the core-list.csv - we recommend fork and pull.

Discussion of proposals for new datasets and for incorporation of prepared datasets takes place in the issues.

To propose a new dataset for inclusion, please create a new issue.

About

Registry of published datasets in the Datasets Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%