Skip to content

The Project CCHAIN dataset is a validated, open-sourced linked dataset measuring 20 years (2003-2022) of health, climate, environment, and socioeconomic variables at the barangay (village) level across 12 Philippine cities.

License

Notifications You must be signed in to change notification settings

thinkingmachines/project-cchain

Repository files navigation

The Project CCHAIN Dataset

Python Code style: black



📜 Description

The Project CCHAIN dataset is a validated, open-sourced linked dataset measuring 20 years (2003-2022) of climate, environmental, socioeconomic, and health variables at the barangay (village) level across 12 Philippine cities. We provide here the code used to generate the barangay level tabular extracts from our various geospatial sources.

Please see our main documentation page for more details

🗄 Notebooks used

  1. 01-aoi_generation - Prepares official administrative boundaries for the target cities.
  2. 02-dataset_alignment - Contains notebooks that process various data sources (in vector, raster, and tabular formats) into consistent barangay-level tabular extracts.
  3. 03-baseline_model - Contains notebooks for the sample outbreak detection model.
  4. 04-analytics - Obtains insights from the produced datasets in the form of visualizations

📄 Licensing

This repository is under Apache 2.0, which allows all developers to freely use, modify, and share software on an “as-is” basis, meaning without any warranties. When making changes, developers are required to document changes to the original code.

⚙️ Local Setup for Development

This repo assumes the use of conda for simplicity in installing GDAL.

Requirements

  1. Python 3.9
  2. make
  3. conda

🐍 One-time Set-up

Run this the very first time you are setting-up the project on a machine to set-up a local Python environment for this project.

  1. Install miniconda for your environment if you don't have it yet.
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash Miniconda3-latest-Linux-x86_64.sh
  1. Create a local conda env and activate it. This will create a conda env folder in your project directory.
make conda-env
conda activate <env name>
  1. Run the one-time set-up make command.
make setup

🐍 Testing

To run automated tests, simply run make test.

📦 Dependencies

Over the course of development, you will likely introduce new library dependencies. This repo uses pip-tools to manage the python dependencies.

There are two main files involved:

  • requirements.in - contains high level requirements; this is what we should edit when adding/removing libraries
  • requirements.txt - contains exact list of python libraries (including depdenencies of the main libraries) your environment needs to follow to run the repo code; compiled from requirements.in

When you add new python libs, please do the ff:

  1. Add the library to the requirements.in file. You may optionally pin the version if you need a particular version of the library.

  2. Run make requirements to compile a new version of the requirements.txt file and update your python env.

  3. Commit both the requirements.in and requirements.txt files so other devs can get the updated list of project requirements.

Note: When you are the one updating your python env to follow library changes from other devs (reflected through an updated requirements.txt file), simply run pip-sync requirements.txt

About

The Project CCHAIN dataset is a validated, open-sourced linked dataset measuring 20 years (2003-2022) of health, climate, environment, and socioeconomic variables at the barangay (village) level across 12 Philippine cities.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages