One-Two-Syzygy

This repository contains materials for setting up a kubernetes based Syzygy instance. There this is some terraform/terragrunt code in ./infrastructure to define a cluster (on AWS for now) then there is a helm chart in ./one-two-syzygy to configure the syzygy instance.

The one-two-syzygy helm chart is a thin wrapper around the zero-to-jupyterhub (z2jh) chart (installed as a dependency). It includes a shibboleth service provider(SP)/proxy which allows a customized hub image to use shibboleth for authentication. Chartpress is used to manage the helm repository and the necessary images:

hub: A minor modification of the z2jh hub image to include a remote-user authenticator

The intention for this project is that it should be able to run on any cloud provider, but to-date only AWS/EKS and Azure/AKS have been tested. pull requests and suggestions for this (and any other enhancements) are very welcome.

Usage

Terraform/Terragrunt

Terraform code to define a kubernetes cluster is kept in provider specific repositories for now: aws/eks, microsoft/aks .

Organizationally we create instances using terragrunt to allow shared state terragrunt.

AWS/EKS Kubernetes cluster with autoscaling and EFS

New instances are created by defining a terragrunt.hcl in a new directory of infrastructure/terraform/eks. The file is basically a collection of inputs for our eks terraform module which does the heavy lifting of defining a VPC, a kubernetes cluster and an EFS share. The inputs include things like your preferred region name, your worker group size etc, see the module variables file for details. ./infrastructure/prod/terragrunt.hcl defines an s3 bucket to hold the tfstate file for terragrunt. This should be customized to use an s3 bucket you control.

$ cd infrastructure/terraform/eks/k8s1
$ terragrunt init
$ terragrunt apply

The output of terragrunt apply (or terragrunt output) includes the cluster name and the filesystem ID for the EFS Filesystem which was created. Both of these values will needed by helm below.

Use the AWS-CLI to update your ~/.kube/config with the authentication details of your new cluster

$ aws eks list-clusters
...
{
    "clusters": [
        "syzygy-eks-qiGa7B01"
    ]
}

$ aws eks update-kubeconfig --name=syzygy-eks-qiGa7B01

When your cluster has been defined you can proceed to the k8s cluster section.

AKS

New instances are created by defining a terragrunt.hcl in a new directory of ./infrastructure/terraform/prod/:

# ./infrastructure/terraform/prod/aks/k8s2
terraform {
    source = "git::https://github.com/pimsmath/k8s-syzygy-aks.git"
}

include {
    path = find_in_parent_folders()
}

inputs = {
   prefix    = "jhub"
   location  = "canadacentral"
}

This files references ./infrastructure/prod/terragrunt.hcl which defines an s3 bucket to hold the tfstate file. This should be customized to use an s3 bucket you control.

You will also need to define a few variables:

mv infrastructure/terraform/aks/k8s2/env.auto.tfvars.json.dist infrastructure/terraform/aks/k8s2/env.auto.tfvars.json

Edit the file infrastructure/terraform/aks/k8s2/env.auto.tfvars.json and fill in the missing variables. See https://github.com/pimsmath/k8s-syzygy-aks/blob/master/README.md for details on how to find out those variables.

$ terragrunt init
$ terragrunt apply

Once the above commands complete successfully, you can setup the new credential for kubectl config.

# to get resourc group and name of the cluster
az aks list
az aks get-credentials --resource-group RESOURCE_GROUP --name CLUSTER_NAME

K8S Cluster

Once the K8S cluster is provisioned, check that you can interact with the cluster

$ kubectl version
Client Version: version.Info{Major:"1", Minor:"22", GitVersion:"v1.22.4", GitCommit:"b695d79d4f967c403a96986f1750a35eb75e75f1", GitTreeState:"clean", BuildDate:"2021-11-17T15:48:33Z", GoVersion:"go1.16.10", Compiler:"gc", Platform:"darwin/amd64"}
Server Version: version.Info{Major:"1", Minor:"22+", GitVersion:"v1.22.12-eks-6d3986b", GitCommit:"dade57bbf0e318a6492808cf6e276ea3956aecbf", GitTreeState:"clean", BuildDate:"2022-07-20T22:06:30Z", GoVersion:"go1.16.15", Compiler:"gc", Platform:"linux/amd64"}

$ kubectl get nodes
NAME                                       STATUS   ROLES    AGE     VERSION
NAME                                          STATUS   ROLES    AGE   VERSION
ip-10-1-1-165.ca-central-1.compute.internal   Ready    <none>   18m   v1.17.9-eks-4c6976
ip-10-1-1-178.ca-central-1.compute.internal   Ready    <none>   18m   v1.17.9-eks-4c6976
ip-10-1-2-178.ca-central-1.compute.internal   Ready    <none>   18m   v1.17.9-eks-4c6976

If you don't see any worker nodes you may need to check your AWS IAM role configuration.

Helm

Install the latest release of Helm.

$ helm version
version.BuildInfo{Version:"v3.9.2", GitCommit:"1addefbfe665c350f4daf868a9adc5600cc064fd", GitTreeState:"clean", GoVersion:"go1.18.4"}

AutoScaler

We deploy the autoscaler as a separate component to the kube-system namespace. It keeps track of which nodes are available and compares that to what has been requested. If it finds a mismatch it has permission to scale up and down the number of nodes (within limits). These operations require some special permissions and setting them up properly can be tricky. Our configuration is specified in the irsa.tf file of our terraform module. Basically it should add a new IAM role called cluster-autoscaling with the necessary permissions. See the AWS-IAM section of the autoscaler documentation for more details - we use the limited setup where the cluster must be explicitly set. The autoscaler will look for specially tags on your resources to learn which nodes it can control (the tags are also assigned by terraform module](https://github.com/pimsmath/k8s-syzygy-eks/blob/ba0f23703a9653135df4a124c66eaf604aa60c93/main.tf#L159-L170))

# autoscaler.yaml
awsRegion: ca-central-1

cloudConfigPath: ''

rbac:
  create: true
  serviceAccount:
    # This value should match local.k8s_service_account_name in locals.tf
    name: cluster-autoscaler
    annotations:
      # This value should match the ARN of the role created by module.iam_assumable_role_admin in irsa.tf
      eks.amazonaws.com/role-arn: "arn:aws:iam::USERIDHERE:role/syzygy-eks-f7LISI3z-cluster_autoscaler-role"

autoDiscovery:
  clusterName: "syzygy-eks-f7LISI3z"
  enabled: true

Install the chart

$ helm install cluster-autoscaler --namespace kube-system \
  autoscaler/cluster-autoscaler --values=autoscaler.yaml

One-Two-Syzygy

Create a config.yaml at the root of this repository. A sample configuration file is included as ./config.yaml.sample. There are two dependent charts we will need (zero-to-jupyterhub and efs-provisioner.

$ helm repo add jupyterhub https://jupyterhub.github.io/helm-chart/
$ helm repo add jetstack https://charts.jetstack.io
$ helm repo add autoscaler https://kubernetes.github.io/autoscaler
$ helm repo add isotoma https://kubernetes.github.io/charts
$ helm update
$ cd one-two-syzygy && helm dependency update && cd ..

z2jh options

See the z2jh configuration documentation. Since z2jh is a dependency of this chart, remember to wrap then in a jupyterhub block inside config.yaml. e.g.

jupyterhub:
  proxy:
    secretToken: "output of `openssl rand -hex 32`"
  service:
    type: ClusterIP

efs-provisioner options

See the efs-provisioner chart for details

efs-provisioner:
  efsProvisioner:
    efsFileSystemId: fs-0000000
    awsRegion: us-west-2

one-two-syzygy options

For the one-two-syzygy chart you will need

shib.acm.arn: The ARN of your ACM certificate as a string
shib.spcert: The plain text of your SP certificate
shib.spkey: The plain text of your SP key

For the shibboleth configuration you will need some configuration from the identity provider. Typically you can specify the service configuration with the following 3 files: shibboleth2.xml, attribute-map.xml and idp-metadata.xml. These are included for the sp deployment as a ConfigMap with the following keys

shib.shibboleth2xml
shib.idpmetadataxml
shib.attributemapxml

Helm will look for these in one-two-syzygy/files/etc/shibboleth/ and they can be overridden with the usual helm tricks (--set-file or config.yaml). Default values are given but these are specific to the UBC IdP so you almost certainly will want to override them.

The apache configuration for the sp is given as another ConfigMap with the keys being the apache config files usually kept under /etc/httpd/conf.d/*.conf. The actual web content can be specified as a ConfigMap (with structure corresponding to the conf.d/*.conf files).

$ kubectl create namespace syzygy
$ helm upgrade --cleanup-on-fail --wait --install syzygy one-two-syzygy \
  --namespace syzygy --create-namespace -f config.yaml

If everything has worked, you can extract the address of the public SP with

$ kubectl -n syzygy get svc/sp

Depending on your provider this may be a DNS entry or an IP address and you will need to populate it to your DNS service.

When you are done with the cluster and ready to recover the resources, you will want to do something like (N.B. This may delete all files, including user data!)

$ cd one-two-syzygy
$ helm --namespace=syzygy del syzygy

$ cd infrastructure/terraform/eks/k8s1
$ terragrunt destroy

Development

Try the instructions for z2jh. If you already have a kubernetes cluster up and running you should only need

$ python3 -m venv .
$ source bin/activate
$ python3 -m pip install -r dev-requirements.txt

When you make changes in the images or templates directory, commit them and run chart press

# To build new images and update Chart.yaml/values.yaml tags
$ chartpress

# To push tagged images to Dockerhub
$ chartpress --push

# To publish the repository to our helm repository
$ chartpress --publish

If you want to make local modifications to the underlying terraform code, you can feed these to terragrunt via the "--terragrunt-source" option. There are some subtleties when doing this, but something like this should work if you have your modules in e.g. ~/terraform-modules/k8s-syzygy-eks

  $ terragrunt apply
  --terragrunt-source=../../../../../terraform-modules//k8s-syzygy-eks

Helm Repository

Releases of this chart are published to the gh-pages which serves as a Helm repository via chartpress.

Tear Down

To tear everything down, run the following command:

terragrunt destroy-all

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

One-Two-Syzygy

Usage

Terraform/Terragrunt

AWS/EKS Kubernetes cluster with autoscaling and EFS

AKS

K8S Cluster

Helm

AutoScaler

One-Two-Syzygy

z2jh options

efs-provisioner options

one-two-syzygy options

Development

Helm Repository

Tear Down

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
files		files
images/hub		images/hub
infrastructure		infrastructure
one-two-syzygy		one-two-syzygy
.gitignore		.gitignore
PROCESSES.md		PROCESSES.md
README.md		README.md
autoscaler.yaml		autoscaler.yaml
chartpress.yaml		chartpress.yaml
config.yaml.sample		config.yaml.sample
dev-requirements.txt		dev-requirements.txt

pimsmath/one-two-syzygy

Folders and files

Latest commit

History

Repository files navigation

One-Two-Syzygy

Usage

Terraform/Terragrunt

AWS/EKS Kubernetes cluster with autoscaling and EFS

AKS

K8S Cluster

Helm

AutoScaler

One-Two-Syzygy

z2jh options

efs-provisioner options

one-two-syzygy options

Development

Helm Repository

Tear Down

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages