Merge pull request #1974 from consideRatio/pr/rst-to-markdown

docs: 100% MyST Markdown
jupyterhub · Jan 4, 2021 · 22dc6c7 · 22dc6c7
2 parents a7158c8 + 6689137
commit 22dc6c7
Show file tree

Hide file tree

Showing 71 changed files with 2,994 additions and 3,318 deletions.
diff --git a/doc/source/_static/images/authenticate.png → ...ges/no-longer-referenced/authenticate.png b/doc/source/_static/images/authenticate.png → ...ges/no-longer-referenced/authenticate.png
diff --git a/...e/_static/images/authenticate_success.png → ...onger-referenced/authenticate_success.png b/...e/_static/images/authenticate_success.png → ...onger-referenced/authenticate_success.png
diff --git a/.../_static/images/cloud_sdk_doc_landing.png → ...nger-referenced/cloud_sdk_doc_landing.png b/.../_static/images/cloud_sdk_doc_landing.png → ...nger-referenced/cloud_sdk_doc_landing.png
diff --git a/...urce/_static/images/cloud_sdk_landing.png → ...o-longer-referenced/cloud_sdk_landing.png b/...urce/_static/images/cloud_sdk_landing.png → ...o-longer-referenced/cloud_sdk_landing.png
diff --git a/...atic/images/container_engine_location.jpg → ...-referenced/container_engine_location.jpg b/...atic/images/container_engine_location.jpg → ...-referenced/container_engine_location.jpg
diff --git a/..._static/images/gcloud_cluster_created.png → ...ger-referenced/gcloud_cluster_created.png b/..._static/images/gcloud_cluster_created.png → ...ger-referenced/gcloud_cluster_created.png
diff --git a/...ages/gcloud_container_clusters_create.png → ...nced/gcloud_container_clusters_create.png b/...ages/gcloud_container_clusters_create.png → ...nced/gcloud_container_clusters_create.png
diff --git a/doc/source/_static/images/gcloud_init.png → ...ages/no-longer-referenced/gcloud_init.png b/doc/source/_static/images/gcloud_init.png → ...ages/no-longer-referenced/gcloud_init.png
diff --git a/doc/source/_static/images/install_sdk1.png → ...ges/no-longer-referenced/install_sdk1.png b/doc/source/_static/images/install_sdk1.png → ...ges/no-longer-referenced/install_sdk1.png
diff --git a/...ource/_static/images/ovh/project-name.png → ...no-longer-referenced/ovh-project-name.png b/...ource/_static/images/ovh/project-name.png → ...no-longer-referenced/ovh-project-name.png
diff --git a/...images/service_account_compute_engine.png → ...renced/service_account_compute_engine.png b/...images/service_account_compute_engine.png → ...renced/service_account_compute_engine.png
diff --git a/doc/source/administrator/advanced.md b/doc/source/administrator/advanced.md
@@ -10,6 +10,7 @@ Most people setting up JupyterHubs on popular public clouds should not have
 to use any of this information, but these topics are essential for more complex
 installations.
 
+(ingress)=
 ## Ingress
 
 If you are using a Kubernetes Cluster that does not provide public IPs for

diff --git a/doc/source/administrator/architecture.rst → doc/source/administrator/architecture.md b/doc/source/administrator/architecture.rst → doc/source/administrator/architecture.md
@@ -1,5 +1,4 @@
-The JupyterHub Architecture
-===========================
+# The JupyterHub Architecture
 
 The JupyterHub Helm Chart manages resources in the cloud using Kubernetes.
 There are several moving pieces that, together, handle authenticating users,
@@ -9,5 +8,6 @@ pods in which users will work, and connecting users with those pods.
 The following diagram gives a high-level overview of the many pieces of
 JupyterHub, and how they fit together in this process:
 
-.. image:: ../_static/images/architecture.png
-   :align: center
+```{image} ../_static/images/architecture.png
+:align: center
+```
diff --git a/doc/source/administrator/cost.rst → doc/source/administrator/cost.md b/doc/source/administrator/cost.rst → doc/source/administrator/cost.md
@@ -1,28 +1,27 @@
-.. _cost:
+(cost)=
 
-Appendix: Projecting deployment costs
-=====================================
+# Appendix: Projecting deployment costs
 
-.. important:: Clarification on cost projections
+```{admonition} Clarification on cost projections
+:class: warning
 
-   As a non-profit research project, Project Jupyter does not offer,
-   recommend, or sell cloud deployment services for JupyterHub.
+As a non-profit research project, Project Jupyter does not offer,
+recommend, or sell cloud deployment services for JupyterHub.
 
-   The information in this section is offered as guidance as requested
-   by our users. We **caution** that costs can vary widely based
-   on providers selected and your use cases.
+The information in this section is offered as guidance as requested
+by our users. We **caution** that costs can vary widely based
+on providers selected and your use cases.
+```
 
-Cost calculators for cloud providers
-------------------------------------
+## Cost calculators for cloud providers
 
 Below are several links to cost estimators for cloud providers:
 
-* `Google Cloud Platform cost calculator <https://cloud.google.com/products/calculator/>`_
-* `Amazon AWS cost calculator <https://calculator.s3.amazonaws.com/index.html>`_
-* `Microsoft Azure cost claculator <https://azure.microsoft.com/en-us/pricing/calculator/>`_
+* [Google Cloud Platform cost calculator](https://cloud.google.com/products/calculator/)
+* [Amazon AWS cost calculator](https://calculator.s3.amazonaws.com/index.html)
+* [Microsoft Azure cost claculator](https://azure.microsoft.com/en-us/pricing/calculator/)
 
-Factors influencing costs
--------------------------
+## Factors influencing costs
 
 Cost estimates depend highly on your deployment setup. Several factors that
 significantly influence cost estimates, include:
@@ -31,8 +30,7 @@ significantly influence cost estimates, include:
 - Number of users
 - Usage patterns of users
 
-Computational Resources
-~~~~~~~~~~~~~~~~~~~~~~~
+### Computational Resources
 
 **Memory (RAM)** makes up the largest part of a cost estimate. More RAM means
 that your users will be able to work with larger datasets with more
@@ -45,18 +43,16 @@ This requires significantly fewer storage resources, and also results in faster
 load times.
 
 For an indicator of how costs scale with computational resources, see the
-`Google Cloud pricing page <https://cloud.google.com/compute/all-pricing>`_.
+[Google Cloud pricing page](https://cloud.google.com/compute/all-pricing).
 
-Users
-~~~~~
+### Users
 
 The number of users has a direct relationship to cost estimates. Since a
 deployment may support different types of users (i.e. researchers, students,
 instructors) with varying hardware and storage needs, take into account both the
 type of users and the number per type.
 
-User usage patterns
-~~~~~~~~~~~~~~~~~~~
+### User usage patterns
 
 Another important factor is what usage pattern your users will have. Will they
 all use the JupyterHub at once, such as during a large class workshop?
@@ -67,22 +63,22 @@ the resources you need to provide. In the future JupyterHub will have
 auto-scaling functionality, but currently it does not. This means that you need
 to provision resources for the *maximum* expected number of users at one time.
 
-Examples
---------
+## Examples
 
 Here are a few examples that describe different use cases and the amount of
 resources used by a particular JupyterHub implementation. There are many
 factors that go into these estimates, and you should expect that your actual
 costs may vary significantly under other conditions.
 
-Data 8
-~~~~~~
+### Data 8
 
 The Data 8 course at UC Berkeley used a JupyterHub to coordinate all course
 material and to provide a platform where students would run their code. This
 consisted of many hundreds of students, who had minimal requirements in terms
 of CPU and memory usage. Ryan Lovett put together a short Jupyter notebook
-`estimating the cost for computational resources`_ depending on the student
+[estimating the cost for computational resources][estimating the cost for computational resources] depending on the student
 needs.
 
-.. _estimating the cost for computational resources: https://github.com/data-8/jupyterhub-k8s/blob/master/docs/cost-estimation/gce_budgeting.ipynb
+[estimating the cost for computational resources]: https://github.com/data-8/jupyterhub-k8s/blob/master/docs/cost-estimation/gce_budgeting.ipynb
+
+
diff --git a/doc/source/administrator/debug.md b/doc/source/administrator/debug.md
@@ -0,0 +1,219 @@
+(debug)=
+
+# Debugging
+
+Sometimes your JupyterHub deployment doesn't behave the way you'd expect.
+This section provides some tips on debugging and fixing some common problems.
+
+## Debugging commands
+
+In order to debug your JupyterHub deployment, you need to be able to inspect
+the state of the resources being used. The following are a few common commands
+for debugging.
+
+**Real world scenario:** Let's say you've got a JupyterHub deployed, and a user
+tells you that they are experiencing strange behavior. Let's take a look
+at our deployment to figure out what is going on.
+
+```{note}
+For our real world scenario, we'll assume that our Kubernetes namespace
+is called `jhub`. Your namespace may be called something different
+```
+
+### `kubectl get pod`
+
+To list all pods in your Kubernetes deployment:
+
+```
+kubectl --namespace=jhub get pod
+```
+
+This will output a list of all pods being used in the deployment.
+
+**Real world scenario:** In our case, we see two pods for the JupyterHub
+infrastructure (`hub` and `proxy`) as well as one user
+pod that was created when somebody logged in to the JupyterHub.
+
+Here's an example of the output:
+
+```
+$ kubectl --namespace=jhub get pod
+NAME                                READY     STATUS         RESTARTS   AGE
+hub-3311438805-xnfvp     1/1       Running        0          2m
+jupyter-choldgraf                   0/1       ErrImagePull   0          25s
+proxy-1227971824-mn2wd   1/1       Running        0          5h
+```
+
+Here we can see the two JupyterHub pods, as well as a single user pod. Note
+that all user pods will begin with `jupyter-`.
+
+In particular, keep an eye on the `STATUS` column. If a given
+pod contains something other than `Running`, then something may be wrong.
+
+In this case, we can see that our user's pod is in the `ErrImagePull` state.
+This generally means that there's something wrong with the Docker image that
+is defined in `singleuser` in our helm chart config. Let's dig further...
+
+### `kubectl describe pod`
+
+To see more detail about the state of a specific pod, use the following
+command:
+
+```
+kubectl --namespace=jhub describe pod <POD_NAME>
+```
+
+This will output several pieces of information, including configuration and
+settings for the pod. The final section you'll see is a list of recent
+events. These can be particularly informative, as often an error will
+show up in this section.
+
+**Real world scenario:** In our case, one of the lines in the events page
+displays an error:
+
+```
+$ kubectl --namespace=jhub describe pod jupyter-choldgraf
+...
+2m            52s             4       kubelet, gke-jhubtest-default-pool-52c36683-jv6r        spec.containers{notebook}       Warning         Failed           Failed to pull image "jupyter/scipy-notebook:v0.4": rpc error: code = 2 desc = Error response from daemon: {"message":"manifest for jupyter/scipy-notebook:v0.4 not found"}
+...
+```
+
+It seems there is indeed something wrong with the Docker image. Let's confirm
+this by getting another view on the events that have transpired in the pod.
+
+### `kubectl logs`
+
+If you only want to see the latest logs for a pod, use the following command:
+
+```
+kubectl --namespace=jhub logs <POD_NAME>
+```
+
+This will show you the logs from the pod, which often contain useful
+information about what is going wrong. Parse these logs
+to see if something is generating an error.
+
+**Real world scenario:** In our case, we get this line back:
+
+```
+$ kubectl --namespace=jhub logs jupyter-choldgraf
+Error from server (BadRequest): container "notebook" in pod "jupyter-choldgraf" is waiting to start: trying and failing to pull image
+```
+
+Now we are sure that something is wrong with our Dockerfile. Let's check
+our `config.yaml` file for the section where we specify the user's
+Docker image. Here we see our problem:
+
+```
+singleuser:
+  image:
+    name: jupyter/scipy-notebook
+```
+
+We haven't specified a `tag` for our Docker image! Not specifying a tag
+will cause it to default to `v0.4`, which isn't what we want and is causing
+the pod to fail.
+
+To fix this, let's add a tag to our `config.yaml` file:
+
+```
+singleuser:
+  image:
+    name: jupyter/scipy-notebook
+    tag: ae885c0a6226
+```
+
+Then run a helm upgrade:
+
+```
+helm upgrade --cleanup-on-fail jhub jupyterhub/jupyterhub --version=v0.6 -f config.yaml
+```
+
+where `jhub` is the helm release name (substitute the release name that you
+chose during setup).
+
+```{note}
+Depending on the size of the Docker image, this may take a while to complete.
+```
+
+Right after you run this command, let's once again list the pods in our
+deployment:
+
+```
+$ kubectl --namespace=jhub get pod
+NAME                                READY     STATUS              RESTARTS   AGE
+hub-2653507799-r7wf8     0/1       ContainerCreating   0          31s
+hub-3311438805-xnfvp     1/1       Terminating         0          14m
+jupyter-choldgraf                   0/1       ImagePullBackOff    0          12m
+proxy-deployment-1227971824-mn2wd   1/1       Running             0          5h
+```
+
+Here we can see one `hub` pod being destroyed, and another (based
+on the upgraded helm chart) being created. We also see our broken user pod,
+which will not be deleted automatically. Let's manually delete it so a newer
+working pod can be started.:
+
+```
+$ kubectl --namespace=jhub delete pod jupyter-choldgraf
+```
+
+Finally, we'll tell our user to log back in to the JupyterHub. Then let's
+list our running pods once again:
+
+```
+$ kubectl --namespace=jhub get pod
+NAME                                READY     STATUS    RESTARTS   AGE
+hub-2653507799-r7wf8     1/1       Running   0          3m
+jupyter-choldgraf                   1/1       Running   0          18s
+proxy-deployment-1227971824-mn2wd   1/1       Running   0          5h
+```
+
+And now we see that we have a running user pod!
+
+Note that many debugging situations are not as straightforward as this one.
+It will take some time before you get a feel for the errors that Kubernetes
+may throw at you, and how these are tied to your configuration files.
+
+## Troubleshooting Examples
+
+The following sections contain some case studies that illustrate some of the
+more common bugs / gotchas that you may experience using JupyterHub with
+Kubernetes.
+
+### Hub fails to start
+
+**Symptom:** following `kubectl get pod`, the `hub` pod is in
+`Error` or `CrashLoopBackoff` state, or appears to be running but accessing
+the website for the JupyterHub returns an error message in the browser).
+
+**Investigating:** the output of `kubectl --namespace=jhub logs
+hub...` shows something like:
+
+```
+File "/usr/local/lib/python3.5/dist-packages/jupyterhub/proxy.py", line 589, in get_all_routes
+  resp = yield self.api_request('', client=client)
+tornado.httpclient.HTTPError: HTTP 403: Forbidden
+```
+
+**Diagnosis:** This is likely because the `hub` pod cannot
+communicate with the proxy pod API, likely because of a problem in the
+`secretToken` that was put in `config.yaml`.
+
+**Fix:** Follow these steps:
+
+1. Create a secret token:
+
+   ```
+   openssl rand -hex 32
+   ```
+2. Add the token to `config.yaml` like so:
+
+   ```
+   proxy:
+      secretToken: '<output of `openssl rand -hex 32`>'
+   ```
+3. Redeploy the helm chart:
+
+   ```
+   helm upgrade --cleanup-on-fail jhub jupyterhub/jupyterhub -f config.yaml
+   ```