Data Science course developed at the Düsseldorf University of Applied Sciences (HSD) and the Centre for Digitalization and Digitality (ZDD).
For the rendered book use this link!
A pdf version of this book can be found on zenodo
This book and all material used to create the book are freely available and can be used according to the CC license (BY-NC-SA 4.0). If you use (parts of) this book for your work, please cite the book as:
Huber, F. (2024).
Hands-on Introduction to Data Science with Python.
v0.21, 2024, Zenodo. https://doi.org/10.5281/zenodo.10074474
It is recommended to create a new environment for this course with many Python libraries that we will use in the Live Coding sessions. You can simply download the environment.yml
file in this repository, or clone the repository using:
git clone https://github.com/florian-huber/data_science_course.git
Then, in the folder with the environment.yml
file simply run:
conda env create -f environment.yml
This should create a Python 3.12 environment with the packages listed in the yaml-file.
The classical conda environmental solver can be slow! So it might make sense to switch to faster alternatives, either within Conda (see https://www.anaconda.com/blog/a-faster-conda-for-a-growing-community) or mamba.
The current book project is meant to be generated using Jupyter Book. Once installed run the following terminal command from the cloned folder:
jupyter-book build .
One option for creating a pdf of the entire book is:
jupyter-book build . --builder pdflatex