Skip to content

Tentabrella/spark_ml

Repository files navigation

Learning PySpark

This repo is created based on the book Learning PySpark and is used for self-learning.

Contains:

  • Basic manipulation for DataFrame
  • Basic manipulation for DataCleaning
  • Basic use for MLlib and one example project

About

spark ml self-learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published