Skip to content
Tansu Dasli edited this page Sep 18, 2023 · 25 revisions

general ML steps

  • Gathering data sampling
  • EDA
  • Preprocessing
    • handling missing, wrong, null, duplicates
    • feature scaling standardization vs normalization
    • feature selection
    • feature extraction (PCA, SVD)
    • encoding (dummy categorical fields)
    • discretization (binning continuous fields)
  • Sampling train-test split