Skip to content
This repository has been archived by the owner on Jul 22, 2024. It is now read-only.
GregDritschler edited this page Apr 1, 2019 · 1 revision

Short title

Machine learning using synthesized patient health records

Author

URLs

Github repo

Summary

This notebook explores how to train a machine learning model to predict type 2 diabetes using synthesized patient health records. The use of synthesized data allows us to learn about building a model without any concern about the privacy issues surrounding the use of real patient health records.

Technologies

  • Machine Learning

Description

This project is part of a series of code patterns pertaining to a fictional health care company called Summit Health. This company stores electronic health records in a database on a z/OS server. Before running the notebook, the synthesized health records must be created and loaded into this database. Another project, https://github.com/IBM/summit-health-synthea, provides the steps for doing this. The records are created using a tool called Synthea, transformed and loaded into the database.

When the reader has completed this Code Pattern, they will understand how to:

  • Prepare data using Apache Spark.
  • Visualize data relationships using Pixiedust.
  • Train a machine learning model and publish it in the Watson Machine Learning (WML) repository.
  • Deploy the model as a web service and use it to make predictions.

Flow

  1. Log in to IBM Watson Studio
  2. Load the provided notebook into Watson Studio
  3. Load data in the notebook
  4. Transform the data with Apache Spark
  5. Create charts with PixieDust
  6. Publish and deploy model with Watson Machine Learning

Instructions

Find the detailed steps for this pattern in the readme file. The steps will show you how to:

Sign up for Watson Studio

  1. Sign up for IBM Watson Studio.
  2. Create a project
  3. Create a Watson Machine Learning instance
  4. Add the notebook to your project
  5. Run the notebook

Components and services

  • Apache Spark
  • Watson Machine Learning

Runtimes

  • Python
Clone this wiki locally