Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Loading Patient Info no longer works with latest data #62

Open
jqnatividad opened this issue May 20, 2020 · 1 comment
Open

Loading Patient Info no longer works with latest data #62

jqnatividad opened this issue May 20, 2020 · 1 comment

Comments

@jqnatividad
Copy link

jqnatividad commented May 20, 2020

The date validation logic below:

# Convert both to datetimes
patients.Confirmed = pd.to_datetime(
    patients.Confirmed, format='%d.%m.%Y')
patients.Onset = pd.to_datetime(
    patients.Onset, format='%d.%m.%Y')

# Only keep records where confirmed > onset
patients = patients[patients.Confirmed >= patients.Onset]

fails because of some invalid dates in the latest version of the data

Only the data up to May 13 works.

Further, the data file is also gzipped because of GH limits and the notebook needs to be updated to handle this.

@tuchandra
Copy link

tuchandra commented May 21, 2020

There's a date in the Onset column entered as 31.04.2020, which doesn't exist - not super sure what it's supposed to be (perhaps 31.03.2020), but it's just two rows and you could drop them without it mattering much.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants