Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Identified issues with tfdv_basic_spending.ipynb #2642

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

ylnhari
Copy link

@ylnhari ylnhari commented Jul 18, 2024

training-data-analyst/courses/machine_learning/deepdive2/production_ml/solutions/tfdv_basic_spending.ipynb might be having mistakes

  1. without disclosing data location , asked user to write code to read data (train and test) in the first few cells of the notebook
  2. unnecessary comment confusing user's (could be from tfdv taxi example but relevant here ) .
    Here are those comments:-
    - Notice that there are no examples with values for pickup_census_tract. This is an opportunity for dimensionality reduction!
    - Try switching between the log and linear scales, and notice how the log scale reveals much more detail about the payment_type categorical feature

training-data-analyst/courses/machine_learning/deepdive2/production_ml/solutions/tfdv_basic_spending.ipynb might be having mistakes
- data location not disclosed asked candidate to write code to read data (train and test) in the first few cells of the notebook
- unnecessary comment confusing candidate (could be from taxi example but relevant here )-> 
1)  Notice that there are no examples with values for pickup_census_tract. This is an opportunity for dimensionality reduction!
2) Try switching between the log and linear scales, and notice how the log scale reveals much more detail about the payment_type categorical feature
Copy link

google-cla bot commented Jul 18, 2024

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant