Use dvc to track my data generation pipeline #10456
Unanswered
ZuoyunZheng
asked this question in
Help
Replies: 1 comment
-
Hi @ZuoyunZheng! If the generated data is part of the |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
as the title suggest, i have a pipeline that depended on some data resources (tracked normally with dvc) to generate the data that i want to use for my seperate model training pipeline. I would like dvc to track my generated data. However it was not allowing me to with the error message:
I have since used
dvc commit
to explicitly track this "artifect" (not sure if this is the correct term).I have seen the discussion here however i don't quite understand the rationale behind why dvc not allowing for explicit tracking of the output of a stage similar to data tracking? Secondly is there any implication to my forcibly doing
dvc commit
? Any help is appreciated. Thanks in advance! :)Beta Was this translation helpful? Give feedback.
All reactions