Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Tensorboard not logging any metrics for Autotrain on dgx #805

Open
2 tasks done
jmparejaz opened this issue Nov 13, 2024 · 0 comments
Open
2 tasks done

[BUG] Tensorboard not logging any metrics for Autotrain on dgx #805

jmparejaz opened this issue Nov 13, 2024 · 0 comments
Labels
bug Something isn't working

Comments

@jmparejaz
Copy link

jmparejaz commented Nov 13, 2024

Prerequisites

  • I have read the documentation.
  • I have checked other issues for similar problems.

Backend

Hugging Face Space/Endpoints

Interface Used

UI

CLI Command

No response

UI Screenshots & Parameters

No response

Error Logs

image

Additional Information

I have run multiple times finetuning LLM-ORPO with autotrain dgx and it is not logging to tensorboard the metrics
Deciding which hyperparameters to tweak without logs or when to stop training without looking the performance of the loss is not technically good. It would be amazing to have logging again.
I noticed that using the parameter "report" with WANDB doesnt work either. Any advice to make this work?

@jmparejaz jmparejaz added the bug Something isn't working label Nov 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant