New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add script for visualizing RaFTS predictions of performance metrics #32

Closed

bolotinl wants to merge 7 commits into main from data_viz

Collaborator

bolotinl commented Nov 22, 2024

A new script allows the user to plot a map of RaFTS predicted module performance and/or a scatter plot comparing RaFTS predicted performances to actual performances

Additions

Script fs_perf_viz.py, which just requires a viz_config.yaml
viz_config.yaml

Testing

The viz_config.yaml should require little to no customization, though it is customizeable in terms of which plots will be generated.

python fs_perf_viz.py "/full/path/to/viz_config.yaml"

Todos

Finish and commit .ipynb that demonstrates the utility of fs_perf_viz.py (in progress)


          Add scripts and associated cfg file for model performance viz

874ae40

bolotinl assigned glitt13


          Remove scratch code

55a4147

glitt13 reviewed

View reviewed changes

Collaborator

glitt13 left a comment

Hi @bolotinl, I didn't try running this yet, but have some suggestions. Let's chat on your availability to work through these.

pkg/fs_algo/fs_algo/fs_perf_viz.py Outdated Show resolved Hide resolved

pkg/fs_algo/fs_algo/fs_perf_viz.py Outdated

+                                  plt.title("Predicted Performance: {}".format(ds), fontsize = 28)
+                                  # Save the plot as a .png file
+                                  output_path = f'{dir_out}/data_visualizations/{ds}_{algo}_{metric}_performance_map.png'

Collaborator

glitt13 Nov 25, 2024

I suggest we 1) create a function for writing data visualization files and 2) a function for generating the data visualization directory and filepath (e.g. std_viz_path). 1) When we switch to the cloud, we'll know to modify the read/write functions to provide a) local read/write or b) cloud read/write (no need to develop this yet). 2) A single function to create the file path will eliminate guesswork on filenames when reading whatever was written. Prioritize item #2 and we can get back to #1 if you're limited in time - we'll need a major overhaul to prepare for cloud integration.

Collaborator Author

bolotinl Nov 27, 2024

Return a matplot lib figure as an object (plt.gcf())
Create functions that define paths
Create functions that do the file I/O so that when we migrate to AWS we can just update the function
Move the creation of the data_visualizations folder into a function that creates the path for it
dir_ * if a directory
path_* if a path
Could include smaller functions for creating the path to the performance map, for example: fsate lines 492, 474
Look at plot learning curve for example: https://github.com/glitt13/formulation-selector/blob/agu_24/pkg/fs_algo/fs_algo/fs_algo_train_eval.py

Collaborator Author

bolotinl Nov 27, 2024

Classes are mostly for conveniently passing along data, so you can probably just do functions for now and either make a separate plotting file or add to fsate

pkg/fs_algo/fs_algo/fs_perf_viz.py Show resolved Hide resolved

pkg/fs_algo/fs_algo/fs_perf_viz.py Outdated Show resolved Hide resolved

pkg/fs_algo/fs_algo/fs_perf_viz.py Outdated Show resolved Hide resolved

pkg/fs_algo/fs_algo/fs_perf_viz.py Show resolved Hide resolved

pkg/fs_algo/fs_algo/fs_perf_viz.py Outdated Show resolved Hide resolved

pkg/fs_algo/fs_algo/fs_perf_viz.py

+                                  plt.title('Observed vs. Predicted Performance: {}'.format(ds))
+                                  # Save the plot as a .png file
+                                  output_path = f'{dir_out}/data_visualizations/{ds}_{algo}_{metric}_obs_vs_sim_scatter.png'

Collaborator

glitt13 Nov 25, 2024

Since this is a different visualization, seems like the std_viz_path function could use the true_keys object for creating the full file name.

glitt13 reviewed

View reviewed changes

pkg/fs_algo/fs_algo/fs_perf_viz.py

+                                  plt.title('Observed vs. Predicted Performance: {}'.format(ds))
+                                  # Save the plot as a .png file
+                                  output_path = f'{dir_out}/data_visualizations/{ds}_{algo}_{metric}_obs_vs_sim_scatter.png'

Collaborator

glitt13 Nov 26, 2024

@bolotinl A new suggestion - create a subdirectory based on the dataset inside output/data_visualizations, mimicking the structures created by fs_algo_train_eval.std_pred_path, e.g.

Path(dir_out)/Path('data_visualizations')/Path(dataset_id)/Path(insert specific plottype_algo_metric_dataset_id here)

bolotinl added 5 commits

November 26, 2024 13:19


          Include download of US map

2d885a7


          Get ds_type, write_type from pred cfg; convert os to Pathlib

aa1c3ba


          Use existing functions for pulling info from attr config

3c0cc87


          Rename 'perf_map' to 'pred_map' for clarity

635dc8f


          Rename 'perf_map' to 'pred_map' for clarity

4d970df

bolotinl closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet