Model testing #700
-
We need to better define what we mean by "Test a model" and provide example files etc, and a checklist so that contributors can follow step by step. |
Beta Was this translation helpful? Give feedback.
Replies: 6 comments 4 replies
-
Hi @GemmaTuron (1) Firstly, I think we should define a file like ersilia did for eml_canonical.csv, there should be a file named e.g. test_smiles.csv that contains atleast 10-20 smiles. So everyone use that smiles for testing. (2) It's better to put one wrong smile string at the end of smiles to confirm the behaviour of model whether it is predicting right or wrong ( but i don't know how ersilia model behaves when a wrong input passes to it), May be for this we define some checks while reading smiles from the file test_smiles.csv that if wrong input pass, ignore it and put NAN in output. |
Beta Was this translation helpful? Give feedback.
-
Hi @emmakodes, @samuelmaina @HellenNamulinda Please give your views on this issue as well. |
Beta Was this translation helpful? Give feedback.
-
This is late,
I'm insisting on two inputs for CLI because I have tested models that only work when given string inputs. |
Beta Was this translation helpful? Give feedback.
-
Hello @everyone, having two files to test a model is fair enough (one contains correct smiles and the other file include some incorrect smiles) but some models are pretty large and takes time. What I may suggest is we still have the two files but the number of smiles in each of the files can be reduced to say 5. These five smiles in each of the two files should be the standard and most common input to use test a model. |
Beta Was this translation helpful? Give feedback.
-
Thanks @emmakodes @emmakodes @HellenNamulinda @ZakiaYahya, this is very valuable, let me add a few extra thoughts: ThoughtsI have written a placeholder at
StrategyThere is a lot of work to be done here, so let's go step by step. I would start with "Input-output consistency". @GemmaTuron , what do you think? In parallel, @emmakodes and @leoank can explore the AMD64/ARM64 point. |
Beta Was this translation helpful? Give feedback.
-
Late to the party, sorry . Everyone has said pretty much what I would suggest. Thanks. |
Beta Was this translation helpful? Give feedback.
Thanks @emmakodes @emmakodes @HellenNamulinda @ZakiaYahya, this is very valuable, let me add a few extra thoughts:
Thoughts
I have written a placeholder at
ersilia.publish.test
, namedLocalModelTester
that should have multiple methods to test models before doing the PR. In particular, one could potentially evaluate the following:BaseInformation
class, which can read themetadata.json
file and will raise exceptions if something does not look good (for example, a wrong URL, or a description that is too short).