Fix rom dataobj #2051

dylanjm · 2023-02-01T19:26:03Z

Pull Request Description

What issue does this change request address? (Use "#" before the issue to link it, i.e., #42.)

What are the significant changes in functionality due to this change request?

Implements changes found in #1718

Allows the option to pass training data sets directly to ROM SupervisedLearning algorithms rather than converting everything to dictionaries.

A flag is used to allow the SVL to self-identify whether it needs legacy training (dictionaries) or can handle training via DataSet.

For Change Control Board: Change Request Review

The following review must be completed by an authorized member of the Change Control Board.

1. Review all computer code.
2. If any changes occur to the input syntax, there must be an accompanying change to the user manual and xsd schema. If the input syntax change deprecates existing input files, a conversion script needs to be added (see Conversion Scripts).
3. Make sure the Python code and commenting standards are respected (camelBack, etc.) - See on the wiki for details.
4. Automated Tests should pass, including run_tests, pylint, manual building and xsd tests. If there are changes to Simulation.py or JobHandler.py the qsub tests must pass.
5. If significant functionality is added, there must be tests added to check this. Tests should cover all possible options. Multiple short tests are preferred over one large test. If new development on the internal JobHandler parallel system is performed, a cluster test must be added setting, in XML block, the node <internalParallel> to True.
6. If the change modifies or adds a requirement or a requirement based test case, the Change Control Board's Chair or designee also needs to approve the change. The requirements and the requirements test shall be in sync.
7. The merge request must reference an issue. If the issue is closed, the issue close checklist shall be done.
8. If an analytic test is changed/added is the the analytic documentation updated/added?
9. If any test used as a basis for documentation examples (currently found in raven/tests/framework/user_guide and raven/docs/workshop) have been changed, the associated documentation must be reviewed and assured the text matches the example.

moosebuild · 2023-02-01T21:11:17Z

Job Mingw Test on 7679bed : invalidated by @joshua-cogliati-inl

wangcj05

In addition to the comments I provided inside the code, I have the following comments:

It is not clear to me how to utilize the DataSet directly as training input, I do not see an example, this may be because I do not familiar with TSA module, could you explain it?
I do not see updated test or new test to check the proposed modifications. Is it checked in the existing TSA tests?

wangcj05 · 2023-02-06T18:42:21Z

ravenframework/Models/ROM.py

    else:
      # TODO: The following check may need to be moved to Dummy Class -- wangc 7/30/2018
-      if type(trainingSet).__name__ != 'dict' and trainingSet.type == 'HistorySet':
+      if type(trainingSet) != dict and trainingSet.type == 'HistorySet':


First, could you add a description to list all possible data structures for trainingSet?
Second, could you add checks for different data structures for trainingSet?

This looks like a specific check for history set alignment, right? I don't know if we need to find out all the different approaches to ROMs within this PR, do we? This sounds like a bigger issue.

wangcj05 · 2023-02-06T18:43:19Z

ravenframework/Models/ROM.py


+      self._replaceVariablesNamesWithAliasSystem(self.trainingSet, 'inout', False)


Could you check to see if this line works with your proposed data structure? In Model.py, this method only accept dict or list as input.

wangcj05 · 2023-02-06T18:44:42Z

ravenframework/SupervisedLearning/SupervisedLearning.py

+    if self.needsDictTraining:
+      self.trainOnDictionary(trainingData, indexMap)
+    else:
+      self.amITrained = True
+      self.muAndSigmaFeatures = dict((f, (0,1)) for f in self.features)


These lines is not clear to me. When dataset is needed, I do not see a training process for the ROM. Could you explain it?

I agree, was a line missed from the old PR? If I recall correctly, we were directly overloading the "train" method if self.needsDictTraining is False.

wangcj05 · 2023-02-06T18:45:41Z

ravenframework/SupervisedLearning/SupervisedLearning.py

@@ -239,15 +255,15 @@ def train(self, tdict, indexMap=None):
      for feat in self.features:
        for index in indexMap.get(feat, []):
          if index not in needFeatures and index not in needTargets:
-            needFeatures.append(feat)
+            needFeatures.append(index)


Could you add an explanation here for the change?

moosebuild · 2023-02-15T20:30:34Z

Job Mingw Test on f4edc15 : invalidated by @joshua-cogliati-inl

computer rebooted

PaulTalbot-INL · 2023-03-08T18:30:07Z

ravenframework/Models/Model.py

+          if oldName in sampledVars:
+            value = sampledVars.pop(oldName)
+            sampledVars[newName] = value
+        elif isinstance(sampledVars, list):


I realize originalVariables is a deepcopy of sampledVars, but it would be nice if this set of if isinstance checked on the same variable instead of the two different ones.

moosebuild · 2023-07-25T16:03:33Z

Job Test qsubs sawtooth on 76a9732 : invalidated by @joshua-cogliati-inl

timed out in Test Plugins

dylanjm requested a review from wangcj05 February 2, 2023 16:28

wangcj05 requested changes Feb 6, 2023

View reviewed changes

aalfonsi mentioned this pull request Feb 7, 2023

Alfoa/feature selection #1301

Merged

9 tasks

dylanjm force-pushed the fix-rom-dataobj branch from 7679bed to f4edc15 Compare February 14, 2023 21:26

PaulTalbot-INL reviewed Mar 8, 2023

View reviewed changes

dylanjm force-pushed the fix-rom-dataobj branch from f4edc15 to 7fe54f8 Compare June 1, 2023 16:58

dylanjm force-pushed the fix-rom-dataobj branch from 4495b14 to 3a988ac Compare June 20, 2023 15:30

dylanjm added 7 commits July 24, 2023 14:44

Handle merge conflicts

d55f078

Small changes to test and custom sampler test

fdf98df

TSA Updates

415bf7f

Small Change to TSAUser

47fa43d

Fix Clustered and Interpolated Tests

28c6855

Remove _train abstract method

b39f8ec

Modify RWD to use np svd

76a9732

dylanjm force-pushed the fix-rom-dataobj branch from 2bad1e4 to 76a9732 Compare July 24, 2023 20:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix rom dataobj #2051

Fix rom dataobj #2051

dylanjm commented Feb 1, 2023

moosebuild commented Feb 1, 2023

wangcj05 left a comment

wangcj05 Feb 6, 2023

PaulTalbot-INL Mar 7, 2023

wangcj05 Feb 6, 2023

wangcj05 Feb 6, 2023

PaulTalbot-INL Mar 7, 2023

wangcj05 Feb 6, 2023

moosebuild commented Feb 15, 2023

PaulTalbot-INL Mar 8, 2023

moosebuild commented Jul 25, 2023


		self._replaceVariablesNamesWithAliasSystem(self.trainingSet, 'inout', False)

Fix rom dataobj #2051

Are you sure you want to change the base?

Fix rom dataobj #2051

Conversation

dylanjm commented Feb 1, 2023

Pull Request Description

What issue does this change request address? (Use "#" before the issue to link it, i.e., #42.)

What are the significant changes in functionality due to this change request?

For Change Control Board: Change Request Review

moosebuild commented Feb 1, 2023

wangcj05 left a comment

Choose a reason for hiding this comment

wangcj05 Feb 6, 2023

Choose a reason for hiding this comment

PaulTalbot-INL Mar 7, 2023

Choose a reason for hiding this comment

wangcj05 Feb 6, 2023

Choose a reason for hiding this comment

wangcj05 Feb 6, 2023

Choose a reason for hiding this comment

PaulTalbot-INL Mar 7, 2023

Choose a reason for hiding this comment

wangcj05 Feb 6, 2023

Choose a reason for hiding this comment

moosebuild commented Feb 15, 2023

PaulTalbot-INL Mar 8, 2023

Choose a reason for hiding this comment

moosebuild commented Jul 25, 2023