Towards Handling Big LHS #361

zhengyang92 · 2018-10-03T01:21:31Z

Always feed getGuesses() with Vars in LHS: the original findCands() for generating synthesis candidates are called with MaxCand = 15, this may miss the input variables declared at the beginning of a large program. This patch fixes this.

Add a switch for big query: In some cases the huge big query may abort the Souper

Enable the ctpop test case.

zhengyang92 · 2018-10-03T03:43:01Z

Solving the conflicts.

… and enable the ctpop test case

regehr · 2018-10-03T03:53:47Z

lib/Infer/ExhaustiveSynthesis.cpp

-  findCands(LHS, Inputs, /*WidthMustMatch=*/false, /*FilterVars=*/false, MaxLHSCands);
+  std::vector<Inst *> Vars;
+  findVars(LHS, Vars);
+


we should talk about this part. why do you think it's a good idea? do you have data supporting that it does a better job?

I did this because findCands() stops when it reaches MaxLHSCands(which is set to 15 by default). However, this might miss the input variable declared at the top of program. Due to this limitation, the synthesizer cannot gives correct ctpop result since the input %x is not harvested as a synthesis component.

Exhaustive synthesis has a call findVars() after doing the big query, hoisting up this call does not have much impact to the compilation performance, compared with the potential benefit of getting better synthesis results.

This approach is stupid I admit, it basically runs the traversal on the LHS twice. I can write a better version of the findCands() which runs once.

This code basically guarantees all the program inputs are collected as components in the synthesis.

I agree that tuning MaxLHSCands will be useful, but just harvesting the vars doesn't sound like the right solution.
for one thing, what if there are 500 vars? then synthesis of just one select instruction is going to involve evaluating 500x500x500 guesses

the idea behind the hard bound is that it prevents these degenerate cases, but your patch will just open us up to them again

Okay this makes sense. But instead of stopping at a hard bounded limit when doing findCands(), can we traverse the whole tree and hard bound the result by the descending order by the "benefit" of each candidates? There is already a "benefit" tagging algorithm there implemented in findCands().

Current "benefit" tagging algorithm simply follows: the deeper the traverse goes, the higher benefit the candidate gets. We may further customize this benefit function to fit our needs. Say some algorithm can tell some kind of candidates are more likely to be used as a component in synthesis, then we may increase the "benefit" of this kind of candidates, to reduce the synthesis time.

well, the benefit is a property of the RHS cost vs. LHS cost, not a property of the choice of inputs, which is what we're talking about here.

my observation is that most rewrites are going to occur at the bottom of a souper LHS, which is why I wrote the code to get inputs using DFS on the LHS.

if we pick the ones likely to lead to the highest benefit, you're saying to pick inputs from the top of the LHS. I'm not opposed to this, but I think it is a mistake to make the change before we have data showing that the change is a good one.

anyway I'm saying:

this is really two patches, please split them

for each patch, I want you to justify it using data instead of making guesses about what will work best

Sure I will split this pr and collect data

regehr · 2018-10-03T03:54:24Z

lib/Infer/ExhaustiveSynthesis.cpp

@@ -32,6 +32,9 @@ namespace {
    "The larger the number is, the more fine-grained debug "
    "information will be printed"),
    cl::init(0));
+  static cl::opt<bool> NoBigQuery("souper-exhaustive-synthesis-no-big-query",


let's get some data about this also -- maybe run synthesis with and without it and see how it changes the CPU time used

regehr · 2018-10-03T20:22:08Z

I think you should separate the bigquery part of this PR from the changes to how LHSs are chosen

regehr · 2018-10-03T20:22:37Z

for bigquery, we should choose the default to be the one that runs faster-- but to make that decision we need data

zhengyang92 changed the title ~~Handle Big LHS~~ Towards Handling Big LHS Oct 3, 2018

zhengyang92 force-pushed the better-huge-lhs branch from 76e2ab9 to 15b974e Compare October 3, 2018 01:30

Always feed getGuesses() with Vars in LHS, add a switch for big query…

5e80c57

… and enable the ctpop test case

zhengyang92 force-pushed the better-huge-lhs branch from 15b974e to 5e80c57 Compare October 3, 2018 03:52

regehr reviewed Oct 3, 2018

View reviewed changes

a updated findCands()

189bd13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Towards Handling Big LHS #361

Towards Handling Big LHS #361

zhengyang92 commented Oct 3, 2018 •

edited

Loading

zhengyang92 commented Oct 3, 2018

regehr Oct 3, 2018

zhengyang92 Oct 3, 2018 •

edited

Loading

zhengyang92 Oct 3, 2018 •

edited

Loading

regehr Oct 3, 2018

regehr Oct 3, 2018

zhengyang92 Oct 3, 2018 •

edited

Loading

zhengyang92 Oct 3, 2018 •

edited

Loading

regehr Oct 3, 2018

regehr Oct 3, 2018

zhengyang92 Oct 3, 2018

regehr Oct 3, 2018

regehr commented Oct 3, 2018

regehr commented Oct 3, 2018

Towards Handling Big LHS #361

Are you sure you want to change the base?

Towards Handling Big LHS #361

Conversation

zhengyang92 commented Oct 3, 2018 • edited Loading

zhengyang92 commented Oct 3, 2018

regehr Oct 3, 2018

Choose a reason for hiding this comment

zhengyang92 Oct 3, 2018 • edited Loading

Choose a reason for hiding this comment

zhengyang92 Oct 3, 2018 • edited Loading

Choose a reason for hiding this comment

regehr Oct 3, 2018

Choose a reason for hiding this comment

regehr Oct 3, 2018

Choose a reason for hiding this comment

zhengyang92 Oct 3, 2018 • edited Loading

Choose a reason for hiding this comment

zhengyang92 Oct 3, 2018 • edited Loading

Choose a reason for hiding this comment

regehr Oct 3, 2018

Choose a reason for hiding this comment

regehr Oct 3, 2018

Choose a reason for hiding this comment

zhengyang92 Oct 3, 2018

Choose a reason for hiding this comment

regehr Oct 3, 2018

Choose a reason for hiding this comment

regehr commented Oct 3, 2018

regehr commented Oct 3, 2018

zhengyang92 commented Oct 3, 2018 •

edited

Loading

zhengyang92 Oct 3, 2018 •

edited

Loading

zhengyang92 Oct 3, 2018 •

edited

Loading

zhengyang92 Oct 3, 2018 •

edited

Loading

zhengyang92 Oct 3, 2018 •

edited

Loading