loop tilling/blocking in SCA forward transformation #19

clementval · 2018-06-21T08:47:05Z

Support for loop-tiling/blocking

clementval · 2018-09-28T06:34:34Z

@mlange05 Do you have an example of how you would like this to be?

mlange05 · 2018-09-28T15:50:03Z

Ok, this is just a preliminary sketch, but basically what I would like for the SCA case is to add an option to insert a single level of blocking around the root call. Something like:

!$claw parallelize forward blocked="bsize"
DO p = 1, nproma     
  CALL compute_column(nz, q(p,:), t(p,:), z(p,:))                                                                                             
END DO

then transforms into something like

p_blocks = <number of blocks as a function of "bsize">                                                                                                                                                                                                                                                 
DO p_i = 1, p_blocks                                                                                                                                                                                                                                                          
   p0 = <first index of block in global array>                                                                                                                                                                                                                                
   p1 = <final index of block in global array>                                                                                                                                                                                                                                
   CALL compute_column ( nz , q (p0:p1 , : ) , t (p0:p1, : ) , z (p0:p1, : ), nproma=p1-p0 )   
END DO

The details of how to define the bsize variable in the pragma or the derivation of block sizes/indices are still a bit fuzzy (sorry), but I'll provide a more concrete example soon...

One key detail to note then is that the OpenMP-parallel loop would live on the outer loop in this calling routine (to avoid false sharing), and would need to be removed from all kernels under this root call for the C backend.

clementval · 2018-10-02T11:36:10Z

@mlange05 Thanks for the input. I will try to draft something in the document and we can iterate on it.

clementval · 2018-10-02T11:54:57Z

@mlange05 Is it fine if this notion is tied with the Single Column Abstraction or would you see a use case where you would used this loop tilling/blocking as low-level transformation like a loop-fusion?

mlange05 · 2018-10-02T14:00:48Z

Hmmm, good question. My primary use case is the root loop of SCA, but I can see how making it a general loop transformation makes conceptually more sense. Is there a way to combine/compose the two, eg. the SCA transformation triggering the loop transformation automatically if given a particular keyword?

That being said, if it's easier to do as part of SCA I would be very happy already.

clementval · 2018-10-03T06:41:21Z

This would be possible. In the loop-interchange directive for example, the fusion clause trigger a fusion transformation automatically.

clementval · 2018-10-25T10:36:53Z

PR #24

clementval added specification SCA LOW-LEVEL labels Jun 21, 2018

clementval added this to the Specification 1.0 milestone Jun 21, 2018

clementval modified the milestones: Specification 1.0, v2.0 Sep 28, 2018

clementval added New feature and removed SCA labels Sep 28, 2018

clementval changed the title ~~loop tilling/blocking~~ loop tilling/blocking in SCA forward transformation Oct 2, 2018

clementval added SCA and removed LOW-LEVEL labels Oct 2, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

loop tilling/blocking in SCA forward transformation #19

loop tilling/blocking in SCA forward transformation #19

clementval commented Jun 21, 2018 •

edited

Loading

clementval commented Sep 28, 2018

mlange05 commented Sep 28, 2018

clementval commented Oct 2, 2018

clementval commented Oct 2, 2018

mlange05 commented Oct 2, 2018

clementval commented Oct 3, 2018

clementval commented Oct 25, 2018

loop tilling/blocking in SCA forward transformation #19

loop tilling/blocking in SCA forward transformation #19

Comments

clementval commented Jun 21, 2018 • edited Loading

clementval commented Sep 28, 2018

mlange05 commented Sep 28, 2018

clementval commented Oct 2, 2018

clementval commented Oct 2, 2018

mlange05 commented Oct 2, 2018

clementval commented Oct 3, 2018

clementval commented Oct 25, 2018

clementval commented Jun 21, 2018 •

edited

Loading