Performance: Combining flats is slow #645

pllim · 2018-08-21T16:54:52Z

This is taken from Astropy project performance survey. Is this is not the correct repo for this issue, please advise.

I have not tried this in recent astropy versions, but the early code to combine flats was FAR slower than IRAF imcombine. I had done some reading up to discover that IRAF was very proud about how it did memory management to do this efficiently. Sorry cannot be more specific or useful than that.

The text was updated successfully, but these errors were encountered:

MSeifert04 · 2018-08-22T10:09:10Z

Thanks for bringing this to our attention @pllim.

There are several places that are quite slow and memory inefficient in combine and Combiner. We are currently working on a few of these issues. But it's unlikely that we'll be faster (or as fast) as imcombine. 😅

jpmorgen · 2020-12-14T01:45:20Z

It occurs to me that considerable speed might be gained by using multiprocessing: each tile could be subdivided into sub-tiles to be farmed out to their individual combiners so that those of us who have multi-core machines (pretty much everyone these days) can enjoy a considerable boost in performance. I have started to design a system that I think would work and preserve the original memory handling subdivisions by making use of queues to provide instructions to the subprocesses and mmaped files (the astropy.fits default) to avoid opening the same file multiple times while creating the sub-tiles. Opening each file multiple times to create the chunk will still, of course, be necessary.

Is there some larger design decision that discourages the use of multiprocessing at this level? Yes, it means that upstream multiprocessed callers need to have their daemon flag set to False and divide the maximum memory and number of processes passed to combiner by the number of parent processes, but with those considerations, I think it should work OK.

If there is interest in the multiprocess route, let me know.

Thanks!

jpm

MSeifert04 added combiner package-expert effort-high performance labels Aug 22, 2018

MSeifert04 added this to the 2.0 milestone Aug 22, 2018

crawfordsm modified the milestones: 2.0.0, 2.1 Jul 27, 2019

mwcraig modified the milestones: 2.1.0, 2.1.1 Dec 24, 2019

mwcraig modified the milestones: 2.1.1, 2.2.0 Mar 15, 2021

mwcraig modified the milestones: 2.2.0, 2.2.1 May 25, 2021

mwcraig modified the milestones: 2.2.1, 2.3 Nov 21, 2021

mwcraig modified the milestones: 2.3, 3.0 Jan 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance: Combining flats is slow #645

Performance: Combining flats is slow #645

pllim commented Aug 21, 2018

MSeifert04 commented Aug 22, 2018

jpmorgen commented Dec 14, 2020 •

edited

Loading

Performance: Combining flats is slow #645

Performance: Combining flats is slow #645

Comments

pllim commented Aug 21, 2018

MSeifert04 commented Aug 22, 2018

jpmorgen commented Dec 14, 2020 • edited Loading

jpmorgen commented Dec 14, 2020 •

edited

Loading