MAINT: suggested speedups #1

andyfaff · 2021-09-14T03:28:01Z

I was browsing PyPI and came across the package. It's great to know that there's a community building up around refnx. I knew that @arm61 was working on stuff around this area, I guess this is to do with that.

The project infrastructure looks great. Automated testing, pypi, etc.

I've made this PR to suggest some possible speedups via removal of loops, vectorisation, etc. It's not currently passing tests, possibly because the PDFs are being calculated differently. That should only require minor tweaks though.

andyfaff · 2021-09-14T03:28:57Z

parameter/gauss_class.py

-        for d in self.data:
-            self.distros.append(norm(loc=d[0], scale=d[1]))
+        # truncnorm takes care of each gaussian contribution integrating
+        # to unity, etc.


truncnorm a much better way of handling a clipped normal distribution, and it should be fairly fast.

andyfaff · 2021-09-14T03:29:39Z

parameter/gauss_class.py

-
-        return _pdf
+
+        arrs = [d.pdf(x) for d in self.distros]


Removing a double nested loop should be a hell of a lot faster. Each of the d.pdf calls is vectorised.

andyfaff · 2021-09-14T03:37:14Z

parameter/gauss_class.py

+        arrs = [d.cdf(x) for d in self.distros]
+        return np.sum(arrs, axis=0) / len(self.data)
+
+    def _ppf_single(self, q):


This approach to calculating ppf is taken from scipy.stats. If it works well there, it should work well here. Of course I may have ignored improved bracketing that you've already worked out for these distributions.

q is normally used as the argument for ppf (at least in scipy.stats). Changing the argument name to q would benefit me (at least), and possibly others who are interested in contributing.

andyfaff · 2021-09-14T04:06:10Z

parameter/gauss_class.py

-                                      args=[v]).root
+        vfun = np.vectorize(self._ppf_single, otypes='d')
+        _ppf = vfun(np.atleast_1d(q))
+
        if len(_ppf) == 1:


Probably not a good idea to have this, you probably need if _ppf.size == 1

>>> x = np.random.random(size=(1, 10)) >>> len(x) 1

MAINT: suggested speedups

eeebb4d

andyfaff commented Sep 14, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MAINT: suggested speedups #1

MAINT: suggested speedups #1

andyfaff commented Sep 14, 2021

andyfaff Sep 14, 2021

andyfaff Sep 14, 2021

andyfaff Sep 14, 2021

andyfaff Sep 14, 2021

andyfaff Sep 14, 2021

MAINT: suggested speedups #1

Are you sure you want to change the base?

MAINT: suggested speedups #1

Conversation

andyfaff commented Sep 14, 2021

andyfaff Sep 14, 2021

Choose a reason for hiding this comment

andyfaff Sep 14, 2021

Choose a reason for hiding this comment

andyfaff Sep 14, 2021

Choose a reason for hiding this comment

andyfaff Sep 14, 2021

Choose a reason for hiding this comment

andyfaff Sep 14, 2021

Choose a reason for hiding this comment