What did you do this week?
- I have opened a PR on cupy adding a scipy.fft backend
- Fixed a few small incompatibilities with the numpy.fft interface
- Updated next_fast_len, a utility function to find the fastest lengths for computing the FFT (pypocketfft!17, scipy#10570)
- Various clean-ups and refactoring of pypocketfft's multi-dimensional transform functions (pypocketfft!16, pypocketfft!20)
- Restarted a discussions about use of OpenMP in SciPy. pypocketfft already supports OpenMP for parallelising transforms over multidimensional arrays, however there are a number of issues preventing SciPy from building with this feature enabled.
What is coming up next?
- Replace OpenMP in pypocketfft with a custom thread-pool. The discussions on SciPy-dev have suggested that OpenMP isn't going to be a suitable solution so I will attempt to replace it.
- Finishing up my CuPy backend PR. It is mostly completed but needs a few more tests.
- Implement shape-changing transforms in pypocketfft.
Did you get stuck anywhere?
I found it quite difficult to get cuda and CuPy working properly on my computer and still occasionally have problems with it stopping recognising my cuda device. However, it's at least functional now if not quite perfect.