What's Changed
- Improve BLAS detection in configure for ARM
- Control of numerical accuracy to speed up calculations: auto switch to single precision
- Compiles with cray compiler
- Fixed bug in HPC implementation of Jastrow
- Improved FMAs in += statements
- Reduced number of mallocs and memsets
- Added some OpenMP on small functions becoming important with large numbers of threads
New Contributors
Full Changelog: v0.5.4...v1.0.0