Move entire calculation to index functions
At the moment, the aggregator calls an appropriate apply_along_axis
with the index function only providing the inner part of this. This restricts the index functions unnecessarily in many cases and can lead to sub-optimal performance.
Edited by Klaus Zimmermann