Hey @ritchieng thanks for implementing this method so quickly! I am curious about the performance of this method comparing to other standard methods, implemented in scipy or numpy. Is it possible for us to see some benchmarking regarding this implementation?