Skip to content

Conversation

@jimburtoft
Copy link
Contributor

Description of changes:

  • Hardware-accelerated 1D FFT using Tensor Engine
  • Supports arbitrary heights and power-of-2 widths (128-4096)
  • Radix-2 Cooley-Tukey algorithm with 128-point DFT base case
  • High accuracy (< 0.003% error vs NumPy)
  • Comprehensive unit tests with performance benchmarks"

Testing:

Just covered by function specific unit testing.

Pull Request Checklist

  • [x ] I have filled in all the required field in the template
  • [ x] I have tested locally that all the tests pass
  • [x ] By submitting this pull request, I confirm that my contribution is made under the terms of the MIT-0 license.

@jimburtoft jimburtoft changed the title Add 1D FFT NKI kernel implementation Add 1D Fast Fourier Transform FFT NKI kernel implementation Nov 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant