Hello Team,
I am currently investigating FFT-based neural network approaches, particularly block-circulant matrix methods (e.g., CirCNN), and evaluating their suitability for deployment on modern AI accelerator hardware.
I would appreciate your insights regarding your platform’s support for such workloads.
Specifically, I would like to ask:
- Does your accelerator architecture provide any dedicated FFT hardware support, specialized FFT instructions, or FFT-oriented processing pipelines?
- Are there optimized FFT kernels, libraries, or software components available within your SDK or software stack?
- Does the platform support custom operators or low-level kernel development that would allow implementation of FFT and IFFT operations?
- Is native complex-number support available, or would real and imaginary components need to be handled separately?
- In your opinion, would FFT-based neural network models, such as block-circulant networks, be an efficient use case for your hardware compared to conventional matrix-multiplication-based models?
I am particularly interested in understanding both the architectural capabilities and software support available for implementing and evaluating FFT-based neural network layers.
Thank you very much for your time and assistance.
Best regards,
Sowmiya Arumugam