I recently use your awesome flash bidirectional linear attention on my tasks. Since images are of different resolution and native resolution training is critical. I would like to know that if you have plans to develop the varlen API to support packing.
Thank you very much!