Hi, I give in input exactly the same dataset as for the simple model that works, but the Attention mechanism model doesn't work. I give the algorithm a dataset of 336 features, and it ask me for 336*336 input. Can you help me please.