in lade/decoding/py line1175 the original code: ```python else: all_old_tokens.append(hits[max_hit]) ``` should be changed to ```python else: all_old_tokens.append(hits[hit_idx]) ``` Otherwise it is just appending the same thing