https://github.com/Wan-Video/Wan2.2/blob/031a9be56cec91e86d140d3d3a74280fb05a9b1c/wan/textimage2video.py#L373 Does anyone know why it is extended to the token length here? Why not like wan2.1?