),"if use_fsq is set to True, `fsq_levels` must be set (and not `codebook_size`). the effective codebook size is the cumulative product of all the FSQ levels"
),f"number of frames {frames} minus the first frame ({frames-int(video_contains_first_frame)}) must be divisible by the total downsample factor across time {self.time_downsample_factor}"