API Guide ========= .. toctree:: :maxdepth: 4 models tensor_parallel context_parallel pipeline_parallel custom_fsdp fusions transformer moe dist_checkpointing dist_optimizer distributed datasets multi_latent_attention num_microbatches_calculator optimizer_param_scheduler optimizer_cpu_offload encoder_decoder_parallelism