schedulers.mdx 4.67 KB
Newer Older
Nathan Lambert's avatar
Nathan Lambert committed
1
2
3
4
5
6
7
8
9
10
11
12
<!--Copyright 2022 The HuggingFace Team. All rights reserved.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
specific language governing permissions and limitations under the License.
-->

13
14
15
16
# Schedulers

Diffusers contains multiple pre-built schedule functions for the diffusion process.

17
18
## What is a scheduler?

19
20
21
22
23
24
25
26
The schedule functions, denoted *Schedulers* in the library take in the output of a trained model, a sample which the diffusion process is iterating on, and a timestep to return a denoised sample.

- Schedulers define the methodology for iteratively adding noise to an image or for updating a sample based on model outputs.
    - adding noise in different manners represent the algorithmic processes to train a diffusion model by adding noise to images.
    - for inference, the scheduler defines how to update a sample based on an output from a pretrained model.
- Schedulers are often defined by a *noise schedule* and an *update rule* to solve the differential equation solution.

### Discrete versus continuous schedulers
27

28
29
All schedulers take in a timestep to predict the updated version of the sample being diffused.
The timesteps dictate where in the diffusion process the step is, where data is generated by iterating forward in time and inference is executed by propagating backwards through timesteps.
30
Different algorithms use timesteps that both discrete (accepting `int` inputs), such as the [`DDPMScheduler`] or [`PNDMScheduler`], and continuous (accepting `float` inputs), such as the score-based schedulers [`ScoreSdeVeScheduler`] or [`ScoreSdeVpScheduler`].
31
32

## Designing Re-usable schedulers
33

34
35
36
The core design principle between the schedule functions is to be model, system, and framework independent.
This allows for rapid experimentation and cleaner abstractions in the code, where the model prediction is separated from the sample update.
To this end, the design of schedulers is such that:
37

38
39
- Schedulers can be used interchangeably between diffusion models in inference to find the preferred trade-off between speed and generation quality.
- Schedulers are currently by default in PyTorch, but are designed to be framework independent (partial Numpy support currently exists).
Nathan Lambert's avatar
Nathan Lambert committed
40
41
42


## API
43

44
45
46
The core API for any new scheduler must follow a limited structure.
- Schedulers should provide one or more `def step(...)` functions that should be called to update the generated sample iteratively.
- Schedulers should provide a `set_timesteps(...)` method that configures the parameters of a schedule function for a specific inference task.
47
- Schedulers should be framework-agnostic, but provide a simple functionality to convert the scheduler into a specific framework, such as PyTorch
48
49
50
51
with a `set_format(...)` method.

The base class [`SchedulerMixin`] implements low level utilities used by multiple schedulers.

52
### SchedulerMixin
53
54
[[autodoc]] SchedulerMixin

55
### SchedulerOutput
56
The class [`SchedulerOutput`] contains the outputs from any schedulers `step(...)` call.
57

58
59
[[autodoc]] schedulers.scheduling_utils.SchedulerOutput

60
### Implemented Schedulers
61
62
63
64
65

#### Denoising diffusion implicit models (DDIM)

Original paper can be found here.

66
[[autodoc]] DDIMScheduler
67
68
69
70
71

#### Denoising diffusion probabilistic models (DDPM)

Original paper can be found [here](https://arxiv.org/abs/2010.02502).

72
[[autodoc]] DDPMScheduler
73

74
#### Variance exploding, stochastic sampling from Karras et. al
75
76
77

Original paper can be found [here](https://arxiv.org/abs/2006.11239).

78
[[autodoc]] KarrasVeScheduler
79
80
81
82
83
84

#### Linear multistep scheduler for discrete beta schedules

Original implementation can be found [here](https://arxiv.org/abs/2206.00364).


85
[[autodoc]] LMSDiscreteScheduler
86
87
88
89
90

#### Pseudo numerical methods for diffusion models (PNDM)

Original implementation can be found [here](https://github.com/crowsonkb/k-diffusion/blob/481677d114f6ea445aa009cf5bd7a9cdee909e47/k_diffusion/sampling.py#L181).

91
[[autodoc]] PNDMScheduler
92
93
94
95
96

#### variance exploding stochastic differential equation (SDE) scheduler

Original paper can be found [here](https://arxiv.org/abs/2011.13456).

97
[[autodoc]] ScoreSdeVeScheduler
98
99
100
101
102
103

#### variance preserving stochastic differential equation (SDE) scheduler

Original paper can be found [here](https://arxiv.org/abs/2011.13456).

<Tip warning={true}>
Nathan Lambert's avatar
Nathan Lambert committed
104

105
Score SDE-VP is under construction.
Nathan Lambert's avatar
Nathan Lambert committed
106

107
</Tip>
Nathan Lambert's avatar
Nathan Lambert committed
108

109
[[autodoc]] schedulers.scheduling_sde_vp.ScoreSdeVpScheduler