Skip to content

add gradient checkpointing #20

@SauravMaheshkar

Description

@SauravMaheshkar

Running flux-schnell currently requires 40 GB of VRAM (thus, an A100 or higher is needed). Gradient checkpointing could enable inference on smaller GPUs.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    Status

    Todo

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions