See Hugging Face’s activity on LinkedIn

ML @ Hugging Face 🤗

Users of `torch.compile`. Some small performance tips: 1. Default to `fullgraph=True` to catch graph breaks as early as possible. 2. Check for recompilation triggers. Put your code under `torch._dynamo.config.patch(error_on_recompile=True)` context. 3. Use regional compilation almost always to cut down cold-start timing significantly. Graph-breaks and frequent recompilations can easily come in the way of performance. Eliminate them as much as possible. In Diffusers, we have a dedicated test suite for checking these things. Reference: https://lnkd.in/gK3DqscU

diffusers/tests/models/test_modeling_common.py at 941b7fc0843139e52419a65b7fa850169fde0360 · huggingface/diffusers

github.com

2 Comments

Daniel Svonava

Vector Compute @ Superlinked | xYouTube

How do you balance fullgraph=True for catching breaks early with the risk of slowing down development feedback loops?

2 Reactions

To view or add a comment, sign in

Hugging Face’s Post

diffusers/tests/models/test_modeling_common.py at 941b7fc0843139e52419a65b7fa850169fde0360 · huggingface/diffusers

github.com

More from this author

What you may have missed from the 🤗 open source community gathering in Paris 🕹️

Accompagnement renforcé de la CNIL et protection des données "by design" 🤗

Explore topics