C4G-HKUST commited on
Commit
d0973b6
·
1 Parent(s): e69c1a8

Add free-tier user limitation note: Fast mode can generate ~6s two-person video max

Browse files
Files changed (2) hide show
  1. README.md +5 -4
  2. app.py +1 -1
README.md CHANGED
@@ -212,11 +212,12 @@ python app.py
212
  #### Generation Modes
213
  The Gradio demo provides two generation modes:
214
 
215
- - **Fast Mode (up to 240s GPU budget)**:
216
- - Fixed 12 denoising steps for quick generation
217
  - Suitable for single-person videos or quick previews
218
  - Lower GPU usage quota consumption
219
- - The 240s is the maximum GPU allocation time (budget), not the actual generation time
 
220
 
221
  - **Quality Mode (up to 720s GPU budget)**:
222
  - Custom denoising steps (adjustable via "Diffusion steps" slider)
@@ -225,7 +226,7 @@ The Gradio demo provides two generation modes:
225
  - The 720s is the maximum GPU allocation time (budget), not the actual generation time
226
  - With 40 denoising steps, approximately 10 seconds of video can be generated
227
 
228
- **Design Rationale**: Multi-person videos generally have longer duration and require more computational resources. To achieve better quality, especially for complex multi-person interactions, more denoising steps and longer GPU allocation time are needed. The Quality Mode provides sufficient Usage Quota (up to 720 seconds) to accommodate these requirements, while the Fast Mode offers a quick preview option with fixed 12 steps for faster iteration. Note that the GPU duration values (240s/720s) represent the maximum budget allocated, not the actual generation time.
229
 
230
 
231
 
 
212
  #### Generation Modes
213
  The Gradio demo provides two generation modes:
214
 
215
+ - **Fast Mode (up to 210s GPU budget)**:
216
+ - Fixed 10 denoising steps for quick generation
217
  - Suitable for single-person videos or quick previews
218
  - Lower GPU usage quota consumption
219
+ - The 210s is the maximum GPU allocation time (budget), not the actual generation time
220
+ - **For free-tier users: Fast mode can generate approximately 6 seconds of two-person video at most; longer videos may timeout.**
221
 
222
  - **Quality Mode (up to 720s GPU budget)**:
223
  - Custom denoising steps (adjustable via "Diffusion steps" slider)
 
226
  - The 720s is the maximum GPU allocation time (budget), not the actual generation time
227
  - With 40 denoising steps, approximately 10 seconds of video can be generated
228
 
229
+ **Design Rationale**: Multi-person videos generally have longer duration and require more computational resources. To achieve better quality, especially for complex multi-person interactions, more denoising steps and longer GPU allocation time are needed. The Quality Mode provides sufficient Usage Quota (up to 720 seconds) to accommodate these requirements, while the Fast Mode offers a quick preview option with fixed 10 steps for faster iteration. Note that the GPU duration values (210s/720s) represent the maximum budget allocated, not the actual generation time.
230
 
231
 
232
 
app.py CHANGED
@@ -768,7 +768,7 @@ def run_graio_demo(args):
768
  )
769
  gr.Markdown("""
770
  **Generation Modes:**
771
- - **Fast Mode (up to 210s GPU budget)**: Fixed 10 denoising steps for quick generation. Suitable for single-person videos or quick previews. The 210s is the maximum GPU allocation time, not the actual generation time.
772
  - **Quality Mode (up to 720s GPU budget)**: Custom denoising steps (adjustable via "Diffusion steps" slider). Recommended for multi-person videos that require higher quality. The 720s is the maximum GPU allocation time, not the actual generation time. With 40 denoising steps, approximately 10 seconds of video can be generated.
773
 
774
  *Note: The GPU duration (210s/720s) represents the maximum budget allocated, not the actual generation time. Multi-person videos generally require longer duration and more Usage Quota for better quality.*
 
768
  )
769
  gr.Markdown("""
770
  **Generation Modes:**
771
+ - **Fast Mode (up to 210s GPU budget)**: Fixed 10 denoising steps for quick generation. Suitable for single-person videos or quick previews. The 210s is the maximum GPU allocation time, not the actual generation time. **For free-tier users: Fast mode can generate approximately 6 seconds of two-person video at most; longer videos may timeout.**
772
  - **Quality Mode (up to 720s GPU budget)**: Custom denoising steps (adjustable via "Diffusion steps" slider). Recommended for multi-person videos that require higher quality. The 720s is the maximum GPU allocation time, not the actual generation time. With 40 denoising steps, approximately 10 seconds of video can be generated.
773
 
774
  *Note: The GPU duration (210s/720s) represents the maximum budget allocated, not the actual generation time. Multi-person videos generally require longer duration and more Usage Quota for better quality.*