feat(gradio_app): implement dynamic concurrency limit based on VRAM
- Add get_concurrency_limit function to calculate concurrency limit based on VRAM - Update clean_vram function and rename to get_vram for better clarity - Apply concurrency limit to the to_markdown function in the Gradio app
Showing
Please register or sign in to comment