Comfyui speed up github. 5 is normal, but it's very slow when using the XL model.
Comfyui speed up github New. Acknowledgements frank-xwang for creating the original repo, training models, etc. There is no progress at all, ComfyUI starts hogging 1 CPU core 100%, and my computer becomes unusably slow (to the point of freezing). Since updating on September 3rd, generations have become extremely slow, but I have a suspicion as to why. A small 10MB default model, 320n. What sampler/scheduler are you seeing the speed increase with? Euler - Simple, try using --force-fp32 or --force-fp16 and if no improvement then --use-split-cross-attention in my GPU cloud service, it takes ~40 seconds to launch ComfyUI. model, The zooming speed is too fast, I'm just gently scrolling the mouse. change some code for lowram,The inference speed of 4070 12G is 20 times faster than before(21s at 20 steps ); 修改了一些代码,目前4070 12G how to increase speed GGUF model? GGUF model 4 step one image generated time 34 second 6gb model but unet 6gb model generated time 18/19 second city96 / ComfyUI-GGUF Public. Old. More than 100 million people use GitHub to discover, fork, and contribute to over 420 million projects. 32s (37. Install the ComfyUI dependencies. Inputs. I posted all the various GPU speeds in the 2B main question thread (still opened in that forum) Feature Idea Allow memory to split across GPUs. js; Search for scale *= 1. 9-8it/s model size ~1. This node adapts the original model and inference code from nudenet for use with Comfy. Best. Write better code with AI Security. Find and fix vulnerabilities Expected Behavior The inference speed and VRAM usage should have remained the same. Follow these steps for fully custom prediction: You will need to use the sampling > prediction > Sample Predictions node as your sampler. Launch ComfyUI by running python main. core. Added "no uncond" node which completely disable the negative and doubles the speed while rescaling the latent space in the post-cfg function up until the sigmas are at 1 (or really Follow the ComfyUI manual installation instructions for Windows and Linux. upd. I have two accelerators, and I used to rely on one with fast internet speed for updating ComfyUI and plugins. Also adds a 30% speed increase. Not su fp16: whether to load model in fp16. - Speed up Sharpen node. During this time, ComfyUI will stop, without any errors or information in the log about the stop. I hope someone finds this useful. (App Already Does It Every Time You Open It, If You Are Using comfyui. regardless of which upscale model - experienced slow speed/inactivity with models like the 4xUltraSharp and 4xFFHQDAT 使用 Kaggle 白嫖 ComfyUI 什么是 Kaggle? 这不妨碍我们白嫖 GPU 算力~ 先上效果图。 使用 ComfyUI 1. Notifications You must be signed in New issue Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community. All my point was managing them individually can easily get impractical. 24. Vast. · comfyanonymous/ComfyUI@e0c0029 The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. Already have an account? 👍 1 reacted with thumbs up emoji The ODE Solver node is available at sampling > custom_sampling > samplers > ODE Solver. github. For ComfyUI / StableDiffusion. 04. 0-runtime-22. ai has gi The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. Actual Behavior The inference speed is about 20% slower and VRAM usage is lower as well. ComfyUI executes ClipTextEncode, after a while computer hangs for 3sec, then automatically reboots (with --lowvram) ComfyUI-cpu is a trimmed down version of ComfyUI that only uses the cpu. An example workflow is in examples/avoid_and_erase. 输出中有一行被笑脸 Emoji 包起来的以 pinggy. Getting 1. 7gb 64% speed increase tensorrt dynamic speed 7. Turns out that with "flux1-dev-bnb-nf4" is a new Flux model that is nearly 4 times faster than the Flux Dev version and 3 times faster than the Flux Schnell version. GitHub community articles Repositories. · comfyanonymous/ComfyUI All custom nodes are provided under Add Node > sampling > prediction. Unless you're planning on running a public server I guess there's not really much information here. It should be at least as fast as the a1111 ui if you do that. 0, 2. the area for the sampling) around the original mask, in pixels. That should speed things up a bit on newer cards. I have successfully load the vision understanding fuction of the GLM4 in COMFYUI. · comfyanonymous/ComfyUI@4ee9aad Write better code with AI Code review What should I do to improve speed and shorten time? thanks. Collaborate outside of code Write better code with AI Code review. · comfyanonymous The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface. The only issue if you do this is that the CFG function will weight your positive prediction times your CFG scale against nothing and you will get a black image. Conditioning deltas are conditioning vectors that are obtained by subtracting one prompt conditioning from another. (7900 XTX, 32GB RAM, Windows 10, Radeon 24. in my local computer, it takes ~10 seconds to launch, also it has wayyy more custom nodes. this is of course when the catvton's models are already loaded. Every time I start comfyui the 1st image is processed quickly. Better compatibility with third-party checkpoints (we will continuously collect compatible free third GitHub community articles Repositories. ; invert_mask: Whether to fully invert the Comfyui windows portable, fully up to date 13900k, 32 GB ram windows 11 h2 4090 newest drivers works fine with 1. 1 with a larger number like 1. You signed in with another tab or window. 51s → 0. DocVQA allows you to ask questions about the content of document images, and the model will provide answers based on the visual and textual information in the document. · comfyanonymous/ComfyUI@58c9838 Up to 28. · comfyanonymous/ComfyUI@ae197f6 Can be added after any node to clean up vram and memory - T8star1984/comfyui-purgevram The most powerful and modular stable diffusion GUI with a graph/nodes interface. 1 driver). bat is used. Stable-Fast Unet compilation to change a model's weights without triggering a recompilation while still keeping the speed benefits from a compiled model. 7 Public Latest comfyanonymous / ComfyUI Public. 7 gb 60% speed increase. ComfyUI design patterns and model management is used where possible now. 7g of VRAM, with a peak of about 16g of RAM, and both of them are at about the same speed, and the reduction of video memory usage doesn't seem to have been as much as I Saved searches Use saved searches to filter your results more quickly context_expand_pixels: how much to grow the context area (i. 📖 Nodes reference. json Using the full workflow with faceid, until 60 seconds, the drawing did not start, and all nodes were working at a very slow speed, which was very frustrating. 5 for a faster zooming in speed, or use a smaller number like 1. Added support for onnxruntime to speed-up DWPose (see the Q&A) Fixed TypeError: expected size to be one of int or Tuple[int] or Tuple[int, As mentioned here after many tests ltdrdata/ComfyUI-Inspire-Pack#135: Everything works fine when I have these commits: ComfyUI: 17bbd83 Inspire pack: ltdrdata/ComfyUI-Inspire-Pack@cf9bae0. There is no need to make a new pattern, it is a very ba Contribute to Comfy-Org/ComfyUI-Mirror development by creating an account on GitHub. Sign up for a free GitHub account to open an issue and contact its maintainers and the community. i have roughly 100 ComfyUI extensions installed. nvidia. 20K subscribers in the comfyui community. 56/s. Controversial. I tried different GPU drivers and nodes, the result is always the same. ***************************************************** "bitsandbytes_NF4" custom --dont-upcast-attention Disable upcasting of attention. When using one LORA, I didnt notice a drop in speed (Q8). It's not obvious but hypertiling is an attention optimization that improves on xformers / etc. Open 4lt3r3go opened this issue Dec 20, 2024 · 0 comments Open speed #191. comfy stable Four specialized nodes for freeing memory: Free Memory (Image): Cleans up memory while passing through image data Free Memory (Latent): Cleans up memory while passing through latent data Free Memory (Model): Cleans up memory while passing through model data Free Memory (CLIP): Cleans up memory while passing through CLIP model data Attempts to free comfyanonymous / ComfyUI Public. If you have two gpus this would be a massive s 90 votes, 23 comments. 1 is grow 10% of the size of the mask. 选择 gpu -》 run all 3. · comfyanonymous/ComfyUI@ae197f6 Contribute to JettHu/ComfyUI_TGate development by creating an account on GitHub. b. The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. # This is the GitHub Workflow that drives automatic full-GPU-enabled tests of all new commits to the master branch of ComfyUI You can also try setting this env variable PYTORCH_TUNABLEOP_ENABLED=1 which might speed things up at the cost of a very slow initial run. checkpoint : the model you select, zero123-xl is the lates one, and stable-zero123 claim to be the best, but licences required for commercially use. Note Convert Model using stable-fast (Estimated speed up: 2X) Train a LCM Lora for denoise unet (Estimated speed up: 5X) Training a new Model using better dataset to improve results quality (Optional, we'll see if there is any need for me to do it ;) Continuous research, always moving towards something better & faster🚀 Write better code with AI Security. - comfyorg/comfyui GitHub is where people build software. context_expand_pixels: how much to grow the context area (i. Contribute to nonnonstop/comfyui-faster-loading development by creating an account on GitHub. Contribute to huanngzh/ComfyUI-MVAdapter development by creating an account on GitHub. - Try to speed up the test-ui workflow. 1-dev and CogVideoX-5b(-I2V). The model speed of SD 1. The most powerful and modular diffusion model GUI and backend. Am I doing anything wrong? I thought I got all the settings right, but the results are straight up Up to 28. context_expand_factor: how much to grow the context area (i. It can be done without any loss in quality when the sigma are low enough (~1). py). Sign up for free to join this conversation on GitHub. If you experience any issues you did not have before, please report them so I can fix them quickly! Notable changes: Slightly lower VRAM usage (0. The node calculates the cosine similarity between the u-net's conditional and unconditional output ("positive" and "negative" prompts) and once the similarity crosses the specified threshold, it sets CFG to 1. sdxl model 768x1024 I am having a problem with very slow generation speed when using AutoCFG. Notifications You must be signed in to change notification settings; By clicking “Sign up for GitHub”, as soon as I switch back to CheckpointLoaderSimple, my generation speeds shoot back up to 3-5it/s. bf Write better code with AI Code review. 8k. Sort by: Best. - How to use PyTorch 2. e. 点击 "Open In Kaggle" 转到 kaggle 2. The main disadvantage compared to the alternatives I mentioned is that it is relatively slow and VRAM hungry since it requires multiple iterations at high res while Deep Shrink/HiDiffusion actually speed up generation while the scaling effect is active. Please allow us to use GPU's in series for greater speeds in image and animation generations, when it comes to running multiple gpu's. bat if you are using the standalone. bat, So This Way It Is Always Up To Date With Whatever Is On My GitHub Page) Only use comfy manager to update the Points, segments, and masks are planned todo after proper tracking for these input types is implemented in ComfyUI. 25% faster) Try using an fp16 model config in the CheckpointLoader node. If it isn't let me know because it's something I need Run ComfyUI with --disable-cuda-malloc may be possible to optimize the speed further. ttulttul opened this issue May The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. Note that --force-fp16 will only work if you installed the latest pytorch nightly. 25 votes, 14 comments. Beta Was this translation helpful? Give feedback. · comfyanonymous/ComfyUI@ae197f6 The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface. My assumption was the filename prefix loop or the repeated regex. From initial testing, the filtering effect is better than classifier models such as The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. --fp16-vae Run the Speed up the loading of checkpoints with ComfyUI. All Plan and track work Discussions. I needed to run comfyui on cpu only device so I created this fork of ComfyUI optimized for cpu usage. If you have another Stable Diffusion UI you might be able to reuse the dependencies. Notifications You must New issue Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the I'd also suggest trying out the "HyperTiling" node under _nodes_for_testing. json. Reload to comfyui v2-rocm-6. kijai / ComfyUI-MMAudio Public. · comfyanonymous/ComfyUI@58c9838 The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. Face Masking feature is available now, just add the "ReActorMaskHelper" Node to the workflow and connect it as shown below: A ComfyUI custom node designed for advanced image background removal and object segmentation, utilizing multiple models including RMBG-2. Actual Behavior. This enhances the user experience and processing The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface. I'm seeing iterations go from 2-3s/it to 40-70s/it+ Running on an i9 11900k, 32GB Ram, NVidea RTX 4070 12GB (I know I'm kinda pushing it on VRam so not sure if this sampler is just a bit more strict with VRam requirements) Keybind Explanation; Ctrl + Enter: Queue up current graph for generation: Ctrl + Shift + Enter: Queue up current graph as first for generation: Ctrl + Alt + Enter You can also try setting this env variable PYTORCH_TUNABLEOP_ENABLED=1 which might speed things up at the cost of a very slow initial run. On Windows using directml I get 1it/s, usually less, using (py -3 The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. Steps to Reproduce Used a default SDXL workflow with lora Debug my catvton takes about 12 seconds to generate an image on A100 GPU. https://developer. 1. Contribute to ccssu/ComfyUI-Workflows-Speedup development by creating an account on GitHub. 8. - Speed up inference on nvidia 10 series on Linux. 3-0. With four LORA, the speed drops x3. But with two or more, the speed drops several times. 0, INSPYRENET, BEN, SAM, and GroundingDINO. Thanks to city96 for active development of the node. 7 seconds in auto1111 with 512x512 20 steps euler comfy gets me 3 seconds to do same image with same settings, thats half the speed, and its pretty big slowdown from auto1111 Any chance t When I use the single file version of FP8, generating a 1024*1024 graph takes up about 14g of VRAM, with a peak of 31g of RAM; when I use the nf4 version, it takes up about 12. Nodes for better inpainting with ComfyUI: Fooocus inpaint model for SDXL, LaMa, MAT, and various other tools for pre-filling inpaint & outpaint areas. You signed out in another tab or window. · comfyanonymous/ComfyUI Add the node via Ollama-> Ollama Image Describer. HyperTiling increases speed as the image size increases. Speed up the loading of checkpoints with ComfyUI. When I generate the base pic it usually takes 20-30 seconds to generate one, and now it bar cubiq / ComfyUI_essentials Public. 70it/s. 11. 0 flows, but sdxl loads the checkpoints, take up about 19GB vram, then pushes to 24 GB vram upon running a prompt, once the prompt finishes, (or if I cancel it) it just sits at 24 GB until I close out the comfyui command prompt The problem is that everyone has different configurations, and my ComfyUI setup was a mess. 2. I'm talking about after the container spins up. ai. · comfyanonymous The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. You can open up your favorite terminal and activate it: this command line setting to disable the upcasting to fp32 in some cross attention operations which will increase your speed. 📢 REGARDING KEEPING THE APP UP TO DATE. Reload to refresh your session. Any pointers on what I could do to speed this up? cheers. · comfyanonymous/ComfyUI@58c9838 use_kv_cache: Enable kv cache to speed up the inference seed: A random seed for generating output. onnx, is provided. Notifications New issue Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community. For instance if some has a bank of gpu's they could run them together to complete a generation. So, you are only seeing ComfyUI crash, or are you seeing your video card disappear from the PCIe bus as well? Anyuser could use their own API-KEY to use this fuction - JcandZero/ComfyUI_GLM4Node. 8GB) depending on workflow; Motion model caching - speeds up consecutive sampling ComfyUI was started with --lowvram --disable-all-custom-nodes. Contribute to Comfy-Org/ComfyUI-Mirror development by creating an account on GitHub. · comfyanonymous/ComfyUI@58c9838 Keybind Explanation; Ctrl + Enter: Queue up current graph for generation: Ctrl + Shift + Enter: Queue up current graph as first for generation: Ctrl + Alt + Enter: Cancel current generation: Ctrl + Z/Ctrl + Y: Undo/Redo stuck issue sample. . sampler: euler scheduler: normal. · comfyanonymous/ComfyUI@ae197f6 The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. - Speed up fp8 matrix mult by using better code. What comfy is talking about is that it doesn't support controlnet, GLiGEN, or any of the other fun and fancy stuff, LoRAs need to be baked into the "program" which means if you chain them you begin accumulating a I'm experiencing slow network speeds when using ComfyUI, especially during operations that involve downloading data. I'll preface by saying that I updated the GGUF loader and ComfyUI at the same time, so I'm not 100% sure which is to blame. Sign up for GitHub By clicking Open the file in a text editor: ComfyUI\web\lib\litegraph. --force-fp16 Force fp16. So if you test the 2B and get a certain speed - copy the same switches (all off really) and re-run to match in the i2v. In ubuntu I am getting around 10it/s with my 6900xt on default settings (py -3. model: Select one of the models, 7b, 13b or 34b, the greater the number of parameters in the selected model the longer The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. link 结尾的链接,打开此链接并选择 Enter site。 附图: I have a dual boot system Ubuntu/Win11. if using higher or lower than 1, speed is only around 1. The problem was solved after the last update, at least on Q8. If you have another UI installed and working with its own python venv you can use that venv to run ComfyUI. 2it/s model size ~1. Contribute to kijai/ComfyUI-FramerWrapper development by creating an account on GitHub. Notes Only parts of the graph that have an output with all the correct inputs will be executed. Contribute to asagi4/comfyui-utility-nodes development by creating an account on GitHub. 10 main. However, a Saved searches Use saved searches to filter your results more quickly Contribute to kijai/ComfyUI-HunyuanVideoWrapper development by creating an account on GitHub. 0, effectively skipping negative prompt calculations and speeding up inference. Added "no uncond" node which completely disable the negative and doubles the speed while rescaling the latent space in the post-cfg function up until the sigmas are at 1 (or really By clicking “Sign up for GitHub”, I just did one that are 3 minute at first generation, and normal speed on upscale and face detailer, but then I redo the same setting, prompt, same seed and it got slower replacing the input image 1 time slowed down the processing. 5 and 2. Run ComfyUI with --disable-cuda-malloc may be possible to optimize the speed further. I also noticed there is a big difference in speed when I changed CFG to 1. Manage code changes I haven't used it for a while, and today when I used ComfyUI to render images, it became very slow. Open Song367 opened this issue Jun 5, 2024 · 3 comments Open How The nodes containes an "unload_model" option which frees up VRAM space and makes it suitable for workflows that requires larger VRAM space, like FLUX. · comfyanonymous/ComfyUI@4ee9aad Custom nodes for using MV-Adapter in ComfyUI. This provides more context for the sampling. 1 gb tensorrt static speed 8. 10%-50% speed up for different diffusion models. There has been a number of big changes to the ComfyUI core recently which should improve performance across the board but there might still be some bugs that slow This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository. ; The sampler input comes from sampling > custom_sampling > samplers. I am using fp16 precision. com Open. ComfyUI Cuda Toolkit 12. How to control video motion speed #56. i don't know what the average number of nodes is for each, most have at least 3-4 nodes in them, some have 20+. Improved expression consistency between the generated video and the driving video. main The speed is the same though - about 2 seconds per it. --force-fp32 Force fp32 (If this makes your GPU work better please report it). enable it can speed up and save GPU mem. This has a very slight hit on inference speed and zero hit on memory use, initial tests indicate it's absolutely worth using. ComfyUI Flux Accelerator can generate images up to 37. Is there a way to get the speed of CheckpointLoaderSimple while being able to set clip skip to 2? Saved searches Use saved searches to filter your results more quickly Experimental use of stable-video-diffusion in ComfyUI - kijai/ComfyUI-SVD An extension to integrate ComfyUI workflows into the Webui's pipeline - ComfyUI CLI arguments · ModelSurge/sd-webui-comfyui Wiki Important. Its features include: a. Top. Avoid using the update function from the manager, instead use git pull, which we are doing on every start if start. Notifications You must be signed in to change New issue Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community. · comfyanonymous/ComfyUI@4ee9aad Make sure you update ComfyUI to the latest, update/update_comfyui. 5 Python 3. Wonder if this might have anything to do with below warning or is it just my 6GB VRAM to little for this node? Sign up for a free GitHub account to open an The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. py . 12/08/2024 Added HelloMemeV2 (select "v2" in the version option of the LoadHelloMemeImage/Video Node). py --force-fp16. However, the generation speed drops significantly with each added LORA. - Speed up hunyuan dit inference a bit. rebatch image, my openpose - AInseven/ComfyUI-fastblend GitHub community articles Repositories. 5 model (realisticvisionV51) resolution 512x768 base speed 5it/s model size ~4. Note: running this command line non-comfyui nets you roughly 8min if all configured properly. The result of this is a latent vector between the two prompts that can be added to another prompt at The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. In the same case, just delete the node The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface. This speeds up your generation speed by two for the steps where there is no negative. 2024-11-11. · comfyanonymous/ComfyUI@e0c0029 I think I remember somewhere you were looking into supporting tensortRT models, is that still in the backlog somewhere? or would implementing support for tensortRT require too much rework of the ex The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. just for example, i personally install nodes (in practice, currently most are node packs) that seem like they may be useful. RAM Speed Power (W) Python PyTorch +cu Xformers Time To Render (s) 4070 Ti Super: 4070 I haven't used Comfyui since the last day of the last month and I just use it again yesterday but I notice a huge difference in generation speed. 5% speed increase with my latest "automatic CFG" update! In short: Turning off the guidance makes the steps go twice as fast. com/comfyanonymous/ComfyUI/issues/1992. Q&A. I have a 3080 GPU, but it takes 9 seconds to infer from one image to text. A100 didn't support the fp8 types and presumably at some point TransformerEngine will get ported to Windows / integrated Hi, is there a chance to speed up the installation process? Unfortunately the environment uses only one cpu core for the pip install process, which can take a long time (up to 2 hours) depending on the instance of vast. Hold spacebar, cursor turns into the hand icon, then click and drag to pan canvas, when done panning release spacebar. ReActorBuildFaceModel Node got "face_model" output to provide a blended face model directly to the main Node: Basic workflow 💾. sd1. images: Image that will be used to extract/process information, some models accept more than one image, such as llava models, it is up to you to explore which models can use more than one image. Notifications You must be signed in to change notification settings; Fork 6. Anything to speed up my workflows. 1; Replace the 1. Topics Trending Collections Enterprise (720p, 24G video memory, batch_size can be adjusted to 40), speed up about 40%. Notifications You must be signed in to change notification settings; By clicking “Sign up for GitHub”, Sign in to your account Jump to bottom. Share Add a Comment. Updates following these commits crash the KSamplerAdvancedProgress //Inspire node when using AYS Scheduler with LCM sampler. Note FreeU and PatchModelAddDownscale are now supported experimentally, Just use the comfy It now has a ComfyUI extension: https://github. Notifications You must be signed in to change notification New issue Have a question about this project? Sign up for a free GitHub account to open an issue and kijai / ComfyUI-HunyuanVideoWrapper Public. 5 is normal, but it's very slow when using the XL model. Sign up for GitHub it improves speed for Flux fp8 on 40xx cards, people have reported anywhere from 10% to 40% practical speed difference. TGate Apply. g. 04-v0. The choice of scheduler will generally make no difference, since the adaptive solvers only take a start point and an end point, and those should always be 1 and 0 respectively for SD3. 05 for a slower zooming in speed. 30it/s with these settings: 512x512, Euler a, 100 steps, 15 cfg and ComfyUI with the same settings is only 9. A bit ago I tried saving in batches asynchronously and then changing the date metadata post-save so everything was in their correct order, but couldn't get the filename/date stuff right and gave up. 24: Updated to latest ComfyUI version. speed #191. Please match every other software out there to pan the canvas. the area for the sampling) around the original mask, as a factor, e. 25% faster than the default settings. - Speed up TAESD preview. It never overheats and honestly generating in comfyui for hours on auto queue doesn't give the same load as benchamrk, I mean comfyui doesn't give a constant 100% load like a synthetic tests or other rendering, or gaming at high resolutions like 5120. Sign up for GitHub Use Torch primitives for Gaussian blur to vastly speed it up #41. If you wish to use other models from that repository, download the ONNX model and place it in the models/nsfw directory, then set the appropriate detect_size. Sign up Reseting focus. 4lt3r3go opened this issue Dec 20, 2024 · 0 comments Comments. A1111 generates an image with the same settings (in spoilers) in 41 seconds, and ComfyUI-Workflows-Speedup. com/gameltb/ComfyUI_stable_fast. You switched accounts on another tab or window. 5% faster generation speed than normal; Negative weighting; 05. pythongosssss / ComfyUI-WD14-Tagger Public. 5 version and CUDA to modify related code to speed up image generation · Issue #5535 · comfyanonymous/ComfyUI By clicking “Sign up for GitHub”, when running this, it seems abnormally slow. Topics Trending Collections Enterprise comfyanonymous / ComfyUI Public. com/blog/unlock-faster-image-generation-in-stable-diffusion-web-ui-with-nvidia-tensorrt/ Is anyone Up to 28. 12/17/2024 Support modelscope (Modelscope Demo). · comfyanonymous fastblend for comfyui, and other nodes that I write for generate video. In Forge i can run it with around 4s/it in comfy it only runs with around 6,5-7s/it. Will provide feedback later if you like Speed is fairly comparable between models for me usually only going up a percentage for FP16 Dev. With the arrival of Flux, even 24gb cards are maxed out and models have to be swapped in and out in the image creation process, which is slow. control_after_generate: Seed value change option every time it runs. If you get an error: update your ComfyUI; 15. Manage code changes Contribute to kijai/ComfyUI-DynamiCrafterWrapper development by creating an account on GitHub. Find and fix vulnerabilities Thank you, it works. T-GATE implementation for ComfyUI. I specifically wrote simple tips or took others in the public domain and used the DPM++ 2M sampler and others, in the end I came to the conclusion that I have a strange problem, in all models I have very poor detailing of characters even at a short distance, I used Contribute to asagi4/comfyui-utility-nodes development by creating an account on GitHub. Topics via a URL, then user could use this function to chat with GLM4 agent. Hey im just wondering why i get in comparison to comfyui such a speed gain with the normal flux dev model. Contribute to JettHu/ComfyUI_TGate development by creating an account on GitHub. 40 which is what I normally get with SDXL. - 1038lab/ComfyUI-RMBG cd ComfyUI/custom_nodes git clone https: Good balance between speed and accuracy; Effective on both simple and complex scenes; This fork includes support for Document Visual Question Answering (DocVQA) using the Florence2 model. Here are some examples (tested on RTX 4090): 512x512 4steps: 0. FWIW, I always use a batch size of 3 as batching offers a reasonable speed boost. The FLUX model took a long time to load, but I was able to fix it. Welcome to the unofficial ComfyUI subreddit. I'm using 512 * 768 image for both clothes and a model, so not a big image. Contribute to kijai/ComfyUI-HunyuanVideoWrapper development by creating an account on GitHub. See bottom section for ELI5. See https://github. 6k; Star 61. Sign up for GitHub By clicking These two examples demonstrate the different speed of generation with the same I use Stabiliti Matrix and I tested in ComfyUI and Automatic 1111 with different models such as the perfect world etc. You can also try setting this env variable PYTORCH_TUNABLEOP_ENABLED=1 which might speed things up at the cost of a very slow initial run. The prompt enhancer is based on this example from THUDM convert_demo. · comfyanonymous/ComfyUI@ae197f6 I can confirm, everything false still sees extremely slow save speed. Generally you'll use KSamplerSelect. Can boost speed but increase the chances of black images. Open comment sort options. Sign in to your account Jump to bottom. window, chrome comfyanonymous / ComfyUI Public. The crash usually happens when ComfyUI visually executes ClipTextEncode but when running it on it's own it doesn't seem to be the issue. To use Just leave ComfyUI and wait 6-10 hours. ; fill_mask_holes: You signed in with another tab or window. · comfyanonymous/ComfyUI@58c9838 A1111 gives me 10. Contribute to Fannovel16/comfyui_controlnet_aux development by creating an account on GitHub. Note FreeU and PatchModelAddDownscale are now supported experimentally, Just use the comfy node normally. My PC Specifications: Processor: Intel The fact it works the first time but fails on the second makes me think there is something to improve, but I am definitely playing with the limit of my system (resolution around 1024x768 and other things in my workflow). hvsisleqbkgixzpttwogpexnpjfdfbhtauocqwtklmpiarmayxl