[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor applications are now being accepted. Click here to apply.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106625151

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
AniStudio: https://github.com/FizzleDorf/AniStudio

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: 773.jpg (3.55 MB, 3072x3072)
3.55 MB
3.55 MB JPG
Seedream thread
>>
hello, im a newbabretardfuckingidiotmongoloid who just started messing with onetrainer, is this the base model for sdxl i should download?

https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/tree/main
>>
>>106628597
i wish you the best of luck on your quest, newbabretardfuckingidiotmongoloid !
>>
what nag node do i use
>>
>>106628619
but you didnt answer the question AAAAAIIIIIIIIIIIIIEEEEEEEEEEEEEEEEE

i'll figure it out ;)
>>
File: api won.png (30 KB, 1019x329)
30 KB
30 KB PNG
Total ComfyCloud API Node Victory
>>
File: 1739222244170329.png (838 KB, 1160x896)
838 KB
838 KB PNG
>>
>>106628594
this scares the chroma foot faggot kek
>>
>>106628640
How does a 14B model need that much memory to run?
>>
>>106628642
computer, add a sonichu medalion without changing the rest of the image
>>
>>106628594
from the thumbnail, i thought it was a giant ribbed dildo.
>>
For the lulz here's all 68 images from previous in a single collage.
>>
>>106628650
VRAM requirements increase to widen the SaaS moat. Do not let those filthy localhoards cross!
>>
>>106628640
>64 × 180 GB = 11,520 GB
is this a joke or something?
>>
>>106628640
>5B is 20 gigs
>14B needs 11TB
??
>>
>>106628640
I guess he means to server every request? Because otherwise this makes zero sense.
>>
>>106628669
>>106628671
Just stop thinking about it, you cant run it regardless so lets all just calm down and subscribe to ComfyUI API.
>>
File: 1742337294651098.png (762 KB, 1088x952)
762 KB
762 KB PNG
>>
>>106628597
If you pick the SDXL preset the field will be automatically filled and when you start training the first it will automatically download the model
>>
>>106628691
I mean poland and serbia are really white, but no one want to go there lol
>>
>>106628622
none theyre all snake oil
>>
>>106628694
i already downloaded every file here

https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/tree/main

so i'll find out soon enough
>>
File: 1737620177815893.png (981 KB, 904x1152)
981 KB
981 KB PNG
>>
>>106628642
Is this the banana
>>
>>106628704
i installed it and it's not, it works as advertised. what i'm not understanding now is why you would use it instead of using one of the speed loras that let you use cfg>1, since nag basically doubles gen times. maybe i'm missing something though
>>
>>106628744
nope qwen + this https://civitai.com/models/1934100/anime-to-realism?modelVersionId=2189067
>>
File: 00020-1543133919.png (768 KB, 896x1152)
768 KB
768 KB PNG
>>
https://huggingface.co/fredconex/SongBloom-Safetensors
https://github.com/fredconex/ComfyUI-SongBloom
we got Suno at home?
https://files.catbox.moe/96i90x.flac
https://files.catbox.moe/olajtj.flac
>>
>>106628662
Lovely
>>
Can you train the same lora with same settings and datasets and get different results or does retraining do nothing?
>>
>>106628827
Not bad, from these samples it's better than Ace-Step 1.0 (will 1.5 ever be released?)

Wonder what the range of music is, and most importantly if it can be effectively finetuned with other musc
>>
>>106628865
>Wonder what the range of music is
you can put a real music in there and it'll remix it, I find it fun to play with
>>
>>106628873
Does it require music input ?
>>
>>106628895
it's not mendatory
>>
File: 1735825321643662.mp4 (994 KB, 640x640)
994 KB
994 KB MP4
>>
File: RA_NBCM_00012.jpg (617 KB, 1872x2736)
617 KB
617 KB JPG
>>
File: ComfyUI_00291_.jpg (1.74 MB, 4096x2048)
1.74 MB
1.74 MB JPG
hunyuan image looks better now in comfy

https://github.com/comfyanonymous/ComfyUI/pull/9882
>>
>>106628863
If you are talking about the same model, a training run with the same dataset will make pretty much almost the same lora at the same epoch.
>>
does nag not work with chroma flash?
>>
>>106628938
Yeah same model. So it's just about resolution and scheduler?
>>
File: mistyloop-420_69.mp4 (784 KB, 624x954)
784 KB
784 KB MP4
>>106628733
thank god
someone finally trained a lora that i actually WANT (sure feels like forever ;D)
>>106628642
>he makes posts like this but gets mad at migusama ;3
>>106628662
n e a t !
>>106628691
ew!
>>
>>106628931
>hey look Comfy I fixed your implementation of that model!
I bet 20 dollars he won't merge it, again
https://github.com/comfyanonymous/ComfyUI/pull/7965
>>
File: 1737490255548824.jpg (1.76 MB, 4096x2242)
1.76 MB
1.76 MB JPG
>>106628931
impressive, that mf saved HunyuanImage
>>
more and more people realizing comfyui is no longer about local models
>>
File: 1750960818031515.mp4 (1.36 MB, 672x480)
1.36 MB
1.36 MB MP4
The man in the red tie raises his arms, and the various trays of fast food on the table in front of him float in the air.

behold my power!
>>
File: LET HIM COOK.gif (383 KB, 220x147)
383 KB
383 KB GIF
https://xcancel.com/LodestoneE621/status/1968687032605065528#m
>Peak GPU mem: 16,139 1,736 MB (on dummy mlp forward pass)
>Speed ratio: 0.99× (compute & comms perfectly interleaved)
GET OUT!
>>
>>106628999
I don't know how he managed to stay healthy after spending 80 years of his fatass life eating only McDonald's.
>>
>>106628999
that looks like something i shat out with sd1.5 circa 2023 what are you DOING nigger?
>>
>>106628926
Nice Huke style.
>>
>>106629000
So what’s the obvious drawback he is choosing to ignore? Because we know from chroma that there were many
>>
>>106628982
No, it's only you, since all local models are supported, often within hours, and it even supports local models that aren't even close to being finished training

There is a LOT to complain about when it comes to Comfy, but it's not local model support, which is stellar

Go lie somewhere else
>>
>>106629023
Sorry meant this for rocketpajeet
>>
>>106629041
truth nuke, and I say this as someone who don't really like this autistic bitch
>>
File: 1728846963539395.mp4 (1.41 MB, 672x480)
1.41 MB
1.41 MB MP4
>>106629023
the source image isn't very good/high quality.
>>
>>106629046
didn't notice him, pretend i quoted him too.

>>106629053
wan can take images from the 1900s and turn them into (masterpiece:1.5), its just (You)
>>
after the botched implementation of hunyuan image and chroma, more and more people are switching to better UIs where output quality comes before model quantity
>>
>>106629000
Has the furry solved the 'too little vram' problem ?

Big if true
>>
>>106629053
McDonald Trump
>>
>>106629000
as much as I question the furry's decisions, he honestly seems like a good researcher
>>
>>106629000
How is this gonna work when even DDR5 is glacially slow compared to gpu vram?
>>
>>106629053
the amount of salt this originally caused will always be so funny to me
>>
>>106629053
So this is it. This is the true power of Americans. My god..
>>
>>106629088
He's not a fucking researcher he just half bakes he saw in a paper and some guy on discord forwarded to him shit and never explains his reasoning before autistically moving on to the next snakeoil.

Why am I the only person who sees this?
>>
>>106629131
>Why am I the only person who sees this?
you aren't, haven't you noticed the amount of seething everytime he made a retarded move on chroma's training? kek
>>
>>106629000
isnt this what flash attention does? you can only move things from ram->vram so fast, i dont see how this will work
>>
>>106629131
You aren't
>>
>>106629095
It's funny because I don't know anyone in the entire history of ever who didn't love going to McDonalds after playing sports as a kid. Hell, even as an adult.
Nobody wants to be sucking down on weird french shit after a big game.
>>
https://github.com/comfyanonymous/ComfyUI/pull/9898
>Reduce Peak WAN inference VRAM usage
>The first git commit alone improves performance some and the second further increases it. I standardized on 1024x1024 for the image size and varied the frames. Before the changes the maximum number of frames it can handle is 49 and this increases it to 65 for my setup.
based
>>
>>106629142
>WAN2.2 I2V 14B Q4_K_S GGUF + lightx2v 4steps LoRA (based on video_wan2_2_14B_i2v template) 1024x1024x61frames video generation
>wan q4
>61 frames
grim, also is 1024x1024 one of the "officially" listed resolutions? i dont think so
>>
>>106629163
>grim, also is 1024x1024 one of the "officially" listed resolutions? i dont think so
it's not, I guess he just wanted to do a test
>>
>>106629131
>he just half bakes he saw in a paper and some guy on discord forwarded to him shit and never explains his reasoning before autistically moving on to the next snakeoil
that's what most professional researchers do
>>
>>106629139
fav timeline: finding out the supersize me guy gained weight and was destroying himself from being a wastoid\boozer not from eggmcmuffs

>free unlimited mcdonalds when they first launched the app
>i ate so much mcdond i should be dead
>i lost around 3lbs kek
eggmcmuff has real egg >;3


good, simple, pure, fun times
>>
what does that have to do with image generation? nb4 spergout
>>
>>106629180
They also document their shit.
>>
>>106629193
the images generated were about\of mcdond and the time trump was being cheeky

its too bad about the face detail from far away
i would imagine things will finetune\tighten in the next few quarters\months
>>
>>106629186
>supersize me guy
The guy was an absolute fraud.
Tbh, I think McDonald's gets way to bad a reputation for no real reason. Their chicken McNuggies are as close as you get gen to bare basic "Human food" and I don't mean that in a bad way.
>>
File: RA_NBCM_00015.jpg (895 KB, 1872x2736)
895 KB
895 KB JPG
>>
>>106629135
No, Flash attention is all about keeping things in the GPU as much as possible

This is about being as efficient as possible when you have to offload parts of a model to ram, this is not a new concept optimization, it's been around in most trainers and inference tools for quite a while, the difference is that this claims to have near zero overhead

If the claims hold up, this would be enormous
>>
>>106629186
You cannot pay me enough to try an egg mcmuffin
>>
>>106629220
>If the claims hold up, this would be enormous
like, he's using a paper to make this node or something?
>>
File: caitBURGERbb.mp4 (938 KB, 768x958)
938 KB
938 KB MP4
>>106629222
you can make one at home with a cookie cutter and egg and 2 slices of cheddar...
surely you eat breakfast sandwiches anon ;3
>>
>>106629211
>vegan girlfriend guy was a drunk fraud
WOW hahahah
>>
File: FluxKrea_Output_2115151.png (876 KB, 1024x1024)
876 KB
876 KB PNG
>>
>>106629220
>No, Flash attention is all about keeping things in the GPU as much as possible
>This is about being as efficient as possible when you have to offload parts of a model to ram
These are not different, the highest efficiency possible IS to keep everything in vram as much as possible while offloading when you need to, given the speed of the gpu vs ram being x10 difference while the time to move from one to another has a big cost too, meaning this can't really be anything new, i doubt its even a better FA
>>
>>106628594
Miyazaki chill
>>
>>106629142
just tested it, went from 22.3gb of usage to 21.7gb, it's not much but I'll take it
>>
>>106629252
speed diff?
>>
>>106629239
bigot
>>
>>106629261
it's the same, but now I can make it slightly faster by offloading less to the ram I guess
>>
>>106629243
>These are not different
Yes they are, Flash Attention optimizations are ALL about optimizing WITHIN the GPU, it have the benefit of fitting more into vram but it have no strategy whatsoever for when it doesn't fit into vram

Offloading optimizations are specifically for when it doesn't fit into vram

So no, they are very different
>>
>>106629131
Anyone who has ever trained the exact same NSFW concept dataset in a lora at both 512x512 and 1024x1024 on Flux for the same number of epochs is aware that the rate of anatomy errors will always be drastically higher with the 512x512 one unless you actually inference at 512x512. Chroma simply didn't get anywhere remotely close to enough training at 1024x1024.
>>
File: FluxKrea_Output_224245.jpg (3.82 MB, 1664x2496)
3.82 MB
3.82 MB JPG
>>
>>106629272
You're right, I've must have confused FA with some other tech I read about a long time ago
>>
File: 1743399892034104.mp4 (1.79 MB, 640x640)
1.79 MB
1.79 MB MP4
>ACK!
>>
>>106629327
BASED
>>
>>106629285
More HD training would have been exponentially more expensive, but I'm not sure it's that. The HD version is fucky and slops prompts, but the anatomy is noticably better than Base. Base is still better because you can fix a fucked hand with more steps and better prompting.
>>
Is there an ideal resolution ratio I should set for WAN videos to not come out fuzzy as shit or otherwise lose their shit in Comfy gens?
>>
>>106629233
I'll lose the cheese and double-side fry the egg
also streaky bacon
cheese and egg, especially cheddar, do not mix imho
>>
>slow mo video
>>
>>106629327
fk yeah
>>
>>106629303
Flux Krea full or fp8?
>>
>>106629351
just dont use anything below q8 and it shouldnt be a big problem, but anyway, 720x1280 or 1280x720
>>
File: 1740997136112356.mp4 (1.78 MB, 640x640)
1.78 MB
1.78 MB MP4
wan is slowly making the troon more female
>>
i take it back what i said many therads ago, i in fact love chroma again and just had aworkflow skill issue. not only that but chroma is really good at inpainting.

i would post an example but my gens are too strong for you, proompter.
>>
File: YWNP.webm (409 KB, 524x362)
409 KB
409 KB WEBM
>>106629327
>>
>>106629373
kek, it's true

based Wan
>>
>>106628754
>since nag basically doubles gen times
no it doesn't??
>>
>>106628754
NAG is a buffed neg prompt, not a cfg1 hack.
>>
File: 1750673569337511.png (3.42 MB, 3828x1133)
3.42 MB
3.42 MB PNG
>>106629457
>not a cfg1 hack.
it works for models with cfg 1 though
>>
>>106629465
working with=/=enabling
>>
File: 1741616771274582.png (760 KB, 835x1634)
760 KB
760 KB PNG
>SPRO is 1st on the trending page
>HunyuanImage isn't even on the list
I hope Tencent is gonna learn from that and will give us kino next time



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.