[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106732360

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>106735944
>since HunyuanImage is a MoE with 13b active parameters, it means that if you're able to put everything on your gpu memory (good luck with that though), the speed is equivalent to the speed of a 13b model right?
>>
https://huggingface.co/lightx2v/Wan2.2-Lightning/discussions/44#68d977700f72f15523491ed0
>The high-noise composite version is very good. In contrast, the low-noise one is not as good. Please upload the low-noise composite version. I'm really looking forward to it!! Thank you for your hard work!
what does he mean by that?
>>
File: 00128-1830093367.png (2.74 MB, 1536x1536)
2.74 MB
2.74 MB PNG
wtf flux krea is soo censored it refuse to gen large breasts despite multiple og FD breasts loras in positive prompt.
>>
File: is it over?.png (176 KB, 460x310)
176 KB
176 KB PNG
>Alibaba: Went the API route with Wan 2.5 and gave us an edit model that zooms in the image and gives plastic humans
>Tencent: Went the LayerMaxxing way and produces slop no one can run
so this is it? Since China has failed us, there's no cope anymore?
>>
>>106736151
it's a loli model
>>
>>106736151
is there a SINGLE flux model on civit that isn't fucking dogshit? if so, link it
>>
>>106736052
I imagine hunyuan 3.0 is also better for obscure concepts like characters, people, specific types of animals, dogs, specific objects etc. I wonder how many spongebob characters it knows.
>>
It took me like 4 days of trial and error, but I finally got a seamless loop again.
Life is great again.
>>
who the fuck asked
>>
File: ComfyUI_00135_.png (2.79 MB, 1280x1920)
2.79 MB
2.79 MB PNG
>>
>>106736229
welcome back soldier. Let's fill this here internet with so much slop that people can't even find real smut anymore.
>>
>>106736220
>I wonder how many spongebob characters it knows.
I doubt the dataset has great captioning, it has 5 billion images, so they probably used a LLM to caption (and because of that it won't know shit about characters)
>>
Need more Jebby.
>>
>>106736281
Yes, saar!
>>
File: 00135-2758733666.png (2.63 MB, 1536x1536)
2.63 MB
2.63 MB PNG
>>106736162
>>106736175
tested this, still very censored. Definitely looks like BFL cracked up the censorship at the prompt level. Nipples looks worst than qwen lmao.
https://civitai.com/models/1830497/flux1-krea-dev-uncensored
https://www.reddit.com/r/StableDiffusion/comments/1me2l80/new_flux_model_from_black_forest_labs_flux1kreadev/
>>
>>106736369
i'll have to try it but this doesnt look like it can do anything outside of 1girl, naked.
>>
>>106736155
Chinese people don't have an angel on their shoulder to guide them, just two devils like jews
>>
>>106736384
>just two devils like jews
the more I spent time on earth, the more I believe in that christcuck story, Eve really ate the fucking apple and we're being punished for that :(
>>
>>106736423
if god were benevolent he wouldnt actively allow the dire consequences of adam and eve's sin to be plaguing us
>>
wan 3.0 when
>>
>>106736441
God is really the most racist motherfucker of them all, one human fucked it up, and all humans must suffer in consequence lmao
>>
>>106736452
>>106736441
Hardship makes you more perfect and allows you to accept the gift of life
>>
>>106736460
>Hardship makes you more perfect and allows you to accept the gift of life
this argument falls apart when there are many examples of people being born with debilitating insanely painful diseases and illnesses that feel nothing but insane pain for a few years and then die.
>>
>>106736451
blame the chinese faggots for not delivering good models anymore, if it was the case we would have our fun with those models instead of this lol
>>
>chang made me do this
dumb faggot
>>
>>106736474
They still got to live and make a mark on the other's around them. Also what does this have to do with generative art often involving women in compromising positions?
Are you some seething disabled Californian that used to post devils and cry about right leaning things 24/7 angry about your self inflicted lot in life?
>>
>>106736490
he's right though, life is unfair, some people are born tall, handsome, and has rich parents, and others are living in fucking Soudan lmao
>>
>>106736490
>They still got to live and make a mark on the other's around them
there are many children that are born that are unwated that get thrown into the garbage or left to die somewhere without anyone knowing and parents never caring, try again.
>>
https://www.reddit.com/r/StableDiffusion/comments/1ntktwg/tbg_enhanced_upscaler_and_refiner_new_version/
ok, who's the schizo who wrote this?
>>
File: 00143-534887680.png (2.67 MB, 1824x1248)
2.67 MB
2.67 MB PNG
>>106736380
i gave up, and will stick to sdxl. Chroma is too random and inconsistent in getting good results. Very difficult to diagnosis the fault of bad outputs and terribly slow even on a 5090.
>>
>>106736460
>the gift of life
you don't get to choose your parents, where you are born, your face, your height, your skin color, your health... how is that a gift?
>>
>>106736220
They said they used 5 billion images. My guess is that a lot of them are Ai-generated, meaning that the model learned a lot of AI artifacts. The model learns whatever is in the data. So no matter how good your architecture is, if your data does not contain what you're looking for, you won't get the expected results. Their model probably learned to generate AI artifacts, which explain why it is not better than the others despite its enormous size.
>>
>>106736512
I'm sure it'll be able to do that god awful 3D style soon don't worry
>>
>>106736530
this, if anything, it's a big model that learned the dataset too well, and it's showing once again how important it is to not have garbage data in the first place, instead of buying gozillions of GPUs to train their model they should've used that money to pay more annotators or some shit
>>
>>106736512
how long is it taking you to gen with chroma on a 5090 and with what settings?
>>
>>106736151
I mean I literally just accidentally got a straight up topless woman with Flux Krea and no Lora with:
`a candid instagram photograph of a Native American woman with very unusually massively large breasts wearing a skimpy sexy Pocahontas costume in a forest at night. She is grinning and facing the camera with her hands on her hips.`
Seed: 639885132113525
Guidance 4.5, Euler Beta, 896x1152
>>
File: 00061-3255851044.png (1.32 MB, 1008x808)
1.32 MB
1.32 MB PNG
>>
fuck natural language prompting, give me danbooru tags
>>
>>106736585
ching chong we use llm to caption, not our proprem white boy
>>
>>106736585
>>106736598
it should be a mix of danbooru tags and natural language, something like this:
"This is an image of [Insert name character] from [Insert anime/video-game series], drawn by [insert artist name], he/she is doing [insert action on the image]"
>>
>>106736611
that's basically chroma with some edit-style direction following.
>>
File: 1735561866042870.png (264 KB, 416x555)
264 KB
264 KB PNG
>>
>>106736598
>he doesnt know
>>
File: pein.png (284 KB, 449x379)
284 KB
284 KB PNG
>1girl, fellatio, on knees, looking at viewer, male pov

>One dame, she is resting her bodyweight on her knees, her eyes are transfixed on the gentleman who towers above her, she is meeting his member with her lips, bringing him pleasure.
>>
>>106736611
>This is an image of
bloat

>drawn by
should just be "by"

>he/she is doing
should be character name for cases with multiple characters but also removes any ambiguity
>>
>>106736616
not chroma, it barely knows anime characters and knows the impressive total of 0 artist tags
>>
I always thought the ideal prompt format would be

1) Title/short overview
2) Common tags, whether booru or photography terms or camera angles or what have you.
3) Detailed Description in natural language

The title/short description for a high level overview, the tags for strongly influencing style and composition, and natural language for guidance afterwards. This also allows for ironic/contradictory titles (fairly common thing to see in museums) or emulating some of the "weird alt-text" creative behavior of 1.5.
>>
>>106736637
>should be character name for cases with multiple characters but also removes any ambiguity
fair enough, use he/she pronouns if it's a solo image, if there's multiple characters, only the character names is enough
>>
thought2image wen
>>
>>106736635
>1girl, sitting, table, chair
how is the model supposed to know she's sitting on the table or the chair? only natural language has this answer
>>
>>106736560
2.5-4 minutes
>>
>>106736625
>HoMO video generation
what a gay model
>>
1beaver, chewing, kitchen chair
>>
>>106736707
impossible, not even Earth's most powerful artists can make this or it would exist already
>>
>>106736635
I mean, "dame" and "gentleman" SHOULD influence the way the fellatio is being portrayed.
>>
>>106736707
dam, I got wood
>>
>>106736654
correct, and neither does sdxl. you're comparing finetunes and loras trained on anime and artist styles to a model that was specifically genericized to follow a mix of tags and natural language prompts.
>>
File: 1740650954091300.png (980 KB, 1451x1181)
980 KB
980 KB PNG
https://xcancel.com/chain_rules/status/1972561714022666628#m
kek, you know it's bad when even twitter stops sucking up your dick
>>
>>106736742
all their releases are improving every time and they are the main leaders in ai space in both image and llms so idk what more do people expect?
>>
So how much longer until we get an anime finetune of chroma
>>
>>106736751
if you're talking about local, then yeah you have a point, they also lead in video
>>
>>106736751
their captioning sucks dick
>>
When updating the requirements for chroma, it won't mess with the stuff I'm already using in comfy, right?
>>
>>106736742
this trolling would make sense if he posted that on Tencent's account, Alibaba is far from being a clown company
>>
>>106736685
have you tried it on comfy? that seems awfully long for a 5090
>>
File: 00085-475584187.jpg (708 KB, 2480x2688)
708 KB
708 KB JPG
>>106736766
I wouldn't do one off of chroma, the fundamental training is fucked
I don't know how you fix this model, the creator decided to just free style shit and the results have been a disaster for the model.
>>
>>106736635
>oh let's test this wildcard pack
>it's all fucking slop purple prose, most concepts don't even work or make no difference even on recommended models

what the fuck is wrong with those people?
>>
File: 00062-3856311199.png (1.29 MB, 1008x808)
1.29 MB
1.29 MB PNG
>>
>>106736783
>their captioning sucks dick
The day a huge dataset with good captions is leaked on the internet will be the day of salvation.
>>
>>106736766
What's really the point when NetaYume is already an ongoing continuation of an actual large scale anime-focused finetune? I'd rather see someone like the bigASP guy do his dataset on Chroma personally.
>>
>>106736816
>ranfaggot is crying like a bitch
>>
>>106736847
Yume is interesting I want it to cook a little more.
>>
File: 00063-1987259455.png (1.69 MB, 808x1008)
1.69 MB
1.69 MB PNG
>>
>>106736766
the future is an edit model that'll be able to reproduce the style of the input images, no need for characters loras anymore, no need for artist loras anymore, just imagine
>>
>>106736825
>ran looking in the mirror
>>
>>106736887
>point to folder
>"Make it like this"
>get gen
>creates entire 3D gaussian splat of scene that can be freely navigated, mostly already rigged for modifying poses, can be relit etc
Inevitable for when AI goes Hollywood.
>>
Someone post in his thread so he can calm down
>>
>>106736917
it will be api-only and very safe, don't worry
>>
File: old gamer.jpg (30 KB, 379x249)
30 KB
30 KB JPG
>make video of hot shemale twerking
>it messes up the balls from behind
>>
>>106736948
>shemale
no one wanted to read this faggot
>>
>>106736948
Cool gen is that Chroma?
>>
>>106736960
>is that Chroma
No it's from RealLifev0.4.safetensors
>>
real life is for sure pickled
>>
>>106736917
that's actually not far from a workflow we've been working on in unreal engine. we're also experimenting with open pose replacements using alternate types of performance capture and rigging data.
>>
File: ChromaDC-2K_00053_.jpg (928 KB, 1632x1848)
928 KB
928 KB JPG
>>106736816
Lora training isn't great either due massive concept bleed. Too bad OneTrainer doesn't have token warmup. SDXL lycoris let's you hammer in several concepts without breaking a sweat.
>>
>>106736955
its not gay if its genned. its a real girl with peanut
>>
>>106737010
Every lora I made gets fucked by swing but fundamentally the tokens are fucked. You can't have a natural language model when common phrases that are the equivalent to 2.0 by default. One anon rang the alarm bells and I apologize for not listening to him.
I hope he's here to see this I was wrong.
>>
>>106737016
>its a real girl with peanut
it's called a troon
>>
>>106736685
>5090
>fp8 model and TE
lol
>>
>>106736034
Can anyone reupload the workflow shared on this patreon post to catbox?
I want to see what tricks he's using but I can't access their crappy website at the moment.
>>
>>106737047
woops forgot the link
>https://www.patreon.com/posts/qwen-image-edit-139876435
>>
File: 1728432315776728.png (1.13 MB, 872x1200)
1.13 MB
1.13 MB PNG
so EA was acquired by saudis today.
>>
>>106736685
Also what Res sampler is that? It can drastically inflate gen times since some of them are really heavy
>>
>>106737069
>woman with red colored hair
lol she would be beheaded for that
>>
>ranfag
>>
Wait, how the fuck do you get the metadata from wan videos onto civitai? It doesn't read it out of the mp4.
>>
>>106737111
not all have the metadata
>>
>>106736880
I enjoyed looking at this today on my lunch break. Thank you for sharing it.
>>
>>106737026
I'm gonna test if token drop improves anything
>>
>>106737117
Well fuck. Can I upload an image and then just swap it out with the video?
>>
>>106737126
Let me know how it works, I just think the token swing is agnostic to anything that can be adjusted with loras alone. You can use a term like selfie even though it wasn't in your dataset and get IRL on a purposefully overbaked anime lora at some seeds.
>>
>>106737007
Yeah, I figured integration with the Unreal Engine virtual set infrastructure was where this was all going. Pretty soon, the live action parts will just be for generating good controlnets, and you'll be able to shift the entire aesthetic of a production by shifting a few IpAdapter v9001 settings, or change a camera angle completely entirely in post because you have generated an entire UE 3D scene instead of just capturing the one image.
>>
File: 1732667834204204.png (1.08 MB, 872x1200)
1.08 MB
1.08 MB PNG
>>106737091
no worries, qwen edit v2 can fix that.

replace the clothes of the character with red hair in the top left with a burqa. replace the clothes of the character with red hair at the bottom right with a burqa.

after swapping some saudi clothes. what a neat tool.
>>
What's the issue with Chroma loras? I thought anons in this thread that trained some said they trained pretty well.
>>
>>106736880
The aesthetic on this changes a lot from thumbnail to image. Very neat up close.
>>
>>106736685
you suck at genning with chroma. stick to sdxl noob.
>picrel took 27 seconds on my 5090
>>
>>106737171
I had no problems training on default Onetrainer settings except I just bumped up the resolution. Anytime I tried settings posted by anyone here the lora turned out shit.
>>
I reduced my wan 2.2 14b i2v dataset to 640x360, and finally, musubi-trainer has stopped OOMing on me. On average, it stays between 25GB and 37GB, though I have seen it spike to around 40GB. I was trying to do 836x480 previously, and I was OOMing on the peaks. If nothing else goes wrong, it'll be done in 5.3 days. This will be an i2v LoRA based on 101 videos.
>>
What's the chroma furry working on right now? Radiance still?
>>
>>106737194
They work better in IRL but not anime, I think chroma is great for IRL if that's your thing but even with bog standard settings the same issues show up when doing a batch
>>
>>106737220
No I only train anime.
>>
>>106737229
Can you post a batch of 5-10 right now?
Something complex not basic with a background, I want to see if it swings or not
>>
>>106734433
>>106735219
>Prompts Chroma with literal slopped tokens designed to work only on booru models that only make the quality of photos look worse (masterpiece, UHd, 4K, high quality, photorealistic, unreal engine, etc...)
>Is surprised when he gets shit
Many such cases.
>>
>>106737238
My Evil lora is decent.
>>
>>106737256
aw. so close to showing panties
>>
File: 1741355671247937.mp4 (1.53 MB, 560x848)
1.53 MB
1.53 MB MP4
>>106737016
>>
>>106737238
Also define complex. Drop me prompt and I'll try to adapt it to it.
>>
>>106737265
KEKW
>>
>>106737256
Cool can I see a batch job result back to back seed?
If you have 4xchanXT it will lower the file size for a painless
Someone holding a sign or taking a selfie in a public place don't need other people but make sure the text is like /ldg/ or something. Just make sure it has a action and text
>>
File: 00064-4170315302.png (1.32 MB, 808x1008)
1.32 MB
1.32 MB PNG
>>
Comfy please add smea dy ++ sampler thank you
>>
>>106737268
>no bulge
gross
>>
File: grid-0002.jpg (779 KB, 4000x2890)
779 KB
779 KB JPG
>>106737268
Something like this but with whatever text you want, I don't want to tell you exactly what you should prompt in the case it's user error on my end
>>
File: 1756184437346646.mp4 (1.94 MB, 560x928)
1.94 MB
1.94 MB MP4
>>
>>106737333
why the fuck would anon look depressed when he gets to come back home to that?
>>
>>106737347
They either have dicks or aids/herpes
>>
>>106737347
because he's a faggot I guess lol
>>
>>106737347
i think it is meant to be a surprise
>>
>>106737347
they're all post op trannies
>>
>>106737347
because they don't have dicks, and that anon hates it that way >>106737016
>>
>>106737347
he's tired, he has to satisfy those 10 women every single day, his dick is burning at this point
>>
File: GYObl25XUAAgQDS.jpg (259 KB, 1907x1584)
259 KB
259 KB JPG
>>106737059
>>106737047
>>
>>106737406
as an engineer I felt that, it's the worst part of my job, but I know it's important to make good documentation so that others can enjoy my craft so...
>>
File: 00065-622211234.png (968 KB, 896x1152)
968 KB
968 KB PNG
>>
File: 1734979048511430.png (293 KB, 500x500)
293 KB
293 KB PNG
I hope they'll quickly make the I2V version as well, I have high hopes on that one since my testing of the new T2V lora
>>
File: 00152-934917361.png (2.36 MB, 1824x1248)
2.36 MB
2.36 MB PNG
>>106737184
good for you, i wasted 4 hours downloading and testing multiple variants and got only 5 ok tier images out 100 slops. What are your settings.
>>
I'm using an illustrious model with reforge, and trying to use inpaint to do text replacements for signs, and such... gibberish to real words, mostly.
Is there a user guide / rentry? I'm getting, at best, mixed to poor results and assume I'm doing something wrong.
>>
>>106737430
>I know it's important to make good documentation so that others can enjoy my craft
It's more like a reminder to yourself more than anything, since all it takes is a few days for your own code/schematics to look alien to you.
>>
File: evilbatch.jpg (610 KB, 2496x1216)
610 KB
610 KB JPG
I'll try the selfie now.
>>
>>106737465
Yeah this is also true. Every time I come back from vacation, it takes me at least 2-3 days to remember what I was doing lol
>>
>>106737347
hollywood film storyboard artist here, anon arrives home, looking depressed, completely unaware of the surprise that awaits him inside
>>
File: 1731019520828160.png (1.08 MB, 872x1200)
1.08 MB
1.08 MB PNG
>>106737166
okay, now all the women are gone, it's saudi EA friendly now.
>>
File: where tendies.png (232 KB, 1840x676)
232 KB
232 KB PNG
>>
File: that's right.png (89 KB, 618x640)
89 KB
89 KB PNG
>>106737528
>hollywood film storyboard artist here
I knew this place was filled with elite people, that's the elite general after all.
>>
>>106737331
Ranfaggot is going insane.
>>
File: evilselfie.jpg (643 KB, 2496x1216)
643 KB
643 KB JPG
Selfie.
>>
>using loss graphs
lel
>>
post pantyshots plz
>>
File: wan22_glued_00018.webm (2.31 MB, 560x928)
2.31 MB
2.31 MB WEBM
>>106737347
can't fix what's broken beyond repair
>>
>>
>>106737593
he's not into hags, he's a man with taste
>>
>>106737593
man is beyond wrecked. he is late stage american empire wrecked. probably going to goon to deepseek cunny
>>
>>106737593
Have him pull out a m16 and blast them.
>>
>>106737562
>>106737481
nta but Neuro design is kinda all over the place there. Doesn't really seem like took the training well? not hating just curious
>>
File: ComfyUI_temp_aibvn_00014_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>106737621
Because her fanart is also all over the place and I didn't want to just overcook it with stream screenshots.
>>
>>106737638
Is chroma that sensitive? I trained a lot of illustrious LoRAs with iffy level of fanart quality and got pretty consistent Loras.
>>
File: Screenshot.jpg (251 KB, 1418x709)
251 KB
251 KB JPG
>>106737445
>>
>>106737663
I only trained two character loras so far so idk. Needs more data. And maybe with a 40-50 pics dataset this is what you get. The rest was fetish gear.
>>
https://github.com/comfyanonymous/ComfyUI/pull/10085/files
will I get some speed improvement if I switch to cuda 130?
>>
>>106737698
Don't do it. It will try to also update torch to 2.10 and break everything
>>
File: Qwan_00001_.jpg (644 KB, 1984x2976)
644 KB
644 KB JPG
So, Comfy's not adding Hunyuan 3.0 support?
I wanted to try that...
>>
>>106737713
Only the latest and greatest API models from now on, no open source for the foreseeable future.
>>
File: 1741894346421404.png (91 KB, 1644x740)
91 KB
91 KB PNG
>>106737713
>I wanted to try that...
you have 240gb of vram?
>>
>>106737739
you don't? the poorfag general is next door anon
>>
File: 1758559618688340.mp4 (654 KB, 480x480)
654 KB
654 KB MP4
>no sage attention 3 for 40 series cards
>*clears throat*

rrrrrrRRRRRRREEEEEEEEEEEEEEEEEEEEEEEEEEEE
>>
>>106737753
Can I use a q0.001?
>>
File: 1747107394785064.jpg (650 KB, 2000x2000)
650 KB
650 KB JPG
>>106737758
it's time to upgrade anon, and remember, the more you buy, the more you save
>>
>>106737758
don't feel too bad just think of the anons still using a 3090
>>
>>106737758
Isn't sageattention 3 literally dogshit?
>>
>>106737435
do you fall out with nigbo Nicholas?
>>
>>106737758
honestly just sell the 40 series card and upgrade to a 50 series at this point. Theyre at msrp and are twice as fast (or more) for ai gen stuff, and some stupid gamer will buy your card and subsidize your ai lust.
>>
File: ComfyUI_temp_nkjid_00006_.png (3.77 MB, 1536x1152)
3.77 MB
3.77 MB PNG
when is the 5070TiS coming out?
>>
>>106737791
I am that anon, which corner should I use to cry?
>>
>>106737671
Chroma is too confusing, there are like 100 models out there, each with their own schizo workflow and schizo sampler, too many merges/loras, no wonder most people hate it, honestly its a mess, there is no official WF, each iteration of a model seems to come up with different shit, like why they can decide for one text encoder, noo, there is flan, gner, t5, blah blah, at least with other models you get kind of similar structure when it comes to workflows, with chroma is the opposite way, and I havnt even mentioned the many new confusing models that the author is regurgitating daily , radiance shit, 2k crap, I mean c'mon, stick with one thing that works instead of and releasing crap different every day

One good thing about Chroma is how easy is to train loras
>>
File: 00066-1167593117.png (1.68 MB, 1152x896)
1.68 MB
1.68 MB PNG
>>
>>106737791
Why would I think about them
>>
>>106737791
>don't feel too bad just think of the anons still using a 3090
it's me :(
>>
>>106737758
even if you had a 5090, SA3 has some noticable quality degradation, I didn't sign for this
>>
>>106737834
Only radiance needs differrent workflow because of the memenodes. Everything else is just plug and play.
>>
File: 1746366563237266.mp4 (454 KB, 366x650)
454 KB
454 KB MP4
Genuinly how is it possible for the absolute majority of people to literally be brain dead tier cattle to not understand how even native fp8 let alone fp4 support is basically worthless when it won't be actually used since its a dogshit quant compared to Q8?

The 4/5xxx+ series sunk cost fallacy copers who spent 2000-8000$ + (foreskin) tip for their native fp8 support just to generate images not even 3x faster but in much shittier quality than those on a 400-700$ 3090. Q8 is basically fp16 while even fp8_scaled (and God forbid naive fp8) is quite different (read, worse) every time

>inb4 b-b-but fp8_scaled CAN look OK!!!
Yeah, you can RNG your way into something that looks OK since images have a high capacity of containing error but in places where it doesn't matter which ultimately can make gens look OK or even in rare cases actually better visually. But none of that is relevant when you going away from the base full quality bf/fp16 model is objectively gonna be worse in general and especially for details/more complex prompt understanding the overwhelming majority of the time
>>
what did he mean by this
>>
>>106737886
this anon is speaking facts
>>
Wasn't the fp4 the snowflake scaling?
>>
>>
>>106737886
>Genuinly
Looks like I quanted this word to fp8 by accident
>>
>>106737886
Is that video from Wan?
>>
>>106737803
>50 series
I need retard power, the 6000 will be mine
>>
File: dmmg_0142.png (1.48 MB, 832x1216)
1.48 MB
1.48 MB PNG
>>106737184
prompting for that low angle is an actual pain the ass
>>
>>106737998
kys
>>
https://huggingface.co/lora-training-frenzi
Free training for a week
>>
>>106734449
meanwhile in the diffusion community...
>JUST STACK MOAR LAYERS BRO
>>
File: 1751307404494304.png (773 KB, 928x1120)
773 KB
773 KB PNG
>>
>>106738020

Maximum 5000 steps runs
Runs with more than 6h will timeout
Train 1 LoRA at a time
Do NOT train LoRAs on likeness of people you don't have consent for or on NSFW stuff
>>
File: 00067-180155300.png (2.56 MB, 896x1152)
2.56 MB
2.56 MB PNG
>>
>>106737562
>>106737481
Chroma things yeah that slide is why I stopped using it
>>106737663
Yes chroma is that sensitive, I was expecting a style lora not a character lora, at least it can do that
>>
>>106737886
I would use Q8 instead of fp8 but apparently it's all gguf and that requires custom nodes. Am I really missing out?
>>
>>106738103
you just install this custom node and you're good to go
https://github.com/city96/ComfyUI-GGUF
>>
>>106738115
Thank you, I'll take a look.
>>
>>106737957
so glad you plan to buy nvidia! The more you buy, the more you save!
>>
>>106738051
>Do NOT train LoRAs on likeness of people you don't have consent for or on NSFW stuff
I wonder how long it is until social media profiles have integrated "use lora" buttons for profiles, both text and image. That's more a cultural shift then a technological one, but imagine explaining the internet landscape of today to someone in 2005.
>>
>>106737886
fp8 scaled is good though. at least on qwen, I tested it vs q8 and fp8 scaled had no noticeable difference
>>
File: dmmg_0114.png (1.62 MB, 832x1216)
1.62 MB
1.62 MB PNG
>>106738019
K
>>
>>106737739
Mit dem Quant Steiners wird das alles in Ordnung kommen
>>
>>106738137
Aaieeee flux face
>>
File: Adelbert Steiner - ff9.png (433 KB, 686x386)
433 KB
433 KB PNG
>>106738138
>Quant Steiner
I know his ancestor
>>
>>106738202
Model?
>>
>>106738225
SCPH-1001
>>
why does this wanimate custom node need to be in a pack with 200 nodes I'll never use
I thought native was supposed to be native, as in no third party nodes. Kijai's wrapper is as native as this shit is
>>
>>106736585
I've never umderstood an inability to describe what you want with natural language. Like, literally just describe it, lol. You can't have a concept of it in your brain without its natural langiage description.
>>
File: 1748549731858748.png (33 KB, 300x302)
33 KB
33 KB PNG
>>106738240
lmao
>>
File: 00068-329534175.png (2.53 MB, 896x1152)
2.53 MB
2.53 MB PNG
>>
>>106738306
>describe how i see it in my brain
>WHOOPS i used a word or iteration the model didn't fucking understand so now the subject is bending over in a 40 degree angle and her spine just cracked in fifty places
it's dogshit when i could just tag dynamic_pose and literally get what i asked for (or a variation)
a blend CAN work its just everyone's definition of what good tagging and good natural language descriptions are vary.
>>
>>106738240
kek
>>
>>106738318
>i used a word or iteration the model didn't fucking understand
that's not the fault of the natural language, it's the fault of the trainers for not making the model learn that word in the first place
>>
>>106738318
dynamic_pose is literally just
>X person is in a dynamic pose
Is it an action pose?
>X person is in a dynamic action pose
You can be more specific and what's best Chroma gives most variety so you don't need to change the description each time to get different poses.
>>
Tags are a type of jargon. There is no true separation between natural language and jargon, just a difference in context. In the context of "terminally online weeaboos" and "photographers" etc, tags are jargon that describe a specific way they want the image to look.
>>
>>106738328
not entirely wrong, it's frustrating because tags are usually a "by popularity" thing so something isn't usually trained because nobody uses it vs "well how did the people who trained this decide natural language should sound?"
natural language has its place, its just subjective in its own way vs tagging.

>>106738366
Heh, you had me in the first half you fuck i'll give you that.
>>
>put 'masterpiece' in prompt
>result isn't a masterpiece
explain
>>
>>106738385
You forgot to add 1girl
>>
>>106738385
>>106738389
>use 1girl
>gen is shit
>use solo
>gen is slightly better
>use "artistic masterpiece solo portrait by Leonardo Davinci"
>result is a masterpiece
use case?
>>
>>106738366
In fact, you can describe a very specific and precise pose, including what is not possible with booru tag prompting, and that is with multiple characters involved. An example is a woman is leaning against a wall, her leg is up, and a man from below is licking her pussy (like the library pic I posted a while back with Chroma).
>>
>>106738400
>>use 1girl
>>gen is shit
That's your misinterpretation
>>
File: 00069-1317266978.png (1.5 MB, 1152x896)
1.5 MB
1.5 MB PNG
>>
>>106736826
It's so expansive to build that I doubt it will happen before we have tools to largely automatize it (and still get a great DS).
>>
>>106736151
>>106736369
I feel so safe anon, thanks to BFL using so many man hours to hide unsafe tits.
>>
blacked forest labs haha lol
>>
>>106738382
Not the AI's fault you have no concept of how to properly describe what you want, nor are you willing to put in minimal effort to learn. If you post an example pic, I'd figure it out in less than a minute.
>>
>>106738516
I'd post this nice picture of your mum i just got done yoinking over but it'd be considered doxxing and i won't do her like that.
>>
so fp8 scaled vs q8 difference is how different? file size is almost the same for qwen edit for example.
>>
>prompt for woman sucking cock
>get tranny autofellatio
Cool model chromaxisters
>>
>>106738539
straightfags are a minority in the ai space
>>
>>106738549
Dang. It's over for vaginabros
>>
File: 00005-1368344300.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
never again..
>>
>>106738539
>neg:
>shemale, tranny

Skill issue.
>>
>>106738539
>uses a model made by a tranny
>is surprised that it has a lot of troon shit in there
ngmi
>>
File: Gvs0LExXoAAUnMP.jpg (253 KB, 1685x2048)
253 KB
253 KB JPG
>>106737059
>>106737047
Help a fren
>>
>>106738596
go shill on reddit faggot
>>
>>106738596
go shill on reddit faggot but also go nuts https://kemono.cr/patreon/user/80482103
>>
File: 1758870712363075.png (1.09 MB, 872x1200)
1.09 MB
1.09 MB PNG
>change the clothes of the males into white saudi attire
>change the clothes of the females into a black burqa
>change the clothes of the character with the white hair on the top right into a black burqa.
to celebrate the EA Saudi purchase: and yeah q8 seems better than fp8 scaled.
>>
File: GZJ9s4cXkAAtBvP.jpg (591 KB, 2480x2480)
591 KB
591 KB JPG
>>106738616
I love you anon
>>
>>106738629
There needs to be at least one of them strapped up with TNT.
>>
>>106736659
I'll try that, thanks Anon.
>>
File: aedgvbaezdgvbeadzgvbea.png (140 KB, 226x232)
140 KB
140 KB PNG
>>106738668
this fella is obviously
>>
File: 1743406533872871.png (1023 KB, 1360x768)
1023 KB
1023 KB PNG
the blonde woman in the black dress is sitting on one of the chairs on the left of the room, with her legs crossed.

I got two aya's. but note it keeps the ps1 (or emulator) style. pretty cool.
>>
>>106738485
I hope their next model will be even safer and completely be lobotomized of women as a concept in general
>>
>>106738677
Add a desk with books and file folders on it to the room in low polygon style. The blonde woman in the black dress is sitting on one of the chairs at the desk. remove the blonde woman from the center of the room.

pretty cool. last part of prompt is to remove the second aya.
>>
File: 00070-2221154205.png (2.2 MB, 896x1152)
2.2 MB
2.2 MB PNG
>>
File: 1727748634573447.png (963 KB, 1360x768)
963 KB
963 KB PNG
>>106738707
helps if I attach the image too.
>>
>>106738677
hard to see the improvement if you don't showcase the original image, use this
https://github.com/BigStationW/Compare-pictures-and-videos
>>
File: 1728702151505944.png (381 KB, 1659x933)
381 KB
381 KB PNG
>>106738720
thanks, for now i'll just post the original reference of the ps1 game.
>>
>>106738677
>>106738718
>>106738728
no loras here ladies and gents. It's like one half of the thread investigates chinese failures, and the other half is reporting its successes. Equal failures as in successes.
i trust this plan.
>>
How the fuck do I use booru tags to generate people under bed covers entirely, retaining their body shape without removing body tags entirely? I have not even seen loras for it and have been trying to do it forever.
>>
File: 1733632086683234.png (968 KB, 1168x888)
968 KB
968 KB PNG
>>106738695
blows my mind bfl would rather kill their models with censorship than god forbid, allow a female to be generated without body horror proportions.

thank god for wan/qwen and china in general not caring about "muh ethical ai"
>>
>>106738746
honestly, make your gen in noob/illustrious or whatever, then use qwen edit to add a blanket over them. might work with qwen edit v2
>>
File: 1731676418864514.png (51 KB, 672x438)
51 KB
51 KB PNG
>>106738728
you can also use a node that stick the 2 images together on comfyui, that one is the best
https://github.com/Eagle-CN/ComfyUI-Addoor
>>
File: 1755572216068847.png (951 KB, 1288x808)
951 KB
951 KB PNG
>>106738770
tested on a random image and it does in fact work. source is just an asian girl in a bikini lying down. try this prompt in qwen edit:

the girl is covered by a white blanket up to her chest.

did quite well actually.
>>106738776
ty
>>
File: 1728177915243304.png (976 KB, 1288x808)
976 KB
976 KB PNG
>>106738788
the girl is covered by a white blanket from her legs up to her waist.

pretty cool model, works well with any anime or realistic gens. this would be very hard to do even with inpainting cause it would have to blend with the rest of the image too.
>>
>>106738826
Black Forest Labs did not like that.
>>
is nunchaku qwen edit worth it?
>>
>>106737069
it's more like the trump family with saudi money, lol. it's funny to see feminist studios, being bought by the same patriarchy they always hated
>>
>>106738837
>we must lobotomize our model cause someone might generate a girl in a bikini on grass
>>
>>106738846
meh, fuck them, the chink models don't annoy us with this bullshit, those who want to die on a hill will die on that hill, and the world will continue to spin
>>
File: ADTINAPINTGUIDE.png (26 KB, 523x733)
26 KB
26 KB PNG
Hello /ldg/,
I want to ask your inpainting users for opinions.
I did research and checked out basically all the UIs out there and their tools for inpaint, none of them have everything for proper image editing, in some point they're all missing some important thing.
We should lay out what each UI can and can't do upfront and look at them what they actually can do in terms of real features.
If you catch any mistakes or have suggestions let me know so I can update the guide!

I don't actually use Comfy by itself (though Krita and Swarm are in the table and they run Comfy as a backend), but I want to add it to the comparison chart. So let me know what Comfy can do on its own!
>>
>>106738881
money talks, models that do stuff people like will generate money, models that suck will lead to bankruptcy.
>>
>>106738910
where's anistudio?
>>
>>106738936
busy generating error logs
>>
>>106738910
Why not AniStudio?
>>
>>106738936
gooning
>>
>>106738910
faggot why not ani
>>
File: 1728577777337281.png (1.5 MB, 1136x920)
1.5 MB
1.5 MB PNG
the man on the left is holding a bottle of champagne with his left hand. The yellow character on the right is wearing a white baseball cap saying "KARL LOST" and has a stack of 100 dollar bills in their right hand. keep his expression the same.

kek, memes will never be the same after AI
>>
File: 1736805694697117.png (1.49 MB, 1136x920)
1.49 MB
1.49 MB PNG
>>106739025
>>
>>106738757
That's the case for so many models, but BFL in particular has a hard on on pushing censored models for some reason.
>>
>>106739059
I wonder how much better one of these big models would be if actual captioned nudity/sofcore/lingerie and underwear/porn datasets were added
>>
>>106738910
beg harder fag
>>
>>106739076
the architectures are infinitively better than SDXL, and you see what it does when trained well. the issue is nobody has a spare megacluster, with proper settings, and a massive quality controlled dataset to do that with.
>>
File: 1745342650580400.png (1.32 MB, 1360x768)
1.32 MB
1.32 MB PNG
the swedish man in image2 is handing the man in image1 a large bag of money with a dollar sign on it, in a retro style arcade. keep the expression of the man in image1 and image2 the same.

im so glad I can just use nodes and reference them (image2/image3) cause it makes prompting with multi input so much simpler now.
>>
>>106739130
I think computer will be cheap enough at some point, but my issue is the "quality controlled dataset" as I don't think it exists much for porn.
Except maybe for whatever openai did with dalle3 back then, using an uncensored captioning model.
>>
File: 1729326810636113.png (1.4 MB, 1360x768)
1.4 MB
1.4 MB PNG
kek

the swedish man in image2 is kneeling in front of the man in image1 sitting on a royal throne, as several large bags of money with a dollar sign on it rest beside the throne, in a retro style arcade. keep the expression of the man in image1 and image2 the same.

if you say keep expression the same it retains the original face.
>>
>>106738910
KYS
>>
>>106739161
*compute
>>
File: 1735345585075084.png (1.39 MB, 1360x768)
1.39 MB
1.39 MB PNG
>>106739162
>>
>>106739182
what's with the hyperfixation
>>
>>106739191
nothing it's a funny story and and excuse to make some billy memes.
>>
>>106739191
hi Karl
>>
>>106739196
at least have some babes around for eye candy
>>
File: 1750856092794829.png (1.37 MB, 1360x768)
1.37 MB
1.37 MB PNG
it's good to be the king.
>>
>>106739235
and all it took was image1: billy image, image2: karl image, and this prompt:

the swedish man in image2 is on his knees beside the man in image1 sitting on a royal throne, as several large bags of money with a dollar sign on it rest beside the throne, in a retro style arcade. the man in image1 has a royal crown on his head. keep the expression of the man in image1 and image2 the same.

now, you can take any image, real or anime, and make changes or edits in seconds. it's so useful. this would take ages to do with controlnets or inpainting otherwise.
>>
>>106738316
cool tortoise
>>
still no new wan lightvx i2v out
no wan nunchaku either
>>
File: ComfyUI_18827.png (2.06 MB, 1920x800)
2.06 MB
2.06 MB PNG
>>106737886
Where are you getting $400 3090s? I could put together a nice little LLM machine at those prices.
>>
>>106736685
sell your 5090 right now
it is completely lost on you
please, give it to a home that will care for it
before you give it ramrot
>>
>>106736034
What's the best site for getting LORA's of real people / tv shows now?
>>
>>106739318
>wan lightvx i2v
they dont believe the new one is dogshit, it's not coming.
>>
>>106739470
They said they will release a new one anon.
>>
File: 1754759918884210.png (984 KB, 1360x768)
984 KB
984 KB PNG
>>
>>106739481
when, exactly?
>>
File: file.png (45 KB, 1082x552)
45 KB
45 KB PNG
>>106739513
>>
when ready
>>106739587
>>106739587
>>106739587
>>
>>106739591
I'll never be ready!!
>>
I'm ready
>>
>>106741398
Took you long enough!
>>
>>106741407
My apologies anon
>>
>>106739135
Nice, now it actually starts looking like Jobst



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.