[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Perpetual Melty Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106950276

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
Blessed thread of frenship
>>
File: IMG_2370.png (695 KB, 678x907)
695 KB
695 KB PNG
>>106952799
me
>>
>>106952799
what a terrible quality collage
>>
I wonder if ran has debo nudes as well
>>
>>106952799
>crying_github_trani.jpg
well done, i expect an uber melty now lmao
>>
>>106952848
probably made a shrine to ani and debo
>>
>troll general
>most of the users hate you
>get triggered into threatening cops
>will still get made fun of
>>106952857
Part of me wishes it was debo but the obsession is something only ani could do
>>
i'm scared
will trani send muttoids to my country now for using StableDiffusion(tm) to animate some juliens?
>>
Let's say I have a prompt that says e.g. "blah blah blah woman in a {blue|green|red} dress"

If I run this overnight, the additional time to load the CLIP model and remake the conditioning 300 times significantly slows down my genning. Ideally this should only need to be done 3 times, if conditioning objects are not too enormous to save and load.

Are there any nodes for ComfyUI that can help me do something like this? Let's say I had multiple wildcards of 3-4 choices each, let's say total of 144 possibilities, and I'd like to randomly generate let's say 20 of these, create all the conditioning files, and then just load one randomly each gen.

I know that for some kinds of generating with controlnets and other constraints this is much more difficult, but I am just thinking of regular Chroma gens.
>>
>>106952907
He's going to try some lolsuit on the wrong person at the rate he's going.
>>
>>106952907
>i'm scared
i'm not last thread just showed me how much anons are pussies these days
>>
>>106952936
no
>>
File: NetaYumeV35_Output_534451.png (2.52 MB, 1280x1536)
2.52 MB
2.52 MB PNG
>>
i can't train wan2.2 loras on civit
wtf did i farm all this buzz for
>>
>>106953168
Civitai ruined itself
>>
>>106953168
>how do we fix Civitai?
by having better base models
>>
>>106953168
make a new civit with hookers and blackjack. I don't get being cheerleaders for people who are fucking awful at their job
>>
>>106953071
Netashill, how's the work shift going? How much do they pay you per hour?

Also, where can I discuss local models in an impartial, free, and objective way without being constantly shilled to death by people like him?
I want an image board, not an AI trade show where all the devs are trying to sell me their products.
>>
File: 1235215.png (1.97 MB, 1432x774)
1.97 MB
1.97 MB PNG
>>106953184
>>106953187
Sorry for that, there was some naked booba and I didnt want to get banned
>>
File: 00028-3441419356.png (2.18 MB, 1024x1280)
2.18 MB
2.18 MB PNG
>>
File: 12.gif (671 KB, 245x175)
671 KB
671 KB GIF
>>106953206
>illustrious, illustrious, illustrious, pony, illustrious, illustrious, noobai, pony, illustrious, illustrious...
>>
>>106953168
>>106953206
you will gen masterpiece 1girl cleft chin american physiognomy awful semi-real style 1216x832 and you will be happy
>>
so then where do i go?
anybody else feel the void in the ai space?
>>
File: 396.png (209 KB, 774x564)
209 KB
209 KB PNG
>>106953272
>>
File: ComfyUI_06302_.png (1.38 MB, 1072x968)
1.38 MB
1.38 MB PNG
>>
File: ComfyUI_06303_.png (1.15 MB, 800x1304)
1.15 MB
1.15 MB PNG
>>
File: ComfyUI_06305_.png (1.24 MB, 1224x848)
1.24 MB
1.24 MB PNG
>>
They tell me this is the blessed thread. Is that true?
>>
File: ComfyUI_06306_.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>
>>106953359
least retarded chroma pic
>>
File: ComfyUI_06307_.png (1009 KB, 864x1208)
1009 KB
1009 KB PNG
>>
File: ComfyUI_00068_.png (1.1 MB, 1024x1088)
1.1 MB
1.1 MB PNG
>>
File: ComfyUI_06311_.png (1.27 MB, 856x1216)
1.27 MB
1.27 MB PNG
>>106953403
its actually qwen
>>
>>106953423
my brain had an automatic reaction just seeing the thumbnail
>>
File: wan22___0062.png (1.73 MB, 832x1216)
1.73 MB
1.73 MB PNG
>>
File: NetaYumeV35_Output_675343.png (1.87 MB, 1280x1536)
1.87 MB
1.87 MB PNG
>>
File: ComfyUI_07530_.png (1.73 MB, 1152x1152)
1.73 MB
1.73 MB PNG
A model is either good at realism, or it's good at anime. It can't be good at both. Chroma proves that more than anything (not that an anime specific tune couldn't make it good, but it's probably outside of most budgets).
>>
>>106953429
you dont mean you popped a boner? r-right?
>>
>>106953204
here I made this one just for you
>>
>>106953479
I felt my penos shoot into my chest from pure revulsion
>>
>>106953457
chroma proves that its good at nothing in particular, maybe understanding prompts and text insertion, but thats it
>>
>>106953204
>Discussion of Free and Open Source Text-to-Image/Video Models
>>
File: ComfyUI_07536_.png (2.1 MB, 1152x1152)
2.1 MB
2.1 MB PNG
>>106953457
Though I do wonder if Chroma training kept on going at a large scale, would it eventually learn proper anime artists? Who knows.
>>
File: ComfyUI_00069_.png (621 KB, 1024x514)
621 KB
621 KB PNG
>>106953495
damn, its good that i cant post the others without getting banned then
>>
File: ComfyUI_00001_.mp4 (608 KB, 480x832)
608 KB
608 KB MP4
>>106953437
>>
>>106953457
a finetune could be good at both if it was like a ~30 to 35 million image dataset used minimum
>>
>>106952936
Ask LLMs to write your own, clip output is a tiny tensor.
>>
File: ComfyUI_00002_.mp4 (819 KB, 480x832)
819 KB
819 KB MP4
>>106953457
>>
File: ComfyUI_07539_.png (2.66 MB, 1152x1152)
2.66 MB
2.66 MB PNG
>>106953507
Oh, you know, like the only model that properly understands both what photos look like (which even the latest API models still struggle to) and in terms of open source, where the objects go in relation to each other, proper spatial awareness, as well as coherence regardless of how simple or complex your prompt is.
>>
File: wan22___0001.png (1.55 MB, 832x1216)
1.55 MB
1.55 MB PNG
>>106953535
noice
>>
>>106953545
I've been using a little node I made with a multiline string input and a conditioning input that evals the string to poke around in a typical conditioning object, looks like the structure is a list with one entry, which is a list with a ~2mb tensor and a dict which looks like {"pooled_output": None} lol

So I guess saving and loading an object like this would be pretty trivial
>>
i just trained a wan 2.2 lora on 768x768 which turned out fine. would there be any problems training a lora on images at 640x928? that's native bucket resolution that works well for Chroma loras, but i'm unsure what happens when try to train portrait images at a specific resolution like that for wan.
>>
>>106953583
nice toes lmao
>>
>>106953583
>that properly understands both what photos look like
pic unrelated?
>>
>>106953583
man there are so many mistakes in this image, makes you fucking wonder how chroma users can write COPE such as what you wrote. fucking unbelievable
>>
File: ComfyUI_07541_.png (2.28 MB, 1152x1152)
2.28 MB
2.28 MB PNG
>>106953583
Of course, as far uncensored txt2img models are concerned. It's still better than Qwen for even mundane softcore tasks.
>>
File: pag2.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>106953583
im not knocking on your efforts, your pics are nice, but still, the more you look at any chroma pic, the more it falls apart
>>
File: wan22___0010.png (1.71 MB, 720x1280)
1.71 MB
1.71 MB PNG
>>106953595
i don't even crop or caption my images for lora training anymore
>>
File: file.png (2.26 MB, 1328x1328)
2.26 MB
2.26 MB PNG
>>
>>106953683
lol'd
>>
File: 00052-4030035914.png (1.6 MB, 1024x1280)
1.6 MB
1.6 MB PNG
>>
>>106953679

after doing like hundreds of loras i can tell you not captioning leads to better results
>>
File: ComfyUI_00066_.png (1.8 MB, 1024x1072)
1.8 MB
1.8 MB PNG
>>106953683
i literally spinned like 20 spiderman pics in chroma and it didnt do proper hands even once, this is the closest it got
>>
>>106953709
but if you dont caption you cant really mix'n'match loras for doing 2girls :(
>>
File: carlos aislop.png (353 KB, 600x600)
353 KB
353 KB PNG
>>106953720
>gen black spiderman
>it doesn't work
woah who could've seen that coming!
>>
File: chroma_flux__0034.png (1.07 MB, 832x1216)
1.07 MB
1.07 MB PNG
>>106953693
>>
File: ComfyUI_07544_.png (1.74 MB, 1152x1152)
1.74 MB
1.74 MB PNG
>>106953602
>>106953636
>There are tiny mistakes that can be inpainted on an image that is my first try and where I'm not being specific by enhancing the prompt with LLM, just prompting

>Amateur photograph, a beautiful young Japanese idol woman with short pink hair on a swing outdoors, with her bare feet in the air


Cope all you want. Chroma filters you.
>>
I bet he shat himself and passed out

8 more hours before the other drunk schizo wakes up...
>>
>>106953423
i hate that i know what this image references
>>
File: chromapig.png (186 KB, 350x225)
186 KB
186 KB PNG
>uuuahh american piggu chwoma filturs yuu~

chroma has soul, like an old 1.5 checkpoint i used to run circa late 2023 lmao
>>
>>106953679
>>106953709
i've never tried without captions, but i've been curious. also if i want to capture a specific hairstyle, outfit or makeup it's good to be able to refer to them with something. however cropping to include what you want it to learn is obviously super important and you wont convince me otherwise.
>>
For all the problems that local models have at least they can produce output that isn't the color of piss
>>
what if i make a trailer using some of my gens and grok imagine. that would be fun
>>
File: ComfyUI_07545_.png (2.21 MB, 1152x1152)
2.21 MB
2.21 MB PNG
>>106953551
>Owwww
>>
>>106953734
wanna bet Illustrious does it better?
>>
File: ostrich.png (1.2 MB, 899x897)
1.2 MB
1.2 MB PNG
>>106953753
is that an ostrich? where is her pelvis and hips
>>
File: NetaYumeV35_Output_536465.png (2.53 MB, 1280x1536)
2.53 MB
2.53 MB PNG
NetaYume can be pushed back into something close to realism way more easily than e.g. Illustrious can, at least. I'm not specifically sure it was actually trained on some amount of photo data beyond whatever existed in base Lumina 2.0, hard to tell.
>>
>>106953816
the skin under the fishnet gives me the heebie-jeebies
>>
has anyone trained a netayume lora yet? how2do?
>>
>>106953860
installing ai toolkit as we speak to do exactly this, dunno what style i'll train yet but probably disney renaissance.
https://github.com/ostris/ai-toolkit?tab=readme-ov-file

>if someone wants to spoonfeed a good dataset this is the best time to do it because its gonna take me forever
>>
>>106953721
Yeah not captioning means it just does whatever it wants in a way you cannot control at all beyond adjusting strength, I've never understood why people think it's useful to train that way. Also cropping was never something that was necessary in any trainer with proper autobucketing either.
>>
>>106952799
hello /g/, is there a prompt that can remove that sort of AI-face and skin complexion that people get, so far I've put in the negative-prompt, "artificial-looking", but maybe there's a better phrase for it, it looks kind of like 2010 era popular deviantart digital paintings, I want to get this woman with more of a hand-drawn, hand-painted feeling and the sort of digital concept thing going on, I've seen it on cheap AI ads is just throwing a wrench in it, thank you
>>
>>106953877
srry i dont have my sets handy but godspeed anon and i look forward to reading your updates along the way
ill have to wrangle aitoolkit to work on my 50 series later and follow in your footsteps
>>
File: 1749268315677453.png (331 KB, 499x429)
331 KB
331 KB PNG
>>106953860
literally why?
>>
You wouldn't understand.
>>
>>106953923
Why not?
>>
File: dmmg_0008.png (1.34 MB, 896x1152)
1.34 MB
1.34 MB PNG
>>106953794
if i am training for an overall style, or body type, i don't bother captioning. if it's something the model doesn't already know like a novel hairstyle, i'll name it so i can reference it.

i only crop if i'm making a lora to apply in a detailer, i find otherwise it makes no sense out of context
>>
>>106953966
He lacks the skill to use it, probably.
>>
>>
>>106954008
butiful lightx2v slowmo
>>
>>106953903
I've tried synthetic-looking, it seems a bit better..
>>
0/10
>>
File: wanvideo__00001_.mp4 (711 KB, 720x1280)
711 KB
711 KB MP4
i thought that whole lightning slowmo thing was fixed now
>>
>>106954225
nope. same shit with the new one
>>
>>106954225
What page did you use to animate it?
>>
>>106954263
what page? the fuck u saying to me?
>>
>>106954263
BROWN
>>
I think the pictures are cleaner, the videos of the Lakers girls kissing the obese guy is funny, but there's a kind of romanticism in the image. Just like when you're a kid and you saw some big tits in a magazine or on a search engine.
>>
File: wanvideo__00002_.mp4 (817 KB, 720x1280)
817 KB
817 KB MP4
>>106954225
>>
>>106954225
according to the turbo autist who's completely clueless. jokes on you if you believed him
>>
>>106954225
The only "fix" is to disable lightxv's lora for the high noise phase. 6 steps high noise, 3.5cfg. 4 steps low noise, lighx2v, 1cfg. No slomo.
>>
>>106954300
>wait 10 minutes for a gen
im good
>>
>>106954304
Enjoy your slomo, reduced prompt adherence and deadened motion then, I guess.
>>
>>106954300
still shift 5?
>>
>>106954304
>wait 3 minutes for a shit gen
>delete
>wait 3 minutes for a shit gen
>delete
>wait 3 minutes for a shit gen
>delete
>>
>>106954319
5-8, arguable whether there's much of a difference
>>
>>106954287
>>106954225
at what point are you mfs just better off watching some normal porn?
>>
>>106954360
stan, y u so mad, try to understand, that i do want u as a fan
>>
>>106954360
I know, I'm into AI because I want to make surreal porn.
>>
File: 1753521099700725.mp4 (1.12 MB, 720x1008)
1.12 MB
1.12 MB MP4
>>106953428
>>
File: dmmg_0022.png (1.59 MB, 896x1152)
1.59 MB
1.59 MB PNG
>>106954360
it is my constitutional right to manufacture women with bad hands and absolute dump trucks
>>
File: 1731643934405715.mp4 (3.9 MB, 2048x786)
3.9 MB
3.9 MB MP4
babe wake up, the Krea fags have finetuned Wan
https://huggingface.co/krea/krea-realtime-video
>>
>>106954398
maybe as a big breast lover I'm just that unsophisticated but I like women with R-cups, wouldn't you want her to look more like a rare occurance?
>>
>>106954447
>distilled
who cares
>>
>>106954447
>wan 2.1
https://www.youtube.com/shorts/0vOLxSU6QlY
>>
>>106954447
>real time video
who gives a fuck? I want quality first, not a turd that can be rendered quickly
>>
>still no training code for Ovi
DOA
>>
File: dmmg_0138.png (1.45 MB, 832x1216)
1.45 MB
1.45 MB PNG
>>106954452
i'm just here to gen, i don't fap to this shit
>>
File: ComfyUI_06312_.png (1.01 MB, 824x1264)
1.01 MB
1.01 MB PNG
>>
>>106954615
catbox?
>>
>>106954551
nice, I can respect that
>>
>>106954633
its qwen with the lora to turn drawings into cosplay
>>
File: ComfyUI_00509_.mp4 (1.02 MB, 1024x1024)
1.02 MB
1.02 MB MP4
>>
>>106953545
>>106952936
>>106953588
Quick update on this: I made the node and it works. ComfyUI remains undefeated.

Did a quick test comparing gen times when pre-genning the conditionings, which takes about 0.5s per conditioning (so you could prepare 80 variants in about 45 seconds, adding a few seconds to load the CLIP model) vs. genning the conditioning as part of the image workflow.

I ran off 5 gens with each method, not counting the first one for obvious reasons, and on average,
Regular method: 46.85s
My new method: 37.03s

Granted the gains will be less significant on longer gens (I usually do 30-40 steps, not 18), but I think this was worth the effort.

Some people with a better VRAM situation may be able to keep their text model loaded at all times, in which case the gains might be negligible (or negative). But in my case this is a big improvement.
>>
reminder to try your chroma workflows with the deis sampler instead, its good
>>
>>106954762
that's cool. I used chatgpt to modify crystools so it showed vram usage in GB instead of a percent
>>
>>106954447
>realtime
>on b200
>>
File: ComfyUI_00510_.mp4 (1.57 MB, 720x1280)
1.57 MB
1.57 MB MP4
>>
>>106954796
What scheduler is it meant to be used with? Fails to denoise correctly with DDIM uniform (shows large influence of input image with 1.0 denoising)
>>
>>106954860
deis simple 20 steps 3 cfg chroma hd bf16
>>
File: what.gif (3.15 MB, 388x224)
3.15 MB
3.15 MB GIF
is there a reason i can't just train lumina using a .safetensors in ai toolkit? why do i need all this huggingface shit? do i really need to download the entire original lumina image 2.0 repo verbatim to train a lora?


were the FUCK is onetrainer when i need it
>>
>>106954868
I don't think you should be recommending CFG numbers devoid of context. It's very much dependent on prompt and image dimensions. E.g. 3.0 is way too high for my current workflow, in which I'm using 1.9
>>
File: ComfyUI_07558_.png (1.92 MB, 1152x1152)
1.92 MB
1.92 MB PNG
Seems like 3D. Though the model is a better candidate than SDXL for models like bigASP or Lustify (though now they have Chroma). But if modelmakers want to train realism at 2B, they should target the best possible model for that size. Tuning SDXL at all right now is a waste of time.
>>
>>106953842
>Seems like 3D. Though the model is a better candidate than SDXL for models like bigASP or Lustify (though now they have Chroma). But if modelmakers want to train realism at 2B, they should target the best possible model for that size. Tuning SDXL at all right now is a waste of time.
>>106954918
>>
>>106954883
>were the FUCK is onetrainer when i need it
i like how they still havent added wan but have hunyuan
>>
>>106954868
Simple works, thanks.
>>
File: 00013-2012533364.png (2.35 MB, 1248x1824)
2.35 MB
2.35 MB PNG
>>106953206
crody model's are barely an improvement from one another iteration. There really does seem to be hard limit of what sdxl can visual achieve in 3d,3dcg and hyper anime- realism department.
>>
File: ComfyUI_00008_.mp4 (1.36 MB, 640x832)
1.36 MB
1.36 MB MP4
>>106954918
>>
>>106954937
I assume it's probably a lot a matter of like, just how Neta Lumina was captioned and then NetaYume Lumina was captioned in the process of being trained over base Lumina 2.0, combined with the better text encoders and it not being distilled, that lead to it not "erasing" quite as much of the original Lumina 2.0 realism knowledge.
>>
File: 1754391068851921.png (1.26 MB, 1208x856)
1.26 MB
1.26 MB PNG
>>
>>106955040
ha
>>
File: 00055-3775134980.png (1.2 MB, 1344x1728)
1.2 MB
1.2 MB PNG
>>
>>106954883
Use the "SD3" branch of Kohya, with this PR:
https://github.com/kohya-ss/sd-scripts/pull/2225
>>
File: dmmg_0039.png (1.13 MB, 896x1152)
1.13 MB
1.13 MB PNG
>>
File: 00020-3561054976.png (2.38 MB, 1248x1824)
2.38 MB
2.38 MB PNG
>>
File: 1757397508485765.png (1.73 MB, 768x1280)
1.73 MB
1.73 MB PNG
chroma version 33 with deis simple 20 steps, some rng thats needed aside, no model matches chroma for realism thats not contrast boosted ten times over
>>
>>106955137
This is like V1 BigASP tier lmao, not even V2
>>
>>106954447
>was thinking it would be cool for some kind of flux or chroma video model of some sort
>this drops

We must think harder bros, if we think it, it will come to pass
>>
>>106955147
>look up bigASP
>uses ponyv6 style score tags
LMAO
>>
>>106955067
literally my cat looking at the fish tank
heh
>>
>>106955137
>one arm is vascular
>the other is bloated
>>
File: ComfyUI_00015_.mp4 (633 KB, 640x832)
633 KB
633 KB MP4
>>106955067
>Prompt executed in 136.90 seconds
Meow
>>
File: berkut cosplay.png (917 KB, 872x1200)
917 KB
917 KB PNG
>>
>>106955159
What does that have to do with anything
>>
File: 1758345780723944.png (1.18 MB, 968x1072)
1.18 MB
1.18 MB PNG
>>
>>106955175
Gives me insight to how shit the model is without me having to actually use it.
>>
>>106955169
vascularity drops when you elevate the angle of your arm against gravity
>>
File: 00061-3901493713.png (926 KB, 1024x1024)
926 KB
926 KB PNG
>>
>>106955185
BigASP v2 and v2.5 are literally the largest scale photo finetunes of SDXL ever done, by a lot, at like 6 million and 13 million images
>>
>>106955233
When will he train on an architecture that isn't a dead end?
>>
File: NetaYumeV35_Output_663675.png (3.14 MB, 1280x1536)
3.14 MB
3.14 MB PNG
Yeah I guess just boomer prompting really brings out the realism in NetaYume pretty easily

`You are an assistant designed to generate images based on textual prompts. <Prompt Start> a digital photograph of a woman approximately 25 years of age, with long, wavy, golden-brown hair spread out around her head, lying on her back in a field of green grass dotted with numerous small white and yellow wildflowers. Her head is positioned towards the top of the frame. Her left hand is resting gently by her side, while her right hand is slightly raised, with her fingers curled loosely. She is wearing a simple, strappy, knee-length white dress with three buttons down the front of the bodice. Her legs are slightly bent at the knees and crossed at the ankles, with her right leg overlapping her left. Her bare feet are visible towards the bottom of the frame. The grass is lush and vibrant, and the small flowers are scattered evenly throughout the frame, with some larger green leaves also visible in the bottom left corner. This is an overhead shot of the woman.`
>>
>>106955251
I think he wanted to do Chroma or Wan 5B next
>>
File: dmmg_0050.png (1.17 MB, 896x1152)
1.17 MB
1.17 MB PNG
thanks to the anon for the krea tips, those plus a lora are doing some heavy lifting
>>
>>106955273
You people have developed some really fucking retarded styles of prompting.
>>
>>106955285
It's just a Gemini caption output for an image that already existed, with the standard Gemma boilerplate for NetaYume in front
>>
File: ComfyUI_00018_.mp4 (955 KB, 640x832)
955 KB
955 KB MP4
>>106955114
Are her hands re-attached at the wrists?
>>
File: image_00002_.jpg (296 KB, 984x1192)
296 KB
296 KB JPG
>>106955137
res multistep + beta with small upscale gives pretty decent results
>>
File: file.png (2 KB, 157x36)
2 KB
2 KB PNG
anyway to get groks imagine locally?
>>
>>106955315
hack into their systems and steal the weights
>>
>>106955315
nope
>>
>>106955315
If you buy 10 cybertrucks Elon personally delivers you the weights on a USB
>>
File: ComfyUI_07560_.png (2.23 MB, 1152x1152)
2.23 MB
2.23 MB PNG
>>106955036
If that's what the original is capable of then seems heavily slopped unfortunately, but still a good candidate for 2B model realism tune.

>>106955137
>>106955310
When I prompted HD I would use res multistep/35 steps (minimum of 30), or heun whenever I needed better results at the cost of speed (still very RNG). HD Flash only needs 8 steps though and it achieves superior results to either setup
>>
>>106955285
What do you mean "you people"?
>>
>>106955358
nthusiasts
>>
File: QwenEdit_00166_.png (1.15 MB, 1080x968)
1.15 MB
1.15 MB PNG
what am I even doing anymore
>>
>look at wf of a mid image
>like 20 snakeoil loras used
Surely you don't do this, Anon?
>>
>alright teto, where did you hide the goofs?
>>
File: dmmg_0045.png (1.5 MB, 896x1152)
1.5 MB
1.5 MB PNG
>>106955308
honestly that would be a sick tattoo
>>
>>106955394
sexo declared
>>
Why are they renaming Chroma to streak?
>>
File: image_00009_.jpg (335 KB, 984x1192)
335 KB
335 KB JPG
>>106955345
I need my negative prompt
>>
>>106955422
just NAG it bro
>>
File: 00034-2907251700.png (2.65 MB, 1824x1248)
2.65 MB
2.65 MB PNG
>>
File: ComfyUI_07567_.png (1.88 MB, 1152x1152)
1.88 MB
1.88 MB PNG
>>106955422
There's NAG for Chroma, though I don't need negs most of the time so I haven't tried it.
>>
File: 1739962764103146.jpg (381 KB, 832x1216)
381 KB
381 KB JPG
>>
File: ComfyUI_06334_.png (1.3 MB, 784x1336)
1.3 MB
1.3 MB PNG
>>
File: 1732546905910078.mp4 (737 KB, 720x1040)
737 KB
737 KB MP4
>>106954986
>>
File: 00037-2137800070.png (2.41 MB, 1248x1824)
2.41 MB
2.41 MB PNG
>>
File: ComfyUI_06345_.png (1.19 MB, 784x1336)
1.19 MB
1.19 MB PNG
>>
>>106954447
>extreme slow motion
why? even my 2.1 has normal movements. and i'm vramlet
>>
>>106955137
I'm legit disgusted some guys find this shit attractive, what the fuck.
>>
File: QwenEdit_00169_.png (821 KB, 1368x760)
821 KB
821 KB PNG
>>
File: ComfyUI_00019_.mp4 (1.72 MB, 640x640)
1.72 MB
1.72 MB MP4
>>106955345
^^

>>106955431
nta, I have grid noise issues with Chroma NAG but probably skill issue

>>106955394
>reattached hand
>stick tattoo
Agreed

>Prompt executed in 90.81 seconds
>>
File: image_00015_.jpg (667 KB, 1672x1240)
667 KB
667 KB JPG
>>106955451
damn I guess I have to try it
>>
>>106955507
post your hot ai woman
>inb4 no reply
many such cases
>>
>>106955532
Any other woman posted in this thread is better than that shit. Atleast they look natural, god damn anon. Get some help.
>>
>>106955507
it would look attractive with those tits or pillow lips wrapped around your cock
>>
>>106955552
No it wouldn't. Body dysmorphia is not hot.
>>
>>106955548
>no image
as predicted, concession accepted, nogen
>>
File: 00044-2138738231.png (2.14 MB, 1824x1248)
2.14 MB
2.14 MB PNG
>>
File: ComfyUI_07584_.png (2 MB, 1152x1152)
2 MB
2 MB PNG
>>106955522
Meant to be in Tokyo
>>
>>106955566
I'm not pausing my current gen and creating something for (You) just to appease your mental illness. You're free to think you won all you want. I hope you get the help you need.
>>
>>106955564
Sure but her body dysmorphia is not my problem.
>>
File: image_00017_.jpg (648 KB, 1672x1240)
648 KB
648 KB JPG
>>
File: ComfyUI_00024_.mp4 (1.35 MB, 832x640)
1.35 MB
1.35 MB MP4
>>106955531
^^

>>106955579
>Tokyo
Oops, I put "city"
>>
>>106955552
>>106955564
>Body dysmorphia
lol, a fag that thinks every mature slightly plumper woman with big lips is literally disgusting while probably only genning the most generic tranime 1girls imaginable that hes too embarassed to even post lol
>>106955581
>implying that he doesnt have even a singular already generated image of a woman he would say is attractive
i do love it when absolute retards with the lowest iq opinions imaginable prove so obviously that they are absolute retards
>>
>>106954762
Couldn't you pre-calculate different conditioning vectors and then simply mix them together in a kind of wildcard system using Conditioning Concat?
So instead of calculating each sentence individually, you could plan atomic sentence parts, so to speak, and then concat these vectors as you wish, like different hair colors, etc.
>>
it's amazing what difference these threads experience depending on if the dev larper is here or not
just noticing, ignore me
>>
File: QwenEdit_00173_.png (894 KB, 1344x776)
894 KB
894 KB PNG
qwen qwen qwen
fun with qwen!
>>
File: image_00019_.jpg (801 KB, 1672x1240)
801 KB
801 KB JPG
>>106955597
lol
>>
File: ComfyUI_07585_.png (1.89 MB, 1152x1152)
1.89 MB
1.89 MB PNG
>>106955531
On some occasions, Chroma HD (v50) has better prompt following, especially with multiple subjects, etc... But for almost every type of prompt HD Flash will look better first try because it fixes fine details. As for multiple subjects and prompt following, that can be somewhat remediated by mixing the two models and prompting at 2k
https://files.catbox.moe/pg1c1o.png
>>
comfy should be dragged out on the street and shot
>>
miku should be dragged out on the street and shot
>>
>>106955678
based
>>
Local is a joke, but at least it’s a funny one
>>
File: QwenEdit_00176_.png (1.14 MB, 1000x1048)
1.14 MB
1.14 MB PNG
BILLIONS
>>
>>106955384
comfyui encourages snake oils. it's THE snake oil UI.
>>
>>106953508
Yeah, but this space has become commercialized. I'm noticing inorganic behavioral patterns that regular hobbyists don't exhibit. Don't you find it suspicious when someone posts 6-8 hours daily, every single day, but only posts images of Neta Lumina via filename? And that's their only form of engagement with the community?
>>
Are Sora gens allowed here?
>>
File: 1738889162509720.mp4 (674 KB, 720x1216)
674 KB
674 KB MP4
>>106955505
>>
>>106955748
The autistic adherence to one single pose makes me think otherwise. It's not a great strategy to shill your model.
>>
>>106955765
Sure, if you've hacked into ClosedAi's servers and got the weights that you just published on a torrent whose magnet link will be in your next reply, but sure
>>
>>106955781
I have the weights but im not sharing them. Too many browns itt
>>
reminder: sam will give us the 18+ update in december
>>
File: image_00023_.jpg (662 KB, 1240x1672)
662 KB
662 KB JPG
>>106955724
must smile
>>
File: ComfyUI_temp_skscp_00003_.jpg (592 KB, 1432x1432)
592 KB
592 KB JPG
>>106955137
Chroma-unlocked-v41-few-steps is still my GOAT
>>
File: screenshot.1761002422.jpg (162 KB, 854x629)
162 KB
162 KB JPG
Backups are important.

>daily local backups to my HDD using restic
>loras/models/output/input/comfy all backed up
>daily remote backups to my 500TB server
>Additional windows scheduled backup archiving the entire directory of comfy in tar.bz2 format
>server has RAID redundancy + snapshots
>desktop has mirror HDD to backup my main HDD incase it fails
>Considering using AWS to keep cloud backups incase of catastrophic family(house fire/etC)

I haven't lost data in over 20 years. I am the backup GOD.
>>
>>106955678
no
migu :(
>>
>>106955663
Quite interesting but vram hungry workflow
>>
>>106955869
impressive, very nice, now lets see your qbitorrent stats and how much of those TB is rare content permaseeded online
>>
>>106955869
>cloud backups
>not cold storage
>>
This Mortal Coil is temporary and thus so is my Digital Carapace
>>
>>106955914
My 500TB server has a unit in the rack that only powers on once a month to do backups, so I believe that counts as cold storage. Still, eventually I will need cloud backups incase a tornado or something destroys everything. Hosting that much data in the cloud will be expensive though so I don't know if I'll go through with it.
>>
>>106955945
but the kino asian 1girl gens can live forever
>>
>>106955913
I lost interest in private trackers/torrenting community in around 2014ish, so I don't perma-seed anymore.
>>
File: ComfyUI_00513_.mp4 (1.55 MB, 720x1280)
1.55 MB
1.55 MB MP4
>>
>>106955869
fbi gets warrant, treasure trove of cp most likely
>>
bros what's the best model for T2I lewds? basically never done anything outside of anime so i'm clueless as to what's good for realistic gens
>>
>>106955953
calculate out tape storage placed in some other location irl you can store things at vs amazon glacier or alternatives
also obviously encrypt things before storing online
>>106955979
i never said anything about private trackers, torrents are still generally the best and most popular way to share larger files and are the most likely place to find old or obscure things compared to any other file hosters, sharing useful data online is one of the most important things one can do while at the same time being basically free
>>
>>106955987
chroma for image gen nothing else comes close.
>>
>>106955771
why only 2 seconds long?
>>
>>106956027
what variant
>>
File: ComfyUI_00514_.mp4 (2.32 MB, 720x1280)
2.32 MB
2.32 MB MP4
>>
>>106956035
i'm finding chroma DC2k to be better than chroma-HD, but that's just me. For training loras, definitely use chroma-HD though.
>>
>>106955853
more breasty milfs please
>>
File: ComfyUI_00515_.mp4 (2.55 MB, 720x1280)
2.55 MB
2.55 MB MP4
>>
File: ComfyUI_00027_.mp4 (2.18 MB, 1280x1280)
2.18 MB
2.18 MB MP4
>>106955663
^^

>>106956040
Chroma has what we knead
>>
File: ComfyUI_07587_.png (2.14 MB, 1152x1152)
2.14 MB
2.14 MB PNG
>>106955907
Yes, it's based on official comfy workflow for adding 2 models, but I unfortunately have never been able to merge the 2, which is why I seldom use that workflow in particular. Though merging them properly is probably not too hard by looking at the code.
>>
>>106956074
The bench is both cursed and zen
>>
>>106956033
Was quickly testing first frame last frame looping with two different color match nodes using the gens from here but there is still an obvious seam on the loop without VACE.
>>
>>106956091
New BlackRock anti-homeless design being imported in Japan.
>>
https://github.com/1038lab/ComfyUI-QwenVL

Time to run hundreds of hot images through the abliterated version of qwen 3 vl model and then gen multiple variations with multiple styles with multiple models for each created prompt
>>
>>106955824
the one that will let us write erotica? wow, big whoop
>>
File: image_00036_.jpg (490 KB, 1240x1672)
490 KB
490 KB JPG
>massive gravity defying hyper-fat-ass
>>
>>106956246
Bass boosted brap
>>
>>106955824
>us
No, they're letting the API models write porn. Nothing for us.
>>
>>106956127
i'd rather use llama.cpp
>>
>>106953071
gives off louise vibes
>>
>>106954300
1 step high without the light lora
3 steps high with high lora
4 steps low with low lora (nvidia rcm if i2v)

Works most of the time and fast.
>>
>>106955869
>my 500TB server
How much is that?
>>
File: 1731994836720541.mp4 (1.97 MB, 720x960)
1.97 MB
1.97 MB MP4
>>106956246
>>
>>106956664
catbox?
>>
>>106956685
nta but it's just wan first last frame 2 video
>>
>>106956705
so wan does hyper bouncy asses out of the box?
>>
>>106956710
No, it's a lora.
>>
>>106956714
What one?
>>
Time to DIFFUSE
>>
>>106956720
Probably "slop twerk".
>>
I'm a DIFFUSER. ama
>>
new wan open source release coming never
>>
>>106956727
the first to songbloom is king of USA.

Oh yeah, I'm songblooming.

Rules: Has to say something that sono etc would ban
>>
>>106956804
>it's not a music diffuser
*yawns*
>>
>>106956817
>native audio
LITERALLY is
>>
>>106956820
CAN YOU TURN OFF THE STUPID VIDEO
>>
*summons the spirit of video*
*stabs it to death*
DIE DIE DIE
>>
>literally just one guy in the thread
This place really has gone to shit, huh?
>>
>>106955358
ncels who ai image gen
>>
>>106955597
Kek, catbox?
>>
File: 1734792637249251.mp4 (2.19 MB, 720x960)
2.19 MB
2.19 MB MP4
>>106956664
Damn color shift is really noticeable when using RifleXRope in the last 5 frames.
>>106956685
What this anon said >>106956705 with this LoRA >>106956727
>>
Jesus Christ, I'm eating.

VOMIT
>>
https://www.reddit.com/r/StableDiffusion/comments/1obws1z/invokeai_was_just_acquired_by_adobe/
Lmaooo, Comfy must have the day of his life here
>>
songbloom likes to skip lyrics, and it's very annoying. I'll work on it. This is nice, though:

https://files.catbox.moe/9w0a06.mp3
>>
so for the WAN2.2 models, do the quantified models degrade in animation quality as you go down the scale? like does the full unquantified model produce more fluid, dynamic, and realistic smooth motion over the entire image than, say, a Q4 model? I'm trying out some quantified models as I can't run the full ones and the animation I'm seeing is all pretty stiff, with usually only 1 part of the image being animated, like it's being tweened in flash rather than a fully animated scene.
>>
>>106956992
Q8 is the absolute bare minimum you should use. Even q6 is way worse.
>>
File: ComfyUI_00036_.mp4 (1.7 MB, 1280x1664)
1.7 MB
1.7 MB MP4
>>106956894
https://files.catbox.moe/wm4tp9.mp4

>>106956963
F

Re-rolling this one.. so close
>>
>>106956998
I can barely gen on Q5 :(
>>
why does photoreal gens itt look laminated?
>>
File: ComfyUI_00038_.mp4 (2.1 MB, 1280x1664)
2.1 MB
2.1 MB MP4
Is catbox down?

Extra smokey
>>
File: screenshot.1761011513.jpg (237 KB, 816x811)
237 KB
237 KB JPG
>>106956963
I mean it couldn't compete with ComfyUI anyway, so that makes sense. Imagine being that retarded anon shilling Invoke a couple of weeks ago. What a dumbass.
>>
>>106956992
>do the quantified models degrade in animation quality as you go down the scale
Yes.

>like does the full unquantified model produce more fluid, dynamic, and realistic smooth motion over the entire image than, say, a Q4 model?
Yes.

>>106957023
IMO, If you can't run Q8, either stick to API nodes or wait until you have money to buy a real GPU.
>>
>>106957089
comfyorg plans to sell off as well. who's going to make a UI after that?
>>
>>106957012
>>106957060
vomit. I'm eating.
>>
>>106957060
Litterbox works but regular catbox is fucked rn with video.
>>
>>106957114
Let's spread a rumor that comfyui is antisemitic. It won't sell :^)
>>
>>106957114
as much as I don't"t want to believe that it's exactly the kind of thing to expect in a year or two. we need something else
>>
>>106957114
>comfyorg plans to sell off as well
Do you have a source to back this claim, Anon? Why would they sell off when they're getting plenty of money from investors and growing rapidly. They are the top UI right now, and selling off to a large company would only result in a net loss for the comfy team. That would be foolish.
>>
Yeah catbox isn't working with mp3 either
>>
>>106957127
let me guess, ani is supposed to be the savior with his shitty cpp project right
>>
>>106957138
https://www.reddit.com/user/comfyanonymous/
just check his replies he says he doesn't plan to at all, he posts on reddit now not here.
>>
>>106957138
Cmon on now anon, it is for profit. Do you think it's gonna end well? (Cough, cough, ClosedAI, cough, cough)
>>
>>106957144
if ani has a grift chink as CEO I would write him off as well but it's not the case. we will see what happens in a year but I wouldn't get optimistic about comfyorg with the way they conduct themselves
>>
File: ComfyUI_00039_.mp4 (3.75 MB, 1280x1280)
3.75 MB
3.75 MB MP4
>>106957119
Wouldn't suppose vomit tastes very nice

>>106957124
It worked for long enough to upload it https://litter.catbox.moe/13pqlz80e41jsb8l.mp4
>>
The 5070, that's like a 4090, right?
>>
A111
dead

Forge
assassinated

reForge
suicide

InvokeAI
defected

AniStudio
forever on a drunken rage against comfy
>>
>>106957150
comfy is not the majority shareholder. the grift chink is. anything comfy says about company direction is not in his control
>>
>>106957160
I just mean VOMIT.

Like put on some CLOTHES, bitch.
>>
>>106956856
>he doesnt filter the retard who posts a million times per thread
ISHYGDDT
>>
>>106957169
you forgot:
ComfyUI
enshitified
>>
File: ComfyUI_00519_.mp4 (954 KB, 720x1280)
954 KB
954 KB MP4
>>106956710
here's wan with no jiggly loras
>>
>>106957170
either way, the moment they sell out is the moment another ui will take their place. it's as simple as that. there are plenty of devs waiting for comfy to die anyway so there will be alternatives.
>>
>>106957186
open sourcing really saved this model because holy censorship
>>
File: 1739776399512581.jpg (55 KB, 656x627)
55 KB
55 KB JPG
>>106957169
What's the lore on the creator of AniStudio hating Comfy?
>>
>>106957169
Very soon

>Comfy
>defected
>>
>>106957138
>Do you have a source to back this claim, Anon
why would a source matter?
>>
>>106957186
honestly this is better than without the lora. i think you need to lower the strength because that lora jiggle is so unrealistic
>>
>>106957196
Why are you gay?
>>
>>106957195
what's censored?
>>
>>106957169
it was all orchestrated by comfyanon, he doesn't want competition
>>
>>106957200
I need the company that lied about many things before to tell me everything will be ok
>>
>>106957203
i didn't do the jiggle lora.. i just ran the same pic on wan to show what it looks like without any jiggle loras
>>
>>106957114
>comfyorg plans to sell off as well.
I highly doubt that, he made 17 millions out of investments, he doesn't need to be bought to survive
>>
>>106957196
ani tried to convince comfy to get rid of the grift chink but comfy likes grifting for money so he didn't
>>
comfyanon sent death squads to competitors
>>
>>106957190
yeah anistudio is starting to look better by the day
>>
baker bake baker bake
bakers bake
bake me a bake as bake as you bake
>>
>>106957224
if the dev team wants an early retirement and doesn't feel motivated to work on it anymore it's always a possibility.
>>
>>106957212
anything sexual or sensual. it has no clue what jiggly titties are
>>
>>106957224
he just wants money. if a company offered for 200 mil he'd do it
>>
comfy is slowing down your gens on purpose if you don't use api
>>
imagine taking the death of another UI and using that to shill and promote fucking anistudio

this guy is fucking cancer.
>>
>>106957233
oh.. well yeah.. but do any of them really? i dunno. it's not their primary focus when generating models for one thing, but sure maybe they do censor that stuff too, but it can be fixed
>>
>>106957230
most already seemed burnt out
>>
>>106957225
we don't deserve ani
>>
>>106957233
>jiggly titties
Well it can do them but it's incidental (a girl jumps), you can't ask for them.
>>
>>106957244
lots of SaaS models do, kling is pretty notable for it. they're just shackled down like dogs so you can hardly get it out of them
>>
>>106957244
>but do any of them really
You'd have to go full BFL puritan to ban any bounce, it's naturally present in videos.
>>
File: 1760983557042631.mp4 (1.89 MB, 720x1280)
1.89 MB
1.89 MB MP4
>>106957244
>oh.. well yeah.. but do any of them really?

Jiggly titties shouldn't be a rarity, they can happen naturally even when the model isn't focused on it.
>>
>>106957247
>burnt out
>they are creating an entire OS for comfy
>posting more than ever now
??
>>
>>106957265
they are prepping to be bought by a huge company, this is why they log all prompts secretly
>>
>>106957268
mind showing me the exact code where they do this anon. its open source after all.
im sure you didn't just make it up.
>>
>>106957259
>full BFL puritan
man it would be funny if they ever made a video model, where suddenly physics completely breaks down and breasts are like ps1 bricks
>>
>>106957265
ComfyUI gets basically daily commits and merges too. idk what this Democrat-style fake narrative takedown is, but it's 2nd hand embarrassing
>>
>>
>>106957277
just the usual anti-comfy shitposting by the usual suspect. instead of working on their own project they spend all day shitposting in /ldg/ thinking it'll make a difference. a moron if you will
>>
>>106957260
Yeah but a lot of models don't know what it is, they just know they should bounce, at least I'm pretty sure most captions don't have this description.
Nothing a lora or finetune can't solve though.
>>
File: 00109-766801950.png (1.16 MB, 1432x744)
1.16 MB
1.16 MB PNG
>>
>>106957281
SDXL cannot do this
>>
>>106957281
based
>>
>>106957281
I'm garbage blind. There is nothing in that picture.
>>
>>106955748
I'm a person who isn't that guy you're talking about but also posts images with Neta in the filename. Like the dragon girl one in the collage was me, not him. The speculative realistic gens earlier were me, not him. The BLACKED joke one just now was me, not him. etc.
>>
>>106957275
Imagine using compute to "de-giggle" bouncing breasts lol.
>>
>>106957308
if have to understand anon, everybody here is one particular schizo so take your pick
>>
This stupid website needs mp3s
>>
Just stop posting
>>
>>106957317
i usually like to go as debo#2 or 3 myself
>>
File: vampire.jpg (299 KB, 1280x1754)
299 KB
299 KB JPG
>>
>>106957314
puritans know no bounds to their depravity
>>
Okay I'll bake
>>
File: ComfyUI_00042_.mp4 (2.92 MB, 832x832)
2.92 MB
2.92 MB MP4
Baker save us
>>
File: ComfyUI_00043_.mp4 (2.82 MB, 832x832)
2.82 MB
2.82 MB MP4
>>
>>106957360
no sound
>>
>>106957326
Is it good? Post to vocaroo
>>
new
>>106957370
>>106957370
>>106957370
>>106957370
>>
File: ComfyUI_00520_.mp4 (1.02 MB, 720x1280)
1.02 MB
1.02 MB MP4
>>106957337
>>
>>106953423
I'm so upset.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.