[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor applications are now being accepted. Click here to apply.


[Advertise on 4chan]


Sometime It's Too Much Edition

Discussion and development of local image and video models and UI

Prev: >>106580216

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>106585705
finally, a good collage untainted
>>
WHERE NUNCHAKU QWEN LORA FIX
WHERE NUNCHAKU WAN2.2
>>
So, which Chroma Model is the go-to model now? 2k? Base? HD? Flash?
>>
>>106585620
Chroma
prodigy
batch 1
150 epochs
~60 pics dataset
A second lora I trained this way but I bumped up the resolution from 512 to 640 this time.
>>
>>106585767
Base has better fine detail but 2K handles higher mpx count better. I use 2K for gens and Base for second pass.
>>
Blessed thread of frenship
>>
>>106585724
Previous was untainted at least
>>
Qwen SRPO waiting room
>>
File: ComfyUI_09878_.png (1.46 MB, 1200x1200)
1.46 MB
1.46 MB PNG
>>
why are wanchads abandoning /ldg/?
>>
>>106585912
They'll be back once the nunchaku version is released
>>
>>106585912
I'm out of ideas.
>>
she nun on my chaku
>>
>>106585727
this so much this
>>106585842
this so much this
>>106585934
this so much this
>>
I hate the muttjeets shilling all their bananas or sneedeam shit in local generals, both are not different of I get with a good workflow for Chroma or Qwen
>>
File: ComfyUI_06926_.jpg (889 KB, 2048x2048)
889 KB
889 KB JPG
>>106585767
2k is not done yet so I go to Flash all the time (this is a mix I use for 2k)
>>
>>106584840
>>106584921
Rescale is more difficult for anon to identify as snake oil because it's aesthetic not performance based. There's a reason it's not discussed anymore.
>>
>>106586194
>(this is a mix I use for 2k)
Is this the 2k weights with the Flash delta applied?
>>
File: ComfyUI_09892_.png (2.19 MB, 1312x1312)
2.19 MB
2.19 MB PNG
>>
>>106586220
HD (v50) with Flash delta weights applied
>>
>>106586194
Do you run second pass/upscale or are you rawdogging this resolution?
>>
>>106586269
ads used to look good
>>
>>106585470
what are your gen times?
>>
>>106586277
Anyone have a workflow or even a screenshot of how they set up a second pass ?

Not sure what it means, you run the same seed on the generated image at a low denoise, or ?
>>
File: ComfyUI_09894_.png (1.73 MB, 1312x1312)
1.73 MB
1.73 MB PNG
>>106586293
Indeed
>>
>>106586316
https://comfyanonymous.github.io/ComfyUI_examples/2_pass_txt2img/
>>
File: comfy1451.jpg (2.41 MB, 2048x2048)
2.41 MB
2.41 MB JPG
>>
>>106586358
Thank you my man!
>>
>>106586371
very nice
>>
>>106586194
Mind sharing that image's catbox?
>>
>>106586277
>second pass/upscale
No, this is the workflow that I use to mix them
https://files.catbox.moe/xh9gv2.png

No second pass. It is quite fast (1:51 per gen, about as fast as a regular Chroma HD gen at 1024 for 30 steps).
>>
Can anyone share a working set of lightning i2v lora for wan2.2?
The default (2.2 i2v lightning at 1 strength for both high and low) is shit, and while I found t2v tricks like reusing wan2.1 loras alongside 2.2 ones to make it better and playing with weights, I don't know about i2v.
>>
>>106586421
I just use the AIO build, it's faster than base 2.2 5b and offers a much higher quality
https://huggingface.co/Phr00t/WAN2.2-14B-Rapid-AllInOne
>>
>>106586371
>>106586401
>>106565868
>>
File: ComfyUI_06930_.jpg (865 KB, 2048x2048)
865 KB
865 KB JPG
>>106586410
Sure
https://files.catbox.moe/dwwfc5.png
Keep in mind I default to 8 steps and may take more sometimes to fix imperfections (same as previous Chromas). Though I have had to do that way less with HD, with the mix at 2k it does mess up fingers a bit more but is easy to fix.
>>
File: 1757870032101457.jpg (23 KB, 201x240)
23 KB
23 KB JPG
>>106586479
>>
>>106586493
no multiple choice?
>>
>>106586447
I use the 14B, so not for me.
Thanks anyway anon.
>>
>>106586417
Any reason you use SD clip with padding removel?
>>
Cooking a pepe jonasan lora for Chroma, attempted to do both natural language description and booru tags within each text file to see if it works better with chorma,
>>
File: 00166-2898845303.png (2.04 MB, 1824x1248)
2.04 MB
2.04 MB PNG
>>
>>106586642
>and booru tags within each text file to see if it works better with chorma,
Spoiler alert: it won't.
Don't try training Chroma to make images with booru tags in non-natural language, it ain't gonna happen
>>
>>106586642
I haven't done both in the same training, but booru style tags work just as well as natural language for Chroma in my tests, despite the model being trained exclusively on Gemini captions AFAIK
>>
File: img_00021_.png (2.35 MB, 1088x1344)
2.35 MB
2.35 MB PNG
>>
File: ComfyUI_06934_.jpg (723 KB, 2048x2048)
723 KB
723 KB JPG
>>106586592
No idea, for some reason the Chroma one gives me an error "ChromaPaddingRemoval
'attention_mask' " which I have never bothered to fix
>>
>>106586687
I agree mine worked well which is why I want to mix it now
>>106586672
It worked well for me so I'm mixing both it was easy to setup with joycaption
>>
>>106586753
>joycaption
nigga, why using that shit at all when a free gemini api exists?
>>
>>106586762
Why the fuck would I use non local to caption porn when I can easily run the model at max performance?
>>
>>106586762
>free gemini api
try to make it caption nsfw stuff
>>
>>106586762
>>106565868
>>
>>106586767
because joycaption is subpar compared to the cloud models?
And gemini can caption images featuring nudity as long as it isn't hardcore porn and doesn't feature lolis

>>106586773
Works fine on my end
>>
>>106586780
booru tagging is not a fine art and it has done a great job describing the images. You would only do that if your system couldn't run joycaption at max performance.
>>
>>106586780
>isn't hardcore porn and doesn't feature lolis
So it's useless
>>
>>106586642
Of all the gens you made, there isn't a single one that caught my attention, much less that I couldn't do with SDXL and a text editor in a third of the time it takes you (5min according to previous threads). But at the same time you're a damn namefag so I wish you the worst. Therefore keep training Chroma, keep wasting time on that, and experiment with Booru tags.
>>
File: ComfyUI_00255_.png (3.8 MB, 1088x1344)
3.8 MB
3.8 MB PNG
>>106586742
>>
>>106586790
I'm sorry that my existence and learning how to work with loras while learning chroma hurts you
>>
>>106586809
Seedream?
>>
File: ComfyUI_00243_.png (2.71 MB, 1088x1344)
2.71 MB
2.71 MB PNG
>>106586827
Good old Flux
>>
File: ComfyUI_09916_.png (1.73 MB, 1328x1152)
1.73 MB
1.73 MB PNG
>>
File: ComfyUI_temp_dqavv_00004_.png (3.59 MB, 1728x1344)
3.59 MB
3.59 MB PNG
>>106586642
why are you always bloging posting you dirty avatarfag, you're like 3 years behind lora traning, stfu
>>
API nodes status?
>>
>>106586858
He wants to be like Ani
>>
>>106586858
Can you define a avatar because I don't avatar post. Shit I leave this general for months. Also your hands are mangled.
>>
File: ComfyUI_temp_arbiz_00007_.png (3.49 MB, 1824x1248)
3.49 MB
3.49 MB PNG
https://files.catbox.moe/mjfg3t.png
>>
File: ComfyUI_00268_.png (3.18 MB, 1088x1344)
3.18 MB
3.18 MB PNG
>>106586839
>>
File: ComfyUI_temp_arbiz_00003_.png (2.54 MB, 1248x1824)
2.54 MB
2.54 MB PNG
https://files.catbox.moe/baueij.png
>>
>>106586824
Trying things out for yourself and experimenting is illegal, anon
>>
Charlie kirk assassination video but big knockers
>>
>>106586642
Imagine being such a namefag that you train garbage loras in a garbage model just to avoid admitting you failed at making anything decent. Peak attention seeking behavior, absolutely retarded.
>>
>>106586873
What crimes has this roastie committed?
>>
File: ComfyUI_temp_lcqvx_00043_.png (3.89 MB, 1824x1248)
3.89 MB
3.89 MB PNG
https://files.catbox.moe/ulqww0.png
>>
File: 00276-1803632336.png (3.36 MB, 2480x2048)
3.36 MB
3.36 MB PNG
>>106586893
It seems that way,
I want to deprive the schizo so I'm going to post again once the lora is done
>>106586899
>>
>>106586902
Playing league
>>
>106586869
KYS nobody needs you here, you are trash
>>
>>106586902
slopposting
>>
File: ComfyUI_00276_.png (3.2 MB, 1088x1344)
3.2 MB
3.2 MB PNG
>>106586887
>>
>>106586907
You will never be Ani
>>
>>106586858
prompt?
>>
>>106586869
everybody knows you're ran, aka another annoying avatarfag, you keep posting the same gens over and over, now you finally learn how to train a lora and for some reason you wanna share that info with us, you're always blog posting whatever new stuff you're into, like when you bought a 5090 and you said you were going to be a menace, what happened with that? I remember.
If you wanna blog post so much, you better create a pixiv/X account and share your progress with your followers, not here, we don't care and its annoying
>>
>>106586938
1girl, pink leotard, large breasts, on one leg, backrooms, vhs style
>>
File: ComfyUI_temp_vusjp_00149_.png (2.79 MB, 1824x1248)
2.79 MB
2.79 MB PNG
>>106586902
homicide with a 3d printed gun (this gen is a homage to the Luigi perp walk photos)

https://files.catbox.moe/mrhq12.png
>>
>>106586873
>>106586905
Man, how could they have shat the bed so hard with Qwen-img when Wan (from the same company and possibly the same team) is so good?

We were so fucking close to making it to paradise if only they didn't completely destroy the model with 4o slop during post-training...
>>
File: ComfyUI_00259_.png (3.04 MB, 1088x1344)
3.04 MB
3.04 MB PNG
>>106586959
damn cool!
I'm a sucker for retro anime. Is this chroma?
>>
File: ComfyUI_temp_vusjp_00118_.png (2.57 MB, 1824x1248)
2.57 MB
2.57 MB PNG
>>106586962

I agree with you there. I use Wan for its realism and Chroma for porn, when the Wan porn loras can't follow my prompt correctly

Have you seen tencent/SRPO btw? It unslops Flux, basically

https://files.catbox.moe/jtrxzh.png
>>
>>106586972
Nigga, he literally posted the catbox...
>>
>>106586959
>>106586980
I just saw it! Wan huh, nice! Thanks
>>
>the perfect UI doesn't exi-
>>
File: ComfyUI_temp_vusjp_00147_.png (2.56 MB, 1824x1248)
2.56 MB
2.56 MB PNG
>>106586972
Wan, using the Goldenboy lora
https://files.catbox.moe/eijfo1.png
>>
>>106586987
nice cars nigga
>>
>>106586951
thx
>>
>>106586978
>Have you seen tencent/SRPO btw? It unslops Flux, basically
I have, and I am waiting for a hero to train that on Qwen
>>
File: collage.png (3.94 MB, 3611x768)
3.94 MB
3.94 MB PNG
>>106587009
me too
----
henlo 'puter frens, i need assistance on prompt engineering.
Try as I might, I can't replicate the leftmost image, my best successes are the middle and rightmost images of the collage.
The catboxes with the configuration and prompts are below. Any tips or hints here? tks in advance!
https://files.catbox.moe/ug3nlx.png
https://files.catbox.moe/xjqcyv.png
>>
hey all, anyone have any info on a decent nudify workflow? i've been using qwen and flux kontext but im still getting pretty shitty results. also looking for any discord's or decent forums where i can get help/feedback?
>>
>>106587059
Feed the youtube screenshot to gemini?
>>
>>106587071
just use Wan and any of the clothes stripping loras
>>
>>106587072
Hmm, good idea. I'll try that out. But I prefer using FOSS models for that, and using text-to-image. I want to pick the ... how should I put this? Chaotic? Improvised? Unstaged? composition/blocking/mood of the original image for other pictures as well. If it could be reliably generated via a text prompt, that would be ideal
>>
>>106586780
It can caption hardcore porn
>>
>>106586790
This level of seething...

Mental illness doesn't even begin to describe it
>>
>>106586856
>punished maid / godzilla crossover
Has this never been done ? Wasted opportunity if not
>>
>>106586767
Because who the fuck cares?? Google already knows you’re a degenerate. You’re being paranoid
>>
>>106587095
im trying to do images specifically, does wan do images? what are the best loras/where can i find them?
>>
>>106587152
Go away, jew

Seek your shekels elsewhere
>>
>>106586856
Love this image, good stuff!

>>106587163
Ask Wan to do a one-frame "video", and you turned it into an img genner. Pick any of the catboxes I posted (like this one >>106586988) for a Wan T2I flow. It includes a 2x upscaler pass as well
>>
>>106587163
I was talking about videos. Base Wan can do nipples and butts just fine, and nudifying with a video model since it has temporal+spatial awareness and things like breast sizes etc will be more accurate
>>
>>106583457
>>106585705
Based
>>
>>106586642
Please share your good quality gens in /adt/ too. We're trying to clean up the general from trolls.
We also need more people who don’t use SDXL.

>>106587093 See?
We’re trying to start fresh and keep things good here. Please give us another opportunity.
>>
>>106587184
awesome! thank you for the info, is there anywhere i can get more info on this workflow? i'm sort of a hack and pretty new to this
>>
File: 44145124512545.jpg (607 KB, 3576x1352)
607 KB
607 KB JPG
>>106586744
>>106586592
So I tried the default Comfy workflow that has the padding stuff. Turns out that it messes up the output (introduces fuzziness). To me, this retardation explains why Chroma hasn't taken off yet to civit/Plebbit normies. They are using a broken official workflow. So they conclude the model can only make shitty pics.

This is what comfy says on the workflow
>min_padding 1 is supposed to be the official way to inference chroma but I think the results are better with min_padding 0

But even that has fuzziness.

I can reproduce the fuzziness across every picture on official comfy workflow E.G.
https://files.catbox.moe/txieok.png

Here are the two comparison workflows
https://files.catbox.moe/obeqbv.png
https://files.catbox.moe/z9txlj.png

Not sure if anyone here has reported on this issue before, or maybe it's a Flash only issue.
>>
>>106587186
nudify a video model? is there anywhere i can get more info on this as well?
>>
Seedream 4 local when
>>
>>
whats the best nodepack for joycaption? I just want to run it on a dir and produce txt files to go along with my dataset :)
>>
File: Qwan_00003_.jpg (916 KB, 1984x2976)
916 KB
916 KB JPG
>All this negativity
Good vlñes only, Anon. Good vlñes only.
>>
>>106587224
I used a workflow from the Mystic NSFW Lora

It's a bit hard to wrap your head around it at first due to the subgraphs and the fact that ComfyUI does not signal where the error is if it comes from a node inside a graph

For example: changing the address of the Lora gives an error on the image subgraph (can't post a screenshot since the machine I'm on doesn't have CUI installed), in the LoraStackerAdv node. You need to click the circle to expand and then correct the lora's address. I set its strength to zero so I wouldn't need to bother changing Loras in two places at once, and it doesn't seem to affect final img quality, prompt following, etc at all, not sure why it's there

Also added a img preview node so that the workflow submenu would display the generated imgs thumbnails, and have a more convenient way to save imgs

https://civitai.com/models/1295758?modelVersionId=2149217
>>
>>106587234
Why no neg prompt?
>>
File: 2loras_test__00088_.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>
>>
>>106587234
Hmm, I thought it was a me issue when I stumbled upon this exact same issue, using ChromaHDV10. Got it a touch better using heun + beta scheduler. res_2s + bong_tangent works decently as well

https://files.catbox.moe/eem0ca.png
>>
File: ComfyUI_09922_.png (1.84 MB, 1024x1024)
1.84 MB
1.84 MB PNG
I am doing some tests with Flux SRPO + NAG (for negative promtps). It does unslop Flux a lot. Might completely replace Chroma for my non-nsfw use cases, especially if I can train Loras on it.

I sure hope someone does SRPO training on Qwen, then we'll be in heaven.
>>
Does anyone knows how to bypass the safety guidelines in Gemini when it comes to creating the 3D figurines pictures that I've seen going around?
>>
File: eval loss.png (86 KB, 867x613)
86 KB
86 KB PNG
Do I just wait until it starts to increase again or what? Is it supposed to flatline?
>>
File: ComfyUI_temp_vusjp_00215_.png (2.64 MB, 1824x1248)
2.64 MB
2.64 MB PNG
>>106587335
Could you kindly share your workflow? Or is it just replacing Flux with SRPO on the CheckpointLoader and fingers crossed your GPU has the VRAM to take it?

https://files.catbox.moe/fj6vs4.png
>>
>>106586269
>>106586341
hot. box?
>>
>>106587234
>>106587319
Yeah, nvm, seems to have been a beta scheduler issue
>>
>>106587353
Here:
https://files.catbox.moe/ljzpd9.png
>>
>>106587395
thanks, fren
>>
File: ComfyUI_temp_tpjvl_00013_.png (3.08 MB, 1152x1152)
3.08 MB
3.08 MB PNG
>>106587234
>>
File: ComfyUI_06948_.png (1.63 MB, 1152x1152)
1.63 MB
1.63 MB PNG
>>106587301
HD Flash doesn't accept one
>>
fuck kohya ss
>>
>>
File: ComfyUI_00044_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>
File: comfy12351.jpg (2.4 MB, 2048x2048)
2.4 MB
2.4 MB JPG
>>
What is the current state of NSFW generation from the big companies? Iirc Open AI said they would allow some nsfw stuff, has that happened? Is stable diffusion the only good nsfw still?

Have any of these companies realized all people want to do is make porn with it?
>>
File: ComfyUI_06949_.png (1.68 MB, 1152x1152)
1.68 MB
1.68 MB PNG
>>106587414
Prompt:
>A beautiful Korean idol woman is taking a selfie, flashing a peace sign and a bright smile. She has shoulder-length brown hair and is wearing a white t-shirt. In the background, a screen shows her performing on stage, wearing a crop top and a skirt. The crowd behind her is filled with fans holding up their phones, capturing the moment. The atmosphere is lively and celebratory, with her joyful expression reflecting the excitement of the event.

Pic rel:
>A beautiful Korean idol woman is taking a selfie, flashing a peace sign and a bright smile. She has shoulder-length brown hair and is wearing a white t-shirt. In the background, a screen shows her performing on stage, wearing a crop top and a skirt. The crowd behind her is facing away from the woman, only their backs visible, filled with fans holding up their phones, capturing the moment. The atmosphere is lively and celebratory, with her joyful expression reflecting the excitement of the event.

Chroma just needs some prompt engineering to fix itself kek
>>
File: ComfyUI_00052_.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>
I am not sure how to handle multiview character sheets in a dataset. should I split them all up?
what about expression sheets, that show head shots of a character making like 10 different expressions?
>>
>>
File: sushicat_raw_00001_.png (2.49 MB, 1088x1632)
2.49 MB
2.49 MB PNG
>>106587319
why chroma?
>>
>>106587447
>What is the current state of NSFW generation from the big companies?
lol
>>
I enjoy Chroma.
>>
>>106587341
It has nothing left to learn
>>
File: ComfyUI_00247_.mp4 (563 KB, 832x480)
563 KB
563 KB MP4
Prompt:
>>
File: ComfyUI_01690_.png (1.56 MB, 1216x832)
1.56 MB
1.56 MB PNG
>>106587447
WAN + Porn Loras have great anatomy, but it has that "stock photography" look, which can be corrected with creative lighting loras, but those can sometimes break WAN's otherwise perfect human anatomy

Otherwise, Chroma is best for porn, no need for Loras, and better prompt adherence too

old flux workflow
https://files.catbox.moe/r7p9oz.png
>>
File: ComfyUI_00248_.mp4 (579 KB, 832x480)
579 KB
579 KB MP4
>>106587448
Pic rel:
>>
>>106587447
As for paid companies, I don't think they'll go through it, too risky
>>
im gunna traaaaaiiinnn
>>
is qwen still broken with sage attention? i haven't pulled in a long time
>>
>>106587482
Porn out of the box, basically. Also, It's the only one that can do a penis correctly
>>
File: ComfyUI_06952_.png (1.73 MB, 1152x1152)
1.73 MB
1.73 MB PNG
>>106587447
>Iirc Open AI said they would allow some nsfw stuff, has that happened?

It's just a grift to attract local users to their API slop. That will never happen as long as there are ways to prompt for someone that resembles a real person or a child. I'd be impossible because celebs are used in their dataset. 4o is very hard to jailbreak. Jailbroken Dalle is unironically still more uncensored than 4o.

>Is stable diffusion the only good nsfw still?

No, Chroma happened anon (pic rel). It's uncensored Dalle.
>>
>>106587480
Nine 海楼石, anon
>>
>>
File: ComfyUI_06953_.png (1.82 MB, 1152x1152)
1.82 MB
1.82 MB PNG
>>
>>106587545
>Porn out of the box, basically. Also, It's the only one that can do a penis correctly

valid point
>>
>>106587547
flash? what settings are you using?
>>
>>
>>106587560
box onegai?
>>
File: meido_00001_.png (2.81 MB, 1088x1632)
2.81 MB
2.81 MB PNG
>>106587335
>>
File: comfy01222.jpg (1.95 MB, 2048x2048)
1.95 MB
1.95 MB JPG
>>
File: samples.jpg (928 KB, 3070x1024)
928 KB
928 KB JPG
Man... This shit (Flux SRPO) is basically Chroma minus mangled anatomy (and minus NSFW). If loras have a good effect on it, then Chroma is officially dead to me. And this is coming from one of Chroma's biggest shills ITT

Someone must apply SRPO to Qwen ASAP
>>
>>106587234
>So I tried the default Comfy workflow that has the padding stuff. Turns out that it messes up the output (introduces fuzziness). To me, this retardation explains why Chroma hasn't taken off yet to civit/Plebbit normies. They are using a broken official workflow. So they conclude the model can only make shitty pics.
I don't know why lodestone doesn't care about that, the fix is here, all Comfy has to do is to merge this shit
https://github.com/comfyanonymous/ComfyUI/pull/7965
>>
>>106587316
Dinner is served
>>
>>106587616
Redpill me on qwen, I've only used it (the edit model) once and told it "see this image? add some text right THERE"
It didn't really work too well, is the base model better for pure t2i? What about lora variety for characters? 2D?
>>
File: ComfyUI_06959_.png (1.44 MB, 1152x1152)
1.44 MB
1.44 MB PNG
>>106587564
Yeah, Flash
>>106587571
https://files.catbox.moe/g6iq7z.png
>>
File: 23x23_00002_.png (2.03 MB, 1088x1632)
2.03 MB
2.03 MB PNG
>>106587432
>>
>>106587616
Isn't the entire point of Chroma the NSFW?
>>
>>
File: ComfyUI_09929_.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>106587647
Qwen is the largest and theoretically the most powerful open T2I model, and its prompt alignment is really good.
The problem is that Alibaba completely shat the bed by fine-tuning the model on gpt4o slop and they obliterated the model's ability to make diverse images and photorealism.
>>
>>106587701
NSFW is the only reason why the mankind did not extinct yet
>>
File: 1752896380869552.jpg (32 KB, 702x720)
32 KB
32 KB JPG
I want to make a lora of a realistic person, and then use it with illustrious.

I've created images of anime characters with style loras and I can get the character with another style, and it's cool. But I don't know how to make the same with people (example, one of the JD Vance memes with Akira Toyama style). Whenever I try to make a lors I get generic brown hair anime guy or the character doesn't translates into the style.
>>
>>106587714
OpenAI should have never released GPT, you're telling me that in addition to slopping the shit out of text gen, it's also ruined image gen?
>>
>>106587735
I could use flux but I've heard that it is not compatible with sdxl models.
>>
>>106587314
this is the stuff I'd like to visit in 20 years when you will be able to get into generated images like a game
>>
>>106585470
was looking forward to these, thanks!
>>
>>106586293
it's not just ads, office approved wardrobe was that before everyone decided suits and skirts were bad in the west for some reason
sad but at least I can gen hot OLs locally
>>
File: ComfyUI_09932_.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>106587701
For many people in this thread (myself included), it was mostly about its ability to do photorealism and "unslopped" stuff without any particular bias. Once I can do that with other models without the mangled anatomy, I am never looking back.

(once again, picrel is Flux SRPO with NAG and a bunch of negative prompts)
>>
after using both qwen and chroma i can confidently say that i like neither of them



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.