[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107405841

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/t

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
nigbo
i
g
b
o
>>
how's the controlnet?
>>
Comfy must be dragged out onto the streets and shot
>>
tensions seem very high in the tongueass lab coomcord
>>
comfy should be dragged into the sea on a yacht
>>
File: 1761248685434585.gif (1.58 MB, 600x273)
1.58 MB
1.58 MB GIF
>z-image
>the loli is sitting in the man's lap
>the man is holding her stomach
>they are both sweaty
>wan i2v
>gyrating hips lora
UOHHHHH
OHHHHHHHHHHH
>>
File: 1759368799409944.jpg (3.5 MB, 1872x2736)
3.5 MB
3.5 MB JPG
>>
>>107408185
>https://comfyanonymous.github.io/ComfyUI_examples/z_image/t
Need to remove the trailing t, it was added by mistake.
>>
File: deFA_zi_00033_.png (2.5 MB, 1840x1096)
2.5 MB
2.5 MB PNG
>mfw
>>
>>107408217
And raped, coz of the implication
>>
File: AnimateDiff_00001-1.mp4 (3.64 MB, 1072x1072)
3.64 MB
3.64 MB MP4
Haven't tried much realistic wan genning, but it looks real nice.
>>
>>107408224
finally some kino
>>
>>107408229
nice gen
>>
File: 1738039412322066.png (2.22 MB, 1957x1035)
2.22 MB
2.22 MB PNG
>>107408220
>this is worse than Flux2 and QIE
>this single image input needs to stop.
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
So it's just one image input on the edit model? Uh oh...
>>
>Z-Image-Base – The non-distilled foundation model. By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development.
>By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development.
>Z-Image-Base
>unlock the full potential for community-driven fine-tuning and custom development.
>
>>
"beautiful woman"

Glorious china.
>>
File: 1746759997379307.png (531 KB, 832x1216)
531 KB
531 KB PNG
https://civitai.com/models/1332651?modelVersionId=2460872
Gonna bring that Midjourney feel
>>
File: Zurbo_00107_.jpg (1.27 MB, 2816x2176)
1.27 MB
1.27 MB JPG
>>107408010
Based shoulderpad enjoyer.
>Messed up the lapels
A shame.
>>
base never (ever)
>>
>>107408010
or it's te/embedding thing
>>
>>
has anyone shared turbo training settings or has furk already locked it behind pateron
>>
File: ComfyUI_00973_.png (2.17 MB, 1072x1920)
2.17 MB
2.17 MB PNG
Seek Christ, anons.
>>
File: ZImg_00211_.png (1.8 MB, 1152x1440)
1.8 MB
1.8 MB PNG
>>
>>107408394
default ostris should work
>>
>>107408394
>https://www.youtube.com/watch?v=Kmve1_jiDpQ
tldw: The defaults, sigmoid if training a character
>>
>>107408257
yeah it's SDXL after all.
on the plus side you can abuse the large canvas size to abuse cramming multiple images in.
>>
File: 00183-3299250893.png (1.31 MB, 1216x832)
1.31 MB
1.31 MB PNG
>>107407694
what were the settings? negative prompts? thanks for sharing :)
>>
>>107408403
>>107408410
defaults suck though i was hoping for a bespoke ldg anon guide
oh well
>>
>>107408394
Is it worth training loras on turbo model? I've been waiting for the base model. I tried few loras from civitai and wasn't impressed at all, like >>107408348
I haven't tried anons 80s fantasy and 2000s camera loras yet
>>
>>107408422
nigga we don't even have a wan guide
>>
>>107408379
> bug face
bruh
>>
File: z-turbo_00017_.png (3.27 MB, 1536x1536)
3.27 MB
3.27 MB PNG
probably a skill issue.

printed on the center of a woman's shirt in a sans-serif font:
"
a b c
d e f g h
i j k l m
n o p q r
s t u v w
x y z
", the letters "l" "d" and "g" are red, the rest are black.
the woman smiles with both thumbs up.
>>
File: 1756557149785292.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>107408348
That's why I love how good Z-image turbo is at details, you're not obligated to go for some zoom in of humans to make the image look good
>>
>>107408422
For real person training 0.0001 (1e-4) works fine, but if you use any synthetic data, which always trains faster, you should drop down to ~0.00005 (5e-5), this range is likely the best for reasonably complex artstyles (as in not anime)

I use logit normal and flow shifting, but I'm training with Diffusion Pipe, not AI Toolkit
>>
>>107408422
its been like a few days no one is experienced enough
also, how do you know it sucks?
>>
File: Zurbo_00108_.jpg (954 KB, 2816x2176)
954 KB
954 KB JPG
>>107408400
Imagine those claws... no. No!
>>
File: 1737749946438601.png (1.68 MB, 1280x720)
1.68 MB
1.68 MB PNG
>>107408469
>>
>>107408185
ANIME DIFFUSION NEWS!
>Noob Models!
SeeleNoobAI (2048 native resolution): https://civitai.com/models/1445275/seele-noobai-sdxl
Chenkin Noob XL:(NoobAI ESP with new dataset of character)
https://civitai.com/models/2167995/chenkin-noob-xl
WAI Shuffle Noob
https://civitai.com/models/989367/wai-shuffle-noob
>Anime Lora Making Guide!
https://civitai.com/models/22530/guide-make-your-own-loras-easy-and-free
>Model News!
ZiT Zeta Image Turbo Model: 6b model, fast, open source, doesn't understand booru tags.
UIs that supports it: Comfy, Krita AI Diffusion, Neo Forge, Swarm, SD Next
>Anime ZiT LoRas!:
Frieren LoRA
https://civitai.com/models/2176854/frieren-beyond-journeys-end-sousou-no-frieren-z-image-lora
Flat Anime Style:
https://civitai.com/models/2175307/z-image-flatanimestyle
Ra Lilium Style:
https://civitai.com/models/2125529/ra-lilium-style
Nyalia Style:
https://civitai.com/models/2180136/nyalia-style
Anime Flat Style:
https://civitai.com/models/1952560/anime-flat-style
Teto:
https://civitai.com/models/2175612/kasane-teto-z-image-lora
ANIME CHARACTER LORA REQUESTS HERE!
>>
File: 00138-3921217583.png (2.2 MB, 1080x1920)
2.2 MB
2.2 MB PNG
>>
File: 1764460193362884.jpg (2.49 MB, 3072x3072)
2.49 MB
2.49 MB JPG
>>107408448
Majority of Civitai loras for Turbo are just of synthetic slop output from other models, as in retarded.

It trains really well, both people and artstyles, BUT base will undoubtably train better given that it's a model made to be a base for further training.

So you might as well wait unless you are REALLY eager or just likes to experiment, like me.

I trained this Z-Image Turbo lora a bunch of threads ago, picked up the person with no problems:

https://files.catbox.moe/4pfomp.safetensors
>>
>>107408522
no frickin way this is ai
>>
>>107408531
put this shit on civitai omg what's your problem
>>
File: 00061-833700208.png (1.37 MB, 1200x1200)
1.37 MB
1.37 MB PNG
>>
>>107408543
No celebrities allowed on CivitAI dude
>>
please understand sir i need to use the pony outputs for training
>>
>>107408557
oh yeah my b
>>
>>107408229
Good style debo!
>>
>>107408531
who is it (so I can tag it)?
>>
>>107408531
does it train well with 1024 resolution? hows the time vs chroma for example? Have you tried large rank 1gb loras like Emma Watson one you made earlier on zImage?
>>
>>107408577
it looks like cara delevigne, or however the hell you spell it
>>
I hope I made it in time before the release of base.
>>
inpainting on zit still ass?
give me your best z-image inpainting workflows
haloing results are an instant disregard
please and thank you
>>
File: ComfyUI_00997_.jpg (847 KB, 2048x2048)
847 KB
847 KB JPG
>>
File: combined_0129.jpg (634 KB, 4066x2040)
634 KB
634 KB JPG
>>
>>107408593
Why not just rin the gen through SDXL with a denoise of like 0.3 with a detailed controlnet, then do a cycle of upscale and downscale?
>>
z shitmix status? dedistillation status?
>>
>>107408584
Haven't tried 1024 yet, this (Cara Delevigne) was trained at 512 which seems well enough and really fast ~1.25 s/it on a 5060 ti

>large rank 1gb loras like Emma Watson one you made earlier
Wasn't me, I only train people lora at rank 16

This model was trained using Diffusion-Pipe: adamw, LR:1e-4, rank 16, logit normal + flux shifting, 25 images, 100 epochs
>>
>>107408595
laughed
>>
>>107408531
i'm curious how you're captioning. i'm using old captions from chroma with 70-100 tokens, but i've also heard people are training without captions at all.
>>
>>107408595
Based, more happy endings please!
>>
>>107408651
>logit normal + flux shifting
Completely new settings for me. Here we go again.

>Wasn't me, I only train people lora at rank 16
My bad. I hope that dude tries to train that lora on zImage to see if it works
>>
>>107408584
>does it train well with 1024 resolution? hows the time vs chroma for example?
i'd compare it to chroma for 1024. it learns slowly at 1024 but the end results are very good. 6000 steps is where i usually stop, but i'm pretty sure i could go even further for further quality increase.
>>
hey guys base was just released.
>>
psych
>>
>>107408693
>>107408702
you devil you

roguish behaviour
>>
>>107408637
why do I only ever get engagement bot tier replies to this question
you don't need to reply if you have such bad ideas
>>
File: Zurbo_00113_.jpg (978 KB, 3328x1792)
978 KB
978 KB JPG
>>107408693
>>107408702
Shame on you.
>>
>>107408674
For this I only used 'rcng' as caption so basically no caption at all, you don't need rcng in the prompt and none of the example images used it, for a bunch of images of a person it's perfectly fine, the model will easily spot the human being pattern to focus on

For anything more varied, like an artstyle or clothing / photography styles I just use JoyCaption 'Write a long detailed description of this image.' and check that it doesn't hallucinate stuff and if so edit the resulting caption.
>>
https://www.youtube.com/watch?v=iNM5z8cCH8w
>A 600k subscribers youtuber is making the promotion of Z-image turbo
goddam
>>
You should gen more cute maids
>>
>>107408462
Is this what you were looking for, anon?
>>
File: zimg_0014.png (1.6 MB, 1024x1496)
1.6 MB
1.6 MB PNG
you can prompt some pretty raw shit on zim turbo without loras???

https://files.catbox.moe/1vrfjc.png
>>
If they trained it on so many real images why do anons gens look so synthetic?
>>
>>107408723
I need non pepperoni nipples
>>
>>107408729
AI is an image synthetisizer.
>>
File: 1745373692465771.png (1.58 MB, 1280x720)
1.58 MB
1.58 MB PNG
>>
I downloaded all this stuff and I dont know what to do lol
>>
File: 1751125395532578.png (3.03 MB, 1536x1536)
3.03 MB
3.03 MB PNG
zit dark fantasy lora plus one other
>>
File: ComfyUI_01035_.png (3.82 MB, 1488x2192)
3.82 MB
3.82 MB PNG
>zit doesnt seem to know what a vampire is

It's so over.
>>
>>107408681
>logit normal + flux shifting
flux shifting is typically known as timestep shifting or flow shifting, so maybe you're already using it.

I've only seen it referred to as 'flux shifting' in Diffusion-Pipe
>>
>>107408691
Good to know. I'm still gonna wait before training. I have this gut feeling that the base model might be massive and super slow. I hope I'm wrong
>>
File: lol?.png (2.14 MB, 3552x1382)
2.14 MB
2.14 MB PNG
>>107408762
what?
>>
File: 1758648031576236.png (3.04 MB, 1536x1536)
3.04 MB
3.04 MB PNG
>>107408760
that looked more like stark girl actually
>>
>>107408777
Nice teeth, bro.
>>
>>107408762
That's Konoru-chan, not a vampire you baka.
>>
>>107408751
sovl
>>
>>107407500
catbox this, please. I must be fucking something up because even with the bracket shit I never get anything close.
>>
>>107408762
teenage jennifer connelly in compromising positions
>>
>>107408778
>>107408760
What's the prompt for cleavages? Or is it a lora?
Zit either gives me nothing or nipples straight to my face
>>
>>107408801
>She wears a dark indigo velvet robe with a deep center split extending to her navel, revealing her torso and the cleavage between her gigantic breasts
>>
File: 1740738784100607.png (1.25 MB, 1280x720)
1.25 MB
1.25 MB PNG
>>107408780
nothing wrong with the teeth I have no idea what you're talking about :^)
>>
>>107408813
>gigantic breasts
>Still medium at best
Pity
Thanks I'll try that
>>
Anyone able to gen hitler well with zit?
Having trouble getting the hair/mustache right.

I thought it'd be easy with such a public figure
>>
>>107408828
chinese model, that size is gigantic for them
>>
File: 1752701518041713.png (1.52 MB, 1280x720)
1.52 MB
1.52 MB PNG
>>
Z doesn't know Greta Thunberg :(
>>
File: ComfyUI_00020_.png (1.78 MB, 1504x1024)
1.78 MB
1.78 MB PNG
>>107408762
false

genned yesterday. well, this morning.
>>
It knows vampires are jews.

>>107408842
keek
>>
File: that's right.png (3.69 MB, 3840x1183)
3.69 MB
3.69 MB PNG
https://civitai.com/models/833507/apple-quicktake-150-digital-camera-style-zit-qwen-and-flux?modelVersionId=2461241
>sovl -> sovless
>>
>>107408821
Is Brad Pitt still in hospital? I hope he is ok.
>>
>>107408729
Because the training was of photoshopped "real images" of women who have been more plastic than person since entering university.
>>
Every day that passes I see soul being used as a synonym for old more and more
>>
>>107408894
many think old = good you are correct
>>
File: sdxl-biglustydonutmix.jpg (785 KB, 5568x1330)
785 KB
785 KB JPG
>>107408531
tested it in an sdxl checkpoint for science
>>
File: HELP ME ANON.png (1.38 MB, 1280x720)
1.38 MB
1.38 MB PNG
>>107408886
>Is Brad Pitt still in hospital?
he still is :(
>>
>>107408870
>ugly -> beautiful
>>
>>107408944
Damn! I need to donate some money to him...
>>
>>107408940
Huh, it actually picks it up somewhat, interesting
>>
File: 1752289735702750.jpg (1.88 MB, 1664x1216)
1.88 MB
1.88 MB JPG
>>107408593
Here's an example where I turned the girl's pendant into a heart-shaped one.
The workflow: https://files.catbox.moe/rb153f.png
>>
>>107408944
>>
so what happens now
>>
>>107408940
and positive weights
using a forge that doesn't support zit
reducing the strength of the lora like
> <lora:4pfomp-(03a9d4d29935):0.5>
all the way down to 0.1 changed absolutely nothing so that's why i went with changing the weight on the name
my prompting probably messes with the likeness a bit too
>>
>>107408976
no remorse for anne, she deserved it
>>
>>107408397
that's timotay
>>
>>107409020
better, but no
trying to hide the problem by using tons of feathering isn't it anon
this isn't the way to get this solved
>>
>>107409020
what's after 1.5, she is becoming progressively more grinch-like
>>
File: ComfyUI_21431_.jpg (336 KB, 1382x1036)
336 KB
336 KB JPG
>>
>>107409046
>>107409020
go 2.0 and prompt her with green skin
>>
File: five.jpg (98 KB, 861x1076)
98 KB
98 KB JPG
>>107409046
(cara delevingne:5.0)
>>
File: ComfyUI_00064_.mp4 (3.52 MB, 480x640)
3.52 MB
3.52 MB MP4
>>
>>107409085
fuckin kek
>>
>>107409085
Nepo-baby that stole the Christmas!
>>
>>107407877
What prompt did he use for the text???
>>
>>107408604
zimage is such slop for troonime
>>
>>107409121
Its like SDXL at 500x500 10 steps, reminds me of neta
>>
File: ComfyUI_temp_sispg_00003_.png (3.29 MB, 1024x1536)
3.29 MB
3.29 MB PNG
>>
File: 00153-10467573012706.png (1.68 MB, 1024x1536)
1.68 MB
1.68 MB PNG
>>
Hitler remained elusive, so I had to settle for the next person in the shadow cabal
>>
File: 1753380270144229.png (1.49 MB, 1280x720)
1.49 MB
1.49 MB PNG
>>107409062
nice one anon
>>
>tfw AMD

What's the best option for someone stuck with an AMD card?
>>
>>107409184
suicide
>>
>>107409168
>pony
uh oh janjan aint gunna like tht one
>>
>>107409184
Doesn't Comfy have a AMD portable release ? If so that's your best bet.
>>
>>107409166
That's the face of a girl who just stole your doughnut.
>>
>>107409189
no but for real mah dogg, drop the shit ahh answers and tell me
>>
File: zimg_0048.png (3.72 MB, 2048x1496)
3.72 MB
3.72 MB PNG
which one is better?

>>107409085
my sides
>>
File: ComfyUI_21432_.jpg (376 KB, 1382x1036)
376 KB
376 KB JPG
>>107409170
you can just slap in whatever and it'll happen
>>
>>107409205
Eat hers later
>>
File: 1736662223117659.jpg (44 KB, 1200x672)
44 KB
44 KB JPG
>>107409213
>>
File: ComfyUI_00332.png (2.59 MB, 1200x1808)
2.59 MB
2.59 MB PNG
>>107408584
I got about 1.5s/it at 1024 with my 4090. R16, 2900 steps and very verbose captioning. AI-Toolkit is really fucking shitty though, Z will probably fair a lot better if they release the base model and you can train on something that's not so completely ass.
>>
>>107409213
right, except for the "canon"
>>
File: combined_0085.jpg (505 KB, 3557x2040)
505 KB
505 KB JPG
>>
>>107409184
buy an nvidia card
>>
File: hands.jpg (294 KB, 1310x1356)
294 KB
294 KB JPG
>>107409213
right has better nails
>>
>>107409206
realistically sell your AMD card and get something better. If you're broke then be prepared to spend the whole day researching how to get the best out of it, and knowing that even then, it'll still be shit.
>>
>>107409250
This is nice!
>>
File: ComfyUI_temp_sispg_00007_.jpg (536 KB, 1024x1536)
536 KB
536 KB JPG
>>
>>107409166
definitely AI. a girl that skinny would never eat a donut.
>>
>>107409274
do you see a bite out of that donut? she's pretending to eat it
>>
>>107409250
Dunno Z just has that dead AI look to it, flux and qwen looks convincingly ''artistic''
>>
File: ComfyUI_00072_.mp4 (1.19 MB, 720x1280)
1.19 MB
1.19 MB MP4
>>
File: 1741598277558521.png (1.01 MB, 824x1264)
1.01 MB
1.01 MB PNG
still love qwen edit 2509 (v2). it's so good. especially since the new one allows multi image and easy referencing with no latent stitching needed. (image1/2/3)

replace the police officer in blue with the pink hair anime girl in image2, who is wearing a blue police uniform and badge, and kneeling on the black man on the floor. keep the anime girl's expression the same. Add the text "Bocchi the Cop!" to the top of the image.
>>
>>107409290
it's clearly trained on seedream slop
>>
>>107409268
This is candid shot. In the early 2000s Brad always carried a sword with him when he was in LA.
>>
>>107409225
Diffusion-Pipe has Z-Image Turbo support

OneTrainer seems to be waiting for Base
>>
>>107409302
could you share a flow for that?
>>
File: ComfyUI_08466_.png (2.86 MB, 1280x2048)
2.86 MB
2.86 MB PNG
>>
>>107409260
This autist knows his shit
>>
>>107409313
>Brad always carried a sword with him when he was in LA.
Who doesn't ?
>>
>>107409250
Z-image anime is pretty generic looking but holy fuck man what does the BFL guys have against anime? Shit always is so melty and anatomy downright awful when it tries to do anime.
>>107409290
It's a real anime image captioned by qwen. qwen has the same generic style as Z.
>>
File: 1743035093581159.png (226 KB, 447x693)
226 KB
226 KB PNG
>>107409321
it's just the default comfy template for qwen image edit. if you updated comfy it's there, didnt change any settings.
>>
File: 890346345.png (1.62 MB, 1024x1536)
1.62 MB
1.62 MB PNG
>>
>Z is supposed to be poorfag friendly
>can't even get Turbo running on my 2070S in comfy
haha...
>>
>>107409352
oh, nice. I rarely look at the templates

thank you
>>
>>107409290
Z turbo only really looks nice when you find a good pocket of training since they essentially slapped a lora on top of it before releasing
>>
>>107408185
>https://comfyanonymous.github.io/ComfyUI_examples/z_image/t
ded link
>>
>>107409355
>2070S
yeah, poor. not completely destitute, living in a crack-shack eating great value rice crispies with no milk
>>
>>107409348
I rely on rooftop Koreans.
>>
>>107409389
2070S is better than 3060 though. SDXL forks work fine...
>>
>>107409383
delete the last "t". don't know why that's there. baker fucking with it probably, fucking faggot
>>
File: zimg_0064.png (2.67 MB, 2048x1496)
2.67 MB
2.67 MB PNG
>>107409260
>>107409232
>>107409221
it's subtle but noise injection does seem to have an effect
>>
File: combined_0079.jpg (596 KB, 3916x2040)
596 KB
596 KB JPG
>>
>>107409403
>noise injection does seem to have an effect
you can also use this
https://github.com/BigStationW/ComfyUi-RescaleCFGAdvanced
>>
File: 1739371615742748.jpg (568 KB, 2560x979)
568 KB
568 KB JPG
When you increase nag_sigma_end you stop NAG earlier than expected, not only it makes it faster but it lets the model more time to do some cleaning on "normal mode", NAG is important only at the begining when it has to create the scene and add the relevant characters with its superior prompt adherence, but the details should be handled without it
>>
File: 1733247629285070.png (1.68 MB, 1024x2048)
1.68 MB
1.68 MB PNG
>>107408828
>>
>>107408970
>>
File: ComfyUI_21438_.jpg (361 KB, 1382x1036)
361 KB
361 KB JPG
anon's dark fantasy lora
>>
File: 1757837492208020.png (495 KB, 860x640)
495 KB
495 KB PNG
>>107409471
even migu is tired of his bullshit
>>
File: 0923523.png (1.86 MB, 1024x1536)
1.86 MB
1.86 MB PNG
>>
File: Zurbo_00125_.jpg (792 KB, 3328x1792)
792 KB
792 KB JPG
>>107409471
That's me! I am the one who is depicted in this image!
>>
>>107409485
That looks good, why doesn't he put that on civitai? >>107406069
>>
File: .png (1.38 MB, 1152x656)
1.38 MB
1.38 MB PNG
Ladies, I think we jumped into the wrong system..
>>
>>107409471
Where did you get this image of??
>>
File: zimg_0071.png (3.71 MB, 2048x1496)
3.71 MB
3.71 MB PNG
>>107409423
interesting
>>
>>107409294
>hummingbird kiss
Cute
>>
>>107409503
that's pretty good, you're using a lora for that one?
>>
>>107409471
is that zimage? cute migu
>>
I NEED to know the size of base!
>>
File: ComfyUI_00396.png (2.3 MB, 1200x1808)
2.3 MB
2.3 MB PNG
>>107409314
>Diffusion-Pipe
Do I have to jump through a a bunch of hoops and mirror HF to work offline like I do with AI-Toolkit?
>>
>>107408774
For the last time since you guys have brain damage and can't read the paper, the base model is the same size as the turbo model at 6B. It will be slower because you can't use 8 steps, you'll need 3-4x the steps to get an image or even higher like prior models.
>>
File: zimg_0075.png (3.86 MB, 2048x1496)
3.86 MB
3.86 MB PNG
>>107409577
eat a dick nigga
>>
>>107409446
nag sigma end being backwards from what you expect is so on brand for comfy
>>
>>107409589
512x512 images of only her face right?
>>
>>107409591
Then why is it taking so long to release if its just a worser turbo??
>>
File: .png (2.24 MB, 1152x1072)
2.24 MB
2.24 MB PNG
>>107409547
nah just zit and a ludicrously long prompt
>>
File: shrug.png (551 KB, 750x1000)
551 KB
551 KB PNG
>>107409577
Why, it won't be released anyway
>>
File: 1745713452904618.mp4 (1.31 MB, 832x480)
1.31 MB
1.31 MB MP4
The anime girl with large breasts sits at a desk in a Japanese classroom.

even with minimal details wan does well. the kijai MoE lora for high helps a lot too (latest lightx2v update, but fixed)

https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22_Lightx2v/Wan_2_2_I2V_A14B_HIGH_lightx2v_MoE_distill_lora_rank_64_bf16.safetensors
>>
File: 1737369861148409.png (1.42 MB, 1280x720)
1.42 MB
1.42 MB PNG
>>
File: ComfyUI_08442_.png (2.76 MB, 1280x2048)
2.76 MB
2.76 MB PNG
>>
>neoforge
>like 3 downloads
>zit workflow for comfyUI
>1 GORLLION DILLON DOWNLOADS
>>
>>107409020
any idea why cfg isn't configurable on a step by step basis? It's the same idea. So is nag. mess with the latent, after a step is finished.
>>
>>107409591
>>107409606
It's not meant to gen on, it's meant to train loras on, and then gen those loras on Turbo
>>
File: 98275346345'.png (1.92 MB, 1024x1536)
1.92 MB
1.92 MB PNG
>>
File: 1763126861920664.png (1.4 MB, 1280x720)
1.4 MB
1.4 MB PNG
>>107409613
>it won't be released anyway
>>
holy shit i just got access to wan 2.5 from a friend. it's 12 different 28gb models, with a gradient of noise levels. I just trained a lora of a cat (12x 1.2gb safetensors) and the results are magical.
>>
>>107409663
Tree branches don't grow 1 ft from the ground. Whoever made this image has literally never stepped foot outside.
>>
File: 1734925878367847.mp4 (1.31 MB, 832x480)
1.31 MB
1.31 MB MP4
>>107409620
The anime girl with sits and reads a book with the title "LDG" in a Japanese classroom.
>>
>>107409606
>Then why is it taking so long to release if its just a worser turbo??
the potential is too big, if someone finetunes that monster it'll end up being too powerful for the goyims
>>
File: ComfyUI_08359_.png (3.05 MB, 1280x2048)
3.05 MB
3.05 MB PNG
still no base Z image?
>>
File: zimg_0082.png (2.34 MB, 1024x1496)
2.34 MB
2.34 MB PNG
>>
File: 1738924086766300.jpg (2.25 MB, 2048x2048)
2.25 MB
2.25 MB JPG
>>
>>107409591
Paper is just a paper. Until I can run it on my own pc I treat it like it doesn't exist.
>>
>>107409687
can you share the flow for this?
it's so damn clean
>>
>>107409608
>ludicrously long prompt
But can it depict ludicrously high speed?
>>
>>107409721
uh no? lol
>>
>>107409423
>>107409523
it does sound nice to be able to control cfg better.
>>
File: 1737126961496420.png (1.32 MB, 1280x720)
1.32 MB
1.32 MB PNG
>>
>>107409691
they're wrapping it and tying the bow.
>>
>Loading checkpoint shards: 100%|##########| 3/3 [06:24<00:00, 128.13s/it]
does this take ages for anyone else?
>>
File: zimg_0086.png (2.63 MB, 1024x1496)
2.63 MB
2.63 MB PNG
>>
>>107409721
it's just wan 2.2 from the templates

Lora setup for wan 2.2:

HIGH:
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22_Lightx2v/Wan_2_2_I2V_A14B_HIGH_lightx2v_MoE_distill_lora_rank_64_bf16.safetensors

LOW: 2.2 lightning low:
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22-Lightning/old/Wan2.2-Lightning_I2V-A14B-4steps-lora_LOW_fp16.safetensors

1 strength for both
>>
File: u_00161_.png (742 KB, 608x1024)
742 KB
742 KB PNG
Finally got this setup to work, didn't think it ever would
> First person view, sitting on a camp mattress inside a tent, the male legs of the subject are visible, slightly spread. A cute young woman sits across him on the other end of the same mattress. Her legs are intertwining with his. She is looking at the camera.
>>
File: 1745048567469540.png (1.32 MB, 1280x720)
1.32 MB
1.32 MB PNG
>>107409765
kek that one is better
>>
File: ComfyUI_00420.png (2.39 MB, 1200x1808)
2.39 MB
2.39 MB PNG
>>107409600
No, 1024+ all around.
>>
File: ComfyUI_21445_.jpg (349 KB, 1382x1036)
349 KB
349 KB JPG
0.4 or so is the sweet spot for the fantasy lora. otherwise it seems like it is deforming stuff but this depends of course
>>
so whats currently the best workflow for Z image turbo?
>>
File: 1759661099267287.png (1.4 MB, 1280x720)
1.4 MB
1.4 MB PNG
>>107409811
>0.4 or so is the sweet spot for the fantasy lora.
Oh shit I was at 0.8 that explains why it couldn't do text
>>
>>107409811
What lora are we talking about?
>>
>>107409633
> why does no one want to use inferior tool?
>>
Has anyone trained a lora that actually works well at 1.0 strength
>>
>>107409849
-> >>107409499
>>
>>107409519
wut
>>107409566
yeah it's z
>>
>>107409811
prompt?
>>
>>107409880
>wut
Of me, fuck. This image of me.
>>
File: 00066-814520533.jpg (1023 KB, 2048x2304)
1023 KB
1023 KB JPG
>>107408185
proper workflow in SD reforge to upscale images?
i'm using the standard 1024x1024 proper resolution but i want to upscale at least x3.
I have 24GB vram.
>>
>>107409403
the one on the right is better
>>
>>107409811
>>107409832
Do you think Comfy fucked up the lora "fix"? It's weird we have to go for lower strength of 1 to get the normal effect of loras
>>
>>107409184
Just use linux and comfyui. It works. I gen with wan, Z, qwen, SDXL, chroma etc on my 7900 XTX. LLMs with llama.cpp work great too, I have been running GLM AIR and GPT OSS 120B with weights split between GPU and CPU.

>>107409261
>>107409189
nvidia FUD. I get why you would prefer Nvidia since it has better support, but why would you be such a mental slave that you actively encourage others to be stuck with a monopoly?
>>
File: ComfyUI_00073_.mp4 (1.64 MB, 720x1280)
1.64 MB
1.64 MB MP4
>>
>>107409964
It's a turbo model anyway and would make sense to lower the strength but I don't know anything.
>>
>>107409981
FUD lol... go back to slashdot
>>
>>107409964
I think this is all ZIT Turbo loras. If you read civitai descriptions, they all recommend really low weights. It probably has something to do with making a lora for a turbo model, which is why people are waiting for basedot.
>>
File: ComfyUI_00155_.png (3.71 MB, 1440x1920)
3.71 MB
3.71 MB PNG
>>107409693
>>107409785
Interesting theme
>>
File: zi_00028.png (1.93 MB, 1024x1536)
1.93 MB
1.93 MB PNG
>>
>>107409410
bruh wtf, how can flux 2 fuck up the fingers like that?
>>
>>107410021
For a second, I misread that as "breastfeeding stall." Hot concept either way though.
>>
>>107409811
did you cook at 8k steps and high lr yet again?
>>
>>107410049
i didn't make the lora >>107409872
>>
>>107409355
Switched to this
https://civitai.com/models/2169712/z-image-turbo-quantized-for-low-vram
and it ran fine. it ain't flipping over for my 2070S till its flipping over
>>
File: combined_0049.jpg (808 KB, 3956x2040)
808 KB
808 KB JPG
>>
>>107409880
what did you prompt? comic of ___? any style prompts?
>>
File: 1.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>
File: ComfyUI_00041_.png (2.23 MB, 1504x1024)
2.23 MB
2.23 MB PNG
>>107409799
why's she so.... demonic looking, the missing nose, and the alien eyes?
>>
>>107410147
You are playing demonic games sir

where does it end sir
>>
File: 1741692612094697.mp4 (1.11 MB, 832x480)
1.11 MB
1.11 MB MP4
The camera zooms out on the anime girl wearing a japanese school uniform and she drinks a glass of water in a Japanese classroom.
>>
>>107410041
It's called panel van.
>>
File: ComfyUI_temp_horsg_00022_.png (2.32 MB, 1024x1344)
2.32 MB
2.32 MB PNG
Zimg is kinda back to old days of warped spacetime and unreal distances.
>>
>>107410072
How did you run the caption? Through API or local? I want to try a LoRa with boomer captions.
>>
>>107410182
it's called sovl anon
>>
>>107410133
>A horizontal two-panel web comic rendered in clean digital art style with crisp black outlines, flat colors, and expressive character animation
you can use llmarena to enhance your prompts. it's free but be careful what you put there as it's basically public
>>
>>107410182
stop! employees aren't allowed to pocket cash, you find anything.
>>
File: 1737088143987054.jpg (2.09 MB, 1536x2048)
2.09 MB
2.09 MB JPG
>>
File: 1747990976321964.mp4 (1.19 MB, 832x480)
1.19 MB
1.19 MB MP4
>>107410172
one more

The camera zooms out on the anime girl who is wearing a japanese school uniform in a Japanese classroom.

this turned out nice, no cut!
>>
>>107409892
I try to follow a shopping list structure of
>general description of the image in couple of sentences, listing the characters in the image and locations/qualities of items etc.
Followed by 2-3 sentence simple descriptions for each thing in the scene:
>character description 1
>character description 2
>character description 3
>asset(s) description(s)
>background description
Then keep working with the descriptions until the image looks ok or funny. No word salads.
>>
>bro A11111/stable diffusion is outdated shit it's all about comfyui
>ok
>install everything
>get a good workflow
>plug in the same model/lora/etc I was using in A11111
>run
>the result is literally almost 1 to 1 the same thing A11111 spits out

Wow real fucking amazing comfy shills
*smack*
>>
File: ComfyUI_00042_.png (2.14 MB, 1504x1024)
2.14 MB
2.14 MB PNG
>>107410157
when the power of the cross is too strong, it splits the demon into two non-demons even.
>>
>>107410273
if you don't need to change anything, then you don't need comfyui.

but

the moment you do...
>>
>>107410273
I thought that's not supposed to happen unless you're using those nodes that replicates A1111 settings
>>
File: file.png (2.11 MB, 2218x1059)
2.11 MB
2.11 MB PNG
>>107408717
>https://www.youtube.com/watch?v=iNM5z8cCH8w
ouhh mama mia!
>>
>>107410273
I thought it was supposed to be worse, hmmm?
>>
>>107410304
there's some minor differences but hardly worth using all these nodes and workflows when A11111 just worked fine with a generic simple GUI. I'm just not seeing the point of why I'd use comfy at all when the result is basically the same shit.
>>107410297
I mean idk? what's the best workflow that does regional prompting? maybe I'll give that a shot and see if it wins me over on comfy. This workflow seems to just gen with no special features besides all the adetailers just being included
>>
>>107410273
>the result is literally almost 1 to 1 the same thing A11111 spits out
>>
File: ComfyUI_00043_.png (1.83 MB, 1504x1024)
1.83 MB
1.83 MB PNG
>>107410282
>>
>>107410333
Because you replicated the A111 workflow to do basic 1girl shit. If you had an imagination, what could you build in comfy?
>>
File: 1745704843627084.png (985 KB, 1024x1024)
985 KB
985 KB PNG
>>
File: combined_0130.jpg (659 KB, 2040x4096)
659 KB
659 KB JPG
>>107410186
Local, with a simple python script
>>
>>107410147
Nice crat.
>>
Can someone tell me how to get zit to give me a fucking side view?
Probably skill issue on my part but it's always the subject starting directly at the viewer
>>
File: 1758717759928061.png (292 KB, 800x450)
292 KB
292 KB PNG
>>107410356
she even lost her left leg and is still happy, that's a girl of focus, commitment and sheer fucking will!
>>
>>107410215
illustrious?
>>
is it just me or is zit overhyped
fine tuned sdxl models look much nicer and are only slightly less controllable
>>
File: 1752775425295102.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>107410373
anything for the onion
>>
>check civitai for other people's prompts
>they're littered with useless fluff prose and LLM-isms
How hard is it to just write a basic paragraph of your 1girl
>>
>>107410400
what more do you need other than
>1girl, tomboy, flat chest
>>
If you had to choose between Wan 2.2 and Z-Image for the "Model of the Year" award, what would be your pick?
>>
>>107410409
>1girl, tomboy,
BASED
>flat chest
you're describing a man anon
https://www.youtube.com/watch?v=Zd8vzIRQLLM
>>
>>107410423
Wan 2.1
>>
https://civitai.com/models/958009
Is this good?
>>
File: 1746358059015179.jpg (1.44 MB, 1536x2048)
1.44 MB
1.44 MB JPG
>>107410378
Zimage.
>>
>>107410429
I woman isn't just a chest, it's the shoulder to hip ratio that makes a woman
>>
>>107410466
there's no women that exist with perfect flat chest, what are you smoking m8
>>
>>107410423
wan is more versatile, does animation obviously, also kinda works as an edit model and it probably has the best anatomy of any model. praying for a wan 2.5 christmas
>>
File: 1759146948096809.png (2.42 MB, 832x1536)
2.42 MB
2.42 MB PNG
>>107410452
I don't see what this lora can do that the turbo model can't lol
>>
>>107410486
>no women that exist with perfect flat chest
No 3D women maybe, that's why 3D sucks
>>
>>107410465
Wow so cool anon!
>>
>>107410423
>If you had to choose between Wan 2.2 and Z-Image for the "Model of the Year" award, what would be your pick?
Z-Image, local image models are now so close to the best API models, can't say the same for video models, we still don't have sound bruh
>>107410499
you're just describing a femboy dude lol
>>
>>107410508
>we still don't have sound bruh
LTX-2 will be open-sourced next month and it does have sound. Let's hope it will be good
>>
>>107410382
>is it just me or is zit overhyped
It's underhyped. There's still performance to be had if you could disable the safety and asianizer.
>>
>>107410465
how'd you convince it to output that style and texture?
>>
File: sad.png (153 KB, 1414x980)
153 KB
153 KB PNG
Can someone check on trooncord and see if they really said that? Can't believe we're getting the "2 more weeks" meme again :(
>>
File: b62-2502901854.jpg (27 KB, 600x450)
27 KB
27 KB JPG
>>107409905
Uhmmm... m'lady, why art thou alone in a place such as this? *tips fedora nervously*
>>
>>107410016
hot
>>
>>107410465
for a model focused on realism it can produce way better artistic shit than flux and Qwen Image lool
>>
>prompt literally anything in Z
>get a woman staring directly at camera
I see, so this is the future of AI...
>>
>>107410543
Chinese don't celebrate Christmas.
>>
>>107410543
>>107410565
that remind me of last year when we were waiting for wan to be released and we had to wait until next year because of some chink christmas or something lol
>>
File: Zurbo_00140_.jpg (845 KB, 3328x1792)
845 KB
845 KB JPG
>>107410543
POOP!
>>
File: 1751896152437762.png (577 KB, 1021x746)
577 KB
577 KB PNG
>>107410543
its
so
fucking
over
>>
>>107410565
The Chinese mostly just think Christmas stuff is American-ish stuff. Sort of like how we think of their lamp festival as just Chinese stuff.
>>
>>107410543
>>107410630
don't doom we don't know if they really said that, I want a discord screen right now!
>>
Time for my hourly reboot because comfy get inexplicably slow.
>>
>>
New to anime genning, which is your favorite Z lora and SDXL lora and model? What should I learn more about, Z models or SDXL? Pros and cons of each one?
>>
Comfyui newfag from earlier here, how exactly do I tell this thing to spit out 10 images per gen? similar to batch genning from A1111. I can't seem to find the node that handles that?
>>
File: file.png (1.23 MB, 2130x1261)
1.23 MB
1.23 MB PNG
>>107410308
let's hope the base model will be more receptive to styles, turbo is too restrictive on its choices
>>
File: YunsitaLittleRedhood.jpg (995 KB, 2048x2048)
995 KB
995 KB JPG
I love doing prude girls being cute but also hot, does anybody knows how can i increase details in lingerie using adetailer or i'm forced to do it with manual inpaint
(i wish i could do models for adetailer)
>>
>>107410648
just use the unload all models node
>>
File: Capture.jpg (8 KB, 300x85)
8 KB
8 KB JPG
>>107410675
This?
>>
File: 1740367144590184.png (18 KB, 428x200)
18 KB
18 KB PNG
>>107410675
there should be a batch size setting somewhere
>>
File: ComfyUI_00074_.mp4 (1.61 MB, 720x1280)
1.61 MB
1.61 MB MP4
>>
>>107410672
NTDMix/GENESIS lora, NlxlMix, Art-illustrious, 2DN.
>>
>>107410704
>does anybody knows how can i increase details in lingerie using adetailer or i'm forced to do it with manual inpaint >>107409423
>>
>>107410713
no it's not that
>>
File: 1girl,solo-3159004684.png (1.06 MB, 832x1216)
1.06 MB
1.06 MB PNG
>>107410704
Nice style!
>>
>>107410713
>>107410715
I will look into those
>>107410722
idk who this is
>>
>>107410672
And RouWei, massive finetune of illustrious with more concepts/artstyles/better natural language prompting.
>>
I thought Batch was for genning multiple workflows simultaneously, whereas the Queue was for genning multiple workflows serially.
>>
>>107410704
>natural language prompting
Snake oil
>>
>>107410713
>>107410715
nta but what's the difference here? wtf is batch size vs batch count
>>
>>107410736
>idk who this is
it's me.... the batch count setting functions as if you pressed the run button ten times. that's not what you want, right?
>>
>>107410753
>>107410745
You
>>
>>107410716
>Yeah no, 3 inches is totally fine
>>
>>107410765
I mean it is, but the thing is this just spits out 10 different generations on 10 different workflows I need to shuffle through. Is there a way to just give me 10 images? A11111 would give them to me in a tiled list (that I can individually go through 1 by 1)
>>
>>107410704
M'lady... *tips fedora(
Download segmenter model from CivitAI... *adjusts glasses*...for the lingerie detection, if you know what I mean. *winks awkwardly(
Then select it in ADetailer.
>>
>>107410769
Sure, so use booru prompting and enjoy the 14m image finetune.
>>
>>107410753
how should i do it and do you get better results?
>>
File: 1753274590122350.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
first person perspective from above a japanese woman dressed as Hatsune Miku standing in the water of a swimming pool at night who is wearing a sleeveless white blouse and black miniskirt, in Japan. Miku's arms are outstretched, and she is smiling.
>>
>>107410776
Batch size set with latent image is not the same as pressing the run button x times. Set batch size 4, seed increment, run. Run again, the first image from the second batch will not be the second image from the first batch.
>>
File: 1749445373869022.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>107410794
added: anime style image
>>
>>107410540
I think it's a combo of "pale color, muted color, painting \(medium\), oil painting \(medium\)"
This prompt was basically random, i threw my old noobai prompts at the wall and looked what whill stick.
>>107410559
I'd say Z has a decent amount of non-realistic art styles in it. But if we get a Chroma styled finetune it's gonna be insane.
>>
Poll

https://poal.me/kse2wp
https://poal.me/kse2wp
https://poal.me/kse2wp
https://poal.me/kse2wp
https://poal.me/kse2wp
>>
File: 1744352943822971.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>107410807
in the style of an oil painting:
>>
RouWei is shit at both natural language and Booru tags.
End of the diacussion.
>>
>>107410818
>No Flux 2
come on anon, give it a chance and show to everyone how everyone don't give a fuck about that model kek
>>
>>107410818
wait what is illustrious v2?
>>
>>107410818
zimage improved everything a lot but we could still do almost all of the things it offered, just with more effort and a lot slower
wan 2.1 on the other hand actually allowed video generation to be of any reasonable quality to be usable by anyone at all, and later did so at 10x the speed with loras
>>
File: ComfyUI_00070_.png (1.07 MB, 1120x1008)
1.07 MB
1.07 MB PNG
>>
>>107410818
>no Qwen
>no Flux 2
>no Hunyuan 2 or 3
>chroma
>>
>>107410851
>wan 2.1 on the other hand actually allowed video generation to be of any reasonable quality to be usable by anyone at all
I'd argue this was Hunyuan Video. Did they ever release the 720p version btw?
>>
>>107410818
What about Qwen Image Edit? I have a lot of fun with that model
>>
File: 1girl,solo-2159004648.png (1.3 MB, 832x1216)
1.3 MB
1.3 MB PNG
>>107410818
Why not SDXL?
>>
>>107410878
in another world hunyuan would have gotten the community effort and probably would have taken that place, but that just didnt happen, it was a worse model
>>
>>107410879
qie was an incremental improvement over kontext dev but was also objectively worse for some things like smaller changes on a persons face, so it cant be the model of the year
>>
>>107410891
A lot of the things that made Wan usable at decentish speeds before lightning loras came along were developed for Hunyuan Video, like teacache and torchcompile.
>>
File: ComfyUI_00072_.png (848 KB, 1120x1008)
848 KB
848 KB PNG
>>
a next level cfg would basically run qwen image edit every step, to remove unwanted things, and enforce wanted things, also with knowledge of prior steps.
>>
File: ComfyUI_00051_.png (2.17 MB, 1504x1024)
2.17 MB
2.17 MB PNG
you NERDS need to stop SCARING bass.
>>
File: ComfyUI_00060_.png (1.76 MB, 1520x1008)
1.76 MB
1.76 MB PNG
Chroma can do early CGI pretty nicely, I was going for the fallout 1 death screen with this one
>>
>>107411074
looks like Conan Exiles
>>
>>107411074
proompt
>>
>>107410870
Does anyone here would actually use Qwen-Image as a "daily driver" for T2I ( not editing) ?
It's just too slopped to be useful, and if you care about doing advanced/ high end stuff you may as well just use the nano banana pro API
>>
File: zimg_0095.png (1.76 MB, 1024x1496)
1.76 MB
1.76 MB PNG
tf kinda tent is this
>>
File: 00253-1017770128.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>
>>
File: YunyunDate.jpg (1014 KB, 2048x2048)
1014 KB
1014 KB JPG
>>107411136
>>
File: 1754782702767190.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
A group of people in an unemployment line outside a building in the city named "UNEMPLOYMENT OFFICE". the people are all wearing tshirts that say "flux 2".
>>
File: ComfyUI_00059_.png (2.27 MB, 1520x1008)
2.27 MB
2.27 MB PNG
>>107411114
A landscape of a desert in the style of the videogame Fallout 1. There are buildings in ruins and ancient technology scattered along. There is a human ribcage and skull lying in the foreground. Daylight. Detailed, volumetric lighting, 4k, high res, old CGI style

Chroma 1-HD is worse with this btw, seems like its a fine tune to speed up high res gens
>>
So what's the value/node/whatever in comfyui I have to tweak to make the generation "randomize" more of the final result while still mostly adhering to the prompt? I want to see some more variety in the end results without changing the core character/theme
>>
File: really?.png (695 KB, 1080x636)
695 KB
695 KB PNG
https://www.reddit.com/r/StableDiffusion/comments/1pchpjb/quick_psa_the_stablediffusioncpp_implementation/
>With my 2060 6GB and the fp16 hack I get about 7.5-8s/it on z-image with comfyui. I've tried several different speedups like cache-dit and the gguf nodes (to limit offloading), but they either look noticeably worse (cache-dit) or make no difference (gguf).
>Now with StableDiffusioncpp I'm getting 4s/it, nearly a 2x speed increase without any noticeable quality degradation.
>>
>>107411130
what lora?
>>
File: ComfyUI_00053_.png (2.23 MB, 1504x1024)
2.23 MB
2.23 MB PNG
leave bass alone
>>
File: ComfyUI_00077_.png (1.27 MB, 1120x1008)
1.27 MB
1.27 MB PNG
>>
tried to install NAG for a basic z workflow, but im getting an error. anyone knows what do i have to look up to fix this? i'm running the patientx fork of comfy https://files.catbox.moe/hxkzcj.json
>ValueError: Model type <class 'comfy.ldm.lumina.model.NextDiT'> is not support for NAGCFGGuider
>>
>>107411204
that's because you didn't install the good NAG branch, it's this one
https://github.com/scottmudge/ComfyUI-NAG
>>
all these fucking NAGgers i swear to god
>>
I am not afraid anymore..I am a NAGger
>>
I get OOMs with NAG.
>>
File: 2608627578.png (1.66 MB, 832x1216)
1.66 MB
1.66 MB PNG
>>
File: ComfyUI_00080_.png (1.13 MB, 1120x1008)
1.13 MB
1.13 MB PNG
>>
>>107411166
makes sense, comfy is trash
>>
>>107411229
thanks! it's working now
>>
File: 891363297.png (1.43 MB, 832x1216)
1.43 MB
1.43 MB PNG
>>
>>107411300
>it's working now
if you don't know the right parameters for NAG, I recommend you this one
>cfg 1, nag_scale 3, nag_tau 1, nag_alpha 0.25, nag_sigma_end 0.75
>>
>>107411309
>shoes
>>
File: zimg_0118.png (1.8 MB, 1024x1496)
1.8 MB
1.8 MB PNG
>>107411167
psxam
>>
File: I SAID BASE.png (1.88 MB, 1024x1024)
1.88 MB
1.88 MB PNG
Close enough kek
>>
>>107411324
>>107411190
kino
>>
>>107410023
it's not a fined tuned model anon. Jesus vramlet hours today
>>
File: 1751836583221175.png (140 KB, 250x250)
140 KB
140 KB PNG
>>107411324
dont worry /ldg/, i will hunt down the bass
>>
>>107411341
>it's not a fined tuned model anon.
no one is gonna finetune a distilled 32b model with a shit licence when Z-Image exists, are you fucking retarded?
>>
File: file.png (2.34 MB, 1344x1344)
2.34 MB
2.34 MB PNG
FOR ZE VATERLAND
>>
File: ComfyUI_00016_.png (766 KB, 730x1024)
766 KB
766 KB PNG
>>107411309
hey, thats pretty cool, sexy school girl shoes too
>>
File: ComfyUI_00055_.png (1.63 MB, 1504x1024)
1.63 MB
1.63 MB PNG
>>
>>107411165
PLEASE REPLY
>>
>>107411341
What...a fucking 32B model will need a finetune to be good?
>>
>>107411353
A-ALL I EVER WANTED
>>
>>107411229
Why hasnt the original devs just update their repo? Whats going on?
>>
>>107411366
That's pretty cool. First I thought it was the ComfyUI dev.
>>
>>107411341
>muh vramlets
dead model enjoyer
>>
>>107411400
>Why hasnt the original devs just update their repo?
maybe they got them
>*insert bogdanov phone meme since the max limit of image replies has been reached :(*
>>
>>107411375
Was to see you smiling
>>
>>107411400
multiple devs are sick of the comfy update humiliation ritual. they might be fed up as well
>>
bake so I can post the good spidermans already
>>
Baker?
>>
>>107411401
>>107411366
sorry, fixed
>>
This is the last /ldg/ thread.
>>
ahhhh image limit ahhh
>>
>>107411436
Comfy broke NAG because he changed the names (layers, blocks, etc) of things that nothing actually uses yet.
>>
i want my money back, this baker isn't doing his job
>>
>>107411475
yeah but there's some PR that are fixing that and are just waiting to be merged, all they have to do is to press one button and they're good to go...
>>
>>107411158
Black farting logs studio still got 300 million in investments because of flux 2, you don't hate boomers enough.
>>
Ahh Baker-Sama ahh... ahh...
I- hmmmg... I-I have slop to p-post...
Ahh.. B-Baker-Sama please don't tease me like this...
>>
No bread until bass
>>
Rehydrate yourselves.
>>
NOOO NOT THE IMAGE LIMIT!!
>>
>>107411158
holy same face, and they all look underage
>>
wait, there's a limit on how many images you can post? since when?
>>
it's over, this is how /ldg/ dies
>>
>>107411523
Since forever.
>>
>>107411522
That would explain how they released such a shitty model
>>
>>107411528
nah, never happened in the past couple years
>>
I want newfags to leave.
>>
>>107411314
thanks, i'll try those
>>
new thred
>>107407830
>>107407830
>>107407830
>>
>>107411558
nigger
>>
>>107411558
oh debo you sneaky bastard
>>
Nevermind this image limit, why cant I post an image from incognito anymore? This site keeps getting shittier mang
>>
>>107411610
its so mossad can track you, they don't finna mess with incognito
>>
No base.
No bake.
This is the end.
>>
>>107411632
pack it up folks, its been... something
>>
fine, I'll do it myself.
>>
Come on I need to post anime girls
>>107411653
>>
/ldg/...
Forgive me...
*dies*
>>
>>107411725
>>107411725
>>107411725
>>
>>107411727
heroic bake



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.