[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107648831

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of XMAS bakes
>>
File: z-image_01268_.png (2.34 MB, 1280x1280)
2.34 MB
2.34 MB PNG
>>
File: z-image_01280_.png (3.14 MB, 1440x1440)
3.14 MB
3.14 MB PNG
>>
>>
>>107651582
arr rook same

>>107651612
all look same
>>
File: ComfyUI_00188_.jpg (118 KB, 1024x1024)
118 KB
118 KB JPG
why did they train qwen to do this
>>
File: 04smpb.jpg (53 KB, 1048x654)
53 KB
53 KB JPG
>>107651567
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
is there a good reason for these to be in the OP?
>https://rentry.org/ranfaggot
>>
File: file.png (12 KB, 828x153)
12 KB
12 KB PNG
>>107651657
>>
>>107651658
12 > 10 but you didn't subtract the botted nos. no wonder why you don't have a job. simple math is beyond you
>>
>>107651670
ok feel free to make your own thread without the rentry and use it alongside the other 20 french anons who said they want to remove the rentries :D
>>
>>107651657
You're a sad pathetic internet troll who will never amount to anything in life. You will not be remembered. Your UI will never be used by anyone and no one cares about you, regardless of how many years of your pointless life you spend spamming these threads.
>>
>>107651618
>>
>>107651607
another miss but still nice.
https://vocaroo.com/11A7XKNqTmkl
>>
>>107651683
>Your UI
what UI?
>>
File: 1760247853379471.webm (438 KB, 794x794)
438 KB
438 KB WEBM
jarvis, enhance image
>>
ani got whiskey amnesia
>>
I should be a lora.

I SHOULD BE A LORA.
I. SHOULD. BE. A. LORA.

I should be the most famous lora.

I am the most popular lora.
A AM POPULAR. I AM THE BEST LORA. I AM THE KING OF LORAS.
>>
I wonder if the Christmas market this year will have the throw the poop at the jeet booth again this year. I hope so, I love participating in all the exciting foreign things.
>>
>>107651640
>day
>not ruined
>>
>>107651640
because you are based
>>
File: 1745492168513798.png (77 KB, 1904x509)
77 KB
77 KB PNG
What is this gay shit bros
>>
>>107651657
Reality check. You think everyone ignores your rentry-free threads, again and again, because they just happened to see the real one first?
No. They ignore you because they hate you. Everyone ITT hates you, and if spamming "b-but jakjak is a jani and is banning all the pro-ani anons at the same time to make it look like it's all me!!" had convincing anyone, they would have used your threads by now.
You want to fruitlessly spend your precious time on Earth during Christmas week making 100 posts per single thread defending your honor in a crowd that hates you, only to have them all nuked every single day? Ok. Good luck. I don't think that's gonna work, but then again, you ignored me when I warned you that your obvious samefagging was just tanking your own reputation, and look where we are now.
>>
>>107651741
Okay Furk
>>
>>107651775
just wait. it gets worse
>>
>>107651703
https://vocaroo.com/1k2uhbI65nWQ
idk, working on it. [inst] messing things up sometimes kind of. text becomes less accurate?
>>
File: z-image_01294_.png (3.13 MB, 1440x1440)
3.13 MB
3.13 MB PNG
>>
>>107651781
lol ur mad
>>
>no access to computer all week
I have gone insane-o
I lust for coome-o
>>
>>107651794
I'm not the one who makes 100 posts per thread seething.
>>
File: z_mod_00528_.jpg (630 KB, 1928x1728)
630 KB
630 KB JPG
>>107651795
>>
>>107651552
>>107651587
>>anti-AI troons getting so mindbroken that the color yellow scares them now
that is real """ai psychosis"""
>>
File: 00009-2525051340.png (1.69 MB, 1480x944)
1.69 MB
1.69 MB PNG
>>
File: z-image_01297_.png (3.25 MB, 1440x1440)
3.25 MB
3.25 MB PNG
>>
File: 1737080787301364.mp4 (1.9 MB, 2048x730)
1.9 MB
1.9 MB MP4
Would you look at that.
https://github.com/BigStationW/ComfyUi-TextEncodeQwenImageEditAdvanced
>>
>>107651842
always the stealth mvp in these threads... ;)
>>
>>107651842
>>107651856
?
>>
>>107651557
prompt or wf?
>>
>>107651871
pretty sure that anon does heavy gimp edits
>>
>>107651842
wtf. I'm assuming the older QIE versions worked the same way and no one figured this out until now?
>>
>>107650857
Thanks.
>>
>>107651897
I have no idea why Comfy decided to downsample the images to 384 * 384, Qwen 2.5 vl can see better with higher resolutions.
>>
>>107651787
>pip install "triton-windows<3.5"
simply as
>>
File: z-image_01312_.png (3.42 MB, 1536x1536)
3.42 MB
3.42 MB PNG
>>
1100s to 351s fucking insane
>>
File: z-image_01313_.jpg (2.62 MB, 1536x2048)
2.62 MB
2.62 MB JPG
>>
>>
>>107651997

multiple requests
>>
File: ComfyUI_temp_sivsg_00005_.png (2.02 MB, 1152x1920)
2.02 MB
2.02 MB PNG
>>
File: 00043-2274768803.png (1.81 MB, 1480x944)
1.81 MB
1.81 MB PNG
>>
>>107652023
What a waist ratio, damn
>>
Stop posting porn on main you disgusting faggots
I just got fired for cumming on cat
>>
File: ComfyUI_temp_sivsg_00007_.png (2.04 MB, 1152x1920)
2.04 MB
2.04 MB PNG
>>
>>107652115
nice
>>
File: z-image-q_00009_.png (2.68 MB, 2048x1536)
2.68 MB
2.68 MB PNG
>>
File: ComfyUI_temp_sivsg_00010_.png (2.19 MB, 1152x1920)
2.19 MB
2.19 MB PNG
>>
1girl Infinity
>>
File: z-image-q_00015_.png (2.48 MB, 2048x1536)
2.48 MB
2.48 MB PNG
>>
>>107652110
>Stop posting porn on main you disgusting faggots
>I just got fired for cumming on cat
Just masturbate at work like I did today pussy faghot
>>
File: The legend of Migu.png (1.41 MB, 1448x720)
1.41 MB
1.41 MB PNG
>>
>>107652213
The cat was my boss why do u think I got fired
>>
File: 00079-2120665467.png (1.32 MB, 1216x832)
1.32 MB
1.32 MB PNG
>>
File: ComfyUI_temp_sivsg_00013_.png (2.01 MB, 1152x1920)
2.01 MB
2.01 MB PNG
>>
>>107651807
what workflow to make basedjacks
>>
File: ComfyUI_temp_sivsg_00014_.png (2.42 MB, 1152x1920)
2.42 MB
2.42 MB PNG
>>
File: ComfyUI_temp_sivsg_00016_.png (2.07 MB, 1152x1920)
2.07 MB
2.07 MB PNG
fucking chroma when Im just about to drop it as a model, I always find ways to gen some good stuff
>>
>>107652311
Please do share the trick for Chroma
>>
>>107652311
>good

anon...i...
>>
>>107652311
it's so good, you better post this exact image in next 20 more threads
>>
>>107652311
Is the implication that this is widely removed from what you've posted previously
>>
>>107652333
>thick mommy thighs
he better
>>
File: z-image-q_00061_.png (3.61 MB, 1788x1340)
3.61 MB
3.61 MB PNG
>>
real women are not built like that you are lusting over a fabrication
>>
>>107652344
thats sounds like a problem for women
>>
>>107652344
>lusting
define lust.
>>
>>107652344
>nigga aint seen fat young hos before they get cellulite everywhere
>>
>>107652351
kys pedo
>>
>>107652353
shes like 23 or 24 nigga
>>
so now that the dust has settled,
qwen edit... what happened there??
>>
>>107652358
did I stutter?
>>
Can I make a request here ?
I'm trying to generate a very high quality deep space image for my BabylonJS skybox sphere. It doesn't HAVE to be equirectangular, but it must be very high definition for it to look good.
No planets, no nebulas, no galaxies, just pure jet black space with lots of distant twinkling stars.
If anyone can help I'd greatly appreciate it.
If you can't post it here because it's too high-res, maybe post it in /hr/ or upload it somewhere and link me. Thanks again.
>>
File: ComfyUI_temp_sivsg_00023_.png (2.79 MB, 1440x1120)
2.79 MB
2.79 MB PNG
>>107652333
not quite the same images, too bad your eyes are not trained /experienced enough to notice the big differences that each image has
>>
>>107652406
https://skyboxgen.firebaseapp.com/
>>
File: z_mod_00552_.jpg (763 KB, 1344x1728)
763 KB
763 KB JPG
>>107652282
Lora test to see if I can mix crude dataset with different art styles
>>
>>107652406
https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler
>>
>>107652424
please release it soon senpai sama
>>
>>107652403
faggot
>>
>>107652442
ok boomer
>>
>>107652418
cool site but doesnt do stars NTA
>>
>>107652418
That's actually very useful for me, but not for this situation, as >>107652455 said, it does not generate a starry night sky.
>>107652432
I appreciate your help but I have neither the hardware nor the know-how to operate this. Plus this upscales so I need a base image anyways.
>>
File: ComfyUI_temp_sivsg_00029_.png (2.79 MB, 1440x1120)
2.79 MB
2.79 MB PNG
>>
>>
>>107652496
Incredible gen
>>
>>
>>
>>107652507
thumbnails are way less powerful in grayscale mode.

I feel a lot more important in grayscale mode. Refined. The American Literati.
>>
>>107652508
lmao
>>
>>107652496
fuk u btich https://www.the-sun.com/news/15635710/mary-magdalene-turbulent-life-surgery-addict-botched-ops/
>>
Huh... Turns out using abliterated Qwen-4b as ZImage text encoder actually improves porn generation. Improvements aren't that big, but abliterated can generate A and B-cup tits (regular one only generates either flat or d+ cup) and dicks are slightly better. Abliterated is less likely to generate bikini that wasn't requested, etc.
I thought since qwen was used only as an encoder, its censoring wouldn't affect image generation, but it did.
>>
Behold, a man of power. He asserts grayscale mode. All before his presence kneel.
>>
File: z_mod_00568_.jpg (451 KB, 1344x1728)
451 KB
451 KB JPG
>>
fixing my prompt lol. Tough job. I'm a hard worker!!!
>>
>>107652536
It takes enormous power to bear the cross of action.
>>
File: 00000-2789092111.png (1.4 MB, 1216x832)
1.4 MB
1.4 MB PNG
>>
alright.

comparison time.

discreet_euler / global
https://vocaroo.com/16YxtSZzZCcr
>>
File: 55.mp4 (2.32 MB, 848x1264)
2.32 MB
2.32 MB MP4
>>
>>107652525
Interesting trick. I can't even coax flatties out.
>>
File: 00000-2789092112.mp4 (637 KB, 1280x720)
637 KB
637 KB MP4
>>107652631
>>
>>107652631
>>107652631
Maybe just try other qwens? Maybe their "censorship" is very crude.
>>
>>107652597
spiral / global
https://vocaroo.com/1gMEWd90spC5
>>
>>107652664
What are system requirements for this model? Can it sing in Japanese?
>>
>>107652676
sorry but I'm not sharing anything, took me weeks searching in the japanese AI forums to figure this shit out
>>
>>107652679
lmao! I'm feeling like making it work in a day just to piss you off.
>>
File: z_mod_00605_.jpg (457 KB, 1344x1728)
457 KB
457 KB JPG
>>
>>107652597
>>107652664
:-)
>>
>>107652693
good luck, i bet if you manage you won't share either
>>
>>107652676
requirements seem low. idk about Japanese. I wouldn't know if it was getting it right.
>>
>>107652679
Nigga just share the name of the fucking model you sperg. Seriously, you're on a local model thread, tf are you doing gatekeeping shit?

Are you Emad Mostaque? How'd the crypto scheme go?
>>
>>107652664
pingpong / global
https://vocaroo.com/1n01OPk4jBSC

I had previously stuck to pingpong. Quality is lower, I think, but better prompt following?
>>
>>107652597
>>107652664
>>107652720
you are autistic (in the bad way)
>>
File: z_mod_00608_.jpg (474 KB, 1344x1728)
474 KB
474 KB JPG
>>
File: 1758248467153461.png (2.2 MB, 2285x904)
2.2 MB
2.2 MB PNG
>>
>>107652717
Isn't he using SongBloom?
>>
>>107652769
Yeah I had to go back a thread to find it. Still though, gatekeeping in local is certainly quite a choice.
>>
File: 1739185437421210.jpg (88 KB, 1360x768)
88 KB
88 KB JPG
>>
File: 1741410004109073.png (575 KB, 1112x936)
575 KB
575 KB PNG
replace the black man on the left in image1 with the anime girl in image2, who is wearing a black trenchcoat and is looking at the man on the right. leave the text unchanged.
>>
File: 1765627080676118.jpg (298 KB, 1280x1280)
298 KB
298 KB JPG
>>
File: 1749526865987576.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>
>>107651842
interesting, thanks for pointing this one out!
>>
File: 1744664669383721.png (1.04 MB, 1360x768)
1.04 MB
1.04 MB PNG
>>
File: 1753262691159393.jpg (574 KB, 2560x1280)
574 KB
574 KB JPG
abliterated-qwen-4b vs puritan-qwen-4b. Same seed, same prompt.
>>
File: xm.jpg (161 KB, 832x1488)
161 KB
161 KB JPG
>>
>>107651842
interesting, how does it work/why does it work?
>>
>>107652901
NTA but comfy default feeds a small image into the "seeing" part of the model and that extensions makes that input bigger which seems to help things be more accurate.
>>
>>107652925
interesting, this node is even more useful cause you can add a weight to it, with 0.15 as the default or higher if you want it to copy the original more.
>>
File: 1764448134464295.png (1 MB, 1360x768)
1 MB
1 MB PNG
>>
File: file.png (30 KB, 634x268)
30 KB
30 KB PNG
>>107653002
do you have this behind positive/negative? allegedly it helps especially with multiple image references. I am not that far into testing the new version yet that I can tell.
>>
File: 1759046267827953.png (1.27 MB, 1360x768)
1.27 MB
1.27 MB PNG
>>
>>107652889
https://files.catbox.moe/f279cl.mp4
>>
>>107653021

https://i.4cdn.org/wsg/1766560486782458.mp4
>>
File: IL_00007_.png (2.03 MB, 1920x1080)
2.03 MB
2.03 MB PNG
>>107653021
migu
>>
File: 1737535117451352.png (1.29 MB, 1000x1048)
1.29 MB
1.29 MB PNG
>>
>>107653092
cute, also try the kijai MoE lora for high noise if using wan 2.2, it works pretty well, your video is smooth so that might be using it.
>>
the man has is leaning on a table with his face beside a line of cocaine. the table is filled with white bags of powder. keep his facial expression and appearance the same.

not bad, 4 steps with the new lora.
>>
File: 1739085311776260.png (1.33 MB, 1000x1048)
1.33 MB
1.33 MB PNG
>>107653132
helps if I attach the image
>>
>>107653136
aight I'm gonna ask, obviously you aren't wired quite right, so how are you even affording the ram/vram/compute?
>>
>>107653182
very deboesque question
>>
>>107653182
I had my 4080 for a while now and its only 16gb

qwen edit doesnt require a lot, 32 or 64gb ram can load models that overflow into ram without issue too
>>
File: b.mp4 (1.7 MB, 720x544)
1.7 MB
1.7 MB MP4
>>107652889
Forgot to change the aspect ratio so the model only had the boobs to work with lmao
>>
File: 1752669612477778.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>
>>107653021
give workflow
>>
>>107653222
https://files.catbox.moe/946g50.json
>>
File: ComfyUI_03499_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>107653229
nta but tanks
>>
File: 1743608211394458.png (1.05 MB, 1296x808)
1.05 MB
1.05 MB PNG
replace the text "MARIO KART" with "FENT MAN". add the black man in image2 to image1 behind the center logo in image1. keep his expression the same.
>>
>>107653229
wait so you genned the text in a different one? damn
>>
>>107653244
I had an edit of the original blue archive image and I wanted to test with the new edit.
>>
finall-
>>
File: 1766556129070643.png (1.06 MB, 992x1048)
1.06 MB
1.06 MB PNG
replace the white man in image1 with the black man in image2, in the same pose. keep the expression of the black man the same.

not bad desu
>>
File: file.png (1.55 MB, 1360x768)
1.55 MB
1.55 MB PNG
>>107653021
>>
>>107653287
nice, edit model is so fun/versatile, esp with the multi image inputs.
>>
>>107653277
ultra slopped desu.
waiting for fagchaku
>>
File: 1762531944053648.png (1.16 MB, 992x1048)
1.16 MB
1.16 MB PNG
>>
File: 1764448134464295.mp4 (1.83 MB, 720x720)
1.83 MB
1.83 MB MP4
>>107653299
>>
>>107653302
reply to me instead
>>
File: 1762531944053648.mp4 (1.65 MB, 672x864)
1.65 MB
1.65 MB MP4
>>107653351
>>
File: 1739143874340371.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
add the black man from image2 to the right of image1. keep the expression of the black man the same. the black man is holding a large bag of white powder. Change the "E" rating to "F". replace the text "Yu-Gi-Oh!" in the center to "Fen-Tan-YL!".

success!
>>
File: 1752139332342760.png (1.25 MB, 856x1216)
1.25 MB
1.25 MB PNG
>>107653373
>>
what causes someone to spam his shitty sloppy negroid edits?
>>
>>107653406
it's like hello world for a new model, fent man is a test case.
>>
>>107653406
he has not had a new idea in four years. he deserves our pity. he is the worst of us.
>>
>>107653415
that's a wild cope for having the mental illness to gen an ugly dead nigger on Christmas eve
>>
File: 1745642824126933.png (948 KB, 856x1216)
948 KB
948 KB PNG
>>107653429
well im done with that anyways, here's something more festive.

the asian girl is wearing a red christmas bikini with white furry trim. She is holding a candy cane.
>>
File: ComfyUI_03301_.png (968 KB, 1024x1024)
968 KB
968 KB PNG
>>
File: 1759608036228000.png (863 KB, 856x1216)
863 KB
863 KB PNG
>>107653442
edit continues to be the most versatile model, cause it can gen stuff and also manipulate.
>>
>>107653442
>>107653449
holy fucking slop, the skin looks like literal shit. are you sure you have this setup correctly?
>>
File: ComfyUI_00114_.jpg (386 KB, 1280x1950)
386 KB
386 KB JPG
>>
File: 1744858153786745.png (1.11 MB, 832x1248)
1.11 MB
1.11 MB PNG
>>107653451
looks festive to me
>>
File: z_image_03412.jpg (297 KB, 1024x1024)
297 KB
297 KB JPG
>>
File: 1756088123355183.png (1.06 MB, 832x1248)
1.06 MB
1.06 MB PNG
>>107653463
the asian girl is wearing a red christmas bikini with white fur trim. she is making a snowman, in the snow.
>>
>>107653470
just using 4 steps with the 4 step 2511 lightning lora today, cfg 1, using Q8 qwen edit (new from today, both)

fast and works, cant complain
>>
File: file.png (86 KB, 1508x409)
86 KB
86 KB PNG
so z-image-turbo have high image quality due to RLHF maxxing and other variants wouldn't be as good?
>>
Since QiE is running Qwen2.5 to rewrite user prompts, it means that censorship can refuse the prompt and the user won't even know it, huh?
>>
File: 1751557882082885.png (1.51 MB, 1248x832)
1.51 MB
1.51 MB PNG
the asian girls are wearing a red christmas bikini with white fur trim, and a tiara with reindeer antlers. the setting is snowy.

neat, from business in spring to christmas.
>>
File: 1744287924948703.png (1.12 MB, 832x1248)
1.12 MB
1.12 MB PNG
>>107653502
one more test, anri but make it christmas:
>>
>>107651842
>my honest to god reaction seeing this information
https://youtu.be/hmpJqJLsR48

will have to give this a try in a sec.
>>
File: 1737940285772864.png (1.01 MB, 832x1248)
1.01 MB
1.01 MB PNG
>>107653525
>>
>>107653498
use an abliterated model if you're concerned
>>
>>107651567
Instead of moving, ephemeral imageboard threads, are there some Discourse forum communities around Local Diffusion tools, setups, workflows?
>>
File: ComfyUI_00026_.png (2.06 MB, 1088x1920)
2.06 MB
2.06 MB PNG
>>107653546
we weave baskets here sir.
>>
>>107653482
where is this table from?
>>
File: 1738903627145389.png (1.38 MB, 856x1216)
1.38 MB
1.38 MB PNG
the anime girls are wearing red santa hats, and the setting is snowy. they are in front of a christmas tree with many presents in front. a sign in front says "Merry Christmas LDG!"

new edit versatile as always.
>>
File: 1737968281564630.png (1.13 MB, 1176x880)
1.13 MB
1.13 MB PNG
>>107653561
>>
>boost MP to 2.1 since i'm genning at 1080p
>now for my nudify prompts its not nudifying, and its adding some random arr rook da same asian chick in WITH the subject in like a selfie format
thats hilariously strange. thought maybe setting it too high would cause abstract patterns or make the whole image noisy, not fuck with prompt adherence.
>>
>>107653554
z-image's repo
>>
tanks
>>
File: 1758850209845794.png (780 KB, 848x1232)
780 KB
780 KB PNG
make the black dress a red christmas dress, make the black blindfold red, and give the girl a santa hat.
>>
File: 1758849587791679.png (821 KB, 744x1392)
821 KB
821 KB PNG
>>107653661
pretty clean for a low quality png
>>
>>107653661
it's so over for artists yet again
>>
>latest comfy removed the inpaint workflow template
HELP
>>
File: 071517_00001.mp4 (1.46 MB, 1024x1024)
1.46 MB
1.46 MB MP4
>>107653684
>he pulled?
>fuck his sheet up
>>
>>107653684
>he doesn't a bajillion customized workflows that have nothing to do with the default workflows anymore
>>
File: 1741999763569380.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>
File: burn.png (663 KB, 1380x604)
663 KB
663 KB PNG
Talking about inpainting, I'm trying to use flux fill and getting this weird burns effect around the mask, any clues?
>>
File: 1735576749803892.png (739 KB, 936x1112)
739 KB
739 KB PNG
>>107653718
>>
>>107653482
yeah base will mostly be useful for training, we'll have to wait for finetunes once it's released so will take a while
>>
I'm using scottmudge's comfy-nag with zit and it doesn't work for me. It's literally no different than the regular cfg guider
>>
File: 1757365175065013.png (745 KB, 936x1112)
745 KB
745 KB PNG
make the image in the style of an 8 bit videogame.

almost like the old school pc adventure games, sorta:
>>
>>107653553
some of us prefer to follow discussions asynchronously.
>>
What did he mean by this?
https://github.com/huggingface/diffusers/pull/12857#issuecomment-3689226117
>>
File: 1747887061343707.png (702 KB, 744x1392)
702 KB
702 KB PNG
>>107653773
also neat cause I didnt prompt pixel art.
>>
oo-ee-oo
>>
>>107653799
the image literally doesn't change regardless of what parameters. Even if tau =2, which should normally result in garbled output, does nothing
>>
File: arr rook da same chart.png (133 KB, 606x685)
133 KB
133 KB PNG
and chart explanation by our favorite insider
>>
with all this training done on the base model, will any loras made on it still work with the version of zit we have?
>>
>>107653847
bi bew releases? who cares
>>
>>107653828
post the images here so we can help
>>
File: merged.png (22 KB, 1023x299)
22 KB
22 KB PNG
>>107653877
*grabs your balls and twists*
>>
>>107653887
can i download? no? then fuck off
>>
File: 1758849587791679.png (474 KB, 500x646)
474 KB
474 KB PNG
>>
>>107653847
as long as its real base and not ultra prefiltered and cucked stabilityai """""""""""""""""""base"""""""""""""""""""""""
>>
>>107653909
I imagine that's why they're going out of their way to make sure people don't get their hopes up, when they say base they MEAN base like "don't try to gen your 1girls on this one".
>>
>>107653482
>>107653738
Ok but how would distillation from Base to Turbo work? Will we be able to make our finetunes Turbo compatible? "Medium" quality sounds pretty bad, and it's already bad that the model is worse.
>>
>>107653907
post workflow then
>>
File: adv.webm (3.92 MB, 1440x1080)
3.92 MB
3.92 MB WEBM
>>
What the fuck did the chinks mean by this?
>>
>>107653933
Just copy the values from KSamplerWithNag. Everything else doesn't matter.
>>
>>107653943
knew you wouldn't, because it doesn't work
>>
>>107653684
>>107653714
There's default workflow templates accessible from Comfy itself? I always assumed I had to go on the wiki to find them.
>>
>>107653937
wow what the fuck
>>
>>107653961
not only that they always keep them up to date with the latest models
>>
>>107653967
Then what is the point of
https://comfyanonymous.github.io/ComfyUI_examples/
?
>>
>>107653984
legacy and beta
>>
>>107653907
>>107653828
OK, it works if, and only if, I keep the cfg at 1.0. If I set the cfg to anything else, its output will be identical to the regular cfg guider at 1.0. Strange
>>
>>107653989
that's a boolean mistyped as double
>>
>>107653235
anon, HOW??
prompt please
>>
>>107654016
Text-to-image AI
is a type of generative artificial intelligence that creates unique visual content (photos, illustrations, art, etc.) from simple text descriptions, often called "prompts
>>
so what's the latest alternative to something like diffusion toolkit for viewing prompt metadata really quickly? apparently https://github.com/receyuki/comfyui-prompt-reader-node this is busted forever and https://github.com/PanicTitan/ComfyUI-Gallery this fucking blows
i don't know how a node for this isn't already standard.
>>
>>107654030
you drag the pic to the app and it opens the workflow
>>
>>107653941
Based chinks,I was working all night on a Christmas image of me and my waifu kissing under a mistletoe. It was wholesome.
>>
File deleted.
>>107654027
yes no shit, I have generated plenty of images.
wondering how that prompt was written
>>
>>107654032
if i could punch your teeth in IRL, i'd really like to not gonna lie
now explain to me, honest to god usecase, why you want to fucking zoom into a workflow EVERY SINGLE TIME YOU WANT TO VIEW AN IMAGE'S PROMPT?
AND FOR HUNDREDS/THOUSANDS OF IMAGES?

>>107654042
Holy moly this simmered me down
>>
>>107654040
My waifu is a pony btw
>>
>>107654047
i don't get your issue
>>
File deleted.
heres another good one
just want some help refining my prompts, here is one I use now:

candid amateur photo of an adult white woman, blonde hair, nude, reclining on a couch in a lived-in living room, taking a sexy mirrorless selfie at arm’s length, smiling softly at the camera, relaxed and confident expression, natural body proportions, realistic skin texture, subtle imperfections
wooden floor, television in background, fireplace, shelves with personal items, colorful clothes scattered casually on the floor, cozy domestic environment
indoor ambient lighting, warm tones, slightly dim evening light, soft shadows, natural contrast
medium close-up framing, shallow depth of field, realistic perspective, handheld phone photo look, informal composition
high quality but non-studio, unposed, intimate, real-life atmosphere, authentic, unfiltered, modern smartphone photography, natural color balance
>>
BLOO BOORD
>>
>>107654062
z image turbo?
you don't need most of that quality descriptor shit, just describe the scene and subject. this isn't sdxl.
candid shot/amateurishly shot image of, etc is all you need.
>>
File: training_pipeline.jpg (246 KB, 2302x678)
246 KB
246 KB JPG
>>107653928
>how would distillation from Base to Turbo work
they discussed it in these 2 papers (https://arxiv.org/abs/2511.22677, https://arxiv.org/abs/2511.13649)
basically they trained a few-step distillation with RLHF to surpass the base model quality
however they didn't public the training script or how to obtain the RL reward model, we'll have to wait if any anon attempt to recreate that, though i doubt we could distill one with local resource
>Will we be able to make our finetunes Turbo compatible
seems not, but realistic style Lora might somewhat works, since it has similar style to what the turbo was trained for
>>
>>107654030
*Looks like diffusion toolkit's been updated quite a bit the past few months, and works way better now. Issue solved.
hopefully :^)
>>
>>107654075
Just using Chroma text to image template on ComfyUI
>>
>>107654084
It's over is what you're saying. Welp, at least there's still that booru tune coming.
>>
File: ComfyUI_03609.jpg (3.81 MB, 2160x1536)
3.81 MB
3.81 MB JPG
>>107654084
I thought SRPO was the "new" hotness... wouldn't it be cheaper/faster to use that on Base instead of trying to recreate their training?
>>
File: Chroma-0115.jpg (142 KB, 832x1488)
142 KB
142 KB JPG
>>107653082
hot, ty
>>
>>107654179
>use that on Base
There won't be a base.
>>
>>107654212
you ruined it
>>
File: 1766573995099.jpg (433 KB, 1179x1546)
433 KB
433 KB JPG
What's the current best model and/or LORA for NSFW realism? Also since when are NSFW pics allowed on /g/?
>>
>>107654230
they aren't, but it's christmas eve so jannies are probably busy
>>
File: ComfyUI_03754.jpg (2.81 MB, 1536x2160)
2.81 MB
2.81 MB JPG
>>107654201
C'mon... now when have the Chinese ever lied?
>>
File: mp.jpg (1.4 MB, 1951x4018)
1.4 MB
1.4 MB JPG
>>107651842
vl_megapixels seems to just determine how close to the source images you want it to be. The higher the value the more it tries to retain the look of the original images but gets more distorted and stops following the prompt. If you're doing basic editing then higher values might be better but if you're trying to prompt an entirely different scenario then you'll want lower values.
>>
>>107654230
Most flexible is probably SDXL or Chroma. E.g. you'd likely get more NSFW poses NSFW outfits and so on.

Best in terms of like, hair and eye and room details on "just nude" NSFW could be Qwen or even flux2 dev with Lora.
>>
>>107653847
>delayed by a month
>was trained for longer
>takes longer to gen
>requires more steps
>produces worse results
OH NOOOOOOO LOCALKEKKIES IT LOOKS LIKE THE COPE IS STARTING!
>i-it's supposed to be bad, it's a base model!
and ponyv7 totally saved auraflow, right?
>>
>>107654251
As for SDXL, which checkpoints or model would you recommend? Big Lust was the last one which really impressed me, but maybe there's something better I've missed along the way.
>>
>>107654261
You're retarded

Lots of people in these threads have been pointing out ad-nauseum that Base is a model to train on, not generate with, meaning it will be created with a focus of being easily finetuned to whatever you want, not with a focus of generating pretty pictures fast like the distilled Turbo model

This is EXACTLY what fine-tuners want, a relatively small model (thus fast to train) with great pre-training which can be finetuned for any purpose, lodestone will be making a Z-Image Chroma and undoubtably we will see the anime/artstyle finetuners do the same
>>
File: 1744874397563403.jpg (665 KB, 2016x1152)
665 KB
665 KB JPG
>>
File: 1743476821265145.jpg (1.47 MB, 1248x1824)
1.47 MB
1.47 MB JPG
>>
>>107654084
Why didn't they do SFT + RLHF (without the distillation)? The quality would've been even better, just imagine
>>
>>107654400
>50 steps
How about...no...
>>
>>107654261
retard
>>107654344
truth nuke
>>
>>107654409
they're giving us SFT (which is also 50 steps) because they want us to get some quality out of the base model, why not go further and give us SFT + RLHF, ultimately people can chose, if you want to stay at turbo it's your choice
>>
>>107654400
They're releasing the SFT as well, just look at the chart
>>
>>107654421
I'm asking for SFT + RLHF, not SFT alone, learn to read anon ;-;, it's the RLHF part that made Turbo so special
>>
>>107654409
SDXL was 50 steps as well, but you easily get away with ~20
>>
>>107653722
post the actual workflow
>>
>>107654400
RLHF nudges the model to bias to style that human consider 'good', so mixing RLHF in that training phase might lock the model to a particular style, which is not what they want when creating z-image (a foundational model)
>>
>>107653847
What would be the use case of Z-image Omni then? if Z-image SFT seems to be as easy to finetune and that base model seems more solid?
>>
>>107654432
But you don't want that for a model made for further training, you don't want aesthetic bias in it, that is something you as a finetuner want to provide
>>
>>107654442
>>107654452
I didn't imply I wanted a variant of a foundational model (we'll already get the foundational model), what I said is that they had the possibility to get a model even better than Turbo, which is basically their finetuned method without the distillation, I'm really surprised they didn't go that path

In case you still don't understand it, I don't want base2, I wanted them to give a finetuned variant as well
>>
stop arguing you dinguses, its crimbus
>>
>>107654400
>SFT + RLHF (without the distillation)
you remove the distillation and you make the model bigger (let's say 15b), and that model is Nano Banana tier, I'm not joking
>>
File: 1740838562343545.jpg (758 KB, 1248x1824)
758 KB
758 KB JPG
>>
>>107654459
>which is basically their finetuned method without the distillation
For what purpose would that be better than Turbo ? Distillation, as in cherry-picking aesthetically pleasing images from a base model and training on it is what makes Turbo good for its intended purpose.
>>
>>107654510
>Why would a double (guidance + steps) distilled model be worse than a normal model?
Asking that in the year 2025 of our lord is crazy
>>
>>107654459
maybe because there is no reason to, they didn't report whether RLHF alone surpass the turbo version so we can assume even if it had better quality, it wasn't by that much to release a separate model
>>
>>107654519
It will be slower and generate less good looking images, are you retarded ?
>>
>>107654510
>Distillation, as in cherry-picking aesthetically pleasing images from a base model and training on it
This is not distillation, you just described RLHF, distillation is when you lobotomize the model to be faster (the lightning loras are distillations for example), and the quality is always worse than a normal model, it's the definition
>>107654524
>they didn't report whether RLHF alone surpass the turbo version
you have a point, maybe they tried it and didn't say on the paper, but still, an undistilled SFT + RLHF finetuned model would've been easier to finetune, it's the distillation that makes training impossible
>>107654533
are you retarded I just told you that the distillation hurts the model quality, you are low IQ to understand that YES TURBO IS GOOD BUT IT COULD'VE BEEN EVEN BETTER IF IT WASNT DISTILLED

What the fuck is your problem? If you want to keep using turbo just do it, my point was that it's cool to have choice and people who want max quality could've went the non distilled path, fucking retard
>>
File: 00017-3264607982.png (2.64 MB, 1920x1080)
2.64 MB
2.64 MB PNG
>>
>>107654533
>It will be slower
yes
and generate less good looking images
hey pal, are you from stupid town? since when distilled models give better quality images than their non distilled variants?
>>
>>107654544
>the distillation hurts the model quality
No it doesn't for its intended purpose, which is to generate very good looking images fast

A non-distilled model of the same size will not have some magic properties, it will be slower and have a lower ratio of good looking images, you could argue that by not being trained on cherry-picked results it will have more variety but that also means more shit, including busted anatomy

The one big advantage would be for training, but we're already getting the best possible models for training, pre-trained and SFT
>>
>>107654562
>it will be slower and have a lower ratio of good looking images, you could argue that by not being trained on cherry-picked results it will have more variety but that also means more shit, including busted anatomy
what the fuck are you talking about? do you even read posts or something? the "trained on cherry picked results" IS THE RLHF PART NOT THE DISTILLATION, I NEVER SAID I WANTED TO REMOVE RLHF I SAID I WANTED TO REMOVE DISTILLATION

FUCKING LOW IQ RETARD REEEEEEEE
>>
>>107654555
>since when distilled models give better quality images than their non distilled variants?
Since forever, do you even know what a distilled model is ?

The main reason for distilling is to get better ratio of good results, typically the model is smaller as well but that's not necessary.
>>
File: 00039-2793464622.png (2.65 MB, 1536x1536)
2.65 MB
2.65 MB PNG
>>
File: ComfyUI_04267_.png (2.15 MB, 1184x1544)
2.15 MB
2.15 MB PNG
>>
>>107654573
>I WANTED TO REMOVE DISTILLATION
Why ? You will have the same model cherry picking bias but SLOWER ?

How fucking stupid are you ?
>>
File: 1759095307579564.png (301 KB, 2009x814)
301 KB
301 KB PNG
>>107654574
>Since forever, do you even know what a distilled model is ?
ok this dude has to be a troll, we spent the whole 2025 using distilled loras on models and we always noticed they gave worse results than when we're not using the distilled loras and this fucker is saying such nonsense and is confident about it

you have no idea what you're talking about don't you? distillation will always hurt the quality of a model (and it makes the distilled model impossible to finetune), it's the tradeoff for speed you retarded fuck
https://arxiv.org/pdf/2510.14974
>>
>>107654574
>Since forever
that doesn't sound right. they always have something weird about them compared to the teacher model
>>
>>107654593
>space before the question mark
Europoor spotted. Opinion rejected
>>
>>107654574
are you fucking retarded or something?
>>
>>107654574
When a model is distilled:
- It's way worse at diversity
- The quality is a bit worse and more slopped
- It is impossible to finetune

Speed is the only good thing out of distillation.
>>
>>107654574
That's a nice quality bait, take my (You) kind sir.
>>
>>107654614
>It is impossible to finetune
not impossible but it's so much fucking harder to get things to stick you have to blow something like $200k rental compute on a model like chroma to get anything
>>
>>107654625
kek
>>
>>107654626
still retarded lodestone even tried but I guess nobody saw z-image coming
>>
File: nuff said.png (31 KB, 224x225)
31 KB
31 KB PNG
>>107654574
>>since when distilled models give better quality images than their non distilled variants?
>Since forever
>>
File: file.png (108 KB, 1372x701)
108 KB
108 KB PNG
>>107654614
>>107654626
you niggers are so annoying.
it depends on how much it's distilled, and what was taken out.
if you take the water out of a lemon, you can still rehydrate it and make fucking lemonade.
if you take out the citric acid, you're gonna get some weird tasting shit.
>>
nu bake
>>107654651
>>107654651
>>
>>107654649
>it depends on how much it's distilled
turbo is literally double (steps + guidance) distilled, you can't make something more distilled than that
>if you take the water out of a lemon, you can still rehydrate it and make fucking lemonade.
oh yeah? like we definitely saved Flux Schnell with Chroma? Oh wait...
>if you take out the citric acid, you're gonna get some weird tasting shit.
Now you're saying distillation is essential to train a model? are you fucking retarded or something?
>>
>>107654594
>using distilled loras
What the fuck are you babbling about ? What is a 'distilled lora' ?

You don't know what the fuck you're talking about.

>distillation will always hurt the quality of a model
No it doesn't, it effectively prunes the model of a lot of shit results, which is the main reason you distill to begin with, to create a model that gives better results, typically with a specific focus

An example is Z-Image Turbo, which has a focus on very aesthetically pleasing images, particularly photography

Bigger is NOT automatically better, it's all about the data quality

>and it makes the distilled model impossible to finetune
No it doesn't, but it makes it a lot harder and more time consuming, like with Chroma which was trained on distilled Flux Schnell
>>
>>107654658
>Ani removed the rentries again
you can try over and over you're not gonna win lol
>>
File: (You).png (105 KB, 519x519)
105 KB
105 KB PNG
>>107654668
>What is a 'distilled lora' ?
>>
File: keekkk.png (437 KB, 976x549)
437 KB
437 KB PNG
>>107654668
>>distillation will always hurt the quality of a model
>No it doesn't, it effectively prunes the model of a lot of shit results, which is the main reason you distill to begin with, to create a model that gives better results
lmaoooooooo
>>
>>107654662
holy shit, way to misinterpret everything i said.
you're too retarded to save, anon. it's terminal, i'm afraid.
>>
>>107654682
Concession Accepted.
>>
File: 1757545150462880.png (251 KB, 1640x1279)
251 KB
251 KB PNG
>>107654682
Try to guess why Flux 2 pro is 4nd on the leaderboard and Flux 2 dev is 15th, the only difference is that Flux 2 dev is the distilled version of Flux 2 pro, are you retarded or something?
>>
>>107653984
these are what comfy makes, the ones in the app templates are made by the "team"
>>
>>107653358
It's a jizz-splotion in your mouth.
>>
>>107654682
troll
>>
>>107654585
Grand Theft Auto - Epstein Island



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.