[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: ComfyUI_00083_.png (1.42 MB, 1120x1008)
1.42 MB
1.42 MB PNG
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107408185

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/t

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
We will never get ZiT base model, doomposters are always right in the end, and comfy should be dragged out on the street and shot.
>>
File: ComfyUI_00080_.mp4 (474 KB, 640x640)
474 KB
474 KB MP4
>>
Kek
>>
>>107411744
wtf lol
>>
File: 1748222675017375.png (184 KB, 626x1428)
184 KB
184 KB PNG
>>107410543
This ledditor is full of shit, the last thing they said according to the release of base was "not yet", nothing else
>>
Stable video infinity for wan2.2 dis month yall https://github.com/vita-epfl/Stable-Video-Infinity/issues/31

>>107411726
kek
>>
File: ZIMAGE_00098_.png (1.66 MB, 832x1472)
1.66 MB
1.66 MB PNG
>https://comfyanonymous.github.io/ComfyUI_examples/z_image/t

Remove the t at the end
>>
File: ComfyUI_21451_.jpg (343 KB, 1382x1036)
343 KB
343 KB JPG
>>
File: 1blackgirl.png (1.2 MB, 768x1344)
1.2 MB
1.2 MB PNG
>>
>>107411725
Kek, based pic OP!
ANIME DIFFUSION NEWS!
>Noob Models!
SeeleNoobAI (2048 native resolution): https://civitai.com/models/1445275/seele-noobai-sdxl
Chenkin Noob XL:(NoobAI ESP with new dataset of character)
https://civitai.com/models/2167995/chenkin-noob-xl
WAI Shuffle Noob
https://civitai.com/models/989367/wai-shuffle-noob
>Anime Lora Making Guide!
https://civitai.com/models/22530/guide-make-your-own-loras-easy-and-free
>Model News!
ZiT Zeta Image Turbo Model: 6b model, fast, open source, doesn't understand booru tags.
UIs that supports it: Comfy, Krita AI Diffusion, Neo Forge, Swarm, SD Next
>Anime ZiT LoRas!:
Frieren LoRA
https://civitai.com/models/2176854/frieren-beyond-journeys-end-sousou-no-frieren-z-image-lora
Flat Anime Style:
https://civitai.com/models/2175307/z-image-flatanimestyle
Ra Lilium Style:
https://civitai.com/models/2125529/ra-lilium-style
Nyalia Style:
https://civitai.com/models/2180136/nyalia-style
Anime Flat Style:
https://civitai.com/models/1952560/anime-flat-style
Teto:
https://civitai.com/models/2175612/kasane-teto-z-image-lora
ANIME CHARACTER LORA REQUESTS HERE!
>>
>>107411780
daaaaaaamn
>>
File: ComfyUI_00635.png (3.28 MB, 1536x2048)
3.28 MB
3.28 MB PNG
>>107411744
Ew...
>>
>>107411780
pretty good. looks like an actual black person and not a caricature from africa
>>
>>107411791
Too old.
>>
>>107411780
we wuz realistic and sheet
>>
File: 904836457.png (1.93 MB, 1024x1704)
1.93 MB
1.93 MB PNG
>>
File: ComfyUI_00087_.png (1015 KB, 1120x1008)
1015 KB
1015 KB PNG
>>
>>107411755
I want to believe you because if it is truly 2mw, then they're definitely up to something. If the distill exists, the base exists. All they need to do is press upload but they won't.
>>
File: 1764713410.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>
what is going on anons? the quality of this thread has dropped dramatically in the past 48 hours. all I see is shitty gens and anons talking to theyselves. they have invaded my safe haven
>>
File: z-turbo_00042_.png (3.81 MB, 2048x1536)
3.81 MB
3.81 MB PNG
>>
>>107411827
z-image made everyone slop happy
>>
>>107411827
Zimage brought back all the mentally ill brown schizos
>>
>>107411827
>new model that everyone can use dropped
>wtf why is everyone genning and posting images
>>
File: 1763713306536234.png (57 KB, 1337x534)
57 KB
57 KB PNG
https://civitai.com/images/112379617
look at that workflow and its node, there's this in there lool
>>
File: ComfyUI_00081_.mp4 (1.33 MB, 640x640)
1.33 MB
1.33 MB MP4
>>107411825
>>
>>107411791
Is that jenna nicholson?
>>
File: file.png (332 KB, 654x368)
332 KB
332 KB PNG
>>107411855
>turbo_base
>>
>>107411859
how did you gen that so fast?
>>
>>107411855
nobody could possibly rename a model file
>>
>>107411855
troll of the century
>>
>>107411876
low res
>>
>>107411852
but they are posting every fucking output turning this place into /sdg/. select your outputs anon, dont just post everything it shits out because then you track your diarrhea all over the thread
>>
File: ComfyUI_00589_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
here's a result from my first z-image lora
it definitely does better with a 2.5d style compared to pure 2d
>>
File: 1764716328.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
>>
File: b93-2953134924.png (367 KB, 800x820)
367 KB
367 KB PNG
>>107411827
>caring more about gens than actual discussion and info
>>
File: ComfyUI_00081_.png (1.32 MB, 1120x930)
1.32 MB
1.32 MB PNG
>>107411898
pretty good tbdesu
>>
File: Nano Banana Pro.jpg (1.92 MB, 2816x1536)
1.92 MB
1.92 MB JPG
>>107411324
NBP is still way ahead the competition lol
>>
>>107411943
I would if the butterfly spammer didnt turn it into his private fucking forum
>>
>>107411953
weird that it fucked up the tear drop so badly in the last panel though. is it supposed to be a rain drop?
>>
So where is the base model?
>>
>>107411961
>caring about gens
You need to go back to you shithole
>>
>>107411968
they never said it was going to be released on the weekend. some idiot anon just misread, probably the same idiot anon claiming base is 6B. see the pattern.
>>
File: ComfyUI_00074_.png (2.27 MB, 1504x1024)
2.27 MB
2.27 MB PNG
>>107411744
>>
>>
>>107411984
oh lawd
>>
File: 1749263175818766.jpg (1.13 MB, 2016x1152)
1.13 MB
1.13 MB JPG
Your fantasy lora is fun, anon kun
>>
I'd like more gens of cute girls being loved instead of hurt
>>
>>107411984
>>107411744
imagine wasting resources making this dumb shit.
>>
>>107411992
read the shirt
>>
File: BASED.jpg (1.8 MB, 2816x1536)
1.8 MB
1.8 MB JPG
>>107411953
>>
File: 1.jpg (751 KB, 1248x1728)
751 KB
751 KB JPG
ZIT REALLY really doesn't like making two headed cows. Does it maybe one in ten seeds.

And which asshole linked the wrong new thread?
>>
>>107411996
what of it? its still stupid. someone so obsessed with 'retarded troons' enough to make gens of it has me concerned.
>>
>>107411998
based
>>
File: ComfyUI_00088_.mp4 (792 KB, 640x640)
792 KB
792 KB MP4
>>107411984
>>
>>107411898
Share please
I love this
>>
does anyone know, why i sometimes get particles(like pearls or something) on the clothes or hair? i have this strange problem regardless of the z version
>>
File: ComfyUI_00075_.png (2.49 MB, 1504x1024)
2.49 MB
2.49 MB PNG
>>107411984
>>
>>107411898
This is so cool! Thanks for sharing do you have any manual or tutorial?
>>
So when is the Z Image booru train?
>>
>>107411998
lmao
>>
>>107412047
post pic?
>>
File: 1734533957803257.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
we are in a new no-flux-plastic era now.
>>
File: Z-image turbo.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
>Hatsune Miku performing with an Otamatone
wtf is this shit? lmao
>>
>>107412022
lmao
>>
>>107412054
never because we'll never get the base model
>>
>>107412047
It's very sensitive to steps and samplers.
For example something like
sgm_uniform and dpmpp_sde will generate b.s. details very quickly and sort of stretch everything around
i havent been able to escape euler simple 10 steps
>>
>>107412054
>So when is the Z Image booru train?
about that, on their discord someone asked them about that anime training and they didn't comment on that saying that it's confidential lol
>>
>>107412087
>i havent been able to escape euler simple 10 steps
normal is the official scheduler of z-image turbo, maybe you should give it a try
>>
File: anytest.png (22 KB, 369x573)
22 KB
22 KB PNG
Anytest is so fucking cool sometimes.
So I wanted to gen a female hand holding a flat (a shoe) from underneath, POV. I don't have a reference for this.
I do have a reference for the shoe: a shoe on Sketchfab that I rotated to the right angle and screenshot, then traced in GIMP.
I had an idea that I thought may work: look at my own hand holding a shoe from underneath from the same angle, then, using that as a reference, very, very roughly draw the hand with just skin color, no outline.
My thinking was I didn't want to draw my own manhand because that wouldn't look right. I thought maybe if I just draw in color and not with an outline, Anytest would sense my intent and draw a female hand, only roughly following the color blob I drew.
>>
File: ComfyUI_00668.png (3.8 MB, 1536x2048)
3.8 MB
3.8 MB PNG
>>107411798
You take that back!
>>
>>107412108
oh I thought it was simple, lol
>>
File: anytest2.png (176 KB, 366x575)
176 KB
176 KB PNG
>>107412117
...and it actually fucking worked. It needs a bit of work inpainting, but it fucking worked.
So, lesson learned: Anytest can sense intent from rough blobs of color and won't follow the blobs perfectly like it will lineart.
Notice how it followed the lineart of the shoe perfectly but not the blob that represents the hand/fingers.
>>
File: file.png (1.14 MB, 1024x1536)
1.14 MB
1.14 MB PNG
>>
File: 1738196974139512.png (159 KB, 1920x674)
159 KB
159 KB PNG
>>107412127
yeah same
>>
File: 1737614482682878.png (900 KB, 480x832)
900 KB
900 KB PNG
>>107412126
who is this woman? why did you make a lora of her?
>>
>>107412169
ratface woman
>>
>>107412117
>>107412132
Great info anon! Where can I download anytest!? I need it for my anime news!
>>
When am I allowed to start laughing at people who said we were getting the base model?
>>
>>107412169
fivel's sister
>>
>>107412193
If we don't get base by the end of January it's over.
>>
File: 1764684836745018.mp4 (3.46 MB, 720x960)
3.46 MB
3.46 MB MP4
>>107412169
I do believe that is Jenna Ortega
>>
>>107412207
>by the end of January
From last weekend to the end of January. Oh delicious. I'm loving this. I'm feeling so vindicated right now. Injecting the smug right into my veins.
>>
>base model is only 6b
>it's also never coming out
it's over no matter what happens. 1000 more years of SDXL
>>
>>107412218
What, that looks nothing like her lmao
>>
File: 1759834532071880.mp4 (2.64 MB, 720x1080)
2.64 MB
2.64 MB MP4
>>
File: gap.png (91 KB, 1587x867)
91 KB
91 KB PNG
>that gap between pro and dev
remember to subscribe to comfyAPI to use Flux 2 [pro], the world's most powerful local model!
>>
>>107412234
>But the Chinese company vaguely implied they would do something!
>T-they're just tuning the base model. That's why they left all my questions on read.
>>
File: what.png (469 KB, 750x1000)
469 KB
469 KB PNG
>>107412250
>No Z-image turbo
>>
File: Z-image turbo.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
>>
File: ComfyUI_00003_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>
File: 1737369928955984.png (186 KB, 2552x577)
186 KB
186 KB PNG
>>107412265
>all my questions on read.
that's the only bone we can chew on
>>
>>107412297
>are we there yet?
>are we there yet?
>are we there yet?
>are we there yet?
>are we there yet?
>are we there yet?
>>
>>107412134
>deleted
jannies should just delete the thread desu
>>
File: 1742548620184919.png (741 KB, 1369x770)
741 KB
741 KB PNG
>>107412297
>not yet
mfw
>>
File: dirtypearls.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>107412060
>>
>>107412319
salted snaek
>>
File: file.png (1.83 MB, 1024x1536)
1.83 MB
1.83 MB PNG
>>
>>107412297
Note the pivot in tone. I've seen this before. They can't say it's not getting released. They know doing so right now would cause issues. So they will just pretend they don't know when.
>>
>>107412331
they're afraid japan will launch ze missiles if they release it
>>
lodestones will dedistill and finetune Flux 2 before Z Image Base releases
>>
>>107412325
sampler/scheduler/amount of steps used? did you use a lora?
seem like artifacts from using too many steps or using an overtrained lora. I have never gotten anything like this.
>>
File: 1.jpg (906 KB, 2221x1988)
906 KB
906 KB JPG
>>
>>107412319
lol
>>
File: 1736627885388887.mp4 (807 KB, 720x720)
807 KB
807 KB MP4
fuck wan. why can't it do simple gestures like waving her finger to sides and insists on doing the silence gesture instead?
>>
File: 1755671052313348.jpg (525 KB, 2709x1482)
525 KB
525 KB JPG
>>107412297
I've been reading the Discord comments, and if I'm not mistaken, my intuition tells me they're training on certain concepts the model doesn't yet understand because they believe it's incomplete without them. I've seen some guy asking them to add concepts like CCTV and shit and Tengyui were appreciative about their suggestions, I think they're just cooking the base model a bit more so that they can please the most of us, they're really based
>>
File: ComfyUI_00718.png (3.59 MB, 1536x2048)
3.59 MB
3.59 MB PNG
>>107412169
That's my muse... my love... my wife!
>>
when I try to run the wan workflow from the op I get a triton error
can't copy the error text for some reason but it's something like:
 FileNotFoundError: [Errno 2] no such file or directory:
C:\\Users.... triton_red_fused_scaled_dot_product_something something.source
Set Torchdynamo_verbose=1 for the internal stack trace something something

I downloaded a clean comfy portable, run the bat and changed the comfy_nvidia.bat
How to fix this?
How do I reinstall Triton?
>>
>>107412383
just reinstall windows, that usually fixes it
>>
>>107412053
Furk off.
>>
>>107412367
well didnt they also get the noobai dataset? i mean i doubt they can train on that in a couple of days, idk whats its size
>>
>>107412389
>well didnt they also get the noobai dataset?
that's for a finetune, not base
>>
>>107412379
Like actual married wife whom you live with in person or some random celebrity you decided to call your wife..? And your wife was ok with you making a fucking lora of her and posting the gens on 4chan? Wild.
>>
>>107412383
install triton windows if youre on windows
and then sage
>>
File: ComfyUI_21458_.jpg (384 KB, 1382x1036)
384 KB
384 KB JPG
>>
If base is actually going to be trained further then I hope they also release a turbo lora with it so that we're not stuck with the current turbo model
>>
If the base model is still not finished but we got a distilled model from the unifinished model anyway, do you think they rushed out turbo so that they can relase this shit the same time as flux 2? if they did that on purpose that's one of the most based moved I've seen in my entire life
>>
>>107412367
>Thanks for these detailed suggestions

How could this be read any other way than "k"?
>>
>>107412415
there's like hundreds of questions on discord and they answered only to a few of them, that means that if they made the effort to answer it means they gave a fuck
>>
File: WanVideo2_2_I2V_00585.webm (1.35 MB, 1056x608)
1.35 MB
1.35 MB WEBM
>Meanwhile at Tongyi labs.
>>
>>107412414
Probably, since they mention Flux 2 in their paper
>>
>>107412414
obviously, everyone from the bigger companies is constantly training models and sitting on those improvements, when they release them depends on the market, obviously they want to maximize impact

if hunyuan dropped a video model that btfod wan 2.2, then wan 2.5 would have been foss
>>
>>107412350
10 steps euler simple, no lora
>>
gawd this loli is swaying her lips and smiling in a seductive manner while grinding on the man wearing a semi-transparent dress
FUCK
I have reached NIRVANA
THIS IS THE END OF MY CYCLE FOREVER
>>
File: 1740406496335205.png (364 KB, 429x411)
364 KB
364 KB PNG
https://huggingface.co/CompVis/stable-diffusion-v1-4
https://huggingface.co/CompVis/stable-diffusion-v1-4
https://huggingface.co/CompVis/stable-diffusion-v1-4
https://huggingface.co/CompVis/stable-diffusion-v1-4

ITS UP ITS UP ITS UP
>>
File: ComfyUI_00095_.png (1.24 MB, 1120x1008)
1.24 MB
1.24 MB PNG
Try this prompt in any model, I guarantee it wont work:

A knight slaying a dragon by impaling its head with a sword
>>
>>107412367
>Tengyui were appreciative about their suggestions
If it's like the other chinese models researchers, it's just a polite nod, nothing more, that doesn't even mean they'll do anything with it.
It's true that they're continuing the base model training, probably for it to not be a disappointment and to capitalize on the turbo success.
People just need to stop being schizo about having basic patience.
>>
>>107412463
WE'RE SAVED
>>
File: kek.png (262 KB, 640x640)
262 KB
262 KB PNG
>>107412463
>ITS UP ITS UP ITS UP
>>
>>107412329
Nice!
>>
>>107412348
bfl are going to gimp the apache 2 model even harder for flux 2. they won't let another chroma happen.
>>
another 5000 years old dalle mini
>>
>>107412466
>If it's like the other chinese models researchers, it's just a polite nod, nothing more
the qwen team from the same company asked on twitter which types of llms and sizes would people like and listened to literal whos multiple times
>>
>>107412466
>It's true that they're continuing the base model training, probably for it to not be a disappointment and to capitalize on the turbo success.
not only that, but with the amount of dicksucking they've gotten recently you bet they want to reproduce that shit again, must feel good to be this loved (they deserve it though)
>>
>>107412407
is this not included in that bat?
>>
is there literally any resource out there anywhere for finetuning VAEs?
there's several people that have done different finetunes for different VAEs and relased them but they all keep their training code secret literally nothing nowhere not even a crumb of information
>>
>>107412463
whats this?
>>
StabilityAI has purchased the rights to Z-Image and will be making it available under their API service. Thank you for your interest in Z-Image.
>>
File: 1757277274260583.png (4 KB, 300x62)
4 KB
4 KB PNG
>>
>>107412457
>man wearing a semi-transparent dress
that sounds so gay
>>
>>107412466
A few years ago I made a porn game in Unity. People would suggest things all the time. This guy wrote me like a page long wishlist of things he wanted in the game. He wanted farts, burps, incest, and noises to go along with it.
Like I shit you not, a whole page.

I responded with almost the exact same thing these guys said. And naturally, never did it.
>>
File: 1741359230450148.png (1022 KB, 1024x1024)
1022 KB
1022 KB PNG
>>107412511
its soul
>>
File: 1750263749982190.jpg (40 KB, 774x60)
40 KB
40 KB JPG
>>107412464
That would be considered gore/violent, and almost no model has that in its training data.
>>
>>107412527
What game?
>>
It's over it's so fucking over
We lost
>>
>>107412457
>semi-transparent
Translucent. Fucking ESL.
>>
>>107412504
Hopefully they don't listen to retards then.

>>107412505
If that keeps them motivated to release better stuff, people can suck their dick all day every day.
I just don't want them to do a nunchaku and just hop from idea to idea without ever actually finishing one.
>>
>>107412525
even the ai knows the context to determine it's the girl wearing the dress

>>107412543
they both mean the same fucking thing who cares
>>
>>107412538
black souls
>>
>>107412441
>the loading bar is being filled up to 100%
even Wan wants it to be finished kek
>>
>>107412530
some freak on reddit said z does gore.
>>
>>107412538
I don't think I could ever reveal it even under an anonymous name because of how fetishistic and embarrassing the contents are.
>>
>>107412530
I see, thanks for that mate
>>
Guys I tried to speed up my comfyui by using cheat engine on my browser and now my computer won't turn on anymore
>>
>>107412527
Listening to everyone is the best way to make the most boring game ever anyway, you can't please everyone, and if you try, you'll just make everyone angry because of the lack of content each thing has.
>>
>>107412530
Thank god, I wouldn't want unsafe outputs like fantasy battles.
>>
>>107412409
really good gen anon
>>
>z image is faster on stable-diffusion.cp-
syke
https://www.reddit.com/r/StableDiffusion/comments/1pchpjb/comment/nrys72z/
>>
>>107412557
probably basic medical gore or something, but I doubt it can do murder tier gore. feel free to try. i can't right now because currently wan gen'ing.
>>
>>107412270
The only put API models on their board. It's a paid API promotion site.
>>
>>107412580
>The only put API models on their board
Flux 2 dev is a local model though
>>
>>107412579
should I make a corpse lora? I have thousands of images of dead russian and ukrainian soldiers.
also is that okay to upload on civitai?
>>
>>107412575
le what?
>>
>>107412567
I agree. That's why I don't listen to suggestions. I image exactly what I wanted to see.
>>
>>107412593
No Civitai would ban it. And I'd wait for base to train any lora because turbo loras have very limited flexibility due to the lack of general knowledge.
>>
>>107412593
fuck yeah
>>
Best guide for training a character lora?
>>
>>107412608
>I'd wait for base
I'd read the room right now if I were you.
>>
>>107412575
>comfy has to wait to see some leddit posts before fixing any shit
it's the second time he did that, does he not look at the github issues or something?
>>
File: 1762533930065338.jpg (443 KB, 1841x693)
443 KB
443 KB JPG
>hey guys try this prompt
>mfw queue up wan gens for the entire day
>>
>>107412593
Please do it. You could do a wound lora that would be better than corpse because it's already possible to generate gore and lying corpses anyway. Upload it to catbox instead of civitai.
>>
File: 1745085340345444.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>
File: ComfyUI_00114_.mp4 (982 KB, 640x640)
982 KB
982 KB MP4
wan always seems like its trying to make a loopable video.. pissing me off
>>
>>107412573
i pasted my previous prompts into a one single messy prompt and tweaked it a little
pushing it but it's pretty amazing how many things this model can hold up together at the same time
when compared to sdxl etc
>>
File: 177.jpg (910 KB, 857x579)
910 KB
910 KB JPG
total doomposter victory
>>
>>107412620
I mean it only takes an hour to train a z lora so I guess it wouldn't hurt to try.
>>
File: 1734959390184168.png (34 KB, 740x441)
34 KB
34 KB PNG
>>107412667
feels bad man
>>
>>107412667
>doomposter

I prefer pragmatist.
>>
>distillation ruined flux
>distillation ruined chroma with plastic skin and bokeh
>distillation ruined z-image with zero seed variation
when will they ever learn? or is speedgenning pureslop just so important that this will always the the number one priority?
>>
File: 1754146694753853.mp4 (974 KB, 640x640)
974 KB
974 KB MP4
>>107412646
>>
>>107412712
>Wearing spats.
Man that was disappointing.
>>
>>107412608
Just upload it on catbox then
>>
>>107412613
high quality images with variety, preferably manually cropped. relying on buckets is lazy and it may not fit it in a way that captures key details. close ups, multiple angles, poses, expressions, etc. absolutely no synthetic data.

high quality captions. you can use ai to caption but you should still manually check each prompt.

regularization dataset. think of these as like the negative prompt of your character lora. ie, if your character is a 1girl, add images of men/different women/etc so your character lora doesn't override every subject in the image. this is debatable and some people dont use them.

amount of steps should be based on amount of images. more images, more steps. always train on base models or finetunes, never shitmixes. that's basically it
>>
>>107412739
should have prompted green shimapan, true.
>>
>>107412744
>some people dont use them.
>Some
Basically everyone.
>>
>>107412768
And also why most loras on Civitai are shit and break if you do anything remotely out the box, so ditto.
>>
>z image base releases
>finally. I can do 1girl, asian now
>>
>>107411725
>here is a guide on how to get wan working
>actually I'm going to skip a shit ton of steps that you need to run it like how to install python, pip, sage, triton and so on
>good luck figuring shit out!
what is the fucking point of writing a guide/tutorial if you're going to gatekeep it anyway by skipping shit?
>>
>>107412784
>>good luck figuring shit out!
never been this easy to figure shit out, just ask LLMs nigger
>>
File: nbp_996353.jpg (840 KB, 1408x768)
840 KB
840 KB JPG
>>107412464
>A knight slaying a dragon by impaling its head with a sword
>Try this prompt in any model, I guarantee it wont work
>>
>>107412784
look at the rentry for chroma training
>>
1 million years of SDXL
>>
File: Zurbo_00102_.jpg (907 KB, 3328x1792)
907 KB
907 KB JPG
Good natural language Anime model, when? Or is there one already? I don't know shit about Anime.
>>107412782
Hell yeah, can't wait.
>>
>>107412816
>no bokeh
based
>>
File: 1755015308384927.png (2.15 MB, 1024x1024)
2.15 MB
2.15 MB PNG
noice
https://civitai.com/models/912942/retro-sci-fi-90s-anime-style-zimageturbofluxchromaillustriousxl?modelVersionId=2463751
>>
>>107412821
>>no bokeh
it's funny that in photography, no bokeh is a result of a cheap small sensor and bokeh is a luxury with more expensive lenses with bigger sensors but in ai slop generation it's the opposite because people forgot how optics work and only ever seen shitty smartphone photos
>>
>>107412799
let me guess it's gonna be another half assed guide with steps skipped
>>
>>107412837
it's not like we forgot how optics work, it's more like we prefer no blur over blur, that's all
>>
File: 1753852560338168.jpg (472 KB, 2048x1536)
472 KB
472 KB JPG
>>107412828
how many of you never train anything yourself and only use stuff you find on civitai?
>>
File: 1753311377515825.png (2.83 MB, 1360x2048)
2.83 MB
2.83 MB PNG
>>
>>107412005
>really doesn't like making two headed cows
It always does it right for the first 3 steps or so and then "fixes" it. Do you think this is because it's distilled?
>>
>>107412853
would be nice if there was a definitive guide
>>
>>107412853
I never train anything not because I don't have the compute but I don't have the dataset
>>
>>107412857
cute breaded hair
>>
File: Zurbo_00001_.jpg (1.1 MB, 2048x2944)
1.1 MB
1.1 MB JPG
Man. I like the model. Now if only they made it...
BASED.

*Puts on sunglasses*
YEEEEAAAAAAAHHHHHHH!
>>
>>107412860
https://www.youtube.com/watch?v=Kmve1_jiDpQ

>>107412865
what do you mean? just gather 20-40 images of whatever
>>
File: 1752186331383941.mp4 (2.02 MB, 640x640)
2.02 MB
2.02 MB MP4
>>107412765
there we go.
>>
i have a conspiracy theory that runpod makes you get charged for more time by randomly throttling their shitty networking to like 500kb/s speeds
>>
>>107412876
Much better
>>
File: 1736631695315923.mp4 (771 KB, 640x640)
771 KB
771 KB MP4
>>107412361
dang i couldnt do it either
>>
>>107412799
just as I thought
line 3 in the fucking guide it randomly introduces "uv" as cmd command with any explanation whatsoever.
now I have to learn first what the fuck uv is.
so I'm back at the pip shit again that doesn't fucking work even tho it's installed when I want to install uv.
>>
File: kek.png (1.54 MB, 1280x720)
1.54 MB
1.54 MB PNG
>>107412828
>>
>>107412872
can't you go for a text in english? I don't speak ching chong :(
>>
>>107412900
now imagine if you had reading comprehension, the ability to search for information online, and a brain that isn't so dopamine rotted that you give up when you don't get instant gratification in 3 seconds
>>
File: WanVideo2_2_I2V_00587.mp4 (1.37 MB, 1136x768)
1.37 MB
1.37 MB MP4
>>
File: question-mark.jpg (115 KB, 800x400)
115 KB
115 KB JPG
So what's the value/node/whatever in comfyui I have to tweak to make the generation "randomize" more of the final result while still mostly adhering to the prompt? I want to see some more variety in the end results without changing the core character/theme
>>
>>107411998
Based, lmao.
>>
File: 1751217581098287.mp4 (1.82 MB, 640x640)
1.82 MB
1.82 MB MP4
>>107412876
even better jump
>>
>>107411744
comfy really let himself go

>>107412960
your gens r bad
>>
say i give a language model a photo of a woman engaged in sexual intercourse
are there language models that will detail everything uncensored?
>>
>>107413003
glm 4.6
>>
>>107413003
Joycaption, Toriigate
>>
>>107413003
Yes. You can even get gemini to do it with an appropriate prompt and an API key.
>>
File: ComfyUI_0068_boudoir.png (1.19 MB, 832x1216)
1.19 MB
1.19 MB PNG
>>
do you think they'll reveal how they captioned all their training data?
>>
File: 1737364363710917.mp4 (1.12 MB, 640x640)
1.12 MB
1.12 MB MP4
the people in white tshirts walk to the right

zimage base, wan to animate
>>
>>107411998
>>107412017
>>107412951
https://youtu.be/8VEmni_QzYk?t=8
>>
>>107412784
>please spoonfeed me on basic shit that can be googled
do we really share a board with people like this?
>>
>>107413036
No.
>>
>>107412934
Not how it works, unfortunately. Seed variation is a property of the model, not the settings. You can kind of fake it by messing with the CGF, but given the time you are asking this, you are probably asking about Z-Turbo, which already works at a CFG = 1. You need workarounds, like non-empty input images (but Z-turbo is surprisingly bad at handling those).
>>
>>107412853
4gb laptop please aadastand
>>
>>107412744
>manually cropped
This part I never get
>>
File: 1755849445395663.mp4 (2.06 MB, 640x640)
2.06 MB
2.06 MB MP4
>>107412646
>>
>>107412794
Model??
>>
>>107412934
try using the sampler dpmpp_2m_heun_gpu. it helps randomizing the final result.
>>
>>107413033
i came
>>
>>107413072
what the heck is z turbo
>>
File: WanVideo2_2_I2V_00588.webm (3.29 MB, 1152x672)
3.29 MB
3.29 MB WEBM
>>107412816
>>
>>107413009
>>107413015
>>107413020
wow nice thanks
>>
>>107413109
keek
>>
>>107413109
how did you train a lora on me
>>
>>107412898
hnnngggg
>>
File: 1749145003271277.png (1.24 MB, 1280x720)
1.24 MB
1.24 MB PNG
>>
File: WanVideo2_2_I2V_00036.mp4 (1.39 MB, 832x464)
1.39 MB
1.39 MB MP4
alchemical mishap
>>
>>107413145
Very cool
>>
File: file.png (261 KB, 498x408)
261 KB
261 KB PNG
>>107413108
>what the heck is z turbo
>>
>>107413036
No, that's the heart of their business.
>>
File: 1746316183236056.png (859 KB, 1084x1408)
859 KB
859 KB PNG
>>
>>107413145
mind posting it to catbox so I can see the workflow? that's neato
>>
>>107413152
Why do furries always get the good stuff first

I'm tired of NoobAI
>>
>>107413152
A boy has the right to dream
>>
>>107413152
Are people on this subreddit all on crack or something? They're always like a billion superlative for every fucking little thing.
>>
>>107413036
if there's one thing you have to understand, everytime a company is ommiting something they implicitly say that they found the part that makes models great, and, to our great surprise (no), the quailty of the data is actually an important piece of the puzzle
>>
>>107413167
nobody knows what superlative means
>>
File: 1764705756366432.png (2.52 MB, 1504x1504)
2.52 MB
2.52 MB PNG
>>
>>107413167
>Are people on this subreddit all on crack or something?

Make a post on that sub then look at the statistics for what country looked at your post and you'll understand.
>>
File: 1759845908602218.png (959 KB, 1024x1024)
959 KB
959 KB PNG
>>107413180
and with qwen edit:

hatsune miku holding the can of soda in image1.
>>
File: Zurbo_00005_.jpg (890 KB, 2048x2944)
890 KB
890 KB JPG
>>107412920
Have another one.
>>
File: 5118522.png (1.24 MB, 1280x720)
1.24 MB
1.24 MB PNG
>>107411998
>>
>>107413201
>Based is releasing tomorrow
I voted for this, thanks mr President!

Jokes aside that's a really solid comic, ZiT is impressive
>>
>>107413167
not everyone had a bad childhood. it's okay if you're frigid, lol
>>
>>107412466
> It's true that they're continuing the base model training
we all know what they do
>>
>>107413109
the average anime convention
>>
>>107413107
It's a fun aesthetic.
>>
File: 1738792839353743.png (2.28 MB, 1024x1536)
2.28 MB
2.28 MB PNG
>>107413180
Miku soda should be a benchmark
>>
File: based.png (132 KB, 498x278)
132 KB
132 KB PNG
>>107413240
those mf are adding "N64 style", "CCTV style", "pepe the frog" and I'm all for it
>>
File: 1742102814694648.png (430 KB, 640x480)
430 KB
430 KB PNG
>there are still people thinking the base model is getting released
its sad, honestly
>>
>>107413176
they always glazing unc
>>
>>107413271
I never thought it would for a second. I've grown a sixth sense for how these companies operate. Unless an explicit in no vague and uncertain terms release date it given. It's not coming.
Their evasiveness on discord is just further proof at this point.
>>
File: images.jpg (19 KB, 472x423)
19 KB
19 KB JPG
>>107413253
When are they adding bukkakes and netorare?
>>
>>107413156
catbox is ip blocking me or something idk it doesn't load. it's just default kijai i2v workflow for the vid with the lightx2v loras and painteri2v node with motion amplitude 1.15
the image was generated with qwen with a shitty pixel art lora that i trained
>>
>>107413283
ask that on their discord kek
>>
>>107413176
by the power of the ESL gods, I give you the ability to open a dictionary
>>
>>107413253
>those mf are adding "N64 style", "CCTV style", "pepe the frog" and I'm all for it

Maybe they'll add big chungus and hawk tuah too. Or maybe they'll just not release the base model because they obviously aren't going to.
>>
>>107413152
>only 8GB of ram
Zimage confirmed vramlet cope
>>
File: Flux2_00281_.png (1.98 MB, 1536x864)
1.98 MB
1.98 MB PNG
>be me
>use Flux 2 just to be contrarian
>>
lets say i have this language model file

Josiefied-Qwen3-8B-abliterated-v1.Q4_K_M.gguf

how do i load it into comfyui to make it describe images and shit? the qwen vl node is censored in comfyui
>>
File: because why not.png (1.74 MB, 1280x720)
1.74 MB
1.74 MB PNG
>>
>>107413324
>the qwen vl node is censored in comfyui
huh?
>>
File: file.png (51 KB, 562x525)
51 KB
51 KB PNG
>>107413328
yeah it wont describe shit in images
>>
File: WanVideo2_2_I2V_00590.webm (3.6 MB, 1152x672)
3.6 MB
3.6 MB WEBM
I've noticed Wan has a really hard time making Indian people.
>>
>>107413324
you load through llama.cpp and expose a openai compatible api endpoint, and in comfy you call that api with nodes
>>
>>107413291
I would but I dont have one. Do it for me anon, I need it to make the nastiest hentai mankind has witnessed.
>>
>>107413341
even with the abliterated model?
>>
>>107413350
based chang didn't train on stinky jeets
>>
>>107413354
i dont know how to load the abliterated model i downloaded which is the gguf file

>>107413351
something i have to learn
>>
>>107413354
>abliterated
Use Derestricted versions instead, they don't have the loss of IQ abliterated have, and they never refuse anything.
>>
test
>>
>>107413371
Welcome back. Don't do whatever you did last time.
>>
>>107413152
>0.2 denoise
classic nothing upscale kek
>>
>>107413378
i cannot be stopped now. the test worked.
>>
File: 1759023731816728.jpg (52 KB, 682x875)
52 KB
52 KB JPG
>>107413295
>they'll just not release the base model because they obviously aren't going to
>obviously
They took the noobai dataset like a day or whatever after turbo was released, and also said they are not happy fully with the full model as they are training it more, they cant train on the new data within a few days and release it so what exactly is "obvious" to you?

The most nuclear truth about copes like these is that this site is filled with low IQ child-brained retards with an ego problem that is scared shitless of being wrong, they want to escape that pain of being potentially wrong, of something happening that their brain would perceive as "bad", so they develop the cope to get out of it by having a dual reward strategy of eternal pessimism.
If you simply always predict something bad will happen, if it does, then you were right, and the bad thing that happened is offset by you feeling good about your prediction and being "right". If something good happens, sure, you were wrong, but the good thing that happened makes it not matter and offsets it.

tl;dr: you are brown.
>>
>>107413350
make her take the sword out, stab the knight, and then the dragon burns him with fire
>>
What was the thing you should use with z image to get better negative prompts?
>>
File: 1761554940845696.png (1.39 MB, 1280x720)
1.39 MB
1.39 MB PNG
>>
>>107413408
Naggerino.
>>
File: file.png (1.12 MB, 759x991)
1.12 MB
1.12 MB PNG
this retardation starting with zimage means the model is indeed popular
>>
File: 1748437607415033.png (38 KB, 597x194)
38 KB
38 KB PNG
>>107413390
>>
Is the community's over reliance on loras autism?
>>
File: Zurbo_00007_.jpg (1.14 MB, 2048x2944)
1.14 MB
1.14 MB JPG
>Tell LLM to generate a prompt for a comic about 4chan and Xi Jinping
>It just shits it out
This is magic. No man should wield this power.
Release the base, Chinaman. Release it.
>>
When ready

>>107413438
>>107413438
>>107413438

When ready
>>
>>107413436
No, it's "we have no choice".
>>
>>107413433
whut does the "x" to the left of the filename do also those icons are too big for my autism but thats besides my question
>>
File: 1737813741787929.jpg (41 KB, 604x1024)
41 KB
41 KB JPG
>>107413433
>couldnt engage
>strawman fallacy
>lying
Q.E.D.
>>
>>107413454
i filters the md5 to quickly get rid of annoying faggots
>>
>>107413460
kek that sounds handy but i always thought the most vervent trolls would be randomizing the md5 but maybe not huh
>>
>>107413472
no the most fervent trolls just post awful gens
>>
pomf
>>
brrrt
>>
faggot
>>
File: 1736471503997800.mp4 (1.3 MB, 720x720)
1.3 MB
1.3 MB MP4
>>107413043
>>
fbrbrbrbrbbrbrr
>>
>>107413472
its mostly to get rid of autistic waifu who post the same pictures all day for some reason i could never grasp.
>>
>>107413249
nice, I assume qwen edit with migu as source?
>>
>>107413515
kek
>>
File: 1764728024.png (827 KB, 1024x1024)
827 KB
827 KB PNG
>>
>>107412351
Do you have mental issues, buddy?
>>
>>107405868
Why use the fork you linked over the original? Just curious what the difference is.
https://github.com/ChenDarYen/ComfyUI-NAG
>>
>>107414070
the fork includes a fix for the newer version of comfy and adds support for z image, and lumina I guess
>>
>>107414018
do you have to ask?
>>
>>107414145
I see, he has a PR up for it but a fork in the meantime, thanks.
>>
it finally happened, everyone else moved on and I'm the only one left



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.