[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107536415

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
comfy should be dragged out on the street and shot
>>
weekend thread of using my gpu as a room heater
>>
Blessed thread of cute girls
>>
File: ComfyUI_00177_.png (998 KB, 1200x1056)
998 KB
998 KB PNG
base status?
>>
>didn't make it into the collage
it's over
>>
we always make the collage
>>
File: img_00026_.jpg (519 KB, 1212x1616)
519 KB
519 KB JPG
>>
>>107538293
I like this
>>
Blessed thread of frenship
>>
>>107538552
Thanks baker, good collage
>>
>buy 6000 blackwell
>still just gen 1girl in illustrious but now i can have a trillion tabs open
was i just a simple man all along?
>>
>>107538622
1girl is eternal
>>
can onetrainer train ZiT yet
>>
>>107538635

I think there's a branch that can currently.
>>
>>107538622
>upgrade from 1080 to 5060 16gb
>still gen 1girls but now in in the best models
>nothing changes yet everything is different
we're just normal men.
>>
>>107538628
1girl is all you need
>>
>>107538635
Yes, but you need to checkout this branch: https://github.com/Nerogar/OneTrainer/pull/1195

That said it should be merged very soon, it's just going through testing
>>
How do I install llama.cpp for portable..?
>>
>>107538654
>grug gets shiny pointy stone
>grug still makes big titty girls on wall
things never really change
>>
Any news with the new NoobAI update? Is it comfy compatible? I forgot the name but it was something similar
>>
File: zimg_0028.png (1.99 MB, 1600x1064)
1.99 MB
1.99 MB PNG
fun lil vhs style lora
>>
>>107538686
rawtime
>>
File: ComfyUI_temp_lufha_00019_.png (3.01 MB, 1088x1856)
3.01 MB
3.01 MB PNG
>>
>>107538552
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
bad bake. what is the purpose of these links?
>>
>>107538712
there we go, prompt perfected. absolute bollywood.
>>
>>107538714
Hehe boring day in your general? Stay here, we'll keep you company while your friends aren't around.
>>
I successfully installed llama.cpp into comfy portable. Why is it being a cunt?
The link is installing it straight into windows.
>>
Why is schizo so mean to ani? What has he done to deserve this?
>>
who's your designated 1girl?
>>
>>107538750
Do you understand what PATH environment variable is?
>>
File: ComfyUI_00242_.png (1.23 MB, 1200x1056)
1.23 MB
1.23 MB PNG
still no z-base?
>>
File: ComfyUI_temp_lufha_00028_.png (2.43 MB, 1792x1088)
2.43 MB
2.43 MB PNG
>>
>>107538765
With a google and limited knowledge, it wants me to install it to windows and add the install path to somewhere?
Weird that the github page has nothing on this.
>>
>>107538714
since drama is thread relevant, we can talk about ran's sonic diaper scat addiction

>>107538750
the jeethon devs that made the bindings are retarded.
>>
>>107538760
For me Mikasa
>>
>>107538779
migu no :(
>>
>>107538780
>Weird that the github page has nothing on this.
isnt it linking you to a literal github page for this in your image? are you being dense?
>>
>>107538750
I dunno what the hell this node this
It either needs to be installed under your venv or added to your bash depending on how it is calling it
>>107538756
We all wonder that
>>107538714
Debo deserves to be there
Ani is there due to ran's schizophrenia
I wonder if we should replace it one made about ran
>>
>>107538786
For me, it's Levi Ackerman, 1girl (male).
>>
>>107538779
Can you make them grope migu?
>>
>>107538760
Power is a cute :3
>>
File: img_00027_.jpg (1.03 MB, 1332x1776)
1.03 MB
1.03 MB JPG
>>107538610
thanks, it's chroma + lora
>>
File: ComfyUI_temp_lufha_00033_.png (2.89 MB, 1792x1088)
2.89 MB
2.89 MB PNG
>>
>>107538791
>It either needs to be installed under your venv or added to your bash depending on how it is calling it
the anon that vibe coded it didn't use the shitty python binding. it's the full llama.cpp server and the node just connects to it. no venv or wheel
>>
>>107538791
>I wonder if we should replace it one made about ran
if you rebaked with it in there I would move there especially to make tRan have the funniest melty of the year
>>
Neither ran, debo or ani exists.
They are all LLMs instructed to play a character and sporadically post here.
It's an experiment conducted by researchers and the results will be published soon.
>>
File: zimg_0038.png (2.21 MB, 1440x1080)
2.21 MB
2.21 MB PNG
this prompt gen shit is kinda wild, but a pain in the dick to setup
>>
>we
>>
>>107538813
power is made for bullying and gaslighting.
>>
>>107538790
https://github.com/ggml-org/llama.cpp/blob/master/docs/install.md

That's the link. No idea what the fuck a winget is.

>>107538782
Makes sense.

>>107538791
I did a .\python etc install which installed and didn't throw an error. But still no good.
I'll figure out what bash is tomorrow. I'm done after the 2hours of a different problemsolving this morning.
>>
finally.. taking a second from 1girls to gen.. 1rat.
>>
Requesting some animated juliens
>>
>>107538829
>double click bat file for comfy
>drag workflow
>1girl, on jetski, ocean, beach,
WOW so HARD
>>
>>107538760
Miyo's simplicity is absolutely adorable I can't stop genning her
>>
>>107538750
>I successfully installed llama.cpp into comfy portable.
no, you had to install llama ccp on cmd without caring about comfy's environement at all
https://github.com/BigStationW/ComfyUI-Prompt-Manager?tab=readme-ov-file#installation
>>
>>107538839
Based and ratpilled
>>
>>107538839
what compelled you to gen a rat? are you that bored?
>>
File: 1739387421173612.png (2.61 MB, 1536x1536)
2.61 MB
2.61 MB PNG
zimage and qwen edit are fun but I also have tried some of the latest illustrious updates on civitai. nova anime v14 is pretty good imo.

what illustrious models do other anons recommend?
>>
>>107538827
That would be based but I am too lazy to go through archives and grabbing enough of his schizo babble to write a rentry.
Maybe another day.
>>
>>107538760
>designated 1girl
>no supperior Yor
>>
>>107538786
>>107538813
>>107538846
Why would you post your waifus for other men to prompt and defile?
>>
>>107538862
>illustrious updates
all snakeoil lora merges
>>
File: ComfyUI_temp_lufha_00041_.png (2.69 MB, 1792x1088)
2.69 MB
2.69 MB PNG
>>
>>107538862
>what illustrious models do other anons recommend?
Noob v-pred 1.0
>>
>>107538859
because i love funny rats as much as i love princess peach and other assorted pretty women

Prompt for other rat enjoyers

A hyper-realistic, amateurish smartphone photograph taken covertly in a dimly lit, upscale restaurant. The focus is on a deep, ceramic crock filled with dark, rich broth. Inside the deep bowl, a large, wet brown rat is fully floating on its back in the liquid, performing a relaxed backstroke. The rat's body is partially submerged in the soup, with its wet fur matted and its tiny pink paws paddling clearly in the air above the broth surface. Ripples surround the rat as it moves. A half-eaten bread roll sits on the white tablecloth nearby. The lighting is poor and grainy, creating a gross, unappetizing atmosphere.
>>
>>107538862
WAI 15 is godtier also NTRMixWAI merge
>>
>>107538883
can you prompt that rat with goku's hair, a ratku?
>>
>>107538862
Nice gen
>>
File: ComfyUI_temp_lufha_00043_.png (2.45 MB, 1792x1088)
2.45 MB
2.45 MB PNG
>>
File: 1752879794824701.png (2.24 MB, 1536x1536)
2.24 MB
2.24 MB PNG
>>107538886
okay will check them out, nova seems decent so far though
>there is a grimoire in this chest, anon
>>
>>107538780
You should get a game console instead. Bye.
>>
>>107538886
>WAI 15 is godtier
i went back to v14, what makes 15 godtier? genuinely asking
>>
>>107538872
>t. mental illnes
>>
>>107538866
>I am too lazy to go through archives and grabbing enough of his schizo babble to write a rentry.
unfortunate but I understand. who the fuck would waste hours of their life digging through threads specifically to make a rentry nobody will read? just seems a bit too mentally unhinged for any normal person
>>
>>107538906
>t.cuck
>>
>>107538900
Yes, nova is godtier you can stay there if you want
>>
File: 1758010644221259.png (3.88 MB, 1024x1536)
3.88 MB
3.88 MB PNG
>>
>>107538869
>redditx1girl
>>
>>107538900
lewd frieren isn't that exciting. fern on the other hand...
>>
>>107538920
They are not real bro
>>
File: 1762776698602758.png (2.43 MB, 1536x1536)
2.43 MB
2.43 MB PNG
>>107538900
to compare, this is wai15, I have that, hassaku, and nova downloaded so far plus base noob.

nice colors, also since models are only 6gb or so you can get a bunch since they aren't using much space. same with loras which are very small.
>>
>>107538889
tried, z-image has no clue what that even means. best i can do is spiked hair.
>>
>>107538760
BABY MAKING FACTORY WITH CHIZURU
>>
Seriously tho... stil no Z-image base? No ETA in sight?
>>
>>107538903
Latest
>>
File: img_00031_.jpg (1.06 MB, 1332x1776)
1.06 MB
1.06 MB JPG
>>
>>107538893
>>107538881
>>107538818
>>107538779
It amazes me how the model is able to generate exactly how they are, their expressions, intent, everything, I wonder if some real photos were used for training, it looks exactly like a real life situation
>>
>>107538935
realer than you
>>
are the anime posters here threatened by the better quality /adt/ posts?
>>
File: 1735972135782053.jpg (459 KB, 1250x1566)
459 KB
459 KB JPG
>>107538969
>I wonder if some real photos were used for training, it looks exactly like a real life situation
they only trained on real photos
>>
>>107538816
Nice, very non-ai look and good composition

It's like someone in these threads actually try
>>
>>107538960
They added the mention of it back to their website around 15 hours ago.
So it will be soon.
I expect it before new year.
>>
File: 1737086690277525.png (2.38 MB, 1536x1536)
2.38 MB
2.38 MB PNG
>>107538934
breast envy is a powerful tag:

also this extension is very useful for reforge prompting: https://github.com/DominikDoom/a1111-sd-webui-tagcomplete

super easy for getting booru tags for characters or concepts, fern became fern \(sousou no frieren\) for example.

used wai15 for this one.
>>
File: ZIT_00006_ (1)-1.jpg (694 KB, 2400x2277)
694 KB
694 KB JPG
>>107538852
Man am I tired.. It works now, thanks. Going to dream of all the possibitilies ahead of me.

>>107538902
Saar.
>>
>>107538951
>z-image has no clue what that even means
dang, a super saiyan rat would've been really funny. thanks
>>
>>107538988
>breast envy is a powerful tag
based
>>
>>107538975
Why are you mentioning your general's name? I don't go to yours to promote this general.
>>
File: 1746436924648436.png (2.53 MB, 1536x1536)
2.53 MB
2.53 MB PNG
>>107538988
>>
>>107538995
if its any consolation, nano banana fucking failed hard at this by rendering a 2d png over its head kek. hey maybe base can handle it.
>>
>>107538990
Retarded faggot.
>>
>>107538988
Thanks for the addon!
>>
>>107538988
Aloof look misses the point.
You need to make her blush or look frustrated.
(with regional prompting.)
>>
>>107539003
Unacceptable, shameful display.
>>
File: 1743709694347969.png (2.46 MB, 1536x1536)
2.46 MB
2.46 MB PNG
>>107539003
oops, should have used adetailer to fix frieren there. nevertheless, fern is fine.

this is also a good one.
>>
>>107538975
I don't vibe well with them, they're not very friendly and don't really discuss tech stuff. They spend most of their time doing cringe RP like little girls.
>>
>>107539018
philistine, gownou_Eren is renowned
>>
This guy is so stupid he doesn't know what an environment variable is.
>>
>>107538990
Seems like the third image is ignored and faces are generic but comfy gen otherwise
>>
>>107539003
Kek
>>
>>107539018
yeah honestly if you're gonna get mad, go get mad at gownou_Eren on xitter >>107539036
>>
File: 1763047825261695.png (2.39 MB, 1536x1536)
2.39 MB
2.39 MB PNG
>>107539028
prompt: masterpiece, best quality, amazing quality, 2girls, frieren, fern \(sousou no frieren\), white and black striped bikini, beach, breast envy

first 3 are just default recommended for the model, rest are just basic tags.

bonus: got symmetrical docking for one gen.
>>
>>107539036
death to gownou_Eren
>>
>>107539031
I don't go there because the creator kept bumping it with stolen images and his favorite pastime is getting into petty cat fights 24/7 and samefagging almost as hard as Ani.
>>
>>107538990
is that a promptgen wf?
>>
>>107538760
Although I prefer the manga to the anime adaptation, the artists did a good job adapting her.
>>
>>107539060
>2girls
>gens 3girls
illustrious SUCKS
>>
things we know about the lolcow catjak:
>half-black amerimutt
>on welfare
>not a single contribution to anything open source
>schizophrenic off their meds
>sonic and ben10 diaper scat enthusiast
>samefags
>uggo slopstyle gilfs
>worse than debo
>jealous of ani's talent
>is malding right now

anything else?
>>
File: that's right.png (1.47 MB, 1408x768)
1.47 MB
1.47 MB PNG
>>107539077
>>2girls
>>gens 3girls
>illustrious SUCKS
tag prompting sucks you mean
>>
File: img_00039_.jpg (791 KB, 1332x1776)
791 KB
791 KB JPG
>>107538984
ty, non-ai look is the goal. Too bad Chroma is so heavy/slow especially with loras
>>
File: however.png (402 KB, 412x444)
402 KB
402 KB PNG
>>107539077
see this is one of the positives of archaic models; sometimes you just get more bang for your gen buck!
sometimes it even happens with wan, i animate a video of 5 girls with their titties out kissing, and it generates an extra suddenly jumping into frame to kiss one of the girls.

>ask for 5girls half nude kissing
>get 6girls
>>
>>107538934
Knowing she's a bonafide granny kills the joy no matter how hot she looks. Fern on the other hand...
>>
>>107539077
it's probably because anon genned at 2mp.
>>
>>107539089
>>is malding right now
should be
>malding for three years (and counting)
>>
>>107539066
Luckily there is this general for more nornal people.
>>
File: zimg_0044.png (2.64 MB, 1080x1440)
2.64 MB
2.64 MB PNG
prompt gen is overpowered
>>
>>107539106
No, you see, we're all "ranfaggot" here
>>
>>107539098
>prompt 1girl with wan kissing another 1girl
>gens 1girl listing my full name and social security number
did they stealth patch wan2.2 to have audio? weird
>>
>>107538985
2 more weeks...
>>
>>107539089
add
>splitbakes
>falseflags as the people (s)he hates
>>
>>107539046
I prompted it to do img3 as the area, the style as 4. Probably went hard on some ghibli style. The women has their characteristics, just defaulted to asian because the style.

>>107539071
Yeah, just usual llm stuff, but it can be fed multiple images. Prompt Manager iirc. Check previous thread for a link.
>>
>>107539116
>right after you said this my wan gen text input suddenly changed to my full name, address, social security number, and bank information
wut duh
>>
>>107538760
Maybe it's superficial, but I like how she dresses.
>>
Anyone managed to run the Newbie model yet?
>>
>>107539136
>>107538760
Image
>>
>>107539140
Not yet,
>>
File: 100957_00001.mp4 (2.62 MB, 1280x720)
2.62 MB
2.62 MB MP4
anyway time for 1rat animated
>>
>>107539089
>>107539105
>>107539122
the sad reality is someone like this exists. they are larping as a reddit mod on 4chan. sad she bakes all the threads to ensure her schizo manifestos taint the OP
>>
>>107539140
It is a very undertrained alpha model. Not sure why you would bother unless very bored with a lot of time to kill.
I guess you can follow the instructions on their hf repo if you can't wait until comfy support.
>>
File: 1567195715489.png (380 KB, 642x803)
380 KB
380 KB PNG
When you are feeding a VL an image for description is it better to scale it down? Does size matter for direct computer vision?
>>
>>107539128
>my wan gen text input suddenly changed to my full name, address, social security number, and bank information
gen it and see what happens
>>
love imagegen but videogen shit like wan makes you go "damn, we might be fucked when this gets better lmao"
>>
>>107539180
wut duh
>>
>>107539136
>>107539144
>Maybe it's superficial
can it ever be anything else?
>>
>>107539178
the bigger it is, the more tokens it's gonna eat, you'll be fine if you resize all your images at 1 megapixel max
https://github.com/BigStationW/ComfyUi-Scale-Image-to-Total-Pixels-Advanced
>>
>>107539190
Whoa...that is literally me...it even sounds like me. How are you doing that?
>>
File: 1757507646637341.png (2.3 MB, 1536x1536)
2.3 MB
2.3 MB PNG
isn't it neat how you can pick a lora for any outfit and get that result on any character, no wonder patreon artists are mad.
>>
>>107539185
>might
will
Enjoy seeing video of a crime you never committed admitted as evidence in court.
>>
>>107538975
Yeah better quality, if I want better quality I go to /edg/ or /hdg/.
Though all things considered I think the "anime" general does a decent job giving newbies their first steps until those newbies level up and move on to generals with more dedicated anons.
>>
>>107539227
if someone went through the trouble of training a lora of you to implicate you, then you might have bigger problems than genning 1girl
>>
>>107539209
Nice, WAI 15?
>>
File: ComfyUI_temp_mpscm_00008_.png (3.61 MB, 1344x1856)
3.61 MB
3.61 MB PNG
>>
>>107539230
/adt/ is just sfw /edg/ and /hdg/. most poster are in both those threads
>>
>>107539227
yeah, the most obvious one that'll play out. good luck to us all in the future
>>
File: ComfyUI_00231_.png (1.66 MB, 1280x1280)
1.66 MB
1.66 MB PNG
>tfw prompt enhancer can't make a proper description for GigaMigu
>>
>>107538760
AI
>>
File: 1761364156651416.png (2.55 MB, 1536x1536)
2.55 MB
2.55 MB PNG
>>107539209
choose your character:
>>
>>107539089
HAHAHAHAHAHA WHAT A FUCKING TRANNY LOSER!
>>
>>107539236
I would expect video edit models where you can edit a person into a video from their image will arrive eventually.
>>
>>107539238
yes, wai15 is consistently good and knows lots of characters/concepts even without loras (but you can use those too). lineart/colors are consistently good too.
>>
>>107539236
i don't doubt it will get so good you don't even need to train anymore
>>
>>107539252
Her multiple eye sockets are ruining it for me.
>>
>best way I can describe using comfyui moving forward is like sitting on a 12" dildo leaving it in and saying "fuck it, I'm gay now" instead of pulling it out with some dignity and saying "what's the next steps"
>>
>>107539140
Yes but I had to change a line of code in comfy because some conditioning to the model wasn't being set up right and it was throwing an error.

Honestly, don't bother. The model isn't good. In terms of raw knowledge, it is slightly better than NetaYume, but still significantly behind Illustrious and Noob. In terms of style, it looks like shit from an ass. No amount of quality or artist tags help it. It also is very noisy, messy, "fried", I don't know how to call it. Half the time you try to make an image with a simple white background, it collapses and gives you a solid white image. They fucked something up with the training.

Caveat: this is all using normal tag list and natural language prompting. I refuse to do the XML autism (and so will everyone else).
>>
>>107539242
Maybe one or two of them crosspost one every 5 days, but you will notice the abysmal quality difference when you go from /adt/ to other anime centered generals.

Though I repeat /adt/ does an excellent job being the entry thread for newfags. Because people who are starting out need tech centered help as well as help at the basic diffusion level, which is what /adt/ knows how to provide.
>>
>>107539252
Both
>>
>>107539285
>Honestly, don't bother.
anybody with working eyeballs made that choice just from looking at the samples and xml prompting
>>
File: ComfyUI_temp_uhkpp_00001_.png (2.86 MB, 1417x1080)
2.86 MB
2.86 MB PNG
ty prompt gen anon, this is kinda neat
>>
wainsfw illustrious and wai ntr mix look exactly the same to me, what the fuck is the difference between them, they just look like anime
>>
>>107539304
The image on the right is really incoherent if you look at it for more than two seconds
>>
generating prompts takes the entire fun out of genning, it's very soulless
>>
>>107539321
Ntrmix has a legacy
>>
>>107539304
it's impressive how close to reality Z-image turbo is desu
>>
>>107539330
what the fuck does that mean?
>>
File: ComfyUI_temp_uhkpp_00005_.png (3.87 MB, 1744x1216)
3.87 MB
3.87 MB PNG
>>107539324
no i truly wanted her to have the shower curtain as a dress and noticed nothing wrong

>>107539333
that we can do this shit for free* at home is nuts
>>
>>107539252
Moar
>>
>>107539066
Page status?
>>
AI toolkit is supposed to download the required files and models automatically right? You just select the model you want to train a lora for, and no further humiliation rituals required right?
>>
File: 1746421001364807.png (2.45 MB, 1536x1536)
2.45 MB
2.45 MB PNG
fun in the sun.

kek I got a chibi frieren, cute!
>>
File: ComfyUI_temp_uhkpp_00008_.jpg (659 KB, 2840x1040)
659 KB
659 KB JPG
>>107539379
yup
>>
>>107539384
Very impressive to be WAN
>>
>>107539398
so what do if nothing happens, even after reinstall it's just stuck starting the job and doing nothing, 0 byte/s
>>
File: flower1.mp4 (1.7 MB, 816x1080)
1.7 MB
1.7 MB MP4
>>107537295
>>
File: push_00001_.mp4 (1.48 MB, 1080x1080)
1.48 MB
1.48 MB MP4
>>107538449
>>
>>107539304
>right image
it did not follow the prompt, did it?
>>
File: 1747096679118211.png (2.73 MB, 1536x1536)
2.73 MB
2.73 MB PNG
>>107539384
>>
>>107539520
Nice
>>
Out of the game for quite some time
Which forge variant is currently the best? Bonuspoints for working on AMD and linus
>>
>>107539550
>use comfy
>pull new update
>don't have to worry about this reinvention of the wheel bs
>>
File: 00012-2204115164.jpg (407 KB, 1440x2560)
407 KB
407 KB JPG
>>107539550
classic for xl, debatably neo for everything else. amd will always get you laughed at but i'm 99% sure all forges always supported amd and have their own separate instructions.
>>
>>107539569
>don't have to worry about this reinvention of the wheel bs
but that's what workflow do all the time. it's actually worse
>>
File: 1741824357039403.png (2.68 MB, 1536x1536)
2.68 MB
2.68 MB PNG
>>107539520
>>
>>107539569
>use comfy
>pull new update
>DRAGGED AND SHOT
>>
interesting. seems like cumfart is using bot farms with throwaway sim cards in china to inflate the GitHub stars and social media likes. hilarious considering none of these dumb fake tricks were used with auto and it still has more stars
>>
>the stop button is back on the gen bar
yay.
>>
>>107539619
>the jeets are still around
uh oh
>>
>>107539623
what jeets, I did a comfy update. I dont have a computer from 1980, how could I be indian.
>>
>>107539599
I feel fortunate the most dedicated idiots that have issues with me are alcoholic low functioning losers that have been exposed a lowcows that can't even match the average /ldg/ poster in skill.
They also like to larp pretending to be girls with other men and can't even read filenames to realize who's posting what
>>
File: 1736951131220310.png (2.14 MB, 3202x1422)
2.14 MB
2.14 MB PNG
>>107539619
>>the stop button is back on the gen bar
and Z-image base can do edit and will be released soon, the dark days will be over
https://github.com/Tongyi-MAI/Z-Image-blog/commit/e67bafb673fa19d301f903ac62de26c48b4cc1c4
>>
>>107539600
Either those are backwards or both AI

Just look at that deformed car
>>
>deformed car
You have to be 18
>>
>>107539642
you're right I didn't even verify it was from an AI site, my b
>>
>>107539640
"soon" could be months or never, but we'll see
>>
File: ComfyUI_00188_.mp4 (970 KB, 488x488)
970 KB
970 KB MP4
>>107539588
>>
>>107539638
hello tRan
>>
File: Comparison.jpg (834 KB, 2160x1200)
834 KB
834 KB JPG
>>
>>107539640
>the dark days will be over
cumfart will always bring darkness. we need something new. tired of being lied to btly the grift chink and cumfart insults researchers that know more than him
>>
File: ComfyUI_00193_.mp4 (578 KB, 488x488)
578 KB
578 KB MP4
>>
File: ComfyUI_00265_.png (1.13 MB, 1280x1280)
1.13 MB
1.13 MB PNG
>>
>>107539755
kill it
>>
>>107539721
prefer the big bottom lady. also chroma might do a good img 2 img for this
>>
>>107539485
i like how she jungles her bobs saar
>>
>>107539759
feel free to make a chroma gen with this prompt so that we can see if it's closer
https://files.catbox.moe/3w663d.txt
>>
>>107539721
zimage is so much better.. thank god for llms and diffusion
>>
File: 1755208089826413.png (544 KB, 1494x1185)
544 KB
544 KB PNG
https://xcancel.com/bdsqlsz/status/1999878087832633354#m
>There is bad news and good news.
>The bad news is that the Omni-Base model may not have higher portrait realism like the Turbo.
>The good news is that the Base model is a rough stone that has not yet been carved out and we can fine-tune it into anime variants more easily.
I think that was expected, it's a base model after all
>>
File: 666666666666.png (1.66 MB, 1376x768)
1.66 MB
1.66 MB PNG
>>107539304
>>
>>107539114
whats that?
>>
>>107539778
I'll believe it when I'm holding the base in my hands
>>
>>107538312
>patchworking tits and vagene in
It's actually a good technique when you want a specific boob shape. it's really hard to prompt for a slim body with massive tits or a fat body with tiny tits.

And for some reason boob tags massively affect the way a face will look..
>>
>>107539781
>whats that?
>>107537803
https://github.com/BigStationW/ComfyUI-Prompt-Manager
>>
>>107539778
Its been said since day 1. will be fun to see all the slopmerges/"tunes" that make it look ten times worse though kek
it really needs to be beaten into people's heads that the only reason z-image is as good as it is, is the LACK OF SYNTHETIC DATA TRAINING.
>>107539785
base, will hold you in his hands.
>>
>>107539778
it means that he was able to test out the final version, release SOON incomming
>>107539794
>the only reason z-image is as good as it is, is the LACK OF SYNTHETIC DATA TRAINING.
truth nuke
>>
File: goodmorningsaar.mp4 (789 KB, 480x842)
789 KB
789 KB MP4
>>107539637
>what jeets
>>
inb4 32b 24GB base model
>>
>>107539778
I wonder why alibaba didn't give a finetune with the base? after all we got a finetune + distillation
>>
is IP-adapter snake oil? I just want it to keep a face consistent between gens.
>>
>>107539807
you have to understand saar, text is really ressource intensive, more than Ray Tracing, trust the izzat
>>
>>107539807
yeah, fuck cumfart forever and ever. drugged and shooted
>>
>>107539778
base might not look as good as turbo, but I have suspicious that Edit will make great images, even without images input, they know how to finetune their model well and Edit will be such thing
>>
File: ComfyUI_00332_.mp4 (812 KB, 640x832)
812 KB
812 KB MP4
>>
https://www.reddit.com/r/StableDiffusion/comments/1pllpaf/the_upcoming_zimage_base_will_be_a_unified_model/
>"Hum guys, Alibaba made a commit where they added one <div> to a html page"
>544 likes
>97 comments
kek, I think it's fair to say Z-image turbo is the most hyped model of all time, not even Flux or Wan had this much impact
>>
File: ComfyUI_00194_.mp4 (417 KB, 416x576)
417 KB
417 KB MP4
>>
File: 00046-1936221472.jpg (354 KB, 1440x2560)
354 KB
354 KB JPG
>>107539864
is this how the internet reacted when 1.5/XL came out? I joined in after the initial hype trains of 1.5/XL.
>>
>>107539778
>omni base
>rough stone not carved
I WANT TO BELIEVE!
>>
>>107539878
1.x was busting a nut for the first time. sdxl was learning about sex toys. it's more like the xl hype
>>
File: push_00002_.mp4 (1.44 MB, 1080x1080)
1.44 MB
1.44 MB MP4
>>107539768
Good taste
>>
File: 1751455173592443.png (309 KB, 653x565)
309 KB
309 KB PNG
>>107539778
>he still hasn't shown a single image from Base
it will look like absolute shit right?
>>
>>107539778
now the question is, do we have what it takes to finetune Base so that it reaches Turbo's level?
>>
File: 1761714909838333.png (32 KB, 417x231)
32 KB
32 KB PNG
ok its actually happening properly now. training twinflow into SD1.5 with 200k images. the twinflow paper did 19 epochs but that's like 5 days so I'll try with just 4 first and see if it works at all

after that it's looking into how to make it a lora like how lightx2v is usually used as a lora
>>
File: ComfyUI_00208_.mp4 (377 KB, 488x488)
377 KB
377 KB MP4
>>
>>107539807
Speaking of shitty UI, am I the only one who gets insane slowdown when typing text in cumfart?
I type text somewhere else and just copy paste at this point.
>>
File: nxyz_20251312_134308_01.png (1.68 MB, 1200x1400)
1.68 MB
1.68 MB PNG
Das crazy, mane
>>
File: ComfyUI_00333_.mp4 (1.36 MB, 720x1280)
1.36 MB
1.36 MB MP4
>>
>>107539937
Good luck twinflow anon.
>>
>>107539778
Isn't that how base models are in general, bad for genning but good for finetuning and training?
>>
>>107539252
>>107539703
>>107539384
Booba
>>
>>107539778
I will take it.
Just let us easily make a coom tune.
>>107539937
AI likely hallucinated some important detail(s) and very decent chance you will just waste time due to that.
You can't rely on an LLM for shit like this.
Never trained SD 1.5 but I am interested in taking a look at what it spat out with that yaml.
>>
File: trust the furry.png (223 KB, 400x400)
223 KB
223 KB PNG
>>107539927
>now the question is, do we have what it takes to finetune Base so that it reaches Turbo's level?
he can do it
>>
>>107539943
bro it's what the video is showing. nodes 2.0 is twice as slow
>>
>>107539808
>inb4 32b 24GB base model
nah the paper said their foundational model (base) was 6b
>>
File: Z-image turbo.png (1.51 MB, 1280x720)
1.51 MB
1.51 MB PNG
>>
>>107540008
He will just make another broken model that has insane amount of shit crammed into it, but too schizo and unstable to be actually useful.
BigASP guy will deliver it.
>>
File: img_00068_.jpg (876 KB, 1680x1336)
876 KB
876 KB JPG
>>107539878
I remember we tried to gen nudes by prompting women surrounded by mirrors and got some horrible body-horror which lead the devs coming here and begging us to stop doing it
>>
let me get this straight
did d*** trigger some schizo? is that whats going on?
>>
>>107540018
>nodes 2.0
I forgot they were doubling down on this cancer
>>
>>107540048
>which lead the devs coming here and begging us to stop doing it

no fuckin way really? i hope someone has screencaps, thats hilarious.
>>
File: z-image_00874_.png (2.4 MB, 2048x1152)
2.4 MB
2.4 MB PNG
>>
File: bestgirl.png (2.31 MB, 1080x1920)
2.31 MB
2.31 MB PNG
>>107539060
>>
>>
>>107540074
It looked like that, but I have no actual proof since it's this place.
>>
>>107540098
Feeta
>>107540091
Booba
>>
>>107540114
pittu next pls
>>
Also I am not sure how we missed this as well but z-image training script has been already merged into musubi-tuner... PR'd by the dev himself.
https://github.com/kohya-ss/musubi-tuner/pull/778
It's coming SOON. Like next few days soon.
>>
File: z-image_00011_.png (1.66 MB, 1008x1408)
1.66 MB
1.66 MB PNG
>ava addams body lora released
holy booba anon
>>
>>
File: z-image_00878_.png (2.69 MB, 2048x1152)
2.69 MB
2.69 MB PNG
>>
This >>107540098 (feet) is more sexo than this >>107539252 (boobs), flat or big
>>
>>107540146
>https://github.com/kohya-ss/musubi-tuner/pull/778
wait, it's the training script for z-image base? LETS GOOOOOO
>>
>>107540146
DOOMPOSTERS GET FUCKING RAPED
THE TIME HAS CUM LADS
>>
File: 1734993863821806.png (82 KB, 1252x361)
82 KB
82 KB PNG
>>107540146
>>107540178
WERE SO FUCKING BACK
>>
I can't believe they've decided to publish Base on civit first
https://civitai.com/models/2212509
>>
>too many people checking the pr all at once
>>
>>107540006
>You can't rely on an LLM for shit like this.
SD1.5 is simple enough where Sonnet 4.5 100% understands it at this point, and Twinflow is simple enough where this can be implemented without hallucination. I am also following the original paper as close as possible

Here's the yaml
https://files.catbox.moe/v733hb.yaml

Here's the slop
https://rentry.org/uwsw9hwo

Here's the Twinflow paper for reference
https://arxiv.org/abs/2512.05150


and whatever mane, im not horny right now so why not try something out. if twinflow wasnt so simple and claimed to be better than lightx2v I'd probably just go play vidya
>>
File: Get fucked doomers.png (1.31 MB, 1280x720)
1.31 MB
1.31 MB PNG
>>107540146
>but muh chinese cultu-ACK
I told ya, nothing ever happened
>>
When SD1.5 came out, a group called unstablediffusion raised about $80k on Kickstarter in a few days for a 1.5 porn finetune. However, it was blocked by the Kickstarter operators after public criticism.
At that time, the community size was certainly only 1% of what it is today.

I wish we could get something big going for z base.
>>
pretty sure that's not one of the z-image devs thoughbeithowever, that's the insider fella from twitter
https://x.com/bdsqlsz/status/1999878087832633354
either way, still confirms its 100% asap because he's been able to train for a while.
>>
>>107540146
wait that's the chink from twitter, so he's working at Alibaba after all?
>>
>>107540219
>unstablediffusion
To be fair their arch changes were retarded
>>
File: ComfyUI_00043_.png (2.1 MB, 1440x1000)
2.1 MB
2.1 MB PNG
https://huggingface.co/Owen777/UltraFlux-v1/blob/main/vae/diffusion_pytorch_model.safetensors
>>
File: oops.png (149 KB, 708x800)
149 KB
149 KB PNG
>>107540146
>N-no you don't get it, in chinese culture that means... uh... uh... AIIE SOMEONE HELP ME
>>
can zit do realistic fantasy girls like orcs or cat girls? a lot of realistic models go into "fake mode" when you prompt for fantasy stuff
>>
>>107540146
>it's real
Oh man, the chinks will really save us, I wanna cry ;-;
>>
>>107540146
Can't believe such an obvious clue has not been discovered for more than a day, that's quite impressive desu lmao
>>
>>107540290
probably because everyone here uses onetrainer and ai toolkit kek

and no one here stalks a rando insider's socials for hints.
>>
File: that's right.png (319 KB, 463x462)
319 KB
319 KB PNG
>>107540146
I'm sure we'll get this shit in less than a week now
>>
File: stench.gif (4 MB, 354x550)
4 MB
4 MB GIF
>>
>>107540206
I mean sure, feel free to spend your time however you please. I just think that's prone to failure. AI knows about SD 1.5, but there isn't enough good quality data about finetuning/distilling it to prevent AI from hallucinations
Some suggestions:
Probably want bf16, I am skeptical that constant with warmup is the most optimal lr scheduler here though I don't know enough to suggest anything else, allow tf32 can be double edged sword in terms of performance, increase save total, or disable it, who knows maybe it converges early and gets fried.
Good luck though
>>
>>107540293
I actually use musubi but have been waiting for base release.
Just checked the guy's xitter and saw it.
>>
>>107540165
Nice
>>
File: push_00003_.mp4 (1.67 MB, 1080x1080)
1.67 MB
1.67 MB MP4
wan2.2 was a mistake

>filthy beggar of utterly neglected appearance in dirty worn-down cloths enters the scene while girl welcomes him
>>
>>107540330
Thanks
>>
>>107540293
>>107540323
what was the tweet that was referencing that PR?
>>
File: kohya balls.png (97 KB, 626x494)
97 KB
97 KB PNG
>>107540323
interesting, i skimmed over that shit this morning when checking his twitter. not sure how i did that.

either way, i imagine he'd only share this because they're about to release base.
>40gb of vram
lmao
>>
Chinese culture status?
>>
File: ZiMG_01397_.jpg (532 KB, 1728x1344)
532 KB
532 KB JPG
>>
File: 636363.png (1.91 MB, 1376x768)
1.91 MB
1.91 MB PNG
>>107540306
>>
>>107540146
>https://github.com/kohya-ss/musubi-tuner/pull/778
it says z-image though? it doesn't specify it's working for base right? maybe it's just a script only for turbo?
>>
File: NEVER DOUBT CHINA.png (431 KB, 800x582)
431 KB
431 KB PNG
>>107540352
kek
>>107540345
>Chinese culture status?
buck broken
>>
>>107540344
40gb VRAM for full fledged finetuning requirement isn't really that bad.
>>107540342
Here >>107540344
>>107540355
Here >>107540190
>>
File: ZiMG_01403_.jpg (406 KB, 1728x1344)
406 KB
406 KB JPG
>>
>>107540376
to be fair, i'm eternally VRAM buckbroken and don't even have a finetuning requirements frame of reference. How high is flux finetuning?
>>
File: 66666666.png (1.59 MB, 1376x768)
1.59 MB
1.59 MB PNG
>>107540364
do i have potential?
>>
>>107540146
>soon
yeah see you in 2 years when wan 2.5 launches
>>
>>107540382
I don't recall it.
I can give you another frame of reference though.
BigASP guy tried to run a test finetune experiment on Wan 2.2 5B and he couldn't fit it into 96GB of VRAM.
So yeah.
>>
>>107540240
oh ok BDS is a contractor at Tongyi Lab it seems. (tylab.ai), explains why he knows both about WAN and Z image now in particular and is invited to modelscope conferences etc

>>107540315
>I just think that's prone to failure.
It one-shot Q8_0 GGUF quantization of Infinity-2B which is a lot more difficult than figuring out how to add a second timestep parameter to SD1.5's UNet. It's like 20 lines of code to do that. Dude I totally understand your concerns but again if it wasn't this simple to try then I wouldn't have

>Probably want bf16
...you're right, but I just want to see something even slightly coherent at 4 steps. This is all theoretical. I'm not spending the time trying to get a production-grade twinflow model out of a single 16gb card
>>
>>
>>107540396
>oh ok BDS is a contractor at Tongyi Lab it seems. (tylab.ai), explains why he knows both about WAN and Z image now in particular and is invited to modelscope conferences etc
I thought he was a journalist but it seems he's more than that, desu it's kinda surprising Alibaba let him post the training code PR lol
>>
File: 6161616.png (1.57 MB, 1376x768)
1.57 MB
1.57 MB PNG
>>107540364
>>
>>107540395
that doesn't sound right
>>
>>107540352
>>107540404
this is cool, do you do that with QiE?
>>
>>107540403
>kinda surprising Alibaba let him post the training code PR lol
not surprising if we assume they're about to release the model. they've kept him on an NDA leash regarding ANY details on base and other unreleased models. training i imagine is offlimits unless they gave him the OK.
>>
>>
File: godswill.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
>>107540403
>I thought he was a journalist
No he writes code and trains loras and releases papers, has a HF account
https://huggingface.co/bdsqlsz

He made a detailed eyes lora for Z image turbo
https://huggingface.co/bdsqlsz/qinglong_DetailedEyes_Z-Image
which was then used by a dev on the WAN team in a huggingface space
https://huggingface.co/spaces/linoyts/Z-Image-Enhance
https://huggingface.co/linoyts
which is really random, which makes me think that this plus the tylab.ai link in his bio means he is meaningfully involved with Tongyi Lab. I also think that explains why he knew so much about Tongyi models specifically, and had less knowledge on other models (he probably heard stuff in those modelscope conferences and just through the grapevine in general)

his HF says hes on the Stepfun team too, not sure what Stepfun's relationship to Ali/Tongyi is though
which is really random,
>>
File: 6666666.png (1.25 MB, 1710x703)
1.25 MB
1.25 MB PNG
>>107540416
Its just gemini pro in comfyui
>>
File: long live china.png (1.09 MB, 832x1216)
1.09 MB
1.09 MB PNG
>>107540413
I am just forwarding what he said.
https://civitai.com/articles/22656/bigasp-30-progress-update-and-26
DM and ask him if you want, I dunno.
>>
File: 628714.jpg (27 KB, 737x573)
27 KB
27 KB JPG
>>107540355
Retards. It's just a placeholder because you should technically train on base.
>>
>>107540443
>Anon's anonymous chamber is on pepe's butt
oh man you discovered my place :(
>>
File: hk69.png (2.07 MB, 1080x1920)
2.07 MB
2.07 MB PNG
>>
>>
File: godswill2.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>107540426
>>
So can I just copy the training lora into musubi and use that or..?
>>
File: zimg_0090.png (2.17 MB, 1080x1440)
2.17 MB
2.17 MB PNG
deleted all my zimg turbo loras to prepare
>>
>>107540443
>API node
gross. don't pay cumfart for ruining the ui
>>
>>107540201
>filename
forced meme but keked nonetheless
>>
File: WE WON.png (1.43 MB, 1280x720)
1.43 MB
1.43 MB PNG
>>107540146
MOUAHAHAHAHAH
>>
>>107540429
>which was then used by a dev on the WAN team in a huggingface space
Linoy works for HF, she's ML for Art team and Diffusers
>>
BASE BROS
WE WON!!!!!!!!!
>>
>>107540493
prompt?
>>
File: l_.jpg (758 KB, 1376x1531)
758 KB
758 KB JPG
>>107540496
just testing some things, ill be back once my 10 dollars runs out
>>
>>
File: zimg_0063.png (1.8 MB, 1080x1440)
1.8 MB
1.8 MB PNG
>>107539495
>>
>>107540570
her hand is up there but the curtain is being held down there and she's well outside the tub
>>
>>107540240
who the hell is this nobody
>>
>>107540582
some guy that clings to relevance by going to AI conventions and parroting in very bad english
>>
>>107540582
>nobody
>>107540429
>>
>>107539240
what model is that?
>>
>>
>>107540582
this "nobody" was the chosen guy by Alibaba to release the training script of base lul
>>
>>107540594
The right wing is detached but cool.
>>
if the next bake has the tRan rentry I'm rebaking with the tRan greentext
>>
>>107540614
Thanks :)
>>
>>107540635
I am trying to bake without it.
Let me put together the collage first though.
>>
>>107540643
nice galko
>>
>>107540570
that's better. She is indeed wrapped in the curtain

Let's see
https://files.catbox.moe/d69piq.mp4
>>
How do you get anything above C cup (and natural) in Zimage?
>>
>>
>>107540635
>>107540644
neither of these posts have ever baked or will ever bake
>>
>>107540665
By accepting god's perfect breast size.
>>
>>107540665
prompt sugoi dekai
>>
>>107538584
What happened to him? Stroke? He only posted once..
>>
File: 113696833.jpg (465 KB, 3040x2080)
465 KB
465 KB JPG
>>
>>107540644
I baked

>>107540693
>>107540693
>>107540693
>>
>>107540701
>putting the BBC spiderman shitposter in the collage
kys
>>
File: ldg bake 2025 dec 14.jpg (3.3 MB, 3260x5340)
3.3 MB
3.3 MB JPG
>>107540721
Also API gen too.
At least ran's schizo babble is gone I suppose.
>>
>>107540677
I have actually baked before though
>>
waiting for the real bake
>>
>>107540659
>that's better
are you stupid?
>>
>>107540764
The choice of images for the OP sucks but I don't think it is worth splitting the thread over.
>>
>>107540794
he left out the ani rentry at the bottom, so he's probably ani
>>
Fuck off ran.
>>
>>107540808
you're right, but i dont thinks its still reason enough to split, I wish trani would just fuck off already
>>
>>107540764
>>107540808
>>107540876
the ran greentext I have saved? I'll add it to the next bake.
>>
>>107539859
nice
>>
>>107538886
I still prefer 14. 11 was overtuned for hags. 13 had really muted colors/lighting and results that differed the most from other versions. 14 felt like it rebased on v9, which is a good thing. 15 got worse again, though not as bad as 13.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.