[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107526185

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>107529321
I feel like she looked much better in her early days. Don't you agree? I'm not a fan of the pictures you used.
>>
>>107529397
a pajeet made this collage
souless
>>
File: go and get the model.png (1.99 MB, 1280x832)
1.99 MB
1.99 MB PNG
>>
>>107529425
nice style
>>
File: zit_00006_.png (1.2 MB, 864x1152)
1.2 MB
1.2 MB PNG
>>
File: zimg_0147.png (1.79 MB, 1080x1440)
1.79 MB
1.79 MB PNG
>>107529424
i'm probably never going to use it again, i just went to a porn site sorted by popular and grabbed photos to do a test
>>
File: ZiMG_01319_.jpg (670 KB, 1728x1344)
670 KB
670 KB JPG
>>
File: ZiMG_01331_.png (3.46 MB, 1344x1728)
3.46 MB
3.46 MB PNG
>>107529474
>>
>>107529429
thanks, guess ill give the training run a shot then, with no tags this time.
>>
>>107529472
aaaah anjelika such a one of a kind creature.
>>
>>107529450
too jealous of ani to have taste
>>
question, why 640x640? figured there was no way you could train a lora that low res and have it work out well, but yours trained pretty good

>>107529531
>>
File: 00167-504378373.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>107529471
>>
What a nice thread.
>>
>>107529548
it had some of these even sooner, but also some of the bunny hairclips still are interpreted as hearts.

>>107529467
just zit and one attempt at the pippa training data that was in the last thread
>>
>>107529397
Can I Z Image Turbo in forge?
>>
i'm gonna pull (musubi)
need toolkit gone, though I doubt m'soobz will perform any differently
>>
>>107529795
forge classic has support
>>
>>107529795
>>107529809
forge classic neo branch... don't be retarded like me and try to figure out why my shit's not right for hours because i didn't get the neo branch
>>
File: 071517_00002.mp4 (3.47 MB, 1536x1536)
3.47 MB
3.47 MB MP4
>>107529806
>he pulled?

>>107529834
neo is AIDS, classic itself does support z-image, but its focused on sdxl based models.

neo has awful memory management issues, itll fail to load models and when you frustratingly keep clicking start, itll keep re-loading the model and taking up more and more ram. Its fucking brutal.
>>
>>107529223
Too many steps
>>
>new cumfartorg announcement
>how to 3x3 grid for ads
sickening
>>
File: 00168-3912627219.png (995 KB, 896x1152)
995 KB
995 KB PNG
>>
>>107529809
>>107529834
>There is now 5 versions of forge
I think it's time I stop being a retard and learn how to use comfy...
>>
>>107529930
comfy sucks fucking donkey dick for anything that isn't wan or maybe qwen edit
i use forge classic for sdxl models and comfy for wan/qweedit
most of those forge forks are fucking awful, including that memleak issue i mentioned.
shit comfy never bluescreened my pc kek
>>
File: file.png (142 KB, 640x477)
142 KB
142 KB PNG
>>107529930
>>
>>107529949
Governments don't like 'cults'. They want to be the only cult around.
>>
>>107529744
>it had some of these even sooner, but also some of the bunny hairclips still are interpreted as hearts.
It does't have to be 1:1. The hyperspecific accessories are just gay to gen.
>>
i dont think i can train z-image on a 3060
>>
>>107530012
Train yourself on some new skills and get a job.
>>
>>107530012
Train z-image anyway and don't get a job
>>
>>107530012
12GB should work
>>
File: pippa.png (555 KB, 1024x1024)
555 KB
555 KB PNG
>>107529999
waifu accessories are srs business, but basically this also indicates to me it needs more training or perhaps different settings

if t he accessories need to be manipulated it's IMO better to just caption them in the dataset and maybe change the dataset so it can be (not) prompted later

>>107530012
actually you could probably train loras with that regardless?
>>
i found this https://github.com/ostris/ai-toolkit/issues/550
>>
File: ComfyUI_02670_.png (1.38 MB, 784x1440)
1.38 MB
1.38 MB PNG
>>107529949
i used forge up until a month ago. forge is generally faster, you're able to see the gens a lot easier as they're happening, and it's just a lot easier to use. Comfy's layout is garbage. the Node design sucks, and you have to go through a bunch of different ones to find the one you like, or just accept one you don't. it's difficult to get shit just right, and to change even minute settings, and the usability compared to forge shit. don't let anyone convince you otherwise.

with all that said, it is the only one that can do Z-Image, and its compatibility with new shit is unmatched. that's literally all it has going for it. if it wasn't for that, this shit wouldn't be installed on my computer at all.
>>
>>107530175
>4bit training
lol
>>
File: 1657133249553.png (81 KB, 260x283)
81 KB
81 KB PNG
>>107530175
>4bit
>>
SamplerCustomAdvanced doesn't preview anymore on the new version of comfyui...
how do i fix?
>>
I just had an insane revelation. If you download a bunch of 1girl pictures you like and train a lora on them, any model suddenly produces pictures you like more. Crazy.
>>
>>107530241
>how do i fix?
drag and shot cumfart
>>
>>107530241
https://github.com/comfyanonymous/ComfyUI?tab=readme-ov-file#how-to-show-high-quality-previews

hopefully manager will fix it
>>
>default training settings like that anon said are actually working
>my lora 800 steps in ISN'T schizophrenic for once
holy
>>
Comfy should be dragged out on the street and shot
>>
>>107529844
Neo works fine on my machine. Are you retarded and didn't configure your .bat file? Did you do a stupid and set pin shared memory on a card with less than 16gb vram, despite the guy warning you multiple times to NOT set that option for cards with less than 16gb vram?
>>
>>107530358
this was an issue with forge in general until it was fixed, the idea it got un-fixed is not exactly hard to believe.
and of course i'm on 16gb of vram.
>>
>>107530372
I've never once had an OOM or ram issue with neo, and I use a 3060 12gb. The only issue I had was forge couple fucking up the neo install... Despite being made by the same fucking guy.
>>
File: bonggirl.png (24 KB, 208x254)
24 KB
24 KB PNG
>>107530384
>The only issue I had was forge couple fucking up the neo install... Despite being made by the same fucking guy.

PFFT and you accused me of having the skill issue.
>>
>>107530425
It's not a skill issue when you know exactly what is breaking your install.
>install forge couple from menu
>neo immediately shits itself over a CUDA issue and insta-crashes
>install via direct git link
>neo immediately shits itself over a CUDA issue and insta-crashes
It's an issue specifically with couple and the latest version of Neo. It's not an issue that can be self-solved by not following the readme instructions.
>>
File: ldg.mp4 (1.28 MB, 752x960)
1.28 MB
1.28 MB MP4
>>
File: Pathetic.png (975 KB, 1024x1024)
975 KB
975 KB PNG
>>107530465
Animate this.
>>
>>107530161
this looks very good
>>
>>107530581
grok.com/imagine
>>
>>107530465
Nice vagina.
>>
Is julien still there? I have a question.
>>
1200 steps, its close, so fucking close...
>>
i cant stop genning girls with large feet and penises
>>
File: ComfyUI_temp_jdhkq_00003_.png (2.56 MB, 1272x2160)
2.56 MB
2.56 MB PNG
>>
File: 1745865606630574.png (1.11 MB, 3325x1534)
1.11 MB
1.11 MB PNG
pretty kewl
>>
>>107530854
Nice. now gen an image of miku and parappa on stage rappin' together.
>>
plz God no miku troon. spare us from your "tests"
>>
File: 1765473492940427.jpg (52 KB, 600x367)
52 KB
52 KB JPG
>>
>>107530891
>6 minutes doing something by hand
>do that tens of thousands of time per year
I'm pretty sure in the long run it's longer than 6 hours
>>
>>107529472
Hey anon, in the last thread I think you mentioned you used the undistilled version that aitoolkit downloads in your training. What made you do that? Have you tried training with the adapter models instead? I ask because I've done some tests with both and it seemed like the undistilled model produced inferior results with the same data set (though there might be some variable I forgot to control for).
>>
>>107530891
>>107530908
this is high Chinese culture
>>
>>107530913
many are saying this. that and the v2 adapter is worse than the v1.
>>
File: ComfyUI_temp_vtsfk_00019_.png (2.9 MB, 1088x1856)
2.9 MB
2.9 MB PNG
>>
File: 1755865544501750.png (1.49 MB, 2672x1548)
1.49 MB
1.49 MB PNG
>>107530861
lul
>>
File: ComfyUI_temp_vtsfk_00023_.png (2.56 MB, 1088x1856)
2.56 MB
2.56 MB PNG
>>
>>107530943
well, it had the right idea, it just didn't know what the fuck it was making i guess kek
>>
>>107530950
>>107530943
>>107530937
You are using a world class tool but the only thing what comes to your mind is a goddamn teal haired vocaloid and spamming the same shit over and over again. You don't deserve these tools.
>>
>>107530960
https://www.youtube.com/shorts/hTXBupA3k1o
>>
>>107530943
kick punch block!
>>
>>107530984
ITS ALL IN THE MIND!
>>
>>107530842
>no blue nails
It's over
>>
>>107531017
her nails are either purple or not even visible most of the time in my dataset
intentional, because sdxl model. its trained off a style accurate lora for noobai.
>>
File: ComfyUI_temp_vtsfk_00037_.png (2.56 MB, 1600x1280)
2.56 MB
2.56 MB PNG
>>107530960
sucks to be you right now
>>
File: ZImg_00024_.png (2.09 MB, 1440x1152)
2.09 MB
2.09 MB PNG
overdid the contrast a tad
musubi works great btw, I just imported my settings from qwen and it's gold
>>
File: z-image_00809_.png (2.59 MB, 2048x1152)
2.59 MB
2.59 MB PNG
>>
File: ComfyUI_temp_vtsfk_00039_.png (2.7 MB, 1088x1792)
2.7 MB
2.7 MB PNG
I have no idea how this lora turned out so good, it even gave it more details instead of killing the model
>>
what does it mean?
>>
>>107531086
sigma ba- wait, you were setting me up weren't you
>>
File: 13-1024x498.jpg (56 KB, 1024x498)
56 KB
56 KB JPG
STOP USING UPSCALERS

they make images look like glossy cell-shaded molestations of the original images. STOP FUCKING USING THEM
>>
File: ComfyUI_temp_vtsfk_00045_.png (2.89 MB, 1088x1792)
2.89 MB
2.89 MB PNG
>>
>>107530581
https://files.catbox.moe/5fb6ao.mp4
>>
>>107530960
U mad?
>>
File: ComfyUI_temp_vtsfk_00046_.png (2.76 MB, 1088x1792)
2.76 MB
2.76 MB PNG
>>
>>107530960
I bet he's having more fun than you
>>
>>107531109
lmaooo
>>
>>107531086
>σ + σ_up-FAGS BLOWN THE FUCK OUT
LMAO
>>
File: ComfyUI_temp_vtsfk_00048_.png (2.44 MB, 1088x1792)
2.44 MB
2.44 MB PNG
>>
>>107531102
>>107531120
>>107531135
>dayum frog nigga you funny
>>
>>107531109
hao?
>>
File: 2877.png (2.03 MB, 1504x1024)
2.03 MB
2.03 MB PNG
>>
>>107531135
>>107531143
we wuzz based n' shieet
https://www.youtube.com/watch?v=l1dnqKGuezo
>>
File: ComfyUI_temp_vtsfk_00058_.png (2.45 MB, 1216x1664)
2.45 MB
2.45 MB PNG
>>
File: ComfyUI_temp_vtsfk_00059_.png (2.42 MB, 1216x1664)
2.42 MB
2.42 MB PNG
>>
prompted for blue fingienails, step 1600. i think ill get it to 2000 steps, but the anatomy is starting to fry, hands get busted or lose a finger
>>
File: 1758874865493685.png (1.47 MB, 2566x1481)
1.47 MB
1.47 MB PNG
>>107530943
it is a way to cope before getting Z-image edit I guess kek
>>
>>107531109
Holy based.
>>
>>107531109
Ya bastard, that's revolting.
>>
File: ComfyUI_temp_vtsfk_00071_.png (2.54 MB, 1664x1216)
2.54 MB
2.54 MB PNG
welp, thats it, bye chroma
>>
>>107531305
Is this a new version of your Chroma ZiT LoRA?
>>
File: ComfyUI_temp_vtsfk_00089_.png (2.21 MB, 1088x1792)
2.21 MB
2.21 MB PNG
>>107531321
huh¨? I'm just testing old chroma prompts with zit and a lora I trained, what kills my buzz about Chroma is how sloooow it is to gen compared to the z-model, the only advantage that chroma has right now is its nsfw capabilities, but with z loras getting better and better each day, chroma days are numbered
>>
>>107531394
My bad. I mistook you for Chroma Asian footfag anon. He was training on Chroma images for ZiT a week or two ago or something. If you don't mind me asking, did you train on the adapters or that new de-distill?
>>
File: Z-image turbo.png (1.65 MB, 1280x720)
1.65 MB
1.65 MB PNG
>>
>character but realistic
augh fuck that's rancid. phew. i think i'll just wait for base and the noob tune. but i think i can call it a "successful" train.
>>
>>107531476
Very nice
>>107531231
Nice work, I want to train a character lora too with a 3060. I noticed with other loras that ZiT is fucking inconsistent
>>107530161
Very cool style
>>
>>107531101
#NotAllUpscalers
>>
>>107531101
you must have used a really shitty upscaler to end up with such a horrible result lol
>>
File: 1751928626995030.png (789 KB, 2691x1459)
789 KB
789 KB PNG
>>107531476
for some style transfer shit it's really not bad
>>
File: mikudayo.gif (827 KB, 165x176)
827 KB
827 KB GIF
>>107530967
such erotic movements...
>>
File: 1750730187449825.png (1.68 MB, 1280x720)
1.68 MB
1.68 MB PNG
>>107531678
MiguDayo is so precious
>>
>>107529940
>>107530182
Other than compatibility with new stuff, comfy only shines if your gen flows go beyond the basics.
If your usecases are linear not requiring any weird stitching, other UIs are faster.
>>
File: ComfyUI_00470_.png (1.7 MB, 1080x1920)
1.7 MB
1.7 MB PNG
>ask qwedit to remove the crown
>it removes the crown
>and adds one
you're a cheeky little rice cunt aren't you m8

>>107531730
that's a much more eloquent and not retarded way of putting it, yeah.
>>
>end of year
>expected a gimped wan2.5 local version
>expected non janky long video
>received C U L T U R E instead

i accept
>>
>>107531305
>>107531394
I love to see others reuse prompts I shared for model testing.

Z is great, but I'm maining SPARK chroma now. The extra gen time is worth it because the variety, styles, NSFW, and realism are SOTA, and it fixed the anatomy problems.
>>
>>107531766
>it fixed the anatomy problems.
it definitely improved on chroma, it's also less slopped, but Z-image turbo still has godlike anatomy and details, I still can't believe it's a 6b model, that's black magic dude
>>
File: ComfyUI_temp_vtsfk_00086_.png (2.28 MB, 1088x1792)
2.28 MB
2.28 MB PNG
>>107531766
what are you smoking, don't flatter yourself, I've never used "your prompts" lmao
>>
File: Z-image turbo.png (2.02 MB, 1536x808)
2.02 MB
2.02 MB PNG
>high angle, fish-eye lens effect.A split-screen composite portrait of a full body view of a single man, with moustaceh, screaming, front view. The image is divided vertically down the exact center of her face. The left half is fantasy style fullbody armored man with hornet helmet, extended arm holding an axe, the right half is hyper-realistic photography in work clothes white shirt, tie and glasses, extended arm holding a smartphone,brown hair. The facial features align perfectly across the center line to form one continuous body. Seamless transition.background split perfectly aligned. Left side background is a smoky medieval battlefield, Right side background is a modern city street. The transition matches the character split.symmetrical pose, shoulder level aligned"
damn
>>
>>107528382
512px pippa trainings for Z-Image-Turbo:
https://litter.catbox.moe/1ihaoqgjnx28pzw2.safetensors
https://litter.catbox.moe/2gwcxp7m0a21ig4s.safetensors

>>107531558 >>107530590
Ty, it's anon's training data tho. I think it still needs a higher resolution attempt
>>
File: 1761863123798078.png (1.45 MB, 1280x720)
1.45 MB
1.45 MB PNG
>>
File: file.png (752 KB, 1113x392)
752 KB
752 KB PNG
>>107531777
>Z-image turbo still has godlike anatomy and details
>>
>>107531777
I predict both of these models will be irrelevant sometime next year, when homebaked models start proliferating. Z is going to trigger an optimization race to see who can outdo their perf per cost/size.

>>107531816
pic, not the blonde. and yeah, I'm smoking weed for your information.

actually, this is a prompt that SPARK still has some issues with. qwen and z handle it way more consistently.
>>
File: THIS.jpg (459 KB, 1250x1566)
459 KB
459 KB JPG
>>107531935
>Z is going to trigger an optimization race to see who can outdo their perf per cost/size.
that's assuming their competitors know the secret sauce (it was just training your model on real data and not being a lazy fuck!)
>>
>>107531935
>Z is going to trigger an optimization race to see who can outdo their perf per cost/size.

Why would a company want their model to be optimized any more than absolutely necessary? Companies are able to charge what they can BECAUSE there is a hard upper limit to what a consumer can realistically afford for their computer. Meanwhile companies are able to eat the cost for hardware and sell back to users at scale.
Optimizing their models would ruin that business model because the extremely expensive hardware they went into debt for just became largely useless.
>>
Does anyone know where i might find a dataset of 1 million close up images of synthetic faces generated by an ai model at a resolution of 512x512?
>>
>>107532002
Sorry. I'm only aware of one with 800k.
>>
File: absolute kino.png (2.1 MB, 912x1622)
2.1 MB
2.1 MB PNG
>>107532002
saar do not redeem my 1 million synthetic faces!
>>
File: 1762109212215924.png (1.6 MB, 1440x1120)
1.6 MB
1.6 MB PNG
>>107531954
https://www.arxiv.org/pdf/2511.22699
>By systematically
optimizing the entire model lifecycle – from a curated data infrastructure to a streamlined
training curriculum – we complete the full training workflow in just 314K H800 GPU hours
(approx. $630K)
>Inspired by the scaling success of decoder-only
models, we adopt a Single-Stream Multi-Modal Diffusion Transformer (MM-DiT) paradigm [ 18]. In this
setup, text, visual semantic tokens, and VAE image tokens are concatenated at the sequence level to serve
as a unified input stream, maximizing parameter efficiency compared to dual-stream approaches
>For distributed training, we employed a hybrid parallelization strategy
>In addition to system-level optimizations, we addressed inefficiencies arising from mixed-resolution
training
>etc
no doubt their dataset was pretty good, but the paper describes many different optimizations they did.
>>
>>107531989
>Optimizing their models would ruin that business model because the extremely expensive hardware they went into debt for just became largely useless.

Optimizing models if anything helps them, its not just hardware costs either. Right now current AI models are extremely inefficient despite their power. The lower the compute need the less resources they need to spend or could be allocated to other use.
>>
File: 1740836177202561.png (2.51 MB, 1120x1440)
2.51 MB
2.51 MB PNG
>>107531989
to undercut the competitor and render their bloated model training investments worthless. the Judeo-Burger AI companies don't want to compete in efficiency, but chinese companies and indie devs do.
>>
>>107531902
Are these identical or what is the second one?
>>
>>107531989
This retarded mindset works exactly only until the bubble pop
>>
>>107532059
>undercut the competitor
They are in cahoots on this matter.

>Optimizing models if anything helps them
They won't start doing this until the consumer is completely price out of personal computing.
>>
I ran the lora and it gen'd mustard gas
>>
>>107532002
Possibly with some research but to get the results you want, you gotta make your own shit.

>synthetic faces

You can do SD1.5/SDXL with various lighting and face loras for this at lighting speeds (use lighting loras or lcm loras). For even more variation, create a few thousand (yes a few thousand) highly unique faces in chroma, switch back to SD1.5/SDXL with IPadapter face combine and go nuts.

Also make use of the Random Number node (was node suite comfyui).
>>
>>107532116
also its a zip bomb that contains 2 terabytes of goatse
>>
>>107532130
thanks for your long reply but i was being a retard on purpose
>>
That sex offender is right. Admin has psychosis because they are chasing sort term gains.
>>
>>107532162
>Admin has psychosis because they are chasing sort term gains.
I have decided to concede and eat my own shit on this issue. Completely unregulated industry can sometimes be bad.
>>
>>107532178
I think these companies are protected because they are so friendly towards certain high ranking people. There is nothing more to it.
Google/Alphabet was split up and Microsoft had lawsuits in the past too. But these companies? None of them are legally challenged.
>>
File: 1734734119231141.png (2.99 MB, 1920x1088)
2.99 MB
2.99 MB PNG
>>107532054
you don't get it, the Jews WANT the models to be big and expensive, requiring their expensive server hardware (licensed from a company with the Masonic All-Seeing Eye in its logo), because that's the only way they can extract rent from AI users. This promise of lucrative AI slavery and serfdom is the only thing holding up the AI bubble. This is why it's strategically-advantageous for Chinese labs and indie devs to make cheap models, it destroys the Jewish AI bubble, collapses the US economy, and ends the chip/wafer hoarding.

>>107532075
>They are in cahoots on this matter.
They have no choice. We will be able to make models as good as Z on local hardware soon.
>>
File: G5voW0KbAAAGfnO.jpg (234 KB, 1206x2012)
234 KB
234 KB JPG
>>107532193
That can only last for as long as it doesnt start affecting other businesses/pops the bubble.
There's no shortage of RAM. These companies colluded to buyout each other's stock to kill the competition. The consumer is even the "customer" anymore.
>>107532193
Its a combination of speculation, money in politics and also a desire for growth. The erosion of any regulatory pressures has basically unleashed the full flood gates here.
>>
File: 00169-3524173799.png (920 KB, 896x1152)
920 KB
920 KB PNG
>>107532237
you can't just print money bro
>>
>>107532247
I'm waiting for one of these companies to buy out an essential service and start trading it between each other.
>>
>>107531305
boing
>>
>>107531394
>>107531495
>>107531816


dat sum fine 1girl
>>
>>107532018
did he ever reply as to why the fuck anyone would want that?
>>
>>107532153
Based, miss chokola
>>
>>107532153
absolute madlad
>>
>>107532153
>>107532237
>>107532247
absolute madlad, go off king

>also gen one of the janny fucking miku just to piss those guys off too
>>
>https://civitai.com/models/2218365
>CyberRealistic Z-Image Turbo
>finetune
Looks like slop lora merged into the model
>>
>>107532316
Seems quite pointless but what do I know about anything.
>>
>>107532237
>They have no choice. We will be able to make models as good as Z on local hardware soon.
anon that local hardware will be an h200 that every mon/g/ol will buy for pennies after the ai bubble bursts.

you can gen your goons and power your own steam turbine all in one easy step. i'm gonna use mine to power a sauna with floor to ceiling goon screens. the future is looking bright baby.
>>
>>107532316
>>107532324
I like cyberrealistic's finetunes for other models. Some of the best realism models out there. Why you'd make a realism model for a model that's already the most realistic is beyond me.
But hey, if it's possible, somebody was gonna do it, no matter what.
>>
>>107532359
>Why you'd make a realism model for a model that's already the most realistic is beyond me.
at this point we got so used to slopped models it's a palovian response, even if the model is really realistic you have the reflex to try and make it more realistic lool
>>
>>107532403
yeah, anyway I'm downloading right it now. the results seem kinda nice. Gonna give it a spin.
What the hell,right?
>>
>>107532430
If you gen Adolf Hitler and Anne Frank, then it's allowed.
>>
File: Untitled.png (2.7 MB, 1776x2224)
2.7 MB
2.7 MB PNG
>>
>>107532445
I'll see what I can do. I don't have anon's superb hitler lora though.
>>
>>107532493
would

imagine the smell
>>
when my gens are so good i dont wanna share it because why would i feed you piggies
>>
brother... oats...
>>
File: 1747441956665959.png (1.52 MB, 1280x720)
1.52 MB
1.52 MB PNG
>>
>>107532550
I am probably the most talented poster itt.
>>
>>107532630
Your prompts are weak and your workflow is basic.

jk i'm sure you're well above average
>>
i am the cheese
i am the best genner in the general
>>
File: ComfyUI_temp_rnjze_00011_.png (2.29 MB, 1408x1408)
2.29 MB
2.29 MB PNG
>>
the longer we wait for z base the better it will be cooked
>>
>>107532791
>He thinks they're still cooking it.

Come on, retard.
>>
>>107532788
yucky hags
>>
>>107532788
tasty hags
>>
>>107532791
>>107532803
it'll be released but it's definitely finished, they're probably waiting for the right moment, like christmas or Flux 2 Klein release, I bet it felt good to dunk on those cuck freaks they want to do it again lol
>>
every time someone claim it's coming out soon, they're adding another epoch to the queue
>>
training twinflow on sd1.5 is much more reasonable. batch size of 4 lets you do 1 step a second on a 5070ti

pixart sigma is a ridiculously complicated architecture and i'm a big stupid for trying it first as my first training ever before SD1.5

sd1.5 in comparison is so simple it's almost kinda cute and embarassingly lewd at this point. i am 100% confident i can train a twinflow for sd1.5 properly and make a rentry describing exactly what needed to be changed
>>
>>107532788
make race queens
>>
>>107532237
if any mentally stable anons are interested in a non-schizo introduction to Cloud Capitalism extracting rent from their serfs living in their walled garden ecosystem digital fiefs, Technofeudalism is a great read. I've been posting my thoughts on it every few chapters on /cyb/
>>
>>107532827
Cool. So how does it alter the base 1.5 functionality?
>>
>>107532838
yes but how do the serfs make money?
>>
Is base out yet?
>>
>>107532846
>Cool. So how does it alter the base 1.5 functionality?
You have to make a twinflow version of the model, then finetune it to support the new second timestep parameter.

Here's some claude slop
https://rentry.org/uwsw9hwo

Treat everything in the above rentry as a hallucination and potentially subtly wrong. I'll make a full slop rentry fully explaining every single change once I'm confident I have actually figured it out.


>>107532871
>yes but how do the serfs make money?
that's the neat part, they don't. at least not in the digital fiefs. they CHOOSE to enter these walled garden ecosystems because of the dopamine the Cloud capitalists are offering, and the cloud capitalists extract "cloud rent" from these users in the form of their habits and personal information

That's why it's feudalism, not capitalism. They don't make money.
>>
>>107532882
Base is never going to be released.
>>
>>107532901
Is that your opinion or is it based on Chinese Cultcure?
>>
File: ComfyUI_temp_fyubl_00005_.png (1.94 MB, 1088x1792)
1.94 MB
1.94 MB PNG
>>
>>107532964
*sniffs in asian*
>>
>>107532964
WRINKLY HAG
(i get no pussy irl)
>>
>>107532906
>based
>culture

zamn!
>>
File: ComfyUI_temp_fyubl_00010_.png (2.64 MB, 1088x1792)
2.64 MB
2.64 MB PNG
how can I avoid the fucking bokeh, NAG doesnt work
>>
File: based cockroach.png (152 KB, 1451x765)
152 KB
152 KB PNG
>>107532901
>Base is never going to be released.
Not so fast! If the turkish god is not giving up, we shouldn't either!
>>
File: sd15.png (438 KB, 512x512)
438 KB
438 KB PNG
>>107531935
>pic, not the blonde. and yeah, I'm smoking weed for your information.

just acknowledging this casual anon-getting-BTFOed, dont mind me
>>
>>107533013
insist on it
>neg: "blur, background blur, bokeh"
at some point the model will finally understand
>>
probably too early to ask but is there a model or lora that can do chroma but with z speeds?

>>107533034
based Furkmanistan
>>
>>107533034
comfy and furk are both optimistic, and i'm still not bored of z turbo, so i am still not dooming

>>107533080
>lora that can do chroma but with z speeds?
i can't imagine that this is possible in any reasonable way without butchering chroma's improvements to the "base" model (which isn't even a base model, its a distilled model, which is why I can't imagine that this is possible)
>>
>>107533080
That guy sucks a mean dick
>>
furkmanistan will personally travel to china and have a stern word with those fucking chinkoids about what it means to honor your word. i trust the plan.


(post furk pics i can animate in wan for the memes tomorrow morning)
>>
>>107533057
I feel like we're not thinking with portals about negative prompts. If we do natural language for the positive shouldn't we do it for the negative as well? "The background is blurred. The image is very blurry. There is a lot of blur in the image" etc
>>
>>107530854
Share the workflow please?
>>
>>107533118
https://github.com/BigStationW/ComfyUI-Prompt-Manager/blob/main/workflow/workflow_Z-image_turbo.json
>>
>>107533112
that's a fair point, try with natural language on the negative and see if it improves anything
>>
>>107533094
>comfy and furk are both optimistic
comfy is disappointed because then he can't get more money out of the api. furk is on the side of good
>>
>loss already going up 3000 steps out of 5000 steps
i wish i knew what i was doing

>>107533132
>furk is on the side of good
I'm just excited for the humilitation ritual furk will have to go through every day using an Israeli model when (if) LTX2 comes out because he is pretty funny when he lolcows
>>
>>107533122
Thank you!
>>
>>107533155
I think he prefers grifting more than he hates (((them))), but yeah I'm really curious to see if he's gonna shill it or not
>>
File: file.png (311 KB, 721x545)
311 KB
311 KB PNG
>>107533155
>I'm just excited for the humilitation ritual furk will have to go through every day using an Israeli model when (if) LTX2 comes out because he is pretty funny when he lolcows
kek
>>
>>107533110
>clash of the cultures

christmas came early!
>>
>>107533110
>furkmanistan will personally travel to china and have a stern word with those fucking chinkoids
Can a man be more based than that? I don't think so
>>
>>107533168
maybe he'll cognitive dissonance and separate Israel from the Israelis (again, I'm excited for the humiliation ritual)

if it's SOTA he will be unable to ignore it, and if it's not SOTA well who cares anyways
>>
>>107533171
kek, more based than xi and z, wtf
>>
>>107533171
>the best lora trainer in the world
>hates the idf
yeah, this guy rocks
>>
>the best lora trainer in the world
>>
>>107533243
He's like Bruce Willis in Armaggeddon - the best driller in the world.
>>
what's with the immune deficiency foundation hate? is this thread full of germs?
>>
>>107533164
No problem!
>>
what does anon think of the new ui for comfyui?
>>
>>107533343
dunno, never pulled
>>
all your base are belong to us
>>
>>
>>
>>107533488
>>
>>107533343
no one asked for it
its obvious fennec fag has lost control of his project
>>
>>107533500
This is like that movie, Alien: Earth.
>>
File: 1750483388371760.png (44 KB, 1166x257)
44 KB
44 KB PNG
:(
>>
>>107533497
>>
>>107533507
>Hires someone for UI who realises they need to make UI changes in order to keep their job
>makes changes upon changes, increasingly stupid shit, users quickly get fed up
Yeah, rookie mistake
>>
>>107533546
All your base are belong to Alibaba
>>
>>107533546
They will obviously start on that after Base is done, how the fuck would he be able to say when it will be released ?

They need to at least start training it before they can give any kind of estimate.
>>
File: 1735351044571977.png (2.77 MB, 1120x1440)
2.77 MB
2.77 MB PNG
>>
>>107533562
Whats stopping them from releasing base? wouldn't they need a base model to make the turbo?
>>
>>107533598
everytime you mention it, it gets delayed by 1 hour. did you see the shit flinging tantrum ldg threw over wan 2.5? devs saw these threads and said "nah"
>>
>>107533507
it fixed the glitchy issues when I save the workflow on android
>>
>>107533609
Schizos of /ldg/ are driving the devs away. I'm sure this ZiT dev who is replying in their Discord server is probably an alcoholic by now
>>
>>107533609
>>107533635
we're not that important, they probably don't even know /ldg/ exists lmao
>>
File: z-image_00837_.png (2.08 MB, 1152x2048)
2.08 MB
2.08 MB PNG
>>
>>107533609
I don't care about, it was a genuine question.
>>
File: z-image_00839_.png (2.02 MB, 2048x1152)
2.02 MB
2.02 MB PNG
>>
File: z-image_00840_.png (1.91 MB, 2048x1152)
1.91 MB
1.91 MB PNG
>>
>>107533598
>wouldn't they need a base model to make the turbo?
No, that was not a base model made to train upon, it was just made to generate pretty images which were used to train Turbo.

The 'Base' everyone is waiting for is a new model made specifically to be easy / effective for further training, either lora or full finetune.
>>
>>107533696
How do you get dark images with zit?
>>
>>107533609
>devs saw
lol
>>
Ignore the failgen, but look at that dude that just appeared out of nowhere unprompted
>>
>>107529450
Agreed. Nothing, but soulless 3DPD garbage. OP should go back to fucking Faceberg and kill himself.
>>
>>107534094
>out of nowhere
It's AGI
>>
>>
File: ComfyUI_00029_.png (2.12 MB, 1520x1040)
2.12 MB
2.12 MB PNG
>find the seemingly right prompt + loras + sampler/scheduler combination
>anatomy goes out of the window
And here I thought ZIT was different
>>
>>107534464
>tweaking the 1000s of parameters in the hopes of finding a configuration that makes the model do something no-one else expected it could do
I think 80% of my time is spent doing this, usually with 0 results
>>
>>107534094
this gen makes kfc blush
>>
Am I misremembering, or has SDXL started to run faster for anyone else too recently (should be since last month or so) under Comfy?
>>
which is easier to do get good face retention with image to video?
>>
>>107534526
>>107534464
my humiliation ritual is returning to euler simple
>>
>>107530465
>>107531109
how the fuck do I do this
>>
>>107534831
>my humiliation ritual is returning to euler simple
same
>>
>>107534859
>>107534831
i dont care if the results are good. using euler feels like bending over and spreading my ass cheeks
>>
>>
>>107528979

Can you share the wf/prompt for her? Really nice.
>>
File: newbie.png (27 KB, 730x458)
27 KB
27 KB PNG
Do I have to pull again to try Newbie? I thought gemma support is already in.
>>
Why is it doing this? Why is it uninstalling the latest and uninstalling an older? I can't use comfy.
>>
>>107531826
That's insane, typos notwithstanding.
>>
>>107534900
Noice. The only thing I can nitpick is the position of her right foot.
>>
>>107532493
>>107532616
>>107533497
>>107534383
Nice gens
>>
>>107534900
Very good
>>
>>107534094
Thanks for sharing
>>
>>107533549
>>107535171
KYS no genners
>>
>>107534857
download and save as mp4
>https://files.catbox.moe/ccsd1r.mp4

load it in comfyui
follow instructions to install missing nodes
get missing models (Smooth Mix, vae, encoder etc)
use an image as first frame (i2v)

this image was obviously generated by ZIT
>https://i.4cdn.org/g/1765567645607030.jpg
>>
>>107535086
you might have to update the custom node itself
>>
man i fucking hate genning on comfyui. not fucking fun to prompt at all
>>
File: zitctrl.jpg (554 KB, 3012x2160)
554 KB
554 KB JPG
>install controlnet shit for zit yesterday night but didn't get it to work
>start again today
>install is bricked
>2hours spent on troubleshooting and get back online
>finally gen
>the result is dogshit

Left original, right v2 ctrlnet.

Chinese culture.
>>
File: zit_00042_.png (1.01 MB, 720x1280)
1.01 MB
1.01 MB PNG
>>107535368
>>
>>107535086
Is there a workflow template?
>>
>>107535382
Anon, I will tell her that you did not bookmark her post
>>
>>107534648
Comfy is very fast and has the best UI right now for model loading and model swapping

>>107535331
You can bloat it with Swarm if you're afraid of nodes or make it more artistic with Krita if you're feeling creative
>>
>>107535405
just the slap basic nodes together
>>
File: 113679313.jpg (200 KB, 832x1216)
200 KB
200 KB JPG
>>
File: ComfyUI_02770_.png (1.36 MB, 784x1440)
1.36 MB
1.36 MB PNG
>>
File: 113679233.jpg (94 KB, 832x1216)
94 KB
94 KB JPG
>>
>>107535368
you've learned a valuable lesson anon, if no one talks about a product, it means it's ass
>>
File: 113667278.jpg (820 KB, 1920x2816)
820 KB
820 KB JPG
>>
>>107531059
What's the base model?
>>
File: 113679250.jpg (159 KB, 832x1216)
159 KB
159 KB JPG
>>
>>107532493
uhh what's the lora for that body type? looks like a mix of asanagi an alex ahad
>>
File: 1755133786840976.png (160 KB, 898x741)
160 KB
160 KB PNG
Flux 2's vae is with the apache 2.0 licence right? it looks like it's better than the first one, what if Z-image was using Flux 2's vae instead?
>>
>>107535755
You'd need to retrain the model. The flux2 vae has way more channels.
>>
what is anon's acceptable speed for wan 2.2 i2v?
>>
>>107535535
>>107535641
>>107535633
>>107535563
nice
>>
>>107535817
Definitely not what I am getting with my 3060.
>>
People shouldn't be allowed to train loras.
>>
Ai-toolkit can't train from merged safetensor file? Do I have to get the entire bloatshit from HF?
>>
>>107536021
loras were a mistake indeed
>>
Reminder that the ZiT conditioning noise nodes work also on qwen with same effect
>>
File: comfypoop.png (56 KB, 943x433)
56 KB
56 KB PNG
Why is he like this?

He always codes better than the rest
>>
>>107536127
>Why is he like this?
lmao, this nigga thinks it's the moment to throw shade at other people's code, he literally fucked up NAG and MultiGPU with his update bullshit
>>
>>107536075
>the ZiT conditioning noise nodes
the what?
>>
>>107535755
>what if Z-image was using Flux 2's vae instead?
I'd rather ask Alibaba to go for a pixel only model, now it's as fast as vae without the pixel compression, perfect for edit models
https://github.com/LTH14/JiT
https://xcancel.com/LodestoneRock/status/1998215045118112029#m
>>
>>107536127
I stand with Comfy
>>
File: wut.jpg (5 KB, 254x198)
5 KB
5 KB JPG
Are my loras coming out deepfried because I have the github version of ai toolkit and not the patreon?
>>
>>107536165
yeah
>>
File: file.png (63 KB, 754x779)
63 KB
63 KB PNG
>>107536127
lmao, there's no mistake, he just doesn't understand, it's not particularly difficult, yes they run the control layers twice, once on just x which is then used as hints to the noise_refiner, and again on unified which is used as hints to the main transformer layers
>>
>>107536165
theres a pateron version and its better?!
>>
>>107536260
>>107536127
Cumfart really thought the Alibaba gods (they made Z-image turbo) could've made such a dumb mistake, sorry comfy, but those engineers are way more talented than your Ui jeets you hired kek
>>
>>
>>107536278
The inpaint part is so simple too, why is that missing
>>
>>
What is the use case for using a UI other than Comfy?
>>
fun
>>
>>107536339
comfyui is a jeetware shitheap. new uis are in c++ so no python niggardry. cumfart dropped the ball so hard people don't mind hitting the reset button
>>
>>107536165
Aitoolkit is barebones, overrated trash.
Jeets like it because of its ebin UI but it's direly lacking in functionality compared to its competitors.
Don't be a jeet.
>>
>>107536360
>new uis are in c++
like?
>>
>>107536367
Stability Matrix
>>
>>107536367
anistudio
>>
grrr I don't like how you trained the model, you have to change it and train it again >:(
>>
>>107536376
hi trani
>>
>>107536376
is there one that's used by more than one person?
>>
>>107536395
your mom?
>>
>>107536376
Doesn't make sense.
>>
>>107536415
>>107536415
>>107536415
move when ready
>>
>>107534464
how do you do fellow... oh fuckit i do a shit ton of test with not many good and often inconclusive results too, not appropriate this board but i do lurk and enjoy what's posted here
https://gofile.io/d/Iy753R



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.