[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107455915

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
1g...NO...2girls
>>
File: ComfyUI_02819_.webm (1.85 MB, 624x1152)
1.85 MB
1.85 MB WEBM
I should train a wam lora...
>>
Comfy must be dragged into burger king and fed
>>
>>107460126
i got u senpai
>>
why does aitoolkit do that
>>
File: YunyunOfficeSex3.jpg (1.4 MB, 3072x3072)
1.4 MB
1.4 MB JPG
>tags: bible black \(style\)
what other interesting artstyle are there for anime?
>>
>>107460145
that's 4girls doe...
>>
blessed thread of friendship
>>
>>107460156
asanagi
>>
>>107460156
go shopping on civit and try out the ones that look cool
>>
>>107460145
why does the girl in the stripes on the right look like she's ready for me to man up and settle down with me?
>>
File: zimg_0022.png (2.27 MB, 1024x1496)
2.27 MB
2.27 MB PNG
>>107460166
eastern european dataset probably
>>
>>107460156
Doesn't look like Bible Black at all.
>>
>>107460222
which one has the biggest cock? 1girl is too boring for me now
>>
>>107460156
ask adt
>>
File: YunyunOfficeSex5.jpg (1.42 MB, 3072x3072)
1.42 MB
1.42 MB JPG
>>107460224
it does but to make it even more closer i'll have to add the artist name
>>
>>107460134
KellyKelley?
>>
File: ZiMG_00740.jpg (724 KB, 1344x1728)
724 KB
724 KB JPG
>>
File: zimg_0028.png (2.11 MB, 1024x1496)
2.11 MB
2.11 MB PNG
>>107460247
>the last thing i see at last call
>>
File: ZiMG_00747.jpg (813 KB, 1344x1728)
813 KB
813 KB JPG
>>107460306
>>107460222

hi psx anon
>>
ok
>>
>>107460361
Come on compared all the other shit loras, that's kino
>>
Do you guys keep several separate comfy installations for different purposes, like image or video generation? Or just one single installation?
>>
>>107460375
I keep 0 cuckfyui cause it's shit
>>
seems like i am getting much better results training on the de-distilled version than using the lora adapter
>>
>>107460375
Yes, one for WAN and one for images
>>
>>107460375
you have to have multiple in case an update fucks everything which seems to be every update at this point. python was a mistake
>>
File: romans.png (1.81 MB, 768x1344)
1.81 MB
1.81 MB PNG
its so close to being awesome
>>
>>107460375
no, i go even further the opposite way and share one venv between comfy, ai-toolkit, and some other shit
>>
>>107460375
i keep one. never run into any problems as these retards who keep downloading sketchy workflows from indians on civitai
>>
Do you guys save your gens as pngs or jpgs?
>>
>>107460408
webp, best compression
>>
File: ZiMG_00753.jpg (672 KB, 1344x1728)
672 KB
672 KB JPG
>>107460358
>>
>>107460404
Main I have so many venv, I really should do this
>>
File: 1763526185063216.png (2.09 MB, 1280x1544)
2.09 MB
2.09 MB PNG
How much RAM do you think next gen Nvidia cards will have?
>>
>>107460392
>>107460390
That's what I do as well.
Just wanted to check what others do, because this feels more "clunky" than "comfy".
>>107460404
That's a lot of bravery
>>
putting "with down syndrome" on my zim 1girls make then look prettier, funny.
>>
>>107460114
seeing jennay pics in the op pic warms my soul. thank you.
>>
>>107460426
bout 3fiddy
>>
>>107460392
>in case an update fucks everything
Why update? if it's working dont touch it, that's how python works
>>
>>107460426
>he thinks they will make another consumer gen
>>
>>107460400
these are not romans
romans had no such dresses
>>
>>107460426
>next
>>
>>107460452
hello, yoland from comfyorg here. nodes 2.0 is great so it's worth updating for new quality of life features!
>>
>>107460458
maybe not dresses but did they have the big tiddies?
>>
>>107460375
No, but I keep a fresh ZIP several versions behind and when some bleeding edge shit become mandatory, only then I make an actual update
>>
File: zimg_0044.png (3.48 MB, 1024x1496)
3.48 MB
3.48 MB PNG
>>107460358
hello anon
>>
don't forget to subscribe to comfy cloud to help support the best development in ai! comfy is for everybody!
>>
File: ZiMG_00756_.jpg (748 KB, 1344x1728)
748 KB
748 KB JPG
>>107460471
likewise.

So this is what I managed to train today.

picrel
>>
>>107460463
yes also brown skin
>>
>>107460375
>>
>>107460484
Share pls?
>>
>>107460392
>in case an update fucks everything
I learned this the hard way. If you have a working comfy always make a backup/copy before updating.
>>
>>107460499
No
>>
>>107460426
this >>107460455
there's not gonna be a 6090

the way things are going nvidia isn't gonna bother with consumer products
>>
>>107460523
Yes
>>
>>107460392
>comfy is so bloated you now have to double the space it takes up because now it's incredibly unstable
lol. new UI when?
>>
File: zimg_0048.png (1.79 MB, 1024x1496)
1.79 MB
1.79 MB PNG
>>107460484
nice, looks like it came out well
>>
File: z-turbo_00017_.png (2.39 MB, 1280x1280)
2.39 MB
2.39 MB PNG
>>
File: file.png (325 KB, 2560x1050)
325 KB
325 KB PNG
>>107460462
It does look neat.
>>
File: eboncchles.jpg (7 KB, 275x183)
7 KB
7 KB JPG
>>107460489
>also brown skin
>>
>>107460526
prime time for nvidia to do a "refresh" generation like they used to back in the day, where the new numbered series is the same chips with different numbering.
>>
>>107460540
it feels like a bag of dicks which is a problem
>>
>>107460540
>the stop button is back
uh? is it time to update now?
>>
>saturday
>still no release date
why did they decide to cancel it like that. it's really strange.
>>
>>107460573
>is it time to update now?
no. only when a new model drips
>>
>>107460484
I've only ever used Civit's trainer. Is it hard to train locally? I got a 5090 now.
>>
>>107460540
Visually perhaps but it is still retarded because
>left to right workflow with huge ass blocks
They should copy Houdini or Nuke because these have been industry standard workflows for god knows how many years by now.
>>
>>107460575
Last minute censoring
>>
File: ZiMG_00765.jpg (851 KB, 1344x1728)
851 KB
851 KB JPG
>>107460499
Will put on my civit and link here

>>107460535
yes, thanks for the pointers!

>>107460578
I used Osrtis's AI Toolkit. Very straight forward, was easy. on a 5080 btw
>>
>>107460552
every consumer gpu they make is resources that could be used on data center products

it already got to the point where it doesn't make sense for them to make consumer chips if they use that fab capacity to make AI chips and with that it would still make sense to make consumer shit if they can get it on a fab that can't make AI shit but now with the memory shortage I'm not sure it makes sense anymore to use lesser fabs to make consumer products
>>
IT'S OUT
https://huggingface.co/Tongyi-MAI/Z-Image-1.0-Base
https://huggingface.co/Tongyi-MAI/Z-Image-1.0-Base
https://huggingface.co/Tongyi-MAI/Z-Image-1.0-Base
>>
>>107460631
eat shit and die
>>
>>107460575
Never going to be released.
>>
File: z-turbo_00025_.png (2.49 MB, 1280x1280)
2.49 MB
2.49 MB PNG
>>107460631
real
>>
>>107460631
why is it still only 6B?
>>
>>107460631
I knew it was fake but clicked it anyway
>>
>>107460601
>Osrtis's AI Toolkit
Thanks I'll take a look.

That style looks pretty cool.
>>
They're not going to release the base model btw. All we get is turbo.
>>
>>107460540
This is just bored UI faggots changing for the sake of change just so they can keep their job. Dragged and shot.
>>
File: zimg_0057.png (1.76 MB, 1024x1496)
1.76 MB
1.76 MB PNG
>>
File: 2girl_00001_.mp4 (2.22 MB, 1120x1440)
2.22 MB
2.22 MB MP4
>>107460145
>>
>>107460647
lol watch them create a NovelAI clone with the dataset they got from LAX
>>
>>107460649
This is the reason why every UI gets worse and worse with every update. Every website, every piece of software and every game. Fuck UI faggots.
>>
>>107460660
haram
>>
>>107460540
>76fps idle
holy shit lol
>>
>>107460645
>I knew it was fake but clicked it anyway
kek, same, it gave me some nerves desu lel
>>
File: file.png (177 KB, 1230x781)
177 KB
177 KB PNG
I'm updating all the way to wan2.2 just because I want to use THIS lora that claims to create sprite animations for 2d games, but it seems the entire logic changed, there are video nodes builtin now instead of having to use FILM libraries or whatever.
Can a kind soul please point me to a basic updated workflow for i2v? I'm using a porn one here, but it seems to be a little too customized
>>107460620
I agree with that. Thinking in business only, Nvidia doesnt have much reason besides having a fallback to keep humoring the gaming market. But when the AI bubble pops (not that AI will become irrelevant, but when the profits dont make up for the 2008-tier leverage that big tech is doing), they will need the consumer market back.

What I THINK will happen, is that chinese companies will start making their own graphics cards. You dont need 7nm to make consumer-grade graphics cards for gaming, and for anything above 22nm China's fabs are just fine.
>>
File: zimg_0060.png (1.65 MB, 1024x1496)
1.65 MB
1.65 MB PNG
>>107460660
nigga this gay as hell
>>
Would it be possible to just mod back the old node visuals? This overdesigned solid color nonsense is like running android on low tier phone
>>
>>107460682
I quite like the
>/ldg/anon's Wan2.2 lightx2v workflow
from here https://rentry.org/wan22ldgguide
>>
File: romans2.png (1.9 MB, 768x1344)
1.9 MB
1.9 MB PNG
>>107460458
its hard to prompt it, i might need to make a lora from screen shots of hbo rome
>>
>>107460730
>i might need to make a lora from screen shots of hbo rome
that's a good idea actually
>>
surprised no ones made Twin Peaks, absolute kino visuals
>>
File: ComfyUI_00203_.mp4 (3.39 MB, 640x640)
3.39 MB
3.39 MB MP4
>>
i remember those first few days where i actually thought the base model would be released when i woke up lmao
>>
File: 2girl_00002_.mp4 (1.48 MB, 992x1440)
1.48 MB
1.48 MB MP4
>>107460696
idc
>>
>>107460730
what is this sd 1.4?
>>
File: ZiMG_00780_.jpg (816 KB, 1344x1728)
816 KB
816 KB JPG
>>107460763
This style doesn't really translate well into videos
>>
>>107460788
these are great thanks anon
>>
>>107460779
its an sdxl 1 checkpoint
>>
>>107460763
>>107460788
can i get a link to this lora? this reminds me of the artist who drew frankenstein but i cant remember his name.
>>
>>107460666
kek. I was, and still am, generating a video though. It takes forever on my rtx 3060.

>>107460462
If you really are from comfy, here's a feedback for the new UI: when I click on a dropdown, I expect that I can instantly start typing to filter the options, not have to click on the search field for that.
>>107460724
I tried that one, but it uses a lot of custom nodes that I didnt had in my old comfy folder, and the node manager couldnt find them automatically to download.
I updated comfy, and it couldnt find these nodes either, and this stupid new UI doesnt have the node manager to auto-download.
A few months without updating and I'm already completely obsolete in using comfy UI lmao

I will try the porn workflow I got off civitai and see if it works. The low_noise and high_noise novelty also feels stupid to me.
>>
>>107460426
6070 - 12GB
6080 - 16GB

remember how for the longest time intel was holding computing back with 4c/8t CPUs for like a decade?
yeah, nvidia is doing the same thing now
>>
File: ZiMG_00783_.jpg (2.52 MB, 1344x1728)
2.52 MB
2.52 MB JPG
>>107460837
Working on the civit page, gimme a few, will link it here
>>
>>107460730
>to make a lora from screen shots of hbo rome
That would be awesome
>>
>>107460875
>Working on the civit page, gimme a few, will link it here
thanks, i found the artist too. beautiful art
>>
File: YunyunOfficeSex6.jpg (2.63 MB, 3072x3072)
2.63 MB
2.63 MB JPG
>semi realism, anime
>>
>>107460844
>If you really are from comfy
if he was it's your duty to tell the corpo cocksucker that ruined everything to fuck off. there is no vision at comfyorg other than trying to get people to use the API nodes/comfycloud
>>
>>107460900
>BernieWrightson
oooh nice! I didnt train it off his stuff tho
>>
>>107460753
This general respects David Lynch because he is a certified appreciator of youthful beauty, but also Lynch is more than just the visuals so there's not as much of a purpose to generate that

I think when video with sound comes out you'll see more Lynchian stuff

>>107460866
Manufacturing is holding back compute right now. The consumer segment doesn't matter. Nvidia would love to manufacture 3x more of every card 5090 and above but there is literally not enough DRAM or fabs on the planet
>>
>>107460912
>I didnt train it off his stuff tho
i know, but it was bothering me that i couldnt remember what your images were reminding me of. i love this type of hatching, it's great.
>>
>>107460844
>I tried that one, but it uses a lot of custom nodes that I didnt had in my old comfy folder, and the node manager couldnt find them automatically to download.
>I updated comfy, and it couldnt find these nodes either, and this stupid new UI doesnt have the node manager to auto-download.
Install the newest ComfyUI Manager and it should work fine. You might wanna do a fresh comfy install if yours was really old.
>>
dead thread
Z hype is completely gone
Base never
>>
BLOAT MORE. COPY EVERYTHING TWICE. YOUR ENTIRE DRIVE SHOULD BE FILLED WITH COMFYUI INSTALLS WITH THEIR OWN VENVS. DOWNLOAD UV AND CONDA. USE ELECTRON. NODES2.0 ARE GOOD FOR YOU. HAVE MULTIPLE PYTHON INSTALLS JUST IN CASE. WHAT DO YOU MEAN YOU AREN'T COMFORTABLE???!?!
>>
>>107460919
>Nvidia would love to manufacture 3x more of every card 5090 and above
Actually they like intentionally limiting supply to create artificial demand. They've done this with every launch since like 2000 series or possibly even longer. The supers for 5000 series got delayed because the old cards weren't selling fast enough and suddenly they had too much supply. Now with the memory happenings there's probably not gonna be supers.
>>
>>107460956
WAAAHHHHHHH WAAAAAAAAAH GUU GUU GAGA!!!
>>
>>107460956
>WHAT DO YOU MEAN YOU AREN'T COMFORTABLE???!?!
my sides are in orbit kek
>>
>>107460955
>>107460956
nogens malding
>>
>>107460956
my file organization is unironically pisspoor but at least i only have a single venv / trainer / engine install
>>
>>107460956
I just git pull and I'm comfy
except for the low framerate of the UI, that's annoying
>>
>>107460976
yoland from comfyorg here. it's actually more performant!
>>
>>107460995
wheres julien, yoland?
>>
if im doing 50 steps, do i do 25 hires steps or 50? im a boob, please help
>>
>>107460956
is actually easier to make multiple distrobox dockers to handle my cute anime wife image generators
>>
>>107460964
>Actually they like intentionally limiting supply to create artificial demand.
Read what you quoted. You're talking about poorfag shit that they make no money on. TSMC has no more capacity. The supers for the 5000 series got delayed because there is no more DRAM

And why is there memory happenings huh retard? Is that also intentionally limiting supply? What do you feed a goyim to emenate brainwaves in this manner wtf
>>
>>107461034
We need GRORIOUS CHINA to save us from the greed of westerners and flood the market with 65nm gpus that performs just good enough for games, and shit tons of ddr4 so we poorfags can generate 1girls at home
>>
File: images-2.jpg (14 KB, 576x324)
14 KB
14 KB JPG
>>
>>107461034
It applies to the 5090. Even way before all the AI happenings they've been pretending they can only make a few of the top consumer cards so they need to be expensive. If they made a bunch of them the price would need to come down.

Now with the memory happenings they don't have much of a reason to make consumer gpu.
>>
>>107461094
and also jews the jews.
>>
File: 1763658567463847.jpg (1014 KB, 3283x1767)
1014 KB
1014 KB JPG
Babe wake up, another method to improve Z-image turbo seed diversity just got released.
https://www.reddit.com/r/StableDiffusion/comments/1pg0vvv/improve_zimage_turbo_seed_diversity_with_this/
>>
File: z-image_00555_.png (2.43 MB, 1080x1920)
2.43 MB
2.43 MB PNG
what is this expression trying to convey
>>
>>107461131
isnt tit the same as injecting noise?
>>
File: z-image_00556_.png (2.39 MB, 1080x1920)
2.39 MB
2.39 MB PNG
>>
>>107461155
you have 2 ways of injecting noise
- on the latent (image)
- on the conditioning (prompt)
that node injects noise on the conditioning, it gives seed variance while keeping the prompt adherence since it's still close to the desired prompt but with a bit of "prompt noise" at the begining of the denoising
>>
>>107461171
3 inches? thats it?
>>
>>107461181
So is there a "better" option?
On the latent or the conditioning?
>>
>>107461018
Depends, half/one third or even quarter is a good starting point but the amount of denoise sort of affects it too.
You need to eyeball it.
Not sure about zit upscaling that's more finicky.
>>
>>107461188
I'd say conditioning, feel free to test that out though, I got more cases where the model didn't listen to my prompt with latent noise injection
>>
>>107461131
Nigga you are just fishing for ideas here and making nodes
>>
>>107461199
>fishing for ideas here
wait we got this idea on /ldg/ first? where?
>>
>>107460730
have you tried stola or whats it called and avoiding cleavage
>>
File: z-image_00557_.png (2.49 MB, 1920x1080)
2.49 MB
2.49 MB PNG
>>
Doesn't RES4LYF or some other schizo packs have some noise nodes?
>>
>>107461228
Schizo noise, yes.
>>
File: images-10.jpg (11 KB, 576x318)
11 KB
11 KB JPG
Look, I've seen a lot of generals. And people come up to me, big strong anons, tears in their eyes, they say "Sir... sir, /ldg/ is the greatest general that has ever existed in the history of the boards, maybe ever."....
>>
File: z-image_00558_.png (2.35 MB, 1920x1080)
2.35 MB
2.35 MB PNG
>>
>>107461125
>It applies to the 5090.
You might not be aware that the 5090 needs 3GB modules which are all going to datacenter GPUs. They're not pretending they can make only so few cards. I feel like you just don't know what you're talking about but thanks for not kvetching out about being called a shabbos goy

Don't try and out-conspiracy capitalism. Remember that your beliefs about the Jews controlling the world do not explain how china and Japan were allowed to become #1 and #4


>>107461094
China will not compete with even the 4090 until 2027 at the absolute earliest.

Welcome to the cyberpunk future. Buy a 5090 yesterday and regret it, or don't and regret it anyways. Learn how to make really fucking tasty mashed potatoes
>>
File: images-8.jpg (10 KB, 576x288)
10 KB
10 KB JPG
>>107461239
...And you know what? They're right. We have FREEDOM here, folks. Tremendous freedom. You want anime? We got 'em. The best waifus. Photorealism so real you won't believe your eyes? We do that. 3D renders, abstract art that makes the so called "experts" cry, we have it ALL.....
>>
>>107461238
if you need schizo noise this thread's got plenty
>>
>>107460829
just give up. sdxl is really bad at historical clothing. i spent hours trying to generate medieval clothes. a complete disaster
>>
File: images-6.jpg (9 KB, 547x365)
9 KB
9 KB JPG
>>107461256
Nobody gatekeeps here. This is /ldg/, and in /ldg/, ALL art is welcome at the table, beautiful art, the best art.
And the fake news will tell you "oh it's shitpost, oh nobody can agree on anything, oh the thread is moving too fast" WRONG.
>>
>>107461171
did not work

https://files.catbox.moe/d0cpcm.mp4
>>
>>107461254
>suddenly goes all schizobabble about da joos
???
>>
File: Zurbo_00034_.jpg (744 KB, 3328x1792)
744 KB
744 KB JPG
Women and their trigger discipline, am I right?
>>
>>107461315
>more drama!
>I'm bored
>>
>>107461293
Other generals? They see a little drama, a little shitposting, they crumble. SAD! But /ldg/? We come out bigger, better, more powerful than ever before. We've got tech discussion, we've got schizos dropping creativity bombs, we've got the best based model trainers, it's a beautiful thing...
>>
File: 435345343.png (787 KB, 970x686)
787 KB
787 KB PNG
>>107461131
Yeah, it's working alright.
>>
>>107461315
looks very hohol
>>
>>107461254
China doesn't have to compete with the 4090, they just have to pump masses of something that's on the level of the 3060 or even the 2060 and that's enough for all the games worth playing
>>
File: heh.png (10 KB, 517x58)
10 KB
10 KB PNG
Kek, cool random find
>>
File: god-emperor-trump.gif (480 KB, 220x214)
480 KB
480 KB GIF
>>107461342
They can't stop us, folks. They tried. They FAILED. /ldg/ ALWAYS wins.
Thank you, God bless, and God bless LOCAL DIFFUSION GENERAL!
>>
File: ComfyUI_00218_.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>
>>107461348
what was your prompt?
>>
>>107461362
>they just have to pump masses of something that's on the level of the 3060 or even the 2060 and that's enough for all the games worth playing
This already exists, it's Intel's B580 or AMD's cards
>>
File: ZIT_00499_.png (2.6 MB, 1152x2048)
2.6 MB
2.6 MB PNG
>>107460730
>>
File: output_00001.mp4 (393 KB, 320x416)
393 KB
393 KB MP4
>t2i focus wan
>does videos anyway
>>
File: ZIT_00503_.png (2.34 MB, 1152x2048)
2.34 MB
2.34 MB PNG
>>107461215
Nta but how's this? I googled stola and they also referred to a palla so this is with both.
>>
File: file.png (25 KB, 345x323)
25 KB
25 KB PNG
should i just fucking kms myself at this point
>>
>>107461448
Gonna need more information. What are you running? What are you running on? Give us some specs.
>>
>>107461448
No just find a SFW fetish you can generate with sora and check back in on local when you're an adult with a job who can afford a 16gb gpu
>>
>>107461131
So is this good? Gonna need some proof from someone other than the guy who posted it.
>>
File: 1761837827829648.jpg (1.26 MB, 2016x1152)
1.26 MB
1.26 MB JPG
>>
File: ZImage_00805_.png (1.4 MB, 896x1152)
1.4 MB
1.4 MB PNG
i PVBLISHED my pixel art lora for z-image

https://civitai.com/models/685038?modelVersionId=2478063
>>
File: 1737153570104355.jpg (1.21 MB, 2016x1152)
1.21 MB
1.21 MB JPG
>>
>>107461494
make a character with this and run the wan lora to get a walk cycle
>>
>>107461512
will try at some point, right now my gpu is too busy training, trying out different datasets and stuff
looks like training on the de-distilled actually fixes so much shit compared to the lora-adapter
>>
>>107461494
Nice. About that hard edge pixel art, is there any chance that'll be available for ZiT?
>>
File: hrs.png (31 KB, 568x333)
31 KB
31 KB PNG
Is there a version of this shit that works with unet/gguf?
>>
File: 00007-4154932266.jpg (177 KB, 1248x1824)
177 KB
177 KB JPG
>>
>>107461589
yeah probably, i'm trying out a training run at the moment, can't give you an ETA but expect it in the next few days, i won't release it unless i'm happy with the result
>>
File: ComfyUI_00004_.mp4 (172 KB, 384x576)
172 KB
172 KB MP4
>>107461463
Wan 2.2. I want to try to use the animation lora to animate a sprite walk cycle.
No success yet.
>>107461512
It's not that simple
>>
File: Zurbo_00039_.jpg (1.19 MB, 3328x1792)
1.19 MB
1.19 MB JPG
>>107461326
Bored girl summer.
>>
File: 2girl_00008__merged.mp4 (1.61 MB, 1440x832)
1.61 MB
1.61 MB MP4
>>107461225
>>
File: 00013-2618432095.png (2.55 MB, 1248x1824)
2.55 MB
2.55 MB PNG
>>
>>107461686
>10 seconds
noice, imo that's the sweet spot for some nice moments
>>
File: 00014-2567451022.png (2.61 MB, 1248x1824)
2.61 MB
2.61 MB PNG
>>
>>107461686
>2girl
>only one girl
but then who was wine glass?
>>
>>107461713
>imo that's the sweet spot for some nice moments
I agree
>>
File: roman woman.jpg (146 KB, 768x1344)
146 KB
146 KB JPG
>>107461215
it was the "cleavage" that was ruinning i. this is jsut prompting
"woman wearing a roman stola and palla"
then i genned shopped cleavage onto it. looks like the model does have knowledge of this, but i would need to get creative to make it sexy
>>
>>107461666
I think you're gonna need a sprite that's well suited for getting a walk cycle like that one looks to be facing too much to the front and the legs aren't well defined with the cape covering most of his body
>>
>>107461494
Nice!
>>
>>107461636
No worries, thanks.
>>
>>107461666
What specific version of wan2.2? What's your GPU? Are you using the lightx2v loras?
>>
File: 00032-1108772361.png (2.64 MB, 1248x1824)
2.64 MB
2.64 MB PNG
>>
File: chroma nunchaku soon.png (1.01 MB, 896x1152)
1.01 MB
1.01 MB PNG
Spooknik (the guy who nunchaku'd bunch of flux tunes) has apparently made significant progress on chroma
https://huggingface.co/spooknik/Chroma-HD-SVDQ/tree/main
Yeah I know chroma has a lot of issues, but this can be useful until a NSFW Z-Image tune arrives.
The worst part of chroma is waiting so much through deformed limbs and seed lottery until you get a good image, making it run at a sane speed can make it usable perhaps.
It's still WIP, and its current outputs have bad quality it seems, but the fact that it works with an actual fused kernel speedup is promising.
Also nunchaku project is almost dead and no one is working on it right now so I am increasingly convinced that we are never getting Wan 2.2, so it would be very nice if we could get this one out at least.
>>
File: ComfyUI_00005_.mp4 (340 KB, 384x576)
340 KB
340 KB MP4
New try.
>>107461806
Well in a game characters often wear robes and tunics.
Not being able to do that is kinda limiting
>>107461860
>wan2.2 14B fp8_scaled
>umt5_xxl_fp8_scaled
Last prompt was 384x576, length=53

I'm gonna try with a different character with more legs and maybe in a better angle
>>
>>107461860
>>107461928
Sorry forgot about my system specs:
RTX 3060 12GB VRAM
Core i5 12400
64GB RAM DDR4
SSD Nvme 2TB

But I am using two monitors and have some docker containers running, so that might be slowing down my pc
>>
File: 00036-391206154.png (2.75 MB, 1536x1536)
2.75 MB
2.75 MB PNG
>>
>>107461726
>2girl

>>107460771
>>107460660
>>
>>107461928
(pixel art lora guy from above)
i have wanted to train a wan lora that adheres to a perfect pixel grid every frame for a while, but have had trouble sourcing datasets
part of me is wondering if i should just downscale+upscale random vids with nearest neighbor and try training on that
trouble with wan is my 3090 can't handle it, have to rent H100 or H200 every time and it gets expensive when experimenting
>>
File: 00041-2247013851.png (2.37 MB, 1824x1248)
2.37 MB
2.37 MB PNG
>>
>>107461913
use chroma cache and/or flash lora and stop using the memequants
>>
>>107461928
>>107461942
Low res gens shouldn't take that long on your pc. Are you maxing out VRAM or RAM while genning? Could be something wrong with your comfy setup as well.

That walks looks kinda passable. You should try scaling it down to some resolution that somewhat matches those fake pixels and see how it looks.
>>
>>107462023
Cache is a meme and I don't like the rigidity of distill loras
If this one also doesn't work good enough, I think will just continue to not use chroma
>>
File: file.jpg (392 KB, 2119x1268)
392 KB
392 KB JPG
>>107461483
I think it's working fine
>Photo of a couple in an Italian city, taken in the 1970s.
went for threshold = 0.15 and strength = 12 on that one
>>
i like how chill it is in these threads when the discussion dies down a bit, the quality goes up
>>
File: thinking_about_it.png (173 KB, 2688x2688)
173 KB
173 KB PNG
Is there a way to introduce uncertainty into the seed/noise so that every gen no matter the seed is differrent?
>>107462073
Cache is literally free saved steps. Also with nunchaku chroma you'd be without loras.
>>
File: Zurbo_00041_.jpg (992 KB, 3328x1792)
992 KB
992 KB JPG
Trash girl & Rat girl linked up! need it or keep it??!
Save us, frilly titty fairy.
>>
File: 00048-1766159665.png (2.65 MB, 1824x1248)
2.65 MB
2.65 MB PNG
>>
File: succky.png (876 KB, 606x962)
876 KB
876 KB PNG
>>107462010
I'm using this lora if you want to try or get some ideas for your own lora:
https://civitai.com/models/2172038/wan22-lora-walk-animation-side-view-sprite-animation-pixel-style?modelVersionId=2445934

Now I'm trying to animate this succubus gemini generated for me. Her legs are more visible so maybe the animation is better.

In that page some people got decent-ish results...

>>107462029
It seems I'm using the default workflow, probably made for GPUs better than mine.
It's a fresh install of the comfyui, I downloaded it today and installed in a new folder.

got prompt
Requested to load WanTEModel
Unloaded partially: 6508.44 MB freed, 742.65 MB remains loaded, 2632.54 MB buffer reserved, lowvram patches: 389
loaded completely; 8762.75 MB usable, 6419.48 MB loaded, full load: True
Requested to load WAN21
loaded partially; 8336.87 MB usable, 7357.85 MB loaded, 6271.23 MB offloaded, 975.01 MB buffer reserved, lowvram patches: 242
100%|10/10 [06:35<00:00, 39.51s/it]
Requested to load WAN21
loaded partially; 8336.87 MB usable, 7357.85 MB loaded, 6271.23 MB offloaded, 975.01 MB buffer reserved, lowvram patches: 242
100%|10/10 [06:41<00:00, 40.14s/it]
Requested to load WanVAE
Unloaded partially: 256.79 MB freed, 7101.06 MB remains loaded, 975.01 MB buffer reserved, lowvram patches: 252
loaded completely; 512.44 MB usable, 242.03 MB loaded, full load: True
Prompt executed in 00:15:27
>>
>>107462087
>Is there a way to introduce uncertainty into the seed/noise so that every gen no matter the seed is differrent?
not really, with ZIT it's so distilled that random noise always converges to the same thing given the same prompt. you can hack around it like start_at_step 1 but that's just a hack, it's basically img2img going from 1 step genned on an empty text embedding to whatever you are trying to gen
>>
>>107462087
>Cache is literally free saved steps.
Is this cache something different than the block caching in teacache/easycache/etc? Because these aren't "free saved steps"
>Also with nunchaku chroma you'd be without loras.
No it has lora implementation.
Why do people keep spreading this BS lie about nunchaku? Flux has official lora implementation and Qwen has an unofficial one that works.
>>
File: 00049-1766159666.png (2.63 MB, 1824x1248)
2.63 MB
2.63 MB PNG
>>
>>107462087
you could do something like this and then use a new noisy latent/inject noise node inbetween on a different seed at low strength
>>
>>107462131
the creator of chroma cache said it works the exact same as teacache, meaning it significantly reduces quality the more aggressive the caching is. It was very noticeable, even at low values, so I stopped using it.
>>
>>107461131
can z-image do nsfw?
>>
File: 00054-3478193880.png (2.73 MB, 1152x2016)
2.73 MB
2.73 MB PNG
>>
>>107462164
it's not as censored as flux but you won't get much nsfw without a lora
>>
>>107462160
That's exactly what I expected it to be, thanks for confirming.
>>107462164
Out of the box it only does "okay" boobs.
>>
File: 00060-1513816774.png (2.77 MB, 1248x1824)
2.77 MB
2.77 MB PNG
>>
>>107462164
you can generate plenty of sexy poses without loras. but the nudity is poorly rendered, with bizarre genitalia for both gender. the good news, is that the adult loras are better than Flux or Qwen
>>
>>107462164
The loras kinda suck. It does alright on nipples on its own with most being almost passable while some looking kinda decent and some clearly fucked but not really body horror. Pussy comes out looking like taint or some fucked up sealed up tranny surgery scar. Dicks are nightmare tier flesh sausages.
>>
>>107462164
The only explicit thing it can do is breasts. It clearly knows genitalia because if you prompt penis, it knows the general shape and where to put it, but it's mutilated.

All the penis loras I tried only work for very specific poses and aren't flexible, but that's to be expected when training a lora from a distilled model. It doesn't matter how much data you throw at it, it'll never be good.
>>
>>107462164
it can do nipples, and the loras for vaginas, dicks and sex aren't very good
>>
So for anime porn, Illustrious models are still king.
A SDXL based model, an architecture from what, 3 years ago? Stable diffusion really cooked.
>>
base: never ever
>>
>>107462266
Even for realistic porn, illustrious is just better because it has good checkpoints and loras. Yeah the realism won't be as good as these newer models but the porn will be better.
>>
File: 2girl_00012_.mp4 (1.8 MB, 1440x800)
1.8 MB
1.8 MB MP4
>>107462099
>>
File: Zurbo_00002_.jpg (1.08 MB, 3328x1792)
1.08 MB
1.08 MB JPG
1girl, squatting, eating
The creativity knows no bounds.
>>107462299
God damn, that's feral. Kinda creepy.
>>
File: Z-image turbo.png (1.57 MB, 1280x720)
1.57 MB
1.57 MB PNG
>>
>>107462318
>anon thinks a hobo/feral girl looks like an OF model that just woke up from bed with no makeup

So sheltered
>>
How do you just remove clothing on a regular picture?
>>
>>107462378
ask computer nicely
>>
>>107462378
remove clothing lora + qwen image edit 2509
'remove her clothes'
next
>>
File: Z-image turbo.png (1.16 MB, 1280x720)
1.16 MB
1.16 MB PNG
>>
File: 2girl_00013_.mp4 (1.47 MB, 1440x800)
1.47 MB
1.47 MB MP4
>>107462318
>>
File: 1759393014508632.jpg (186 KB, 909x732)
186 KB
186 KB JPG
I feel sorry for ramlets. I upgraded to 128gb ddr5 a year ago for $230. it'd cost $1,758 ($879 x 2) for the same upgrade now, jesus fucking CHRIST.
>>
>>107462378
https://civitaiarchive.com/search?base_model=Qwen&platform_status=deleted&rating=explicit
>>
>>107462495
$460*(230x2)
>>
File: Z-image turbo.png (1000 KB, 1280x720)
1000 KB
1000 KB PNG
>>
File: ComfyUI_00012_.mp4 (2.59 MB, 1126x2048)
2.59 MB
2.59 MB MP4
>>107462495
I paid 479€ for my 64GB kit on black friday. I guess I should consider myself lucky to have gotten it before the prices went up even more.
>>
>>107462597
"low-poly" completely ruins the prompt and makes everything look like a minecraft/roblox ripoff
>>
>>107462618
I tried to put minecraft on the negative (NAG) but it didn't work yeah
>>
File: Nano Banana Pro.jpg (780 KB, 2752x1536)
780 KB
780 KB JPG
>>107462597
Nano Banana Pro is on a league of its own lol
>>
>>107462639
>one gorbillion parameters and a dataset consisting of Everything on the internet
if it weren't itd be embarrassing
>>
>>107462495
I bought 64gb for £95 about a year ago, now its somewhere around £250 for the same ones
>>
>>107462645
>a dataset consisting of Everything on the internet
what's preventing the others to do the same and have some fucking balls
>>
>>107462639
That is a bit closer to accurate though there were basically no games that looked like the ps1 example back then. PS3, 4 and 5 look too good.
>>
Does a node exist that allows me to use global variables in prompts to do string formatting? IE,
{character} is having sex with a man. {character} has {hair_attributes}. I have a lot of character loras that I queue up when going to work, and it's becoming tedious to replace each character everytime I queue up a new prompt
>>
>>107462495
>>107462603
I purchased the last RAM out of Saigon for my country which was a single 32gb Lenovo stick for $150 pushing me up to 64gb of ram

I probably wouldn't have upgraded if the apocalypse did not occur
>>
>>107462650
It's a paid service retard
>>
File: ComfyUI_00002.webm (3.35 MB, 1280x1280)
3.35 MB
3.35 MB WEBM
>have plenty of vram but only 32gb ram
>because of that, ssd always gets raped by offloading onto pagefile when switching from high noise to low noise wan models
>remember that RAMMap exists
>try manually cleaning up memory during a model switch
>it works, and my ssd doesn't get raped
>"cool, I bet there are custom nodes that automate the process"
>any "memory management" or "memory cleaning" nodes I could find either don't work or execute once when they get passed by in a workflow
>ask gemini to write a daemon that works throughout the whole workflow and executes whenever memory fills up past a certain threshold
>it fucking works
Why the fuck are there no custom nodes for this
>>
>>107462666
The only reason I even upgraded in February was because of WAN2.1's torch compile eating up all my ram. It's not really needed with WAN2.2 since it's MoE, but I'm fucking glad I did.
>>
>>107462639
>turns to slop after ps2
man NBP is smarter than i thought
>>
File: please.gif (64 KB, 200x186)
64 KB
64 KB GIF
>>107462667
Yeah sure whatever, as if anyone give a fuck about that.
>>
>>107460375
i think ill have the beta for my ui out next week, not sure tho still tinkering with things
>>
>>107461913
Based. Chroma HD is neat, but someone needs to tell the guy about Chroma HD flash as well. A nunchaku HD Flash version would be as fast as Z turbo and is already better than base/HD versions in terms of convergence.
>>
File: Nano Banana Pro.jpg (801 KB, 2752x1536)
801 KB
801 KB JPG
>>107462659
>>107462639
ok it definitely cooked on that one
>>
File: file.png (710 KB, 887x726)
710 KB
710 KB PNG
>z-image loras
>>
>>107462695
is it a comfy fork?
>>
>>107462703
z-image needs more oriental female loras.
>>
>>107462661
wildcards
>>
>>107462702
that ps2 one could maybe an early ps2 game, FF12 looked more like the ps3 example and it was a ps2 game
>>
>>107462678
Well clearly now you can make a github for it and share
>>
>>107462713
I was trying to make fun of those weird eyes and the lora being called "beauty girl"

those eyes are fucking freaky
>>
File: zimg_0091.jpg (1.68 MB, 9152x1336)
1.68 MB
1.68 MB JPG
i've been using noise injection with my workflows for quite some time now, mostly because it can details when injected later in the diffusion steps. all this discussion got me curious so here's what happens when you inject a fair bit of noise at different points in the process
>>
>>107462703
So many Z-Image LoRAs just to do things that the base model can already do, but they're too lazy to figure it out via prompting, and their LoRA sucks that it degrades the outputs anyway.
>>
>>107462757
that applies to every model, there are a shitload of loras for the most basic of shit
>>
>>107462702
Local can do this too. What you need to do is get images from every single gaming generation (20 images from each) and train separate loras.
Then you need to generate an image for each gaming generation using the trained loras (that took about 10 hours total).
After that you have to open Krita or GIMP and paste them in order against a white background.
Oh yeah, don't forget to go on Google images to get every console logo so that you can paste it above each image you generated from the loras.
Then go on ldg and say local will catch up to NBP after the finetunes start rolling in.
>>
>>107462730
>github
Nah, too lazy for that. Best I can do is just share as is https://files.catbox.moe/ipmwwx.py
>inb4 pickle, malware
Just write your own. Anyways, the node is "RAM Watchdog", just place it somewhere in a workflow and connect a "display text/anything" node to it. If you see a special message then you're good
>>
>Positive Prompt: Screenshot in the style of 3D PS2 video game featuring a knight wielding a sword in his right hand and a shield in his left hand, as he traverses a dark fantasy dungeon with a glowing lantern. It is a third-person action RPG. The walls are made of stone and there is an orc behind him. The sky is visible, with stars and a crescent moon glowing bright. The textures are low resolution but vibrant.
>Negative Prompt: 2D, GUI, interface, UI, logo, Minecraft, Roblox, cartoon, Pixar, Disney, drawn
This gives some interesting results
>>
>>107462780
keek
>>
File: Nano Banana Pro.png (1.66 MB, 1376x768)
1.66 MB
1.66 MB PNG
>>107462702
>replce Cloud with Kasane Teto
oh man, imagine if we had a model with this much knowledge...
>>
>>107462754
So the take away is that it's pointless to do it?
>>
>>107462650
Google has petabytes upon petabytes of premium quality low background steel depicting every subject from every angle in the form of their YouTube data.
>>107462645
>if it weren't itd be embarrassing
>>
File: 1759316209925596.jpg (332 KB, 1372x649)
332 KB
332 KB JPG
>>107462703
saar may i interest you in dress your beauty girl in traditional saree sarr
>>
>>107462495
I bave 64GB DDR4 which I paid $100 for. But it's not fair bros. If I had gone for DDR5 instead (and I could have) I would be able to sell my RAM for a 5090 (in exchange for cheaper DDR4 RAM). You'd be dumb not to do it now.
>>
>>107462815
>Google has petabytes upon petabytes of premium quality low background steel depicting every subject from every angle in the form of their YouTube data.
it's not just the amount of data that matter, but the quality of the captions too, they probably have half of india doing this shit manually
>>
>>107462816
>trained 6 512x512 screenshots from a 25 year old vhs rip
>>
File: 1739607460174826.jpg (175 KB, 832x1216)
175 KB
175 KB JPG
>>
>>107462829
how extremely jeet of them
>>
>>107462745
yeah it's pretty funny
>>
>>107462833
gross shit like that belongs in >>>/trash/
>>
>>107462829
yes saar very good quality
>>
File: zimg_0095.jpg (2.07 MB, 9152x1336)
2.07 MB
2.07 MB JPG
>>107462814
if you don't see any difference or benefit then no, not at all

here's injecting a ton of noise as an example
>>
>>107462847
Are you trying to say you see a benefit?
>>
File: ComfyUI_00218_.mp4 (235 KB, 640x480)
235 KB
235 KB MP4
>>
>>107461913

>Wan 2.2

lol true, we're NEVER getting wan anything for chaku. The developer said many months ago they'll do wan after qwen as seen here: https://www.reddit.com/r/StableDiffusion/comments/1mpceox/nunchaku_svdq_hype/

But that never happened...Anyway, just use woct0rdos radial attn instead, they fixed the weird resolution thing apparently
>>
File: 546456353.png (937 KB, 1113x780)
937 KB
937 KB PNG
>>107462816
Z-Image can already do sarees natively. What is even the purpose. To con idiots who don't mess with the AI?
>>
i'm bored with z-image already. give us the base model or go away
>>
im not impressed
>>
File: ComfyUI_00222_.mp4 (225 KB, 640x480)
225 KB
225 KB MP4
>>
File: Z-image turbo.png (1.7 MB, 1280x720)
1.7 MB
1.7 MB PNG
>>
>>107462871
easy buzz if retards fall for it
>>
>>107462645
Don't think it's that big. Probably just an MoE.
>>
idk anything about ai, but are any of the generators in the op as good as google's nano banana or whatever it's called?
>>
>>107462824
>they probably have half of india doing this shit manually
to make the initial dataset, probably. I think they probably just used something like Gemini 2.5 pro for all captions though
>>
>>107462700
It supports loras so you should be able to load flash loras.
>>
>>107462958
>load flash loras

I don't think Flash LoRAs are the same as the HD Flash model we got.
>>
>>107462910
Wtf do you even do with buzz?
Why are the civitjeets constantly beg, spam dogshit loras and make random shitmixes for it?
>>
File: office-group-photo.jpg (3.03 MB, 2752x1536)
3.03 MB
3.03 MB JPG
>https://ai.google.dev/gemini-api/docs/image-generation
Can Z-Image do this without loras or inpainting?
>>
>>107462938
It depends on what you're after. Local models are size limited to what we can realistically run on modern PCs but some of them are still pretty damn good for what they do. For example this new Z-Image Turbo is quite impressive and fast.
>>
>>107462970
I thought it was just one of them baked into the model.
Regardless, once every single aspect of the base chroma is figured out, he would probably make one for the flash too.
>>
>>107462808
Based free thinking, no ideology bs Nano Banana poster. We need people actually testing and comparing critically and objectively Local with SaaS models. Local cult echochambers is a meme
>>
>>107462987
you could do that with an edit model that can handle more than 5 image inputs, BFL said that Flux 2 dev could handle this, but man, every additional image input makes shit way slower, without an image input I wait 4 mn, with 2 images input I wait 7 mn... does anyone know if Z-image edit will be a single or a multiple image input model?
>>
>>107462983
they probably gen with buzz on civit since they're jeets and likely only have a shitty phone

buzz is also used to train models on civit

>train shitty lora on poverty buzz
>get retards using your lora and gain buzz
>train new lora with said buzz
>>
>>107462993
>depends on what you're after
Just from very simple cat and chibis pics, nothing big.
I'll check out z-image. Thanks, bro.
>>
>>107463005
>>>/sdg/
>>
File: ComfyUI_temp_ovkyi_00010_.png (3.81 MB, 1840x1200)
3.81 MB
3.81 MB PNG
The fps meter in comfy is really fucking funny lately. The frontend is totally raped. I went from idle 999+ to ~80
>>
>>107463022
People here actually experiment. Your general is a discord shithole. Learn to improve yourselves first.
>>
>>107463058
based
>>
File: Wanimate_00113.mp4 (1.01 MB, 832x576)
1.01 MB
1.01 MB MP4
>>107462443
Thoughts?
>>
File: future.png (10 KB, 816x85)
10 KB
10 KB PNG
>>
File: ZIT_00564_.png (3.96 MB, 2048x1152)
3.96 MB
3.96 MB PNG
>>107462987
Yeah.
>>
the forced depth of field in z image is awful
>>
>>107463058
what the actual fuck are you talking about?
>>
>>107463061
Sexo, woman can't compete.
>>
>>107462164
I got it to do everything from doggystyle, blowjobs, missionary, threesomes, group shit, the penis all look mangled but other than that in terms of the comp it gets it fine. My guess is they trained it with censored jav and shit so all it ever saw is mosaic'd genitals
>>
>>107463061
Give the tits some more jiggle
>>
>>107463006
>>107463068
>Both missed the point and are admitting it can't without saying it.
>>
>>107463068
z expressions are not bad at all. paid nano slop rekt
>>
>>107463071
/sdg/ is a tranny misery pit, the sooner you admit it, the better.
>>
>>107463067
>They might open source something soon
Sure. Another ultra cucked oss model to the list kek
>>
>>107463097
>didn't define the point
>HA YOU MISSED THE POINT
great job retard
>>
>Mention you think the base model probably isn't come
>People here actually get offended.

Why are you like this?
>>
File: Zurbo_00010_.jpg (1.09 MB, 3328x1792)
1.09 MB
1.09 MB JPG
>>107463061
That's a big ol' yikes from me chief.
>>
>>107463099
seriously.. im glad there is a diff thread for gens because i absolutely cannot stand the garbage spam in sdg.. the retard that thinks they're so clever spamming whatever their stupid [list of objects with girl in list of backgrounds] shit is so goddamned retarded and they've been doing it nonstop for at least a year now
>>
File: ComfyUI_00233_.mp4 (377 KB, 832x640)
377 KB
377 KB MP4
>>
>>107462780
Ok it can do those specific narrow styles what if there are other styles I want to do with Nano? What then, wait for google-sama to update and if I wish vweeeery hard at a shooting star they may just include the one I wanted?
>>
>>107463110
This is a turbocope general, bud, what did you think would happen when free toys are denied?
>>
>>107463110
Because you don't know shit, and there's no point in you spamming your same retarded 'prediction' 100 times per thread

Get a fucking life
>>
File: zzzz.jpg (3.01 MB, 3736x2486)
3.01 MB
3.01 MB JPG
how come all the seeds are the same and everything i gen has no character to it, its almost always a flat perspective of the 1 girl. are you suppose to prompt paragraphs to describe everything? the workflow i downloaded used 4 steps with cfg 1 which i read means you can use a negative, so how would i go about getting rid of the blurry depth of field? can i use 5 cfg 40 steps like a normal model?
>>
>>107463135
Why are you getting upset at me when it's Tongyi who won't release it?
>>
>>107463147
>are you suppose to prompt paragraphs to describe everything
yes
>>
>>107463069
become a nagger
>>
>>107463153
He's trying to cope. Just let him be.
>>
>>107463153
It will release, you know it as well, so stop pretending

>OMG it's been 10 days since Z-Image Turbo released, a revolutionary model, why hasn't the Base model been released yet despite being a more complex model to train!!!!
>>
is there a way to feed context of movement or images from previous latent or images to the next sampler in wan? Basically to say

"Hey this is what it looked and moved like, now continue from here"
>>
>>107463163
I suggest you get more familiar with Chinese culture, because their communication since releasing turbo strongly points to the base not being released.

The turbo model (trained using the base) should indicate to you as much. Everything explaining that fact away is cope.
>>
>>107463147
>how come all the seeds are the same
The model has poor seed variance
>are you suppose to prompt paragraphs to describe everything?
Just write a sentence or two for your idea and give it to grok
>4 steps with cfg 1
You want 8-10 ideally
>which i read means you can use a negative,
You can't use negatives with cfg1, it has no effect
>so how would i go about getting rid of the blurry depth of field?
NAG but it needs more vram
>can i use 5 cfg 40 steps like a normal model?
No you would get fried garbage.
You need to wait for the base model for that.
>>
File: Wanimate_00114.mp4 (1009 KB, 528x960)
1009 KB
1009 KB MP4
>>107461416
Thoughts?
>>
>>107463183
how use NAG?
>>
>>107463163
>despite being a more complex model to train
you do realize that turbo was made from base right? why did they distill an unfinished base model in the first place?
>>
>>107463188
https://github.com/scottmudge/ComfyUI-NAG
>>
>>107463099
jesus
I was telling them to go there because I didn't want that shit here, retard
>>
>>107463188
>how use NAG?
https://www.reddit.com/r/StableDiffusion/comments/1pbrbrt/nag_normalized_attention_guidance_works_on_zimage/
>>
Man this damn hobby is filled with people who need to be spoonfed every fucking little thing...
>>
File: is_it_really_base.png (318 KB, 1615x565)
318 KB
318 KB PNG
>>107463163
We might not even get the true base model. You want the pretrain, not the SFT which local might get.
>>
File: ZImage_00840_.png (1.05 MB, 1152x896)
1.05 MB
1.05 MB PNG
>>
File: ComfyUI_00041_.png (3 MB, 2040x1152)
3 MB
3 MB PNG
feed me daddy
>>
>>107463245
>>107463245
>>107463245
>>107463245
>>
>>107463232
Would this actually have a significant enough impact that would harm large scale finetuning?
>>
>>107463177
>I suggest you get more familiar with Chinese culture
lel like you know jack shit about Chinese culture

If they weren't releasing it the would just remove the 'to be released' from their official github page

Flux sister you can stop, Base will be released
>>
>>107463268
i eat cat livers and frog skulls so i think i kno wmore about it than you do
>>
>>107463203
From 'a' base model, which only reason for existing was to distill from, as in generate images for the distilled model to train on.

This to be released Base model is trained to be a foundation model, as in a model made for further finetuning.

Since you don't know anything about AI models you can just stop.
>>
File: zit_00003_.png (1.75 MB, 1504x1024)
1.75 MB
1.75 MB PNG
>>
>>107460682
just use the default comfyui template
>>
>>107460620
sounds like an opportunity for AMD and Intel
>>
>>107463959
it is but they still need the memory to make them and I'm not sure if amd's next gen can beat the 5090 and intel is much less likely to get there
>>
>>107464066
just need the memory bandwidth. RAM will catch back up eventually.
>>
>>107464089
I mean that intel and amd still have to consider if it's worth investing in the consumer market if the other option is selling more enterprise level stuff if memory chips are the bottleneck
>>
>>107464106
If Nvidia boxes them out, they might have no choice but to go consumer. If the west was still white, it would be a no brainer. People would actually be doing things for themselves. Perhaps the chinks will drive the consumer market.
>>
>>107463061
would



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.