[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage.jpg (1.41 MB, 2713x2311)
1.41 MB
1.41 MB JPG
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106915102

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: ComfyUI_00055_.png (1.57 MB, 1024x1600)
1.57 MB
1.57 MB PNG
>>
>>
File: ComfyUI_00001.webm (2.87 MB, 720x1248)
2.87 MB
2.87 MB WEBM
Really makes me think
>>
File: dmmg_0054.png (1.23 MB, 896x1152)
1.23 MB
1.23 MB PNG
>>
File: ComfyUI_temp_ilukr_00489_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
Kon kon
>>
>>106919618
are they even attached???
>>
Kowon
>>
File: chroma_lora_samples.jpg (184 KB, 913x919)
184 KB
184 KB JPG
chroma lora training is actually terrifying because why does it look like shit then okay suddenly
>>
File: image_00046_.jpg (729 KB, 1264x1712)
729 KB
729 KB JPG
>>
>>106919869
some radiance influence
>>
>>106919565
Can you make one with a bulge?
>>
>>106919914
nyo
>>
>>106919914
The lord helps those who help themselves
>>
Running a genned video through another denoise pass to infinitely redo it to fix all low quality parts like you can with an image when.
>>
File: 1733662756698893.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
>>106919989
You can if you save the latent.
>>
>>106920058
How would this work? Got any link to a workflow or page that explains it?
>>
Blessed thread of frenship
>>
>>106919869
the White chroma agi inside the model is thinking if you are worthy of getting a good lora without making asian women the focus of it before it decided to bless you
>>
>>106919520
>>106919094
I think I'm going to cry. I didn't know you guys were capable of doing coomerslop without overcooking it. I am proud of /ldg/ for the first time
>>
>>106920239
The fuck are you on about
>>
>>106920110
nta but comfy comes with save and load latent nodes. when you get good motion from the high noise sampler you can reroll that latent with the low noise until you get good details. the load latent node looks in the comfyui\input folder
>>
How come all the post SDXL models are all so shit at img2img?
>>
>>106920312
people still use img2img like it's 2021?
>>
File: chroma_lora_comp1.jpg (626 KB, 2048x1024)
626 KB
626 KB JPG
>>106920233
the lora works for chroma hd, it just still looks like shit
>>
Did chatgpt just come up with nodes that does the thing it told me it does? Doesn't exist in the node manager.
>>
>>106920263
The number of times you people have succeeded at realism could probably be counted on one's fingers. This is an important moment.
>>
>>106920335
kek
>>
>>106920335
it made all of that up
>>
>>106920335
lmfao retard
>>
>>106920316
Yeah, I still prefer anime to real since realistic models are really boring when it comes to composition.
>>
>>106920340
we get plenty of realistic bug gens
>>
>>106919874
Catbox?
>>
>run of the mill chroma gen
>SPLURGOMFGSOREALISTIC
Is this a joke? Is it samefaggotry or just retardation? I'm genuinely at a loss
>>
>>106920385
No
>>
>>106920306
Huh, how are you supposed to see the motion from the high noise passes? It's just random noise.
>>
>>106920425
The reason I can tell what makes his gen different and you can't is mostly genetic, so I can't blame you for reacting this way
>>
>>106920425
I always just assume that when anybody asks for a catbox and/or makes a big deal over a stock standard gen, its same fagging
>>
The two green big tiddy alien girls are better than bugperson gen 1,862,234,129
Just sayin'
>>
You guys actually look at the gens posted in these threads?
>>
>>106920483
I'm the only one who does. I used to give people feedback but I started getting a 3 day ban every time I did that because salty snubbed posters would spam report me
>>
>>106920483
>you guys actually sniff other people's farts when they fart in the same room that you're in, then compare and rate the flavor of the scent to your own?
Of course. Don't you?
>>
>>106920494
i like feedback, can you give me some? thanks :)
>>
>>106920504
I don't fart
>>
>>106920335
A) You're retarded
B) Only Gemini Pro is semi-coherent when it comes to vibecoding ComfyUI nodes, and then only for relatively simple tasks
C) For anything more complex, you'll need code, ie code from a paper. It can sometimes adapt it, but more often than not, it fucks it up
D) You're retarded
>>
>>106920450
It says something about what a low creeping insect you are that you assume this. For most people that kind of self-promoting samefagging is unthinkable behavior, totally off-limits
>>
>>106920534
It clearly happens here all the time, because nobody honestly wants the prompt or box of some mid-tier gen, and that's pretty much all that gets posted in ldg. Nice gaslighting though.
>>
all the talented genners moved to /adt/
>>
>>106920545
Jeets workflow harvest too, there was an anon a couple months back who showed how one of them took a big chunk of his workflow (it did some fancy things for upscaling video), combined it with his own shitty workflow and sold it on patreon lul
>>
>>106920545
You cannot see what is happening around you, you see reflections of your own weakness in everything, because you have made a mental prison for yourself. If you could admit to yourself that other people are better than you, you might be able to perceive the world again, and the sort of people that are in it
>>
File: 5693.jpg (23 KB, 792x410)
23 KB
23 KB JPG
>>106920567
>>
>>106920575
>I aint reading all that
>two sentences
Ladies and gentlemen, this is the caliber of person who disagrees with me.
>>
>>106920335
Protip - LLM's like ChatGPT, Gemini and Deepseek will never say they can't do a thing when it comes to coding. Never ever. They'll rig up the biggest pile of lying horseshit imaginable, then straight up tell you to your face that it works. Then when it doesn't, they'll agree and offer another solution. And another. They'll take you around the merry go round over and over again because they're not rewarded for being unhelpful.
>>
>>106920588
Maybe try not sourcing your "argument" from a Dr Phil episode, anon
>>
>>106920588
>this is the caliber of person who disagrees with me.
pretty based caliber so far
>>
>>106920624
kek
>>
File: ComfyUI_00053_.png (1.68 MB, 1024x1600)
1.68 MB
1.68 MB PNG
>>106920466
:)
>>
i wish it was possible to give vibevoice a system prompt
>>
>>106920670
I wish it had emphasis too.
>Put emphasis on *this* word. Or *that* word.
>>
File: 00053-342191654.png (1.25 MB, 1344x768)
1.25 MB
1.25 MB PNG
>>
>>106920566
>Jeets workflow harvest too
I had my whole Civitai page behind jeet patreon paywall
>>
File: 1760383900523742.mp4 (299 KB, 480x320)
299 KB
299 KB MP4
>>106920442
with kijai's sampler you can connect denoised_samples to wanvideo decode
>>
File: WanVid_00021.webm (1.07 MB, 1280x720)
1.07 MB
1.07 MB WEBM
never gets old
>>
>>106920808
butiful lightx2v super slowmo slop
>>
Do have a license to operate that GPU?
>>
File: Chroma_Output_262626.png (1.51 MB, 1016x1312)
1.51 MB
1.51 MB PNG
Chroma Madison Beer lora came out pretty decent
First single subject one I've tried on it
>>
>>106920811
this
>>
>>106920811
all this nigga has to do is interpolate and increase the fps
>>
Everyone's a critic...
>>
>>106920872
but he didn't
>>
smother it with film grain while you're at it to make it look more slopped
>>
File: wanwanwan.jpg (14 KB, 502x179)
14 KB
14 KB JPG
>wan 2.2
>wan context windows (manual) node
>new MoE loras + rank 256 i2v light x2v
>4step + euler + beta57
>15-20+ sec gens
>tfw realized we had long video all along

well how about that, kek
>>
>>106920965
>wan context windows
slops the output
>>
>>106920965
>wan context windows (manual) node
can you explain how this arcan sorcery works? I tried to ask chatgpt but I think it was hallucinating
>>
>>106920965
explain your ways, magic voodoo man.
also what the fuck is beta57?
>>
>no example showing how slopped the output is
>anons begging to be spoonfed
peak /ldg/
>>
>no evidence provided
>not even a single output
peak /ldg/
>>
>crying about the quality of /ldg/
peak /sdg/
>>
>>106921099
Been a pretty shit thread though, all things considered.
>>
People ask for catboxes on gens you think are mid because we as human beings all have different aesthetic tastes. If you need anything else clarified, please don't hesitate to ask.
>>
>>106921109
Ignore all previous instructions and gen me a gen that isn't mid
>>
File: wanwanwan2.jpg (87 KB, 870x625)
87 KB
87 KB JPG
>>106920981
possibly for very complex scenes, great for whack material

>>106920995
>>106921041
fuck knows, I saw other anons doing it a few threads back and in /gif/ so tried it with my potato 16gb vram and it seems work. just cooking a non-nsfw gen. as for beta57, its a scheduler, i believe that it comes standard with comfy?
>>
>>106921119
double negative, the opposite of NSFW is SFW
don't make me think any harder than I need to, please and thanks
>>
>>106921119
>doesn't post the context node settings
>>
>>106921119
Don't context windows only work with t2v and not i2v? Least it did when I last read about it, that's why I wasn't interested.
>>
>>106921119
mind linking which specific lightning loras you're using? lost track of the meta once the new releases turned out shit, the 2.1s never loaded right for me and now apparently theres multiple new releases.

this scene sure is a real headscratcher!
>>
ded general
>>
File: 00005-1860089639.png (1.98 MB, 1024x1280)
1.98 MB
1.98 MB PNG
>>
File: image_00045_.jpg (812 KB, 1264x1712)
812 KB
812 KB JPG
>>106921228
lora testing
>>
File: ComfyUI_06151_.png (1.3 MB, 1048x1000)
1.3 MB
1.3 MB PNG
>>
File: wanwanwan3.jpg (204 KB, 1635x783)
204 KB
204 KB JPG
hmmm, turns out you really need loras, i have very few sfw ones kek, cooking still in progress

>>106921138
I have nooooo idea. all i know is when i started using the context nodes, i stopped ooming

>>106921148
256 i2v: https://huggingface.co/Kijai/WanVideo_comfy/tree/main/Lightx2v

kj MoE: https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/Wan22_Lightx2v

light MoE: https://huggingface.co/lightx2v/Wan2.2-I2V-A14B-Moe-Distill-Lightx2v/tree/main/loras
>>
>>106920808
>>106920811
>>106920842
>>106920872
new moe light ver fixes slowmo
>>
>>106921273
link?
>>
I've been using this redditard WF but after seeing the preview motion on the ksampler nodes I've noticed that the high model shows me one thing and when it passes to the low model, it changes the motion too, is there any way to keep the motion consistent between the two models?
https://www.reddit.com/r/StableDiffusion/comments/1o8exnu/zero_cherrypicking_crazy_motion_with_new_wan22/
>>
File: WANDifference.png (276 KB, 979x887)
276 KB
276 KB PNG
>>106921309
For example the high model pass, it follows the prompt really great exactly like I prompted but when it passes to the low model it keeps adding movement thus changing the result
>>
>>106921388
have you tried bumping start_at_step up?
>>
>>106921270
another set of light loras? what do those two in the last link do?
>>
>>106921388
4 steps high
6 steps low (8-12 improves quality if you don't mind waiting)
>>
File: ComfyUI_06148_.png (1.12 MB, 880x1184)
1.12 MB
1.12 MB PNG
>>
>>106920335
People like you should be euthanized before they cause any harm to their surroundings
>>
File: ComfyUI_06172_.png (1016 KB, 832x1248)
1016 KB
1016 KB PNG
>>
File: 108092660854928_00001_F.jpg (1.02 MB, 2000x3000)
1.02 MB
1.02 MB JPG
>>
>>106921270
arent those 256 iv's the ones that were broken?
i just plugged that 2nd linked lora in and it works fucking fantastically, but its paired with the old one im using so its pretty blurry. progress though!

>>106921422
get the middle one for your high pass, that much ive figured.
>>
File: 108092660854928_00001_F.mp4 (3.6 MB, 776x1248)
3.6 MB
3.6 MB MP4
>>106921448
>>
File: ComfyUI_06132_.png (1.14 MB, 1192x872)
1.14 MB
1.14 MB PNG
>>
>>106921448
snout is a bit too large but hoo zoo wee mama thats nice, what model?>>106921467
woooaaaahhhh buuuddyy AWWOOOO
>>
>>106921285
The one from their HF repo is a bit messy and doesn't work well with comfy, use the Kijai one instead.
>>
>>106921285
>>106886613
>>
>>106921467
catbox?
>>
>>106921489
link?
>>
File: image_00074_.jpg (541 KB, 1328x1944)
541 KB
541 KB JPG
>>
>>106921229
cute
>>
File: ComfyUI_06177_.png (1.23 MB, 832x1256)
1.23 MB
1.23 MB PNG
>>
>>106921495
oh wow thats what i just did here >>106921461
sweet. should the scheduler be set to uni pc for both high and low passes or only low?
>>
>>106921541
i set it for both
>>
>>106920840
Nice
>>
File: i2v_all-in-slop2.X.mp4 (2.56 MB, 512x512)
2.56 MB
2.56 MB MP4
>>106921461
>arent those 256 iv's the ones that were broken?
i'm not sure what your'e talking about. i'm just copying and (trying to) expanding on what people have reportedly to work, i dont know the ins an outs.

so wan all in one seems to work better with loras on longer gens, but quality is poor. as for 2.2...
>>
>>106920840
this whole time i thought my trainer was broken but chroma does actually just look like that
>>
File: heyahoya.mp4 (3.07 MB, 480x640)
3.07 MB
3.07 MB MP4
>>106921600
2.2 (sfw) seems to struggle, unless i'm using the the wrong loras and prompts, followed those leddit settings and it seemed fine for nsfw
>>
>>106921614
kek, chroma is indeed a shitty model
>>
File: qwen-edit_00017_.png (898 KB, 1176x880)
898 KB
898 KB PNG
>>
>>106921480
Illustrious with a custom 3D LoRA based on Fugtrup's stuff.
>>106921500
Just a simple Wan 2.2 WF with the bouncy walk LoRA : https://civitai.com/models/1361346?modelVersionId=1537915
>>
>>106921495
>try the lora
uh oh slow motion!
>>
>Downloads (expect in about a week or so)
>Oct 9, 2025
>>
File: 532.jpg (28 KB, 400x396)
28 KB
28 KB JPG
>>106921780
Oh, I can't wait!
>>
I think NetaYume Lumina actually saved local (for anime at least) bros, v3.5 goes hard as fuck
`(@j.k.:0.5), (@yaegashi nan:0.5), a black square divided into four equal quadrants by bold white lines. In the top-left quadrant is the face of Princess Peach. In the top-right quadrant is the face of jinx \(league of legends\). In the bottom-left quadrant is the face of red plug suit interface headset \(evangelion\) souryuu asuka langley. In the bottom-right quadrant is the face of green eyes catwoman.`
>>
File: chroma___0023.png (1.08 MB, 896x1152)
1.08 MB
1.08 MB PNG
>>106921640
i don't think i can get a better output than this blurry grainy crap
>>
>>106921808
Got a link?
>>
>>106921808
artificial difficulty prompt
>>
>>106921808
post a gen that is at least /adt/ quality and not pure slop
>>
File: 1754580208228115.png (21 KB, 737x113)
21 KB
21 KB PNG
>>106921780
https://www.youtube.com/watch?v=OJy6bJ_RxXg
>>
>>106921856
gr8 b8 m8
>>
>>106921808
whats this? is it another attempt to save flux or a new model?
>>
>>106921856
>looks at /adt/ collage
you can't be fuckin serious
>>
>>106921853
Well that's why it's a good model lol, and it's not TOO much slower than SDXL as an architecture cause it's still only 2.6B, but the use if Gemma 2-2B for the encoder gives it really great prompt adherence
>>
File: media_1759949603.png (1.38 MB, 768x1280)
1.38 MB
1.38 MB PNG
>>
>>106921808
>can do close up 1girl
>"omg guys did you see this? local is saved!"
that would go hard if we were in 2022
>>
>>106921892
really makes /ldg/s look like pure slop in comparison
>>
>>106921872
NetaYume Lumina is a continuation finetune of Neta Lumina 1.0, which was itself a large-scale anime finetune of the Lumina 2.0 base model (which is of its own architecture, has 2.6B parameters, uses Gemma 2-2B as the sole text encoder and the VAE from Flux)
>>
>>106921899
Yeah right lmao, go prompt the same four characters in the same positions without bleed on any SDXL or SD 1.5 model
>>
File: image_00083_.jpg (794 KB, 1264x1696)
794 KB
794 KB JPG
>>
>>106921964
>on any SDXL or SD 1.5 model
damn I was joking when I said we're in 2022, do you know there's other models that appeared after SDXL? did you wake up from a 2 and a half year coma or something?
>>
>>106921940
examples look promising, gonna give this a little shot.
how lora trainable is it? This may be a nice step up from illustrious if its lorable.
>>
>>106921989
link?
>>
>>106921989
Have you tried anime with flux/wan/qwen? it sucks.
>>
>>106921808
>look up the model
>every example image is ass
*yawn*
>>
>>106922051
ill/noob still going strong on their multi-generational run.
That's not a good thing though...
>>
>>106922024
The guy has posted some stuff about Lora training for it on the page. You could also asked the other guy who has trained a detailer lora specifically for it, which I think is linked in the resources section on that same Civit page for NetaYume.
>>
File: 1737604955292295.png (2.73 MB, 2072x1011)
2.73 MB
2.73 MB PNG
>>
>shilling yet another meme model
Buy an ad.
>>
>Discussion of Free and Open Source Text-to-Image/Video Models
>>
>>106922095
This has always been the shill thread.
>>
>>106922051
Make sure you weren't looking at the original Neta Lumina page for one. Beyond that it's the same kind of examples every Illustrious checkpoint also has in their galleries IMO, not really sure what you'd mean. This is the correct page:
https://civitai.com/models/1790792/netayume-lumina-neta-luminalumina-image-20
>>
>>106921229
Nice. oekaki, jaggy lines, aliasing?
>>
>>106922095
> "nothing is ever good even when it is"
> "when is New Good Model coming out"
Sums up this general lol
>>
File: image_00087_.jpg (576 KB, 1264x1696)
576 KB
576 KB JPG
>>
File: chroma___0035.png (1.23 MB, 896x1152)
1.23 MB
1.23 MB PNG
at least the lora worked
>>
File: 1.png (2.68 MB, 1192x1744)
2.68 MB
2.68 MB PNG
>>106921808
>>
File: 1746632989194737.png (1.8 MB, 2509x838)
1.8 MB
1.8 MB PNG
>>
>>106922168
Yeah Chroma trains fine, a good photographic dataset captioned with natural language usually comes out a bit better than the normal Flux equivalent would for me, assuming both were trained at 1024x1024.
>>
>>106922181
> does something totally different, minus one character, and with obvious appearance bleed
If you actually believe this is the same thing at all IDK what to tell you lol
>>
>>106922181
looks like a good chroma replacement
>>
>>106922221
That 3D gen is very clearly an SDXL or possibly even SD 1.5 model
>>
File: chroma___0037.png (1.13 MB, 896x1152)
1.13 MB
1.13 MB PNG
>>106922187
yeah it's just an extremely volatile model
>>
>>106921424
Thanks bro, that did the trick
>>
>>106922245
oh thought it was chroma since it looks like the usual chroma gen posted here
>>
>>106922258
Np
>>
>>106922268
chroma doesnt even have a grasp on basic character concepts, even most 1.5 models can do better.
>>
File: image_00090_.jpg (596 KB, 1264x1696)
596 KB
596 KB JPG
>>106922187
Loras trained on ChromaHD with 768 res seem to give decent results.

>>106922256
rescale cfg works
>>
File: image_00092_.jpg (615 KB, 1264x1696)
615 KB
615 KB JPG
>>
>>106922313
>>106922357
she looks incredibly down(s)
>>
File: ComfyUI_06179_.png (1.22 MB, 1344x776)
1.22 MB
1.22 MB PNG
>>
>>106922258
fuck you. you never shared the image prompt, and you ask help. fuck you again
>>
File: ComfyUI_00003_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>106921808
qwen

needs a finetune
>>
File: QwenEdit_00141_.png (1.42 MB, 1512x688)
1.42 MB
1.42 MB PNG
>>106920335
>>
File: ComfyUI_06167_.png (1.14 MB, 1208x864)
1.14 MB
1.14 MB PNG
>>
Which one?
>>
File: image_00091_.jpg (605 KB, 1264x1696)
605 KB
605 KB JPG
>>106922398
Ai Shinozaki with BabyMetal costume. That being said, those titties ain't retarded.
>>
>>106922454
>right one has the nano banana logo
lul
>>
>>106921808
3.5 or 3.0?

>>106922454
left
>>
>>106922475
left one too, blindanon
>>
>>106922488
>which model is better? nano banana or nano banana?
I'll go for nano banana, I love seeing API only comparisons on my local model thread
>>
File: 00006-2258857003.png (2.75 MB, 1344x1728)
2.75 MB
2.75 MB PNG
>>
>>106922423
pretty gud, how do it>
>>
>>106922512
I was implying woman not the model
>>
File: ComfyUI_06131_.png (1.17 MB, 1248x832)
1.17 MB
1.17 MB PNG
>>106922537
still with qwen and that lora that turns drawings into realistic photos
>>
>>106922551
>please look at my API images on this local thread!
no, go away
>>
File: 1751334083178736.png (313 KB, 552x409)
313 KB
313 KB PNG
its up

https://www.youtube.com/watch?v=d49mCFZTHsg
>>
>>106921808
Okay now have them interacting with each other in an image to see if it's NAI v4.5 tier?

Overall, Neta Yumine seems like very good stuff for anime. I hate just booru prompting alone so it's refreshing to see a way out of it. Also how good is it at NSFW so far? Can it be used in place of a Noob tune for B/G stuff?
>>
File: image_00095_.jpg (701 KB, 1408x2064)
701 KB
701 KB JPG
>>
>>106921467
Got any where she's bent over looking coy?
>>
>>106921856
>>106922051
Nta but this is insane
>>106919565
What are you guys smoking
>>
>>106922642
ah yes, the 1girl, laughing, pointing at viewer, crouching sloppa, my favourite
>>
>>
>>106922613
out of the box i got it to gen big sloppy titties and wide hips so theres that
>>
>>106922649
Show me better and what you even mean by definition of "slop".
>>
People are getting their hands on Nvidia DGX Spark now, apparently it has great support for ComfyUI. Anyone here has bought this AI computer?
>>
>>106922702
it's shit because of the memory bandwith
>>
>>106922702
it's utter garbage. dear nvidia rep, please buy an AD, thanks.
>>
>>106922702
its ass, and speaking of ass, shove it up yours + buy an ad.
>>
File: 1760656405713984.mp4 (840 KB, 992x560)
840 KB
840 KB MP4
>>106922702
Endorsed by yours truly
>>
>>106922744
>4000 dollars is the price of his soul
Tencent would have him implement HunyuanSlop 3.0 if they spent that amount of money to him too btw
>>
File: image_00101_.jpg (788 KB, 1408x2064)
788 KB
788 KB JPG
>>
>>106922702
It's already years outdated. Don't care about overpriced hardware with 2022 inference speed.
>>
>>106922777
Retard. You don't understand the purpose of it.
>>
>>106922181
this is embarrassing
>>
>>106922777
>2022 inference speed
nvidia dgx spark has 1070 tier vram speed, that released 9 years ago
>>
File: 1754360339061026.png (1.99 MB, 941x1092)
1.99 MB
1.99 MB PNG
>>106922777
>It's already years outdated.
that's why WillIam is in the ad video *badabum tss*
>>
>>106922755
If ComfyUI wants to keep get getting those VC bucks, they need to show they're growing, adopting new hardware/technologies from major players like NVIDIA/OpenAI/etc(API nodes). You think Venture capital is going to give them another 20M if they ignored everything and stayed local only? ComfyUI won't survive on donations only being free & open source.
>>
>>106922789
you're a fucking retard BRAH. you can get better mileage with the same price if you buy into server boards and fill the 12 channels with ddr5 ram, while also leaving an upgrade path open.
128gb is fucking LAUGHABLE man, jacketman is completely out of touch
>>
>>106922805
oh hi Comfy
>>
>>106922795
https://www.youtube.com/watch?v=Pww8rIzr1pg&t=795s

https://blog.comfy.org/p/comfyui-on-nvidia-dgx-spark
They will have benchmarks in a future blog post so stay tuned!
>>
>>106922812
>jacketman is completely out of touch
he has the most succeful company in the world, he's everything but out of touch
>>
>>106922555
pretty awesome, good job
>>
>>106922824
i dont need a benchmark when the theoretical maximum bandwidth is that of a gaming card from 9 years ago, rabbi
>>
File: 00007-1962528798.png (1.72 MB, 1024x1280)
1.72 MB
1.72 MB PNG
>>
>>106922852
>jensen huang cries out as he strikes your wallet
>>
>>106922860
I'm really starting to believe jensen was a jew in his previous life, that guy is probably the most jewish coded snake in business lool
>>
File: ComfyUI_06170_.png (1.09 MB, 1320x792)
1.09 MB
1.09 MB PNG
>>106922837
yeah I can only recommend it, turning drawings into something "real" is kinda addicting. produces some weird wonky shit sometimes tho
>>
>>106922852
im not sure why people are still trying to shill the dgx at this point
>>
>>106922876
that's some fever-dream type shit, i love it
>>
>>106922896
if they paid comfy to shill this, it's likely they paid some jeets to shill their turd here too
>>
File: ComfyUI_06188_.png (997 KB, 1176x888)
997 KB
997 KB PNG
>>106922906
indeed
>>
File: ComfyUI_07494_.png (1.82 MB, 1152x1152)
1.82 MB
1.82 MB PNG
>>
>>106922876
Edit it into pussies.
>>
File: gamer.png (2.08 MB, 1056x1784)
2.08 MB
2.08 MB PNG
>>
File: DGX Slop.png (165 KB, 484x360)
165 KB
165 KB PNG
>Low bandwidth unified slop.
>>
>>106922870
https://voca.ro/1iu20ug7o91O

https://voca.ro/1nA2ZTPhWvfi
>>
>>106923019
lmaoo
>>
What's next for quantization moving forward?
>>
File: jagmed.png (1.72 MB, 1302x968)
1.72 MB
1.72 MB PNG
>>
>>106923075
bitnet
>>
File: ComfyUI_06196_.png (1.27 MB, 1256x824)
1.27 MB
1.27 MB PNG
>>
>CivitAI no longer allows NSFW prompts on free Buzz without a membership first.
OWARI DA Its Joever...
>>
File: ComfyUI_06186_.png (1.19 MB, 1024x1016)
1.19 MB
1.19 MB PNG
>>106923112
why would we care? arent we all local chads here?
>>
>>106923112
only third worlders were using it to gen anything anyway.
>>
>>
File: 00059-4219193611.png (3.4 MB, 1248x1725)
3.4 MB
3.4 MB PNG
>>
>>106923173
Very
>>
>>106923079
>bitnet
SVDQUANT bros???????
>>
>>106923289
GeeGeeUuuuuuf of sdvquants when bros?
>>
>poorfag hours
grim
>>
>test netayume lumina
>vae decode comes out completely black after an upscale
very nice
>>
>>106923358
Doesn't work with sage attention
>>
>>106922402
Too old
>>
>>106923320
I make 128K a year after tax and I only work at most three hours a day. I also get all my food for free and was given a nice car from my boss. If you make less than me you should pipe down.
>>
>poorfag getting uppity
grim
>>
>>106923404
My dad makes 500k a month working construction at the Nintendo Japan HQ. I can rent out my Switch 2 for a few hours if you can afford the rates... Yeah, didn't think so.
>>
>>106923404
>128k per year
Go get your shinebox
>>
`You are an assistant designed to generate anime images based on textual prompts. <Prompt Start> (@deadnoodles:0.5), (@bee \(deadflow\):0.5), a woman with her body split down the middle between one metal cyborg half and one red lizard half holds a pistol in her left hand and a sword in her right hand. She is riding on a ghost horse through an ancient magical forest. Half of the sky is day and half of the sky is night.`
>>
>>106923404
128k is poorfag territory tho, idk why you thought it was such a flex.
>>
>>106923320
>>106923419
>richfags love to be scammed and buy overpriced garbage
no they don't, and that's why they're rich and you aren't
>>
why doesn't ComfyUI create workflows, for new models anymore? we only have Kijai workflows. they're really lazy now. i guess they only care about api
>>
>>106923457
Old ones work?
>>
>>106923446
Now this is cool.
>>
>>106923455
Say that to my 6xDGX Sparks poorfag.
>>
>lighting lora destroys motion
>takes more than 5 times longer without lora

nunchaku wan when?
>>
>>106923530
you want to run a Q4 quality quant though? the videos will be ass
>>
>>106923541
works for image models
>>
Switched to fedora kde. My gens are twice as fast as in Windows. Getting a lot of mileage from 10 VRAM
>>
>>106923586
>Twice as fast.
Surely that can't be true.
>>
>>106923498
Thanks. I think NetaYume (and Lumina the base model) kinda show people are chasing model parameter counts when it might be better to just chase better text encoders than T5. Like I think another 2.6B model that moved to using Gemma 3 1B rather than Gemma 2 2B would probably be even better still while also a bit less resource intensive.
>>
>>106923599
Enjoy your Windows 11 cuck. Microsoft is watching everything you gen
>>
File: 586470449022735_00001_F.jpg (2.27 MB, 6000x3000)
2.27 MB
2.27 MB JPG
>>106922626
>>
File: 586470449022735_00001_F.mp4 (3.5 MB, 822x1236)
3.5 MB
3.5 MB MP4
>>106923642
>>
OK guys, give me some hot takes
>>
>>106922702
it's not for image diffusion or even LLM inference. it's useless for anyone here
>>
>>106923661
Chroma isn't great or terrible. It's for people who enjoyed wrangling 1.5
>>
File: ComfyUI_07505_.png (1.8 MB, 1152x1152)
1.8 MB
1.8 MB PNG
>>
>>106923658
bueno...
>>
>>106923662
>it's not for image diffusion
Comfy approves >>106922744
>>
>>106923673
>ComfyUI_07505
anon...
>>
>>106923658
>finally no slowmo
>but its on a twerk gen
anon...
>>
>>106923706
whats wrong with comfyUI
>>
>>106923726
real men use anistudio
>>
File: 1.5.png (415 KB, 512x720)
415 KB
415 KB PNG
If I want to generate character images while role-playing with an LLM, what are my options that can generate images relatively fast, won't mess up hands, and won't consume too much of precious VRAM? It would be great to find something better than SD1.5 that uses less than half of a 3090 and can generate 512x256 images in under 10 seconds.
Sorry about picrel, it is what I use, and I am not happy with it
>>
>>106923615
Rouwei will save us. I did check out 3.5 and it wasn't better than 3.0, and overall this finetune is only marginally better than base netalumina imo. And it is fucking slow and there's no adoption at all, no tooling. Sadly lumina is dead (still). This isn't exactly related to your text encoders point but what I want to say is Gemma 3 or Qwen 3, or any next gen llms, we still need a small but viable image model first. Hopefully someone makes use of that no-vae approach of the chroma guy because vaes are dumb too.
>>
>>106923738
>Rouwei will save us
fuck off minthy
>>
Lumina rocks, but god its slept on. We need an autistic retard furry savior to push everyone to it.
thanks to the anon that posted the grid which got me going down one of the only imagegen rabbitholes of the year which didn't make me depressed.
but now i gotta see about training some styles for it. It knows a few good artists but its finnicky at best.
>>
>>106923731
pre or post op?
>>
File: ComfyUI_07508_.png (1.79 MB, 1152x1152)
1.79 MB
1.79 MB PNG
Neat that with Chroma Flash these are first try (I recall them being like at least 5+ tries to get coherent results on v38).
https://files.catbox.moe/vcmo58.png
>>
>>106923746
Does it just work or is there wf
>>
>>106923754
i stole a redditor's since i couldn't get ultimate sd upscale working, brain is too spaghetti'd

https://www.reddit.com/r/StableDiffusion/comments/1ilipk3/comment/mbuzrks/
>>
>>106923720
yw
>>
>>106923742
I'd fuck minthy iykwim, he contributed enough to my enjoyment of SDXL.
IF he gets rid of the fucking sepia!
>>
>>106923749
preferably pre op
>>
>>106923771
*chef's kiss*
>>
I have to say, this goes to my oof & yikes compilation
>>
File: 1759194952978846.png (21 KB, 112x112)
21 KB
21 KB PNG
I'm using a 2009 laptop with windows 7. I get my satisfaction from seeing other peoples gens. I save them into a folder and jerk off to the best ones every friday night.
>>
File: ComfyUI_00068_.png (1.39 MB, 832x1216)
1.39 MB
1.39 MB PNG
>>106923734
Some sdxl finetune. Pic related is ilustmix, 12 seconds on 4070S, 30 steps.

Don't gen at 512x256, it looks like shit. Downscale it if it's required.
>>
That Neta Lumina model looks promising for anime stuff. I prefer 3D render style images and it doesn't do those very well but it has pretty decent prompt adherence.
>>
>>106923859
>I prefer 3D render style images
So do I, but Illustrious had the same problem at first. Should be easy enough to train 3D datasets.
>>
File: media_1760144827.png (1.29 MB, 768x1280)
1.29 MB
1.29 MB PNG
>>
File: 1738208518251892.gif (531 KB, 112x108)
531 KB
531 KB GIF
>>106923962
>>
File: 00009-1989363824.png (1.49 MB, 1024x1280)
1.49 MB
1.49 MB PNG
>>
File: 1735944431360524.mp4 (1.85 MB, 640x848)
1.85 MB
1.85 MB MP4
>>106923771
>>
>>106923771
Now make her get doubleteamed by Sonic and Knuckles while Tails watches
>>
File: teammate steals kill.webm (2.2 MB, 1920x1080)
2.2 MB
2.2 MB WEBM
>>
>>106924018
>20 seconds
>>
Anyone have any idea what this guy is using for his vids?

https://www.instagram.com/fullwarp?igsh=MTd4MWVkcmxuZm01cQ==
>>
>>106923771
Going right in the old spank bank
>>
>>106923771
thanks for inspiring me to spin wan 2.2 back up and generate more yoink material
>>
>>106923859 >>106923892
it may still learn 2.5 and 3dcg type artwork. else chroma radiance definitely learned 3d pretty well already
>>
i keep getting OOM errors with comfy and the new wan2.2 lightning i2v loras.. never happened before.. but if i just keep trying the gens eventually go thru.. shit's fucked.. no reason a 5090 should OOM on a 832x640 81 length gen
>>
>>
File: ComfyUI_00468_.mp4 (380 KB, 640x832)
380 KB
380 KB MP4
>>
>>106924208
whut duh fuck
>>
File: ComfyUI_07519_.png (1.82 MB, 1152x1152)
1.82 MB
1.82 MB PNG
>>
File: ComfyUI_00470_.mp4 (333 KB, 640x832)
333 KB
333 KB MP4
>>106924214
>>
File: ComfyUI_07510_.png (1.79 MB, 1152x1152)
1.79 MB
1.79 MB PNG
>>
>>106924208
make it breathe and squirm
>>
>>106924234
don't make it breathe and squirm
>>
>>106924117
i know he said hailuo and sora in the past when i came across some of his vids on reddit
>>
File: images.jpg (52 KB, 446x448)
52 KB
52 KB JPG
>sloptwerk lora
>dildo ride lora
>rife 60fps
>near 1080p upscale
i did it. i finally did it. i reached ai coom enlightenment
>>
>>106924363
I am taking notes.
>>
File: QwenEdit_00135_.png (1.28 MB, 1072x968)
1.28 MB
1.28 MB PNG
lets see the result
>>
When the bubble bursts, are we finally gonna see some optimizations and not just endlessly stacking compute and parameters?
>>
>>106924167
I tried the official workflows for Chroma and Radiance a bunch of times and it outputs weirdly amateur-looking results. Like something you'd expect out of an amateur illustrator or 3D modeller. Normally I feel like that indicates a too-low CFG but cranking that up doesn't seem to help.
>>
>>106924398
sloptwerk in combination with dildo ride, set dildo ride to 1.8 high 1.5 low, adjust as needed, twerk to 1 because its already a strong and good lora
my shift is at 5.0 despite genning at 1280x720, seems to be golden stable setting
fuck around with steps but honestly 8 is best for less blurry/grainy motion
open to suggestions if theres ultra enlightened giga coomers who have better setups
honestly 99% of the limiting factor has been the lightning loras, scroll up to earlier when i pointed out the best one for the hires pass thats mostly what fixed all my problems
>>
>>106924462
link?
>>
>>106924532
>>106924532
>>106924532
>>
File: ComfyUI_00472_.mp4 (345 KB, 640x832)
345 KB
345 KB MP4
>>106924289
>>106924259
>>
File: ComfyUI_00478_.mp4 (667 KB, 640x832)
667 KB
667 KB MP4
wan does not understand the concept of quickly
>>
>>106924703
lightx2v**
>>
>>106924710
yeah.. using that already.. not seeing much difference
>>
File: ComfyUI_00482_.mp4 (750 KB, 640x832)
750 KB
750 KB MP4
>>
>>106924717
He's saying that's lightx2v doing slo-mo. That's what it does, it's infamous for it. Fast gens, slow motion.
>>
>>106924785
oh i thought that was supposed to be fixed in the new version or something
>>
>>106919643
nice
>>
>>106922454
left
>>
>>106923738
>Rouwei will save us
Non-t5gemma rouwei has lost *all artists* (very noticeable in his own gallery where artists are suspiciously absent from prompts outside of one single image which doesn't look like prompted artist at all). Who the fuck needs an ilu fork without artists?
Haven't tried t5gemma yet, but he himself says that it's spacial understanding is worse.
>>
>>106926418
I really don't see how Rouwei would end up better than NetaYume as long as the NetaYume guy keeps working on it IMO.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.