[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: ComfyUI_00025_.mp4 (394 KB, 640x640)
394 KB
394 KB MP4
Love is Blind Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106514324

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
Blessed thread of frenship
>>
Wan = critique ok
Qwen = critique ok
SDXL models = critique ok
Flux = critique ok
Chroma = above criticism, lode of cunt fart furry stones is the second coming of Christ and will save us from the vae.
>>
>>106517598
>This level of mental illness portrayed by an obsessive-compulsive disorder over a model that you can't accept other people using
Seek help, you are too insane even by 4chan standards
>>
File: ComfyUI_00074_.mp4 (720 KB, 640x640)
720 KB
720 KB MP4
>>
Hopefully we can abandon this shit thread soon
>>
>>106517659
don't like then make your own lil bro
>>
File: ComfyUI_00033_.mp4 (633 KB, 640x640)
633 KB
633 KB MP4
>>106517659
post enough images to get to the bump limit
>>
>>106517642
NTA but nah this is about normal for 4chan
/g/ is one of the sanest boards
CAPTCHA: DP4A8
>>
File: ComfyUI_00082_.mp4 (653 KB, 640x640)
653 KB
653 KB MP4
so close
>>
File: ComfyUI_00085_.mp4 (1.55 MB, 640x640)
1.55 MB
1.55 MB MP4
Pretty amazing how far we've come since Jurassic 2B.
>>
File: ComfyUI_00086_.mp4 (1.22 MB, 640x640)
1.22 MB
1.22 MB MP4
>>
File: ComfyUI_00087_.mp4 (1.09 MB, 640x640)
1.09 MB
1.09 MB MP4
>>106517745
better gen
>>
File: ComfyUI_00089_.mp4 (381 KB, 640x640)
381 KB
381 KB MP4
This pic is legendary in /o/
>>
File: ComfyUI_00614_.png (541 KB, 1024x1024)
541 KB
541 KB PNG
>>106517536
The media wants her to be forgotten. I made sure her memory will live on forever with a Chroma LoRA
https://files.catbox.moe/fsvmpl.zip
>>
File: ComfyUI_00090_.mp4 (513 KB, 640x640)
513 KB
513 KB MP4
>your name is teenus
>>
File: ComfyUI_00096_.mp4 (562 KB, 640x640)
562 KB
562 KB MP4
>>
File: ComfyUI_00098_.mp4 (433 KB, 640x640)
433 KB
433 KB MP4
>>
File: ComfyUI_00072_.mp4 (713 KB, 640x640)
713 KB
713 KB MP4
oops i forgot the thread title
>>
>>106517536
>>
File: ComfyUItest_00025_.png (3.64 MB, 2560x1440)
3.64 MB
3.64 MB PNG
>>
bake so fucked up i thought anistudio was added to the OP text pasta again
>>
>>106517993
Why does Ani studio try and get their front end recognized here instead of doing the logical thing and feeding it to reddit so it trickles down here?
>>
File: ComfyUI_00100_.mp4 (415 KB, 640x640)
415 KB
415 KB MP4
>>106517993
I don't even have an excuse.
>>
File: 1735358774540.jpg (845 KB, 4160x3744)
845 KB
845 KB JPG
>>
so upgrading ram and gpu are the main physical ways of increasing output speeds right?
>>
File: ComfyUI_00099_.mp4 (637 KB, 640x640)
637 KB
637 KB MP4
>>106518032
neat style
>>
Usecase for vibevoice? It's good and that's it. How much use is there for a voice model that sounds like someone?

And making sexy moans is not a usecase.
>>
File: ComfyUI_00104_.mp4 (1.25 MB, 640x640)
1.25 MB
1.25 MB MP4
>>
>>106518071
how does it feel to have zero imagination?
>>
>>106518071
>And making sexy moans is not a usecase.
It is to me!
>>
>>106518071
Sorry I'm not giving you my business idea
>>
File: ComfyUI_00106_.mp4 (447 KB, 640x640)
447 KB
447 KB MP4
>>106518073
It works really well with old pictures.
>>
File: ComfyUI_00108_.mp4 (646 KB, 640x640)
646 KB
646 KB MP4
>>
File: ComfyUI_00110_.mp4 (688 KB, 640x640)
688 KB
688 KB MP4
>>
Let me translate these responses

>>106518084
I don't know either
>>106518085
GOON GOON
>>106518094
I was hoping you could tell me.
>>
File: ComfyUI_00113_.mp4 (750 KB, 640x640)
750 KB
750 KB MP4
>>
very inorganic
>>
>>106518174
very uninorganic
>>
>>106518187
everything will be ok ! <3
>>
File: ComfyUItest_00023_.png (3.96 MB, 2560x1440)
3.96 MB
3.96 MB PNG
>>
>>106518155
>GOON GOON
GOON GOON to you too brother, whuzzah
>>
>>106518071
I know a guy who trains on his own voice so he can pump out tutorial slop quicker. You're welcome, no need to reply and I won't give you any more hints.
>>
File: ComfyUI_00114_.mp4 (549 KB, 640x640)
549 KB
549 KB MP4
>>
>>106518048
Specifically GPU for processing speed, RAM for bigger projects or resolutions, I think.
>>
File: 1_00001_.mp4 (936 KB, 640x640)
936 KB
936 KB MP4
>>106518225
>>
File: 1_00002_.mp4 (1.18 MB, 640x640)
1.18 MB
1.18 MB MP4
>>106518225
>>106518242
>>
>>106518048
RAM doesn't matter as long as it's >= GPU VRAM
GPU VRAM is the single biggest limiting factor for both quality and speed
GPU speed matters too but VRAM is the most important thing
>>
File: 1755629125875.jpg (398 KB, 4160x1248)
398 KB
398 KB JPG
>>106518050
thx still figuring out which epoch i like best
>>
File: ComfyUI_00120_.mp4 (1009 KB, 640x640)
1009 KB
1009 KB MP4
>>
>>106517598

Remember, if you don't like Chroma, you're a Nazi.

There is no such thing as being indifferent or ambivalent about Chroma. You either love it, use it daily, and defend it and Lodestones, or you hate it, in which case you're a literal Nazi and not welcome here.
>>
>>106518347
The only person who constantly talks about Chroma in these threads are you, the Chroma hater

Your gaslighting is pathetic, almost as pathetic as your samefagging
>>
>>106518370
Actually. I am the Chroma hater.
>>
File: ComfyUI_00123_.mp4 (1.01 MB, 640x640)
1.01 MB
1.01 MB MP4
>>
ashamed that im allowed in the same thread as the VRAM chad pumping out videos that quickly. give me my orders, senpai. i will not let you down.
>>
>>106518387
Ask him to generate at 720p and the pace should slow down.
>>
File: ComfyUI_00124_.mp4 (696 KB, 640x640)
696 KB
696 KB MP4
>>
>>106518399
not even in my wildest dreams could i attempt to wish that an anon such as that would heed my words. its akin to standing in a thunderstorm or hurricane. all one can do is watch. and pray.
>>
File: ComfyUI_00126_.mp4 (612 KB, 640x640)
612 KB
612 KB MP4
>>106518399
check you want 720x720? A lot of the ones I'm making end up larger than the file limit. I don't know why, the more detail the larger the files seems to be even at 640x640.
>>
File: ComfyUI_00127_.mp4 (548 KB, 640x640)
548 KB
548 KB MP4
>>
>>106517848
who dis and how she died?
>>
File: ComfyUI_00128_.mp4 (736 KB, 640x640)
736 KB
736 KB MP4
fucked up my tallgeese
>>
File: elf-hugger_00011_.png (1.2 MB, 752x1248)
1.2 MB
1.2 MB PNG
>>106518433
I believe she was the beneficiary of some urban cultural enrichment.
>>
File: ComfyUI_00130_.mp4 (1.32 MB, 640x640)
1.32 MB
1.32 MB MP4
>>
File: ComfyUI_00031_.mp4 (495 KB, 640x640)
495 KB
495 KB MP4
>>106518387
have fun
>>
>>106518433
https://xcancel.com/piersmorgan/status/1964725765095710837
https://xcancel.com/AshleyRindsberg/status/1964797187461972421

>>106518459
>beneficiary of some urban cultural enrichment
She tried to escape senseless violence
>>
File: 1745120861604.jpg (274 KB, 1416x2120)
274 KB
274 KB JPG
>>
File: ComfyUI_00135_.mp4 (475 KB, 640x640)
475 KB
475 KB MP4
>>
File: ComfyUI_00136_.mp4 (535 KB, 640x640)
535 KB
535 KB MP4
>>106518536
>>
File: ComfyUI_00137_.mp4 (684 KB, 640x640)
684 KB
684 KB MP4
>>
File: elf-hugger_00010_.png (1.19 MB, 752x1248)
1.19 MB
1.19 MB PNG
>>106518511
>She tried to escape senseless violence
In the end she fell victim to a totally different kind of drone.
>>
File: ComfyUI_00139_.mp4 (972 KB, 640x640)
972 KB
972 KB MP4
> I didn't need that foot anyway
Captcha:GAYVS
>>
File: 1744472007221.jpg (284 KB, 1416x2120)
284 KB
284 KB JPG
>>
File: ComfyUI_00140_.mp4 (1.11 MB, 640x640)
1.11 MB
1.11 MB MP4
>Secret weapons over Normandy
>>
File: ComfyUI_00141_.mp4 (457 KB, 640x640)
457 KB
457 KB MP4
>>106518526
>>106518592
I like this style.
>>
>>106518433
she relaxed
>>
File: ComfyUI_00142_.mp4 (495 KB, 640x640)
495 KB
495 KB MP4
>>106518613
I'm really impressed by this gen, it looks like the real launch.
>>
File: 1743789278052.jpg (220 KB, 1416x2120)
220 KB
220 KB JPG
>>
Surely you also queue 2k images overnight to go through the next day, Anon?
>>
File: ComfyUI_00144_.mp4 (361 KB, 640x640)
361 KB
361 KB MP4
>>
>>106518633
now do one that looks like the "real" moon landing
>>
File: 1752120763697.jpg (326 KB, 1416x2120)
326 KB
326 KB JPG
>>
File: ComfyUI_00146_.mp4 (553 KB, 640x640)
553 KB
553 KB MP4
>>
File: ComfyUI_00037_.mp4 (1.12 MB, 960x960)
1.12 MB
1.12 MB MP4
what
>>
>>106518688
Extremely plappable milf bod
>>
File: ComfyUI_00147_.mp4 (657 KB, 640x640)
657 KB
657 KB MP4
>>
File: ComfyUI_00148_.mp4 (1019 KB, 640x640)
1019 KB
1019 KB MP4
>>
File: ever more mo-gay.jpg (560 KB, 1536x2048)
560 KB
560 KB JPG
I can finally gen without a space heater torching my legs.
>>
>>106518725
What if your cats get into in?
>>
>>106518688
shrooms kicking in
>>
File: 1652812234920.gif (1.41 MB, 350x272)
1.41 MB
1.41 MB GIF
>>106517848
i don't understand why this thread even cares. she was clearly an woke girl, and probably blacked and shit. don't cry for all the blonde girls, kek
>>
>>106518725
Serial Experiments Lain
>>
>>106518747
It has a fan guard, just doing initial setup now.
>>
>>106517848
which version of chroma and is there a trigger word?
>>
File: ComfyUI_00152_.mp4 (1.79 MB, 640x640)
1.79 MB
1.79 MB MP4
>>
File: ComfyUI_00153_.mp4 (2.11 MB, 640x640)
2.11 MB
2.11 MB MP4
>>106518725
oh shit better check that thermal paste
>>
File: 1746193048942.jpg (893 KB, 4160x2496)
893 KB
893 KB JPG
>>
File: ComfyUI_00155_.mp4 (1.53 MB, 640x640)
1.53 MB
1.53 MB MP4
>>106518652
>>
File: its too late.mp4 (1.07 MB, 960x960)
1.07 MB
1.07 MB MP4
>>106518749
that wasnt a microdose
>>
File: ComfyUI_00156_.mp4 (1.2 MB, 640x640)
1.2 MB
1.2 MB MP4
>>106518828
>>106518652
>>
>>106518757
btw, don't ban me if I've been rude. i use chroma. I'm already grounded, lmao
>>
File: ComfyUI_00157_.mp4 (641 KB, 640x640)
641 KB
641 KB MP4
pillars of retardation
>>
>>106518795
kek
>>
>>106518848
desu, I think you should be banned for even using Chroma. Illegal model.
>>
File: 1526956362177.png (281 KB, 495x495)
281 KB
281 KB PNG
>>106518795
Another 12 volt high power connector....
>>
File: ComfyUI_00160_.mp4 (783 KB, 640x640)
783 KB
783 KB MP4
I'm out for the night, hopefully an Australian has some gens for us
>>
File: ComfyUI_00161_.mp4 (861 KB, 640x640)
861 KB
861 KB MP4
>>106518883
one more for the road
>>
>>106518905
I have both of mine doing some batch work so I can't respond with a gen right now. Also I just realize I can load wan 2.2 FP16 on my RTX6000 card....
>>
>>106518910
>I can't respond with a gen right now.
That's okay anon I still love you.
>>
>>106518917
Would you still love me if you knew I was genning fetish porn? Because that's what I'm doing.
>>
>>106518920
My love is unconditional.
>>
>>106518920
Aren’t we all really?
>>
File: ComfyUI_00162_.mp4 (649 KB, 640x640)
649 KB
649 KB MP4
>>106518910
jelly of your horsepower
>>
>>106518920
what kind tho
>>
File: ComfyUI_00163_.mp4 (423 KB, 640x640)
423 KB
423 KB MP4
>>106518931
alright i'm seriously off for the night
>>
File: 1743805803907.jpg (226 KB, 1416x2120)
226 KB
226 KB JPG
>>
how can I batch process multiple images with different prompts in comfy?
>>
>>106518985
Don't know if it's possible but just try duplicating the nodes that are needed to be duplicated in a single workflow like prompt and sampling nodes
>>
>>106518985
Truly, it must have some kind of wild cards or dynamic prompts node right? It’s pretty basic functionality.
>>
https://voca.ro/1nqOQ4hIzcrU
>>
File: ComfyUI_00048_.mp4 (385 KB, 544x960)
385 KB
385 KB MP4
>>
File: 1744134730465.jpg (361 KB, 1416x2120)
361 KB
361 KB JPG
>>
>>106519030
kek
>>
>>
>>
>>106518688
Those are so massive they're causing time dilation
>>
File: 1745901256852941.png (3.82 MB, 1344x1728)
3.82 MB
3.82 MB PNG
>>
What's with the shitty op?
>>
>>106519030
this is pretty good
>>
File: 1729831385810475.mp4 (1.45 MB, 832x624)
1.45 MB
1.45 MB MP4
>first taste of i2v
I'm definitely getting a 5070 Ti Super 24GB once they're out
>>
>getting trolled by i2v loras 5+ times in a row
so fucking annoying
>>
>>106519259
klass tamarind mix and teasdale hominy
>>
>>106518071
My VN will be entirely voiced now :D
https://vocaroo.com/17GdWIJgJgNI
>>
File: ComfyUI_00520_.webm (3.79 MB, 608x896)
3.79 MB
3.79 MB WEBM
>>106518933
WAM. It's kind of niche so sometimes you gotta make your own.
>>
File: Chroma_00002_.jpg (282 KB, 992x1456)
282 KB
282 KB JPG
>>
>>106519335
fifi, that's not what i meant by cream pie
>>
>>
>>106519329
AI basically in the right place now where you could make a pretty banger model with animations and voice acting and it wouldn't feel especially cheap if done right.
>>
Where are people getting VibeVoice 7B?
>>
>>106519329
is there a way to exaggerate the voices like in chatterbox or? vibevoice seems only good for basic mundane voice conversations
>>
>>106519364
why not both?
>>
File: 1740783690277968.png (290 KB, 768x432)
290 KB
290 KB PNG
>>106519030
kino
thank you for gem anon
>>
>>106519382
Nta but it imitates the reference sample as close as possible. So if you want anger, you would want to splice some angry lines in the reference.
>>
>>106519382
I generated that clip around 5 times, she sounded unreasonably angry in 3 of them and too vulnerable and sad in 1
The model seems to get real exaggerated and emontional at times but guiding it seems impossible
>>
>>106519396
yes sort of but it sometimes acts completely against the sample voice, I inputed a mostly neutral slightly worried voice but I often get an exaggerated ANGRY response and another time I inputed a sad vocie and sometimes gave me cheery responses
>>
>>106519374
I took it from bill gate's pocket when he was napping.
>>
File: Chroma_00009_.jpg (704 KB, 992x1456)
704 KB
704 KB JPG
>>
>>106519396
but if i want to clone a specific voice sample if i add another angry voice it will probably fuck up the cloning? chatterbox seems much more controllable
>>
https://voca.ro/1c9C0qTnukzp
>>
>>106519467
lul
>>
>>106519461
I'd like to see a chatterbox vs vibevoice output. I mean, even from context alone, vibe voice seems to do a really good job of inferring what the tone should be.
>>
File: 1731422333311848.png (233 KB, 513x513)
233 KB
233 KB PNG
>>106519467
sar please send account details i do not want to die sar let me send money please sar
>>
>>106519414
Yeah it's still a bit rng of course but it seems to net me closer to the emotions I want doing that.
>>106519461
I mean more in the way of same person but angry assuming they have lines like that of course. Maybe I just sucked at using chatterbox but the results when you up the exaggeration just becomes high-pitched and glitchy.
>>
>>106519467
>Pakistani
Are they scammers just like Indians?
>>
>>106519498
No idea. I just know it's who they always blame when things go wrong.
>>
>>106519498
Darker the skin, darker the soul
>>
File: Chroma_00018_.jpg (443 KB, 992x1456)
443 KB
443 KB JPG
>>
File: 1749560759974018.png (335 KB, 452x742)
335 KB
335 KB PNG
https://voca.ro/18WgII5ok9fY
https://voca.ro/178eihFyNefB
https://vocaroo.com/1mUiPvzjiMnM
https://voca.ro/16yr1GhZzuuX
https://vocaroo.com/1ncWGn7j8pu4
https://voca.ro/1mcbmZbhyjZf
https://voca.ro/192SwXVWOwLl
>>
>>106519519
not gonna lie, they sound perfect for a ps1/ps2 game when the sound quality was a bit crunchy, you could definitely make another Deus Ex game with this model kek
>>
File: 1745751147027684.mp4 (639 KB, 800x736)
639 KB
639 KB MP4
>>106519259
>>
>>106519519
>I hope you installed a recoil mod

why did this in particular make me laugh, fuck this is really good
>>
>>106519467
redeemed
>>
File: 1748141799853295.mp4 (1.48 MB, 800x608)
1.48 MB
1.48 MB MP4
>>106519564
>>
>>106519580
Her face has big
>You spent all of last night drinking with your friends and didn't come home until 8AM in the morning energy.
>>
Which idiot made the OP?
>>
Where u get keep your prompts? Just normal .txt ?
>>
File: Chroma_00028_.jpg (335 KB, 992x1456)
335 KB
335 KB JPG
>>
File: Qwan_00004_.jpg (695 KB, 1984x2976)
695 KB
695 KB JPG
>>106519712
In the pictures, silly.
>>
>>106519347
>>106519511
>>106519716
Is there some Range Murata in there?
>>
File: ComfyUI_00051_.png (2.48 MB, 896x1600)
2.48 MB
2.48 MB PNG
Qwen doesn't understand body proportions, huh? I'm trying to make a shapely woman and failing.

What am I doing wrong?
>>
File: Chroma_00034_.jpg (447 KB, 992x1456)
447 KB
447 KB JPG
>>106519785
Tsukasa Jun
>>
File: ComfyUI_00077_.png (3.61 MB, 1080x1904)
3.61 MB
3.61 MB PNG
>>106519797
>>106519760
Hey it's you, so I'm trying a WF with Qwen itself running twice rather than wan, because it was too slow. it's better than my previous results, I feel.
>>
>>106519519
i love what this model enables us to make lmao
>>
>>106519760
catbox?
>>
File: 1740182222727570.png (252 KB, 400x485)
252 KB
252 KB PNG
400$ + tip per image paintpiggy seethe
https://files.catbox.moe/36w4mj.mp4

https://x.com/derekalia/status/1964484540359164023
>>
>>106518071
>Usecase for vibevoice?
Bank fraud
>>
File: ComfyUIq-img-_00014_.png (933 KB, 832x1152)
933 KB
933 KB PNG
testing qwen now
>>
>>106518071
r/gonewildaudio
>>
>>106519938
lmao wtf
>>
is nunchaku qwen worth using?
>>
>>106520023
its fast af

dono if it supports loras
>>
>>106519992
it's less slopped than flux but the skin is still ultra smooth, when will we be free from this shit? :'(
>>
>>106520031
are loras required for decent gens, havent used it myself yet
>>
>>106520037
to get rid of smoothskin. yes
>>
>>106520031
>dono if it supports loras
Not yet, I think they are working on Qwen-edit rn
>>
>>106520054
oh, thats good, hopefully it happens soon.
do their flux models support loras?
>>
>>106520059
Yeah
>>
>>106520062
nicee I gotta try it then. thought that didnt support so never bothered!
>>
File: 1730752346096913.mp4 (1.3 MB, 800x608)
1.3 MB
1.3 MB MP4
>>106519580
>>
>>106520090
are these I2V or T2I? curious how you made the base images
>>
>>106520132
Everyone knows the boomer shooter LoRA guy.
>>
File: Qwan_00017_.jpg (578 KB, 1984x2976)
578 KB
578 KB JPG
>>106519797
Gotta finagle the prompt a good bit and exaggerate whatever you want to see (a lot). It's going to try hard to default back to the usual body type you see.
>>106519834
Haven't thought about doing a second pass with Qwen itself, that's a neat idea. But I imagine skin detail will still be somewhat lacking at higher resolutions.
>>
https://voca.ro/15cRSYm0udpS
>>
>>106520211
DAMN dude peter griffin just fuckin owned chromasome dorks so hard, they cannot come back from this

please post the transcript
>>
>>106520220

Speaker 1: Yo, check the mic, one two...

Speaker 1: Call it Chroma, more like a coma for your GPU's aroma, burnin' VRAM, ain't no diploma for this mess.

Speaker 1: Prompt adherence? A wild guess. Ask for a cat, get a plastic mess, shiny skin, uncanny valley express.
Speaker 1: Style control? A nightmare scene, word it wrong, get something obscene.

Speaker 1: Legs are twisted, hands are a blur, "photorealistic" is a word it never heard.
Speaker 1: It's slow, so slow, a minute per frame, ain't nobody got time for that game.

Speaker 1: They say it's uncensored, a selling point, but the output's so busted, what's the point?
Speaker 1: A face like melted wax, a background of sludge, Chroma, my friend, I hold a grudge.

Speaker 1: You had the hype, the open-source creed, but all you planted was a mutant seed.
Speaker 1: I'm back to Flux, back to SDXL, leavin' Chroma in its low-res hell.
Speaker 1: Word.
>>
File: ComfyUI_WAN2.2_00068.mp4 (3.4 MB, 568x1008)
3.4 MB
3.4 MB MP4
>>106520197
Man fuck Qwen, too much fidgeting to get shit right, I can gen videos by the time I finish with Qwen. I want to like it, not letting me.
>>
>>106518757
You are a subhuman kike
>>
File: 1728726922441434.png (477 KB, 800x528)
477 KB
477 KB PNG
>>106520132
>>106520140
Yeah it's Flux with a couple lora's trained on GZDoom and Lands of Lore 2 screenshots.
>>
what new image model would you recommend if I have two 3060 (12 GB each)? preferably something that fits the entire transformer/unet in one, and TE on the other
>>
>>106520276
>lora's trained on GZDoom and Lands of Lore 2 screenshots
would you mind sharing?
>>
>>106520292
Hmm. Can I recommend SD 1.5? Or perhaps a pencil?
>>
is there a retard proof guide for vibevoice?
>>
I'm trying to use the chroma gguf on SwarmUI and get this
>No backends match the settings of the request given! Backends refused for the following reason(s):
>- Request requires model 'chroma-unlocked-v11-Q8_M.gguf' but the backend does not have that model

plugin to read gguf files is installed, wat do
>>
>>106520304
what are you struggling with
>>
>>106520310
just getting it all downloaded and set up
like i said... a retard proof guide.
>>
File: 1734815322592613.mp4 (1.11 MB, 800x608)
1.11 MB
1.11 MB MP4
>>106520298
Here
>>104447387
>>
>>106517848
I saw a /pol/ post showing her bedroom and it had a BLM poster in it, lmaoooooo
>>
>>106520299
nope. I ran that years ago. have used SDXL and Flux on these.
>>
File: ComfyUI-img-_00017_.png (1.47 MB, 968x1240)
1.47 MB
1.47 MB PNG
>>106518942
what model? most models often have perfect circle bimbo breasts. i hate it
>>
>>106520211
>>106520223
I neew more funny scripts
https://vocaroo.com/1m2iPtANNeEO
>>
>>106520317
I'm pretty sure vibe voice is retard proof. I've never had a more simple input and go model.
>>
>>106520359
Not him but I can't find the 7B model anywhere
>>
>>106520364
https://huggingface.co/aoi-ot/VibeVoice-Large/tree/main
>>
>>106520366
How do you combine all 10 safetensors files?
>>
File: ComfyUI_WAN2.2_00070.mp4 (3.64 MB, 568x1008)
3.64 MB
3.64 MB MP4
>>106520238
>>
>>106520382
The model loader knows how to load the shards in the correct order. There is no reason to combine them.
>>
>>106519810
already known by the model?
>>
>>106520382
make a folder in comfyui models folder called TTS, then another called VibeVoice. git clone that entire repo in there.
>>
https://voca.ro/140cqYgjIcmS

Vibe voice lets me have infinite star trek TNG radio dramas.
>>
>>106520387
catbox?
>>
I use AI for Goon and it bothers me that Chroma puts male hands, feet and hips on women in 3D, but also in 2D. Sorry, Chroma but I'm not going to use you.
>>
File: 93b.png (254 KB, 900x806)
254 KB
254 KB PNG
>>106520415
Chroma is trash, even 8gb SDXL models are better for most things, whats even the usecase for it?

>inb4 n-no you ju-
I ran the biggest models on a 5090, its just bad, only excells at placing text wherever I tell it to
>>
>>
File: 1755141784024647.mp4 (2.11 MB, 800x608)
2.11 MB
2.11 MB MP4
>>
>>106520410
LMAO
>>
>>106520431
it has zero aesthetic tuning or post-training, just a raw base model. It'll need another finetune to actually be useful.
>>
>>106520410
Part 2.
https://voca.ro/193mQFFkIlCs
>>
>>106520490
Goddamn these are so damn good. I thank the jeet that messed up and got us this model.
>>
>>106520410
>>106520490
Laughed so hard
>>
>>106520490
I appreciate how Piccard grows more and more annoyed as it goes on
>>
>>106520490
lmao he sounds genuinely angry about it
>>
File: 1729610183950381.mp4 (1.78 MB, 800x608)
1.78 MB
1.78 MB MP4
A bit lewder than intended

https://files.catbox.moe/lak71i.mp4
>>
>>106517879
Lol it really is /g/eet now.
>>
>>106520560
kek
>>
>>106520431
Chroma is shit and the anti chroma schizo was right.
>>
File: 1751355762410095.jpg (568 KB, 1284x1277)
568 KB
568 KB JPG
>>106520560
oh yeah /g/ janitors are all saars now
>>
File: 1733713554663451.png (99 KB, 808x268)
99 KB
99 KB PNG
>>106520560
>>106520637
tv too
>>
>>106520651
>your IP is: do not redeem saar
kek
>>
>>106520431
Chroma is structurally broken, Lodestones is struggling with what would be SDXL's mangled fingers and hands.
>>
File: 1727524263231464.png (984 KB, 1080x1009)
984 KB
984 KB PNG
>>106520630
>Chroma is shit and the anti chroma schizo was right.
this, if you're reading it anti chroma schizo, I apologize
>>
are there any new ForgeUI forks or is reforge still the GOAT?
>>
New chink stink model wen?
>>
>>106520703
https://github.com/Haoming02/sd-webui-forge-classic/tree/neo
>>
>>106520431
>whats even the usecase for it
i2i into wan
>>
>>106520842
>animated trany hands feet and hips
Never
>>
>>106520778
doesn't that have a memory leak issue?
>>
>>106520980
No, this are Comfy shillers. Only /ldg/ uses Comfy all the alternative diffusion thread use any forge branch
>>
>>106520667
Probably pro and anti where the same schizo whiththe only objetive of shilling chroma
>>
File: ComfyUI_00618_.png (627 KB, 1280x800)
627 KB
627 KB PNG
>>106518433
>who dis and how she died?
The legal system failed her. Justice is no longer color blind in the US.
>>106518757
>i don't understand why this thread even cares
It's okay to be white
>>106518773
Chroma HD and she should appear without trigger but Iryna Zarutska
>>106520323
>I saw a /pol/ post showing her bedroom and it had a BLM poster in it
Black lives do matter. It's also okay to be white.
>>
>>106520410
kek
>>
>>106520989
>No
I'll give it a download then, thanks. I think the memory leak thing was with Wan anyways, which I don't really care for.
>>
>>106521013
>be hohol shluha
>be immigratings to america
>be hanging blm poster in room like good american retard
>be stabbed by black
XAXAXAXAXA
>>
>>106520976
i said i2i retard
>>
>>106520660
SDXL's fingers and hands are way better than anything I got out of chroma
>>
>SDXL better in every way
Ok, then please spoonfeed this retard. What's the SDXL equivalent for photographic lewds?
>>
>>106521234
Well, I'd say they have their shortcommings, but a new one that I like is called FinalCut(SDXL) 13GB one, others like pornmaster pro can be good at times, I'd say they all struggle in generating cum, and will have something here or there you dont like, bu well, Loras are a thing.
>>
>>106521013
that's a dem voter kneeling in front of BFL, she got to experience the diversity in 4K, kek
>>
Why can't burgers ever shut up about their dogshit politics
>>
File: 00003-399027143.png (1.52 MB, 1344x768)
1.52 MB
1.52 MB PNG
>>
>>106521526
That's a very, uhh, interesting structure you have there
>>
File: ComfyUI_00633_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>106521059
I blame the judge who let him out. The black people I live around are normal hard working people who say hello and use the sidewalk
>>106521438
>that's a dem voter kneeling in front of BFL, she got to experience the diversity in 4K, kek
A refugee dem voter? What makes so many anti-democrats have fever dreams of self-validation? That being said, it's a shame Elon didn't bolster the Libertarian party, instead of making his own
>>
>>106521585
>chroma lora
what a waste
>>
File: ComfyUI_00043_.png (1.41 MB, 768x1280)
1.41 MB
1.41 MB PNG
>>106521585
Why didn't you save her?
>>
>>106520404
lora
>>
>>106521562
>interesting structure
... and what do you see, Dr. Freud?
>>
>>106520410
holy kino. netflix is fuuuucked.
>>
>>106520507
>I thank the jeet that messed up and got us this model.
ahah yes, "messed up", totally accidental ahah
>>
>>106520490
>https://voca.ro/193mQFFkIlCs
>imagine spending 150k dollars just to endlessly clowned on and ending up as a funny meme on /ldg/
money well spent I guess kek
>>
organic...
>>
File: Chroma_00021_.jpg (891 KB, 1344x1728)
891 KB
891 KB JPG
>>106520410
>>106520490
damn great
>>
File: 00026-2108783866.jpg (1.42 MB, 2048x2480)
1.42 MB
1.42 MB JPG
So I came down from my cave to test chroma
>Needs a finetune for artist styles
>seems to know the same foundational "anchor artist for styles"
>Needs high steps to make good stuff to the point my 5090 takes 12-15 minutes once high res fix is used
I need to make some loras but the process annoys me and my eyes glaze over
>>
File: stares_in_Japanese.jpg (53 KB, 500x473)
53 KB
53 KB JPG
>>106522070
>my 5090 takes 12-15 minutes once high res fix is used
who is this model for exactly?
>>
>>106522073
With the lighting model I think it's fine, the problem is still the style swing you can force a general style but the model is easy to train for loras, I just need to just stop being stubborn, One trainer would be great if it wasn't broken on forge so now I need to learn easy lora trainer or whatever in OP. It listens well and I don't need regional prompt it just lacks detailed information on characters and styles which is expected for a base model. If you have the hardware this is the model to use once it gets improved.
>>
I want to use WAN2.2 I2V. I have an image of a person where their eyes are closed and their head is turned slightly, obscuring some key details about their facial features.

Is it possible to use an image as a reference and have the AI use that reference image to "fill in" the details of the face? Or is that simply only capable with WAN2.1 VACE? Can WAN2.2 Fun control do anything like that?
>>
>>106518536
>>106518552
Not to pollute the thread with my own obsessive mecha nonsense, but this slop just made me think: Why in the fuck don't they slap beam bayonets in Gundam rifles? You'd think it would be faster than pulling out a beam saber.
>>
>>106522070
What's your steps/sampler/scheduler?
>>
>>106522207
Euler a 100 steps DIMM
>>
File: ComfyUI_00640_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>106521621
>chroma lora
>what a waste
I'm not doing SDXL, but do you have a model request?

>>106521647
>Why didn't you save her?
All I could save were her jpegs
>>
>>106519712
I built a few different massively overcomplicated dynamic nested prompt stacks that I can just click “generate forever” on and get good variety. Put them all in a text file and use that text file as a wildcard lol, then I just have to prompt “__prompts__” and the 1girl gacha goes vroom
>>
>>106520335
Just the stock i2v on comfy
>>
>>106522070
>5090 takes 12-15 minutes once high res fix is used
lol
>>
>>106522070
Post your workflow. Must be some abomination.
>>
>>106520980
So does comfy and people still seem to love it
>>
>>106522395
>workflow
I don't use comfy, high res fix is the main reason for the time loss even at 30 steps using 4x ultra sharp 2, which you should do with this model it improves quality drastically
>>
>>106522342
It’s fucking insane isn’t it? Here I am taking 25 seconds for my “highres fix” images and that feels unbearably slow.
>>
File: ComfyUI_00707_.png (2.01 MB, 1328x1328)
2.01 MB
2.01 MB PNG
>>
https://civitai.com/models/1938784
p good lora if anybody likes the style
>>
File: ComfyUI_temp_zupkv_00001_.png (3.91 MB, 1728x1152)
3.91 MB
3.91 MB PNG
The fuck is this vehicle lmao.
>>
>>106522622
does qwen nunchaku work with loras? also what is up with this "just the nips" lora on civit. what the fuck
>>
ooo-eee-ooo
>>
>>106522697
not sure i've never used nunchaku or the nips lora
>>
>>106522697
Someone posted that a few threads ago it’s amazingly bad lmao. Like most Lora’s on civit probably 50/50 of just shoddily done or that guys personal fetish
>>
File: Qwan_00021_.jpg (621 KB, 1984x2976)
621 KB
621 KB JPG
>>106522697
>does qwen nunchaku work with loras
Not yet, they're working on it.
>>
File: 1730636011402983.png (41 KB, 1098x382)
41 KB
41 KB PNG
>>106520410
i can't listen to it :(
>>
>>106522689
doesnt look weird to me but im not a vehicle guy
>>
>>106522860
same i'm د موټر سړی نه یم
>>
>>106522718
What do you mean, anon?
>>
File: 00084-3389367029.png (1.5 MB, 1080x1576)
1.5 MB
1.5 MB PNG
>>106522718
>>106522901
Kuwabara kuwabara
>>
File: 1753975263558800.webm (3.65 MB, 720x938)
3.65 MB
3.65 MB WEBM
>>106517536
>Love you baby
This triggered me
>>
I feel Qwen too has a "sameface" problem
>>
>>106522070
>Needs high steps to make good stuff to the point my 5090 takes 12-15 minutes once high res fix is used
retard alert
>>
>>106523001
What model did you use to make this trvke its bretty gud
>>
>>106523102
Post your work (You won't)
>>
that voice shit is way too good keke
>>
>>106523114
I don't use chroma you retard. That shitter taking 12-15 minutes to gen and hiresfix is moronic.
>>
>>106523160
take your mental illness and go
>>
>>106523160
don't speak that way about ron
>>
>>106523107
It's from here ( >>>/pol/515074384 ), quite brutal thread
>>106520529
I love these retro aesthetics
>>
When ready

>>106523197
>>106523197
>>106523197
>>
>>106523161
>projecting
>>
>>106522689
ISIS rocketlauncher tank



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.