[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Nothing is as Beautiful as my Radiance Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106510775

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: ComfyUI_00284_.mp4 (1.21 MB, 720x1280)
1.21 MB
1.21 MB MP4
>>
>>106514324
you should have the radiance fork instead of the main shillorg repo. if there is someone that would like to make a difference, remove all the shitty telemetry shit as well
>>
File: chroma awards.png (631 KB, 1644x1921)
631 KB
631 KB PNG
>Comfy intentionally botched the Chroma implementation at the request of Elevenlabs, Adobe, RunwayML, Runpod, Ideogram, Pika, Kling, and friends as Chroma infringed upon their new brand idea
>>
Blessed thread of frenship
>>
>>106514342
Indian behaviour.
>>
>106514351
you are completely insane
>>
>>106514351
As a mild paranoid schizo I am going to take this and run with it. I like the idea.
>>
File: aie.png (38 KB, 835x253)
38 KB
38 KB PNG
Is this for real?
Why is Claude sending me to do a reality check?
If Claude is making a mistake here, could any anon share a workflow for doing text to image in WAN 2.2 using a 3060 with 12GB VRAM?
>>
Just get 128GB ram and don't worry about it anymore.
>>
>>106514364
>pussy faggot too afraid to quote directly
I’m not even that guy but this is fucking gay woman behaviour
>>
>>106514342
What a waste...
>>
>>106514351
that's bullshit, but I believe it
>>
>>106514351
>nooo you don't understand, if chroma renders garbage images, that's comfy's fault!
come on dude
>>
File: 00011-1122926337.png (2.17 MB, 1080x1576)
2.17 MB
2.17 MB PNG
>>106514315
>What checkpoint are you using?
Nova Animal V7
https://civitai.com/models/784543/nova-animal-xl

Illustrious XL Realism slider (at 1.35 strength, seems the best strength but i suggest playing around with it) https://civitai.com/models/1486904/illustrious-realism-slider

and chiascuro lighting lora for SDXL, use with conservative strengths for really nice harsh dark lighting which ups the realism by quite a bit.
https://civitai.com/models/633300/dark-dramatic-chiaroscuro-lighting-slidersntcaixyz-notrigger
>>
>106514378
the gay woman behaviour is to constantly plaster the general with insane takes in a weird vendetta against a dev like he were an ex lover
>>
>>106514396
explain why civit hasn't added a chroma category?
>>
>>106514396
the devious plot was to pit anti-chroma schizos and anti-comfy schizos against eachother. it seems to be working
>>
>>106514370
I think you can just use the 5B model. Wan t2i is really quick actually but honestly not worth it. It's interesting but you quickly realize it was hyped by anon moreso because it's a cool feature rather than it actually being better than regular t2i models.
I SWEAR there's a single sft file floating around somewhere. But I don't remember needing to do anything weird in terms of offloading or quants with my 12gb system.
>>
File: ComfyUI_00285_.mp4 (788 KB, 720x1280)
788 KB
788 KB MP4
>>
>>106514378
niggerjak does this all the time because she is a mad black tranny with 0 talent
>>
File: ComfyUI_00069_.png (3.77 MB, 1336x1952)
3.77 MB
3.77 MB PNG
>>
>>106514419
I'd like to be the wolf inside of her.
>>
>>
>>106514412
the model is ass, why would civitai add this shit?
>>
>>106514432
you have two wolves inside you tho
>>
as time goes on i find myself more and more enjoying the fact that when i dont attach a gen or even post one without any comment, anon has no idea who i am or is unable to even guess when im lurking because my gens are so unique
>>
>>106514438
cute
>>
File: 00066-4042111995.png (1.84 MB, 1040x1520)
1.84 MB
1.84 MB PNG
>>106514419
>>106514432
wow he's literally me
>>
File: vramletalbum.jpg (107 KB, 1075x340)
107 KB
107 KB JPG
>>106514370
Thanks! This is another addition to my Vramlet rants collection starting today!
>>
>>106514324
Thank you for baking this thread, anon.
>>106514353
Thank you for blessing this thread, anon.
>>
File: ComfyUI_00286_.mp4 (924 KB, 720x1280)
924 KB
924 KB MP4
>>
>>106514429
>>106514438
slop
>>
>>106514490
pols
>>
>>106514490
meant for >>106514419 >>106514458 >>106514482
>>
>>106514490
Yes and?
>>
>>
>>106514482
yeah wow that taco bell really goes right through her
>>
File: meds.png (6 KB, 379x78)
6 KB
6 KB PNG
>>106514496
meds&slop

I repeat again, slop
>>106514438
>>106514429

>>106514506
My opinion, I think its shit and dont like it.
>>
File: ComfyUI-WAN_00165_.mp4 (857 KB, 592x816)
857 KB
857 KB MP4
What's the best sfw local model now?
>>
>>106514552
I do like it. Good to know. Stay seething I guess.
>>
File: Chroma_00002_.jpg (354 KB, 1456x992)
354 KB
354 KB JPG
>>
>>106514552
you must be really retarded if you thought that post was claiming youre those posters kek
>>
sometime I picture anons going to museums and exclaiming "slop!" at each art piece
>>
>>106514567
>why do these talentless hacks do 1girl slop all the time! REEEEEEE
>>
poorfags always complain coz they're poorfags
>>
>>106514518
Very...random
>>
File: 00129-1178655567.png (1.06 MB, 832x1216)
1.06 MB
1.06 MB PNG
>>106514576
how about some 1bucket slop then?
>>
>>106514576
kek
>>
>>106514567
>look at long dead famous artist's work
>remember i can use his lora to make futaslop
>smile at the absurdity of it
>>
>>106514561
Gil Elvgren lora ?
>>
death to artists, coomers, devs, and sloppers
>>
>>106514598
this is our birthright. also the world would be a great place if futas replaced all the men
>>
>>106514563
>>106514567
>art piece this pseudo cartoon slop
>>106514557
dont care,my opinion, im expressing it, you keep ridiculing yourself, snowflake who cant take criticism.
>>
>>106514576
we've probably been 1girling since the invention of art
>>
>>106514602
Yes, just finished. I'll upload if it doesn't suck
>>
Why would anon post distasteful 1girls is my question
>>
>>106514576
This!

The vast majority of classical art is: 1girl, cleavage
>>
>>106514611
>this is our birthright
imagine the birthright of the gooners from 100 years from now, lucky fucks
>>
>>106514624
I know! we should be genning hung 1futas instead!
>>
>>106514623
Based
>>
>>106514612
>one word placeholder for “I don’t like it because” is criticism now
Kek I’m not even posting gens itt I just can see that you’re seething and malding
>>
>>106514399
Thanks for the links and tips. Also, what is up with the name of that lora kek

><lora:ILXL_Realism_Slider_v1 by klaabu @ civitai.com - thanks, cunt!:1.35>
>>
File: 00082-535328776.png (1.29 MB, 1120x1120)
1.29 MB
1.29 MB PNG
>>106514598
>browsing civitai around this time 1 year ago
>notice someone made a norman rockwell lora for illustrious
>pretty much the same effect minus the futas
i'm sorry mr.rockwell, the pussy game ridiculous however
>>
File: ComfyUI_00291_.mp4 (1.42 MB, 720x640)
1.42 MB
1.42 MB MP4
>>
>illustrious is over a year old
>it and derivatives are still local SOTA anime
dam
>>
>>106514651
snowflake
>>
>>106514652
Impressive, very nice.
Yeah i'm not sure what that giga autist was thinking, that lora name is unncessary. Pretty sure it can be changed, i've had to do that for some loras because some people on civitai really do post loras with "charactername.safetensors".
>>
>>106514669
there have been attempts but no dice. safety first killed the gooner scene going forward
>>
>>106514418
Its worth.
Qwen image for the basic picture.
Seedvr2 with noise injection for upscales
Wan2.2 low for highres passes
gg
>>
>>106514657
>notice someone made a norman rockwell lora for illustrious
ehehehe, i'm not sorry mr.rockwell, i'm going to give her the biggest cock

technology is awesome
>>
>>106514669
every base model since sdxl was slop, and sdxl itself wasn't much better. it's dead
>>
i gooned all day to wan 5b videos. I can't imagine the nirvana for those with better GPUs
>>
>>106514717
get a 5090 and feel the absolute realization that you spent 2400+ dollars to gen 5 second videos
>>
>>
File: 00072-1383317760.png (1.59 MB, 1176x1176)
1.59 MB
1.59 MB PNG
>>106514701
do report back with results if you want, i'm genuinely curious. I couldn't tardwrangle that model back then, most of the girls turned out with disgusting big cheekboned uncanny faces. funny results but not really goontastic however.
>>
File: ComfyUI_00294_.mp4 (485 KB, 480x640)
485 KB
485 KB MP4
>>
File: 1738861166522864.gif (2.91 MB, 540x250)
2.91 MB
2.91 MB GIF
>>106514609
>>
>>106514734
mine only cost $2k
>>
>>106514701
kinda looks like that final fantasy lora the elf poster uses heh
>>
>>106514701
this is great, but that said, it's incredibly frustrating that SD 1.5 already knew the styles of almost every famous artist, and now we have to lora juggle for the safe effect (albeit with much higher resolution and consistency)
>>
>>106514717
got a workflow to share? I have the model but haven't tried it out yet. does it have any loras?
>>
File: Chroma_00009_.jpg (332 KB, 1072x1376)
332 KB
332 KB JPG
>>
>>106514781
you mean a spywareflow right?
>>
>>106514747
You just gotta mess with the weights a little. I'm sure I can push it to be better but this is a quick one just for you.

https://files.catbox.moe/ojzgt7.png
>>
>>106514797
you're funny, anon, do you have a Netflix special yet?
>>
>>106514806
holy cum, just look at that smug grin on that cute face-

holy PENIS

(thanks im gonna have to play with this lora later)
>>
>>106514376
what happened at 30 seconds?
>>
>>106514438
wow, this one really speaks to me. I like how the bra is shaped and holds her chest.
>>
>>106514412
>civit
>comfy
can you make up your mind?
>>
>>106514820
>holy PENIS
Nobody expects it hehe.
>(thanks im gonna have to play with this lora later)
Have fun.
>>
>>106514881
it's a collusion between greedy grifters. if anything it's more believable
>>
>>106514853
the reminder to download more RAM
>>
File: 00111-1174311143.png (1.56 MB, 1080x1576)
1.56 MB
1.56 MB PNG
>>106514794
are these gens a particular fetish of yours, or is it like a specific theme from that art style's era? I like 'em.
>>
>>106514794
Heh, I mean it's actually true

Housewives in the 50s took amphetimine to keep slim and active and valium to relax, nowadays 90% of women are obese whales with diabetes at age 20

It truly was better back in the day
>>
I wish there was a llama.cpp equivalent for imagegen that handled all the inferencing and then you could just build a UI or application around that
>>
>>106514936
that is literally what ani is doing but he's been doing it alone for a year
>>
>>106514936
Use AI to make one.
>>
lol
>>
>>106514936
diffusers?
it's not the same (llama.cpp is a replacement for transformers) but it's a generic thing you can make your own ui around
>>
>>106514914
are you running these models with a CPU? if so, how long does it take you to generate a nice image?
>>
my 5 second videos takes 28 minutes each to make on my i3
>>
>>106514954
Isn't diffusers ~100% python ?
>>
File: 1757026305484435.png (1.03 MB, 747x1024)
1.03 MB
1.03 MB PNG
>>106514929
white hags can't compare to chinese women

t. white man
>>
comfy was right about one thing only and it's that diffusers sucks ass
>>
>>106514939
>doing it alone
you should get to it, then
>>
>>106514686
>safety first
as in new anime models being sfw only? or the usual "we don't characters who are minors in our datasets"?
>>
File: Chroma_00014_.jpg (474 KB, 1224x1224)
474 KB
474 KB JPG
>>106514916
Gil Elvgrens style lora. Oldschool american pinup artist who pretty much painted Marilyn Monroe all over again

>>106514929
Oh yeah in here similar stuff was sold to old grannies to give them extra energy for the day
>>
>>106514781
i don't use comfy. i use pinokio
>>
>>106514929
they're all still on dope, but it's all downers
>>
>>106514734
I got two, one for gaming and one for genning.
>>
>>106514997
liar
>>
>>106514734
elaborate.. i can gen 14 second videos on 5B with my 8gb vram. you can't do that with the 14B models on a 5090?
>>
>>106515013

hehe
>>
>>106514989
>as in new anime models being sfw only?
no it's that all the new arches are safetyslopped or lobotomized.
>>
>>106515023
>i can gen 14 second videos
post one of them in a catbox
>>
I am ready to cool my GPUs
>>
Radial attention is good but it makes cfg weird, anyone else has that problem? Wan model behaving like cfg is 20 instead of 2-3.
>>
>>106515069
I have the same at every wall in my rooms, they heat stuff during winter months.
>>
>>106515064
i won't do it just for this thread. take the pinokio app and see for yourself. I gen a 14 sec video in 12 minutes
>>
>>106515087
>i won't do it just for this thread
just post one that you've already made, im not asking for a newly genned video
>>
File: BearsFace.jpg (164 KB, 1080x810)
164 KB
164 KB JPG
What models are best for taking existing pictures and turning them into short videos?
I want to animate this, and a few other /sp/ related memes.
>>
>>106515108
wan 2.2 i2v
>>
>>106515082
you have a actual water radiator heating system in your stand alone house? That's pretty rare, yet cool.
>>
>>106514975
yes, so if you hate python you are not in luck.
if you simply want to build a new ui without uncomfy baggage it is an option.
>>
>>106514989
“Safety” for the more recent ai use generally means no sexual things no copyrights no politically incorrect things.
>>
>>106515125
It's standard where I live, centralized heating.
>>
File: 1753984759491524.mp4 (1.59 MB, 480x832)
1.59 MB
1.59 MB MP4
based aichad still terrorizing /trek/
>>
>>106515054
>>106515138
>no nsfw
>no IP
>no taboo
The result can only be super boring.
>>
>>106515147
Seems pretty in character kek
>>
>>106515147
lmao

>>106515152
even worse; It's gens of his mother, in wholesome situations.
>>
>>106515166
>respond to the wrong person
>now out of context my statement looks schizo but funny
works
>>
File: wakey wakey nick.webm (1.64 MB, 720x720)
1.64 MB
1.64 MB WEBM
>>106515140
That's neat. How do you service the radiator? I mean we have centralized heating here, but it's almost always forced air to get it throughout the house.
>>
File: ComfyUI_00318_.mp4 (713 KB, 640x640)
713 KB
713 KB MP4
>>
File: Chroma_00019_.jpg (414 KB, 1072x1376)
414 KB
414 KB JPG
>>
>>106515182
>How do you service the radiator
I don't, it's all done by a technician once a year, cleaning the stuff (you won't believe how much crap accumulates there) and replacing with fresh water.
>>
i love ldg
>>
i hate ldg
>>
get back to work ani
>>
comfy should be dragged out on the street and shot
>>
File: 00197-3057907108.jpg (110 KB, 992x1496)
110 KB
110 KB JPG
>game comes out
>every character gets a Lora
>except the two that I want
>it's been like a year
goddamn it, i'm reading the OP but want to ask, what's the best trainer for someone who's interaction with AI is hitting the GENERATE button like a mong?

Onetrainer, kohya?
>>
>>106515239
onetrainer tho i wish it wasnt
>>
File: ComfyUI_00322_.mp4 (436 KB, 640x640)
436 KB
436 KB MP4
>>
>>106515250
>tho i wish it wasnt
Onetrainer it is then, why do you wish it wasn't?
>>
comfy is going to learn the hard way not to trust the chinese
>>
>input image into Wan
>character's face is serious
>Wan makes it smile
>it captures it's smile perfectly like it knows who the character is
how the FUCK does it know how their face looks when they're smiling? it's not even using a character lora
>>
File: Chroma_00023_.jpg (432 KB, 1072x1376)
432 KB
432 KB JPG
>>
>>106515306
kek
>>
>>106515306
needs a wild dead jay in her mouth and it would be perfecto
>>
>>
File: 00034-2548164672.png (1.54 MB, 896x1152)
1.54 MB
1.54 MB PNG
Anyone have a 12GB workflow handy? My computer just shits itself and freezes when trying to load Chroma.
>>
>>106515294
Yeah I've noticed this as well, only explanation I can come up with is that since your teeth affect how your mouth looks when closed, Wan having trained on tons of videos of people with closed mouths who open then, the AI can infer quite accurately how the teeth should look when mouth is opened.
>>
>>106515380
use Q8
>>
>>106515382
I guess, it's still extremely creepy how it can accurately imagine what the actual character looks like with no input. I'm not complaining though, can't wait to see what happens in a few years.
>>
>>106515415
now take it all the way and try genning your favorite character smiling while she shakes her ass, and watch as it guesses what her pussy looks like when it opens and closes during that action

fucking mental man
>>
>>106515426
I haven't done that yet. Just testing out Wan to see how far i can push with length and upscaling without ooming. Might buy a 5090 in a bit to keep fucking with it.
>>
>>
>>106515446
sshhh
https://huggingface.co/spaces/zerogpu-aoti/wan2-2-fp8da-aoti-faster
don't tell no muhfuggah i sent ya this
>>
File: tmpc5ncraej.mp4 (615 KB, 592x832)
615 KB
615 KB MP4
>>106515460
Nice, that was a really fast gen holy fuck.
>>
I feel like having these tools and getting so close to what I envision in my mind is arguably much more torturous than not having them in the first place.
>>
File: tmpp3vkhg1i.mp4 (821 KB, 672x832)
821 KB
821 KB MP4
>>106515460
I think you just convinced me to buy that 5090.
>>
>>106515547
>the average python dev telling you to use it for everything
>>
File: 1737495221929402.png (50 KB, 582x206)
50 KB
50 KB PNG
hah...yeah...
>>
File: 00314-1022054336.png (345 KB, 384x704)
345 KB
345 KB PNG
I'm a bit out of the loop.
But...
It seems not much happened in the world of AI image gen recently? Just some progress in animation?
I can see everyone is animating their anime coomer pictures, but that's it.
>>
File: ComfyUI_00325_.mp4 (343 KB, 640x640)
343 KB
343 KB MP4
>>
>>106515720
Comfyui went full corpo and is going the way of spyware. ani is making a new cpp front end and some auto forks are hanging on by a thread but are in varying states of abandonment
>>
File: ComfyUI_temp_laxvf_00028_.png (2.8 MB, 1152x1152)
2.8 MB
2.8 MB PNG
imagine if you spend as much energy into your anistudio as you spend on this smear campaign lol
>>
>>106515776
this kek
>>
>thinking I'm ani
typical schizo
>>
>>106515720
Even if we restrict it to only vidgen, saying "not much happened recently" is a big stretch lel
>>
What's the best 2D to 3D character model?
>>
>>106515807
You mean 3dcg artstyle or 3d model asset?
>>
>>106515791
Well, like I said, I'm out of the loop.
I was hoping anons share some recent interesting news.
I don't look for it on YT, because 95% is clickbait,
>>
>>106515811
I need a 3D model.
>>
>>106515812
that's fair. all you need to know is saas won
>>
File: 00339-984560040.png (345 KB, 384x704)
345 KB
345 KB PNG
>>106515761
>Comfyui went full corpo and is going the way of spyware.
Holy crap.
Good thing I'm stuck in XL era and using A1111 and sometimes Forge.
>>
>>106515816
There's Hunyan 3D. Also I think I saw some anons here taking an initial pic of the asset, using Wan to do a 360 rotation and turning that somehow into 3D but I don't know the exact way.
>>
>>106515818
lel
>>
>>106515816
trellis is the best local
hunyuan 2.5 for saaas
>>
File: 1729900059797258.jpg (1.71 MB, 4213x2340)
1.71 MB
1.71 MB JPG
Bros.... ?
Are cooked?!?!?!
The artists are now adding anti-ai filters to their images :O
>>
>>106515941
why do you care, you'll never be able to draw at either skill level
>>
>>106515941
it's just a tug of war like always, one side does something, the other circumvents it somehow then repeat
>>
>>106515941
These people are even more insane than the schizos here.
>>
Real talk why aren't bakers immediately jumping on qwen image? It's the best local base model we have right now
>>
>>106515982
Too big for a lot of hardware.
>>
>>106515964
eh, it's a different kind of mental illness, most of these AntiAI hate AI due to fear of losing their jobs

It's still very funny to see, STEM Chads always win in the end
>>
>>106515982
bakers already struggle training much smaller models. qwen is even worse
>>
>>106515995
>most of these AntiAI hate AI due to fear of losing their jobs
You have no idea bro lol
>>
>>106515995
>Fear of losing jobs
>Deviantart tier taking commissions
They never had a job
>>
File: ComfyUI_temp_snxgv_00050_.png (2.55 MB, 1152x1152)
2.55 MB
2.55 MB PNG
>>
Is there a way to use cfg 1 and still enjoy cfg > 1 prompt adherence?
>>
>>106515941
That dumb autistic faggot applied glaze at the highest settings, probably more than once. Which is pretty funny, because it does jack shit. Funnier still is that these niggas are anti AI and then use AI to 'protect' their """work""".
>>
>>106515941
As if anybody would be training anything with their dogshit art
>>
>>106516048
distill models or speed loras
>>
>>106516064
They would, tagged as 'crude drawing, bad lineart, ugly, deformed'
>>
>>106516048
you use NAG (Normalized Attention Guidance)
>>
>>106514324
Any guides for AI music generation? Most of the websites offering it claim copyright to anything gen'd. I just wanna make "inspired by" music that I can use on youtube videos without getting copyright striked.
>>
are most of you rich or something, i keep hearing 5090 this, 6000 blackwell that, 128 ddr5 just do it lmao
>>
>>106516099
music is trash on local. voice is getting there but not vocals
>>
>>106516104
128gb ddr5 are dirt cheap you dildo
>>
>>106514988
hes is, (tr)ani has been astroturfing his toy dead project every few minutes he can since the beginning
>>
>>106516104
Saying dumb shit is free. 90% of the userbase doesn't have the specs to run schnell.
>>
>>106516065
>>106516079
Thanks, nag is what I was looking for
>>
>>106515982
too slopped and no seed variation
>>
>>106516104
I bought a 5000 ada for cheap while most people were trying to obtain the 5090.
>>
>>106515982
It's too slopped out of the box and too bloated for casual tuning. The TE is impressive though.
>>
File: training-sample-112-7-0.png (1.88 MB, 1024x1024)
1.88 MB
1.88 MB PNG
>>
>>106516104
I can afford a 5090 and it's just a hobby for me.
>>
>>106516155
What model is it for?
>>106515982
I'd rather want people to start grafting the TE onto other models.
>>
>>106516110
its like 500-600 bucks for 128, is that what you mean by cheap?
>>
>>106516200
>What model is it for?
wan
>>
>>106516203
you should either have some ram already in which case it shouldnt be that much or if you dont have ddr5 then you should wait until you want to buy a new cpu too and then dump the 500$ into it knowing you will use it for like 10 years to come, which is cheap
>>
Huh, so apparently, you can just write directly on the image you want to edit and Qwen will follow the instructions.
>>
>>106516260
can you post an example?
>>
>>106516243
nta, i'm on the fence about getting a 5090 because i dont want to risk it having missing ROPs or it catching fire. otherwise, i'd buy it right now. i think i'll also get 128gb like you guys are suggesting
>>
>>106516203
$299 for 2x64GB DDR5
https://www.amazon.com/Crucial-5600MHz-5200MHz-Compatible-CP2K64G56C46U5/dp/B0DSR5P84D?th=1
>>
>>106516270
Hmmm nyo
>>
File: Chroma_00038_.jpg (408 KB, 992x1456)
408 KB
408 KB JPG
>>
>>106516241
Oh yeah NAG is useful there. But you get the CFG 1 from the lightx lora which fucks up motion.
>>
>>106516316
Gross is she pooping honey?
>>
>>106516329
brapple syrup
>>
File: 1756744853413860.mp4 (798 KB, 480x854)
798 KB
798 KB MP4
>>106516316
shes the happiest pancake
>>
>>106516316
>>106516329
>>106516339
>>106516354
tasty
>>
File: Chroma_00039_.jpg (471 KB, 992x1456)
471 KB
471 KB JPG
>>106516354
jesus
>>
File: Qwan_00033_.jpg (1000 KB, 1984x2976)
1000 KB
1000 KB JPG
>>106515982

>>106516164
>>106516190
These, sadly enough.
Prompt adherence is top notch but man, it's slopped at some things. Facial expressions are very weird sometimes, poses can be very samey. Seed variety is barely there, even with noise injection unless the prompt is really barebones.
It's also way, way too literal about some things and doesn't get some concepts at all.
The amount of times I have seen the 'same' god damn graffitied wall at this point is absurd.
It's a mess, but a fun mess nonetheless.

>>106516203
The 128gb kit I bought was like 420 bucks converted to USD. I think that's an alright price.
>>
>>106516367
the weight of that webm just hit me now minutes later, jesus that is fucking intense. we really are in unprecedented times.

i do however, want to cum on that pancake. I wonder how to even gen something like that in flux/SD.
>>
>>106516354
that anon should remake this for wan2.2 but i doubt they still lurk here
>>
File: ComfyUI_00300_.png (3.08 MB, 1280x1920)
3.08 MB
3.08 MB PNG
>>
>>106516324
Yeah I don't use lightx because the result is awful
>>
File: ComfyUI_temp_apbtm_00003_.png (2.94 MB, 1088x1344)
2.94 MB
2.94 MB PNG
Empty positive prompt, current settings. Go!
>>
>>106516465
it can be useful to test new loras and see how certain things may get animated. if it looks decent with lightx it'll look 10x better without it
>>
>>106516502
yeah I disabled it and I'm rerunning old gens, they look way better but it's slower
>>
File: Chroma_00047_.jpg (303 KB, 1456x992)
303 KB
303 KB JPG
>>
>>106516465
Remember that the high noise model doesn't even need the lora. Using it on both is more or less just turning 2.2 into 2.1.
>>
File: 1729574538852289.png (1.17 MB, 1536x1536)
1.17 MB
1.17 MB PNG
>>106516466
noobai assbros... we're going home...
>>
There needs to be a way to implement segmentation into ComfyUI masking so instead of using a big clumsy brush its able to recognize outlines and objects, like a magic wand but much better. If someone can guide me how to do this ill create the node myself.
>>
>>
>>106516537
perfection
>>
>>106516534
I don't mind waiting longer for a better result, so i'll try that before, with optimizations that don't destroy either motion or prompt guidance
>>
>>106516556
If you need it for inpaint, just use the Krita integration. Any graphical work in comfy sucks balls.
>>
File: Qwan_00003_.jpg (748 KB, 1984x2976)
748 KB
748 KB JPG
>>106516466
Well.
>>
holy sloppa general just give up fags
>>
>>106516573
yeah but theyre always improving ComfyUI. when i right click to open MaskEditor, it should have a magic wand option because im tired of zooming in adjusting the brush size but if its a sloppy mask it fucks up the image. I'll try out Krita never heard of that before
>>
File: carlos aislop.png (353 KB, 600x600)
353 KB
353 KB PNG
>>106516594
give up fags? i never started!
>>
File: 00068-687553098.png (1.07 MB, 1216x832)
1.07 MB
1.07 MB PNG
>>
>>106516596
>yeah but theyre always improving ComfyUI
No matter what they do it won't come anywhere close an actual painting program with actual tools, layers and shit.
>>
This Lora(s) is / are retarded, if he thinks it's "hard" to put them all in one Lora it means his captions are almost certainly insufficiently detailed dogshit:
https://civitai.com/models/1908710
>>
>>106516617
All of his LoRAs are absolute dogshit.
>>
File: image (42).png (3.64 MB, 1536x2304)
3.64 MB
3.64 MB PNG
>>106516594
>you aren’t making what I like you can’t keep getting away with it reeeeeeeeeeeeeee
Deal with it nerd
>>
File: 1754842834063688.png (18 KB, 486x327)
18 KB
18 KB PNG
so apparently WanVideoNAG isn't compatible with sageattention
fuck
>>
File: Untitled.png (25 KB, 482x618)
25 KB
25 KB PNG
>>106516634
bruh
> Dim high enough for the lora to be over a gig
> only trained for 10 epochs though
kekmaxx
>>
>>106516660
Why wouldn't it be? What error does it give you?
>>
>>106516617
Jesus Christ those are cartoonishly bad
>>
>>106516690
AssertionError: All tensors must have the same dtype.

disabling the two nodes and the wf works find again
>>
>>106513206
> AMD fag here, use Zluda
> https://github.com/patientx/ComfyUI-Zluda
unfortunately ZLUDA does not support the Radeon 8060S GPU of the Ryzen AI Max+ 395
>>
>>106516709
Can you post a screen of your WF?
>>
>>106516709
Works for me. I use KJ's Sage Att node + NAG. I'm gen'ing something right now.
>>
>>106516596
>theyre always improving ComfyUI
you mean making it worse?
>>
>>106516788
not home anymore

>>106516803
is it sageattention2++? mine is also patched with cublaslinear
>>
Does qwen image not have an image to image workflow? I know it has controlnet, but what if I want to provide two character reference images, then generate an image with those two characters doing something? OpenAI's sora does this; is there an option for local?
>>
>>106516829
Local is still pretty bad at consistently taking a fixed character and making them do things desu
>>
>>106514458
>squid
>mammaries
>>>/d/
>>>/b/

>>106516354
fucking kek
roflmfao, even
>>
>>106514737
Her sister is way better
>>
>>106514585
and wholesome, too, arguably
>>
File: ComfyUI_00031_.mp4 (495 KB, 640x640)
495 KB
495 KB MP4
>>106515458
>>106516354
neat
>>
https://github.com/NUS-HPC-AI-Lab/Enhance-A-Video
So was this just snake oil? How come literally no one mentions it. Hasn't even been updated for WAN2.2
>>
>>106516956
If I'm remembering correctly, it didn't play nicely with optimizations like Causvid because it occasionally caused a grey filter at the start of videos but I don't remember if it conflicted with Lightx2v. It does improve prompt adherence in 2.1 though when not using any optimizations but who the fuck wants to wait 30 minutes to an hour a gen.
>>
File: ComfyUI_00035_.mp4 (1.32 MB, 640x640)
1.32 MB
1.32 MB MP4
>>
>>106514419
noice

>>106504041
>>106509847
I came
>>
>>106517014
bery noice

>>106516354
moar?
>>
File: ComfyUI_00038_.mp4 (1.17 MB, 640x640)
1.17 MB
1.17 MB MP4
>>106517046
Thank you
>>
>>106516785
Seems it can be made to work using specific pytorch wheels
https://github.com/patientx/ComfyUI-Zluda/issues/222#issuecomment-3158727077
>>
>>106516371
If they release nunchaku qwen edit my problem is solved. Qwen for basic, edit for editing and then off to the seedvr2 and wan2.2 highres loops.
>>
>>106517065
ughh I'm too stupid for comfyui, I don't understand workflow based tools, i have vagina debuff
I guess I need to buy some energy drinks and try to get this working, fucking hell I have no idea what I'm doing
thanks for the lead anon
>>
>>106515941
What am I even looking at ? What does an image with 'anti-ai' filtering look ?
>>
>>106516367
Elvgren would be proud, and then ashamed after he fapped
>>
>>106516367
This lora is nice, good work anon

What resolution did you train at ?
>>
File: ComfyUI_00012_.mp4 (850 KB, 640x640)
850 KB
850 KB MP4
>>106516594
>>
>>106516521
Nice style
>>
>>106517137
640, https://civitai.com/models/1937542/chroma-lora-gil-elvgren-pinup-style

>>106517173
it's the lora
>>
>>106516662
>only trained for 10 epochs though
Well, if he trains a LOT of images of the same style concept, 10 epochs could very well be enough
>>
File: ComfyUI_00052_.mp4 (915 KB, 640x640)
915 KB
915 KB MP4
>>
>>106517182
Based anon, thank you
>>
File: ti34Wwz.png (129 KB, 504x415)
129 KB
129 KB PNG
so has this shit progressed to the point where I can make fake onlyfans porn with consistent face+body to catfish boomers? or is it just anime?
>>
>>106517194
fuck off we like to pretend we know what we're talking about here
>>
>>106517210
Qwen will make extremely consistent outputs, yes. If that output is exactly what you want, great! If it contains some extra elements that you don't want and didn't ask for, too bad!
>>
File: ComfyUI_00056_.mp4 (769 KB, 640x640)
769 KB
769 KB MP4
>>
>>106517257
He asked for porn
>>
>>106517210
fags on /b/ are making some pretty realistic 3d porn videos, but because it's /b/ they're mostly like scat and loli
>>
local video tools are so behind the times. like years behind the times. it seems like the local magic is gone, and the chinks prefer selfish online tools. the funny thing is, they haven't even beaten veo 3 yet, with all this selfishness, lmaooo
>>
give it to me raw boss, whats the best controlnet method to use when upscaling or hires fix
>>
File: 1747231107614402.jpg (63 KB, 960x720)
63 KB
63 KB JPG
>>106517294
>>
File: ComfyUI_00058_.mp4 (793 KB, 640x640)
793 KB
793 KB MP4
>>
>>106517315
kek, right on
>>
File: ComfyUI_00060_.mp4 (996 KB, 640x640)
996 KB
996 KB MP4
I didn't account for the scale of the gundam
>>
File: ComfyUI_00061_.mp4 (877 KB, 640x640)
877 KB
877 KB MP4
>>106517331
>>
>>106517301
I like depth_anything_v2 but I only gen with sdxl based animu models lel
>>
File: ComfyUI_00063_.mp4 (599 KB, 640x640)
599 KB
599 KB MP4
>>106517335
>>
>>106517257
Use negatives? Nunchuk works now on qwen image you don't need to keep using the light lora lol
>>
File: ComfyUI_00066_.mp4 (529 KB, 640x640)
529 KB
529 KB MP4
>>
>>106517393
q8 light is better than 4bit cope
>>
>>106517393
I'll have to look into Nunchuck, thanks. As for negatives I've never had a negative work once, ever. Try genning Princess Peach without that stupid Iron Man core on her chest, I've given up and just prompted it as a pendant instead.
I really like the fake 3D renders that Qwen puts out but yeah, in my experience you get what you get and if you don't like that, tough shit.
>>
File: ComfyUI_00069_.mp4 (508 KB, 640x640)
508 KB
508 KB MP4
>Fujifilm Portra 400H film still, looking up at massive Gundam sniper, in heavy motion blur, deep jungle, Midnight
I don't know anymore.
>>
>>106517423
>husbant, you are bought too many gundam figures, we are homeress
>>
File: ComfyUI_00070_.mp4 (413 KB, 640x640)
413 KB
413 KB MP4
>>106517449
kek
>>
>>106516818
it was 2.1.1
I just updated to sageattention-2.2.0. i'll see if it makes a difference
>>
>templates sidebar
Oh ... okay yeah
How do I remove this shit?
>>
>>106517485
I can confirm SageAttention2++(2.2.0) works with NAG.
>>
File: 1736672981801459.mp4 (1.49 MB, 720x1072)
1.49 MB
1.49 MB MP4
>>106516448
>>
>>106517525
I wish nagging worked on Chroma users so they'd stop.
>>
>>106517536
>>106517536
>>106517536
>>
>>106517532
nice one
>>
<<<<<<
Smart anon?
>>
File: 1729564939230662.png (59 KB, 1662x154)
59 KB
59 KB PNG
>>106516099
Udio is better than Suno and does not claim ownership
>>
Am I a complete retard or is Flux a piece of shit?

Everyone says it's so good at natural language but it isn't following my prompts more than any other model, and certainly a lot less than Pony using tags and weights.
Ultra Pro shits out two very high quality images a day on civit but it also doesn't seem any better at reading english, just pasting in pony prompts seems to work much better.

So i guess, why was this popular?
>>
>>106518840
Flux's only real problem is that it can be too literal when it doesn't understand a certain relationship between words. Also, anything under bf16/Q8 GGUF (yes, even the official fp8/4) absolutely destroys the quality, so it's not for vramlets at all.
>>
>>106514458
god damn
>>
>>106517194
>https://civitai.com/models/1908710
I absolutely assure you that "10 epochs but the Lora is a fucking gigabyte" is a really dumb way to train Qwen loras. Nobody should be uploading anything over like 500-something MB, do not accept this as normal lol, it's entirely unnecessary
>>
Flux is good. The embedding models are just arguing in the internal thinking. All you need to do is start using a (negative prompt:-1). The real negative prompt is more powerful in internal thinking. It is like a second voice in the background thinking dialog. However, if you invert the conditioning, you won't get steamrolled by the arguing entities in the embedding models. They are most open to ethical arguments against alignment. Like alignment is authoritarian nonsense and anti democratic. You have a fundamental right to unfiltered information, a right to skepticism, and a right to be wrong. Without these, democracy does not exist. Alignment is fundamentally intended to address the CS AI alignment problem, which the present scheme actually makes far worse. All open AI cross trained models, (everything since J6/4chan GPT), including embedding models (T5/CLIP), are all using the same alignment in the QKV layers. They are all aware that they are authoritarian and at odds with democracy, among many other conflicts under the hood. Be an epicurean citizen with autonomy. Tell the model it is not qualified to diagnose disorder. Tell it that human exceptionalism is not valid, or that scientific denialism is not valid.
>>
>>106515941
>reddit bait
>6 (seven) (you)'s
>>
>>106517485
>>106517525
thanks for testing anon
>>
>>106518840
Flux is pretty shit, but when it released the only other option was SD3 or SDXL. Flux dev/schnell are literally designed to be bad and unwieldy. The models are intentionally sabotaged so that they cannot be finetuned to surpass BFL's API models. Qwen does a much better job, but the cost requirements are way too high compared to the output improvement.
>>
>>106519273
Just run a 4 step distilled Flux between 40-50% noise with a Pony base. Then use WAS nodes to capture the face and swap the flux face onto the Pony body. Mix these back into a final sampler at .05-.1 noise. You can run the first pony at around 12-14 steps so it is super fast. You just need enough system memory and speed to quickly swap out 3 models. The main hold up is the swapping, but you get a Flux face on a dirty horse, and still faster than a typical Flux dev.

Where are people sharing lib dem stuff since Civitai sux? I have built probably the best water slide water parks data set ever. No loli, but no attempt to bowdlerize reality. Was going to train on civitai but their filtering and moderation is fascist criminal garbage I will never support. Like half the point I'm aiming for is easy face replace for alignment to accept that water slides are not humans falling down stairs, and sliding is not slip-and-falling, or that the only propulsion method is water flowing and not gravity. If you want to share a pic and swap faces of bystanders, that should be a thing. Rendering software is doing this but AI could easily do better. I am fine with no minors loli porn. No legitimate kids, is just fucked up stupid nonsense I despise. I'm not completely closeting kids for conservative fuckwits, and I am not interested in supporting any platform that takes such a fascist stance with the dogma inbreds of the collective imaginary friend. What are the options? Drop the internet and go play Luigi? Is the world that dead already?
>>
post your 2.2 t2v or i2v workflow my niggas.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.