[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106523197

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>
File: ComfyUI_00164_.mp4 (1.79 MB, 640x640)
1.79 MB
1.79 MB MP4
>>
let's have sexi sex
>>
File: ComfyUI_00165_.mp4 (3.34 MB, 640x640)
3.34 MB
3.34 MB MP4
>>
File: Chroma_00061_.jpg (395 KB, 1576x1080)
395 KB
395 KB JPG
>>
>>106525822
good choice on the crap edit
>>
>>106525885
catpl0x
>>
>>106525791
If you expose comfy or any app really for that matter to be able to use it remotely, you have to make sure you lock your system down somehow or else it’s basically broadcasting publicly.
>>
>>106525891
>crap
*brap*
>>
Still no lora nunchaku for qwen and no nunchaku for wan.
>>
File: ComfyUI_00166_.mp4 (2.6 MB, 640x640)
2.6 MB
2.6 MB MP4
>>
>>106525880
I was ready to be really impressed if it got the integra rear correct.
>>
>>106525900
>If you expose comfy or any app really for that matter to be able to use it remotely
when/how does that happen?
>>
>>106525911
nice beast gens retard kek
>>
File: Chroma_00065_.jpg (652 KB, 1576x1080)
652 KB
652 KB JPG
>>
File: ComfyUI_00167_.mp4 (591 KB, 640x640)
591 KB
591 KB MP4
fake dashcam
>>
>>106525911
you would have to manually add the --listen flag and also have no firewall, or for some reason manually open the comfy port. As it turns out, some people are that dumb.
>>
>>106525938
it's pure suicide to open a port to a web application like that
>>
File: ComfyUI_00169_.mp4 (1.15 MB, 640x640)
1.15 MB
1.15 MB MP4
>>106525910
Some day.
>>
>>106525938
>you would have to manually add the --listen flag
kek, only retards would do something like that, desu they deserve it if they do it in the first place
>>
>>106525934
>mountain in the background stays the same shape even after it is hidden from sight by the building
This impresses me
>>
File: ComfyUI_00170_.mp4 (2.46 MB, 640x640)
2.46 MB
2.46 MB MP4
>>
>>106525984
I do it so I can gen on my phone while in bed but the comfy box has no outside internet access only lan kek
>>
>he thinks he's safe
>>
>>106525998
you can do that from anywhere, just vpn to your house, never expose this stuff online directly
>>
Blessed thread of frenship
>>
>>106526026
i'll break your fuckin' neck m8
>>
>>106526026
i'll massage your fuckin' neck m8
>>
>>
File: ComfyUI_00171_.mp4 (306 KB, 640x640)
306 KB
306 KB MP4
>>
File: Chroma_00071_.jpg (561 KB, 1080x1496)
561 KB
561 KB JPG
>>
File: ComfyUI_00991_.png (1.54 MB, 1328x1328)
1.54 MB
1.54 MB PNG
>>106525822
Iryna Zarutska qwen lora https://gofile.io/d/6IZUNy

>>106525998
cloudflare is free and faster than gigabit
>>
File: ComfyUI_00172_.mp4 (495 KB, 640x640)
495 KB
495 KB MP4
>>
File: ComfyUI_00173_.mp4 (621 KB, 640x640)
621 KB
621 KB MP4
>>
File: Chroma_00073_.jpg (631 KB, 1080x1496)
631 KB
631 KB JPG
>>
>>106526113
what model?

also, do you have other loras trained anon?
>>
File: ComfyUI_00174_.mp4 (961 KB, 640x640)
961 KB
961 KB MP4
>>
File: ComfyUI_00175_.mp4 (1.34 MB, 640x640)
1.34 MB
1.34 MB MP4
>>
Have they fixed qwen not working with sage yet?
>>
File: ComfyUI_00177_.mp4 (561 KB, 640x640)
561 KB
561 KB MP4
How harsh would the gravity be on a planet this size moving that fast?
>>
File: ComfyUI_00178_.mp4 (770 KB, 640x640)
770 KB
770 KB MP4
>>
File: ComfyUI_00180_.mp4 (566 KB, 640x640)
566 KB
566 KB MP4
>>
Surprisingly, the speed lora didn't brutalize the result outside of the fucking lizardmen on the fresca.
>>
>>106526189
Wouldn't it be less harsh? But assuming that's real time, Jupiter would be yeeting away its atmosphere like instantly.
>>
File: ComfyUI_00181_.mp4 (1.11 MB, 640x640)
1.11 MB
1.11 MB MP4
>>
>>106526218
What model is this? I recognize your gen when you tried some new Chroma-HD pth.
>>
File: ComfyUI_00182_.mp4 (818 KB, 640x640)
818 KB
818 KB MP4
>>
>>106526263
Qwen edit plugged into qwen t2i workflow.
>>
>>106525822
Any flux loras or custom nodes for warping, twisting, distorting and bending?
>>
File: 31.jpg (134 KB, 503x1008)
134 KB
134 KB JPG
what the fuuuuuck
>>
>>106526360
What the fuck is this snake oil node?
>>
>>106526376
Looks more like a prompt lego than snake oil
>>
>>106526102
I think I know which lora you used for this.
Please tell me you prompted that disco ball explicitly...
>>
File: Qwan_00026_.jpg (866 KB, 1984x2976)
866 KB
866 KB JPG
>>106526176
Works on my machine
>>
File: ComfyUI_00992_.png (1.92 MB, 1328x1328)
1.92 MB
1.92 MB PNG
>>106526113
>what model?
https://huggingface.co/Qwen/Qwen-Image
>also, do you have other loras trained anon?
Many, but this is what you get

>>106526176
>Have they fixed qwen not working with sage yet?
Not for me. Also --fast makes it output noise
>>
>>106526434 for >>106526147
>>
>>106526393
please enjoy the esl babble. Disco elysium lora for chroma was used
>art style of disco elysium. absurd surrealist painting of a town. there's on man dressed as a viking and holding a disco ball on his lap. orange houses, glowing building, river, cinematic, painterly, abstract background, strange psychedelic colors, surreal effects are distorting the image in artistic way, deep tones and fun colors, rusty car, brutalism architecture; concrete building. in the background there's a grey stone Statue of a skinny old balding man with hand raised, visible brushstrokes, stylized depiction. Stone road.
>>
>>106526360
Great for randomization instead of having a text wall of { option1 | option2 | ....... | option341 | option342 }.

>>106526376
Its only snake oil if 1girl is your only prompt
>>
>>106526492
>{ option1 | option2 | ....... | option341 | option342 }.
Does this even work outside of clip models? I completely forgot about prompt syntax since dumping sdxl.
>>
>>106524565
are you using the moe k sampler?
>>
>>106525916
You could put this in any art gallery and no one would suspect it's AI, good job anon

Also undoubtably you'd have some pretentious art fag be like 'that wrecked car, such deep symbolism...'
>>
>>106526434
I want a middle-finger phone
>>
>>106525994
There can only be one, with such a huge forehead
>>
>>106526083
Nazi cats are underappreciated
>>
>>106526202
Syd Mead! Nice!
>>
>>106525822
why is sd next in OP but not ani studio
>>
>>106526645
>why is sd next in OP but not ani studio
sdnext: 6.6k github stars
ani: 18 github stars
>>
>>106526565
With lora it's very impressive, learns styles so well. Even very specific brush strokes. I still need to see how/if inpainting works with it.
>>
please consider
>>
>>106526516
I have no clue but I just use ImpactWildcardProcessor from ComfyUI-Impact-Pack for all image models. Great for controlled "randomness"
>>
>>106526565
The hell are you talking about. That's obviously a Disco Elysium lora. This one I reckon:
https://civitai.com/models/1927225/chroma-lora-art-style-of-disco-elysium
>>
File: 56480570.mp4 (3.87 MB, 1120x800)
3.87 MB
3.87 MB MP4
>>
>>106526684
because we're not putting an obscure, unfinished ui in the op just to satisfy some faggot's ego
>>
>Warning: TAESD previews enabled, but could not find models/vae_approx/None
Why do I keep getting this warning when my TAESD previews already work just fine?
>>
File: ComfyUI_00184_.mp4 (811 KB, 640x640)
811 KB
811 KB MP4
>>
>>106526860
I only get that with Radiance, what model are you using?
>>
>>106526889
WAN. I guess it's because VideoHelperSuite already does previews and there's no taesd_decoder.pth/encoder to support wan previews natively?
>>
File: Chroma_00088_.jpg (714 KB, 1080x1496)
714 KB
714 KB JPG
>>
File: ComfyUI_01011_.png (3.34 MB, 1680x1680)
3.34 MB
3.34 MB PNG
>>106526113

>>106526579
>I want a middle-finger phone
It didn't work. I heard Qwen is kinda busted and this is like my 10th gen. Details and text are really sharp though
>>
File: 52574395.mp4 (3.97 MB, 720x720)
3.97 MB
3.97 MB MP4
>>
File: chroma.png (1.76 MB, 896x1152)
1.76 MB
1.76 MB PNG
I'm trying to get newspapers to raise alarm about Chroma, an inherently unsafe model (unlike Flux, Qwen, etc.). I thought about the "it generates CSAM" angle, but I'm not sure if it jibes with the ADL. So still brain storming... Pic? It generates propaganda?
>>
File: Chroma_00090_.jpg (717 KB, 1080x1496)
717 KB
717 KB JPG
>>
>106527053
lil bro is trying so hard
>>
Will I get in trouble for posting images of chibi Arab men throwing acid at chibi trani's avatar face?
>>
>>106526977
RIP, it's fucked up
>>
anistudio will save us!
>>
File: ComfyUI_01018_.png (2.29 MB, 1328x1328)
2.29 MB
2.29 MB PNG
>>106527131
RIP
>>
>>106527138
i dont need saving. i love comfyui it's nearly perfect.
>>
File: ComfyUI_01019_.png (1.94 MB, 1328x1328)
1.94 MB
1.94 MB PNG
>>
>>106527284
Nice composition
>>
>>106527284
>When she realizes that the city is shit. When she has not realized what how to find the brighter side of things.
>>
why vace sucks compared
>>
why isn't it possible to pause a generation and resume it later? for example, you're in the middle of a wan gen which will take 1 hour and want to pause it to go play a game with friends.

you can do it when training loras, so why not gens?
>>
File: 305b2mnbfzuc1[1].jpg (18 KB, 303x326)
18 KB
18 KB JPG
https://voca.ro/1oouzFbS5x6u
>>
File: ComfyUI_01023_.png (1.58 MB, 1328x1328)
1.58 MB
1.58 MB PNG
>>106527287
ty

>>106527302
>Burned in the melting pot

Qwen isn't so bad
>>
>>106527421
I dunno. If my pc goes to sleep in the middle of a wan gen it'll just resume when it wakes like nothing happened.
>>
you're a very bad doggo.
>>
>>106527463
Yeah but I don't want it to sleep. Just pause and resume later. Even better if you can pause, save the current state, close ComfyUI. Then load the previous state and resume where you left off. That would be fucking godsend.
>>
>>106525822
>>106526756
faggot bake
>>106525859
>>106526787
Pedophile bake
>>106526010
>>106526836
literal human garbage
>>106526026
>>106527271
samefag


>MIRROR PROTOCOL IS NOW ACTIVE
>>
>>106527428
kek
>>
disgusting general and threads
>>
>>106527421
You should be able to pause it by clicking pause break while in the comfy cmd window
>>
>>106527468
>the purifying fire.
>>
File: 1748308863994466.png (1.32 MB, 1440x1120)
1.32 MB
1.32 MB PNG
>>
its the only way he can learn
all shitposts are now returned to sender stay in your containment thread schizo

>>106527305
>N****
>>106527328
>ooo-eee-pfffft
>>
File: ComfyUI_01029_.png (2.55 MB, 1328x1328)
2.55 MB
2.55 MB PNG
>>106527570
>KampfyUI
kek
>>
>>106526137
>>106527498
>>
File: wan22_light21_00663.mp4 (769 KB, 480x480)
769 KB
769 KB MP4
>>106527520
I know, I was just implying that it was possible. wait, one hour for a wan gen? VRAMLET
>>
File: wut.png (20 KB, 577x134)
20 KB
20 KB PNG
>>106527616
?
>>
>>106527178
>>106527643
-
>>106526817
>>106527631
>>
>>106525822
>>106527656

GEEE EMM !!
B O T S T A T U S ??? ;3
>>
>CallHomeUI
>>
>>106527667
making threats is in violation of international \ united states law
>>
Well, time to fuck up /n*pt/ i guess if you're asking that much *shrugs*
>>
What the fuck is going on in this thread?
>>
>>106527706
fuck off retard
>>
File: AnimateDiff_00311.mp4 (1.44 MB, 480x480)
1.44 MB
1.44 MB MP4
ANT NOOOOOOOO
>>
i wonder how many times he has been banned kek (hundreds that im aware of)
>>
everytime it happens
all the troll-shitposts across multiple boards are miraculously gone as well

really activates the almonds doesnt it?
really jogs the fuggin nog so to speak
hmmmmmmmmm
>>
>>106526434
peak, breast size
>>
File: brunt.jpg (45 KB, 705x530)
45 KB
45 KB JPG
https://files.catbox.moe/go1pwb.flac
qwenners, your response?
>>
>>106527807
>he likes them neutered\spayed via chemicals in the tap water so they never develop breasts
bisgustin.

>>106527747
>>106527832
*\SDG\
>>
>>106527836
You had every opportunity to say "Make my lobes go boing boing."
>>
>>106527623
>BRAAAAAAAAAAAAAAAAAAAPPPPPPPP
>>
someone really needs to make a penis lora that looks accurate from any angle. im so tired of 20inch dicks that look like worms
>>
>>106527872
https://civitai.com/models/1874153/oral-insertion-wan-22
>>
any latest improvements to image genning besides DMD2?
>>
File: 1739569323921419.jpg (661 KB, 1700x2200)
661 KB
661 KB JPG
>>106527872
BACK IN MY DAY ALL WE HAD WAS COMPUTER-GENERATED WORM DICKS AND WE LIKED IT! WE HAD TO WALK BAREFOOT IN THE SNOW AND PLAYED WITH DIRT!
>>
>>106527836
Begun. The voice wars have.
>>
File: AnimateDiff_00314.mp4 (1.42 MB, 640x480)
1.42 MB
1.42 MB MP4
>>
>>106525822
I'm newish. What are some fun art styles to play around with, aside from "anime" and "realistic pictures".
>>
>>106527906
videos ;3
>>
>>106527906
drawn, think of any favorite artists like shadman for example
>>
>>106527885
i thought this only worked specifically for oral motions, but i guess i can see what happens when i want a male pov penis doing missionary.

>>106527896
i just pretend their tentacles and still fap but i kinda want real dicks now
>>
>https://rentry.org/wan22ldgguide

how outdated is the models and loras in this guide?
>>
File: 03271-3206284668.png (531 KB, 512x640)
531 KB
531 KB PNG
>>106527872
where we're going, we don't need dicks
>>
>>106527927
>rentry
KEK
>>
>>106527858
https://files.catbox.moe/f70m6c.flac
>>
>>106527906
Orientalism
>>
>>106527899
if this doesn't make the collage i will become violently ill\sick to my stomach and lose all faith\hope\goodwill towards all of the rest of humanity and the earth itself............
>>
>>106527836
https://voca.ro/1n0IJ52DQpJT
>>
>>106527906
Non troll response, here are some that I use when I want to get "artsy":
ukiyo-e
miniature (middle east)
pulp
troll globohomo art (Ok not a good style but can be very fun to make shitposts with)
realistic seinen styles
minimalism
art deco
>>
>>106527927
it's up to date but some of the guide is leftovers from the wan 2.1 guide and doesn't apply to 2.2. just use kijai's 2.2 workflow and avoid the all in one because that's pure shit.
>>
>>106527937
>>106527836
I'm actually very surprised the 1.5B model can zero shot clone these voices. It's actually worse if you give it a lot of audio, it starts to repeat it back to you. 10-15 seconds is good.
It's such a small model that I can't just go "oh, M$ trained on DS9". It's a wonder, really.
>>
can i catfish people with AI audio yet?
>>
>>106527941
wow beautiful. I love that you can see the Orion constellation above the girl's head.
Also I love girls with Aquiline noses. I know a girl like that. She was once a true love of mine. Looked a lot like this girl, only somewhat fairer. T-T
>>
>>106527981
As long as it's not real time. Yeah pretty much. Vibe voice most of the time sounds absolutely real.
>>
>>106528006
Shit, I didn't even notice Orion.

You're a man of culture, for sure. Aquiline noses on women are criminally underrated, and fair-skinned dark-haired med/middle-eastern girls are the absolute best.
>>
>>106527962
Lol
>>
>>
>>106528044
Good to meet a fellow man of culture. Love that art style. Really makes me feel nostalgic for a time I never knew.
>>
>>106527912
Ok I know it’s troll but what does it say about me if I actually like shdman’s style, outside of the scat and furry shit
>>
>>106528085
not a troll, i love shadman. his thing is definitely exaggeration of body features
>>
File: AnimateDiff_00315.mp4 (3.98 MB, 720x944)
3.98 MB
3.98 MB MP4
>>106527942
it won't if I bake ;^)
>>
>>106528107
I found another one on danbooru that reminds me of shaddy but a little more “graphic novel” style called afrobull. Western draw tag apparently but the models can do it well
>>
File: LDG.mp4 (742 KB, 720x720)
742 KB
742 KB MP4
>>106528123
RUDE.
>>
>>106527928
I'm not going
>>
File: 1731409122245569.jpg (830 KB, 2304x1792)
830 KB
830 KB JPG
>>106527906
>>106527964
all of the famous old oil painting styles like rococo, baroque, etc are fun. chroma is your best bet for these styles from pure prompting.
>>
>>106528195
you just know
>>
>>106528195
horrid even 2018 stable diffusion is better than this
>>
>have been running the same version of windows for months
>updates are blocked in registry and with firewall
>haven’t pulled updates to ui
>haven’t changed video drivers
>last few days nvidia driver decides it wants to start crashing
What the fuck lads why is windows so fucking abjectly horrible, there is no reason for it to now start crashing the driver. None. Nothing has changed. I hate bill gates so much you couldn’t even comprehend it.
>>
File: 03272-3953784081.png (525 KB, 512x640)
525 KB
525 KB PNG
>>106528191
Back to the Future Morty! Oh the fun we're gonna have Morty!
>>
>>106528210
RETVRN
>>
>>106528232
>nvidia driver starts crashing
>this must be a problem with windows
great deduction
>>
File: AnimateDiff_00316.mp4 (1.81 MB, 608x480)
1.81 MB
1.81 MB MP4
>>
>>106528107
I personally enjoy that noob/illustrious models recognize shadman, elastigirl, and violet well enough that I can basically recreate my own version of his greatest hits
>>
>>106528144
Cute
CUTE!
>>
>>106528289
helicopter head. kek.
>>
>>106528232
>>106528245
Drivers don't suddenly decide to start crashing for shits and giggles retard. It wouldn't be working stable before if the driver version was problematic.
Either some other change to his system OP didn't notice or mentioning or his GPU is fucked.
Alternative his CPU/memory isn't stable and sending corrupt data to his GPU, causing crashes. (Believe it or not, this is not unbelievably rare)
Is your CPU intel by any chance OP? Run some y-cruncher to see if system itself is stable first.
>>
>>106528323
It’s an ayyyymd and the 5070ti I just bought like two months ago and has been used pretty well daily. I mean you could be right that things are going bad but god I hope not for new parts that would be very gay. Nothing else has changed like I said I’ve got auto updates for everything blocked off and I haven’t installed anything. Oh dear time to stress test I guess.
>>
>>106528239
>Oh the fun we're gonna have Morty!
Without dicks...

X for doubt
>>
>>106528302
>>106528144
he became the board, many such cases
>>
>>
File: cm6zpufv02m21.jpg (480 KB, 3478x2526)
480 KB
480 KB JPG
Shouldn't voice cloning be a bigger deal in modding? Like you can voice your mod now! With the real voices! Is there some luddite regime in the "modding community" or something?
https://files.catbox.moe/yy3v4m.flac
>>
>>106528241
Brings you back
>>
>>106528290
no way, really?
>>
>>106528365
Basically any project can be voiced now assuming you find the right voice to work with.
>>
>>106528378
It doesn’t work great since they have so few works on danbooru but it’s enough for gacha. And I know your question was disingenuous but I answered earnestly anyway.
>>
>>106528404
nah i have autism, it was genuine
>>
>>
>>106528338
Yeah you gotta see if the CPU and memory config is stable first. Sit through a few hours of y-cruncher.
How long have you been on the current driver version? Were you trying anything new when the crash happened?
I have heard that 5000 drivers are shitty but if you have been on the same driver for a good while and wasn't trying anything novel I am afraid it might be a warranty thing.
Alternatively try changing driver version later and see if that changes anything with stability.
>>
So DS9 = Pro chroma and TNG = Anti chroma?
>>
>>106528418
Well if it is anything like that I hope you’re right and it’s just the memory. A ram stick is a cheaper replacement than a card. Nothing new other than my usual 1girl simulator, and same driver as when I first setup the card. Only other thing I think could possibly be is a shitty riser cable but it’s an MSI one so it should be decent. Hurray for hardware gremlins
>>
>>106528398
That makes no sense in the context.
Is it even worth reporting this piece of shit spamming these threads? A 90 IQ janitor looks at this and thinks "looks real to me" and nothing happens.
>>
>>106528450
huh?
>>
I love how Qwen's enhanced adherence makes it more possible to actually bring a specific vision into reality, but sometimes I just want to throw some concepts together and hit the gatcha button.
>>
>disable the lightning loras
>wan starts listening to me again
They need to redo that shit
>>
>>106528478
which lightx2v loras are you using?
>>
>>106528478
>he updated past 2.1 for a meme "speed increase"
>>
>>106528488
The 2.2 I2V one
>>
boop
>>
>>106528503
that's your problem. the 2.2 loras suck fat logs out of my ass
>>
>>106528455
https://files.catbox.moe/upef1g.flac
>>
>>106528076
this reminds me of the style the yumeneta guy is into https://civitai.com/images/94812323
>>
>>106528506
>he (((UPDATED))) aka broke all the nsfw 2.1 lora
>>
>>106528506
So just use the 2.1 one then? On low noise?
>>
>>106528467
>but sometimes I just want to throw some concepts together and hit the gatcha button.
i must be a creativelet because i feel the same way a lot of the time. rarely do i go "oh i have an idea for an image that involves one girl in this part of the image doing something, and then one boy in this other part of the image doing something else" etc
>>
>>106528518
It's the artist 'quasarcake'. It's a very distinctive style.
>>
>>106528533
I've almost always just done wildcards for that reason. I love generating random scenes or NPCs for some theoretical RPG. I never actually use them. It can be fun to try to make something specific but that can also take a very long time and I'm more interested in seeing cool images than I am in actually conceiving of them.
>>
>bumping the troll bake to keep it up until this thread is done so he can spam it as if it's the next thread
>>
>>106528526
I want you to do a comparison between 2.2 & 2.1 and post the results. i promise you'll barely see a difference.
>>
ive rediscovered the old in-prompt editing syntax from auto1111 and it's actually useful for getting a character that the data doesn't really show doing something, to do it by starting your gen with a different character that does do that thing. as an example, the model can make the animal crossing dog in a fighting pose but keeps her chibi, but if i seed it with something like "tifa lockhart" first then it creates the dog more humanoid and more dynamic fighting pose. it's a fun little prompt tool [a:b:x]. and before the sloppa complainer tags me, i know it aint great but it's a neat little thing to remind you all of if you still use 1111webui based apps. not sure if comfy can do something like it.
>>
>>106528526
on both high and low with 3.0 strength on high and 1.0 strength on low as outlined by kijai. that might not necessarily be the best settings but it's worked well enough for me and it's certainly worked better than the 2.2 lightx2v loras which are complete dogshit for some reason
>>
>>106528564
The difference isn't the issue, its that this shit doesn't work (ok it works, but its kinda shit). Why is it 4 steps anyway, like specifically 4, both times?
>>
>>106528586
> Why is it 4 steps anyway, like specifically 4, both times?
you don't have to use 4 steps. I personally use 10 steps at 0.8 strength for both high/low. The quality is excellent but obviously the motion is nuked to shit. It was the same with the 2.1 lora.
>>
File: 1743609482175885.png (1.96 MB, 832x1248)
1.96 MB
1.96 MB PNG
>>
>>106528592
Yeah i know, and you definitely should go over, but they specifically targeted 4 steps. Just want to know why
>>
>>106528576
Are you using the FP16 versions of the 2.2 loras? because they aren't dogshit for me.
>>
>>106528601
because the model was distilled to specifically 4 steps....
>>
File: screenshot.1757383276.jpg (427 KB, 666x809)
427 KB
427 KB JPG
>>106528601
fyi, most technical questions can be answered by AI.
>>
File: chroma_00008_.png (1.56 MB, 896x1152)
1.56 MB
1.56 MB PNG
>organizing my gens into folders so gwenview won't take so long to load
>over 500 gens in the jew folder
>>
>>106528564
this is wrong though
if you are using lora designed for 2.1 and try them on 2.2 you will have fucked up results
>>
>>106528653
not always, and i'm not the anon saying to use the 2.1 lightx lora. I personally never bothered with it myself im just too lazy to test
>>
File: 1609301122243.gif (964 KB, 466x264)
964 KB
964 KB GIF
>Takes less than 30 minutes to train a chroma lora on my 5090
Time to have some fun
>>
>>106527906
Art Nouveau
>>
>>106528646
Frightingly accurate
>>
>>106528668
What will your first one be?
>>
File: ComfyUI_temp_patpl_00001_.png (2.36 MB, 1280x1024)
2.36 MB
2.36 MB PNG
>>
>>106528576

You only need to use the Lora on the low noise model.
>>
>>106528669
>art nouveau
You are now back in early 2023 where everyone memorized the artist name Alphonse mucha
>>
File: 00221-4239959166.jpg (1.04 MB, 2048x2480)
1.04 MB
1.04 MB JPG
>>106528675
Amelie
>>
File: AnimateDiff_00317.mp4 (1.69 MB, 480x848)
1.69 MB
1.69 MB MP4
>>106528653
One would think that but that is simply not the case. The 2.1 lightx2v loras work better for wan 2.2 overall, at least from my experience.
>>106528714
For what it's worth the low noise model is concerned with detailing the image while the high noise model outlines the general motion
>>
File: ComfyUI_00186_.mp4 (1.47 MB, 640x640)
1.47 MB
1.47 MB MP4
>>
>>106528737
This is fucking shit. Way too fast. Half the frames feel skipped. Surely you have a better example?
>>
File: ComfyUI_00189_.mp4 (379 KB, 640x640)
379 KB
379 KB MP4
>>
File: 00020-625905396.png (845 KB, 1216x832)
845 KB
845 KB PNG
>>
>>106528737
Exactly. If you use it on both, the motion gets simplified to 2.1 levels.
>>
File: ComfyUI_00191_.mp4 (923 KB, 640x640)
923 KB
923 KB MP4
>>
chroma knows what a xenomorph is but it's refusing to give me ellen ripley. no one knows my pain.
>>
File: 1739751467528335.jpg (1.07 MB, 1416x2120)
1.07 MB
1.07 MB JPG
>>
>>106528764
Maybe sigourney weaver?
>>
File: 1_00005_.mp4 (336 KB, 640x640)
336 KB
336 KB MP4
>>
>>106528775
awesome
>>
>>106528775
Nice.
>>
>>106528774
total refusal on that one, that's why i was trying ripley instead. she might be in the model somewhere, i just have to get her out. i've gotten random youtubers out before.

what would an autistic kraut who doesn't want to sue tokenize a movie star as?
>>
File: 1726762046147375.jpg (2.06 MB, 2832x2120)
2.06 MB
2.06 MB JPG
>>
>>106528759
What did you make this image with?
>>
File: 1690797139188440.jpg (116 KB, 1200x684)
116 KB
116 KB JPG
>>106528794
that should read doesn't want to be sued. if i prompt xenomorph vs greatest foe it gives me a black haired woman every time. so it knows, it's just being coy.
>>
>>106528799
fluxdev
>>
File: 1_00011_.mp4 (639 KB, 640x640)
639 KB
639 KB MP4
lewd
>>
>>
File: 1_00012_.mp4 (614 KB, 640x640)
614 KB
614 KB MP4
>>106528876
look at the cuppage
>>
>>106528771
>>106528798
I really like this style and theme, reminds me a bit of Jet Set Radio
>>
>>106528876
>>106528895
Lmao looks like those fake ass inserts chicks wear
>>
Julien status?
>>
>>106528729
wow, very epic ran. thanks
>>
File: 1_00006_.mp4 (711 KB, 640x640)
711 KB
711 KB MP4
we went from ai not being able to make hands to getting fat hands correct, what a timeline.
>>
>>106528804
>that should read doesn't want to be sued
Although it is technically legal, it's seems stupid at this point to have celebrities captioned by name and thus reproducable in a model that does NSFW.

Better to leave that for loras.
>>
>>106528914
being more productive than (You)
>>
>>106528914
melting and coping itt
>>
File: 1_00016_.mp4 (1.32 MB, 768x1344)
1.32 MB
1.32 MB MP4
>>106528570
2 minutes to gen my sloppa
>>
>>106528917
we are already obsolete. and still no new local project. they are now focusing on online slop
>>
>>106528950
Kek the mushroom merchant hath returned. It’s pretty cool how it can coherently move my image ngl. I tried doing similar with chatgpt sora and it doesn’t work at all. Surprised local has the advantage over them here anyway
>>
File: 1_00018_.mp4 (3.08 MB, 640x640)
3.08 MB
3.08 MB MP4
>>106528901
>>
slave to gacha
>>
Anyone had any luck with using NAG with chroma?
>>
>>106529001
Good times
>>
File: 1726573016564668.jpg (847 KB, 2000x1496)
847 KB
847 KB JPG
>>106528901
i think my captions can be better but i think it captures the artists style decently
>>
Has comfy fixed the gguf memory leaks yet?

>>106529024
Why whats wrong?
>>
>>106529055
Who is the artist/artists ?
>>
>>106529064
>Has comfy fixed the gguf memory leaks yet?
Works on my machine

>whats wrong?
Doesn't seerm to do much except making the pic brighter. Using the default nag values.
>>
File: 00248-3278292974.jpg (855 KB, 2048x2480)
855 KB
855 KB JPG
>>
File: 00032-4057040956.jpg (227 KB, 1248x1824)
227 KB
227 KB JPG
>>
File: 1_00023_.mp4 (478 KB, 640x640)
478 KB
478 KB MP4
>>106528759
>>
>>106529087
I haven't had any issues lately. Ram properly dumps.
>>
File: 1737382885991317.jpg (314 KB, 1080x1350)
314 KB
314 KB JPG
>>106529069
umisida
picrel is one of the 35 images in the dataset
>>
File: 1748079069379490.jpg (696 KB, 1416x2120)
696 KB
696 KB JPG
could be better at generalization tho
>>
This feels powerful, I can't believe how fucking fast it is to make these
I just need to find a easy way to gather images so much to do
>>
File: kek.jpg (22 KB, 982x154)
22 KB
22 KB JPG
>>106529087
>Works on my machine
Hows video for you? Since 0.3.51 it freezes after a few vid gens, so reverted back a few versions and had 0 problems since. Seems people are still having issues too. As for chromadog, I just prompt and fuck with the CFG until it works, kek. Chroma NAG is pretty neat though
>>
>>106529124
Thanks, cool style, surprised me that it's by a japanese artist since it doesn't really 'feel' japanese in style, more like european to me
>>
>>106529157
It's unfortunate that many sites make it a pain to scape.
>>
File: 00252-1952406014.jpg (888 KB, 2048x2480)
888 KB
888 KB JPG
>>106529157
After high res fix
>>106529171
Yeah that is going to be a problem
>>
>>106529159
Most modern Japanese stuff that isn't animeshit is very European in flavor.
>>
>>106529064
Not gguf but I pulled a few days ago and I had to lower the res of my cnet image because Comfy would occasionally OOM where it previously did not.
>>
>>106529158
I haven't touched video in a while so idk. Are you using the basic gguf nodes or the multigpu?
>>
File: 1753119271250444.jpg (1.44 MB, 2832x2120)
1.44 MB
1.44 MB JPG
>>
File: 1729036407629748.jpg (492 KB, 1388x2832)
492 KB
492 KB JPG
>>106528923
well then i guess it's just a german that wants to be a fag then. i'll keep trying to shoot around it.
>>
>>106529159
>it doesn't really 'feel' japanese in style
i didnt really notice until you pointed it out actually
>>
>>106528107 >>106527912
the most famous artist of our time can now pretend he's just some random ai-using american president on the internet and you won't even know
>>
Since vibe voice is an LLM, why can't we just speed it up using optimizations like EXL2 or something?
>>
File: he pulled.jpg (18 KB, 331x325)
18 KB
18 KB JPG
>>106529187
That sucks. I'm sure they'll cater to the next big model release instead of letting old issues snowball

>>106529205
Just the regular ones
>>
>>106529277
Which model's which quant gguf do you refer to? Also which node? (UNET, CLIP, etc.)
When using shit like wan 2.2 on my 32gb ram system the memory use is fucking insane, I see 20-30 gigs of swap use.
However I haven't noticed any memory leaks (continuously increasing memory use). It's stable around that threshold.
>>
>>106529158
not same anon but gguf wan/chroma works ok for me

i seemed to have some issues with gwen image edit but it's not clear to me if it's a memory leak or something else
>>
which variants of illustrious and noob are considered the "base" versions at this point, at least as far as wide use?

and between noob and illustrious, which is more capable?
>>
>>106529349
I don't know too much about base illust, I haven't really seen too many people using it without finetunes.
For noob you are probably looking for the 1.0 v-pred. It's kinda ass about backgrounds but usable imo. Very good out of the box knowledge of characters, art styles, anatomy, fetishes etc.
>and between noob and illustrious, which is more capable?
Definitely noob.
>>
>>106528775
Time lapse gens are the best
>>106529349
NoobAI vPred 1.0
Illustrious 2.0 (though many use 1.0)
>which is more capable?
Between il2.0 and noobvpred it depends on what you value. noob was trained on e621 while il was not. il (2.0, not 1.0) can output larger images on a single pass than noob. noobs colors are more accurate. the list goes on.
I'm a bigger fan of noob desu.
>>
>>
File: 1659468845159371.jpg (58 KB, 500x500)
58 KB
58 KB JPG
after fucking around for a bit, some actors and actresses are in there, and some are not. curious.
>>
>>106529369
>>106529366
so a noob v1.1 (for example) is likely a regression or someone's mix?
>>
cozy bread
>>
>>106529349
I feel like noob is more of a “base model “ than illustrious, like you really need to prompt it well for good results, but it also takes Loras really well. Illustrious I feel is more tuned towards the generic anime look. Not as bad as the mixes and fine tunes that you see out there, but definitely where noob is like anime under the surface, illustrious brings it up to the surface a little more easily, if that makes sense.
>>
File: x.png (2.14 MB, 832x1488)
2.14 MB
2.14 MB PNG
>>106529349
the originals are the "base" versions. it is not that useful. welcome to the lora jungle - it either works or it doesn't.

what either noob or illustrious have managed to train is impressive but nowhere near perfect so both have their own set of flaws making neither clearly better in general

some newer models are what would be better in general but they are not finetuned to the same degree and their base is sometimes more censored
>>
>>106529422
1.1 is the epsilon version.
Supposedly have betters backgrounds (not tested it myself) but lacks the key advantage of v-pred version: amazing contrast that is much better than any other sdxl model.
>>
how do I preserve colors in ComfyUI? i have this image i made with Qwen that i want to upscale with SDXL but I can't reproduce the same colors
>>
until a modern anime model can generate as many styles as accurately as noob, it will remain supreme
>>
>>106529465
Visually what would a modern model give us over noob anyway? I know there would be advantages in composition and prompt adherence but for outputs and visual aesthetic, I can’t really picture them being much better
>>
File: noob vs illust.png (2.31 MB, 1664x1216)
2.31 MB
2.31 MB PNG
>>106529445
And if you need an example of what I mean by contrast and shit backgrounds.
Same prompt, seed, sampler, etc run on both noob v-pred 1.0 and hassaku v3 (illustrious merge).
>>
>>106529222
model?
>>
>>106529460
if you can simply upscale with some ESRGAN instead

otherwise there are so many methods like controlnet union, alternatives and variant on CFG and so on and afaik none just works for everything but some would probably work for your usage

it's relatively clear that image editing is going to be done on more powerful models of the likes of kontext, gwen or whatever else in the future tho
>>
>>106529482
16ch VAE = better details simpleas
>>
File: Chroma_00056_.jpg (369 KB, 1160x1496)
369 KB
369 KB JPG
>>106529460
>i have this image i made with Qwen that i want to upscale with SDXL but I can't reproduce the same colors
Upscale with controlnet tile + promax when using sdxl if you wanna retain likeness
>>
>>106529263
Yes but you would need to implement the model to get inference to work on those engines and that is an issue.
>>
>>106527521
>>106529323
GM!
>>
>>106529483
I can recognize why you say left's background is worse but I disagree on the basis that how much "worse" it may be is far outshines by how much "better" the rest of the image is. But again, I understand that at a certain point it's a matter of personal preference.
>>
>>106529121
>Gm!!!! ;3
>>
New thread: >>58227122 #
Migrate whenever
>>
>>106529521
>Tsukasa Jun
thumbs up!
>>
Is /a/ right? Has image gen literally gotten worse?

>>>/a/282129552
>It's gotten worse since 2020 lol, that's why everyone just uses finetunes of SD1. 5 before the model had the chance to eat so much of its own shit
>>
>>106528560
>>
Fresh

>>106529560
>>106529560
>>106529560
>>
>>106529549
>everyone just uses finetunes of SD1.
Your hourly vramlet retard who can't use X model FUDing because of his sour grapes because he can't use anything else.
>>
>>106529482
>>106529503
On top of these, other areas of improvement:
Natural language prompting
And with NLP, the ability to do multi subject prompts
And also with NLP, much better results with open ended prompts that rely on model's reasoning and knowledge for expected aesthetics (For example think of what sora does for x but y style requests versus the slot machine of different character and style weights we play until we luck out)
The ability to edit and inpaint near seamlessly, especially with NLP rather than setting up masks (Think of nano banana) Would save so much time and electricity instead of re-rolling seed
Adding on to other guy's vae comment, much better text generation as well.
>>106529537
Well it can generate normal backgrounds at times but it has a very strong tendencies to often fight you for empty or simple bokeh background in certain prompt combinations regardless of what you add to the prompt afterwards.
Like if you want detailed backgrounds you may consider getting a refiner model or other options.
But anyway try and see if it works out for you.
>>
>>106529499
yeah but SDXL is much faster on my 10 GB. and its sufficient for inpainting and upscale, does a fine job in many cases
>>
>>106529579
sure, use it where it works. almost all these other methods have some success.
>>
>>106529483
Right background is more detailed, and yet it is miles more shitty than the left because it can never escape the merge's trademark overfit 'ai' look everyone is tired of. Sometimes less is more.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.