[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: liquid dung garage.jpg (1.75 MB, 3264x3264)
1.75 MB
1.75 MB JPG
Discussion of free and open source text-to-image models

Briefly baked bun: >>103122994

The West Knows Best Edition

>Beginner UI
Metastable: https://metastable.studio
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SD.Next: https://github.com/vladmandic/automatic
InvokeAI: https://github.com/invoke-ai/InvokeAI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://aitracker.art
https://huggingface.co
https://tensor.art/models
https://liblib.art
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>SD3.5L/M
https://huggingface.co/stabilityai/stable-diffusion-3.5-large
https://replicate.com/stability-ai/stable-diffusion-3.5-large
https://huggingface.co/stabilityai/stable-diffusion-3.5-medium
https://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-medium

>Sana
https://github.com/NVlabs/Sana
https://sana-gen.mit.edu

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
File: the.png (1.84 MB, 1056x1520)
1.84 MB
1.84 MB PNG
reposting since it was end of bread and the only good gen i've done today
>>
Blessed thread of frenship
>>
>>103131582
>2034
>spaghettiui is on its 12th refactor, zero ux improvements, same issues persist
>illy and comfy are still arguing about who stole whos code
>any discussion about wishing things were better is immediately shut down
>yOu dOnT unDeRstAnD iTs eXpEriMenTaL!!!!
>>
>Alexa, where the hot singles in my area at?
>>
>>103132379
Now gen her eating a burger
>>
>>103132390
worst thing is, when i downloaded comfy again recently and tried drag and droping some old images from sd 1.4/5 days and a bit later, half of them just didnt work at all, as in, the workflow didnt create itself at all, and none of them auto-added adetailer to the workflow even when manually added by me from that other repo...

god forbid i actually used obscure ui/s, workflows, software or models, i would need to wait for 10 years and AGI before any of the old workflows can be recreated
>>
>vanilla comfy
never ever
>>
>>103132365
>>
>>103132390
Feel free to look at what tech was like in 2014. You'll need to Google because you were likely 4 years old at the time.
>>
>>103132448
<3
>>
>>103132337
what model did anon use here, i need to know.. NOW!!
>>
File: 144148_00001_.png (1.55 MB, 1368x760)
1.55 MB
1.55 MB PNG
>>103132458
>>
If you just started imggen and have only used Flux please refrain from doling out advice thank you
>>
>>103132486
sdxl + my lora.
>>
File: Cog_00009.webm (1.32 MB, 720x480)
1.32 MB
1.32 MB WEBM
>>103128999
Neat
>>
File: liz1.png (2.16 MB, 1056x1520)
2.16 MB
2.16 MB PNG
it's so hard to prompt a robot that has forms but isn't a fucking android robot with a human head. I want more cartoonish looking robots but flux almost always spergs out...
>>
>>103132603
This one cooked up well!
>>
>>103132558
..like base sdxl?
>>
>>103132616
thanks anon, it's maybe 10% of gens that end up like this. I've tried for a few hours to find the right prompt but i think flux just doesn't like doing non-humanlike robots.
>>
>>103132634
yea.
I'd try pony but I don't know how to make lora out of it.
>>
File: Oregon Coast - 9.png (2.45 MB, 1152x1536)
2.45 MB
2.45 MB PNG
oregon
>>
File: 9a.jpg (76 KB, 816x1200)
76 KB
76 KB JPG
another but I inpainted load of shit
>>
>>103132655
it's the exact same process, use booru tags and you're done
>>
File: file.png (761 KB, 1254x1000)
761 KB
761 KB PNG
https://xcancel.com/fabianstelzer/status/1855165428806435207
>>
>>103132597
>>103128999
can someone make hugging face demo? i can't run cogx
>>
>>103132655
that's pretty impressive, i didn't know you could get that far with base sdxl using only a lora. i haven't tried making a pony lora myself but just caption the images with booru tags and you should be good
>>
>>103132681
As amazing as this looks I just want an IP adapter trained specifically for Illustrious
Maybe one more for SD3.5 medium trained on anime.
>>
>>103132739
>IP adapter trained specifically for Illustrious
desu a better one for base XL. it sucks ass compared to the 1.5 ipadapter.
>>
File: tmp40kflvhu.png (1.38 MB, 1280x768)
1.38 MB
1.38 MB PNG
>>
>>103132597
This shit is the reason that after having just Blender doing LOTS of spare time modeling and sculpting for years, I quit cold-turky when generative AI kicked off.

3d modelling/sculpting is just over, it's going to die faster than concept art.
>>
>Load webpage a billion animated videos start playing, instantly utilizing 30% CPU and 20% GPU
>Search for something
>Application error
>Search again
>Find what I want
>You must be logged in to download this
>Login
>Application error
>Logged out
It's incredible how much a piece of shit Civitai is as a site.
>>
File: rb.png (381 KB, 720x720)
381 KB
381 KB PNG
ok here is the music video, i hope this high-effort shitpost keeps you entertained for 86 seconds

https://litter.catbox.moe/n5ev4h.mp4
>>
>>103133110
>>>/reddit/atbge probably
>>
>>103133110
why not AI gen the music too? would have been better than whatever this is
>>
>>103133110
Nice editing but that's possibly the worst song I've ever heard
>>
>>103133110
That music is pure rage fuel. I like animations tho.
>>
>>103132365
Collages are getting worst every thread, you should stop editing them, dont flip them or crop them, it just doesnt look good
>>
>>103133110
the video itself is quite nice, not exactly my thing but i enjoyed it. i did not like the music, it's a perfect example of why i hate zoomer music
>>
>>103133294
>don't crop them
sometimes it's unavoidable, trying to reasonably fit them all in, but I can try and avoid it next time
>>
>>103133329
you should start playing tetris with the collage
>>
>>103133337
already the case desu
>>
>>103133342
it would be funny if you rotated some of the gens to make your own tetris shapes
>>
>>103133110
Composition is good, but that music is not for me at all, out of all historys musical genres you chose zoomer mumble drone techno, alienates 95% of people with ears.
>>
bobot ballerina
>>
File: file.png (801 KB, 1024x768)
801 KB
801 KB PNG
>>
>>103133110
>https://litter.catbox.moe/n5ev4h.mp4
Music is awful but it may pander to youtube audience, you should upload it there and maybe start monetizing if you're into that, there is a handful of AI girls channels that are pretty awful (Dalle-3, tensorart tier gens animated with Kling/minmax) but getting a tons of views, normies don't even understand the type of gens people can do in local/newer tools
>>
>>103132603
Looks like Valkyr from Warframe. Was that intentional?
>>
>>
>>103133451
real image?
>>
>>103133558
SD3 Medium Finetune
>>
File: file.png (782 KB, 1024x768)
782 KB
782 KB PNG
>>103133587
>>103133558
Meant to attach
>Heidi Klum | Heidi klum, Heidi klum victoria secret, Swimsuit models a photograph of a young woman posing on a beach. She is wearing a black bikini with thin straps that wrap around her body in a criss-cross pattern. The bikini top is strapless and the bottom is low-cut, accentuating her curves. The woman has long blonde hair that is styled in loose waves and is looking directly at the camera with a sultry expression. The background is a beautiful blue sky with white clouds and the ocean can be seen in the distance. --n watermark, stock image --w 1024 --h 768 --l 5.5 --s 30
>>
File: Mochi_00026.webm (1.24 MB, 848x480)
1.24 MB
1.24 MB WEBM
MochiHD waiting room
Mochi img2video waiting room
Cog1.5 waiting room
>>
>>103133617
https://github.com/kijai/ComfyUI-CogVideoXWrapper/tree/1.5_test
>>
text 2 armpit hair video model waiting room
>>
>>103133508
nope, just prompted a "lizard robot" because the head shapes are more creative
>>
>>103132696
Even an 8gb card can run cogx with kijai's nodes in comfy to do lots of offloading.
>>
>>
>>
>>103132890
I can sympathize. I didn't even sculpt for years, I just started to get serious with it at a too late time, months before StableDiffusion came out. Too bad.
>>
any tips on how to get a good dancing candy raver?
>>
>>103133876
>>103133886
why post basically the same picture
>>
File: 311388113569005577.webm (745 KB, 720x1072)
745 KB
745 KB WEBM
>>103132669
Damn this almost looks like a figurine, nice stuff

>>103132365
>>103132448
Can the proompter of the blue angel with vines please share a catbox
>>
>>103134040
ups the chance of getting in the collage, also colors
>>
>>103134077
that's not how it works
>>
File deleted.
>>103134085
ok
>>
>>103134097
k
>>
>>103134057
stop posting minimax shit here, it's not local
>>
>>103134117
fair point
>>
>>103134134
no worries
>>
>>103134085
collage maker is more concerned with his art than ranking images.
>>
stinky having another bad day? :(
>>
>>103134157
>art
>>
if the collage is the reason people spam shitty 1girl gens then baker anon could single handedly save this general by simply not picking low effort 1slops
>>
>>103134185
he hasn't picked many of my 1girls. I still post.
>>
>>103134185
1 girl
>>
1girl supremacy
>>
recommendation to baker anon: only put monster girl gens in the collage for the next 10 years
>>
>>103134185
anons would spam 1girlslop regardless, it's in our dna
>>
THREAD THEME: turn your 1girl into a monster girl
>>
>>103134207
>>
>monster 1girl hour
real shit?
>>
>>103134236
put armpit hair into the prompt
>>
File: 003291.png (1.54 MB, 832x1216)
1.54 MB
1.54 MB PNG
unintentionally
>>
File: 1720057066475.jpg (638 KB, 1024x1024)
638 KB
638 KB JPG
>>
>>103133251
>>103133260
>>103133273
>>103133312
>>103133376
>Nice editing
premiere 25 is insanely easy to use and get good shit out of. this was my first time ever using it and the whole thing took only 8 hours to make including watching tutorials
>but that's possibly the worst song I've ever heard
you will almost certainly enjoy the music in the next one more (it'll be AI generated)

>>103133481
>you should upload it to youtube and maybe start monetizing if you're into that
when i have like 3 things worth posting at once then i probably will. i need the ziggers on 2ch to wake up from their vodka comas and tell me how accurate the russian translation is since i did it with o1 and then this would be 1 of the 3
>>
>>103134243
lol
>>
File: ComfeeYouEye_00004_.png (1.39 MB, 1024x1408)
1.39 MB
1.39 MB PNG
>>
File: liz14.png (1.75 MB, 1480x1184)
1.75 MB
1.75 MB PNG
really need to train me a lora my prompt is unstable as fuck
only ever trained for pony (with good success) but flux dataset captioning gives me nightmares
>>
File: file.png (825 KB, 1024x768)
825 KB
825 KB PNG
>>
>>103134419
Needs Japanese and Chinese translation too. Maybe some of the other major languages, just throw the lyrics at gpt and be done. It was fine for Grimes's "We appreciate power" video.
>>
>>103134097
>[File deleted]
But her nips are covered
>>
File: yippy fang.png (6 KB, 394x89)
6 KB
6 KB PNG
>>103134530
>Needs Japanese and Chinese translation too
this is a good idea thank you anon

fun fact: Porsche in Chinese is Bǎoshíjié, Lolita is Luólìtà, not sure what foreskin is but i hope it's yǐpí fáng
>>
>>103134612
sovl
>>
File: 1714856292960498.jpg (195 KB, 1024x1024)
195 KB
195 KB JPG
>>
>>103134671
i see a samurai about to attack a guy in a chair with a sword and rifle
>>
>>103134671
I see my math teacher scolding me for not having brought a ruler to geometry class (it broke it my backpack).
>>
File: 00017-4142513449.jpg (1.58 MB, 2304x1792)
1.58 MB
1.58 MB JPG
>>103134230
el ogro de las americas
>>
>>103134671
i see headless infected pajeet sitting on a toilet communicating with mantis brahmin
>>
>>103134671
album cover
>>
>>103132448
ty for including my celebslop
>>
>>103134829
>art pop, chamber pop, post-minimalism
oh god i can smell the stench of a 3.65 for 3.000ish ratings on rym right there
>>
>>103134057
>Can the proompter of the blue angel with vines please share a catbox
I wish
>>
>>103131539
Nigga that's kawaii. Lora?
>>
File: delux_co_00029_.png (1.61 MB, 1536x1024)
1.61 MB
1.61 MB PNG
>>
File: tmpq4or_s_y.png (1.2 MB, 1480x768)
1.2 MB
1.2 MB PNG
>>
>pony realism generates hotter black women than white women with less tokens
what the fuck kind of bias is this
>>
>>103135188
based
>>
File: 003326.png (1.92 MB, 1040x1520)
1.92 MB
1.92 MB PNG
>>
Question, just learned about PuLID(for SD XL) and decided to try it out, but it doesn't seem to do anything on forgeUI, am I doing something wrong or is it not supported on forge
>>
File: ComfyUI-1830.png (1.65 MB, 832x1216)
1.65 MB
1.65 MB PNG
>>103134057
>Damn this almost looks like a figurine, nice stuff
FYI there are wonderful actual figurine models, most notably those made by https://civitai.com/user/MIAOKA/models

You might want to give them a try.
>>
File: 00054-4095550461.jpg (888 KB, 1344x1728)
888 KB
888 KB JPG
>>
>>103135480
>>
File: 00061-4095550460.jpg (708 KB, 1344x1728)
708 KB
708 KB JPG
>>
>>103135167
Very nice
>>
File: file.png (816 KB, 832x1216)
816 KB
816 KB PNG
>>103135595
>>
>>103135850
nice
>>
why does ani keep /sdg/ updated with the app and not us?
>>
>>103136274
forget this trash being
>>
>>103136293
no, we may actually get a comfy and automatic gui that runs the cpp repo. I'm sick of these shitty web apps
>>
>>103136322
why are web apps so popular in the AI space anyway? i don't have a server to run these with.
>>
File: 003354.png (2.4 MB, 1040x1520)
2.4 MB
2.4 MB PNG
https://litter.catbox.moe/3u561a.jpeg
>>
>>103136364
cloud shit and all the corpos are spending billions on data centers for you to pay them
>>
>>103136413
it's just kinda sucks that a lot of this shit is supposed to run as daemons in the background and takes up all my VRAM when it's idling
>>
>>103136426
that's why I have high hopes for ani's app. cpp inference manages memory better or should in theory. also assuming ani knows what he's doing too
>>
>>103136274
has that guy ever delivered anything he posts? I remember he how he was going to revolutionize anime with his prompt scheduling animation bullshit and never delivered, he even managed to scam some japanese investors, the only thing hes kinda of good at is generating degenerate shit
>>
>>103136454
yes he has in the past. ani actually tries hard to keep his word every time and his github frequency is approaching comfy numbers
>>
>>103136364
Because they're more device independent in general after nearly all the GUI frameworks are crappy particularly if you go cross-platform AND they are better for IAAS/SAAS or generally kubernetes/docker cluster users. Many corners of AI actually have a bunch of these clusters.
>>
>>103136454
>has that guy ever delivered anything he posts?
nope
he's a lolcow
>>
>>103136426
Taking up all the VRAM when it's idling isn't a consequence of it being a webapp, it's probably just because whatever you're using keeps models or something loaded in VRAM. In a few cases there may be configuration to immediately unload them after every batch... with the obvious downside of having to load it again.
>>
>>103136522
i think comfy doesn't do it unless you get some plugins. ollama is the only local AI thing that i've used that unloads stuff when it idles. still, you're running a bunch of python shit in the background constantly that's also hosting a http server...
>>
>>103136555
I'm so sick of the comfy bloat ecosystem. 80% of the shit I have to run around and get should just be in comfy
>>
>>103136555
Sure. But this complaint isn't amounting to much given how many generations of shitty GUI frameworks also hogged system RAM and caused maintenance headaches on this, that or all platforms before potentially getting abandoned f for another shitty GUI framework that also got abandoned.

And there is virtually nothing that justwerks across platforms and in people's compute cloud other than webapps.

Overall you can expect only more webapps unless you address not just this software stack around AI but the whole GUI framework situation. But I figure for now you better just pick choices that unload as much as they can from VRAM.
>>
>catbox
>no metadata
I see how it is.

>>103136274
Who's that? I'm a newfag.
>>
>>103136631
go back to your containment thread
>>
>>103133110
your generation was a mistake. you have been bricked by the internet, porn, black culture, and loneliness. it's total mindrot. you could create anything, absolutely anything, and this is your output. jesus christ anon
>>
>>103136640
But it's shit...
>>
File: 1715127529734186.jpg (220 KB, 1024x1024)
220 KB
220 KB JPG
>>
>>103136631
>Who's that? I'm a newfag
an anon that was here for a very long time and made a lot of contributions for auto and comfy. I think he works for sai rn
>>
>
>>
>>103136745
That's pretty cool. Thanks.
>>
been out of the picture for a while, is Flux still king or is SD3.5L (or some other model I haven't heard about) better now?
mainly interested in realistic styles & accuracy over stylisation

also
>15 minute captcha despite posting here forever without prior issues
TJD
>>
>>
File: 00009-4069572339.png (1.22 MB, 896x1152)
1.22 MB
1.22 MB PNG
>>103136876
I haven't bothered with SD 3.5 at all but from what I heard it seems to be just less dogshit version of 3.
Obviously wait until someone who actually used it responds but I haven't seen anything too impressive.
>>
>>103136725
what's his name?
>>
>>103136995
XIRP-101
>>
>>103134651
Thanks, realism models are underrated for sovlposting

>>103135480
Nice!

>>103136876
I think SDXL is still king for realism, but for accuracy probably still Flux. Haven't played with 3.5 too much though so maybe I'm wrong
>>
>>103137025
3.5 is a waste of time.
>>
>c'mon you guys are just being boomers there's no way the song is that bad-- holy jesus christ
russianon i'm sorry but you have no fucking taste
>>
>>103136876
Flux is still king no doubt about it.
just the accuracy for including text in the image puts it far above everyone else.
also imo Flux is less of Gacha shit like the other models where the output really feels like gambling, with Flux I feel like its far better at following prompts.
its also easier to train.
If only Flux just wasnt so goddamn fucking slow.
>>
File: 00014-3443282387.png (1014 KB, 1024x1024)
1014 KB
1014 KB PNG
Back in SD days we were supposed to prefer resolutions that are multiples of 64 for best results.
I know that flux can go nuts in many different resolutions as long as there is less than 2M pixels but does it still help to prefer multiples of 64?
For example using 1024X1152 instead of 1000x1150?
>>
>>103136676
>>103136835
>>103136971
ani gave us some details on what to expect
>>
Release it or shut up
>>
>>103137081
who are you talking to?
>>
>>103137069
I've seen the extensions for comfy made recently (for 3.5) to enforce the 64 rule, but haven't tested myself.
>>
File: ComfyUI_122228_.png (2.09 MB, 1280x1280)
2.09 MB
2.09 MB PNG
Kolors with photo lora I trained
>>
>>103137170
keep in mind SD 3.5 Large only does 1MP basically, whereas Medium has a span from 0.25 up to 2MP. I personally prefer Medium for this reason
>>
File: 00082-377847871.png (1.96 MB, 896x1152)
1.96 MB
1.96 MB PNG
>>
>>103137025
>>
>>103137194
>Buttchin
>>
>>103137067
>imo Flux is less of Gacha shit like the other models where the output really feels like gambling
sounds like your guidance is too high
>>
File: IMG_0569.jpg (413 KB, 1484x1788)
413 KB
413 KB JPG
I saw this real pic posted on twitter and now I know where those fucked up ragged lookin dresses in my gens come from.
>>
I like generating this grainy art style that makes it look like the image was a scan from an old magazine or something
>>
File: 003406.png (2 MB, 800x1920)
2 MB
2 MB PNG
>>
>>103137332
>>
File: 00015-3477225717.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
Let see if there is any interest to this:
I am just prompting album names to flux and posting first thing it spits out.
Try to guess the album.
>>
>>103137341
>>
>>103137202
>Medium up to 2MP
Now this I have tested and it's false, 1.4MP is the limit. Unless you are using some finetune.
Still, does it work better to immediately gen at the higher resolution compared to 1MP and then upscaling?
>>
File: 00022-230386508.png (1.83 MB, 1024x1024)
1.83 MB
1.83 MB PNG
>>103137350
This should be guessable with some thought
>>
File: 00094-2549582755.png (2.08 MB, 896x1152)
2.08 MB
2.08 MB PNG
>>103137332
same, I love that look
>>
>>103137439
cozy
>>
>>103137211
Mermaid and harpy done
>>
File: 00030-2467336251.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>103137429
Since no one cared, here is an easy one on my way out.
Welp, it didn't work, have a nice day anons.
>>
>>103137350
>>103137429
>>103137560
sorry anon, i'm no /mu/ head. I can't guess any of these
>>
>>103137429
this one's easy, it's obviously Metallica's famous album A Bunch of Colored Stone Spheres Between Two Castle Walls
>>
>>103137350
Hall of the mountain king
>>
>>103137585
Correct genre but nope obviously.
>>103137573
Lol
>>
File: 003411.jpg (367 KB, 1800x4320)
367 KB
367 KB JPG
>>
>>103137526
>>
>>103137689
it gave her Kesha's face
>>
>>103132365
>3 of my images in the collage
feels good man
>>
>>103137689
>>
bigASP v2.0 isn't half bad for nudity, thanks whoever mentioned it last week-ish. Better genitals and naked bodies than the last juggernaut with some random nsfw enhancing loras.
>>
File: ComfyUI_temp_faxbh_00002_.png (3.27 MB, 1280x1600)
3.27 MB
3.27 MB PNG
>>
File: 00139-3747386391.png (1.8 MB, 896x1152)
1.8 MB
1.8 MB PNG
>>
File: 00001-3775522569.png (1.82 MB, 1224x1224)
1.82 MB
1.82 MB PNG
Flux.
SD Forge.
What causes the leftmost edge to be deformed?
Not the first time this happened.
Also this is upscaled at 0.47 denoising and the source image DOES NOT have it. Which makes it extra weird.
>>
File: ComfyUI_temp_lmith_00001_.png (3.45 MB, 1280x1600)
3.45 MB
3.45 MB PNG
>>
Unfortunately don't think this one is advertiser friendly

https://files.catbox.moe/fxmyhv.png

>>103138034
How do you think it compares to Lustify?

>>103137992
>>103137526
>>103137211
some examples
>>
>>103138178
Tensor core fragmentation is causing a latent washback.
>>
have any of y'all folx been able to get those svd quants working with comfy?
https://huggingface.co/mit-han-lab/svdquant-models
>>
/sdg/ is so much better thread
>>
>>103136986
What SD 3.5 is good at is being trained
>>
>>103138340
the constant arguing over the collage made the good posters leave
>>
0/10
>>
>>103138237
Ehh I might have tried the oldest version but I might have not. Not really a big fan of 3d porn, and tasteful nudity most XL models can do okay. Can't comment, decided to try out bigasp only due to anon's praise and some recently freed space.
>V2 ≥ V4 > V1 > V3
Doesn't make me believe in it. If I have to swap models, not loras, to get better something, that means SDXL as a whole can't hold on to the concepts and must be ditched asap. Or the modelmaker can't make the model.
>>
>>103138365
is it me or have threads on 4chan, across all the boards, gotten slower over the last year?
>>
File: ComfyUI_03496_.png (1.32 MB, 1152x896)
1.32 MB
1.32 MB PNG
>>
File: 00156-2261631470.png (1.95 MB, 896x1152)
1.95 MB
1.95 MB PNG
>>
File: ComfyUI_03474_.png (1.59 MB, 1152x896)
1.59 MB
1.59 MB PNG
>>
File: 00274-3377713573.png (1.52 MB, 1536x1536)
1.52 MB
1.52 MB PNG
>>
File: 00015-3736771908.png (974 KB, 1632x1152)
974 KB
974 KB PNG
>>
File: ComfyUI_03453_.png (1.14 MB, 1152x896)
1.14 MB
1.14 MB PNG
>>
>>103138393
That's because of moderation.
>>
>>103138487
what, like people getting banned multiple times and ultimately leaving for some other platform?
>>
kino gens
>>
>>103138178
did she get irreversible'd?
>>
>>103138491
Vast majority of posts and threads were made by a small percentage of anons. Moderators also participating in shitposting and trolling. Ban on memes and words. The prolific shitposters gave up.
>>
>>103136576
having no way to gracefully shut it down from the UI is something i really don't like too. what if i like to launch comfy from a desktop entry or something? i could turn it into some PWA or something.
>>
File: RA_NB1_00016_.jpg (1.1 MB, 1920x2808)
1.1 MB
1.1 MB JPG
>>
>>103138306
So... assuming I am not getting trolled what do I do about it?
12GB card.
>>103138561
She is just a used up hooker now.
>>
>>
>>
File: 00001-3620153572.png (1.94 MB, 896x1152)
1.94 MB
1.94 MB PNG
>>
File: RA_NB1_00018_.jpg (643 KB, 1920x2808)
643 KB
643 KB JPG
>>
man do I really have to spend 800$ for used 3090 in current year
I want to play game too
>>
>>
>>103138769
I would wait for the 5000 series. Nothing too special going on atm
>>
>>
>>103138794
pretty sure they won't sell 24-32gb card unless it's 5090
>>
File: RA_NB1_00021_.jpg (996 KB, 1920x2808)
996 KB
996 KB JPG
>>
>be anon
>have a tool with nearly limitless creative potential
>gen and post basically the same picture, so on and so forth
many such cases
>>
>>103138816
Straight prompt?
>>
>>103138870
Gay prompt, they both sport tits, didn't you notice?
>>
>>103138870
That one was just luck. It's a very chaotic prompt and I wasn't even going for multiple figures, I just happened to get something cool.
>>
File: RA_NB1_00023_.jpg (1.11 MB, 1920x2808)
1.11 MB
1.11 MB JPG
>>
What's the difference between Epsilon-Pred and V-Pred for NoobAI? I finally got Epsilon spitting out some good stuff after messing with sampler and cfg, tried V-Pred and it's not as good, but maybe I just need to change samplers/cfg again
>>
I've become lazy about following different UIs lately. Is A1111 still viable or would it be smart to change to something else? I tried ComfyUI at one point, but it felt like it took more time to play with nodes than to actually do any genning so I switched back.
>>
>>103138887
Is it still your March30Mix? Either way really good.
>>
File: grid-0025.jpg (995 KB, 2688x3456)
995 KB
995 KB JPG
>>103138921
Try euler
>>
i still don't know what different samplers actually do. sometimes some of them work, sometimes they don't.
>>
File: ComfyUI_03544_.png (1017 KB, 1152x896)
1017 KB
1017 KB PNG
>>
>>103138928
>would it be smart to change to something else?
Forge. If you liked or got used to a1111, then it's basically a faster upgrade that also allows you to use newer/different models likes Flux.
>>
>>103138950
>https://civitai.com/articles/3693/stable-diffusione-a1111-using-xyz-plot-script
The best way to know is to test them all. No universal good settings for every checkpoint.
>>
>>103138950
>sometimes some of them work, sometimes they don't
Since they're kind of like different methods of calculation, some of them yeld better results under different circumistance, be they steps or CFG.
Linkrel is a very comprehensive, if a bit dated dive into the matter:
https://stable-diffusion-art.com/samplers/
>>
File: RA_NB1_00026_.jpg (1.4 MB, 1920x2808)
1.4 MB
1.4 MB JPG
>>103138935
No I mostly moved onto using NoobAI along with a few loras to try to enhance things a bit.
>>
>>103138979
>DPM++ 2M Karras
seems like that's the right one. i always use this. i get weird results with the 3M variants.
>>
File: ComfyUI_122111_.png (3.07 MB, 1920x1088)
3.07 MB
3.07 MB PNG
>>103137386
it's not false for at least a good number of types of things, this is a raw 3.5 Medium gen for example. SDXL or SD 3.5 Large would lose coherency of composition long before this res
>>
>>103139058
3M SDE should be used with the Exponential scheduler
>>
>>103139058
A good choice I'd say. Usually hop between it and good old Euler A myself. Back when I benchmarked samplers, I also grew fond of UniPC.
>>
>>103138034
there's never be anything better probably, nobody else is gonna do "Pony except it's real photographic porn" at that scale most likely
>>
>>103137215
not really? regardless Kolors doesn't have buttchin inherently even if you want to argue this particular gen does. I'm not going to explain for the 1000th time that Flux looks like it does because it's distilled and not any other reason
>>
File: RA_NB1_00028_.jpg (826 KB, 1920x2808)
826 KB
826 KB JPG
>>
>>103139100
very nice, very good
>>
File: ComfyUI_00657_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>103139090
>not really?
>>
File: RA_NB1_00029_.jpg (825 KB, 1920x2808)
825 KB
825 KB JPG
>>103139103
Thanks,
>>
is bigasp2 just trained on shitty professional porn? why do all the milfs i generate with it look so bogged?
>>
>>103139080
I've done it on Pixart and the results weren't bad but the model feels creatively limited but it made decent genitals. I'm currently doing a moderate sized finetune on SD 3.5M but I'm mostly just waiting for Sana. You'll see more of these models when 5090s and Titan AI GPUs hit because the problem ultimately is it costs $5000+ in cloud compute to do something like Big Asp.
>>
File: RA_NB1_00030_.jpg (1.12 MB, 1920x2808)
1.12 MB
1.12 MB JPG
>>
>>103139282
what's the advantage of bigasp2 over flux? Prompting bigasp2 is a massive pain in the ass.

Obviously, flux cannot do convincing genitalia most of the time, so I was thinking of just using bigasp2 to inpaint my flux gens. Also, bigasp2 looks really burned-out regardless of the settings I use
>>
switched to the q_4_k_s gguf quant of flux dev, finally i can run loras without oom, never going back
>>
>>103139486
I've just hopped onto 4 quants in LLMs, looks like they are indeed a sweet spot.
>>
File: 003494.png (1.75 MB, 1040x1520)
1.75 MB
1.75 MB PNG
>>
>>103138955
cool style
>>
>>103139058
after having tested all different combinations, i only use dpm++ 2s a with exponential scheduler for pony, always gives the best results especially when doing double or triple pass.
>>
>>103139902
Oh I absolutely adored 2s for img2img. Might be the best sampler I've seen for refining details without changing their structure. Bit too slow for base gens though, since quality seemed on par with some of the arguably best. Never tried it in exponential though.
>>
File: gen_00027_.png (1.67 MB, 1120x1440)
1.67 MB
1.67 MB PNG
>tfw robotpreggers gf is still decades away
grim
>>
Cozy
>>
File: ComfyUI_00029_.png (1.31 MB, 1024x768)
1.31 MB
1.31 MB PNG
>>
>>103140069
only sexy gen itt
>>
>>103138393
it's not just you. Some boards have really slowed to a crawl compared to what they used to be like.
>>
>>103138393
the new timer filters phoneposters and ban evaders
>>
My guess is oldfags are busy irl, and newfags migrated over to tiktok and discord.
>>
>>103137560
>no one cared
I spent a good 30 seconds looking at each of them. I suspect we just don't listen to the same kind of music. For the last one I could only think of a song (not an album), which would be Heavy Metal Heart by Sky Ferreira.
>>
>>103137350
imma guess In the Wake of Poseidon by King Crimson
>>
File: tmpr43yrfy8.png (2.1 MB, 1480x1248)
2.1 MB
2.1 MB PNG
>>
Starting from scratch, how do I actually get good at making AI images? I have a RTX 3060 with only 12k mb memory if that matters
>>
File: ComfyUI_03580_.png (1.2 MB, 1152x896)
1.2 MB
1.2 MB PNG
>>
File: ComfyUI_03411_.png (1.55 MB, 1152x896)
1.55 MB
1.55 MB PNG
>>103141091
>how do I actually get good at making AI images
you need a 4090
>>
File: 160830_00001.webm (706 KB, 848x480)
706 KB
706 KB WEBM
>>103141091
>get good
There's your first imaginary hurdle, there are no ""good" images except in a subset of viewers whos ideals of aesthetics match with yours.
Understanding how the various models interpret your prompt is a good first task for today. Go and ask gpt how this works, it will save you time trawling through 4 years of internet garbage geared to "sloppa" production.
Find some models on civitai for whatever version of SD you're going to use, again ask gpt, tell it your specs and ask for a recommendation for 1.5, SD, Flux, 3.5 and so on.
>>
What model is best for anime?
>>
File: CogVideo1.5_00001.webm (767 KB, 768x768)
767 KB
767 KB WEBM
Cog 1.5 is pretty nice, I like how flexible it is with resolutions. First frame always seems to be garbage though
>>
>>103141219
you try the i2v on it?
>>
File: CogVideo1.5_00008.webm (395 KB, 768x768)
395 KB
395 KB WEBM
>>103141452
A little bit, I can't seem to get much from it besides slight camera movement. I've not used it much though, could just be bad seeds
>>
File: Cog_00008.webm (504 KB, 720x480)
504 KB
504 KB WEBM
>>103141797
Same prompt with the original Cog i2v
>>
>>103141797
>>103141804
https://github.com/kijai/ComfyUI-CogVideoXWrapper/issues/214#issuecomment-2466746960
if you haven't already might try higher resolution see if it fixes it.
>>
File: 0.jpg (270 KB, 1024x1024)
270 KB
270 KB JPG
>>
>>103140633
>>103140642
Nope but thanks for the input
>>
File: ComfyUI_temp_llezi_00240_.png (3.82 MB, 1536x1536)
3.82 MB
3.82 MB PNG
initial retraining on noob vpred 0.5 looks good
>>
>>103137350
Emigrant's song?
>>
File: smilejiggle2.webm (2.31 MB, 720x1280)
2.31 MB
2.31 MB WEBM
>>
The problem of art and the artist?
>>
>>103142030
Whats the watermark on the lower right you've blanked out?
Definately non local gen.
>>
>>103141927
they went back to 0.5? nice
>>
>>103142087
no they soldiered on, and this is the half-done vpred conversion
>>
File: beagoodboy.png (1.91 MB, 1088x1600)
1.91 MB
1.91 MB PNG
>>103142064
It's Runway
>>
>>103139484
Use score tags and PAG
>>
>>103142120
Why are you posting non local gens in a thread for local gens?
Are you a retard or just a shit stirring fag?
>>
File: shhh.png (3.17 MB, 1328x1944)
3.17 MB
3.17 MB PNG
>>103142191
The initial image is local, retard
>>
File: ComfyUI_temp_ipydk_00003_.png (3.9 MB, 1536x1920)
3.9 MB
3.9 MB PNG
>>
>>103142228
This
>>103142030
Is not local you absolute brain dead junkie
>>
How would I go about merging an Illustrious and a Pony model in ComfyUI?
>>
>>103142370
Civitai should have some guides. I just don't think it will work too well
>>
>>103142370
take your computer and merge it with a bullet
>>
File: catbox_zkywyj.png (1.4 MB, 1344x1728)
1.4 MB
1.4 MB PNG
>>
>one day ago
>>
>>
File: 808211064.jpg (560 KB, 1728x1344)
560 KB
560 KB JPG
>>one day ago
>>
A lot of good gens today. Good going guys.
>>
File: 00002-458083898.jpg (286 KB, 1360x2048)
286 KB
286 KB JPG
evenin lads
>>
>>103143040
all in due time, steady she goes
>>
File: 1623104946106.jpg (2.18 MB, 2688x1536)
2.18 MB
2.18 MB JPG
>>103139060
Oh wait, oh no... I've been attempting to gen 4MP all along. 2x2 is 4! Sorry.
>>
>>103143380
sniff
>>
>>103143544
1.4 is the multiplier for the 1MP dimensions, not the resolution.
>>
>>103133110
Don't listen to the hate
>>
>1slop
>>
>>103133110
Moлoдeц бoeц
>>
File: 1726213245774521.png (259 KB, 1768x682)
259 KB
259 KB PNG
what's the deal with sd3.5 training?
>>
>>103143708
My guess would be either folks ain't interested in it due to it being a lackluster base, or there's no reliable infrastructure to finetune it with.
>>
>>103141219
its local right? And what is its context window? My problem with animatediff is the context is too short, like why can't they just figure this shit out? context overlay works but its very flickery
>>
>>103143708
It's a shit model
>>
>>103141804
try the words shoot and not firing weapon...
>>
>>103142228
not local is not welcome here, we don't want that shit here, its commercial ffs, fuck all these cunts that want to stop us having nice things. I can go fucking bankrupt.
>>
Rekneaded the dough over at:
>>103143810
>>103143810
>>103143810



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.