[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage_1761915745_1.jpg (1.52 MB, 2589x1870)
1.52 MB
1.52 MB JPG
Spooky Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107058480

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Neta Yume (Lumina 2)
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd
https://gumgum10.github.io/gumgum.github.io/
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
TY baker
>>
File: 00008-1625409147.png (189 KB, 640x512)
189 KB
189 KB PNG
>>
It's peculiar how wan t2v makes better anatomy in low res low steps videos than t2i with higher res and steps. And in less time.
>>
>>107062315
>>FIBO is moderately good at NSFW
can someone confirm this and add a catbox, this retard hasn't proven his claims
>>
Blessed thread of frenship
>>
>>107062470
Are you a bottom?
>>
total spaghetti death
>>
Do people still use kohya to train loras or is there something better now?
>>
>>107062588
yes for illu
>>
File: 1761913097055066.mp4 (1.89 MB, 830x920)
1.89 MB
1.89 MB MP4
How can I do this?
I want to make millions streaming video games as a cute girl with big tiddies.
>>
does anyone aside from ai-toolkit use the accuracy recovery lora to enable training a good quality lora even on q3 qwen image for example?
>>
>>107062659
I don't think I could handle thousands of desperate males simping for me, only faggots would enjoy this shit
>>
>>107062659
Maybe try asking this on twitter.
>>
>>107062659
imaging also transforming body and jerking off of futa version of yourself
>>
File: Untitled.png (808 KB, 852x726)
808 KB
808 KB PNG
>>
>>107062725
this was the cringiest shitties gen Ive ever seen posted, kys
>>
>>107062659
Is that real time or pre rendered?
>>
>>107062659
>this will kill the bitch/Pokimaine streamer industry
WTF I LOVE AI NOW
>>
>>107062659
how do I wan realtime like the post claims? surely they don't use prerecordings? he said right there in the post it happens in realtime
>>
>>107062896
4xH100 in the next room
>>
>>107062963
even that wouldn't be enough I think
>>
Ran should be online soon. Can't wait for his blogpost rambling about imaginary bots and ****.
>>
>>107062990
>Ran derangement syndrome
>>
File: 886288905.png (1.43 MB, 832x1216)
1.43 MB
1.43 MB PNG
Is there any generic negative prompt for Chroma? Seems to me every time I prefer the result without any negative prompt.
>>
>>107062990
Why are you so jealous?
>>
>fanboys are attacking the truth poster
>>
File: 1754536561147556.png (352 KB, 640x426)
352 KB
352 KB PNG
>>107063028
>calling himself the "truth poster"
>>
>>107062588
I use onetrainer
>>
>>107062990
if there's no bots here who is the one spamming random posts from older threads, and for what purpose exactly
>>
>>107063137
Please read the OP and you'll understand why
>>
>>107063144
Love your obsession with imaginary nemesis. At least I didn't pay $3k to generate 1girl slop and calling it skillful.
>>
>>107063164
Nobody cares leave us alone
>>
File: 001_00_image.png (1.33 MB, 1552x656)
1.33 MB
1.33 MB PNG
Emu3.5 works out of the box on 4x4090. It's a Transformers framework LLM, so it automatically does model parallelism across all your GPUs.

The version they released is autoregressive only, meaning it has to predict the visual tokens one-by-one instead of diffusing them all at once. So it's quite slow, 12 minutes to generate this image. The model seems to choose the aspect ratio and resolution. There's no config to specify them. I'm running another gen and asking the model to generate 512x512 specifically, let's see if it knows how to do that.
>>
File: 1736231229877701.png (370 KB, 736x407)
370 KB
370 KB PNG
>>107063164
>imaginary
pack it up boys, those random posts spam never existed after all, nothing to see here
>>
>>107063170
What do you mean?
>>
What's so bad about ponyv7? I saw people shittalking it but at least sizewise it looks like the intermediary model between XL and fuckhueg new models so it might be the model to succeed XL because you don't need a supercomputer cluster to finetune it.
>>
>>107063185
Cool, let me just whip out my four 4090s. What even this sorcery?
>>
>>107063204
There's nothing wrong it just people needlessly shitting on it.
>>
File: 00002-1669949994.png (3.09 MB, 1744x2248)
3.09 MB
3.09 MB PNG
>>
>>107063185
>out of the box on 4x4090
Yeah I'm sure it was a struggle with 96 fucking gigs of vram
>>
>>107063204
maybe get an eye check bro?
>>
File: 00024-2727793699.png (1.64 MB, 864x1208)
1.64 MB
1.64 MB PNG
>>
>>107063283
looks okay for a base model, remember what XL looked like when it came out?
>>
File: 3591736750.png (1.26 MB, 832x1216)
1.26 MB
1.26 MB PNG
>>
>>107063295
Anime diffusion thread is that way ->
>>
>>107063294
yeah XL was 20 years ago tho, there is no justification for a 'BASE' model to look like this, especially since it butchererd the artist tagging too.
>>
File: 00045-3141342541.jpg (1.12 MB, 2480x3088)
1.12 MB
1.12 MB JPG
>>107063204
Pony offers nothing and has the same tag issues as chroma while doing everything worse
>>
trick or treat?
>>
File: 626479426.png (1.15 MB, 768x1344)
1.15 MB
1.15 MB PNG
>>107063307
>>
>>107063326
Ranjeet is delusional just like his gens are tasteless.
>>
res2s or res3m?
>>
>>107063383
coin flip
>>
File: Untitled.png (1.48 MB, 864x1208)
1.48 MB
1.48 MB PNG
>>
File: 1101785152.png (1.07 MB, 832x1216)
1.07 MB
1.07 MB PNG
>>
Ran offers nothing and has the same mental issues as debo while doing everything worse
>>
File: 780215980.png (1.03 MB, 832x1216)
1.03 MB
1.03 MB PNG
>>
Thinking of shopping for some DDR4 for my 3090 24 GB, how much should I go for to maximize AI value for money? Don't want to break bank, but 32 feels insufficient.
>>
>>107063426
>32 feels insufficient.
for wan 2.2, it's better to go for 64gb of ram
>>
>>107063426
128 or bust.
>>
>>107063426
bare minimum 2x32gb, good spot: 2x48, 2x64 some kind of future proofing at the price of lower mt/s.
dont fill all your channels, it's diminishing returns/slower
>>
File: 2773822557.png (1.44 MB, 832x1216)
1.44 MB
1.44 MB PNG
>>107063426
I'm running 32, but I hear 48 is a sweet spot for price/performance.
>>
>>107063426
I'm using a gguf of chroma1 hd flash and comfy is constantly giving me
lora key not loaded: lora_transformer_distilled_guidance_layer_layers_0_linear_1.alpha
>>
would you give candy?
>>
>>107063451
>>107063445
>>107063437
>>107063436
>check prices
>64 goes for over 150 bucks
wtf I thought ram was supposed to be dirt cheap
>>
>>107063479
there was a recent price hike. unknown if it's the sign of a BEAR NAND MARKET or not.
as always, if you need it now, buy it now.
>>
>>107063451
>>107063445
You sure do have some weird technical concepts about ram. Jesus...
>>
>>107063426
I hit 95% RAM usage with 64GB with my 3090 with WAN2.2 sometimes.
>>
>>107063479
It's been increasing, if you are in yuro land you could order via aliexpress it's still 50% cheaper than local or so.
>>
>>107063489
when juggling models/loras you can offload everything to RAM and put it in VRAM only when needed, makes it nice to switch workflow on the fly. also the extra ram can be used to actually do other stuff (like working?????)
>>
>>107063426
I maxed out my 64gb constantly with my 5090. Upgraded to 192gb, filling that up to about 60%
If you want to use a lot of loras, you should go for 2x 48gb minimum.
>>
>>107063511
What the hell are you trying to explain here? You clearly missed the point.
>>
>>107063504
which vendors are trustworthy?
>>
>>107063523
>Upgraded to 192gb, filling that up to about 60%
So it's using 115gb of ram? that is insane, why Wan 2.2 is so hungry of memory??
>>
>>107063531
Read the comment section and go from there. Use your brain. eg if it's too good to be true it's not true
>>
>>107063544
I do a lot of frames and try not to use batching.
And a lot of loras.
>>
>>107063565
>Use your brain
But anon I...
>>
>>
File: 3578691091.png (1.49 MB, 832x1216)
1.49 MB
1.49 MB PNG
>>107063489
It's straight forward, more=better.
>>
>>107063504
>if you are in yuro land you could order via aliexpress it's still 50% cheaper than local or so.
Thanks, will take a looksie
>>
File: 100617621.png (723 KB, 832x1216)
723 KB
723 KB PNG
>>
uhmmm put the candies in the bag, now!
>>
>>107063631
holy... what happened here? ToT
>>
File: 00053-1169833307.jpg (1.29 MB, 2480x3088)
1.29 MB
1.29 MB JPG
This model is pretty good lots of potential even in it's current state, 60-100 steps is the sweet spot
>>
>>107063686
Fugly gen
>>
>>107063686
which one, Neta?
>>
>>107063706
yup, the more I play around with it the more potential I see
>>
>>107063699
yeah, the almost fusing fingers on the right hand, the saliva being colored/shaded differently, the floating candy, the glowy horns, the teeth... these are the immediate issues, it's not a shitty gen, theres way worse, but yeah
>>
>>107063709
alright, thanks for letting me know what to avoid.
>>
>please reply to me
>>
wanna come to gensokyo, mister?
>>
>>107063718
no, it's ugly you blind fuck. this is why have zero artistic capability because you are only counting fingers
>>
>>107063763
>uhrr ugly
I also don't like the gen overall subject/style/composition, but that's more subjective
>>
>meanwhile in /sdg/
>>
Can someone explain this autism of hating on every new model coming out? When a new model comes out I always hope it takes off and gets finetuned etc and its fun to try out new models. I genuinely don't understand this behavior. Or is it sour grapes because tehy can't run it or what
>>
>>107063763
how about you make a better gen then?
>>
>>107063786
It's a small scorned group trying to rage bait, it's the only thing they have left because they got exiled from this thread. There will always be a problem and they will always bitch and complain about something. They wanted to be popular avatarfags and got promptly booted for faggotry
>>
File: 2304695834.png (1020 KB, 832x1216)
1020 KB
1020 KB PNG
>>
I will gen your waifu in a halloween costume of my choice while a smelly old man rapes her with his hands and mouth
>>
ITODDLER BTFO, thats it for today, happy memeween bros!
>>
>>107063789
see
>>107063784
They have a containment thread for a reason
>>
>>107063786
but pony legit SUCKS. neta is fine
>>
>>107063786
>Can someone explain this autism of hating on every new model coming out?
if the model sucks, it sucks, it is actually autism to lie to yourself and pretend that "newest = good"
>>
>>107063846
which model doesn't suck
>>
>>107063831
They need to cut (your) SNAP benefits
>>
>>107063862
noob/illu,
neta,
chroma,
qwen

everything else right now is super cope dungshit
>>
>>107063874
wan is cope but not chroma? come on dog
>>
>>107063886
wan is also good, but I thought we were discussing exlusively image models, while wan can do both (mainly video)
>>
>>
the VSR is pretty good
>>
>>107063784
Who?
>>
File: 00060-686739003.jpg (1.31 MB, 2480x3088)
1.31 MB
1.31 MB JPG
Some people are asshurt today we have crying and retard pedoshit today.
>>
>>107063927
hmm that sounds gay as fuck.
>>
>>107063943
>pedoshit
why are normieshits like you here? FUCK OFF
>>
>>107063950
Take your fucking meds, better yet I'm sure /sdg/ would love those post more. I already know you won't post there because that's your home thread
>>
File: 985348037.png (1.07 MB, 832x1216)
1.07 MB
1.07 MB PNG
>>
>>107063958
>home thread
you're retarded, /adt/ and /sdg/ are cesspools of avatarfags/namefags 1upping each other, wish theyd just move to discord to do their gay shit.
>>
>>107063975
You're not fooling anyone tranny
fuck off
>>
Happy Halloween LDG I wonder what last Halloween was like ITT
>>
>>107063991
hmm she got her cheek and arm dirty, cant she pay attention? stupid brat
>>
>>107063686
>UTTER garbage
Let me guess... netayume?
>>
>ranfaggot avatarspams the thread again
many such cases
>>
>Has to ban evade
>has to use the ani cope
>has been doing this same bit for years
We get it you got diddled don't mean you have to be a pest
>>
File: dmmg_0075.png (1.21 MB, 832x1216)
1.21 MB
1.21 MB PNG
>>107062588
i use ai toolkit
>>
how2train neta lora plox
>>
>107064012
>We
Only a tard refers to himself in 3rd person.
>>
>>107064012
concession accepted. thanks for not dragging out your seethe
>>
>>107063451
model?
>>
File: 1738492999637210.mp4 (2.17 MB, 720x1072)
2.17 MB
2.17 MB MP4
>>107063657
>>
File: file.png (12 KB, 691x123)
12 KB
12 KB PNG
>tfw running out of space
bROS how do you cope about lack of real estate? my storage is 2x4tb nvme 4.0 in raid 0 btw, but I'm kinda regretting this choice (as the abstraction layer disables directstorage), and I have already filled 3 nvme slots on my mobo WHY IS SPACE SUCH A PREMIUM WHY WHY WHY
>>
File: dmmg_0033.png (1.34 MB, 832x1216)
1.34 MB
1.34 MB PNG
>flux doesn't know what a sugar cube is

>>107064184
this is digital hoarding anon
>>
File: 2060402274.png (1.12 MB, 832x1216)
1.12 MB
1.12 MB PNG
>>107064158
Chroma
>>
>>107064209
based i kneel
>>
File: 00046-2918778916.png (1.8 MB, 896x1152)
1.8 MB
1.8 MB PNG
>>
When doing I2V, how much of the image prompt should be included in the video prompt? More, so that the video model has greater context on what it's looking at and things that could come into frame, or less so that it doesn't get distracted from the job of producing motion?
>>
File: really nigga?.png (29 KB, 528x247)
29 KB
29 KB PNG
>>107064129
>We
>3rd person
>>
I finally figured out that snakebite2.1 XL model, I struggled with some threads ago prompt adherence is really good for XL models, but the output looks kinda slopped.
>>
>>107064229
just focus on describing what's going to happen in the video
>>
>>107063943
Yume is bad bro get over, everything made in it has this fake plastic robot feel
>>
>>107064300
Perhaps it's time for you to get back to school.
>>
>>107064385
>he's doubling down instead of finally noticing that "we" is actually a "first person plural"
the greatest sign of retards like him btw
>>
ranfaggot has ruined these threads
>>
anyone tested that new emu3.5 model? i just saw a jewtube video on it
>>
>>107064430
>anyone tested that new emu3.5 model?
it has no gguf and no comfyui so no
>>
>>107064422
What do you mean?
>>
>>107064430
Yeah man, only took 40 minutes to gen a 1MP image on my 4090, using offloading. Best model ever.
>>
>>107064439
>What do you mean?
That perhaps it's time for you to get back to school.
>>
>>107064450
I still don't understand. Why are you so mean all the time?
>>
>>107064448
show the image kek
>>
File: this.png (124 KB, 1825x431)
124 KB
124 KB PNG
>>107064458
>I still don't understand.
>>
>>107064477
>twitter and politics out of nowhere
>>
>>107064494
I don't understand
>>
>>107064494
What do you mean?
>>
>>107064477
just stop replying to him, point it out when he's spamming then move on, do not engage
>>
File: 1747646629368787.png (819 KB, 1280x720)
819 KB
819 KB PNG
>>107064494
>>
y lilbro so mad
>>
>WE
>>
>>107062659
>How can I do this?
Look Asian so the model doesn't have to do much guesswork what a female version of you looks like.
>>
File: image_00004_.jpg (546 KB, 1336x1768)
546 KB
546 KB JPG
>>
>>107062659
This hurts women and simps (aka faggots), not me. A return to the paranoid "there are no women on the internet" days is a good thing for me. The invention of woman streamers was a huge step in the wrong direction for the human species and it's finally being corrected.
>>
>>107064581
model?
>>
File: 1759484824522095.jpg (712 KB, 1416x1888)
712 KB
712 KB JPG
>>
File: image_00007_.jpg (684 KB, 1336x1768)
684 KB
684 KB JPG
>>107064639
Chroma
>>
>>107064612
pretty based take desu
>>
>>107063784
Yes, and?
>>
hmm where are the scary 1girls?????????????
>>
File: 00079-4168928525.jpg (1.37 MB, 2480x3088)
1.37 MB
1.37 MB JPG
You got to feel bad for these people
>>
>>107064740
right index finger is fucked, otherwise acceptable gen, better than your usual cgi onsen gens
>>
Gimm vfi or rife vfi?
>>
>107064768
He's very needy today
>>
>>107064796
Film vfi.
>>
>>107064820
spoiler that shit nigga, I got scared out of my boxers!!!! ToT
>>
>>107064820
kill yourself, pedo
>>
File: 00077-4168928524.jpg (1.21 MB, 2480x3088)
1.21 MB
1.21 MB JPG
>>107064835
He's really desperate today but he will keep ban evading.
>>
>>107064820
just go back to your containment trooncord, mentally ill fattoid
>>
>>107064860
>im an homosexual haglover
we could tell by your cowtits gens
>>
File: image_00011_.jpg (521 KB, 1336x1768)
521 KB
521 KB JPG
>>
>>107064874
I can smell you from here
Again you have been seething for hours and ban evading because you feel slighted by the existence of this thread
>>
how do I make chroma gens even more real?
tips?
>>
File: 1738350390873069.png (1.75 MB, 768x1344)
1.75 MB
1.75 MB PNG
>>
>>107064891
just add a grain filter, that's what passes as KINO ANALOG REALISTIC!!! here
>>
I enjoy when you can feel the butthurt behind anons posts
>>
Why can't we rangeban india already so we can skip all the schizos and pedos.
>>
>tfw chroma fp16 + 16fp encoder runs well on 5060ti 16gb + 32gb of ram
Feels good bros
>>
>>107064933
>we
>>
>>107064898
that's reddit tier advice tho
>>
File: image_00012_.jpg (725 KB, 1336x1768)
725 KB
725 KB JPG
>>
File: 1757034018646948.jpg (760 KB, 1416x1888)
760 KB
760 KB JPG
>>
>>107065025
Very nice, what did you use?
>>
File: file.png (277 KB, 1127x607)
277 KB
277 KB PNG
vroom
>>
https://youtu.be/qw4fDU18RcU?t=420
localsissies, pewdiepie says that AI images are slopped as fuck, how do we respond?
>>
>>107065095
>stop using things I don't like
I imagine people who used typewriters acted the same way when people started typing on computers.
>>
You need to be "aesthetic_1, aesthetic_2, aesthetic_3"maxxing. Very important.

>>107063784
Didn't need the update thanks
>>
>>107065095
He put a period after stop. So technically he is advising on how to use ai right now.
>>
File: 1733231599184409.jpg (718 KB, 1416x1888)
718 KB
718 KB JPG
>>107065063
https://civitai.com/models/1790792?modelVersionId=2298660
>>
>>107065117
Since they keep posting here they need to see the mirror, sorry anon
>>
>>107065119
Sweet.
>>
>>107065025
>>107065119
>>107064860
>>107064740
>>107064652
>>107063943
Megaslop
>>
File: image_00017_.jpg (831 KB, 1336x1768)
831 KB
831 KB JPG
>>
File: dmmg_0119.png (1.31 MB, 832x1216)
1.31 MB
1.31 MB PNG
>>107065095
pewdiepie has been shitting up the internet for years, i do not care
>>
>>107065240
wheres your gen?
>>
why is ranfaggot such a needy little attention whore?
>>
>>107065112
>>107065118
>>107065258
hes advocating for local models in that video youve been boozled
>>
File: dmmg_0031.png (1.43 MB, 832x1216)
1.43 MB
1.43 MB PNG
>>107065279
you have watched 100% more pewdiepie videos than i have. i am not the one who has been bamboozled.
>>
>>107065260
If you look inside /sdg/ you'll understand why
>>
>>107065279
I don't watch reverse clickbait videos from aged out youtubers.
>>
File: IMG_1487.jpg (1.61 MB, 1808x3216)
1.61 MB
1.61 MB JPG
>>107062482
>>
>>107065350
Go back nigbo and leave koff alone
>>
>>107065350
what does this accomplish rannigger? showing off your boyfriend?
>>
>>107065350
I got jumpscared fu
>>
>>107065350
You're an asshole anon
>>
>>107063243
>Yeah I'm sure it was a struggle with 96 fucking gigs of vram

->
>12 minutes to generate this image
Well it was.
>>
>>107065350
stop Julien!
>>
>>107063383
res3m is faster and looks marginally better
>>
The poet would conjure images in the mind by the careful selection of words; the prompter also makes images from words, real images visible to the eye, but the machine mind for which we choose the words is autistic and retarded. Prompting is autistic and retarded poetry.

And just as in poetry it was not the image in the mind but the satisfaction of well-placed words capable to make such an image which gave its hearer that unique pleasure, so too the unique pleasure of prompting is the prompt and not the image; the image is merely the proof which gives the prompt its savour
>>
>>107065384
still don't know who this is supposed to be
>>
Stop stealing intellectual property and use safe, certified models.
>>
File: ComfyUI_00420_.png (2.94 MB, 1248x1848)
2.94 MB
2.94 MB PNG
>>
File: dmmg_0159.png (1.41 MB, 832x1216)
1.41 MB
1.41 MB PNG
>ldg
>replying to camera image
guys
>>
>>107065117
>>107065453
kinosovl
>>
>>107065549
cute
>>
>>107063918
VSR?
>>
>svi implemented into comfy
>kijai workflow

goddamnit, its going to take forever to setup a native version

https://github.com/vita-epfl/Stable-Video-Infinity/tree/main/comfyui_workflow
>>
File: 1745974565054318.jpg (1.46 MB, 1248x1824)
1.46 MB
1.46 MB JPG
>>
File: dmmg_0171.png (1.34 MB, 832x1216)
1.34 MB
1.34 MB PNG
>mfw face detailer detects skull faces
>>
File: ComfyUI_08107_.png (1.57 MB, 1152x1152)
1.57 MB
1.57 MB PNG
>>
>>107065662
I can smell that room
>>
File: ComfyUI_08124_.png (2.19 MB, 1152x1152)
2.19 MB
2.19 MB PNG
>>
File: 1753997786818772.png (3.81 MB, 1416x1888)
3.81 MB
3.81 MB PNG
>>
>>107065662
It's crazy how you can instantly tell it's a chroma gen because it always has this fried melted wax texture over everything. Personally, I fucking hate it and think that even a generic, airbrushed, slopped AI look is better. But that's just like, my opinion man.
>>
>>107065698
It's just leftover diffusion noise. The model is kinda fucked in this regard.
>>
>>
File: ComfyUI_08125_.png (1.8 MB, 1152x1152)
1.8 MB
1.8 MB PNG
>>107065674
>Fucked up hands

Kek how didn't I notice that, but she's not human anyways so it's fitting.
>>
NIGGA WHAT? >>107065472
>>
>>107065789
License to surf.
>>
>>107065674
keto friendly
>>
Is flow matching implemented in onetrainer already?
>>
>>
File: 00036-2850245809.png (1.67 MB, 864x1208)
1.67 MB
1.67 MB PNG
>>
File: chroma___0223.png (1.47 MB, 832x1216)
1.47 MB
1.47 MB PNG
>>107065698
it also doesn't have to look like that
>>
>>107065823
Love skellington
>>
https://github.com/tencent-ailab/SongBloom

new tencent song model dropped
>>
>>107065879
https://github.com/fredconex/ComfyUI-SongBloom
>>
File: ComfyUI_08130_.png (2.07 MB, 1152x1152)
2.07 MB
2.07 MB PNG
>nogen

Anti-Chroma troll is back at it again.
>>
I guess you're posting kids because of the conditions of your release not allowing you to be near any I guess
>>
>>
>>107065915
her left hand has some serious skill issue
>>
File: 00043-2583006679.png (1.76 MB, 1216x856)
1.76 MB
1.76 MB PNG
>>
File: 1759967058517909.png (2.37 MB, 1152x1152)
2.37 MB
2.37 MB PNG
>>
File: dmmg_0185.png (1.6 MB, 832x1216)
1.6 MB
1.6 MB PNG
>>
File: 00116-565773828.jpg (1.37 MB, 2480x3088)
1.37 MB
1.37 MB JPG
>>
>>107065915
>oversaturation, the model
>>
stable diffusion and wan have made me truly appreciate the beauty of the human body
>>
File: GE4WhUTaoAAmMDM.jpg (65 KB, 1217x831)
65 KB
65 KB JPG
Someone make a poll to see how many people agree that netayume is a bad model
>>
especially after trying chroma!
>>
>>107066090
kek
>>
>stable diffusion and wan have made me truly appreciate the beauty of the human body
i wouldnt like feet as much if it werent for wan

oh and i cant post webms or mp4 in this thread. says corrupted file or unsupported file type. curious
>>
>>107066123
catbox them
>>
>>107066131
it's not about that, it's about the fact that this is a new crackdown i havent seen before
but sure
https://files.catbox.moe/gd5l98.mp4
because this is worth sharing and keeping. what a pretty pink halloween princess
>>
>>107066089
why do you care if others use it
>>
>>107066144
nevermind this was related to me selecting "change hash". site admin continues to be tech retarded nonbinary antis and everything is fine
>>
>>107066145
cause the main netayume poster in here thinks its just one schizo that finds it ugly
>>
>>107066145
He has nothing else going on in life and is a /sdg/ poster who's afraid to post here. It doesn't matter who 90% of them are low skill losers
>>
File: 00022-4123541055.jpg (1.28 MB, 1616x2368)
1.28 MB
1.28 MB JPG
>>
File: 1.png (130 KB, 405x496)
130 KB
130 KB PNG
I found the spookyposters account!!
>>
>>107066231
>looks exactly like that movie
>turns out to be that movie
Well no surprise, but also disappointing. I thought this was all prompted vanilla.
>>
local has no style and relies on loras to cope. you cannot extract ‘style’ from any local base.
>>
>>107066282
Should I take a diarrhea shit in your containment general?
>>
File: 00014-3592924147.png (1.29 MB, 864x1152)
1.29 MB
1.29 MB PNG
>>107066282
At least some illustrious retrains have pretty good styles, and have very good artist detection, I guess
>>
File: 00019-3728708034.png (2.35 MB, 1248x1824)
2.35 MB
2.35 MB PNG
>>
File: 00246-1011651664.png (1.13 MB, 864x1152)
1.13 MB
1.13 MB PNG
>>107066316
>masterpiece, ultra-HD, high detail, best quality, 8k, best quality, ergonomic, depth of field, (anime coloring, anime screencap:1.5), official art. refined detailed, 1beautiful woman, thick lips, saturated, crazy smile, looking at viewer, narrowed eye. high ponytail. large breasts, cleavage. wearing maid uniform and lace headdress. standing looking sweet, holding a spiked bat over her shoulder. deep pink neon background, high contrast, (shoot from side), upper body shot, dynamic composition, dynamic angle. glitch effect,
>>
SDXL wins again.
>>
>>107066344
trvke...
>>
https://www.youtube.com/watch?v=LjU89rZa8HQ
a man can dream
>>
>>107066411
desu I don't see the point of having such a powerful machine, the only big shit we have is hunyuanSlop 3.0 and it's a terrible model
>>
>>107066427
Train your own model or make your own Noob style tune?
>>
>>107066427
oh thought i was in lmg
>>
File: 00025-214057692.png (1.36 MB, 1000x1248)
1.36 MB
1.36 MB PNG
>>
File: ComfyUI_06730_.png (1.61 MB, 1200x896)
1.61 MB
1.61 MB PNG
>>
File: QWEN_00031_.png (2.11 MB, 1080x1352)
2.11 MB
2.11 MB PNG
happy halloween nigs
>>
File: ComfyUI_06617_.png (1.83 MB, 1200x896)
1.83 MB
1.83 MB PNG
>>
>>107065258
catbox pls sirs
>>
File: ComfyUI_08135_.png (1.67 MB, 1152x1152)
1.67 MB
1.67 MB PNG
>>107065879
Is this good? How does it compare to previous Songbloom?
>>
File: 1756648598822147.jpg (1.64 MB, 1248x1824)
1.64 MB
1.64 MB JPG
>>
>>107066634
Damn bruh
>>
>>107066427
you could do crazy batch sizes with it
>>
File: 00035-708266146.png (2.03 MB, 1248x1824)
2.03 MB
2.03 MB PNG
>>
>>107066634
Cruelty Squad ahh gen
>>
File: ComfyUI_06743_.png (1.76 MB, 1200x896)
1.76 MB
1.76 MB PNG
>>
>>107066144
cute and funny gen, appreciated
>>
File: 00119-796787808.png (593 KB, 512x640)
593 KB
593 KB PNG
>>
File: dmmg_0137.png (1.34 MB, 832x1216)
1.34 MB
1.34 MB PNG
>>107066635
>https://files.catbox.moe/yi18r7.png
i save a thumbnail with the workflow because i delete my images frequently
>>
>>107065879
>240s dropped
>not in HF
am I blind or?
anyway the DPO model was pretty much MEH, the fact you cant also prompt for genre/vocals gender/mood/style was kinda garbaj
>>
File: ComfyUI_06748_.png (1.71 MB, 1200x896)
1.71 MB
1.71 MB PNG
>>107066762
>>107066648
>>
>>107066867
ew nigga
>>
>>107066867
moar
>>
File: ComfyUI_06751_.png (1.71 MB, 1200x896)
1.71 MB
1.71 MB PNG
>>107066913
>>107066871
>>
>>107065571
Tyvm for kind expression of support
>>
File: dmmg_0187.png (1.68 MB, 832x1216)
1.68 MB
1.68 MB PNG
>>107066922
double ew
>>
>>107063011
i put futa and diaper in the negatives mostly
>>
File: ComfyUI_08147_.png (1.83 MB, 1152x1152)
1.83 MB
1.83 MB PNG
>>107066861
Yep, our only hope is ACE Step 1.5 after it's gone through SFT, and the model Qwen is working on. Though it's interesting that Tencent made Songbloom, and they also made whatever this is https://github.com/tencent-ailab/SongGeneration
(which, while it also kinda sucks, seems much closer to Suno than SongBloom and looks like they're using a richer dataset).
>>
File: file.png (2.13 MB, 1536x1024)
2.13 MB
2.13 MB PNG
>>
File: 00120-300276448.png (487 KB, 640x512)
487 KB
487 KB PNG
>>
Hopefully the Qwen music model is not slopped and benchmaxed like all of their other models
>>
cozy
>>
>>107066867
Pizza cheese
>>
>>107066993
Did you write the text yourself after the gen? Why does it have no depth or shading? This cant be chroma.
>>
>>107067149
its netayume, text done with QIE2509. should've probably told it to CARVE the text instead of just bloody cursive but MEH, good enough
>>
>>107067160
>netayume
EWWWWWW
>>
File: 00052-1903891252.png (3.37 MB, 2064x1408)
3.37 MB
3.37 MB PNG
>>
Fresh

>>107067190
>>107067190
>>107067190
>>107067190

Fresh
>>
>>107067182
retard
>>
>>107062675
What if all your viewers all pretend to be girls too?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.