[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107178630

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>Neta Yume (Lumina 2)
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd
https://gumgum10.github.io/gumgum.github.io/
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: 00042-1470519578.png (2.87 MB, 1920x1080)
2.87 MB
2.87 MB PNG
so, that fella managed to split TWO threads in one day huh? interesting power he has.
>>
Blessed thread of frenship
>>
what is going on? why is the schizo acting out like a crazed giga faggot recently? he keeps troll baking other generals too. so sick of the shitty moderation on this board fucking hell
>>
File: Katanas.webm (2.49 MB, 928x1376)
2.49 MB
2.49 MB WEBM
>>
>>107187330
ty 4 bake
>>
this thread is for localchads only
(your mom will die in her sleep tonight if you post api gens)
>>
>>107187437
>thanking yourself in third person again
You are such a loser hahaha
>>
File: yin yang.png (1.07 MB, 1216x1536)
1.07 MB
1.07 MB PNG
>>107187449
What about local models that I run on API?
>>
>>107187347
Uhh buttbox please?
>>
File: 00049-3387951538.png (2.15 MB, 1920x1080)
2.15 MB
2.15 MB PNG
>>107187506
sure, though it's mostly just this guy's setup with a different model >>107187207
https://files.catbox.moe/ff26tp.png

i have a feeling if i post any of the other ones i just did i'll get owned with a 3-day..
>>
>>107187466
you used the old collage I made in some demented 5d chess maneuver to have tranistudio in the op and get attention. fuck off
>>
>>107187449
they still dont have a response to this
>>
File: ComfyUI_temp_gpprv_00001_.jpg (415 KB, 1344x1088)
415 KB
415 KB JPG
>>107187486
anon...your mom...., i'm sorry for your loss.
>>
File: hog-real.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>
>>107187523
>i'll get owned with a 3-day
so long as no nips or vageene the jannies should be merciful. unless you make it really art-ful then you can get away with a lot...
>>
>>107187665
Wtf
>>
>>107187665
rememba da toim
>>
When is Nano Banana 2 expected to release? I have $20 worth of ComfyCredits remaining but I might buy more depending on how good it is
>>
>>107187719
lol faggot >>107187449
>>
File: 1695270990250.png (803 KB, 1152x896)
803 KB
803 KB PNG
>>107187523
Thanks, and good point since the roboasses are coming out too juicy.
>>
>>107187330
FUCK YOU ANI
>>
>>107187736
take your /adt/ drama to your containment thread
>>
>>107187751
EVERYONE MUST KNOW ABOUT ANI AND HOW HE'S SCHIZO
>>
>>107187819
dumb retarded nigger, quit this site instantly
>>
>>107187643
beautiful gilf. which model?
>>
>>107187892
chroma base, gradient_estimation/bong_tangent, cfg 3.5, 25 steps.
>>
File: comfyui_0026.mp4 (2.98 MB, 960x960)
2.98 MB
2.98 MB MP4
>>107187719
>>
saars we can redeem the bobs and vagene
https://civitai.com/models/1648982?modelVersionId=2391828
https://litter.catbox.moe/lmn8a7zazkj2vjss.mp4
>>
File: 20220306_033359.jpg (73 KB, 1170x1153)
73 KB
73 KB JPG
alright, i'm not asking anymore.

tell me which video diffusion models i should use on 10gb vram, now!
>>
>>107188099
none of them. 10gb is suffering
>>
>>107188099
Kling, Veo, Sora
>>
>>107188099
>tripfag
>futafag
>posting a random nigga for some reason
k
>>
>>107187999
maybe the lora can be used as a baseline "nude"
>>
there's two ldg's, one with debo and one without. which to use
>>
>>107187999
Actually pretty good lora, I wonder how much time until it gets banned for whatever retarded reason.
>>
>>107188099
>blintrovert
my nigga
>>
>>107187999
how badly does this lora influence the person? any concept lora that changes the appearance of my person lora is garbage
>>
>>107187980
cute
>>
>>107188134
i was using wan 5b a little bit, and sort of had success with a few linux crashes. though it was definitely low quality most of the time and janky.

>>107188139
i have something different in mind for my renders
>>107188210
yo
>>
i love local diffusion
>>
>>107188247
The examples don't seem like they destroy the body type or face.
I am genning to see how good it is, I'm tired of shitty loras changing the faces. Though the face morph stuff tends to happen with blowjob loras.
I don't know why so many bj loras are so bad with that.
>>
>>107188329
>I don't know why so many bj loras are so bad with that.
because the concept the model often learns is "face+dick in mouth", but with not enough face variety so they all converge to a specific face doing the exact same move, changing the face of every image you put into it
sometime it's even "dick sprouts out of mouth" and you get body horror
wish more experienced anons tried training them, oral loras are on the harder side to not mess up
>>
>>107187980
glad he didn't get slapped by a fennec
>>
>>107188247
i dunno try it out
(nsfw)
https://litter.catbox.moe/ahb38tudazy3bdke.mp4
https://litter.catbox.moe/45b6vke63qqlnas2.mp4
https://litter.catbox.moe/1pe4xjodgc4g275b.mp4
>>
File: 1754093615344344.jpg (77 KB, 626x602)
77 KB
77 KB JPG
Sorry if this a retarded question by why do my Comfy gens never seem to come out as nice as the CivitAi generator? I get that the seeds and LORA weights might not translate 1:1 but I swear they must do some tricks in the preprocessing I'm not following
>>
>>107188541
I can't. i'm away from home + training a lora in the background, so currently can't remote use comfy.
>>
File: 1754759251039087.jpg (217 KB, 1259x866)
217 KB
217 KB JPG
i am a very busy man with many loras to train!
>>
>>107188621
Which trainer is this?
>>
>>107188653
https://github.com/ostris/ai-toolkit
>>
>>107188659
I'm curious about how people train LoRAs, not to do it myself or anything, just the process itself. Do you just save a bunch of images of x and feed it into the trainer or something?
>>
File: 1736135566550546.jpg (247 KB, 1168x880)
247 KB
247 KB JPG
>yesterday they split the anime thread
>today they split this thread
Why are the hoes so incredibly mad at these threads?
I simply do not understand.
>>
>>107188792
You run bunch of images+caption pairs over and over until the model hopefully picks up what you wanted it to learn.
>>
File: 00123-1635479909.png (3.25 MB, 1536x1536)
3.25 MB
3.25 MB PNG
>>107188811
your gen got featured in today's /adt/ collage. very cool.
>>
>>107188621
is there a trick to training or am i just stupid? i'm using mostly default settings on a 4090 and 128gb of ram but i just get an OOM
>>
>>107188792
you could have skipped through a million yt tutorials by the time you typed out that
>>
why does qwen image edit refuse to adjust character body proportions? is it stupid?

how can i make a fattie skinny and a skinny fattie
>>
>>107188792
grab a bunch of images. if it's a person you want to train, grab a bunch of varied images of them. caption them(descriptive text). captioning depends on the model. SDXL models use tags. flux/wan/chroma/qwen/etc use natural language. some people use ai models to caption images like joy caption/florence/etc.

concept loras require a bit more involvement. you need reg data or very good training sets so your training images/videos dont override the concept. ie, a blowjob lora should never change the appearance of the subject, or at the very least have minimal impact.

the training dataset is the most important aspect a lora. everything else is mostly related to training speed. more steps = better quality to a certain point. higher rank = better quality but at the expense the lora has less flexibility.
>>
are there any checkpoints that generate male characters well with style loras?
>>
cozy bread
>>
File: sleepingbeauty.png (2.73 MB, 1824x1248)
2.73 MB
2.73 MB PNG
>>
File: ComfyUI_wan__00001_.mp4 (1.34 MB, 832x480)
1.34 MB
1.34 MB MP4
>>
>>107189289
Damn nice
>>
>>107189289
giwtwm
>>
File: 00145-9363595.png (2.52 MB, 1536x1536)
2.52 MB
2.52 MB PNG
>when you try to fuck her silly but end up fucking her to down syndrome
>>
still no way in wan2.2 to reuse the last x frames as the starting frames of a next generation? (and no i don't wanna use vace)
>>
>>107189425
would
>>
Where did Lain go?
>>
>>107188826
Shut the fuck up, thread schizo.
>>
File: media_1762993886.png (1.45 MB, 768x1280)
1.45 MB
1.45 MB PNG
>>
File: 1737033226252457.jpg (144 KB, 1024x1024)
144 KB
144 KB JPG
>>107189525
I had to edit the text in then I realized her hands were fucked up
Now I'm in a gacha loop trying to roll them perfectly
>>
>julien butthurt because of the strawpoll
>>
>>
>>107189619
maybe if you kill yourself you can suck his penis for an eternity in the afterlife <3
>>
>>107189558
Thank you for posting Lain.
>>
>>107189465
Getting last frames are usually pretty bad, be it wan 2.1 or 2.2 from my experience (of course if you want to prompt travel, then that's the way to go, but always found last frame anything to be jerky, inconsistent with noticeable changes). Just waiting on, native versions/2.2 version of longcat https://github.com/meituan-longcat/LongCat-Video/ and svi https://github.com/vita-epfl/Stable-Video-Infinity, at the moment, these two work best for 2.1 or kj nodes...

The best I've found is the context nodes "context windows (manual)" or "wan context windows (manual)". Only downside is need a lot of ram/vram. However, if you have 16gb card and 32gb system, you can do around 300 - 400 frames but it will be slow. There's wan block swap that can allow for even more frames but its pretty slow https://github.com/orssorbit/ComfyUI-wanBlockswap

Also, recommend to use woct0rdho's radial attention for long gens https://github.com/woct0rdho/ComfyUI-RadialAttn this will speed boost for 10 second gens and beyond. Down side is you have to use fixed resolutions https://github.com/woct0rdho/ComfyUI-RadialAttn/issues/9 (I use 640 x 480). If you do decide to use radial attention, be sure to use woct0rdho for

triton: https://github.com/woct0rdho/triton-windows
sage: https://github.com/woct0rdho/SageAttention
sparge: https://github.com/woct0rdho/SpargeAttn
radial: https://github.com/woct0rdho/ComfyUI-RadialAttn

Otherwise, you WILL run into errors. Thanks for reading my blog.
>>
>>107189643
was this a real finetune or just lora merge?

>>107189875
interesting
>>
>>107189875
thanks for the long explanation anon
>>
File: yamato flowers.png (2.64 MB, 936x1368)
2.64 MB
2.64 MB PNG
i love yamato!
>>
File: ComfyUI_wan__00007_.mp4 (364 KB, 832x480)
364 KB
364 KB MP4
>>
>>107189908
It's not a full finetune yet, they released a preview. But they fixed chroma's consistency and anatomy issues, idk how.
>>
>>107190085
could you link were you got this?
also would
>>
localoptimabros... we got competition

https://civitai.com/models/2118407?modelVersionId=2401448

https://files.catbox.moe/6nrey6.mp4
>>
>tfw you dont have enough vram to run wan effectively

Just kill me now tbqhwyf
>>
File: ComfyUI_wan__00009_.mp4 (743 KB, 832x832)
743 KB
743 KB MP4
>>
>>107190118
buy ram and give it 20/40 blocks
>>
File: alex jones approval.gif (2.12 MB, 177x210)
2.12 MB
2.12 MB GIF
>>107190124
bursted out laughing 1 second in, holy fuck
>>
>>107190118
i do okay with 8gb
>>
>>107190134
i have a 4070 super (12gb) and it either takes forever to gen or i get oom, think i did a 5 sec video last night and it took over 35 mins
>>
File: ComfyUI_wan__00011_.mp4 (550 KB, 832x640)
550 KB
550 KB MP4
>>
>>107190150
something wrong on your end then, takes 5 minutes for a 832x480 vid for me after the first gen though the latest comfy updates made it slower. your oom is probably from the pinned memory option, try launching with --disable-pinned-memory
>>
>>107190177
this is what i'm using atm, might need to just drop to Q3
>>
>>107190103
Hnnnng seems uber based. Downloading before it gets nuked. Hopefully not too many deformed gens though.
>>
>>107190237
d i s t o r c h 2
>>
>>107190237
no bro ive used q8 t2v and q6 i2v, i can probably use q8 i2v but too lazy to download because i started with q6 when i was new. the initial load is like 25 minutes it sucks but after that the vids gen in 5 minutes at 6 steps
>>
File: ComfyUI_wan__00019_.mp4 (615 KB, 480x832)
615 KB
615 KB MP4
>>
Comfy's official fp8 weights for WAN2.2 are not supported under ROCM 6 so the GGUF loaders are indispensable on AMD, at least as far as WAN is concerned.
>>
>>107190335
In another universe where comfy is based the software would issue a warning explaining to you how shit fp8 is and require you to type "yes, I am a retard" before running it.
>>
>>107190262
>>107190263
can yall drop a link to a workflow so i can mess around with it and get it working?
>>
>>107190335
i mean the fp8's are pretty shitty compared to the Q8 so. eh.
>>
>>107190362
https://files.catbox.moe/p1n5b2.json
>>
>>107190376
thanks, appreciate it
>>
File: Gilf.webm (1.63 MB, 928x1376)
1.63 MB
1.63 MB WEBM
TFW no big titty GILF GF
>>
>>107190345
lol
>>
>>107190345
lmao would unironically love this it would've saved me a lot of time early on
>>
>>107190417
that loose paperthin skin, oh my
>>
>>107190162
>>107190272
based
>>
this thread is the reason i realized i wanted to fuck animate inanimates like pancake girls, pancake syrup girls, robo girls, and now fucking porcelain teacup girls.
then again, i always wanted to fuck statues of women so that about adds up.
>>
>>107190417
What's with you faggots and putting crosses in everything?
>>
>>107190562
What's with you faggots being triggered by crosses all the time?
>>
File: 1736922875332308.png (1.23 MB, 896x1152)
1.23 MB
1.23 MB PNG
>>107190536
filthy statuefucker
>>
File: 1755526787708515.mp4 (3.71 MB, 832x512)
3.71 MB
3.71 MB MP4
>>107190536
>>
File: 00118-865259477.png (3.15 MB, 1536x1536)
3.15 MB
3.15 MB PNG
>>107190572
What's with you faggots?

>>107190596
>>107190599
hoollyy awwooobbaaaabooba *eyes bug out like a cartoon character*
>>
>>107190417
fuckin wood
>>
>>107190572
It is not hard to imagine why Christians are annoyed by seeing crosses inserted in porn. You can disagree, but you can hardly act surprised.

It is a little hard to understand why someone might feel a pathological need to insert crosses in his porn gens. Is it a "raised Catholic" thing? It seems like the cross itself gives a certain kind of guy a hard-on. Normally that sort of guy is a homosexual, but I guess not always.
>>
>>107190599
>bejeweled milkers, adversary of cocks!
>>
>>107190376
Does this work if you only have a single GPU setup?
>>
schizo lost everything
>>
>>107190675
yes
>>
>>107190677
yes
>>
>>107190699
yes
>>
File: Video_00152.mp4 (2.12 MB, 544x960)
2.12 MB
2.12 MB MP4
>>107190608
tit men are literal children
>>
>>107190729
excuse me nigger my sunburnt queen there clearly had (wide hips:1.5) and (huge ass) in the positives. i like both.
>>
>>107190562
IDK it's fitting for a GILF I guess
>>
Is there a list of celebs known by wan 2.2?
>>
>>107190760
I have it here:


hope this helps
>>
>>
>>107190770
It didn't as it is empty.
Would appreciate an actual real useful reply next time.
>>
>>107190737
For a grandmother? No, it's not. Maybe for an abuelita or a nonna or an Irish nana.
>>
File: wan22___0029.png (1.68 MB, 832x1216)
1.68 MB
1.68 MB PNG
>>107190736
grow up
>>
>>107190770
fucking kek
>>
>>107190729
The "tit man" vs "ass man" dichotomy is xbox vs playstation.. you're both children
>>
>tits
>ass
for me it's hips and thighs
>>
File: 1758215518376805.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
le foot man has arrived
>>
File: flux___8.png (1.19 MB, 832x1216)
1.19 MB
1.19 MB PNG
>>107190838
>>
>>107190851
only anime ones or perfect realistic ones (aka almost none)
>>
>>107190872
>nipple transmogrification
In some ways everything is still basically SD1.5
>>
I should have known it was only a matter of time before I downloaded foot LoRa
>>
File: 00059-2523556449.png (3.54 MB, 2048x2048)
3.54 MB
3.54 MB PNG
Hirefuse ningen. Washi no na wa POWAAAA
>>
>>107190944

washino cabriolet
>>
I recently got back into this stuff and came upon the civitai archive. WTF is happening with the deletions? I've noticed there are some good trainers with hundreds and hundreds of loras getting wiped out daily without any back-ups
>>
File: 00002-327207064.png (3.69 MB, 2048x2048)
3.69 MB
3.69 MB PNG
>>
>>107190991
>without any back-ups
civitarchive keeps snapshots but it might be where people go at all at this point
>>
File: 1731563675792843.png (2.3 MB, 1728x1344)
2.3 MB
2.3 MB PNG
>>
>>107190103
>>107190244
Just like any "beat NSFW concept into SFW model" loras out there it is rough.
Some seeds don't work, cranking strength high changes faces so out of question.
Sadly some body horror. Not the worst NSFW wan lora in that regard, but it's common.
Run around 30 gens. Got a few good enough results from them. Thanks for posting.
>>
File: slop.mp4 (1.51 MB, 640x640)
1.51 MB
1.51 MB MP4
>>107190729
what was the ass lora used for this? JIGGLE? or was it i2v because ive never gotten cellulite that good in t2v nor have i ever gotten that "wide hips with hip dips" type of white girl ass before

>>107190838
>for me it's hips and thighs
nice legs usually imply a nice ass


and of course, face is the most important. you can fix small breasts with impregnation and architecture, you can fix small ass with squats and architecture, you can't fix a butterface
>>
>>107190991
apparently some guy got banned because his body-slider lora "promoted anorexia". yep
>>
will be fun one day going through all the snakeoil papers and analyzing them with ai to see why they didnt amount to anything or if they were onto something good
>>
File: Video_00153.mp4 (1.88 MB, 544x960)
1.88 MB
1.88 MB MP4
>>107191025
>psxCheeks on civit
i had the same issue so i made my own lora, it's t2v
>>
>>107189643
looking good
>>
>>
>>107189643
would you say it's currently better than HD?
>>
>>107191185
way better. slightly less style flexibility, but it's worth it.
>>
File: 1736576858359836.png (1.1 MB, 896x1152)
1.1 MB
1.1 MB PNG
>>
File: 1741815191016988.png (2.49 MB, 1120x1440)
2.49 MB
2.49 MB PNG
>>107191229
>>
>>107191229
>>107191249
based
>>
File: 1731812060961079.png (2.36 MB, 1440x1120)
2.36 MB
2.36 MB PNG
thanks
>>
>>107190774
nice
>>
>>107191229
>>107191249
>>107191312
I like my /ldg/ with a hint of /mwg/
>>
v2 facefuck lora from a week ago

https://civitai.com/models/2023407?modelVersionId=2383114

https://files.catbox.moe/5ho52d.mp4
>>
>>107191409
/mwg/?
>>
>>107191392
Thanks
>>
File: 1744445930825722.png (2.48 MB, 1800x1800)
2.48 MB
2.48 MB PNG
>>107191412
They were /pol/ threads where we generated the most offensive memes possible
Sadly the archivist disappeared and it died out due to prompt censorship
but we kept it going through 2024
https://rentry.org/mwarchive
>>
>>107191439
Three of those pixeldrain archive links are dead, do you have them saved?

2023

October
Part 3 - 1.6 GB - 8,500 images
pixeldrain com/u/Kp2Vjb91
Part 4 - 1.7 GB - 9,200 images
pixeldrain com/u/P113xCT7

December
Part 2 - 1.5 GB - 6,200 images
pixeldrain com/u/Cqgfskcs
>>
File: 1754790646201425.jpg (170 KB, 1024x1024)
170 KB
170 KB JPG
>>107191496
Sadly I do not have Oct part 4 or Dec part 2 but I will download the rest now, if that's the case
And I'll ask my baker fren if maybe he has those so we can put them back up
Thanks for checking!
>>
File: 1733424924456037.png (150 KB, 1080x1071)
150 KB
150 KB PNG
>>107191543
>I'll ask my baker fren
oof nevermind, looks like he got b&
>>
>>107191557
In future mwg tell people to just create a torrent in qbittorrent and port forward their ports, seeding forever instead. Also mirror everything on gofile.io
>>
>>107191614
Sadly /pol/ isn't the same as it used to be, plus Bing killed the ease of generating we had by censoring everything.
Alas, the /ldg/ VRAM gods are all we really have left to carry the torch.
>this is probably why there are inexplicable bread splitters and divisionfag schizos periodically trying to piss on our parade
On the bright side though I do actually have Oct week 4 saved, and there's plenty of kek worthy material in the rest of the archives if any img2vid chads are looking for funny material to work with.
I will keep that in mind though.
>>
File: 2215498653.webm (798 KB, 960x720)
798 KB
798 KB WEBM
I for one welcome our clanker overlords
>>
File: WAN2.2_00532.mp4 (3.83 MB, 544x960)
3.83 MB
3.83 MB MP4
What books are we reading today??
>>
>>
>trying to run qwen edit 2509
>use default comfy workflow template for fp8
>comfy crashes when it gets to sampler
>32 GB RAM 24 GB VRAM
wtf
>>
>>107192010
>uhghhh it crashes
are you able to read the error? if you're unable just throw it at chatgpt
if you somehow still cant figure out while it crashes, maybe it's better you stick to api services
>>
>>107191660
>Oct week 4 saved
Upload to gofile and post the link
>>
File: image_00004_.jpg (619 KB, 1264x984)
619 KB
619 KB JPG
>>
>>107192060
fuck you
>>
>>
>>107192019
there is no error, probably an oom but I should have enough memory...
>>
File: WAN2.2_00543.mp4 (3.79 MB, 544x960)
3.79 MB
3.79 MB MP4
>>107192060
>>
File: 1636147928226.png (934 KB, 513x1117)
934 KB
934 KB PNG
I'm still so confused why kijai fills the vram up fully while default only fills 60%.
>>
>>107192217
comfy native has a better memory managment
>>
>>107192107
when it OOMS it tells you, allocation error or someshit like that. check the console
>>
File: sgsdgsdgsfff.jpg (15 KB, 358x109)
15 KB
15 KB JPG
Such a chill 33c gpu in the morning. Not for long.
>>
File: lora_00016_.jpg (827 KB, 1264x1656)
827 KB
827 KB JPG
>>107192092
Have apple instead

>>107192158
Pretty good
>>
>>107191427
noice
>>
File: lora_00019_.jpg (1020 KB, 1264x1656)
1020 KB
1020 KB JPG
>>
>>
>>107191741
Another anon has accepted the superiority of the 960x720 resolution

>>107191748
>What books are we reading today??
I'm halfway through the Invisible Man and also I'm going through Mona Lisa Overdrive. I like the invisible man a lot it's like proto-sci-fi before electricity which is a very interesting time period and the chapters are relatively short and it's a short book in general so it's a pleasant pick up and put down read. MLO is more "work" for me to pick up but I can enter a flow state with Gibson

I am on track to finish reading 3 books this year, which is the average number of books an American reads apparently
>>
>>107191062
>psxCheeks on civit
>i had the same issue so i made my own lora, it's t2v
I guess brown butts and gold chains will return shortly. Thanks and double thanks for sharing the Lora you made. Shoulders of giants
>>
File: lora_00023_.jpg (1.33 MB, 1264x1656)
1.33 MB
1.33 MB JPG
>>
>>107191054
>will be fun one day going through all the snakeoil papers and analyzing them with ai to see why they didnt amount to anything or if they were onto something good
If InfinityStar is indicative of a trend (which it may be since it's a ByteDance release), then diffusion is over so that snakeoil might be irrelevant for the autoregressive text to image/video models that are coming

InfinityStar claimed something like 10x faster 5s 720p video generation than with diffusion. I spent today trying to GGUF their smallest image model Infinity-2B but their code is shit and they use flan-t5-xl but reference t5-v1_1-xxl so I wasted a bunch of time on the wrong text encoder

If no one beats me to the punch, I might have a working Q8_0 of InfinityStar and ComfyUI-GGUF support for it too by next week. Strong might
>>
>>
>>
>>107192461
comfy is not the majority shareholder. the grift chink is. anything comfy says about company direction is not in his control
>>
>>107192440
my man post your chroma wf pls
>>
>>107193110
Let's spread a rumor that comfyui is antisemitic. It won't sell :^)
>>
>>107192440
>>107193110
maybe on your civit page

>>107193124
fuckoff
>>
>>107193194
I can barely gen on Q5 :(
>>
File: WAN2.2_00553.mp4 (3.59 MB, 1280x720)
3.59 MB
3.59 MB MP4
>>107192158
>>
>>107193223
Yes, it's based on official comfy workflow for adding 2 models, but I unfortunately have never been able to merge the 2, which is why I seldom use that workflow in particular. Though merging them properly is probably not too hard by looking at the code.
>>
>>107193198
hey at least thyre good gens
chroma doesnt work well for me
>>
>>107193236
i'm finding chroma DC2k to be better than chroma-HD, but that's just me. For training loras, definitely use chroma-HD though.
>>
>>107191660
>Sadly /pol/ isn't the same as it used to be, plus Bing killed the ease of generating we had by censoring everything.
>Alas, the /ldg/ VRAM gods are all we really have left to carry the torch
this just shows /pol/ is full of retards, they are APIkeks who couldn't even gen 1girl locally if they tried
>>
File: 00014-1527940325.png (2.21 MB, 1824x1248)
2.21 MB
2.21 MB PNG
>>
File: 00020-1225976970.png (2.17 MB, 1248x1824)
2.17 MB
2.17 MB PNG
>>
>>107193647
I lost interest in private trackers/torrenting community in around 2014ish, so I don't perma-seed anymore.
>>
>>107192252
comfy a year ago has better memory management
>>
>>107193740
I have the weights but im not sharing them. Too many browns itt
>>
>>107192461
>ComfyUI
just throw it in the trash and make your own ui
>>
>>107192217
different offload implementations/settings
>>
File: 1665.jpg (21 KB, 460x460)
21 KB
21 KB JPG
Is there a way in Forge to download directly from civitai with all the metadata? Its easy in swarmui but its giving me a headache here.
>>
>>107193833
woah what a screenshot
>>
I haven't done local image gen since August and that's basically the first time I ever tried it seriously and not treat it as a toy, so I don't really know how diffusion models develop over time. Is there a reason to get newer checkpoints or do people keep making new ones for mostly aesthetic purposes? Like do they have additional functionalities older models can't do (for me specifically older models means from August)?

On that note, what are the latest developments, and things to look forward to?
>>
File: 00036-3452992540.png (2.42 MB, 1824x1248)
2.42 MB
2.42 MB PNG
>>
light 2 has good clarity but crappy motion
light new has good motion but crappy clarity
what if I used both?
>>
>>107193905
Not much is going on. Civitai -> arrange trained base models by date
>>
File: WAN2.2_00560.mp4 (3.55 MB, 872x592)
3.55 MB
3.55 MB MP4
>>107193929
>>
>>107193953
reminder: sam will give us the 18+ update in december
>>
>>107193969
i had a very annoying time trying to get 360 orbit to work with autistic boomer prompting. Did you use a lora for your gen.
>>
Has anyone tried the other wan models? How worthwhile is funcamera and wan animate?
>>
>>107193834
Apparently its just that Forge did an update that broke all the metadata, so I'd stick with swarmUI but it doesnt allow to use ADetailets since pickletensors

God the open source scene is such garbage
>>
>>107194038
Yeah, but this space has become commercialized. I'm noticing inorganic behavioral patterns that regular hobbyists don't exhibit. Don't you find it suspicious when someone posts 6-8 hours daily, every single day, but only posts images of Neta Lumina via filename? And that's their only form of engagement with the community?
>>
>>107194039
Couldn't you pre-calculate different conditioning vectors and then simply mix them together in a kind of wildcard system using Conditioning Concat?
So instead of calculating each sentence individually, you could plan atomic sentence parts, so to speak, and then concat these vectors as you wish, like different hair colors, etc.
>>
>>107187330
Is there a proper guide for getting set up on win/AMD gpu or is it really just that one halfassed github tutorial that's written by someone who's simultaneously got millenial cringe humor and zoomer brainrot?
>>
>>107194045
I'm not pausing my current gen and creating something for (You) just to appease your mental illness. You're free to think you won all you want. I hope you get the help you need.
>>
>>107194087
it would look attractive with those tits or pillow lips wrapped around your cock
>>
>>107194097
Y-you ummm... what?
>>
>>107194105
I'm legit disgusted some guys find this shit attractive, what the fuck.
>>
>>107194108
About three fiddy.
>>
>>107194115
There's NAG for Chroma, though I don't need negs most of the time so I haven't tried it.
>>
>>107194038
yeah, used a lora. prompting will only get me so far. never got a 360 with prompts
>>
>>107193754
no one is going to use your shitty imgui ui, retard
>>
>>107194120
I think he wanted to do Chroma or Wan 5B next
>>
File: lora_00032_.jpg (1.07 MB, 1264x1656)
1.07 MB
1.07 MB JPG
>>107193110
Food Photography on civitai
>>
>>107194124
vascularity drops when you elevate the angle of your arm against gravity
>>
>>107194130
literally my cat looking at the fish tank
heh
>>
>>107194081
retard
>>
>the bot is spamming again
MODS PLEASE
>>
>>107194140
I assume it's probably a lot a matter of like, just how Neta Lumina was captioned and then NetaYume Lumina was captioned in the process of being trained over base Lumina 2.0, combined with the better text encoders and it not being distilled, that lead to it not "erasing" quite as much of the original Lumina 2.0 realism knowledge.
>>
>>107194145
my question is, how the fuck is he (or multiple people?) able to destroy two threads at once?
>>
>>107194145
I don't think you should be recommending CFG numbers devoid of context. It's very much dependent on prompt and image dimensions. E.g. 3.0 is way too high for my current workflow, in which I'm using 1.9
>>
>>107194153
What scheduler is it meant to be used with? Fails to denoise correctly with DDIM uniform (shows large influence of input image with 1.0 denoising)
>>
Ani is still doing this spam nonsense?

He really can't get over being bullied hey. He really should just take his Ritalin.
>>
>>107194153
its that demented tranistudio retard, just ignore, report and move on
>>
>>107194166
its qwen with the lora to turn drawings into cosplay
>>
>>107194168
maybe as a big breast lover I'm just that unsophisticated but I like women with R-cups, wouldn't you want her to look more like a rare occurance?
>>
ani debo julien comfy

all must be shot in the streets
>>
>>107194175
I know, I'm into AI because I want to make surreal porn.
>>
File: 00006-3674195429.png (2.82 MB, 1080x1920)
2.82 MB
2.82 MB PNG
>>107194166
or estrogen kek

>>107194175
sounds like a plan, even if cumfart gets caught in the crossfire.
>>
>>107194177
gargle my balls cumstain
>>
Oh shit, the painteri2v node got an update for kijai nodes AND with last frame. Massive.
>>
>>107194183
Enjoy your slomo, reduced prompt adherence and deadened motion then, I guess.
>>
>>107194184
The only "fix" is to disable lightxv's lora for the high noise phase. 6 steps high noise, 3.5cfg. 4 steps low noise, lighx2v, 1cfg. No slomo.
>>
>>107194185
nope. same shit with the new one
>>
>>107194145
Just keep reporting all the posts for "This post appears to be an automated spambot".

Eventually the /g/ jannies will notice a spam-like pattern and actually realize it's ani being a schizo troll, reposting old posts from the archives.

Also make sure to report this post I'm making. It might help the janitors understand what's going on.
>>
>>107194185
WHERE IS T2V?!! DONT IGNORE THE OGs
>>
>>107194185
Cool. I hope there's decent workflow examples. Wan wrangling isn't fun.
>>
>>107194193
I've tried synthetic-looking, it seems a bit better..
>>
>>107194194
srry i dont have my sets handy but godspeed anon and i look forward to reading your updates along the way
ill have to wrangle aitoolkit to work on my 50 series later and follow in your footsteps
>>
>>107194193
they'll just warn you and maybe ban you
>>
at this rate /sdg/ seems sane
>>
>>107194196
Yeah not captioning means it just does whatever it wants in a way you cannot control at all beyond adjusting strength, I've never understood why people think it's useful to train that way. Also cropping was never something that was necessary in any trainer with proper autobucketing either.
>>
>>107194200
0/10
>>
>>107194199
the skin under the fishnet gives me the heebie-jeebies
>>
>>107194200
i've never tried without captions, but i've been curious. also if i want to capture a specific hairstyle, outfit or makeup it's good to be able to refer to them with something. however cropping to include what you want it to learn is obviously super important and you wont convince me otherwise.
>>
how did this clown mf escape containment?
>>
>>107194207
>gen black spiderman
>it doesn't work
woah who could've seen that coming!
>>
>>107194219
i don't even crop or caption my images for lora training anymore
>>
well this bake is ruined. bred is ded
>>
>>107194224
im not knocking on your efforts, your pics are nice, but still, the more you look at any chroma pic, the more it falls apart
>>
>>107194199
Not necessarily true. They are aware of this type of shitposting and have had to deal with it before. If you get banned you can just appeal it on the irc (don't bother pm'ing yournamehere though, he's an antagonistic dickhead).

I don't know if fission is still around but if you ever see him in the IRC, he will always be happy to check what's going on.
>>
>>107194226
I agree
>>
>>107187449
Will GPU rental services cuck you from running models with nsfw prompts?
>>
Why are random insta gens better than whatever you guys have
>>
Ani started getting bullied on /adt/ again so he's spamming /ldg/ to cope. He blames /ldg/ for everything wrong in his life.
>>
File: 00051-2508068311.png (2.62 MB, 1248x1824)
2.62 MB
2.62 MB PNG
>>107194120
link please?
>>
>>107194245
picrel sux btw. just lettin u know that your gen sucks monkey ass
>>
>>107194266
I've been using a little node I made with a multiline string input and a conditioning input that evals the string to poke around in a typical conditioning object, looks like the structure is a list with one entry, which is a list with a ~2mb tensor and a dict which looks like {"pooled_output": None} lol

So I guess saving and loading an object like this would be pretty trivial
>>
>>
>>107194277
nice toes lmao
>>
>>107194245
>>107194266
>>107194277
Another example
>>
>>107194293
Ask LLMs to write your own, clip output is a tiny tensor.
>>
>>107194282
>>
>>107194334
damn, its good that i cant post the others without getting banned then
>>
File: lora_00043_.jpg (696 KB, 1264x1656)
696 KB
696 KB JPG
>>
>>107194357
chroma proves that its good at nothing in particular, maybe understanding prompts and text insertion, but thats it
>>
File: psxntk_00002_.png (2.53 MB, 1792x1152)
2.53 MB
2.53 MB PNG
>another day, another lora

>>107192440
i see you grindin' broh

>>107192433
you're welcome anon.
>>
>>107194374
my brain had an automatic reaction just seeing the thumbnail
>>
>>10719433
>>
>>107194385
least retarded chroma pic
>>
>>107194374
>another day, another lora
what are you cooking?
>>
File: powershot_s40_1.png (357 KB, 512x512)
357 KB
357 KB PNG
>tfw burnt the lora
>>
File: powershot_s40_2.png (1.18 MB, 832x1216)
1.18 MB
1.18 MB PNG
>>107194431
rebuilt my dataset and training to do this powershot s40 lora but it's overcooked this time
>>
File: 00025-2640786003.png (3.67 MB, 1544x1536)
3.67 MB
3.67 MB PNG
>>107194432
looks like your average flux gen to be quite honest
>>
>>107194385
>>107194334
>>
File: 00058-1146111525.png (2.29 MB, 1248x1824)
2.29 MB
2.29 MB PNG
>>
>>107194464
Tch.. bratty little bitch is gonna need some serious correction.
>>
>>107194464
Why is she so serious, I bet she didn't eat her evening raw mince.
>>
File: lora_00050_.jpg (888 KB, 1264x1656)
888 KB
888 KB JPG
>>107194441
qwen lora?
>>
File: 00069-3595365989.png (2.94 MB, 1824x1248)
2.94 MB
2.94 MB PNG
>>107194538
:)
>>
Updates no one cares about on making Infinity-2B work as a GGUF

I had to patch multiple parts of their code, including the part that handles linear layers.

Now I'm on the part where I make actual image generation work right now (fixing dequantization logic for layers and weights of the new autoregressive architecture)

It's nice to see a model using t5-xl instead of t5-xxl. Or maybe it's not. We'll see when I actually get inference running on it
>>
File: 0000.png (1.92 MB, 1344x896)
1.92 MB
1.92 MB PNG
This idea looked better in my head than the final result.
>>
>>107194613
aaaaawwwww she's adowable could a nyugguh get a catbox pwease?
>>
>>107194619
boo fuckin hoo nigbo
>>
File: lora_00040_.jpg (1.04 MB, 1264x1656)
1.04 MB
1.04 MB JPG
>>
File: WAN2.2_00565.mp4 (3.9 MB, 544x960)
3.9 MB
3.9 MB MP4
>>
Hello /g/aylords.
I have two RTX2070 supers that have served me well to even generate SDXL pretty damn quickly when using a multi-GPU setup on Comfy.

I’m wondering what the best video model for my hopeful use case is, maybe you can shine some light onto the possibilities.

I want a general-purpose looping video generator that can do basically anything so I can create looping animated album covers for my local music files

Since album covers can come in a variety of art styles it shouldn’t be highly specialized, and it should also be able to do nsfw since some album covers are fairly nsfw, but doesn’t necessarily need to be able to do full on porn. Any advice for a wayward ni/g//g/er?


I don’t have any fancy gens to share right now because I’m not very good at them and also at my job, where I am employed and cannot post pictures of half naked anime girls. Browsing here is a risk as it is
>>
File: infinity-2b-gguf.png (793 KB, 1023x510)
793 KB
793 KB PNG
cool infinity-2b works as a q8_0 gguf

It took 5 seconds to make a 1024x1024 image with an unquanted vae and Q8_0 text encoder

Now we can move on to the video model
>>
>>107194761
>/g/aylords
i like your tone, nigga.
no, rtx 2070 supers are not going to happen with wan. best you're gonna do is keep on with sdxl.
>>
>>107194761
beggars can't be choosers, if ltx doesn't work you're fucked
>>
>>107194775
>>107194779
Not even with a split model? Rip

I mean I thought as much. How good is video generation with stable diffusion? I imagine pretty terrible since it’s not specifically trained to be as dynamic
>>
>>107194801
there are some good examples in this thread my man
>>
>>107194801
there is no video generation with stable diffusion my /d/own syndrome /c/ompanion, just save up your good boy shekles for a 5000 series.
>>
File: 1742126922202909.jpg (199 KB, 1024x1024)
199 KB
199 KB JPG
>>107192042
https://gofile.io/d/22kM9z
>>
>>107194762
>cool infinity-2b works as a q8_0 gguf
please share my dude
>>
>>107194801
SVD is garbo, even just for adding ambient motion to a still image it's awful
>>
File: 1743701445281257.png (713 KB, 1080x1045)
713 KB
713 KB PNG
I currently have a amd rx 6700 xt gpu, I tried setting up wan2.1 and generating some image to video but got some error. I'm assuming my gpu isn't meeting minimum requirements or something. Anyone know?
>>
>>107194812
>>107194815
>>107194825
Thanks retards. Looks like I gotta bite the bullet. To think all the way back in the days of the novel ai leak I was thinking to myself “I wonder when I will have to upgrade my setup.” Here it is kek, the day in question
>>
>>107194821
>please share my dude
I need to actually package it first as a ComfyUI node or something. Just giving you the python scripts won't work either because I made some edits to their own inference script in order to support dequanting and handling normal attention when you don't have flash attention installed

If you can wait until the evening I'll make something worth sharing after I pass out

>>107194870
>To think all the way back in the days of the novel ai leak I was thinking to myself “I wonder when I will have to upgrade my setup.” Here it is kek, the day in question
Buy a rtx pro 6000. 96gb of vram. You will be good for the next 5 years of AI, maybe even the next 10 if hyperscalers keep buying all the memory
>>
File: gigachad pureblood.gif (2.66 MB, 361x498)
2.66 MB
2.66 MB GIF
>>107194870
hell 2 years ago people in /lmg/ were saying we'd never get gpt turbo at home, 2 years ago i was marveling at what 1.5 could do. now i can gen literally almost anything i want out of every single A.I (image,vid,tts,text) anytime i want with just my new 5060 ti 16gb and have it turn out pretty fucking good.
just a 2 year difference. mind blowing.
>>
File: 00102-4237625426.png (2.74 MB, 1824x1248)
2.74 MB
2.74 MB PNG
>>107194625
here you go
https://files.catbox.moe/2ycp0t.png
https://files.catbox.moe/t3sc8u.png
https://files.catbox.moe/i5511a.png
here's lora: https://civitai.com/models/945982?modelVersionId=1493037
>>
>>107194918
That's fun but are you really using that model?
I should try but I've been postponing the fact that I need to update my venv for cumui because I have a new linux installation.
>>
>>107194918
very kino, thank you. i will now attempt to hold back the urge to gen her pissing over a squat toilet.
>>
>this gen is considered R rated by civitai's new standards
what a queer little website
>>
>>107194883
>If you can wait until the evening I'll make something worth sharing after I pass out
There's no hurry, just curious to try it
>>
Does stacking a bunch of loras together create inconsistent results because the loras conflict with one another or is there an inherent limitation of the architecture that makes adding more loras a bad idea?
>>
File: 1740404097428150.png (736 KB, 1024x1024)
736 KB
736 KB PNG
>>107194994
love me fangs
love me :D face
simple as
>>
>>107195044
general limit is like 4 before you start seeing proper problems, but lora mixing is never a bad idea. it can have all kinds of neat effects. style mixing is one thing. its really mostly up to how the checkpoint handles loras, and how the loras were trained.
>>
>>107195044
Loras will always make the model more stupid. Combining them will make the results more and more stiff.
It's not bad per se but you would want to test it out.
>>
>>107195060
More stupid is a good way of describing what I mean.
>>107195054
>it can have all kinds of neat effects
Yeah I've tried mixing all kinds of loras and even found using negative weights help with the aesthetic I want, but it seems like the more I add the less of an effect each one has and the more unpredictable the results of adjusting the weights of each lora. I guess the lora acts kind of like a sledgehammer to the model.
>>
>>107195060
>>107195099
I've seen it described as loras are literally raping the checkpoint you apply it to. So yeah, it can be seen as making it more stupid. Though i think a well trained lora (which is less complicated than it sounds) really lessens that effect.
>>
>>107195107
>>107195099
Yeah I meant it depends what you want of course.
I'm no way experienced in using them for artistic purposes like some here are.
>>
>>107195107
>loras are literally raping the checkpoint
gen it
>>
File: flux_ntk_v2_0001_.jpg (2.49 MB, 3644x4668)
2.49 MB
2.49 MB JPG
>>107194569
it's not yet but i can run it and see. i've only attempted qwen loras once with mixed results

>a woman sits in an empty fast food restaurant, casual photo, social media photography, psxntk
>>
image gen is the ultimate gacha
>most generations are N rank and completely disposable
>sometimes you get an R rank that is decent
>every 100 gens or so you get that perfect SR rank
>if you use a paid service it's literally gacha
>>
>>107195145
just git to practicing. thats all there is to it really. and obviously learning how it works under the hood to an extent. but even indians can do half of that.

>>107195147
that oekaki fella probably could.
>>
File: 00122-3824233318.png (2.7 MB, 2016x1152)
2.7 MB
2.7 MB PNG
>>107194994
i remember when the Sameko Saba vtuber came out. Mods on civitai were on high alert for sameko loras and auto restricted all of them pg13 category.
>>
>>107195176
No I'm experienced it's just that I don't care and prefer to just use plain. Maybe it's beecause I'm not a fan of characters.
>>
what caused ran to crash out?
>>
anyway, this is generally why you'd wanna train a lora for a character in sdxl based models, even if the character is trained, rng is still not on your side.
>and no this is not an andy warhol piece
https://files.catbox.moe/a81hw4.png
>>
>>107195313
he is in his discord
>>
>>107195044
One thing that can make them work together worse is if they were captioned in significantly different ways, or just with significantly different levels of accuracy.
>>
>>107195168
>psxntk
This kind of shit does absolutely nothing useful on models like Qwen and Flux where you almost certainly aren't training the text encoder at all while training a lora
>>
baking
>>
>>107195434
use case for training the text encoder on flux and qwen?
>>
>>107195443
>>107195443
>>
File: ComfyUI_00096_.png (2.01 MB, 1104x1472)
2.01 MB
2.01 MB PNG
>>107194569
actually i'm retarded i think my qwen lora actually came out, i just had no idea how to use qwen
>>
>>107195170
Skill issue
>>
>>107195927
Inpainting is a skill, yes. For generation alone, he's somewhat right.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.