[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106503402

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: postcard.png (288 KB, 868x321)
288 KB
288 KB PNG
>>106507309
>>
>>106507313
>>106507155
>>
File: 1757200396295745.jpg (22 KB, 708x279)
22 KB
22 KB JPG
Don't you see, people, that you are using a failed and unstable UI under the pretext that "it's the best" but deep down it's the only one that you can choose?
Every day you struggle, sweating the cold sweat of fear that something unexpected will happen when Comfy launches.
Ignore me, I admit I'm a hater, but think about yourselves
Do you really deserve to be treated this way, by this unstableUI?
>>
glazing cumfartui is now ILLEGAL
>>
>>106507328
As a certified non comfyu user (but not hater) and a forge classic enjoyer, it also has weird glitches and typical python POS project weirdness aplenty. There is no “great” solution. Just install them all, hard drive space is cheap.
>>
I will create a collection of all the screenshots of people asking for help here when Comfy stops working.
>>
>>106507348
I use Dream Studio as god intended, anon.
>>
Why is there so much hate?
>>
>>106507313
>>106507300
>>
>>106507309
Is there a model yet that is:
1) At least as good as flux dev
2) is actually capable of understanding explicit descriptions, i.e. not just “uncensored” but the base model knows sexual positions and understands things like the difference between sex and rape.
>>
>>106507354
>>106507363
We genning comfy users now? Kek
>>
seems like sdg is raiding again sadly>>106507362
>>
>>106507348
stuff like this is exactly why ani left to make a C app instead. I really hope it will raise the bar for what is acceptable software
>>
>>106507328
you dumb bastard. you understand that the eror in that image is an issue i caused on purpose to bait the ilk like you? the reason that error happens is because i purposefully put in the wrong resolution. if you have a res not div'able by 64 then sparg attn throws a shitfit

now please just go away, you're shitting up the brand new thread ffs.

>>106507341
seems like it. time to walk away for a few hours until the psychopath fills his diaper
>>
https://huggingface.co/silveroxides/Chroma-LoRA-Experiments/tree/main/hyper-low-step
Apparrently the bottom one serves as a stabilizer. (It's not actually low step tho. Use a full run)
>>
>rocketgurlp***
>>
>>106507375
rent free
>>
>I was only pretending to be retarded
>>
>>106507372
Unironically things like 4o and Gemini seem to be capable just fine but they are hard filtered. I’ve seen anons in other threads post how it lets some slip out sometimes but that’s not local I guess lol. You didn’t explicitly specify.
>>
>>106507372
3) can also gen them without horrible anatomy or women with man hands/shoulders and feet?
>>
>>106507391
not in a good way imo. trying to use that to make a realistic cartoon (think graphic novel etc) becomes impossible as it rapes the style into that shitty corpo cartoon style.
>>
>>106507395
can you explain schizo? from my understanding pytorch is just a wrapper over libtorch which is c++/cuda code
>>
>>106507413
>>106507388
>>106507363
>>106507354
high quality stuff maybe check out sdg?
>>
>>106507413
more like this please
>>
>>106507406
I've got a decent results with anime and even some flesh. Tried 0.5 strength?
>>106507408
Chroma can do billionaire parties on a certain island in the carribean, if you know what I mean.
>>
Qwen seems to handle "combo pack" character Loras quite well, this is a mid-epoch sample from one I'm training right now where half the dataset is solo Tifa and half the dataset is solo Red XIII
>>
File: ComfyUI_temp_acghq_00003_.png (3.14 MB, 1664x1264)
3.14 MB
3.14 MB PNG
My last post shilling samplers today, but ancestral works quite well with Chroma: re-evaluating at every step does wonders with these undertrained models.
>>
>he still doesn't know python can't into scaling
>>
>>106507413
Ah, deepdream my old friend, you’re in the wrong thread.
>>
>>106507427
Sorry about the speech bubbles, that's my lora's fault.
>>
>>106507427
yeah all Ancestral samplers work with all Flow-matching models in Comfy just because of a built-in "hack" it has, they otherwise wouldn't work though
>>
>>106507431
nice irl comfy fennec girl. really brings out the spirit of comfyui
>>
>ldg is worse than sdg
what the fuck is going on?
>>
File: IMG_2957.jpg (102 KB, 700x525)
102 KB
102 KB JPG
>>106507446
Pic reminded me of deepdream images which were saas not local lol.
>>
>he doesn't know why users hate python
>>
>>106507451
Happens when /sdg/ is raiding.
>>
InOrganic
>>
>>106507460
then why are all these python devs scaling shit like crazy? it's annoying as fuck
>>
>>106507472
>>106507469
gluten free :D
>>
>deleted the monkey one
the only funny one removed, typical
why is it i have to check desu to view these threads properly these days?
>>
>>106507494
pokes bear, cries
>>
>>106507453
this is actually kinda neat anony
>>
> radial attn on
> output is k, bit rough

> radial attn on
> output is far better

ah geez rick, i-i'm. i don't feel so good
>>
>>106507527
>he watches the normie-cattle-sloppa
oooofff, n g f m i
>>
>>106507527
i mean off.
time to off myself

>>106507540
>he understands the reference
you're the loser in this scenario
>>
I'm really struggling here, could someone tell me where i'm messing up here? The upscale comes up super blurry.
>>
>>106507309
I have no idea what half the things are on this. Can someone give me a really basic run down of what each thing is? like what is the ui for? where are the models? My goal is to make videos like in OP's. What do i need for that?
>>
>>106507559
anistudio for all three
>>
>>106507559
wow if only there were some kind of getting started guide in the post

piss off. i hope your tensors fuck your wife.
>>
>>106507547
I think you’re missing a second vae decode but I’ve only used comfy once or twice so other anons can better say. I stayed on forge after similar difficulty as you with no “simple” hiresfix or sd upscale type functionality.
>>
File: Qwan_00013_.jpg (807 KB, 1984x2976)
807 KB
807 KB JPG
A very liberal interpretation of 'dust motes'.
>>
>>106507570
>anistudio
>18 stars on github
>main branch hasn't seen an update in 4 months
Why'd you mislead a tourist like that?
>>
>>106507602
kek
>>
can you relax with the literal death threats?
that'd be great
>>
>your daily use limit for comfyui has been reached
>please purchase comfycoins to continue
>>
>>106507641
worth every penny teehee ;3
>>
>>106507573
tried to read it and didn't understand shit, but im an ultra consumer who doesn't have money so i don't want to understand it i just want to know the bare minimum to make it happen.
>>
>>106507646
leave me out of whatever nonsense is going on (which seems to be constant here)
>>
File: ComfyUI_01077_.png (1.63 MB, 1024x1536)
1.63 MB
1.63 MB PNG
>>
how do you make nsfw content with the wan 2.2? I do image to video and prompted it to undress and it doesn't do anything except girl moving around
>>
>ultra consumer
>doesn't have money
what is this fresh new cope?
>>
>>106507570
i don't want anime style though? and do you have a link for it?
>>
>>106507675
im taking my secrets to the grave ;3
>>
>>106507680
We aren’t interested in helping you make Epstein island style images you nonce
>>
>>106507679
it's called being retarded buddy hope you understand you're hurting the feelings of a retard. I'm gonna have to go take it out on my student on Monday now because of you
>>
>>106507635
are you ok anon?
>>
>>106507683
see >>106507602
>>
>>106507687
not what i want didn't even indicate that, very evil of you to even say something like that. how do i convince you to tell me what to do
>>
are you load British or something? go to sleep it's like 3 am for you
>>
Qwen is a good base model
Chroma is not a good base model

End of story.
>>
>>106507557
>learn to post the catbox first instead of a screenshot
Sorry, i figured out how to export it, here you go.

https://files.catbox.moe/4tdumu.json
>>106507574
Yeah, I have forge but I wanted to figure it out here so I don't have to have switch between them.
>>106507580
>Your second sampler isn't hooked up. Go slow anon, take your time.
I'm trying man, but I think i'm just making it worse, I connected the second sampler but it's still blurry.
>>
>all his posts are nuked from \vp\ instantly
why cant you just be a good doggy?
>>
maybe it's about the message not the post
>>
File: 00073-3398516116.png (972 KB, 1216x832)
972 KB
972 KB PNG
>>
>>106507745
>sdxl model
>cfg 1.0
>lcm/normal
>no negative
anon please try harder
>>
>>106507309
Anybody also got this seg fault using this(https://github.com/Enemyx-net/VibeVoice-ComfyUI) for vibe voice?

FETCH ComfyRegistry Data: 5/96
got prompt
WARNING: LoadAudio.IS_CHANGED() missing 1 required positional argument: 'audio'
APEX FusedRMSNorm not available, using native implementation
[VibeVoice] Using embedded VibeVoice from /home/plof/AI/ComfyUI/custom_nodes/VibeVoice-ComfyUI/vvembed
[VibeVoice] Using auto attention implementation selection
Loading checkpoint shards: 67%| | 2/3 [00:02<00:01, 1.22s/it]FETCH ComfyRegistry Data: 10/96
Loading checkpoint shards: 100%|| 3/3 [00:03<00:00, 1.15s/it]
No preprocessor_config.json found at microsoft/VibeVoice-1.5B, using defaults
Loading tokenizer from Qwen/Qwen2.5-1.5B
[VibeVoice] Starting audio generation with 20 diffusion steps...
[VibeVoice] Generating audio with 20 diffusion steps...
[VibeVoice] Note: Progress bar shows max possible tokens, not actual needed (~30.0 estimated)
[VibeVoice] The generation will stop automatically when audio is complete
Generating (active: 1/1): 0%| | 0/132 [00:00<?, ?it/s]FETCH ComfyRegistry Data: 15/96
run.sh: line 2: 3186442 Segmentation fault (core dumped) python main.py --listen
>>
>>106507756
not a mind reader but since you clearly aren't used to not getting what you want i'll comply

sunset complete
its people like you that 'ruined' 4chan
>>
>>106507772
>anon please try harder
I'm using dmd2 as a lora, not sure if you're implying i'm trolling but i'm not. Just asking for help.
>>
>>106507775
I don't use that repo. But taking a shot in the dark, when was the last time you did
pip install transformers --upgrade
?
>>
>>106507799
>denoise .01
That's what I use on Forge and it works fine.

>fwiw
i had to look that up, i'll try latent if i can get this to work
>>
>>106507745
Why do you have a vae node when the checkpoint (and I assume the model) already has a vae output? Speaking of vae, the vae encoder isn't hooked up neither. The reason shit looks blurry is because you're saving the image at the wrong node, you need to connect it to a 2nd vae decode node that is connected to the 2nd ksampler latent output. Also, increase the denoise on the 2nd ksampler, try at least a value of 0.30.
>>
>>106507784
bye bitch
>>
>>106507784
>>106507840
we can threaten you with impunity!
>>
stop interacting with the brainlet.
there are more than enough example workflows for him to figure this shit out. you telling him what to do will just lead to more retards never learning
>>
File: ComfyUItest_00004_.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
What's up with all the ugly gens?
>>
>>106507784
absolutely never ever come back please. you are the person who "ruins" 4chan by being pathetic all over the place.
remove the name and partake in conversations like normal people do. nobodly likes a namefag
>>
>>106507850
pic related?
>>
>>106507850
sdg got fed up again probably
>>
>>106507850
It's just a hater, ignore the lowlife.
>>
>>106507869
given he wont stop being retarded removing the name he will just be impossible to filter out then, so not a good idea, he should just stop posting altogether
>>
869
so you can larp again? as him?
>>
you dont want me to post here (fine)
by why go out of your way to shit up a general that has nothing do with you?
>>
>>106507892
why make death threats?
>>
>>106507833
>>106507843
I think i'm getting somewhere now after your reply. It's no longer blurry and the result looks good. Thanks.
>>106507849
Why does it upset you that i'm trying to learn? Just ignore me if it bothers you that much.
>>
>>106507901
>>106507892
why make literal death threats because you dont like something on 4chan? why try to micromanage\control 4chan?
why be so compulsive\neurotic to the point of mental illness?
do you need a padded room?
how many times to the jannies have to ban\prune you?

its illegal to make death threats in the united states edgelord or not.
>>
>>106507877
i have a pretty thick skin and can enjoy banter from time to time; and even dont mind being the butt of mean-spirited jokes...
but death threats are taking things WAY too far
>>
>>106507919
can you pleast stop forcing your existence into these threads? you are unwell. stop trying to make yourself a "personality" here. it's unsightly.
>>
Hey guys I've decided to switch from smoking Hercules to smoking Labubu.
>>
>>106507929
can you stop menstruating pls
>>
>>106507916
even if I give you the benefit of the doubt and assume you are not trolling (you are), what you are doing here is really really dumb. If you set out to design a comfyUI workflow to make people angry, you have succeeded, congrats. If you are actually trying to learn then delete your current workflow and start with an sdxl template.
>>
lil'bro needs help
>>
File: ComfyUI_03297_.png (2.72 MB, 1344x1728)
2.72 MB
2.72 MB PNG
>>
SICKENING honestly
this is what happens when redditors visit the boards too often kek
>>
File: zuizhz1rr5761.png (1.01 MB, 1395x746)
1.01 MB
1.01 MB PNG
>>106507936
can you do this one real quick
>>
>>106507936
why couldn't it be smoking hot teens in my area :(
>>
>>106507963
I already did that one a few weeks ago, but I can't find it in my output folder for the life of me.
>>
>>106507954
>assume you are not trolling (you are)
I'm not trolling. Look, this is what i'm doing right now after I got that reply about the VAE not being connected. I got an image to output and upscale. Still messing with it though.

>If you set out to design a comfyUI workflow to make people angry, you have succeeded, congrats
I don't understand why i'm making you angry. I'm just trying to learn Comfy.
>>
>>106507933
only came here and made posts because im getting death threats on other unrelated boards from here
>>
>>106507994
so you want to spread drama of other boards here? fuck off
>>
Why are you meanie heads bullying the cute Tripfag? :c

BE NICE! c:
>>
>>
>>106508005
literal DEATH THREATS anon & we all know who it is because of his vidgens\resolution
unprovoked mind you
>>
>>106507984
try it again
>>
>>106508023
>we all know who it is because of his vidgens\resolution
can you try to formulate a clear thought? you talk like you're high honestly
>>
>>106508034
that sick fuck ani
>>
>>106508043
KEKKED
>>
File: 1757204876626457.webm (2.69 MB, 1440x1440)
2.69 MB
2.69 MB WEBM
see ya around
fucking literal psycho shit
>>
>>106508061
comfortable gen
>>
>>106508041
what hrt + unresolved mental issues does to a motherfucker.
daily reminder to be thankful your brain doesn't try to pull shit on you. imagine having to make yourself the center of attention everywhere you go, just so you can feel a modicum of social interaction so you don't kill yourself.

thank you brain, i love you.
>>
>>106508078
same here, can't wait for all these mental ill trannies to be hung
>>
File: WanVideo2_2_I2V_00333.webm (2.04 MB, 1248x720)
2.04 MB
2.04 MB WEBM
>>
>>106508087
>>106508078
t-t-the jannies are conspiring against ME!
you truly sadden me anon
praying for ya
>>
>>106507991
you seem earnest but i dont use uncomfyui, sorry
>>
>>106508061
Snowflake
>>
>>106507933
He thinks this is /sdg/ where avatarfags rule and post shit like “how are you today sweetie” to each other, it’s disgusting
>>
File: ComfyUItest_00019_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>
>>106508017
I absolutely love bulbous bimbo balloons and I’m not afraid to admit it
>>
>>106508230
yeah well that's just like your opinion man
maybe youll be happier on reddit where everything is the same
>>
>>106508078
projection
>>
>>106508078
imagine attacking things consistently and constantly because you don't like them like a psychopath, sad.
see
>>106508147
>>106508305
>>
File: ComfyUI_00002_.mp4 (886 KB, 640x640)
886 KB
886 KB MP4
>>
File: ComfyUI_00003_.mp4 (1.22 MB, 640x640)
1.22 MB
1.22 MB MP4
The video gen seems like magic.
>>
File: AnimateDiff_00308.mp4 (3.91 MB, 720x1040)
3.91 MB
3.91 MB MP4
>>
>>106507916
>imagining doing all this for a hiresfix +lora
Lol
>>
File: ComfyUI_00004_.mp4 (930 KB, 640x640)
930 KB
930 KB MP4
I'm impressed by the dust.
>>
i have been conducting illegal gardevoir crossbreeding experiments. not sure why it keeps coming out dark/hazy tho lol
>>
>>106507916
I hope you aren't a VRAMlet and doing all this for sad prompting with SDXL because it gives me deep pity and sadness
>>
File: ComfyUI_00005_.mp4 (606 KB, 640x640)
606 KB
606 KB MP4
I am a promptlet
>>
File: ComfyUI_00006_.mp4 (608 KB, 640x640)
608 KB
608 KB MP4
>>106508406
I don't know what happened here.
>>
>>106508391
Hassaku x Illustrious = Pepsi x Coke
Both are different flavors of the same garbage. You're basically choosing between two types of cancer, they're both inbred frankenstein merges of existing checkpoints.
>>
>>106508431
hassaku is a tune of illustrious buddy
>>
>>106508468
>tune
the word is 'slopmix' sir. Noob is a finetune, the rest are lora merge slopmixes.
>>
File: ComfyUI_00009_.mp4 (575 KB, 640x640)
575 KB
575 KB MP4
>mushroom cloud
>>
>>106508382
I know, right? I got it to work so it's all good.
>>106508397
>I hope you aren't a VRAMlet
How would you feel if I was doing this on a 5090?
>>
For ComfyUI Cumfartniggers having problems with (v)ram management since the very recent updates, don't pass "--fast" without any arguments anymore, this update fucked it
https://github.com/comfyanonymous/ComfyUI/commit/e2d1e5dad98dbbcf505703ea8663f20101e6570a

With no arguments all "--fast" optimizations are enabled, which are:

>Fp16Accumulation = "fp16_accumulation"
>Fp8MatrixMultiplication = "fp8_matrix_mult"
>CublasOps = "cublas_ops"
And the new trash one
>AutoTune = "autotune"

So now pass the above strings as arguments for --fast or don't use --fast at all, in my experience the 20% faster fp16 accumulation is not worth the quality degradation for example.
>>
>>106508431
Yeh I just downloaded a bunch of the most popular models on Civitai and switch around every so often, still not sure if there’s any one I like more than the others yet
>>
File: ComfyUI_temp_fvumt_00005_.png (3.83 MB, 1400x1800)
3.83 MB
3.83 MB PNG
>>
>>106508522
people swear by wainfsw, i use v14. there's something about v15 that doesn't convince me to change but i cant tell what it is
>>
>>106508524
I can't even see their asses! :c
>>
File: ComfyUI_00013_.mp4 (320 KB, 640x640)
320 KB
320 KB MP4
Having some fun
>>
>>106508524
>plump belly
This is shortcut to CUM
>>
File: ComfyUI_00016_.mp4 (714 KB, 640x640)
714 KB
714 KB MP4
>>
File: QvenEeemage_Output_123151.png (1.73 MB, 1472x1024)
1.73 MB
1.73 MB PNG
she would, I've seen the vidyas bros
>>
File: ComfyUI_00021_.mp4 (725 KB, 640x640)
725 KB
725 KB MP4
so close yet so far
>>
>>106508413
That's kind of awesome, sneaky cactus be like 'let's go!'
>>
File: ldgtest3.mp4 (511 KB, 960x192)
511 KB
511 KB MP4
>>106508679
>tfw nobody makes ldg logos anymore they just argue about shit
>>
>>106508413
You used fp8 scaled, that's what.
>>
File: ComfyUI_00023_.mp4 (637 KB, 640x640)
637 KB
637 KB MP4
>>
>>106508413
this is legit a scene from lba2
>>
File: EOM8ivLUUAEz.jpg (36 KB, 384x516)
36 KB
36 KB JPG
Do you need the flux controlnet to upscale with chroma or just the upscaling model?
>>
>>106508695
I stopped making them once the Baker refused to put them in the collages
>>
>>106508724
he has derangement syndrome
>>
>>106508728
and is v into leashes/dog roleplay
& has fat/scat fetish+ hates trump
>>
>>106508714
>he still thinks a real live person is choosing which slop goes in
>>
>>106508734
accurate
>>
>>106508734
don't forget the poop and toilet posting
>>
>>106508750
janitor privileges
>>
>>106508524
catbox?
>>
this two sampler bullshit is so fuckin SLOW
>>
>>106508505
thanks for posting, i was wondering why with the new update vae encode/decode was fucked up
>>
>>106508061
catbox?
>>
File: 1265758173541.png (29 KB, 130x190)
29 KB
29 KB PNG
>>106508505
>he pulled
>>
>>106508532
tried it, it does seem to produce more vibrant and "cleaner" outputs, but i think it's also what anons here mean when they say "slopped", it readily wants to give everything the same big eyes and similar face shape no matter the seed. might have to play with it more though i don't hate it. also a hornier model (its in the name i suppose), it just gives everything cameltoe unprompted lol
>>
>>106508023
>worrying about death threats from randos on the internet
u got blue hair and pronouns in bio or something?
>>
>>106508697
There it is!
>>
File: ComfyUI_00155_.png (3.71 MB, 1336x1952)
3.71 MB
3.71 MB PNG
>>
was training code ever released for VibeVoice?
>>
>>106508854
no. fucking jeets man. then again we would have never got the weights uncensored if it weren't for their incompetence. double edge sword...
>>
>>106508871
can it do anything lewd or is it slop
>>
File: 1744868720330028.png (1.91 MB, 1080x1909)
1.91 MB
1.91 MB PNG
>>
>>106508874
it can do lewd sounds, just in a podcast format which is funny. just grab the samples you need from it using audio editing tools. comfy is terrible at that
>>
>>106508874
Couple anons were posting some stuff earlier today got it to do some moans and stuff but apparently it’s a dice roll, you can’t directly prompt them.
>>
>>106508874
Yes it can do lewd there's examples in the last two threads.
>>
>>106508887
>you can’t directly prompt them
Wait so how do you control the model?
>>
>>106508815
the illegal crossbreeding experiments continues. trying to create the ultimate goonermon, gardporeon. kek. forgot how useful the simple little prompt alternating syntax is in the 1111 derived UIs.
>>
>>106508917
why did she bake a wicker basket?
>>
>>106508903
words. it reads words but it won't "read" non words like moans and sounds and such, if i understand right.
>>
>>106508918
maybe its full of some delicious jelly donuts!
>>
>>106508917
the controlnet image i'm using has a round birthday cake, but i didn't prompt anything about what she's holding so the model is just dreaming something up i guess. different seed and now it's a pot of rocks lol.
>>
File: 1750718450798549.png (34 KB, 579x316)
34 KB
34 KB PNG
i'm training a chroma lora using diffusion-pipe. previously, i've trained using a single resolution on all images. for example, a lora i did recently had 100 images, and i used a micro bath size of 4, meaning i had 25 steps per epochs. it's simple and it made sense.

however, i decided to test making a lora with several different resolutions, so i resized my training data into 4 different resolutions, and set those as my bucket sizes. but now when i train my lora, around 70 images total, i am now getting 7 steps per epoch. and at a batch size of 4, that tells me it's only training on 28 images or so.

i've made sure that there are enough images in each bucket so that they dont get skipped. i've watched the output in my terminal and seen that all images have been cached before training started. i've put my output into various chat bots and i dont really trust their response.

is this expected when using multiple buckets, or is something terribly wrong? to me it seems like it's skipping more than half the training images for some reason.
>>
>>106508524
>>106508585
>>106508555
>>106508759
nta but
https://files.catbox.moe/dje8xn.mp4
>>
A question about forgeui controlnet segmentation:
is there a way to change what color should prompt what?
>>
>>106508967
For open pose type skeletons you mean? You can actually get those to work reliably? I gave up on them and just use depth_anything/v2 I find it MUCH more reliable
>>
File: 1741744953344443.png (10 KB, 329x171)
10 KB
10 KB PNG
So this is all one schizo or? Why were they both nuked when there were no arguments itt
>>
>>106508984
lazy jannies often hammer multiple people or any post that was reported
>>
>>106508917
>forgot how useful the simple little prompt alternating syntax is in the 1111 derived UIs.
I miss that a lot actually.
>>
>>106508984
Made me realize we haven’t had the comfy should be dragged blessing itt yet, wow.
>>
>>106508984
>>106508989
this though sometimes it's funny. caught a ban once for just saying i'm about to summon debo
>>
>>106508976
oh no, just the regular segmentation controlnet option. it separates objects correctly, but I was wondering if there was a way to define what segmented pieces should be (in img2img). I didn't see any way to do so, and there even exists a color coding list, but it seems to be static
>>
Blessed thread of frenship
>>
>>106509013
Ah gotcha, I never use that one, don’t hate me for Reddit post here but it seems like it’s what you’re looking for https://www.reddit.com/r/StableDiffusion/comments/11cvkmp/comment/kzok1yo/?utm_source=share&utm_medium=mweb3x&utm_name=mweb3xcss&utm_term=1&utm_content=share_button
>>
>wainsfw
>>106508123
>>
>>106509040
>go to the pederast thread
No, I don’t think I will thank you very much.
>>
File: 00380-2385048658.png (3.37 MB, 1248x1824)
3.37 MB
3.37 MB PNG
>>106508984
I just realized this thread was not properly blessed. We are all heathens that have lost the path.
>>
>>106509045
now we can be pedos without any remorse
>>
>>106509036
Scrounged a little more based off that and it seems like I'm just gonna have to edit the outputted segmented image by changing the colors manually. a little annoying but doable. Thanks!
>>
>>106509060
NP anon, If you haven’t dried depth anything v2, I highly recommend, it takes practically anything I throw at it on sdxl based models
>>
>>106508924
How does it differ from a classic TTS then?
>>
>>106509071
It's very good tts
>>
>>106509071
it's podcast format for one so you can have two voices talking to each other
>>
i have great fun using 3dpd pornostars as controlnet inputs.
>>
>>106509117
>no lisa ann lora
pein
>>
>>106509145
Pornolab has a huge collection torrent which I had on my nas before it died. Go out there and be the change you want to see, lad.
>>
>>106509157
>which I had on my nas before it died
you didn't have a nas for your nas? heh
>>
>>106509040
what are some /ldg/ approved checkpoints?
>>
>>106509163
RAID0 is a hell of a drug.
>>
>>106509176
>RAID0
as expected of a Gardevoir enjoyer
>>
>>106509170
Just go to Civitai and search for most downloaded of all time. The masses couldn’t be wrong, surely!
>>
File: cin16 - Copy.jpg (167 KB, 768x768)
167 KB
167 KB JPG
>>106508984

All that autistic hyperfixation energy wasted on shitposting. Very sad.
>>
>>106509214
Got it, i'll do as you said and sort by "Newest" then download the first one.
>>
>>
>>106509241
Careful anon I saw the other day some gigachads are still uploading 1.4 mixes kek. Might end up travelling through time
>>
>>106509045
Bless this BWC, bitch
https://files.catbox.moe/i6946s.mp4
>>
>>106509288
christcucks are more into BBC anon
>>
>>106509314
I don't see what buck breaking cock has to do with anything here anon
>>
>>106509347
no clue man but stastics show christcucks like their blessed BBC for some reason
>>
i don't want to turn into one of those post all of my sloppa gens types, but you faygles aren't posting anything. so i tried the gardporeon prompt on one of those hyper slopped "noob illustrius realism" type mixes meant for doing nudes and faceswaps and shit (which it sucks at like most sdxl based models), but it creates a very pleasing 3dcg effect with non-human subjects that i quite like. reminds me of like late 2000s cg art like the old digitalblasphemy stuff. might be a neat alternative if you're looking for that kind of effect.
>>
>>106509533
looks like a cosplay photo, neat
>>
>>106509533
there doesnt have to be constant activity. nothing new is out.
>>
>>106509533

How does it interpret something fluffier? Maybe lopunny?
>>
>>106509539
Yeah it’s interesting sometimes it gives that body suit effect and sometimes it goes full shiny hard candy surfaces for everything. Like I said I found it kinda shit for real 3dpd but this is a fun alternate use
>>
>>106509573
same seed and settings just changed to lopunny and uh...interesting lol. still a unique look compared to the anime focused models
>>
>>106509601
Very balloon-looking which I'm sure some people are into. I get what it's trying to do though.
>>
>>106509573
>>106509601
so basically a realistic fursuit generator lmao.
>>
What is that thing called when you use some sort of 3d-modeling style skeletal animation to drive the AI video's motion? I remember seeing it posted but I didn't bookmark it.
>>
>>106509601
>>106509618
furfaggotry belongs in >>>/b/
>>
>>106509625
Sounds like controlnet for video?
>>
>>106509634
Homeboy just asked what it looked like no more I promise soweyyy
>>
>>106509625
Openpose controlnet.
>>
>>106509618
dick twitched, not happy about that but here I am.
>>
File: 2kvshd.jpg (1.51 MB, 2304x1792)
1.51 MB
1.51 MB JPG
https://huggingface.co/silveroxides/Chroma-Misc-Models/tree/main/2k-test/2025-09-05_07-22-30
Interesting snakeoil version
>>
>>106509655
Fug I didn't save the tags. Left is the 2K, right is HD
>>
File: raptor1 - Copy.jpg (186 KB, 768x768)
186 KB
186 KB JPG
>>106509634
Trying too hard lil bro
>>
>>106509675
That's a chicken, nigga
>>
>>106509655
Any explanation what this is?
>>
>>106509682
It's called 2k test so I'm guessing it's for more high res pics, but who knows.
>>
https://files.catbox.moe/81c0l9.flac
>>
File: chicken1 - Copy.jpg (185 KB, 768x768)
185 KB
185 KB JPG
>>106509678
You're right, sorry.
>>
>>106509700
>chefs kiss
>>
>>106509701
Don't let me catch you slipping again lil bro
>>
File: ComfyUI_00279_.mp4 (1.04 MB, 720x720)
1.04 MB
1.04 MB MP4
>>
>>106509722
>feathers
NOT MY DINOSAURS!
>>
>>106509700
amazing
>>
File: ComfyUI_00280_.mp4 (1.57 MB, 720x720)
1.57 MB
1.57 MB MP4
>>
File: ComfyUI_temp_qufsb_00001_.jpg (748 KB, 1792x1152)
748 KB
748 KB JPG
>>
File: 1686646577004591.jpg (198 KB, 1080x1080)
198 KB
198 KB JPG
>>106507309
holy shit, get me that chunli webm holy fuck please anons
>>
>>106509756
the collage always contains gens from the previous thread, so search there.
>>
>>106509773
sounds like a lot of work when you could just link me to it
>>
>>106509687
I love this model but every facet of it feels like black magic.
>>
>>106509773
Found it thanks
>>106509778
fuck off falsefagger
>>
>>106509722
>>106509741
I knew dinosaurs were gay
>>
>>106505476
Literally using sage2 with all sdxl based models in forge rn, wym?
>>
File: 75656885667.jpg (167 KB, 768x768)
167 KB
167 KB JPG
>>106509712
You got it bro
>>
>>106509799
Does it actually speed it up tho?
>>
>>106509799
take anything you read here lightly as the majority of posters are quite retarded
>>
File: ComfyUI_temp_qufsb_00005_.jpg (656 KB, 1536x1280)
656 KB
656 KB JPG
Seems like 2Mpx is the limit. Going above started duplicating the vehicles.
>>
holy fuck bros, happy to live in this timeline, can generate anything now, we have image, video and now voice; Generating a dataset and training a lora has never been easier, its fast too, can put out characters every 1-2 hours, local wins as always
>>
File: ComfyUI_00060_.png (3.12 MB, 1280x1920)
3.12 MB
3.12 MB PNG
>>
File: 1736465084189571.mp4 (3.62 MB, 480x832)
3.62 MB
3.62 MB MP4
>>106509361
>(((statistics))) show



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.