[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


26b Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106553794

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>106556603
Weak highlights today
>>
>>106556614
which one was yours?
>>
>>106556614
>Weak highlights today
HunyuanImage and SPRO turned out to be nothingburgers, so there's nothing to showcase recently
>>
>>106556648
SPRO is a good technique. too bad they used it on a distilled model
>>
>>106556655
they'll use it on Qwen Image soon, we'll see if the method is really the ultimate unslopper
>>
I have an obscure problem

I'm trying to gen women with black skin, like actual black skin not african brown skin. And whenever my workflow goes to the next sampler/pass they go from black to brown. I've tried different prompts but no changes have worked. One interesting thing about the problem is even though the sample preview looks like black skin on my high pass, the first frame is brown skin.
>>
>>106556603
>instagirl grifter slop lora reddit images in the highlights

you can't go any lower than that
>>
So, now that the dust has settled. just how censored is Wan 2.2?
>>
localchads we will rise again
>>
>>106556694
it's too censored...
https://files.catbox.moe/xzc8j0.mp4
>>
Worst collage EVER
>>
>>106556708
now run this without the lora
>>
hi anon,
is qwen inpainting model out?
>>
>>106556747
no
>>
>>106556751
than you sir
>>
Blessed thread of frenship
>>
>>106556737
>Oh you can nail down nails with a hammer easily? Now do it with only your hands, heh.
Smartest brown
>>
>>106556737
why do you not want to use a lora? if it works it works
>>
>>106556776
you know what he meant in the first place when he first said it, you disengenuous fucking retard.
>>
>>106556729
which one was yours?
>>
>Use my civitslop lora SIR its trained off best henti SEX and GOODEST CAPTIONS
>>
>>106556673
ugliest collage set in memory, op really must be that guy
>>
>>106556792
If someone can train a lora on a basic local gpu thats just porn and the model just gets everything because it wasnt lobotomized like a lot of other models then it's not censored.
>>
>>106556801
You forgot to attach your own image showcasing true art?
>>
File: Untitledgwgwg.mp4 (3.68 MB, 612x1200)
3.68 MB
3.68 MB MP4
Kek, I wanted to see how this turned out. Taking the last frame of the gen and just keep on going.
Even with a reference image it shits itself extremely quickly.
>>
>>106556841
def a problem somewhere in the workflow to have that big color shift
>>
okay what's the catch local gen? Is my pc a ticking bomb?
>>
File: 1756288374336460.png (88 KB, 1039x879)
88 KB
88 KB PNG
Anyone uses the clown samplers with wan 2.2?
Picrel is what I use, is it what you'd recommend?
(I don't use lightx2v)

It's quite slow compared to what I was using before (unipc)
>>
>>106556861
Oh that was just a shitty mediaplayer screenshot I shoved back in, I just wanted to test the seamless part. I'm sure I can do much better if I try.

Saw someone automating the FFLF and having new prompts for each one stacked up.
>>
What one anon brought up before is why wan 2.2 uses so much RAM. Isn't it just supposed to load the model into VRAM? Is it raelly so demanding that it instantly overflows the GPU and loads into RAM?
>>
>>106556877
>what's the catch local gen?
If you want good outputs you need to have the skill
>>
>>106557053
oh yeah true that. but having a lot of fun now
>>
File: ComfyUI_WAN2.2_00102.mp4 (3.52 MB, 648x808)
3.52 MB
3.52 MB MP4
2B cant spell for shit
>>
>>106557053
>you need to have the skill
the skill that allows me to buy a 5090?
>>
>>106557075
>he thinks better card = better gens automatically
oof...
>>
>>106556894
unipc sux

how is your ui differnt
>>
>>106557084
faster prompting with a 5090 allows me to polish my dick while i read the thesaurus. checkmate
>>
>>106556894
I think clownsharkbatwing whateverthefuck the author goes by is kinda schizo. I stopped using his nodes months ago.
>>
would i need to add a lora to make boobs bigger than what is being genned? or add it to the prompt?
>>
how big of a dataset do you guys go for?
>>
File: 00089-3411299868.jpg (776 KB, 2048x2480)
776 KB
776 KB JPG
For chroma at a conservative step count for constant coherency and composition 30 is the safest number for me and 10 steps on high res seems to be fine, with a style lora to keep everything stable this model is still a bully on hardware even a 5090
>>106557075
Honestly you needed foresight to get that thing, I knew recent events but if I wasn't so tuned in my instincts would have told me to wait. If you're a burger things are fucked

Posting comparisons for first and second pass now
>>
>>106557131
first pass
>>
File: file.png (10 KB, 1199x167)
10 KB
10 KB PNG
>>106557088
>how is your ui differnt
>>
>>106557131
Just 10 steps? Are you using only a second ksampler or some other nodes?
>>
>>106557131
>>106557139
>>106557210
can you please stop seeking attention? you post the most boring slop in the general, blog post about shit nobody cares about and publicly pat yourself on the back for validation. you are so much worse than ani, comfy and even debo. just stfu already
>>
>>106557210
>Nodes
No just the webui high res fix
>>106557229
Meds?
>>
>>106557233
>Meds?
what, you can't find them? sad, you really need to take them
>>
>>106557131
that reminds me about the other day when an anon was saying how he was doing 100 steps for a chroma gen with another 30 on top for a hires fix and then he was complaining about the 10+ minute gen times. kek
>>
>>106557250
you are actually replying to that very retard lmao
>>
>>106556673
how do we cope
>>
File: 00117-3279622274.jpg (1.04 MB, 2048x2480)
1.04 MB
1.04 MB JPG
>>106557248
4 u
>>106557250
I was that anon, the problem is without a strong style anchor you do need higher steps to reduce style swing. Also the lower the step the more volatile the composition
>>
>>106557269
your ugly hag found your meds. it's bed time faggot
>>
Oh the loser crew is seething because they have nothing going on in the containment zone.
>>
Video gen on a 5090:
- using fp16_accumulation -> 19min
- not using fp16_accumulation -> 19min
wtf
>>
File: ComfyUI_00065_.png (3.44 MB, 1336x1773)
3.44 MB
3.44 MB PNG
>>
>>106557112
literally just add weights, start with (huge breasts:1.5) and try changing the number

>>106557269
this is upscaled? something is horribly wrong with your workflow
>>
>106557322
He's getting desperate because him and his crew kept getting rejected during the time I was gone
>>
>>106557322
>(huge breasts:1.5)
thanks man will try! is there anywhere to find the tags that work best with gens?
>>
so does going over 5 secs shoot up gen time exponentially? it doesn't increase linearly? 5 secs was 4mins and then i went to 8 and its now 12minutes to gen
>>
>>106557333
danbooru or any doujin hoster. test each tag individually or in isolation to ensure prompt adherence
>>
>>106557346
yes, u just answered your own question
>>
--use-sage-attention

thoughts?
>>
>>106557361
yes
>>
>>106557233
based, how its chroma on Neo?
>>
>>106557361
why is it a fucking fag flag?
>>
Are the NetaLumima derivatives good yet
>>
>>106557358
Just wanted to make sure I wasn't fucking up somewhere, today i went from 5 hour gen nightmare fuel to getting help from a kind anon and 5 minute gens.
>>
File: 1721367340755262.gif (792 KB, 294x233)
792 KB
792 KB GIF
>>106557256
>>
>>106557371
would you like to reiterate your question so you don't sound like a third worlder?
>>
Man VAE really is cancer. One more decode in the WF and it deleted all the warmth the color had
>>
>>106557424
orig pre-upscale
>>
Hello, I hate Comfy
>>
>>106557452
Hello I hate comfy too, let's get married!
>>
>>106557452
Same. i cant install neo sadly
>>
>>106557413
why is it a flag in the flag append of the program that is used to launch comfyui with bat files in the windows?
>>
>>106557424
>Man VAE really is cancer.
yes, that's why I want lodestone to succeed
>>
>>106557413
why is it a homosexual tranny command flag instead of a node I can use at runtime?
>>
>>106557475
It needs some speed-ups first. Using it as a daily driver is madness. Might be decent as an upscaler tho.
>>
>>106557371
>why is it a fucking fag flag?
you can use kj's node to activate it instead >>106555496
>>
>>106557489
this breaks qwen you stupid nigger. pay attention
>>
>>106557467
why, let me help you
>>
>>106557505
it doesn't you retarded monkey, you have 4 options and that one doesn't give the black image, you're so fucking dumb, think before opening your trash mouth
>>
How do i make the dude dark-skinned?
>>
>>106557505
nope, works on my machine
>>
>>106557520
"dark skinned male"?
are you retarded?
>>
>>106557511
retard. it doesn't actually work. kijais sage attn node does NOTHING.
try it. turn the flag off then generate an image with kijai set to auto then to disabled. no difference.
now please uninstall your OS and drown your computer.
>>
>>106557531
>it doesn't actually work.
absolute skill issue, it works fine on my machine
>>
I FUCKING HATE COMFY DEVS SO MUCH IT'S UNREAL. THIS FUCKING SHIT IS MORE A DEBUGGING SIMULATOR THAN A FUCKING IMAGE GENERATOR. I'M SO FUCKING SICK OF IT
>>
>>106557539
haha yeah i know right?
so, did you catch that game last night?
>>
>>106557526
I tried that and then it consistently does this. a dude humping at the side.
https://files.catbox.moe/pe9axb.mp4
>>
>>106557269
it's a fake catjak or he started doing H
>>
>>106557545
your lora or prompts are fucked
>>
>>106557539
Haha, imagine working with that UI every day!.
The more time passes, the more convinced I am that Comfy is more for AI hobbyists than for people who work with AI.
>>
>>106557545
what lora? that means the lora isn't trained in black dudes
>>
File: file.png (18 KB, 1253x193)
18 KB
18 KB PNG
>>106557563
>>106557569
this is the prompt and the lora is https://civitai.com/models/1923528/sex-fov-slider-wan-22
Ohhh so I'd have to find an actual dark-skinned one. rip.this is what i put in the prompt
>>
>>106557585
Write actual descriptive sentences, wan is about natural language, it doesn't understand the (xxx:1.5) syntax.
>>
>>106557585
what >>106557593 wrote, but also use synonyms, repetitions, etc in the prompt you want
>>
>>106557585
if you're doing video gen, weights don't work as other anon said, you have to describe it instead. i thought you were doing SDXL with my original response
>>
>>106557593
>>106557615
Oh okay I'll try that.
>>106557618
Ah I should have specified, Yeah I'm doing video gen on wan 2.2
>>
>>106557626
cucked by brown penis again frfr ong
>>
What too much denoise does to a mf
>>
>>106557074
are you post-processing these to add the noise and aliasing?
>>
>>106557585
you should give up. right now :D
ur brain too smol
>>
>>106557678
grrrr it is!! .___. but still have to try.
>>
>>106557539
You can always use Diffusers
>>
>>106557703
fuck python in general desu
>>
I HATE SNAKES
SNAKES IN MY WALL
SNAKES IN MY PIPES
>>
#comfy killed the hype
>>
File: 1740601582851.png (1.8 MB, 1080x792)
1.8 MB
1.8 MB PNG
why hasnt anon posted this level of kinosoul with seedream
https://xcancel.com/fofrAI/status/1966142589289329015
>>
>>106557893
because this is the local thread. get out shill
>>
>>106556332
>I still have yet to see an image that is better than what we can achieve locally.
there you go >>106557893
>>
File: 1757488525753312.jpg (73 KB, 600x592)
73 KB
73 KB JPG
For those using a 5090 on linux, do not upgrade from 575.57.08 (or below) to 575.64.03, at least in my case, the vram usage has gone up, and while I could send wan2.2 fp8 to vram, now it also needs me to send parts to ram for it to work.
>>
File: file.jpg (2.99 MB, 4096x3072)
2.99 MB
2.99 MB JPG
>>106557893
it looks like the Wall of Fayth from FFX, beautiful
https://www.youtube.com/watch?v=WBjbY1dwO_Q
>>
>>106557893
>>>/g/adt
>>
File: ComfyUI_WAN2.2_00104.mp4 (3.6 MB, 536x962)
3.6 MB
3.6 MB MP4
>>106557074
>>106557506
I think it's because there's a lot of conflicts with Easy Comfy installation.
>>
>>106557936
Is that why I'm getting fedora shutting down my konsole webui session?
I thought it was because of the model but I never had that problem with flux and now I'm using 64gbg of actual ram during generation after 8 or so hours
>>
>>106557969
No idea, it just got OOMs after OOMs for me.
>>
File: G0kkGctXAAAJio9.jpg (3.74 MB, 4096x3072)
3.74 MB
3.74 MB JPG
>>106557893
>>106557938
Bigger version.
>>
>>106557982
this is actually fucking impressive, the anatomy is on point
>>
File: 1271698796.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
>>106557893
This does look really good, but I think you can make similar stuff on local. Most people just arean't interested in that kind of art.
>>
>>106558000
>I think you can make similar stuff on local
prove it
>>
File: 1728775135441849.png (20 KB, 481x361)
20 KB
20 KB PNG
Anyone uses that? What parameters do you use?
>>
>>106557909
my point was why werent the cloud shills posting outputs at that level when they were deep in their shill campaign in this thread
>>
File: ComfyUI_WAN2.2_00105.mp4 (3.78 MB, 536x960)
3.78 MB
3.78 MB MP4
>>106557965
>>106557673
Missed this. Yes, I have a sharpen and noise nodes.
>>
File: 00119-1712924411.jpg (907 KB, 2048x2480)
907 KB
907 KB JPG
>>106558037
It's a bitter circle of anons that have been doing this for years, it could be anything they will try to lower the thread. Look at the threads they come from and you will understand why
>>
>>106558069
> dom female
> wearing leash

o-oh my
>>
>>106558009
Not gonna lie, I got no idea how to prompt it. You got any prompts?
>>
File: fucking.png (194 KB, 587x655)
194 KB
194 KB PNG
Are there any nodes that just extract the raw prompt text and not add a bunch of json mess and shit? I can't link these to the prompt window. The SDprompt reader didn'twork either.
>>
>>106558069
try it with 1000 steps. it should be so much better
>>
>>106558116
yes
https://github.com/BigStationW/ComfyUi-Load-Image-And-Display-Prompt-Metadata
>>
>>106558120
You don't have a rig that can run chroma so you sit here and seethe lol
>>
>>106558153
flawless seethe logic anon. well done
>>
>>106558124
Fucking finally. I hate the raw metadata shit everything else has.
>>
is there any way to use flux kontext or qwen edit to inpaint a lewd feature into a photo?
>>
>>106558063
cool, it works well for these, especially the misato earlier
>>
>>106558196
can you even use loras on the edit models?
>>
>>106558218
yes
>>
File: Whisk_3883598166.jpg (471 KB, 896x1280)
471 KB
471 KB JPG
To clarify, can I crudely photoshop a lewd feature over features and ask qwen or kontext to blend it seamlessly?
>>
>>106558253
>qwen or kontext
for sure no for kontext, they explicitely do everything in their power to fight anything nsfw
>>
File: Chroma_00062_.jpg (709 KB, 1248x1728)
709 KB
709 KB JPG
>>106558253
>To clarify, can I crudely photoshop a lewd feature over features and ask qwen or kontext to blend it seamlessly?
You can use inpaint with any uncensored checkpoint to do that, sdxl finetunes work

>>106558268
>for sure no for kontext, they explicitely do everything in their power to fight anything nsfw
Doesn't it require lora trained with image pairs? massive pain in the ass
>>
>>106558301
I'm going to need a cat box for this. just, wow
>>
File: ComfyUI_06800_.png (1.69 MB, 1152x1152)
1.69 MB
1.69 MB PNG
CPU got fried Chroma bros. I'm back.
https://files.catbox.moe/muhruw.png

Btw these are basically my first try with Chroma HD Flash. Truly is a blessed model.
>>
>>106558318
You couldn't make that image even if he gave it to you
>>
>>106558301
Thanks, haven't had much luck w/ sdxl inpainting.
>>
>>106558301
>Doesn't it require lora trained with image pairs? massive pain in the ass
It's not really clear yet. Training qwen loras in general is a pain, so there's still a lot of testing to do. Image pairs are the proper method, but it also looks like training the concept alone can sometimes be enough to teach it the necessary information. It's almost impossible to get A-B pairs for many contexts, so hopefully that will work out.
>>
Why do people recommend using the wan 2.1 lightx2v lora on wan 2.2? All it does is harm fidelity
>>
Civitai added Chroma tag

>>106558318
I'll upload the lora
>>
>>106557074
alright buddy
nice try
now have kaine jizz "hussy" over the pages of weiss...
>>
finally blessed
>>
File: ComfyUI_06801_.png (2.18 MB, 1152x1152)
2.18 MB
2.18 MB PNG
>>
File: 1583441205198.jpg (72 KB, 1250x1246)
72 KB
72 KB JPG
Holy fuck chroma base is so much better than HD for upscaling. How tf can a so called HD model suck at fine details so much?
>>
>>106558380
So you can post a reaction image and not a gen displaying that?
Suspicious
>>
>>106558319
In case you have a VR headset: 3DSVR-0438
>>
>>106558390
I am currently genning only diaper porn so I can't post it here.
>>
>>106557945
>implying pedowaifu slop is soulful
Chair and rope
>>
File: ComfyUI_06802_.png (1.96 MB, 1152x1152)
1.96 MB
1.96 MB PNG
>>106558380
>How tf can a so called HD model suck at fine details so much?

Try Chroma HD Flash. It handle 2k just fine.
>>
What do I use to start captioning videos? Does local even have an option for that?
>>
>>106558427
Type lazy captions by hand.
>>
>>106558419
>obese vomit hag lover being toxic to randos
please just take your meds
>>
>>106558253
>that question
>that image
This nigga trynna add penises to the women, isn’t he.
>>
>>106558426
>flash
Distill doesn't have the same outputs as base. You pay for the speed somehow.
>>
>>106556603
Oh wow he did a release https://huggingface.co/lodestones/Chroma1-Radiance
>>
>>106558369
>kaine jizz "hussy" over the pages of weiss
tf you mean?
>>
>>106558431
This is an AI general bruh why would I do that?
>>
>>106558455
Garbage in garbage out.
>>
holy shit seedream is insane. china absolutely destroyed local
>>
File: Jenny Surf's Up!.webm (3.92 MB, 852x1280)
3.92 MB
3.92 MB WEBM
>>106558019
>PerpNegAdaptiveGuider
>CFG = 3.6
>cfg_start_pct = 0.25
It's unlikely to help, but that's what I settled on.
>>
>>106558446
Kainé right, is called a hussy by Weiss in the events prior to the webm.
Kainé loses control, grabs the stupid book throws his ass to the god damn ground, whips it out, strokes and releases all over his leather bound cover, his "face" so to speak and spells out the word hussy in jizz.
It's symbolic, it represents the trauma that Weiss feels and internalises with humour and the fine line of anger that kainé walks.

idk what emil does, that dude's a gay little skeleton, he probably plays wth his boner.
>>
>>106558433
You keep exposing yourself by thinking you are talking to the same person retard, by the way sei shoujo is not western you disabled retard.
I was going to tell you that in the other thread but all I asked for is for anons to make chroma loras.
>>
What image-to-video models can I run on 4060 with 8GB vram?
>>
File: ComfyUI_06804_.png (1.41 MB, 1152x1152)
1.41 MB
1.41 MB PNG
>>106558394
>3DSVR-0438
Kino

>>106558437
In case of the Flash experiment? I don't know how, but it somehow is very close to convergence. It's like a completely fixed v48. Though the default one messes with prompt following (which I'm currently using). There's a way to fix that though, the delta weight mixed with the HD weight is pretty strong at 2k and still preserves prompt following of original.
>>
>>106558318
https://civitai.com/models/1948914/chroma-lora-tsukasa-jun-style
>>
does anyone know of a fag Discord group that works collectively on short films (2 minutes or longer)?
im really keen to do something together
>>
>>106558538
Ask reddit
>>
Can anyone help a guy with a shitty 2060 mobile make some videos? I already have comfyUI installed.
>>
>>106558553
You're priced out of this subsection unless you're packing 16 or more vram and a modern card
>>
>prompt for penis sniffing
>keep getting fellatio
have to get a lora for fuckin' everything man
>>
File: ComfyUI_06809_.png (1.89 MB, 1152x1152)
1.89 MB
1.89 MB PNG
>>106558492
Basically what would potentially take a bunch of tries/seeds on regular Chroma versions you get first try on or 2 Flash.
>>
>>106558562
Shit. Thanks anon. What about normal image generation?
>>
>>106558569
struggle bus but you might be able to do light XL?
>>
>>106558492
I'm gonna give flash a try. Do you mind sharing your prompt?
>>
>>106558507
tyvm anonie!
>>
>>106558466
Thanks!
I'm not sure I'll use it, it completely fucks my outputs for some reason.
>>
>>106558568
yummy ol like teacher, nice
>>
>>106558507
TY man
>>
>>106558573
is there any json ready to use for that?
>>
>>106558492
please flash her panties.
>>
File: ComfyUI_06814_.png (1.8 MB, 1152x1152)
1.8 MB
1.8 MB PNG
>>106558578
>Amateur photograph of a beautiful Japanese female idol woman sits on a stage chair, performing with an acoustic guitar. She is wearing a white off-the-shoulder top and a short, vibrant yellow miniskirt with her panties slightly visible. With a focused expression, she looks down at her guitar while a microphone stands ready in front of her, suggesting she is singing as well. The surrounding stage equipment indicates she is at a live outdoor concert or festival.

>>106558426
Same but without mention of panties
>>
>>106558553
Not a chance. Consider renting time from a cloud GPU provider, once you get ComfyUI set up you can basically one-click deploy and have a top of the line GPU for a whole day for like $10 or something. Might be even less.
>>
>>106558732
Can you suggest me a non-scam provider?
>>
>>106558744
if you are asking these types of questions you are in for a world of pain trying to figure this shit out. I've used RunPod a few times for training loras, it was fine. You still need to install comfy, download the models/loras (through jupyter), then you can get to fucking around with comfy. good luck lol
>>
>>106558744
runpod, vast.ai, tensorboard off the top of my head. just remember that you'll be running on other people's hardware, so don't be uploading/genning stuff that would get you in any trouble
>>
>>106558773
Thanks for the heads up, I guess I'll save up some to buy a nice GPU in the future.
>>
>>106558477
ahh ok
>>
local lost
https://blog.comfy.org/p/seedream-40-now-available-in-comfyui
>>
>>106558842
hey buddy, we also got qwen-edit controlnet masks today, we still in it
>>
>>106558855
>qwen-edit controlnet masks
*yawn*
>>
>chroma: 512x512
>seedream: 4096x4096
sigh…
>>
>Qwen-edit
>controlnet
do people really?
>>
File: ComfyUI_06824_.png (1.45 MB, 1152x1152)
1.45 MB
1.45 MB PNG
>>106558694
>>
>nano
>banana
hehe my penis is bigger
>>
Norway will be the salvation
>>
Chrome?
Pooma more like
>>
File: 1751732611264707.mp4 (697 KB, 1280x720)
697 KB
697 KB MP4
Babe wake up, a new video model got released
https://huggingface.co/bytedance-research/HuMo
https://phantom-video.github.io/HuMo/
>>
>>106558842
SAAS shills really working overtime lately, /ldg/ threads are like anudda shoah
>>
>>106556670
you might want to find some images of fully black skinned characters and give them to Qwen or gemini and see how they caption the image

prompting for "dark skin" or "black skin" often results in the same issues as prompting for "young" does where it can mean a lot of different things in different contexts so the model doesn't really know what to actually do with the token
>>
>>106558930
No it's just a small group praying for us all to fall. We have so many eyes on this thread when we shouldn't
>>
>>106558427
local has an option technically because you can use Qwen, but you really should use Gemini Pro to make captions for videos/images, that's what the chinese do lol
>>
>>106558920
>HuMo-17B
>VideoGen from Text-Image - Customize character appearance, clothing, makeup, props, and scenes using text prompts combined with reference images.
>VideoGen from Text-Audio - Generate audio-synchronized videos solely from text and audio inputs, removing the need for image references and enabling greater creative freedom.
>VideoGen from Text-Image-Audio - Achieve the higher level of customization and control by combining text, image, and audio guidance.
>The model is trained on 97-frame videos at 25 FPS. Generating video longer than 97 frames may degrade the performance. We will provide a new checkpoint for longer generation.
Hum.
>>
>>106558976
>No it's just a small group praying for us all to fall. We have so many eyes on this thread when we shouldn't
anyone who supports uncensored video/image diffusion supports a pedo bar. no normalfag on the planet thinks you should be able to ai generate literally anything. you have to be a radical libertarian/cypherpunk to think that's ok
>>
>>106559005
That's a odd post to say when those types moved to the anime thread. We don't want them either and have always wanted them out. They can stay there and act like that
>>
>>106558920
>still locked to 5 second

i'll pass
>>
>>106559004
this is a bytedance release so there's no way its going to be good. it's nice to see that we're almost in the post-character LoRA era where I just need one yearbook photo to blackmail the fathers of schoolchildren systemically

>>106559009
>We don't want them either
you do not make the rules on what types of local diffusion are allowed here
>>
>>106559025
Of course and they eventually left because they were mocked and reported for 3 years
>>
>>106559020
we don't actually know if its "locked" the same way wan is or not until we can test it with more than 97 frames.
also don't discount the framerate increase. there's no need to interpolate anymore. if a similar self-forcing and lora ecosystem shows up, or if its better for porn/lewds or if it somehow inferences faster then it will have its place.

>>106559029
well, the anime pedophiles did maybe. i still think that thread is just one schizo samefagging though
>>
>>106558699
Thanks a ton anon.
>>
File: NO.jpg (29 KB, 941x404)
29 KB
29 KB JPG
if it weren't for drooling tards like this, wan 2.1 nunchaku would of been here by now
>>
>>106559005
Pure thought police nonsense argument

Ai generated images / video of 'uncensored' nature not shared with anyone can be nothing more than 'thought crimes'
>>
>>106559077
>wan 2.1 nunchaku
wan 2.2 exists you know? kek
>>
>>106558920
This is a wan 2.2 tune you poopoo head.
>>
>>106559093
>poopoo head
you must be 18 to post here
>>
>>106559077
>would of

>>106559082
>Ai generated images / video of 'uncensored' nature not shared with anyone can be nothing more than 'thought crimes'
agreed, but discussion about them fundamentally doesn't matter because they're not "real". never understood why people ever discussed the legality/unlegality of simple possession, possession of literally anything on the planet is legal if you're not retarded. the only time it becomes relevant is when its "shared" anyways e.g. when the police find it or you "shared" knowledge of the existence of your possession of the item in the process of obtaining it

>>106559093
i'd refer to Pony as a "new model" even though its a tune of SDXL
>>
>>106559093
>this 17b model (HuMo) is a finetune of a 14b model (Wan)
you're so fucking retarded
>>
>>106559055
https://github.com/Phantom-video/HuMo
>huggingface-cli download Wan-AI/Wan2.1-T2V-1.3B

Its based off wan, however they did say..

>The model is trained on 97-frame videos at 25 FPS. Generating video longer than 97 frames may degrade the performance. We will provide a new checkpoint for longer generation.

Many said this before and we still dont have proper long gen apart from tricks, wouldnt hold your breath
>>
File: file.png (51 KB, 2867x151)
51 KB
51 KB PNG
Does this error mean anything?
the Gradio thing. I'm using wan2gp 2.2 ITV, the gen is working but i keep seeing that error on each new video. do i just ignore it?
>>
>>106559117
You are the absolute retard moron nigger down syndrome mongoloid that cannot even read the model's project before sucking his own nigger dick in forchink.
>>
>>106558920
>wan2.1
LOL
>>
>>106559130
>debo is so mad
nice
>>
>>106559119
we can do more tricks with more framerate though. no need for a 4fps lora, now we can do 6fps which is 50% more information. also the fact that they fixed framerate with a tune at all sounds very very impressive to me but i dont know the technicals that well

>>106559130
damn dude at least i dont write posts like this yet
>>
>>106559130
What kind of mentall illness is this?
>>
File: Wetting.png (298 KB, 744x480)
298 KB
298 KB PNG
>>106556603
What image-to-text tools are people using these days to generate prompts from images? Having a hard time find one that ... actually works.
>>
>>106559093
shut up doodoo head
>>
>>106559130
gonna run a t2v with this prompt one sec
>>
File: lmfao.jpg (67 KB, 1169x823)
67 KB
67 KB JPG
>>106559085
Just because 2.2 exists doesnt mean the new shiny object should get priority. 2.1 is still great. See another picrel, this is the 4th time a new model got in the way of wan so you'll be lucky to see any 2.2 advancements, nevermind 2.1
>>
>>106559150
Gemini, joycaption
>>
ooo-eee-ooo
>>
>>106559160
Honestly it's just because these people are so fucking scatter brained.
We should have gotten wan with lora support, and same with qwen.
>>
what did he mean by this
>>
>>106559150
i would use gemini or qwen for sfw, joycaption for nsfw, and modifiying outputs from the aformentioned ones or just writing it yourself if you're doing something illegal
>>
>>106559203
Grok 4 can caption nsfw fine, no need for jailbreaking etc...
>>
>>106559234
Does it have a local version?
>>
>>
>>106559239
No but just like Gemini it can be used via API
>>
>>106559248
Ok, but I'm not sending my freak porn pics to Elon
>>
File: kekoroonie.jpg (39 KB, 658x657)
39 KB
39 KB JPG
>>106559173
Save this image when they decide to do all of these new models for reference, kek
>>
>>106559262
how do u englihrs?
anyway I fucking love I can gen in 8s THANKS CHINKAMAN, I wonder on the 5000 series how faster it is compared to 4000
>>
>>106559253
thats the trade offer
you get captions, he gets your freak porn pics
nothing in this world is free. you literally don't matter btw so unless this is literally child rape (not even real people you know, or real kids, i mean full on nudity or rape) stop making your life more difficult
>>
File: ComfyUI_00195_.mp4 (516 KB, 640x640)
516 KB
516 KB MP4
>>106558492
she is cute, her song 01001100 01101111 01110110 01100101 00100000 01000010 01100001 01110100 00100000 01010011 01101111 01110101 01110000 is a banger.
>>
>>106559262
sad it's not out yet
>>
>>106558485
Check previous thread for anon's wan2gp workflow catbox
>>
File: 1611764255023.png (1.92 MB, 2847x1412)
1.92 MB
1.92 MB PNG
>>106558920
>open source
>from bytedance
>>
>>106559378
I think Alibaba is pushing envelope on these guys, Hunuyan even released a non-distilled version of their model when they always release distilled ones because of them.
>>
File: 00187-3838723413.jpg (1.88 MB, 2480x3072)
1.88 MB
1.88 MB JPG
The high res upscale is killing me on this model 10 steps takes 5 minutes on my 5090
>>
File: ComfyUI_00200_.mp4 (507 KB, 640x640)
507 KB
507 KB MP4
>>106559298
Her headliner
>>
i had 0 expectations but am still disappointed
>>
File: Tongue.png (217 KB, 350x332)
217 KB
217 KB PNG
>>106559442
I recognize that face.
>>
Can the guy with the iryna zarutska lora share it please?
>>
File: blodeuwedd-1.jpg (1.1 MB, 1345x2268)
1.1 MB
1.1 MB JPG
Is Chroma1-Flash similar to the old "turbo" SDXL models?
>>
>>106559467
how did he even train a lora for that don't you need at least 6 images absolute minimum
>>
>>106559439
>heun
>40 steps
>10cfg
not sure if shitposting
>>
>>106559483
i believe it because 40 steps used to be a popular flux number
>>
File: 00608-2750414837.jpg (747 KB, 2048x2688)
747 KB
747 KB JPG
>>106559483
You don't explore new models?
>>
>>106559480
1 image is the minimum
>>
>>106559500
Heun is a lowstep sampler tho. It's actually one of the slowest overall. And how doesn't 10cfg fry his pics is beyond me.
>>
>>106559378
bytedance only open sources slop so the localpiggies have something to eat. Their actual good video model is seedance
>>
>>106559508
but won't 1 image just create that one image whenever you use it

>>106559511
>Heun is a lowstep sampler tho. It's actually one of the slowest overall.
oh right samplers can have different speeds i completely forgot about that because i have been using euler for the last 3 years and have literally never found a need or desire to switch from it, even on video

nevermind i now think it was a shitpost
>>
>>106559511
I use dynamic thresholding why wouldn't you?
>>
>>106559480
Just a quick image search shows at least 8 different images of her in various poses from her social media postings, you could probably find at least 20+ if you really wanted to
>>
>>106559519
Such a huge corp like that does not have much to gain from closed source models. In fact they could profit much more from open sourcing them, like Alibaba (providing infrastructure for inference). A shame.
>>
>>106559470
It's made for low steps and overbakes easily, so pretty much.
>>
>>106559418
>I think Alibaba is pushing envelope on these guys
Ali Baba is in the unique position where they are a national champion with ZERO skin in the social media game unlike Tencent/ByteDance so they don't care about conflicts of interest releasing a local image/video model. Since they're the Chinese Amazon with AliCloud they also probably have a fundamental interest in releasing the best models so chinese people/researchers try them out on their cloud
>>
Qwen has the potential to beat bytedance but they need to sort out their datasets. hunyuan is complete slop thoughever
>>
File: ComfyUI_00635_.png (782 KB, 1024x1024)
782 KB
782 KB PNG
>>106559467
Iryna Zarutska qwen LoRA https://gofile.io/d/6IZUNy
>>
>>106559549
I forgot how easy it is to find images of anyone on the internet because I deliberately checked out from social media as soon as I graduated highschool with no regrets


I'm excited for 15 years in the future when the AI automatically creates loras for all faces of the cute girls I pass by on my morning commute from the cameras in my smart glassses while I sleep and injects them into my dreams

oh wait my brain already does that for me
>>
>>106559566
>they also probably have a fundamental interest in releasing the best models so chinese people/researchers try them out on their cloud
Then why doesn't Amazon and Google do the same ?

Your argument makes no sense.
>>
File: shieldmaiden.jpg (1.06 MB, 1800x2320)
1.06 MB
1.06 MB JPG
>>106559563
Seems kinda slopped but I'll keep experimenting with the settings. Really need to be able to run chroma faster to make it useful.
>>
>>106559592
this would be a nice moment to share the training data as well since its a relatively simple lora and would be educationally useful
>>
>>
>>106559611
Lovely
>>
File: dyn.png (34 KB, 729x366)
34 KB
34 KB PNG
>>106559537
ok what does any of this mean?
>>
>>106559619
I don't use comfy UI
>>
>>106559203
>gemini or qwen for sfw, joycaption for nsfw,
I wouldn't really know what to do to be illegal. Does that make it more likely to happen?

Anyway. Does Joycaption intentionally go out of it's way to skew things to Nsfw? Because overall I like using this with more range and options. But if I feed it pictures of wizards I don't want it constantly telling me their wearing dicks for hats.
>>
Stop being fucking poor holy shit
>>
>>106559603
Amazon completely missed AI somehow while focusing on Alexa so they just sell pickaxes, and Google DOES do that shit. Show me one of their open source models as good as their cloud models. And Google plugs their Colab and Compute Cloud every time they mention using and running inference on Gemma

>>106559624
>Does Joycaption intentionally go out of it's way to skew things to Nsfw?
Just try it out. You can kind of direct how you want your images captioned by prompting it
https://huggingface.co/spaces/FiditeNemini/joy-caption-beta-one
>>
File: ComfyUI_06829_.png (2.2 MB, 1152x1152)
2.2 MB
2.2 MB PNG
>>
>>106559449
Is that John Carmack
>>
Anything new in the i2v world besides what's in this guide?
https://rentry.org/wan22ldgguide
>>
Is there a setting for network/neuron dropout in OneTrainer?
>>
I see Radiance got its own repo. Did he give up on that?
>>
>>106559640
>Amazon completely missed AI somehow
They're raking in money from their AI cloud, if your argument held any water they'd be releasing SOTA models to use
>and Google DOES do that shit
They do practically nothing compared to Alibaba etc, practically all their AI stuff is proprietary, western big tech in general are keeping their AI stuff proprietary

Even the White House has called this out and said they need to release open models else the US will lose influence

You keep bending yourself into a pretzel to defend western companies proprietary AI strategy, it's pathetic

The western big tech SHOULD be the ones providing open free models, instead you need to turn to China for that
>>
File: AnimateDiff_00282.mp4 (974 KB, 1280x720)
974 KB
974 KB MP4
>>
please I need someone to generate a compilation video of different Star Wars characters that are turned indian
so SAAR WARS
pew pew thanks
>>
>>106559720
>They're raking in money from their AI cloud,
yeah that's what i said

>They do practically nothing compared to Alibaba etc, practically all their AI stuff is proprietary, western big tech in general are keeping their AI stuff proprietary
china is more locked down in general at the cultural level. since neither of us live in china or work in the chinese tech space, neither of us are authorities to claim one way or the other

the rest of your points i agree with and are just limitations of capitalism. when the only metric that matters is profitability these are the examples of priorities you get. nothing you can do other than nationalizing the company and running into state capitalism issues. you don't get to have your free market cake and eat the state capitalist one too (but actually WE do get the second cake, because china is using AI as part of an asymmetric [dis]information campaign against the West)
>>
>>106558069
>>106559604
>>106559501
>>106559439
I can do this in SDXL in 40 seconds wtf its this?
also
>>>/g/adt/
>>
>>106559720
nta western companies are fairly hell bent on making money by any means necessary. I can't see the proposal to release sota models out for free going over well in a investor meeting. China also has the added backing/blessing from the government to release these to undermine american tech industry influence. I assume if AI bubble ever does pop in america, china will probably close up too. Not defending american corpos (fuck them) just giving perspective.
>>
File: ComfyUI_00021_.png (2.35 MB, 1152x1536)
2.35 MB
2.35 MB PNG
>>106558319
ty for pointing that out. All the schizos negged me into not trying it somehow

>>106559467
Iryna Chroma1-HD https://files.catbox.moe/fsvmpl.zip

>>106559608
>a nice moment to share the training data as well
You're welcome for the weights anon
>>
>>106559771
>asymmetric [dis]information campaign
? Explain the disinformation here
>>
File: ComfyUI_00215_.mp4 (739 KB, 720x720)
739 KB
739 KB MP4
old guy just chillen
>>
China is not releasing open models as part of some "asymmetric warfare campaign". This is a meme. They are doing it, mostly, because no one would pay attention to their models otherwise, even in China. In this regard they are behaving a bit wiser than American companies. In that there is no reason to be secretive about the weights if there's no money to be made off of said weights. Deepseek is an exception. They really are believers in open source. Whether they are allowed to remain as such will be interesting to see.
>>
>>106559723
is that cringe-acle
>>
>>106559262
Hey hey let's go! Kenka suru
Taisetsu na mono wo protect my balls.Let's qwenimage love!
>>
>>106559821
>believers in open source
If they were believers the models would be uncensored
>>
>>106559608
you can easily find that anywhere if you need examples
>>
>>106559821
Is chatgpt and stuff even allowed in china? If not I don't see how they would even care about being noticed, they rule the market there. Not like releasing free models for the rest of the world is helping them earn money lol
>>
Fresh when ready

>>106559851
>>106559851
>>106559851
>>106559851
>>
>>106559775
You're not a bright one I get it
>>
>>106559784
>to release these to undermine american tech industry influence
This I fully agree with, it's not altruism that fuels the China open model releases, but it doesn't matter, what matters is that the western big tech want to lock down AI as a proprietary service, for monetary and control reasons, and China is the one that is propelling open models forward at an impressive pace.

We'll have to see if this can drag western big tech kicking and screaming into releasing SOTA open models (well, not 'OpenAI', they never will), the best outcome would be a open model prestige race between them and China, here's hoping. Sadly it's more likely western big tech will lean on lawmakers to have Chinese open models banned.
>>
>>106559854
I know claude is banned (by Anthropic, not China). Not sure about chatgpt. A lot of Chinese use claude anyway, getting around the block in various ways. Chinese companies would like their models to be used by the rest of the world, just like they want any product they make to broadly popular everywhere if they can help it. They can't really make any money yet so they just release it for free instead as they build their capabilities.
>>
File: ComfyUI_06830_.png (1.9 MB, 1152x1152)
1.9 MB
1.9 MB PNG
>>106559786
Np anon. Remember there will always be anti-Chroma schizos here and there, they do not see Chroma for what it is. While not perfect, it's a model with a lot of potential. Further finetuning could fix remaining imperfections like faces in background in pic rel.
>>
>>106558492
>the delta weight mixed with the HD weight is pretty strong at 2k and still preserves prompt following of original.


how do you mix those models? or you just add a second sampler?
>>
>>106559822
yes
>>
>>106559604
Yeah, it plastics up the image easily. Some anons have said it's better for anime stuff, but I haven't tried that.
>>
Is this the technical/training thread?
>>
>>106560156
this is now the previous one but yes >>106559855
>>
>>106558886
thx



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.