Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107273460https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://rentry.org/wan22ldgguidehttps://comfyanonymous.github.io/ComfyUI_examples/wan22/>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2298660https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQdhttps://gumgum10.github.io/gumgum.github.io/https://huggingface.co/neta-art/Neta-Lumina>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
I claim this thread's virginity
I call sloppy seconds!
Anyone have any experience making wan LoRAs? I was going to use ai toolkit for it then rent a gpu. But I wanted to train wan on other videos not just images.
>>107279438my experience training wan on video is very limited. i do think the first things to try are musubi-tuner and ai-toolkit tho.
>>107278964Anyone get this working? i pulled latest comfy, downloading the models now. but don't have a workflow.
>>107237888>>107237888>>107237888fill old threads first you dumb retard
>>107279471im downloading the i2v model, will post results once I hack the old hyunan i2v to work with this
>>107279478stop bumping your trollkbake retard
>>107278964anyone got a basic workflow? i don't even have the comfy templates installed rn for pythonic reasons
>>107279478everyone will bump my bakes ani. I am just more popular than you
>>107278964>muh prompt enhancers!
>>107279478Those are clearly trollbakes. Ani has never, NOT ONCE, asked to be included in the OP. Only schizos are pushing for it. Please respect Ani's wishes. He himself has said he doesn't want any drama and to just not put AniStudio in the OP.
>>107279505that's right! ani is a talentless hack compared to me. he would always bake right after me because he is a worthless tranny
abort thread, leave the trolls here and we can get back to it because they will necrobump the old threads like they have for days
does the new hunyuan video have lowstep inference?
I think it's working. preview isn't looking too hot though
>>107279521theres a distill and lightx2v, but im downloading whatever is on comfy's hf repo for now
I wonder if taking estrogen as a man makes you schizo or if it just exacerbates the schizo behavior even further. We need digital ID to delete trannies from /g/ and /ldg/
>>107279519no they should stay in my perfectly fine bake
>>107279530what system you rocking and hows the speed
I almost have a wf
>>107279530>slopstyle comfy example picc'mon man, have some dignity
>>107279530wrong clip type, for starters
>>107279541he should use one of my perfect gens
what the actual fuck is wrong with you niggers?
>>107279558why do you keep making and bumping trollbakes faggot?
>>107279567I DON'T GIVE A FLYING FUCK ABOUT EXCUSES. FINISH YOUR FUCKING THREADS YOU SUBHUMAN RETARDS
>>107279567Some troll is trying to get /ldg/ kill
my hunyuan wf. where did I fuck up?https://files.catbox.moe/x6156l.webp
>>107279571I don't give a fuck about your melty
>>107279572brb, baking
>>107279572Although, HunyuanVid 1.5, some unknown Chinese model, Flux.2, a possible decent music gen model right around the horizon... It makes sense. Actual paid API shills are afraid so they cause discord at the perfect time.
>>107279580this garbage is why we don't use comfy gens to make vids
>>107279580How fast is this thing as in IT/s compared to the Wan models?
>>107279580>shitty qwen gen can't make fennec ears>hyvid makes absolute slopwe are regressing
>>107279542yeah noticed that just before you posted>>107279573dunno but mine above seems to work. probably need to add tiled decode though
>>107279580really disappointed so I'm going to bed. probably better at realistic
Does it use the same VL as Qwen?
>>107279643the VL is Qwen
so let me get this straight, niggerjak has been making bakes with and without anistudio and pretending to be ani to make everyone hate ani and trash the board in the aftermath of the melty?
https://github.com/Comfy-Org/workflow_templates/pull/288/official templates
>>107279667yes, he showed his hand last thread. I hate that the baker is a tranny drama faggot but I don't herd cats. if the thread gets banned because of it we can just go back to /sdg/
>>107279667>>107279679FUCK OFF
>>107279683I don't give a fuck about your melty
>>107279668well that just saved me from generating a whole lot of shit overnight>>107279689nigger
>>107279667>>107279668>>107279679>>107279679>>107279683>>107279689this is all just the samefag schizofaggotry again
>>107279406best model for generating 2d pixel art characters?
>>107237888>>107237888>>107237888this schizo drama won't end so I'm directing anons to the oldest thread. at least get these old threads to bump limit so we can stop hearing from complaining fags>>107237888>>107237888>>107237888
>>107279760nah, keep bumping it by yourself
>>107279667yep. ani himself said all the shilling is by the schizo and he even asked not to put anistudio in the op. ignore all the schizoposting and keep using this thread
>>107279668Uh, not looking good. 20 steps. I think the easycache snake oil fucked it. Says it skipped 11 of the 20 steps wtf
>>107279806yeah dont use that garbage, wait for the distills/lighting loras instead.CFG distill should give us an easy 2x
>>107279806Always test without caches and torch. That shit just doesn't work.
>>107279806Yeah forget Comfy I'd stick to the official HF implementation which looks amazing from the preview/demo before drawing judgements about quality of the model. Also curious if it's uncensored from the get go.
>using a workflow from the comfyui wiki guy >the comfyui wiki guy who barely understands comfyui >
https://github.com/comfyanonymous/ComfyUI/commit/943b3b615d40542ea19bc8ff8ad2950c0a094605ok I have a question, why did comfy implement that model but not the russian model?
>>107279858do they seriously want us to believe that their 8b model is better than a 28b MoE model?
>>107279872>>107279858Kandinsky is based but I don't think anyone can run it. Hunyuan is made for weak GPU inference so it should benefit more of us, especially since inferencing Wan is quite prohibitive (and the speed LoRAs are trash to be frank).
>>107279891>+12% win rate?
>>107279858>why did comfy implement that model but not the russian model?I guess he was too focused working on Tencent to implement HunyuanVideo 1.5, its time will come
>>107279910>>107279920
>>107279926don't mind me I'm fucking retarded kek
>>107279858https://huggingface.co/lightx2v/Hy1.5-Quantized-Models/tree/mainthere's already a lightning version btw
>>107279858sr seems to be for "super resolution" but like is it for t2v or i2v? or both?
>>107279940>fp8 distilledi want fp16, hunyan is providing the fp32
>>107279957>i want fp16this, the model is "only" 16 gb big at fp 16, so it can fit nicely without having to use cope quants
>>107279971comfy just uploaded the distilled:https://huggingface.co/Comfy-Org/HunyuanVideo_1.5_repackaged/tree/main/split_files/diffusion_models
>>107279994the distilled (made by tencent and is only about cfg 1 and not cfg 1 + 4 steps) models aren't the lightvx models
>finally get stomach bulges to work good>wake up to one of the loras being updatedI don't know if I should be angry or happy.
>>107279998I dont think lightx2v has posted the LIGHTNING 4step version yet tho, they did the same for wan, they have copies of the basemodel with lightx2v written in the name but theyre just normal models
>>107279858https://huggingface.co/tencent/HunyuanVideo-1.5>Additionally, the innovative SSTA (Selective and Sliding Tile Attention) mechanism prunes redundant spatiotemporal kv blocks, significantly reduces computational overhead for long video sequences but they never showed any examples of long videos though?
I've made it into top 100 for reporting TOS stuff on civitai purely from reporting jeets.
>>107280026>he does it for free
>latent upscale hunyaninterdasting
>>107280036Giving jeets a bad time, for free? That's a blessing.
>>107279858What I like about Tencent is that they always use a big ass vae (2.5gb) and it's a good thing since it helps getting details and the humans still look good at far away distances
>>107279792when did ani say that catjak? was it last thread when you schizo false flagged using his old gen?
>>107280013Wow, very nice. Do you like Aria?
>>107280103Ani is a good person so surely he doesn't endorse the shilling and trollbakes. Right?
not bad lmao, ill try the distill now
>>107279858>the russian model?speaking about that, they also released an image model, has anyone tried it yet?https://huggingface.co/kandinskylab/Kandinsky-5.0-T2I-Lite-sft-Diffusers
>>107280131I2V 480p? how many steps you went for? that looks good yeah
>>107280143yup, 20 steps. using the official workflow basically, I added fp16acc and sage attn tho.1st gen was... 380s~ on 16gb vram (partially offloaded to ram) which is good with CFG, with the CFG distill I should be doing around 200s~, which beat my WAN genning time (without 4 step lora, just cfg distill). changed the prompt a little too, ill post here the distill results
>read the desc on a new wan 2.2 lora>he suggests almost 200 stepsWhat are these people smoking?
>>107280118>Do you like Aria?What's that?
>>107280025>SSTA (Selective and Sliding Tile Attention)that shit is really efficient on the vram, when I loaded the model I was at 17.5gb (fp 16) and then when I'm running it at 720x720x121g it only goes up to 20.1gb, really nice
>>107280150tried tiled decode, it produces BAD artifcats, completely unusable.experimenting with camera controls
>>107279668thanks anon
>>107280218that's because you have to increase the temportal size to be higher than your total number of frames
https://github.com/princepainter/Comfyui-PainterFLF2VIs someone able to clone this? Im prompted to sign into github, and when I do, the repository isn't found.
>>107280244>wan 2.2it's obsolete my boi >>107280131
>>107280243well I can decode without problems without tiling, I'll keep this in mind if I get OOM during vae decode
>>107280136wait it also does image editing, maybe it's better than Qwen Image Edit?
bros... WE ACHIEVED AGI
>>107280165A manga/anime about Venice in space, with cute girls.
>>107280131>>107280298I like the motion it doesn't wobble like shit like on other models, and it's only 480p it looks promissing
>>107280317camera controls are sadly... a step down from wan, but I only tried them on the distill, maybe it behaves better on the non distill.btw on the cfg1 distill (fp16) I gen in 180s~once we get 4-8step loras, this is going to go SO fucking fast.
BROSI JUST REALIZED WHAT THE GREY DOT ON THE UPPER LEFT SIDE OF A NODE DOES
>>107280337congrats! welcome to the organizers club
>>107279858>Hatsune Miku skateboarding in New York, anime styleit doesn't know migu are you fucking kidding me???
>>107280364>same 5second window context failureIt's a wash.
>>107280307she opens her eyes and makes a kiss gesture with her hands and mouth to the viewer>>107280364uh oh
>>107280374that one is worse, the hands movements are ass, did you change something?
>>107280362Oh I've been organizing a lot, but through right click.
>>107280364>it doesn't know migubruh
>>107280384yeah it sucks, picrel is same seed/prompt but non-distilled.it's night and day.this took 250s, I think prompt needs improvement, but I cant be arsed to hook up a prompt improver to it
>>107280409it's pretty solid desu, let's hope the lightning lora won't destroy that
>>107279069fuck you mean advertise sonic- oh shit?!
>>107280536I mean, they advertised with avatar last year so...https://www.reddit.com/r/StableDiffusion/comments/1h5nbft/experience_avatar_again_with_the_new_hunyuanvideo/
>>107280487
>>107280569me on the right
>>107280364finally got migu but it's inconsistent
>>107280242its jannies' work to remove trani threadsone schizo is trying to shill his garbage ui, its not our fault
>>107280605damn t2v looks so bad
so I was looking into hookin up my qwen3vl model to this, and this is the sysprompt they use:https://github.com/Tencent-Hunyuan/HunyuanVideo-1.5/blob/main/hyvideo/utils/rewrite/i2v_prompt.pychang bros...
>>107280642maybe 20 steps isn't enough Idk but I don't want to wait for more, it's already long enough, I'll wait for the lightning loras to see if it's viable or not
>>107280605https://xcancel.com/TencentHunyuan/status/1991777221799997626#mit seems like it can do a lot of character so I don't know why it would struggle so much for migu (warning, the sound is very loud lool)
>>107280645>https://github.com/Tencent-Hunyuan/HunyuanVideo-1.5/blob/main/hyvideo/utils/rewrite/i2v_prompt.pysorry but I don't speak ching chong, what seems to be the issue?
the hunyuan model handles breast physics out of the box
>>107280012if what you said is true then it's weird that tencent says that it has already a lightning support
>>107280013>>107280136These look great
Nano Banana pro is fucking amazing wtf, local can barely do 2 characters and it looks plastic
>>107280949>1 white dude>1 (((italian)))>1 jew>2 jeets>1 chingthat's what I call diversity!
>>107280605>Long take, fixed camera, the video begins with a close-up of a man, then Hatsune Miku appears on stage to slap the man.this shit is so ass... I should stop having any hope towards Tenslop, they just suck it is what it is
>>107279858>a 480p and a 720p modelwhy? Wan can do all resolutions so why can't it do it?
i'm trying out wan2gp with wan2.2 i2v but i get very annoying results with portraits as start images because the generation just pans the camera down and cuts the face out of frame. i tried messing with flow shift, image_mask, image_ref_relative_size, resolutions, but still nothing.
>>107281241it's incredible how much schizos are in this general, there's the blacked schizo, the "I make 4 ldg threads" schizo, debo...
>>107281257he also shits up lmg for what it counts. also just report and ignore, youre feeding them by replying
>>107281257how is debo "schizo", explain please
>>107281241>>107281257>>107281278hello tranihang yourself
>>107281241can't you do proper bbc gens instead of hillchurls? this isn't rage-inducing enough
>>107281290>how is debo "schizo"https://rentry.org/debo
>>107281257>it's incredible how much schizos are in this generalThere are attention seeking schizos in nearly every thread. If you don't use auto filters, then you are your own worst enemy.
>>107281330Filtering them is no different from tolerating themAnd we don't tolerate subhuman avatartroons
>>107281337>Filtering them is no different from tolerating themIt's better than tolerating them because you don't have to see their crap.
>>107281391fantasy niggers are just not as good as the bbc material. you can do some savage jungle african barely sentient niggers
Any Applefags here? What's your setup? I'd like to train a LoRA around a specific character, any advice on which tool runs best on Apple Silicon?
>come here to discuss cool ai releases>thread hijacked again with bbckek you guys are worse than /gif/ and dare i say, /b/
>>107281406Nope, it leads to new posters thinking subhuman behavior is acceptable
>started a roleplay with furry anthro bunny girl sniper>haha funny bunny lewds>ended up spending 4 hours crafting a story with rescue and freeing them from humans, finding an ancient space sport in the contaminated zone, and escaping with them to a fresh new world>story just ended and i feel sad that thats not my realitydamn this shit is addicting.
>>107281466hey this isn't half as bad as the /gif/ spammer jeet
>>107281628>furrykys
>>107281516>Nope, it leads to new posters thinking subhuman behavior is acceptableYou can't report the mass trolling of threads anymore, and Hero won't implement IDs or filter those specific terms globally, even though some have. Type in 'ch*ck*en sh*t' correctly and automatically get banned for two days. We could have that system, but filters is all that is given to us, so deal with it.
>>107281824nope, sorryi won't surrender to shitsposters and trannies
so how is Hunyuan i2v for anime, compared to out-of-the-box Wan2.2? no examples in the release page does not seem good
>schizo still pooping and shitting himself about aniget a life fucking loser
i hope the trannies are cute when they finally take overalso hunyuan 1.5 can do some nice futas
>>107281873it seems fairly competitive with wan on t2v but it's also actually hard to compare, there are so many things one has trained well and the other hasn't
>Concept: AI model that generates waifus in VR/XR space in real timewhere do i apply for funding and how many more years?
>>107281924shit usecaseI want something wired directly into my brain so I can gen just by thinking, no need for text enconder since my brain is creating the prompt and the model has been directly trained on brain inputs.inference is also seamless
So can hun you wan actually do 10 seconds, or is that yet to release? Or hardware limitations?
succesfully hooked up the I2V prompt rewriter, give me some kino to gen
>>107281968>comfyno thanks
>>107281877Go back to your empty threads, tranny faggot
>>107280136https://arxiv.org/pdf/2511.14993first time that I see the finetune actually unslop the pretraining model lol
>actual comfy shills ittgrim
>>107282014Ani I think everyone would like you more if you posted more shota.
>>107282030schizo never posted any proof about ani having shota collection
>>107282046>no shotaOh, well fuck him then.
>>1072819681girl, asian, big breasts, masterpiece
>>107282014>>107282046Again, you can fuck off to your empty threads, faggot
>>107282046>schizo never posted any proof about ani having shota collection/ldg/ is (one of many) /ss/GOD friendly boards, so that's one more reason why (tr)ani isn't welcome here.
>>107282107ss is the most redditor fetish imaginable grow some balls fucking sissy
>>107282111If you didn't cut off your balls, you wouldn't be so low T and mentally retarded, trani.
>>107282124go cry to your mommy little fag
release the shota files
>>107281316>https://rentry.org/debohow can one man be so based
>>107282181>how can one man be so based
>>107282194he deserves it desu
>>107281873I know it's not technically anime but>the girl drinks the sake from the small cup in her hand. she closes her eyes and collapses falling backwards onto the floor. she lays on her back on the floor and spreads her legs>3090, 30 steps, 716 secondsgonna need to try that prompt enhancer later
what's the best image upscaler out right now? I'm looking for something that doesn't leave the image looking like a smudgey mess.
>>107282443the t2v is p nice, i missed how good hunyuans gore was compared to wans.
>>107282475Depends what model you use, i2i controlnet is great for sdxl
>>107282153https://desuarchive.org/g/thread/101130602/#101134653Wasn't too hard to find
>>107282519I'm trying to use it for photos.
>>10728244312 minutes for that?
>>107282548I still find it weird that nobody bothered to look at metadata before me for several months
>>107282548i can't fap to this
kino
>>107282548>>107282596its not ani schizo
>>107282555Topaz photo
>having issue with comfu ui>update it and dependencies to see if it gets fixed>it now wont start up and gives no error
>>107282719why would you even use comfy?
>>107282726what do your reccomend for wan image to video and vibe voice?
>>107282745anistudio will be able to handle those soon. wan works right now
Any anons follow YouTube channels that try to stay up to date on breaking NSFW news/releases? Aitrepeneur used to be good but he hardly posts shit anymore
>>107282745sd.cpp will support those soon. The wrappers are crinkling
>>107282745Ignore him, comfy is by far the best currently, and helps you learn. Try rebooting your machine btw. I've had that happen when done package wasn't released properly or the GPU hook can't be made for whatever reason. Otherwise just reinstall comfy and nice your models to the new path. I personally like to keep a separate comfy install for each tool (one for wan, one for voice, one for SD/image). If you've got a ton of custom nodes all in one install then things can get messy, take forever to load, have conflicts that could cause crashes, etc.
>>107282684lolcow
>>107282857we should support leejet
>>107282863Sorry for retard words I'm on mobile and getting molested by autocorrect
>>107282877dont worry, still better than the incessant any shilling spam by this fucking retard
>>107282684how do you think heani feels about a random schizo on 4chan ruining his only chance at making it by being a retarded shill?
>>107282896>ruining his only chance33 stars.. ngmi
SDXL is so fucking ass I need a 5090. I can't handle it anymore.
>>107282896>how do you think heani feels about a random schizo on 4chan ruining his only chance at making itactual schizo anon herei literally goon to my success regarding thisfuck trani
>>107282719>4090>cu126I think that might be the problem
I was kinda expecting hyvid to be ready when I woke upslowly losing faith in that fox fucker
>>107282963you're just another debo wannabe i'm the true schizo successor
>comfy is the best>error: fuck you
>>107282981nah i told him we will regret what he has done (irl)also it won't get any better ever for him
>>107282588yeah... there's no light loras yet obviously
>>107282997*he
>>107282997lol
So tired of Forge Neo failing to install for some reason; surely there's a better webui for a 2060?
>official hunyuan 1.5 comfy workflow returns black screenwhat am i doing wrong
>>107283141sage attention status?
Any news on SD.cpp?
>>107282974what do you mean? cuda versions arent compatible with certain graphics cards? when i tried to force re install sage attention i do see nvcc.EXE failed with exit code 4294967295, but it doesnt start up with sage attention off either. i might try a fresh insteall
>>107282911Are you saying you need a 5090 to run SDXL?
>>107279406nice I made it into the collage
>>107283185No I need a 5090 to run better models than fucking SDXL.
>>107280949>looks plasticGet your eyes checked before you post bair.
>it doesn't understand feeti'm tired, anon
>>107282911maybe you just need more time and more offloading to RAM?
>>107283146Was working fine with sage for me>>107283231what model?>>107283091unironically maybe look at one of the sd.cpp wrappers
of all the useless nodes comfy crams into its defaults, why the fuck is GGUF support still missing?
>>107283312you got a recommendation for an sd.cpp wrapper?
>>107283314why would anyone maintain more code if other niggers will do it for free while at the same time you get to freely handwave dismiss anyone who has any problems with your project but is using those external nodes (everyone)?
>>107283241Oh. SDXL is like what, 6gb? You can run chroma ggufs mang https://huggingface.co/silveroxides/Chroma1-Base-GGUF/tree/main
>>107283257>FluxUse Chroma.
>>107283312>what model?your favorite, Chroma.
is there an online resource that shows you all the versions of each python library or do i have to google them myself? for java i just go to maven
>>107283493https://pypi.org/ or on the CLI via uv pip or pip or conda I thinkas always, it's not absolutlely all tho. someone can publish on github releases or elsewhere.
>>107283493>for java i just go to mavenPoor child, you've been wronged.>is there an online resource that shows you all the versions of each python librarypypi has each release and you can go to the project's github repo to find more
>>107283257
>>107283314who cares about comfy
>hes still bumping the 20+ hour old troll threads
>>107283558>troll threadsthey were made days before this one schizoprove they are troll threads
>skim the threadSo is there no reason to download hunyuan when I already have wan?Can it still do blowjobs out of the box?
>>107283521>>107283525thanks
>>107283393>>107283554flux is fine with feet straight on, it's when the pose is complex it starts to go off. is chroma any better?>a photograph of a woman in a red bikini, sitting crosslegged in a bright green tent, on black and white buffalo check blanket, camping supplies in the tent behind her