Discussion of Free and Open Source Diffusion ModelsPrev: >107792305https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://github.com/Wan-Video/Wan2.2https://comfyanonymous.github.io/ComfyUI_examples/wan22/>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2485296https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe|https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>107794550is saw somewhere that the issue is im using main instead of the v4 branch, is that it?
>>107794563yeah v4 branch doesnt have the memory freeing buttons sadly
>>>107794552>>Prev: >107792305>Prev: >>107792305
>>107794583fuck
>>107794552>107792305DING DING DINGRETARD ALERTALL HANDS MAN YOUR BATTLESTATIONSWE'VE GOT A RETARDED FAGGOT HERE
Man, did Ran really snuck in the troll links again? It's so fucking tiresome
>>107794552>Maintain Thread QualityThis shit needs to stop. Take your fucking off topic schizo drama to discord where it belongs
>>107794599im sorry bro no one will use your UI
>>107794607on topic and related to thread health. it has been there for far more threads than it was without. if you are not satisfied with the thread culture, migrate somewhere else, there are plenty of AI threads on this board
>>107794552Total 1girl supremacy
>>107794609Take your meds, faggot. We just had one of the best threads ldg ever had and now you bring this garbage here again. Do you really hate this general so much?
>>107794629>it has been there for far more threads than it was withoutMaybe this logic works in your shithole of a country, but appeal to tradition is one of the worst arguments you could possibly have, ever. It's not "thread culture", this is your personal vendetta against a dev.
>>107794629>give attention to attention whores who get off on said attention, especially when it's negative>continue giving that attention despite the fact the attention whores in question are still herewinning move right there, good job anon
>>107794632last thread it was basically ltx shitty videos... barely any image. wish we had more 1 girls posted or images in general
>>107794639I don't browse this general often and thanks to this discussion I saw those links and read them and will now spread the word
>>107794652dude get the memo: he's replying to himself
so tired of ran's antics
ltxv is amazinghttps://files.catbox.moe/ydn07o.mp4https://files.catbox.moe/g9dzvj.mp4https://files.catbox.moe/upb2vk.mp4
how long does it take half-decent loras to be made for new models, i was using hunyuan1.5 before and there weren't many but ltx2 is a gamechanger
>>107794718we just need porn loras, people get to work
>>107794762https://files.catbox.moe/uetngb.mp4Forgot the video. Fuck this catbox shit, why can't we get sound on here or an AI board.
>>107794677soon bro i downloaded some creepshot loras yesterday
Blessed thread of truth friendship and justice for all
>>107794768dont downscale / use the upscaler, they made it ugly, at least atm, it seems broken
>>107794599>>107794607>>107794639>barely 5 hours of sleepLolYour dogshit frontend is kilking you
>>107794802But anon... that WAS with no upscaler.
>>107794834then why did yours look so much worse>>107794718
>>107794851I don't know. But I assure you. No upscaling or downscaling.
>>107794818He doesn't even work on it lol
>>107794607It's actually very important to keep the attention whores and avatartroons in checkAlso "anistudio" is malware and newfrens need to be warned
>>Maintain Thread Quality>https://rentry.org/debo>https://rentry.org/animanonyep. this is the real thread. carry on.
>>107794983Why did you repeat my post?
>schizoid trying to cover for his own schizo mistakelol.
Good Model for pic nswf edit? Flux kontext dev is censored
>>107795052wondering this too 3bhqwen edit worked for clothes but there's nothing as simple as "give her big bouncy breasts"
even a jewish team can't make an optimized ltx2 day one. religions kek
>be trani>piss against the wind>"stop pissing on me ranfaggot!"lolcow
>prompt, input image and audio file combo keep giving me a powerpoint presentation zoom in with no motion>nothing spicy about it at allMOTHER FUCKER
>>107795203sadge
how difficult is wan or other video generations on vram? whats the minimum acceptable without genning it for an hour?
>>107795320bro its been 3 days, change subjects
>>107795298bout three fiddy
>>10779529816 but you need lighting loras or ggufs
>>107795343i should change subjects
LTX-2 gguf when?
>>107795418is this ai?
>>107795345>>>/wsg/6067316so much quality
>>107795421Yes, but plottwist: grok was trained on real pictures of Elon in a swastika bikini.
>>107795379please broh
>>107795320>>>/wsg/6067319
>>107795467>>107795567:|
>>107795581>:)
>>107795345
someone tried gemma 3 abliterated instead of the original one to test if it works or changes anything?
>>107795655yeah
>>107795641
>>107795715
>>107795764Consider me spooked anon
thanks for beta testing guys
>>107795796my dick thanks you
>>107795800proof?
ltx is too powerful
whats comfy-kitchen?
so far only things ltx2 is able to do are memes about floyd, hitler, trump, from what i see anything sexy is out of the question
>>107795866Butr anon tohse are sexy
>>107795830they made a home for all the 1girls
another day, another lora
>>107795918Looking good. Every day is failbake day for me.
>>107795932whatcha making
>>107795918what loss values do you typically start and end with? do you just look at the samples to decide whether the checkpoint is good or too far gone?
To make a style lora, can I just grab random varied screenshots from a tv show etc and it'll work for any sort of character/environment gen, as long as I tag them properly?
>>>/wsg/6067150>>>/wsg/6067150>>>/wsg/6067150Migrate.
>>107795948Should work, yeah.
https://files.catbox.moe/djvo4z.mp4https://files.catbox.moe/l67j93.mp4https://files.catbox.moe/0wvsmj.mp4https://files.catbox.moe/03noxe.mp4
>>107795933Trying to port some old IL loras based on some strong styles but characters end up looking fucked up. I'm starting to really appreciate how easy it is to train loras for IL.
>>107795938in general with ZiT i see a very fast drop to around .04 but I can't really seem to get it to converge completely. so i train to about 1500 steps and see if any samples are acceptable then gen about 50 images. if it's overcooked i'll try earlier checkpoints, if it's undercooked i'll run another 1k steps then rinse and repeat. i think until we get base this is definitely more of an art than a science.
>>107795988damn, ltx got real boring real fast
>>107796015nigger ive done nothing else since it came out, and this is before loras. Once loras come out it will be crazy. Uncensored veo 3 at home is crazy
So actually looking at some of the examples some that look decent, people already training loras and it (apparently) takes around 10 - 30 secs to generate a video, wondering if its worth getting in to ltx2https://www.reddit.com/r/StableDiffusion/comments/1q6j2v7/trained_my_first_ltx2_lora_for_clair_obscur/
>>107796007the fact that this isnt base is the main reason why im put off baking new loraswho knows how well the lora baking process will roll over from turbo to base
>>107796029already? that is awesome
>>107796007.04 or 0.4?all of my onetrainer trainings look more or less like this, no matter how many images i have, caption or no caption, rank 16 or 32, lr 1e-4 or 3e-4, etc etcvalidation will go up at some point, which is where i usually stop training.but i honestly dont understand why it all looks the same
Can you block swap ltx2? Is so, how?
>>107796044two more weeks
>>107795990i had to start from scratch when i switched over from flux. i just use the z defaults and my old datasets with tags. i had all kinds of problems trying to use settings from flux training.>>107796044fair enough. it's definitely sinking time into a dead end.>>107796055yeah down to .4. i wouldn't focus so much on the numbers if you're happy with the outputs, especially with z-image training since it's all a hack anyway
>>107796057its automatic. Just use --reserve-vram 2 or so so it knows how much vram to keep for windows
>>107796023yeah uncensored veo 3 at home would be crazyltx looks god fucking awful though
>>107796076can you stop with this cope? it runs on 12GB cards now, just disable the downscaling and upscaling
>>107796074I'd rather do it myself.
>>107796071soon
ltx2 better than wan 2.2?I'm literally returning to the thread after being busy with other things for almost the entire year. Last time I left Hunyuan was still top dog in video generation.
>>107796090i can run wan just fine, nobody's copingyou're just blind on hype and failing to see how fucking awful this looks. you'll be ashamed of shit you posted in a few days.
how to avoid static jpg output with ltx2?
has anyone else noticed that people have generally started to accept that the release of the z-image base model was cancelled?
>>107796108nigger post me a wan gen that looks anywhere near as good as >>107794718
>>107796029training this model looks like its gonna be crazy easy then, awesome. Lets see if first nsfw lora drops today or tomorrow
>>107795777So this is the power of ltx, such quality I kneelhttps://files.catbox.moe/8ikajr.mp4
>Frame count must be divisible by 8 + 1.LTX2 frame rate seems to be 25 in the comfy workflow, so why is that 25fps instead of 24 or 16fps if we want divisible by 8?So to get 5 seconds of video, we'd need 121 frames, not 126.
ltxv 2k res looks GOODhttps://files.catbox.moe/uju19s.mp4
>>107795988i don't remember wan2.2 being this mushy
>>107796147lmao did you chain 10s gens? poor miku looking plasticer by the second
>>107796105Maybe in terms of speed and producing sound. Suppose it comes down to what you want to generate.>>107796114Good, the 'where base' spam was obnoxious.
>>107796164yes it decided to make the purple girl exist out of thin air and steal migu's spot
reminder you can only get a good LTX outcome if you rent it out and generate on a b100
>>10779615924GB vramlets need not apply, this is a 5090 chad only model
>>107796186based fuck vramlets I hate them
>>107796180this took 3 mins on 5090, just stop being poor>>107796159
Proompt me this1girl, standing, very beautiful
>>107796164
>>107796158the requirement of 8+1 is correct so it should be 24fps, I don't know why the workflow goes with 25
>>107796201post workflow or you are lying
>>107796212grim
>>107796201i dont see how 5090 is a flex. you can rent that for cents on runpod lol
>>107796209here saar
>>107796147>>107796164Ah, the issue that plagues every long chained video generation, the sudden jank. Wonder if they'll make an SVI versi...https://github.com/vita-epfl/Stable-Video-Infinity/issues/67
>>107796226he supported nvidia and you didn't. they deserve our money for providing this incredible tech
>>107794552>>Maintain Thread Quality>https://rentry.org/debo>https://rentry.org/animanonwhy is this still in the OP? tran is supposed to be in a witness protection program and move to africa
>guys ltx quality is amazing look at this barely moving scene with no organics>oh i also wiped out metadata so you can't even verify if i lied about timingslmao
>>107796232thank you saar very beautiful saar darker skin would look beautifer
>>107796218its official inference code, not comfy
>>107796201>cloud genlmaoing my ass off
>>107796114i never expected it to be released in the first place. i understand Chinese Culture
>>107796266>cheating
What is it for?
>>107796268we just making shit up now? nothing in that points to anything being cloud based? and are you so poor that someone owning a 5090 is that hard to believe?
>>107796248kek. ngl tho genning on runpod from my mac while cozied up in a blanket is comfy af
>>107796280if it was local there'd be a comment
>>107796280Stop lying bastard fuck you
>>107796291i'll probably gonna be confined to my laptop for a few months soon. why runpod instead of something like paperspace?
>>107796262>darker skin would look beautiferhere u go sar please be of upvote for good looks bless ganesha for fortune
>>107796297what would it say? Im on arch linuix with strengthened kernel so im not on some shitty windows spyware if that is something windows does
all I fucking get are jpg with sounds
>Im on arch linuix with strengthened kernel so im not on some shitty windows spyware if that is something windows doessevere mental illness
>>107796308try it yourself retardhttps://github.com/Lightricks/LTX-2/blob/main/packages/ltx-pipelines/src/ltx_pipelines/ti2vid_two_stages.py
>>107796212kek
>>107796317comfy adds spyware to everything
>>107796280>>107796291>1 minute apartsee nigga i knew you were full of shit. it's way too easy to tell a cloud gen from a local ltx gen
>>107796335I stopped getting complete still when I reduced the strength on the LTXVImgToVideoInPlace node. Still not getting a lot of movement though which is irritating.
>>107796396thanks I'll try that, for now half of what I get is OOM, the other is these jpg stills
are people retards, ltxv looks great if you gen at high res without the down scale bs. What is this strange coping about it looking better than wan? Are people being paid by alibaba to shit on anything else?
oh and actually use this https://huggingface.co/Lightricks/LTX-2-19b-IC-LoRA-Detailer/blob/main/ltx-2-19b-ic-lora-detailer.safetensors
>>107796463post comparison
the lack of 1girls from ltx gens got me worried
What is the best way to enhance breasts and genitals on anime characters in local gen? I know for video gen you use loras; are there similar loras in image gen specific for translating simplified boobs to more appealing boobs? Something that works like the automatic detailer for hands and faces from the rentry guide would be perfect.
lol now I'm getting oom no matter the number of frames
>>107796502here is ltxv, now you do a wan onehttps://files.catbox.moe/3udh6d.mp4
So how is LTX-2 for t2i?
>>107796571stop reposting crap from the bandoco discord, tranny
Why does the guide in the OP not include LTX-2?
>>107796605I accept your concession
>>107796613its doa
>>107796613Too close to the character limit.And no we have to mention the meme anime model that 3 autists use and those discord drama rentries MUST remain there, please understand.
>>107796571wheres the workflow?
>>107796637>Too close to the character limit.remove the off-topic shit from the OP then maybe?
>>107796105Wan 2.2 is still better overall, the video quality is better. However ltx2 had some clear advantages, lip syncing is pretty good, and it's 25fps natively, and it's faster. You could conceivably use each in different situations. Wan 2.2 for scenes with higher motion and ltx 2 for more static scenes with dialog.
>>107796648ltxv one, remove the downscale and upscale parts, use higher res with full non distill model at 30+ steps and 4 cfg
testicles
>>107796750benchod
you can do proper quality ltxv video with less vram, you just need enough ram https://files.catbox.moe/secyev.mp4
>>107796663>i won't sharefuck off then
>>107796637Stuff never usually gets added that quickly DESU. I guess ZImage did but it's an exception. Neta earned its spot over time too lol, IDK why you think it's the problem
>>107796637...Just rip the band-aid off, they won't miss it!
>>107796764share your wf anon
>>107796797everything is here, it was done on a 3060 12GBhttps://www.reddit.com/r/StableDiffusion/comments/1q6k2a3/definition_of_insanity_ltx_20_experience/
>>107796562
>>107796787why are you obssessed with this weird faced woman
>>107796825I don't think he included it, the video file doesn't have it
>>107796860>The workflow is the I2V Comfyui template one, including the models, the only change is VAE decode is LTXV Spatio Temporal Tiled Vae Decode and Sage Attention node.even has the full prompt
>>107796825>The video took 16:21 minutes on a RTX 3060 12GB>16:21 minutesThats a deal breaker sadly. Despite my 4070tis, I'll just wait for the speed boosts
>>107796868yeah I was looking into someone making their own without the 2 sampler stages overcomplicating things
>>107796396Oh, i'll also be trying that.>>107796417nta but trypython3.12 main.py --disable-pinned-memory --disable-smart-memory --lowvram --reserve-vram 1.0that is what got it working on my piece of junk 3060 12GB, my issue was not OOM it was the way comfyui is so aggressive with memory swapping causing my system to just freeze up in the vae decode stage or some other stage. its worth doing this also if you use linux and are hitting a lot into swap file/partitionsudo sysctl vm.swappiness=10add it to /etc/sysctl.d/99-swappiness.conf so its changed at bootvm.swappiness = 10
>>107796905Does comfy contact the mothership?
are loras trained from dedistilled ZIT better? There's a noticeable increase of defects when using loras trained on the normal bf16
Is there something better than Euler + Linear Quadratic 16+ steps for ZIT?
The anime woman eats soup
>>107796936Not sure. I trained a few on the adapter before ostris made the dedistill, since then I've only trained on the dedistill and they work fine, but not sure if they're better.
>>107796931after many anons who bothered to check, yes it does and multiple popular custom nodes do as well
Where the FUCK is the Will Smith spaghetti video
>>107797009>multiple popular custom nodes do as wellfake
>>107796936Some say the dedistill is better, some say the adapter is better. See which one works better for you.Dedistill worked better for me but both were shit ultimately and it wasn't a direct comparison (lots of other variables changed) and I am waiting for Base.
>>1077968704070 ti would prob be at least twice as fast if not three times
>>107797020man this is so good, i love ltx
>>107797034>Some say the dedistill is bettersome are retarded then. Obviously full model is way better just a lot slower
Any news on lodestone's Chroma Z suicide mission?
>>107796931yeah some custom nodes send telemetry /home/anon/.config/Ultralytics/settings.jsonchange"sync": true,to"sync": false,fucking cheek bastards it should be fucking opt-in
>>107796787
>>107797057sync does not send any data other than your version
>>107797021did you set the flag in ultralytics so it doesn't phone home yet?
>>107797044what the fuck are you talking about retard that looks like absolute shit
>>107797064says who?
You didn't download the newest Mossad spyware, right anon?
>>107797077the fucking code
don't worry goyim they're just sending the version nothing else
>>107797076
>>107797056It started training a few days ago and in the unlikely event that it doesn't turn out to be dogshit, it won't be in a usable state until a few weeks at minimum.
>>107797109qrd on chroma? why bother with it
>>107797056>hey Lodestone you should wait for Z Base and just finetune it with the Chroma dataset>"nope I need to train now">okay well then, you should merge the dedistillation adapter into the model and then finetune it normally>"absolutely not, I'm a genius and I can invent better ways to train it">alright, what's the plan then?>"I'm gonna take the Turbo model, swap out the VAE, force the model to learn a whole new latent space, and make other architecture modifications, this way it will be even better">...>"that'll be another $150k pls donate to my ko-fi"it's gonna fucking suck
>>107797127you voted for this
it wrokshttps://files.catbox.moe/hrb25s.mp4https://files.catbox.moe/p9zyxu.mp4
>The female teacher dances and hums a happy song.>30 steps>euler simpleThis works so well indeed.https://files.catbox.moe/6nclzf.mp4
>telemetryvery cringe. blocked python from comming with the net a long time ago
LTX2 is dogshit."a beer can is thrown in from out of the frame and the monkey catches the beer can with his hand and then opens the beer can open with his other hand and then begins to drink from the beer can and then lowers his hand holding the beer can down and cheers to the camera."wan 2.2 can do this amazingly.
>>107797152prove it?
>>107797117>qrd on chromaFlux but dedistilled, can do higher CFG and negative prompts, knows NSFW, furfag shit and a lot of other stuff.But unfortunately trained by an ADHD furfag autist so it's completely schizo, unstable and unreliable. (The cope is that it's just a base model and it will be amazing once someone finetunes it. Coincidentally there is also another cope that it needs massive amount of data to make a proper finetune, how strange...)It's also slow as shit.Shilled sporadically by its cult on plebbit and there is one lesser schizo who occasionally goes crazy about it here.>why bother with itIf you have an OP GPU like 5090 and don't mind playing a lot of seed lottery, looking at deformed slop, you can get good images from it, eventually.High potential, low average returns.
I'll be honest I think ltx is doa
>local ltx is unusable>cloud ltx is a 4/10>grok is free and does everything ltx does lightyears ahead
>>107797176and wan can do this
"the camera pans out of the military humvee car window and reveals the humvee car driving in a city very fast to the right side as the camera follows the car and it drives through a crowd of rabbi jews and their bodies bounce off the car as the car keeps driving through the large crowd of rabbi jews with extreme violence and force."Dogshit.
>>107797194no it isnt
isn't it cool how we got this model which can do over 10s of video while wan 2.5 is API only?also, 24fps base too, no more slow mo.https://files.catbox.moe/rz0nck.mp4
>>107797155I never got chroma to work properly. No mater what I tried the output was dogshit with artifacts, glitches and chromatic abberation and whatnot, I eventually gave up on it
>>107797176grok cannot do porn
>>107797194>humveelol
for the anon using RES4LYF, how did you do it, I keep getting this error with ltx2
>>107797218who did it better?
bros we need to pretend ltx2 is good so alibaba is pressured to openweights wan 2.6
>>107797224neither can ltx retard
>>107792671Not that anon, but I did this and still getting the "no CLIP/text encoder weights in checkpoint, the text encoder model will not be loaded." error
They done LTX 2 on 8GB vramhttps://www.reddit.com/r/comfyui/comments/1q5vxky/ltx2_on_rtx_3070_mobile_8gb_vram_amazing/Guessing it was after all just a skill issue.
>>107797219Congrats, you did the correct thing anon.You haven't wasted a lot of time trying to tard wrangle it or try comically overcomplicated furfag discord workflows like me, only to arrive at the same conclusion eventually.I decided that I will need overwhelming evidence that it isn't shit before bothering with any lodestone model in the future.
>>107797117A Flux schnell [and for radiance a pixnerd pixel space model] variant NSFW finetune done by a single user.Can do quite many nsfw/anime/1girl/clothes and furry things other models can't but it is also a fairly messy model, so you could have endless debates if it's good enough for some use this or that person had in mind.
>start onetrainer and continue last training from backup like usual>it's redownloading ZiT from scratchI do not like this and wonder why it's doing this
>>107797262there is nothing it can do that illustrious doesn't do better and faster
"the tiny frogman is leaping around in the puddle repeatedly multiple times during the rain very happily as the water splashes with each time he lands in the puddle and then a hunter in a similar style to the frog peaks out in the trees in the background with a rifle and aims at the tiny frogman still leaping around in the water and follows the frogmans movements with the rifle and then the hunter shoots the frogman as the frogman slumps over like a ragdoll in the water face down and the water around the frogman turns red gradually. "
>>107797219>>107797257sorry you're missing on the fun
>>107797275why not use the transformer override? just tell it your local file path
>>107797275On Win11 and edited enviromental variables? It sometimes shits itself
>>107797278>Slightly better text (if you are lucky with seed)>Complex multisubject prompts that are impossible to do without regional conditioning and controlnets in illustrious (Need even more luck with seed for that.)But yes I agree that it isn't worth it.>>107797300Z does that image a few times faster and would nail it first try.
wan2gp chads and niggas, we eating good today :)
>>107797246get those files and the same gemma_3 fileotherwise update comfy in the folder and GUI if not updated
>>107797308I'm a retard and never got this to work, I just have to drop the bf16 and text encoder into the same folder and point it to it right? For some reason it kept refusing to load.
>>107797322I love z-image too but it can't, also chroma (for now) has the same the tools that flux has (redux, controlnet, unet modifiers, etc), chroma has a more lewd/nsfw dataset so you can generate anything you like really
>>107797322>>Complex multisubject prompts that are impossible to do without regional conditioning and controlnets in illustrious (Need even more luck with seed for that.)nta but proof?
>>107797329I did. Cloned the entire https://huggingface.co/google/gemma-3-12b-it-qat-q4_0-unquantized/tree/main and also tried the 23GB gemma_3_12B_it.safetensors file along with the other small files.Downloaded the latest v0.8.0 but nothing is fixing this damned error.
>>107797326qrd
>>107797300I just got dogshit output with astronomical gen times and foudn no point in bothering, the output wasn't good enough to justify the wating time
>>107797347oh wait a second, you're saying it completely redownloads the "Tongyi-MAI/Z-Image-Turbo" folder? the override is specifically only for the transformer as far as i know, so it still looks for TE etc in the Tongyi-MAI/Z-Image-Turbo even if you use a safetensor from somewhere else
>>107797246i get that error also, hmm... Right i know that you can't clone them you have to download them individually. put the safetensor file into its own sub-directory/folder inside of the text encoder directory ComfyUI/models/text_encoders/gemmaput it all in there and restart comfyui and then load the clip from that in the clip loader.
fuck
Finally a good collage, thank you anon >>107792352
>>107797382I feel your pain anon, I was there too, chroma is a fucking schizo model and it doesnt help that its discord community shares crazy workflows too, there are too many chroma versions out there, but I have been using it since the weekly releases and I can tell you that the fp8 is good combined with the latest comfy, is just as fast as flux.1 now
>>107797370I get messages like that when it loads stuff too, i'm also using the fp8 distil model, try that instead of full if you are using it maybethis is my setup:
>>107797395>put the safetensor file into its own sub-directory/folder inside of the text encoder directory>ComfyUI/models/text_encoders/gemma>put it all in there and restart comfyui and then load the clip from that in the clip loader.Yeah I downloaded each one. Also have it in a subfolder. Really frustrating that this isn't working correctly. I still get an output but I have to assume it isn't using the prompt.
>>107797392yep downloading everything from scratch for some reason
i cut myself everytime one of my images isnt on the collage
>>107797428forgot workflow:https://files.catbox.moe/7gvw08.json
>>107797369The proofs I have are fetishes I won't admit. (Not pedo stuff, for the glowies among us)As much as I can say: Imagine two completely unrelated fetishes that never get paired on the booru. One takes place on the left side and the other on the right.I have never seen an SDXL booru tune pull it off without slopping both together, even with regional conditioning.Chroma nails it, with varying degrees of deformed anatomy, but they are kept separate and not slopped together.
"the woman is squatting down as a man leaps in very fast from the side and dropkicks the woman in her back as she is violently shoved out of frame as the man leans down to the camera and shouts "no gooks allowed!!!""
>>107797456DOA
>>107797437>i cut myself*another slice of cheesecake
>>107797426What changed? I am not seeing any commit regarding chroma inference speed in comfy last few weeks.
>>107797326Buy an ad.
>>107797428>fp8 distil model, try that instead of full if you are using it maybeYeah, I'm using that but still getting the "no CLIP/text encoder weights in checkpoint"
>>107797430oh man i think its because we've both downloaded some other safetensor file from another location, i have to go track that down.
>>107796029>civitai still did not create a section for ltx2
>>107797437I just let out a little autistic "nnnnNNNNNg" and then exhale sharply like I'm an angry bull. Then I pick up my foam mousepad and whip my desk with it, it makes a wicked loud noise but doesn't damage anything. Then I use Claude to gen a Haitian voodoo curse against OP and I read it aloud.
>>107797430>I still get an output but I have to assume it isn't using the prompt.yeah its not because none of the videos i genned have motion, then i saw you post, then i looked back and carefully looked at my comfyui output in the terminal and its not even loading the clip model.
>>107797485another user had a similar issue:>If I remember correctly, this message:No CLIP/text encoder weights in checkpoint; the text encoder model will not be loaded.only occurs when the model is loaded from the node, but you are supposed to load the CLIP model with a separate node. This message is normal.try a diff node or workflow for the encoder
>>107797498>i have to go track that down.I would appreciate it. I also tried gemma_3_12B_it_fp8_e4m3fn.safetensors but that didn't work either.
>>107797357
>>107797326Does wangp work on amd+linux? I need to gen some goon clips
>>107797518>you are supposed to load the CLIP model with a separate nodeWhat other node should be used beyond " Gemma 3 Model Loader"?
>>107797476dunno, but I updated comfy a few days ago and its handling fp8 speeds really better now, I even droppped all the ggufs models I was using, I think it was because LTX-2 was close to being released and comfy was working closely with nvidia to get better speeds with fp8-fp4 models, I havn't even updated to comfy-kitchen yet also https://huggingface.co/silveroxides/Chroma1-HD-fp8-scaled/tree/main got released a few days ago too, its only 9.1 GB, so its not a heavy model anymore
>>107797430wait are you trying to use the fp8 version that was posted on reddit? i think that might be the issue because that is one i've been trying to use just now and its giving me those errors.
>>107797278chroma has often more details related to clothes, jewelry, backgrounds and stuff, but with the tradeoffs (stuff it can't do) and the speed difference - yea, illustrious/noob are overall better for most people.
>>107797537all my workflow has at the left is these nodes, try a diff workflow cause that node is probably causing an issue
>>107797520bingo, i've found our problem. i'll try and see if i can find the hugging page where i got that from last night.
>>107797537>What other node should be used beyond " Gemma 3 Model Loader"?And I should note that I tried "LTXV Audio Text Encoder Loader" but that gave an error and I'm not doing anything with audio besides passing it through to the output video node.>>107797549>wait are you trying to use the fp8 version that was posted on reddit?No. Tried all that were on the official LTX2 repo; ltx-2-19b-dev, ltx-2-19b-dev-fp8, ltx-2-19b-distilled, and ltx-2-19b-distilled-fp8.
>>107797587now do 2 girls, yeah you cant ltx can
>>107797538>I currently recommend using "small_rev3".>small_rev3 inside do_not_use folderIt feels like a humiliation ritual to go through this autistic furtroon shit.I am curious about what this fp8 speed up is about, it seems architecturally interesting even if the quality is still bad (and I like won't benefit from it on my RTX 3000).Do you know which paper this experiment stemmed from?
>>107797373its a gradio webui for multiple ai video models and large image model released after sdxl. Got 51 seconds on my rtx5090+64gb
>>107797622but comfyui
>>107797527it should work, check the discord. the dev is active at the moment and the community support is very friendly.
>>107797278Try two custom subjects with consistent details in illustrious, i fucking dare you.
>>107797640Thank youLet the gooning begin
>>107797573>try a diff workflow cause that node is probably causing an issueWhat other node can load the Gemma 3? I'm using the LTX2 V2V_Detailer workflow. https://github.com/Lightricks/ComfyUI-LTXVideo/blob/master/example_workflows/LTX-2_V2V_Detailer.json
>>107797598well i got the same problem, i'm trying to figure it out once I fix it I'll let you know what I did.
>>107797369I dare you to make two OCs in illustrious interacting without comfy couple. You will get insane detail bleed, or the subjects won't be recognizable whatsoever.Although i think neta yume is even better if you want anime AND several OCs.
>>107797633soulless corpos now. just going to die like invoke
>>107797621yeah I know, just use fp8-mixedfinal, about the fp8 speed up my experience has been anecdotal so far, but reading about comfy-kitchen seems they did something with nvidiahttps://github.com/Comfy-Org/comfy-kitchenI havnt migrated yet
>>107797667...and here's neta yume
>>107797674Just as a reference what GPU are you on anon?I am not exactly sold, but I will give it a try, just to see if there is any speed difference if not anything else.Probably Saturday or Sunday, too busy right now to try it properly.
They've added separate models as noted herehttps://huggingface.co/Lightricks/LTX-2/discussions/11that should help.
>>107797642show it in chroma
>>107797653use the workflow I linked and try i2v.
>>107797701>>107797667can either do goon?
>>107797701>>107797667*yawn*
>>107797764theres a literal watermark bro, lmao
>>107797518yes that is what google gemini told me, now I'm trying to find what node can load gemma...
>>107797829>making shit upi accept your concession
>>107797846not even him but its right there
>>107797864>ai cant reproduce watermarks because i said sololmao
>>107797864>>107797829you lost
>>107797872https://www.dreamstime.com/romantic-connection-anime-young-couple-head-held-gently-sweet-romance-romantic-connection-anime-young-couple-image411446948
>>107797888illustrious can goon you can't with your chroma and the other one
>>107797840I linked a workflow, save it and load it in comfy.
>>107797674Hmm well with v0.8.0 on a 5060ti I got 1.9it/s with svdq-fp4_r128-z-image-turbo vs. 1.5it/s with z_image_turbo_bf16 before. And its size is 4GB vs original 12GB, so.
>>107797749Yes, both of them can do NSFW. Chroma is far better but has issues with anatomy, Neta tries but has very limited knowledge of poses.>>107797764This is not impressive at all, and i bet the dude's painted nails are leaking from the girl's prompt.
>>107797922>Neta tries but has very limited knowledgecant lora fix that?>has issues with anatomycan't lora fix that?
>>107797905is this real?
>>107797922>dude's painted nails are leaking from the girl's promptbigot
>>107797934They can but barely anyone makes neta loras. Certainly easier than coping with noob/illust.
>>107797901I got it I think, just going to see if using the basic comfy checkpoint loader would actually work after dragging the file into the checkpoints folder.
>>107797922You can spin the gen gacha so quickly compared to using either of those models that it doesn't matter, you can just gen enough to get the right gen
install this i guess https://github.com/Lightricks/ComfyUI-LTXVideobut i thought it was comfy core thing now ugh ffs, what a fucking mess.
>>107797219>>107797257>>107797322I actually managed to get good results from Chroma, even trained a few loras and it captures the style better than ZIT, but it's so much slower that it's hard to justify using it.
>>107797972It abso fucking lutely matters, you can roll illustrious gacha 100 times, and it will be worse than a single chroma prompt when it comes to character consistency with two custom subjects. This is an area where SDXL shows its age the most.
>>107797009So comfy is not really "local"?
>>107798117is any webslop that uses npm and pip really local?
>>107798058I don't mind the slow speed, but the stripes that just sometimes appear for no god damn reason annoy me the most
>>107798140Holly sexo which model!
>>107798134>is any webslop that uses npm and pip really local?Which governments are monitoring you?
Daily reminder that you don't need more than 1girl.
Any updates to the no clip/text encoder shit?
>>107798168Chroma1-HD
8k is really not that much if you think about it, RTX 6000 pro might actually be worth it.
Bros... I think LTX2 might be bad...
>>107798274>24fps>10s+ gens>emotive>can use audio + image or i2vit's great.
>>107798274Only problem with it is how heavily censored it is, but other than that it's better than WAN 2.2 by every metric.
>>107798307it's censored, right?
New thread:>>107798332New thread:>>107798332New thread:>>107798332
>>107798335>but other than that it's better than WAN 2.2 by every metricyeah I'm not seeing that with the comfy workflow. can't get the workflow from ltx to run at all
>>107798347Nice
>>107798243Not yet, its still giving me problems despite using that other anons workflow method of loading.
>>107797382from time to time it hits well but for most part what you say is true.speed is major issue.has face gen issues as well,but it has good prompt understanding.overall nothing comes close to z-image right now.