Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107526185https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://github.com/Wan-Video/Wan2.2>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2298660https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe|https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>107529321I feel like she looked much better in her early days. Don't you agree? I'm not a fan of the pictures you used.
>>107529397a pajeet made this collage souless
>>107529425nice style
>>107529424i'm probably never going to use it again, i just went to a porn site sorted by popular and grabbed photos to do a test
>>107529474
>>107529429thanks, guess ill give the training run a shot then, with no tags this time.
>>107529472aaaah anjelika such a one of a kind creature.
>>107529450too jealous of ani to have taste
question, why 640x640? figured there was no way you could train a lora that low res and have it work out well, but yours trained pretty good>>107529531
>>107529471
What a nice thread.
>>107529548it had some of these even sooner, but also some of the bunny hairclips still are interpreted as hearts. >>107529467just zit and one attempt at the pippa training data that was in the last thread
>>107529397Can I Z Image Turbo in forge?
i'm gonna pull (musubi)need toolkit gone, though I doubt m'soobz will perform any differently
>>107529795forge classic has support
>>107529795>>107529809forge classic neo branch... don't be retarded like me and try to figure out why my shit's not right for hours because i didn't get the neo branch
>>107529806>he pulled?>>107529834neo is AIDS, classic itself does support z-image, but its focused on sdxl based models.neo has awful memory management issues, itll fail to load models and when you frustratingly keep clicking start, itll keep re-loading the model and taking up more and more ram. Its fucking brutal.
>>107529223Too many steps
>new cumfartorg announcement >how to 3x3 grid for adssickening
>>107529809>>107529834>There is now 5 versions of forgeI think it's time I stop being a retard and learn how to use comfy...
>>107529930comfy sucks fucking donkey dick for anything that isn't wan or maybe qwen editi use forge classic for sdxl models and comfy for wan/qweeditmost of those forge forks are fucking awful, including that memleak issue i mentioned.shit comfy never bluescreened my pc kek
>>107529930
>>107529949Governments don't like 'cults'. They want to be the only cult around.
>>107529744>it had some of these even sooner, but also some of the bunny hairclips still are interpreted as hearts.It does't have to be 1:1. The hyperspecific accessories are just gay to gen.
i dont think i can train z-image on a 3060
>>107530012Train yourself on some new skills and get a job.
>>107530012Train z-image anyway and don't get a job
>>10753001212GB should work
>>107529999waifu accessories are srs business, but basically this also indicates to me it needs more training or perhaps different settingsif t he accessories need to be manipulated it's IMO better to just caption them in the dataset and maybe change the dataset so it can be (not) prompted later>>107530012actually you could probably train loras with that regardless?
i found this https://github.com/ostris/ai-toolkit/issues/550
>>107529949i used forge up until a month ago. forge is generally faster, you're able to see the gens a lot easier as they're happening, and it's just a lot easier to use. Comfy's layout is garbage. the Node design sucks, and you have to go through a bunch of different ones to find the one you like, or just accept one you don't. it's difficult to get shit just right, and to change even minute settings, and the usability compared to forge shit. don't let anyone convince you otherwise.with all that said, it is the only one that can do Z-Image, and its compatibility with new shit is unmatched. that's literally all it has going for it. if it wasn't for that, this shit wouldn't be installed on my computer at all.
>>107530175>4bit traininglol
>>107530175>4bit
SamplerCustomAdvanced doesn't preview anymore on the new version of comfyui...how do i fix?
I just had an insane revelation. If you download a bunch of 1girl pictures you like and train a lora on them, any model suddenly produces pictures you like more. Crazy.
>>107530241>how do i fix?drag and shot cumfart
>>107530241https://github.com/comfyanonymous/ComfyUI?tab=readme-ov-file#how-to-show-high-quality-previewshopefully manager will fix it
>default training settings like that anon said are actually working>my lora 800 steps in ISN'T schizophrenic for onceholy
Comfy should be dragged out on the street and shot
>>107529844Neo works fine on my machine. Are you retarded and didn't configure your .bat file? Did you do a stupid and set pin shared memory on a card with less than 16gb vram, despite the guy warning you multiple times to NOT set that option for cards with less than 16gb vram?
>>107530358this was an issue with forge in general until it was fixed, the idea it got un-fixed is not exactly hard to believe.and of course i'm on 16gb of vram.
>>107530372I've never once had an OOM or ram issue with neo, and I use a 3060 12gb. The only issue I had was forge couple fucking up the neo install... Despite being made by the same fucking guy.
>>107530384>The only issue I had was forge couple fucking up the neo install... Despite being made by the same fucking guy.PFFT and you accused me of having the skill issue.
>>107530425It's not a skill issue when you know exactly what is breaking your install.>install forge couple from menu>neo immediately shits itself over a CUDA issue and insta-crashes>install via direct git link>neo immediately shits itself over a CUDA issue and insta-crashesIt's an issue specifically with couple and the latest version of Neo. It's not an issue that can be self-solved by not following the readme instructions.
>>107530465Animate this.
>>107530161this looks very good
>>107530581grok.com/imagine
>>107530465Nice vagina.
Is julien still there? I have a question.
1200 steps, its close, so fucking close...
i cant stop genning girls with large feet and penises
pretty kewl
>>107530854Nice. now gen an image of miku and parappa on stage rappin' together.
plz God no miku troon. spare us from your "tests"
>>107530891>6 minutes doing something by hand>do that tens of thousands of time per yearI'm pretty sure in the long run it's longer than 6 hours
>>107529472Hey anon, in the last thread I think you mentioned you used the undistilled version that aitoolkit downloads in your training. What made you do that? Have you tried training with the adapter models instead? I ask because I've done some tests with both and it seemed like the undistilled model produced inferior results with the same data set (though there might be some variable I forgot to control for).
>>107530891>>107530908this is high Chinese culture
>>107530913many are saying this. that and the v2 adapter is worse than the v1.
>>107530861lul
>>107530943well, it had the right idea, it just didn't know what the fuck it was making i guess kek
>>107530950>>107530943>>107530937You are using a world class tool but the only thing what comes to your mind is a goddamn teal haired vocaloid and spamming the same shit over and over again. You don't deserve these tools.
>>107530960https://www.youtube.com/shorts/hTXBupA3k1o
>>107530943kick punch block!
>>107530984ITS ALL IN THE MIND!
>>107530842>no blue nailsIt's over
>>107531017her nails are either purple or not even visible most of the time in my datasetintentional, because sdxl model. its trained off a style accurate lora for noobai.
>>107530960sucks to be you right now
overdid the contrast a tadmusubi works great btw, I just imported my settings from qwen and it's gold
I have no idea how this lora turned out so good, it even gave it more details instead of killing the model
what does it mean?
>>107531086sigma ba- wait, you were setting me up weren't you
STOP USING UPSCALERSthey make images look like glossy cell-shaded molestations of the original images. STOP FUCKING USING THEM
>>107530581https://files.catbox.moe/5fb6ao.mp4
>>107530960U mad?
>>107530960I bet he's having more fun than you
>>107531109lmaooo
>>107531086>σ + σ_up-FAGS BLOWN THE FUCK OUTLMAO
>>107531102>>107531120>>107531135>dayum frog nigga you funny
>>107531109hao?
>>107531135>>107531143we wuzz based n' shieethttps://www.youtube.com/watch?v=l1dnqKGuezo
prompted for blue fingienails, step 1600. i think ill get it to 2000 steps, but the anatomy is starting to fry, hands get busted or lose a finger
>>107530943it is a way to cope before getting Z-image edit I guess kek
>>107531109Holy based.
>>107531109Ya bastard, that's revolting.
welp, thats it, bye chroma
>>107531305Is this a new version of your Chroma ZiT LoRA?
>>107531321huh¨? I'm just testing old chroma prompts with zit and a lora I trained, what kills my buzz about Chroma is how sloooow it is to gen compared to the z-model, the only advantage that chroma has right now is its nsfw capabilities, but with z loras getting better and better each day, chroma days are numbered
>>107531394My bad. I mistook you for Chroma Asian footfag anon. He was training on Chroma images for ZiT a week or two ago or something. If you don't mind me asking, did you train on the adapters or that new de-distill?
>character but realisticaugh fuck that's rancid. phew. i think i'll just wait for base and the noob tune. but i think i can call it a "successful" train.
>>107531476Very nice>>107531231Nice work, I want to train a character lora too with a 3060. I noticed with other loras that ZiT is fucking inconsistent>>107530161Very cool style
>>107531101#NotAllUpscalers
>>107531101you must have used a really shitty upscaler to end up with such a horrible result lol
>>107531476for some style transfer shit it's really not bad
>>107530967such erotic movements...
>>107531678MiguDayo is so precious
>>107529940>>107530182Other than compatibility with new stuff, comfy only shines if your gen flows go beyond the basics.If your usecases are linear not requiring any weird stitching, other UIs are faster.
>ask qwedit to remove the crown>it removes the crown>and adds oneyou're a cheeky little rice cunt aren't you m8>>107531730that's a much more eloquent and not retarded way of putting it, yeah.
>end of year>expected a gimped wan2.5 local version>expected non janky long video>received C U L T U R E insteadi accept
>>107531305>>107531394I love to see others reuse prompts I shared for model testing.Z is great, but I'm maining SPARK chroma now. The extra gen time is worth it because the variety, styles, NSFW, and realism are SOTA, and it fixed the anatomy problems.
>>107531766>it fixed the anatomy problems.it definitely improved on chroma, it's also less slopped, but Z-image turbo still has godlike anatomy and details, I still can't believe it's a 6b model, that's black magic dude
>>107531766what are you smoking, don't flatter yourself, I've never used "your prompts" lmao
>high angle, fish-eye lens effect.A split-screen composite portrait of a full body view of a single man, with moustaceh, screaming, front view. The image is divided vertically down the exact center of her face. The left half is fantasy style fullbody armored man with hornet helmet, extended arm holding an axe, the right half is hyper-realistic photography in work clothes white shirt, tie and glasses, extended arm holding a smartphone,brown hair. The facial features align perfectly across the center line to form one continuous body. Seamless transition.background split perfectly aligned. Left side background is a smoky medieval battlefield, Right side background is a modern city street. The transition matches the character split.symmetrical pose, shoulder level aligned"damn
>>107528382512px pippa trainings for Z-Image-Turbo:https://litter.catbox.moe/1ihaoqgjnx28pzw2.safetensorshttps://litter.catbox.moe/2gwcxp7m0a21ig4s.safetensors>>107531558 >>107530590Ty, it's anon's training data tho. I think it still needs a higher resolution attempt
>>107531777>Z-image turbo still has godlike anatomy and details
>>107531777I predict both of these models will be irrelevant sometime next year, when homebaked models start proliferating. Z is going to trigger an optimization race to see who can outdo their perf per cost/size.>>107531816pic, not the blonde. and yeah, I'm smoking weed for your information.actually, this is a prompt that SPARK still has some issues with. qwen and z handle it way more consistently.
>>107531935>Z is going to trigger an optimization race to see who can outdo their perf per cost/size.that's assuming their competitors know the secret sauce (it was just training your model on real data and not being a lazy fuck!)
>>107531935>Z is going to trigger an optimization race to see who can outdo their perf per cost/size.Why would a company want their model to be optimized any more than absolutely necessary? Companies are able to charge what they can BECAUSE there is a hard upper limit to what a consumer can realistically afford for their computer. Meanwhile companies are able to eat the cost for hardware and sell back to users at scale. Optimizing their models would ruin that business model because the extremely expensive hardware they went into debt for just became largely useless.
Does anyone know where i might find a dataset of 1 million close up images of synthetic faces generated by an ai model at a resolution of 512x512?
>>107532002Sorry. I'm only aware of one with 800k.
>>107532002saar do not redeem my 1 million synthetic faces!
>>107531954https://www.arxiv.org/pdf/2511.22699>By systematicallyoptimizing the entire model lifecycle – from a curated data infrastructure to a streamlinedtraining curriculum – we complete the full training workflow in just 314K H800 GPU hours(approx. $630K)>Inspired by the scaling success of decoder-onlymodels, we adopt a Single-Stream Multi-Modal Diffusion Transformer (MM-DiT) paradigm [ 18]. In thissetup, text, visual semantic tokens, and VAE image tokens are concatenated at the sequence level to serveas a unified input stream, maximizing parameter efficiency compared to dual-stream approaches>For distributed training, we employed a hybrid parallelization strategy>In addition to system-level optimizations, we addressed inefficiencies arising from mixed-resolutiontraining>etcno doubt their dataset was pretty good, but the paper describes many different optimizations they did.
>>107531989>Optimizing their models would ruin that business model because the extremely expensive hardware they went into debt for just became largely useless.Optimizing models if anything helps them, its not just hardware costs either. Right now current AI models are extremely inefficient despite their power. The lower the compute need the less resources they need to spend or could be allocated to other use.
>>107531989to undercut the competitor and render their bloated model training investments worthless. the Judeo-Burger AI companies don't want to compete in efficiency, but chinese companies and indie devs do.
>>107531902Are these identical or what is the second one?
>>107531989This retarded mindset works exactly only until the bubble pop
>>107532059>undercut the competitorThey are in cahoots on this matter. >Optimizing models if anything helps themThey won't start doing this until the consumer is completely price out of personal computing.
I ran the lora and it gen'd mustard gas
>>107532002Possibly with some research but to get the results you want, you gotta make your own shit.>synthetic facesYou can do SD1.5/SDXL with various lighting and face loras for this at lighting speeds (use lighting loras or lcm loras). For even more variation, create a few thousand (yes a few thousand) highly unique faces in chroma, switch back to SD1.5/SDXL with IPadapter face combine and go nuts.Also make use of the Random Number node (was node suite comfyui).
>>107532116also its a zip bomb that contains 2 terabytes of goatse
>>107532130thanks for your long reply but i was being a retard on purpose
>>107532059>>107532075Semiconductor corps have been a cartel since the late 70's and its just now they've reached critical mass with the blatant monopolistic/anti competitive tactics.Also pricing out the consumer would be retarded because RAM doesn't just affect home PCs but quite literally everything (even non-computers, micro controllers etc) that has an "IF" statement in its function.What needs to happen is for the regulators/gov to pop the cock out of their mouths and really bitch slap these corps for such blatant anti-trust violations/ anti-free market tactics. But that won't happen with this current admin.Also>still no /ai/ board
That sex offender is right. Admin has psychosis because they are chasing sort term gains.
>>107532162>Admin has psychosis because they are chasing sort term gains.I have decided to concede and eat my own shit on this issue. Completely unregulated industry can sometimes be bad.
>>107532178I think these companies are protected because they are so friendly towards certain high ranking people. There is nothing more to it. Google/Alphabet was split up and Microsoft had lawsuits in the past too. But these companies? None of them are legally challenged.
>>107532054you don't get it, the Jews WANT the models to be big and expensive, requiring their expensive server hardware (licensed from a company with the Masonic All-Seeing Eye in its logo), because that's the only way they can extract rent from AI users. This promise of lucrative AI slavery and serfdom is the only thing holding up the AI bubble. This is why it's strategically-advantageous for Chinese labs and indie devs to make cheap models, it destroys the Jewish AI bubble, collapses the US economy, and ends the chip/wafer hoarding.>>107532075>They are in cahoots on this matter. They have no choice. We will be able to make models as good as Z on local hardware soon.
>>107532193That can only last for as long as it doesnt start affecting other businesses/pops the bubble.There's no shortage of RAM. These companies colluded to buyout each other's stock to kill the competition. The consumer is even the "customer" anymore.>>107532193Its a combination of speculation, money in politics and also a desire for growth. The erosion of any regulatory pressures has basically unleashed the full flood gates here.
>>107532237you can't just print money bro
>>107532247I'm waiting for one of these companies to buy out an essential service and start trading it between each other.
>>107531305boing
>>107531394>>107531495>>107531816dat sum fine 1girl
>>107532018did he ever reply as to why the fuck anyone would want that?
>>107532153Based, miss chokola
>>107532153absolute madlad
>>107532153>>107532237>>107532247absolute madlad, go off king>also gen one of the janny fucking miku just to piss those guys off too
>https://civitai.com/models/2218365>CyberRealistic Z-Image Turbo>finetuneLooks like slop lora merged into the model
>>107532316Seems quite pointless but what do I know about anything.
>>107532237>They have no choice. We will be able to make models as good as Z on local hardware soon.anon that local hardware will be an h200 that every mon/g/ol will buy for pennies after the ai bubble bursts. you can gen your goons and power your own steam turbine all in one easy step. i'm gonna use mine to power a sauna with floor to ceiling goon screens. the future is looking bright baby.
>>107532316>>107532324I like cyberrealistic's finetunes for other models. Some of the best realism models out there. Why you'd make a realism model for a model that's already the most realistic is beyond me.But hey, if it's possible, somebody was gonna do it, no matter what.