Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107570316https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://github.com/Wan-Video/Wan2.2>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2485296https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe|https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
For those who missed it, DFloat11 is making absolutely identical images as BF16, down to the pixel lmaohttps://huggingface.co/mingyi456/Z-Image-Turbo-DF11-ComfyUIhttps://github.com/BigStationW/ComfyUI-DFloat11-Extendedhttps://imgsli.com/NDM1MDE2
Blessed thread of blessed thread of blessed thread of blessed thread of blessed thread of blessed thread of blessed thread of blessed thread of blessed thread of b
>>107576396why did you pick your own slop images with the shitty lora that fucks up hands?
>>107576418GPT please translate me in vramlet language
im gunna traaaaaaaaaaiiiinnnnnnn
>>107576428you already know why https://rentry.org/ranfaggotmaintain the thread quality we have another tranny bake
>>107576396can we get a rebake? this has a bunch of off topic drama links in it
>>107576396>>107576437are you still going on with this flamewar shit? both of you should kill yourselves already
/ldg/- Limp Dick General
okay now post again but this time try not to sound like youre seething and malding
>>107576477sure
>>107576418Shouldn't this be physically impossible? Did the Chinese use some black magic?
>>107576487>Did the Chinese use some black magic?I have no idea how they did that but they did lmao, my guess is that models don't care if the precision is 16 bits or 11>t. didn't read the paper but should since it's really cool that they managed to pull this shit off
I don't understand why OneTrainer won't develop a web UI. Some people rent GPUs for training and can't see the window. I guess I could use the CLI mode but at that point I may as well just switch to musubi tuner.
>>107576437>https://rentry.org/ranfaggotlmfao ran is probably seething about this
>local keeps falling behindkek, saas wins again
>>107576476Oh no!
>>107576513>Flux 2 [max]what?>GPT Image 1.5 1stWHAT??? what is this shit? I feel I'm missing some episodes there
>>107576418does it work with loras tho
>>107576513>>107576523https://youtu.be/DPBtd57p5Mg?t=3lmao it changes the image during edit, even Kontext dev doesn't do that, NOOB
>>107576430Here's the chart.>>107576487Just read the paper, it's been out since April.
>>107576533no ;-;, we need some coding wizard to improve that node
>>107576497>>107576546nice
Oh no
>>107576418>>107576487https://github.com/LeanModels/DFloat11>How It Works>DFloat11 compresses model weights using Huffman coding of BFloat16 exponent bits, combined with hardware-aware algorithmic designs that enable efficient on-the-fly decompression directly on the GPU. During inference, the weights remain compressed in GPU memory and are decompressed just before matrix multiplications, then immediately discarded after use to minimize memory footprint.>Key benefits:>No CPU decompression or host-device data transfer: all operations are handled entirely on the GPU.>Decompression overhead is constant per forward pass and independent of batch size, making DFloat11 increasingly efficient at larger batch sizes.>DFloat11 is much faster than CPU-offloading approaches, enabling practical deployment in memory-constrained environments.>At batch size = 1, inference is approximately 2× slower than the original BF16 model, but the performance gap narrows significantly with larger batches.>The compression is fully lossless, guaranteeing that the model’s outputs are bit-for-bit identical to those of the original model.Note the second-last point. Might still be nifty for VRAM-constrained setups.
Thoughts on Reddit for learning about open source models?
>>107576513In what world is the new GPTslop better than NBP? I swear every single time a new corpo model gets released it rises to the top regardless of the quality of its outputs. These benchmarks are nothing but a huge meme.
>>107576570His ass is surprisingly shapelyAlso>retarded captcha is broken and locks you out of posting if you don't need verificationWhat the fuck is this
>>107576501basically on bf16 you let the model chose between -123 and +123 (2^8), but in reality they never go for such high numbers, they actually never reach the max of 8 (2^3), so basically if you tell them to go for a max value of 8 nothing will change and you basically won 5 bits, 16 bits -> 11 bits
>>107576640>In what world is the new GPTslop better than NBP?I have no idea dude, NBP has almost completly solved realism and this new OpenAI isn't even close to that
>>107576640They're just lying now because it's all a game to appease boomer shareholders
>>107576641>>retarded captcha is broken and locks you out of posting if you don't need verificationyou have to update 4chanX, they just fixed that, and yeah it was annoying as fuck
This is gold.
>>107576640Is this your first time realizing leaderboards are literal memes and should not be taken seriously
>>107576696looks like a baked potato to me anon...
>>107576702nah, that leaderboard used to be pretty accurate, like if I were to make my own rankings of image models it would be pretty close to that one... until today I guess
>>107576551vibe code it
>>107576396
Where comfy node for treillis 2?
holy fuck the new captchas are a nightmare.
>>107576868I like them desu, they tickle my brain.
>>107576868if you cant solve them in under 5 seconds it means you are most likely nonwhite and low IQ
shitty fingers were a part of the artists dataset
>>107576632do not redeem
chinese culture
WHERE IS TRELLIS 2 COMFYUI?Can someone give me comfy's phone number so I can ask him directly?
>>107577065May I have a cat box good sir?
Can I load two different wan models at the same time?Using SCAIL alongside the wan 2.2 low for example.
>>107577129
>>107577230i have not, willing to try.>Still no /ai/ board.
>>107577247https://files.catbox.moe/pezxmg.safetensorshttps://files.catbox.moe/hv28nx.json
>>107577294Based
Bruh, I'm browsing the thread about kuroba app being fucked over by the captcha.Then I see a familiar pattern.This fucking schizo might also be the dev for the chance app.
>>107577665It's definitely the same schizo. Surely this can be triangulated to find some more info that results in a permanent removal.
>>107577665>>107577689Did you mean to post this to another thread?
Neat, this breast slider lora works nicely.>>107577752No, it's about whatever schizo that's been plaguing us the past week.
>>107576513when z-image base releases, everything will be alright
tried out wan 2.6 and it can do a good blowjob. neat. praying this shit is announced open source as a surprise please.
Even at .75 weight, the v2 controlnet for zit obliterates the quality..>>107578110We're getting two chink models based on grok, this month? What a time to be alive.
What's the best UI to get into if you just want to make meme/porn videos and images?Hopefully easy to use. I have 16 vram if that's a factor.
>>107578342comfyui.whether it's easy or medium hard is a matter of personal perspective. it's not a smartphone app for the lowest common denominator.
I hate to install a whole node pack to use just one node and not have enough information of how to use many of them.Comfy is so fucking awful to use, wish we had a better UI for generating images
>>107578453https://github.com/comfyanonymous/ComfyUI?tab=readme-ov-file#comfyui-manager
>>107578453there's already good solutions for the cumfart poothon dependency hell problemhttps://github.com/FizzleDorf/AniStudioone of the anons is creating a c ui fully independent from poothon node hell, it's very solid. the only reason it's not in the op is because comfy schizo is pooping her pants because of its existence
>>107578479I wasn't asking you, schizo
>>107578482t. schizoldg will be free from poothon hell
>>107578488>t. schizot. guy who made 70 posts in one thread and got all of them nuked
>>107578491not your imaginary "schizo", schizoeverybody is fed up with comfy and tRan niggerbakes
>>107578500https://desuarchive.org/g/search/tnum/107570316/deleted/deleted/yeah that's totally not you haha you got me there
>>107578511yeah its different anons that got spam reported by ran (so meaning you troonjak schizo)
>>107578500>everybody is fed up with comfy and tRan niggerbakesStill don't get whats the purpose of those schizo rentries. should remove them already, they are not related to the thread topic
I love blonde foxesI'm going to use comfyuiit's that simple
>>107578581gen them on interfaces that are not complete piece of shit. comfy is trash
>>107578587Give me a reason to, and I willSo far, these other outdated/abandoned/schizo options haven't given me a reason to switch.It's really that simple.
>>107578601>Give me a reason to, and I willhe's too lazy for thathe'd rather delusionally hope that by spamming threads enough times, he will eventually get free labor and sell commercial licenses
>>107578612>he'd rather delusionally hope that by spamming threads enough times, he will eventually get free labor and sell commercial licensesi'm tranjak schizo and i make up shit by the way
>>107578617yeah anon, the 70-message spam that was nuked in that thread and 80 nuked messages in two other threads were totally not you, hahaha!
Guys we should support the violent african gangs that probably roam Trani's home city, and pay them to beat him up
>>107578652aren't posts like that against 4chan rules? sounds like instigation
>>107578659Nah, for it to be instigation I'd need to be talking about a human being
>>107578659I thought you were okay with that kind of stuff though? >>107559634
>>107578672retard>>107578674not me
>>107578680yeah haha the 150 nuked posts were definitely not you haha that's right you tell em
>>107578680We don't sign our posts here
>>107578110>praying this shit is announced open source as a surprise please.they won't even release the deprecated wan 2.5 lol
Why are Anistudio """fans""" so mentally ill?
it's sad that the schizo tranjak is so deranged that ani even stopped posting here. he used to be so happy sharing his progress but ldg is not what it used to be. he's on discord now
>>107578708Cool, you should fuck off and stuff your mouth with trani's cock as much as you wish
I think the schizo actually cried when the first nuke hit the thread. It's heartwarming.
>>107578717i disagree, we should be more united against ran's schizophrenia. she's very damaging to ldg with her niggerbakes
>>107578724Why?The links warning about some avatarfaggot reatards make your axewound ache THAT hard?
So... local diffusion, huh?
>>107578798we should use lmg until things calm down and niggerjak takes her meds finally
>>107578806Please keep making retarded postsMakes tge moment you get jannied even funnier
>>107578825based. ldg will overcome schizo tranjak meltdowns and will be better than ever
>>107578260v2 cnet is broken, use v1 while we wait for the new one
keep seething tRan. ldg will get through this
STOPREPLYINGTOIT
>>107578863niggerjak is crying
Where the fuck is the base model unironically?
>>107578875ranfaggot will just samefag if you don't reply to his threads as usual. just ignore the drama spitebakes
>>107578863>>107578875>>107578930Why are you replying to your own posts?
>>107578920post lora
>>107578941only one of those posts is mine, don't know about others
>>107578967Sure, I'll remember to act surprised when they all get deleted at the exact same time
>>107576396>Maintain Thread Quality>https://rentry.org/debo>https://rentry.org/animanonlocal diffusion?
"a woman is looking at herself in a mirror"Impressive.
>>107578981yes, local diffusion
>>107578986second one is hot
For zimage turbo is FlowMatchEulerDiscrete the best? FlowMatchEulerDiscrete and clownshark sampler some people sayIn my experience euler simple just works, there was one more similar combo that seemed like 3% more clear on details that I found but didn't want to change to in case it makes some other seeds or concepts worse.
get it before it's deletedhttps://civitai.com/models/1696517/ig-hottie?modelVersionId=2511469
>>107579031>3 comically bad example gens>trained on pony outputshmmm... nyo
https://apple.github.io/ml-sharp/>We present SHARP, an approach to photorealistic view synthesis from a single image. Given a single photograph, SHARP regresses the parameters of a 3D Gaussian representation of the depicted scene. This is done in less than a second on a standard GPU via a single feedforward pass through a neural network. The 3D Gaussian representation produced by SHARP can then be rendered in real time, yielding high-resolution photorealistic images for nearby views. The representation is metric, with absolute scale, supporting metric camera movements. Experimental results demonstrate that SHARP delivers robust zero-shot generalization across datasets. It sets a new state of the art on multiple datasets, reducing LPIPS by 25–34% and DISTS by 21–43% versus the best prior model, while lowering the synthesis time by three orders of magnitude.
>>107576396BOOBAAlso sick ass RoboCop art.
>>107579036Why do Indians refuse to understand the concept of GIGO?If you train a better model with shit images, it won't "uplift" the shit data, you would only ruin the better model.But I suppose their thoughts aren't more complex than bob and vagen.I also think this shit got worse last few months. Civit always had a sea of garbage loras, but I feel like I am not imagining the uptick of 512p, slop dataset loras.
>>107576513nice to see openai topping the charts
>>107579065ENHANCE
>>107579031actual garbage
Qwen with 8 step lora is about as fast as Z and is better with some prompts. But it also has sloppier visuals. What's your favorite Qwen realism lora?
>>107579113qwen cant do realism, it looks like plastic no matter what
>>107579031this is the worst thing ive ever seen in my life
wan 2.6 livehttps://www.youtube.com/watch?v=Dp-CMOo-kOc
>>107579135>implying i have that much vram
is 12gb vram enough for video generation or will I be severely limited?16 is entry level right?
Can't wait for base and exploring artists.
Does ComfyUI work well on Cachy+AMD GPU?
im training a lora of a building for the first time. wish me luck
wat do? 0 results on the internet.
>>107579135holyfuck! Does anyone speak Chinese? Will it be open-sourced? What are the requirements?
>>107579220For wan 2.2 24gb (as well as 64gb of sys mem) is needed for sane speeds.16gb will be better but don't expect it to run fast.12gb is doable I suppose if you can wait 10 minutes for a video but I find it not worth bothering.>>107579236>CachyThat's just Arch under the hood, I run it fine on Arch so yes>AMD GPUAyyymd sucks for AI>>107579237GL anon, unusual concept for a lora.
>>107579270what UI should you be using if you are running 16vram 32memram?can this do 1080 or 720p?
>>107579135>api
what if all these Chinese models are intended as a weapon to turn generations of Western men into useless gooners?
>>107579292it is what it is
>>107579292when we reach realtime 100% convincing generations of anything of any length civilisation will be over.
how can i do this shit in image into video shit in comfyui without looking like ass
>>107579340someone will just make an AI that identifies AI gensor someone will make a law that requires AI gens to have a watermark symbol or something
>>107579340>when we reach realtime 100% convincing generationsgive it a year or two>>107579374>someone will just make an AI that identifies AI gensthen someone will just make an ai that can avoid being identified, the computational cost of this cat and mouse game will not be worth itin the end, the internet will be viewed as a casino where everything is assumed to be fake and if you get tricked by an AI then thats just part of the experience
>>107579270>Ayyymd sucks for AIFuck, sucks that's still the case. Hoped they would have something going for it by now.
>>107579292i hope that's true because then they would have a final solution to the random access memory problem
>>107579292They've already achieved that with tiktok.
has anyone trained a style lora for z-image? how many images do you generally need? i was thinking of making one of this swedish comic book illustrator i like
>>107579574Animate it and it's basically a mobile game ad