Discussion and Development of Local Image and Video ModelsPrevious: >>108645344https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
Blessed thread of frenship
https://xcancel.com/sama/status/2046598595869331894#mwatch him releasing something that destroys the competition and then deletes it 6 months later like sora keek
bosy creb
>>108652860local bros.... how do we get good text?
>>108652860It's now hosted on Pentagon datacenters
>>108652877we type it in
>>108652877I still can't fathom how good it is at text, it's so close it can almost do fucking poster movies with text that's just too small for anyone to bother to read kekhttps://xcancel.com/harufit333/status/2046603596746436965#m
>>108652894whats the style?
Local diffusion?
>>108652908is dead
>>108652897that's fucking impressive, but that doesn't look like michael jackson at all in 1983 keek
why is anima so ugly
wonder if its mainly jeets here who are desperately sucking corporate dick. Really weird
>>108652928>2021 year belongs to Bidenwhat did SOTA mean by that?
>>108652932You've done him
>>108652928>not a single Trumpcurious
>>108652932poorfags who have given up and submitted to the rental society
>>108652932>sucking corporate dick.you're talking about Alibaba? BFL? (((Lightricks)))?
>>108652928>HOLLYWOODLANDdamn I didn't know that it was its name until 1943
>>108652902classic art lora, havent uploaded yet
>>108652928what is 2014 supposed to mean?
>>108652952I'm talking about a hostile ideological antic, that you only continued to perform with this retarded rhetorical question
>>108652979>a hostile ideological anticyou mean the Chinese Culture? (we still don't have Z-image edit btw)
How is klein KV at upscaling? Any better than regular 9b? Regular always warps and distorts parts of the image for me when upscaling.
>>108652972ebola
>>108652989oh yeah you're right, nice catch
>>108652979i wouldn't mind it if api models were actually good.watching someone shill nano banana as the second coming of christ gets old real quick.
>>108652986>"we dont have this one model, duh"
>>108652996>someone shill nano bananait's gpt image 2 though?
>>108652928>1996>"Atlanta 1999"kek
will there ever be a proper anima? all i have seen so far is literal shit
images v2 is superior!
>>108652928>1973>The Mummy at a press conference???
>>108653025is it good at realism though? can't wait for ernie v2 to distill gpt image 2 instead of NBP next time kek
>>108652564>>108652621So apifags are horny seeing a sex scene worthy of a TV movie rated 13+ recommended by the mother-in-law...They have really no clue about local nsfw...
>>108653025i saw reddit posts of gbt-4 with same text capabilities. Not saying images 2 isn't even better, but some people seem to never have seen good api text before?
>>108653032>So apifags are horny seeing a sex scene worthy of a TV movie rated 13+ recommended by the mother-in-law...they can do actual porn anonhttps://www.reddit.com/r/Grok_Porn/
>>108653036>but some people seem to never have seen good api text before?in this place? probably not, they foam in the mouth so hard when they see API gens, they prefer to stay in their bubble and pretend that plastic skin and 1 line of text is the best humanity can do right now
>>108652860Their "useless" models are lovely. Claude is way ahead of their chatbot, but Sora 2 and GPT Image were revolutionary for the time they came out.
>>108653036it does long text SO MUCH BETTER than Nano Banana Pro and any other models, specially dense text packed imagesPrompt:>i am thinking of making a terminal user interface, but for browing 4chan, generate an aesthetic, riced, beautiful high info packed terminal user interface image of such software
SAARS
>>108653020Learn to use cumfart trany... Anima is SOTA in 2026.
>>108653055wait this isn't a real screenshot???
>>108652894>>108652968Noice
How does a single general make some anons so upset? I don't get it.
https://files.catbox.moe/pe9hp8.png
>>108653065>Learn to use cumfart tranyeven anifart is shilling ComfyUi, times sure have changed
>>108653055>4curseskek
>>108653067nope it's really that goodbut they might nerf this model after some days as they always do
https://files.catbox.moe/pfb5in.png
>>108653063You are literally brown. Stop the larp
>>108653063lmaooo, local would never make such kino
>>108653055Why don't these sorts of gens get posted in the cloud threads? All the gens I see in those threads are complete ass. Like only the good API gens get posted here while the cloud threads have only slop kek.
>>108653055>>108653087It's very impressive indeed. Too bad they don't seem to care about aesthetics though.
>>108653025what in the world was the prompt for this
https://files.catbox.moe/rkwcl4.png
>>108653115"gem"
>>108653115>>108653125Prompt: a video game called "Hollow Knight: Basedsong" (which is a parody of hollow knight silksong) where the player is a basedjak, other npcs are wojaks, basedjaks, chudjaks, pepe frog, gigachad etc. set in a meme world)
>>108653125I believe it
>>108653055its over. desktop threads are dead
https://files.catbox.moe/uwz5hh.png
>>108653106
>>108653087how expensive is it per image?
https://files.catbox.moe/me3icm.png
>>108653150Check out hardware prices, then you know
yawn
>>108653163SAAR I NEED SPAM GPT PICTERS
>>108653150they haven't released it in the API yet you can try this in chatgpt if you have plus or higher sub
>>108653150It only costs your dignity
>>108653169this marble looks shit desu
>>108653150gpt-image-1.5 costs 8 bucks/1 million tokens in input and 32/1m in outputhttps://files.catbox.moe/8qs2w0.png
All right, that's it, get out!>>108653190>>108653190>>108653190
>BEATH NOTEYk-kawaii
>>108653169ok... guess i will wait... stuck with poopy old nano banana...
>>108653169this checkpoint of images v2 openai released is badappearently this is the worst, compute efficient version so it makes shittier results compared to the other checkpointsthe other checkpoints (tested on arena ai) had much better results unfortunately i did not save those samples to compare)
https://files.catbox.moe/aajk9f.png
There are too many different families of models now. Could you please add a simple summary, please? I mean the pros and cons of each, which one to use for what use. Beginners will be totally lost. Especially the newer and lesser known models like Anima, Chroma, Wan...Something like:SDXL+ rather good for photorealism and painting- often fucks up details and body parts- No so good for 2DIllustriousXL+ Good for manga and cartoon+ millions of LoRAs available- struggles with multiple charactersPonyXL+ Rather good results and many LoRAs available- if you don't use some very specific words in the prompt the result looks like shitFluxAn improved SDXL but a lot heavier and slower to generate.Flux Kontext+ Very powerful to rework a picture, it understands what you ask it to do.- Takes very long to generateZ-Image Turbo+ very good pictures and rather good consistency with the prompt- very poor diversity, for a same prompt all pictures will look the same
>>108653055i expect this model to get nerfed very hard after 2-3 week when all the been mark scores are up and subscriptions increase. shame typical pattern and circle with all these western frontier models.
>>108653038Not anymore, all the Reddit is old gen. Even spicy paid users having multiple refusal... Cloud is dead.
rofl which one of you set him off today
>>108653194pipe dream to expect street shitters to only shit on their own streets
https://files.catbox.moe/675o1o.png
>>108653216The local models meta rentry link in OP is outdated but gives you an entry point at least
the only good thing regarding openai releasing images v2 is that google will release their next nano banana pro version which would be handy making open source training datasets that chinese models can catch upto images v2
>>108653216I use Z-Image Base for my everyday needs, I use Spark.Chroma for the porn and sometimes Wan 2.2 when I want a "photojournalism style" photohttps://files.catbox.moe/8rfj0k.png
I love AI
>>108653235true, they've definitely been sitting on their next model
>>108653235>>108653258please discuss about API shit here instead >>108653194
>>108653194This is the way
>the only hope for localcucks is for china to release models trained on api outputsgrim
>>108653216you can take out the first three desu Z killed SDXL realism and Anima killed SDXL anime officially
>>108653265no thanks
>>108652860I mean it's probably not worth wasting bandwidth and compute on a pseudo-social media for images and videos, but they probably want to have an image model around. Their competitors Grok and Gemini have them.Expect tight limits on free (if it even comes there) and non-pro tiers though as this is no longer a priority for them.
>>108653269Did comfy make the official nodes work with ace step xl yet
>>108653276do you really want to be "off topic" that bad huh?
edgy af https://files.catbox.moe/wgio3h.png
That troll would make sense if tongyi didn't prove that not using synthetic training data is the way to go
https://files.catbox.moe/3991wy.png
>>108653269no one cares what the base models are trained on. the whole point of open weights is to train them with whatever you want.
>>108653025kek
>>108653235using too much nanobanana pro images for training data is a very bad idea. just look how slopped imagine art pro 1.5 and v2 gens look.
>>108653265Kudo for the API thread!! Can't support cloudfags anymore
>>108653335It's inexcusable that google can't generate normal-looking average faces and body types.
>>108653348Catbox? Ni LoRA anon?
>>108653320>>108653025What is up with the grain/ blotches it insists on adding to the image? I can see how in a photo prompt it'd help sell the realism, but here it just plain don't make sensehttps://files.catbox.moe/ajvza1.png
>>108653348Great example of the mistakeface. All ai does this. ai doesn't know how to make "pretty, for a small town" or "probably in the top 5 in the motorcycle gang"
>>108653065>Anima is SOTA in 2026.grim
>>108653359is this slop supposed to be funny anon?
>>108653359this would crush on reddit, you should post there exclusively
Are there any Z Image turbo loras yet where the output isn't gigaslopped
>>108653384Learn to use tools instead of crying. Anima gens look awesome.
https://files.catbox.moe/5ropyv.png
>>108652928I know we can cherry pick stuff and it's not perfect but this still mogs anything local so hard it's not even funny.Can't wait for it to be censored to uselessness regardless in ClosedAI fashion.
Fuck you cloudfagsFuck you API cucksFuck you nigboFuck you julienFuck you rainy-girl poster
>>108653401>Anima gens look awesome.proof? would love to see these awesome gens you guys keep talking about
>>108653400Install LoRA optimizer, it's already month olds anon.
>>108653386>>108653396
>>108653420We're not your daddy anon, many discord are full of anima gens
>>108653434epic... a golden manbaby for you good sir
>>108653434Needs a Ben Garrison signature
>>108653409Amen
how long till openkikes nerf this
>>108652928>swastikaahahahah oh man
>>108653377it's just a matter of captioning data as such, anon. And you can do this locally.
>>108653422what's that
>>108653434Return to /adg/ and kys anon
>>108653369it's all lora>>108653377>Great example of the mistakefacelora alters face
>>108653460>>108653434>>108653359>>108653235Where are you running this?
>>108653488first time I heard of adg
catbox and litter are being a dildo
>>108653499I saw real people yesterday.
>>108653434give it a few soijak variants and ask it to make an original one
>>108653512What a coincidence.
>>108653512>>108653533it died after the 4chan hack
>>108653509Go ask in /adg/ cloud fag
>talks about ltx2.3>boooooooooo get out of here chud, seedance is better>talks about gpt image>NOOOOOOOOO THAT ISNT LOCAL CHUD
>>108653509in chatgpt web, some accounts have access to it>>108653526
>>108653606ltx 2.3 is as good as soraand 4chan anons contradicting each other? Must be a day that ends with a "y"https://files.catbox.moe/a1yc7c.png
>>108653634>cameltoebased beyond beliefhttps://files.catbox.moe/wbvhuh.png
>>108653653what style is this?
gpt-image-2 is great at terminals"show me a screenshot of a mac desktop, large terminal window visible of the earth map in ASCII"
>>108653670>>108653697what do you not understand in "local"?-> >>108653190
>>108653697come on, make a original soijak
>>108653653please, make her do paizuri, I beg
>>108653662mix of base anima artists
>>108653730already nerfed from what I can see, sorry to tell you this, but every OpenAI image and video model are only good for memes and nothing else
>>108653700Yoji Shinkawa?
>>108653730Her eyes are lighter.I am convinced that these models are instructed to make vaguely resembling but some details are changed versions of real people when prompted to draw them.
>>108653743anon, be more specific or post meta
>>108653730+1 for loving ldg-1 for looking nothing like sweeney
>>108653764>>108653747looks like they nerfed this model with public figures/celebs already
>>108653793what if you edit her with black skin but tell ai to make her skin white
>>108653777>>108653764She also doesn't have a flux chin
>>108653764>I am convinced that these models are instructed to make vaguely resembling but some details are changed versions of real people when prompted to draw them.if only local was was better at deepfaking>muh lorastoo much drift and looks like shit>muh flux kleinyea, if you dont mind plastic reptile skinthe local deepfaking scene hasnt evolved since 2020
>>108653793This has been a thing since GPT-Image 1. Is there any evidence it initially allowed celebrity likeness? More likely it launched with similar filters in place already.
>>108653803das rayciss
>>108653817silly post
>>108653730
>>108653764Same with Qwen or Klein. The more you change the image via prompt, the more they converge towards sameface.If you put her in different outfit or make her naked, you pretty much retain face structure, but as soon as you change her pose it diverges away from the original character.I think that's just an artifact of edit models.
>All this crying about API vs local>Meanwhile API is just a glorified text and screenshot generator, nothing practical can come from the model they're releasing because it's caged>APIcucks have zero control over what they generate. Like a style or gen? No guarantee it stays or that you can even use the model at all
>>108654006How do I purchase this
>>108654020you'd have to get drafted
>>108654036He looks too young to be drafted.
...guys I dont know even know what is going on or what is needed for these python scripts. I am that dumb. I'm not able to just run the scripts as posted in my Python shell, I'm like not even smart enough to ask the right questions. Just this whole git/pip thing eludes me. Trying to run WAN locally, have used StableDiffusion locally fine
>>108654069aint no fortunate son
>>108653754Yes indeed
>>108654078Desu tell it you are tech illiterate and need things explained verbosely and just let the chatbot of your choice guide you through the installation of Comfyui.Once you succeed that just open the template for Wan 2.2 text to video or image to video and roll with the defaults.I don't know what else to say if you are at this level.
>>108654078>install comfyUI which is as easy as extracting shit out of a folder>install comfyUI manager (also simple)>open a wan workflow>download the missing files>???>PROFIT
so you can blur a compressed image and then let the diffusion model recreate it to clean it up, but are there any models that let me do this with videos?
>>108654124Why are you putting hidden cameras in the women's batroom?
>>108654078>have used StableDiffusion locally fineTry Forge-Neo. It's basically the same as old A1111 sd-webgui but with modern models support.
>>108654134Where else would you put hidden cameras?
>>108653730>skipping anistudiohow rude of Sam OpenAI
>>108654114>>108654116I have ComfyUI, I must have handled this before. I think I'm figuring it out, thanks
>>108654036>>108654089Model?
HAPPASEXO
>>108654152I am not allowed to answer that question outside of /pol/.
>>108654190klein
>>108654134no i want to try using it as a method to smooth out the distortions in my generated videos
any uh good boorus for realism ai pornography?
>>108653972>screenshot generatorEntire swaths of artists (concept, advertisement, UI design) have been obliterated completely with v2 lolYou can keep like 1 art director guy who will do everything now
>new api model over performs>ldg spergs out>api model gets cucked and nerfed with a week>ldg spergs outyou guys fall for the same bait every time
>>108654268Why? Because ai told you so "this is good"? Based on what metrics? When it was tasked to generate an "beautiful ui" for 4chan - where was the beauty? It was a literal bios ui.
>>108654319Based on me having functioning eyeballs
>>108654323Are you seriously suggesting an image can replace concept of an user interface which needs to be built as being user friendly? How? You never have created anything that people actually LIKE to use in your entire life. Using implies interaction. And again where is this "beautiful" as instructed by the prompt? >>108653055 Glad I dont have your eyes.
>>108654340You can tell it what exactly you want you fucking idiot and to edit what you dislikeAnon demonstrated the most basic, most braindead prompt.
so anima is pretty much replacing illustrious and stable diffusion right? this model fuckin' shits all over every other local model I've used with the exception of flux MAYBE because I have no idea if flux was really good or not considering how slow it was I never used it.
>>108654353klein has its uses, unsure about flux2 dev, I have the q4_k_m of it but... nyo
>>108654352Go on, create something. You must be truly a powerhouse of imagination and taste
It’s almost been 10 days, why do they keep bumping that thread? They don’t have collage anymore, they don’t have Anchor, scraps of a glorious past. I’m saying this as a former anon who was there during its golden days.
>>108654372I don't have a chatgpt subscription lol
>>108654353it's small, fast, trains well, has great prompt adherence, and can do low/medium res realism already. it is going to replace a lot of models that people have been stubbornly clinging to.
>>108654190yes its flux klein edit
>>108654353Anima is replacing Illustrious, just like /ldg/ is replacing /adt/ for anime.
>>108654392Yea because you're talking out of your ass
>>108653063kek
does ldg stand for lodestonesGODS
>>108654389I don't think about them at all
>>108654414but you should
>>108653063>can't generate porndoesn't matter how good those models are they're missing an entire leg to stand on.
ldg is the best ai gen thread on the chanz no bullshit just
>>108654389They have to move to /jp/ or /c/, i told them long ago but they didn't hear me.
>>108654418why?
>>108654389Faith
>>108654433indeed we iz
cozy
Is it possible to run ComfyUI Desktop from one drive, but draw its models/checkpoints/vae from another drive?
>>108654690>desktopIdk if it's the same but portable (which you should be using anyway) has extra_model_paths.yamlEdit it like this, just add your own folders
its uphttps://www.youtube.com/watch?v=sWkGomJ3TLI
>>108654690just do some symbolic links
>>108654736bigma anon? is that you?
>>108654748Yes and a new model architecture. That's just 1.2 million steps (about 2 weeks) on a 5090. Qwen text encoder + contrastive flow match, block skipping and residual learning. 1B model.
>>108654732>entire team is east asianamerica is healing
>>108654770based. what VAE?
>>108654770have you checked out nanosaur
>>108653055I mean there's no way it's just one model though, this thing has to be some kind of multistep pipeline that doesn't even have a real analog in terms of open models
i always believed in bigma
>>108654842Yea there are dozens of control instances for sure.
>>108654828e2e-qwenimage-vae>>108654838Haven't heard of it, doesn't seem to show up on Google and the Reddit thread that may be it is deleted
>>108654859you only believe in bigmacs
https://old.reddit.com/r/StableDiffusion/comments/1srrj72/unpopular_opinion_but_the_amount_of_low_effort_ai/
unironically why do you care so much about what plebbitors post