The US will Ban that Card EditionDiscussion of Free and Open Source Text-to-Image/Video Models and UIPrev: >>106790544https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Neta Luminahttps://huggingface.co/neta-art/Neta-Luminahttps://civitai.com/models/1790792?modelVersionId=2203741https://neta-lumina-style.tz03.xyz/>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbours>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Maybe it would be a good idea to take real, very high-res photos of human skin with a lot of detail, run them through some 0.1-0.3 denoise of qwen image to destroy a lot of the detail and get the plastic skin, and then train a lora where you teach a model to basically take the plastic versions of images and try to "add realistic detail" where you use the original images as the goal?
>>106793018>qwen image*qwen image edit
>>106793018it sounds possible but perhaps not easy. maybe you can also use regularization images, haven't used that on qwen trainings yet tho.
>>106793018>https://github.com/MIGHTYEZ/Inversion-DPOThis is possible but most people can't afford it. It's called DPO, but you need to have the model in memory twice to use it, one with frozen weights and then the one unfrozen being trained.
>>106793085How do people train qwen image edit loras then?
>>106793018That's more or less how anon trained the clothes remover LoRA, so yes.
any way to speed up qwen edit when using multi image input? shit is like 7-9x slower
>>106793018It should be done with a wide range of images instead of strictly skin. You would have a "desloppa" LoRA. I'd be surprised if that's not a thing already.
>>106793127ur spilling into ram, look at your vram usage
>>106793131deslopper would be huge
>>106793127just use the 8 step qwen image 2.0 lightning lora, when I dont have image2 bypassed it's still around the same speed
>>106793139100% gpu and 22gb ram in python kekW I'd say you're right>>106793150I am using it, but I must be right on the limit with 1 image. Or there's some rocm faggotry failing me.
>>106793093???what he's saying and what you're thinking are not the same thing lil bro
>>106793141we can only hope kek
>>106793186I assume to train qwen image edit loras to for example remove clothes you train it in a similar way to what I initially described?
Do any Qwen Edit loras even exist?I'm talking about Loras to make this or that look better
>>106793054no surprise, chroma has very attractive girls (typical booru / internet standards) and i did prompt sexy clothes> large breasts, aqua eyes, very long hair, black hair, hime cut, black latex jacket, cropped jacket, black micro bikini top only, black micro shorts, thong, highleg panties, fishnet pantyhose, fishnet legwear, makeup, black choker, jewelry, sunglasses on head
>comfyui first model load and inference still overfills vram and only on second gen onwards does it allocate the proper amount
>>106793258I love that style of bikini and material too. The shiny wetlook is the best.
>>106793227I've been told qwen image loras may work with qwen edit
>>106793336ty, anon
vibevoice + a snip of JC audiohttps://voca.ro/1n9rSVHWiD5b
>>106793398I disagree with the message. Literally, nobody is interested in controlling your miserable life. For (You)
What were silveroxide's chroma lora experiments? Were they literally just flux models that he converted to work with Chroma?
>>106793445openAI is absolutely doing social engineering
>>106793003blestest flanzone of threads! ;D>wan 2.1>lora: wan hip swing twistsauce: https://tensor.art/models/906123123449986583byeeee<33333
>>106793398anyone who is stupid enough to believe any capitalist is doing anything "for good" deserves what they get
>>106793488wait, requesting:catbox\prompt\models\etc for >>106791094;_; so cute
>>106793398based
>>106793498MUH CAPITALISMtell me you have a tv watcher\brainwashed take without actually telling me kekoogie boogie money haunting you??financials are the DIRECT reason people actively try to innovate\participate in a freemarket economy it is the literal driving forceyour qualms are not w\ the forces of capitalism themselves (intentional) but w\ crony corporatists rigging the game for themselves via corporate\gov welfare state tldr;stop posting libtard <3your are uninformed (not libertarian) at bestor basically dumb\stupid at worst
>>106793498how much for sex?
>>106793518REKT
JC Denton on Sam Altman taking money for Sora 2 then censoring after taking $200 from people:https://voca.ro/14gLLirZYX5o
>>106793488>>106793518I never thought I could like a namefag tripfag
>>106793467use deepseek insteadhttps://files.catbox.moe/7yoy86.png
>>106793518woops ultra triggered the lib.. who hilariously enough calls other people libs
>>106793003>>106793028>yea the "release" chromas are chroma 1 base and 2k, what he is working on is radiance (model architecture change without VAE)ok, googled it...>Release branch:>Chroma1-Base: This is the core 512x512 model. It's a solid, all-around foundation for pretty much any creative project. You might want to use this one if you’re planning to fine-tune it for longer and then only train high res at the end of the epochs to make it converge faster.>Chroma1-HD: This is the high-res fine-tune of the Chroma1-Base at a 1024x1024 resolution. If you're looking to do a quick fine-tune or LoRA for high-res, this is your starting point.>Research Branch:>Chroma1-Flash: A fine-tuned version of the Chroma1-Base I made to find the best way to make these flow matching models faster. This is technically an experimental result to figure out how to train a fast model without utilizing any GAN-based training. The delta weights can be applied to any Chroma version to make it faster (just make sure to adjust the strength).>Chroma1-Radiance [WIP]: A radical tuned version of the Chroma1-Base where the model is now a pixel space model which technically should not suffer from the VAE compression artifacts.So Chroma1-Base is somewhat different from Chroma 50?
>>106793562base is v48
>>106793549libertarians aren't calling for the gov to control money supply AKA STATISTS\commiesif you weren't chinese you would understand
>>10679356249 and 50 were meh version that turned into HD. 50 Annealed is an abomination.
>>106793518The father of capitalism, Adam Smith, would say you're an idiot and our system sucks.If you don't like books, ask AI.
>>106793518surprisingly based
>>106793529isn't it open source?https://www.youtube.com/watch?v=2SvPfkXs3Nkhttps://files.catbox.moe/mg1ze6.png
Does the newer Qwen Edit model needs its own lightning lora or does the old work?
>>106793586libertarians =/= libtardsalso, almost non-existent
>>106793609works for me
>>106793623also using this one
>>106793609the regular 8 step qwen image 2.0 lightning one works better than the 1.0 edit one, at least in my experience. use that, 8 steps.
>>106793623The edit and base loras are interchangeable?
>>106793591>sucksyet all the spoils\benefits directly produced by it are how you are able to whinge\complain online keknice logicno system is perfectbut historically the safest\smartest solution has always been to let the MARKET decide what is acceptablevote with your walletyou actually vote every single day>dont like A, B, or C>don't buy from them, dont support themeven walmart bends\noodles at the 10% margin loss mark, hence, all the surge of 'made in usa' items 'sweatshop free' items ect during that specific trend years agoit doesn't take much for people to make real-world changesbut people would rather seethe and act like demoncraps that obey their (((tv))) programming overloads you know im right
the asian girl is sitting on a beach chair in a white bikini. to her right is a chibi plush doll of Hatsune Miku.what a cool model.
>>106793613>almost non-existentonly if you believe the blatantly throttled viewcount numbers on 'YOU' tubeeven elon tested this theory and laughed >example: ron paul viewcounts on YT are in the 3-4 figures no matter what>one shitpost show on twitter\x = viewcount in the hundreds of millions overnightin response, fb, insta, google\youtube say:'w-w-wait! it was the FEDS who MADE US do all that!!!"the consolidation of the open-internet webtraffic was a mistake and will bring about its end
How come nobody posts videos anymore?
>>106793709I'm a vramlet amdlet and I assume this would take me 30 min to gen so I don't bother
By the way, when I use seedvr2 as an image upscaler with cfg 0.5 and 3 steps instead of 1, I get significantly better results—at the expense of time, of course.I'm still surprised that it works with more steps.
>>106793636idk, I'm on Edit nowhttps://files.catbox.moe/bklho3.png
>>106793726What repo are you using that lets you adjust the cfg and and steps?
>>106793709it takes substantially more time for me to render\edit\crop\salvage a wan2.1 vid than to show my illustrious waifus that i can doink out in 21 seconds per 1080p image hehe
>>106793736lora?
>>106793709the kino sora era may be over but its affects are everlasting
>>106793744this >>106779689
>>106793761<333333333333333333333
>>106793762SEXOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO
>>106793738numz repocfg had to be adjusted a few lines to make it accessible via node, and the step value is hardcoded deeper in the gen script.
>>106793762Lora or is this built in?
I obtain an erection every time I find a new artist to train on.
>>106793398this is what i made https://vocaroo.com/18yqEZuy1epV
>>106793790>Find artist to train on.>Almost done building dataset>Coom>Lose interest in trainingevery time. On the rare occasions I do go to the point of training. I maintain a semi for like the entire training period.
>>106793782https://civitai.com/models/1822984/instagirl-wan-22
>>106793774Ah, okay. I can see "steps" in generation.py I can just hardcode that to 3 and it should work yeah? I can also change cfg to 0.5 in the shared values?
>>106793809I've seen this before but I still don't quite understand "what" it is.
>>106793726any examples/comparisons? curious
can you guess what game this voice is from? https://vocaroo.com/1nqDuCh3fWUi
>>106793827it's a wan lora, you can use it for t2v or use it to generate a starting image and animate. it comes with an example workflow
>>106793832vibe, can tell from the sting at the start
>>106793832I really have no idea but it gives me another hand touches the beacon vibes.
>>106793844switch sage attention off
>>106793762too skinny
>>106793844one billion dollars please mr softbank.
>>106793844Tile de VAE faggot
>>106793709>How come nobody posts videos anymore?TrAnies in the thread can't gen videos and basedGODS are generating i2v sex of themselves and hot women without wanting to self dox
>>106793811Just incase anyone was curious. Yes this worked.
>>106793762>women with big eyes roubd facdand chubbyYuck, they are too gay
>>106793762I fucking love chubby women so much and I unironically hate Ozempic because I've seen a noticeable and sharp dropoff of fat women "retiring" and reemerging with saggy ozempic bodies..
>>106793586libs as in liberals, no one gives a flying fuck about libertarians. libertarians are as child-brained as anarchistsus government is a neoliberal government, an ideology it tries to foist upon the rest of the world, but only the vassals have fully accepted it and are getting ass pounded by it simping for capitalists is simping for your owners like the good little wage slave you arebut i forget, 4chan is basically cia/mossad/mi6 whatever anyway so of course you'd simp for your masters
>>106793916>wanting sound fiscal responsibility+ a properly maintained currency is child-brainedyou simply do not know what you are talking about>gov does bad thingsstop giving them so much authority
>>106793930k you've got to be at least 18 to be here kiddo
who cares
local diffusion?
sure is glow in here
>>106793831maybe later, still exp
>>106793961you cant even see your own nose on your faceyour are literally under direct control of corporate interests if you want more gov\ hate capitalism\ etcyou are a useful idiot, sad.you can only double-downyou can only see the splinter in your brothers eyes, while ignoring the log in your own eyesa fool who cannot be taught
rocketgulp needs another ban
>>106793978nta, but I just checked and I can see my nose. What kind of fake news is this?
debo can you please just return to sdg and stop bringing up your libtard tirades?? you always do this
>>106793978read a book dumb fuck
>988can't argue with \ kill an idea? BAN HIM! ;D>>>>/leddit/ (idiotville)>>106793990my favorite fake news was when they tried to say that the corn-syrup was better than regular sugar in the coke hahahow can these fucks even sleep at night??>>106793958yes ma'am! my poor gpu will catch on fire any day now hehe(too low of stepcount here)
>>106793709People always eventually lose interest in videos as it takes too long and you can only do so much with a few seconds of video. People so inclined are making porn with it but you can't post it here.
qwen edit is literal magic. it's a meme maker, can make characters interact from diff images, can repose a character, change their clothes, remove elements, make them nude, make them thin or fat, etc.
>>106794025skittles sexo
>>106793709I don't play with video that much. It's silly fun though.
>>106793709sora 2 revealed how much of a toy wan is
is there a qwen edit for video?
>>106794042>censora 2
Why didst thou leave, Debo? I simply seek answers, my friend."
>>106794044vace I guess, but it's not quite the same.
>>106794053>unable to trick the machine
>>106794063we're going to have to go more jewish on this one
>>106794063>dudes finger is literally on fire
>>106794069back in the oven
>rocketgurlp*** and n*gbo are fightinglove 2 see it
>>106794061>amazing anon, today's trick is tomorrow new censorship!
>>106794061post nsfw sora stuff then
>>106794096He meant tricking it in showing... gasp!... ankles!Perhaps even the nape of the neck!
>>106794106kling will allow nipples to slip through if youre sneaky enough
>>106794120Yeah I was there when the cross trick worked, anything you got since they patched that is kind of sad.
>>106794061>training the censorship>paying for this privilege
>>106794120pls saar, just a sliver of the bob saar
>>106794155Literally worse than a jannie
>>106794158blody bastards, no bobs, no vagane, no sexi sex
>>106793529>after taking $200 from people:Was it really $200 to get in? Fucking kek
>>106794175The best part is there was absolutely no way that wasn't their intent from the start. They knew fully well people would and could make copyrighted characters with it and chose to let it fester for a day or two before pulling the rug. Sometimes I hate the AI industry because of how much overlap it has with crypto.
>>106794175it's freethey're certainly paying with their data, however
>>106794175no, but 200 is their priciest tier for gpt in general (outside of api stuff)
>>106794158Kek. Yeah, it's always fun to see how far people will go to push the boundaries.That being said, /b/ exists to subvert.
>>106794199Literally haven't been on /b/ since like 2010.
>>106794084>fightingpostcard seems a bit spicy but generally nice
https://vocaroo.com/1e6ioSLSpBbv
it's truly a shame vibevoice never got training code
>>106794270training for what? it takes what ever the sample is and gives it back to you. if you want a girl yelling you just use a sample of her yelling. thats why video game wavs are so good cause you can grab expressive characters with different emotions in the same voice
With the AI boom, I no longer look forward to the weekend, for I know there won't be any model releases during it... May monday come as soon as possible and may we get real time video gen by the end of next year.
>>106794281i want to feed it a bunch of shit from DLSITE and make it more robust for nsfw lol
>>106793321you're in luck since apart from chroma and the chroma radiance snapshot that I am posting, wan has good support. as do various sdxl derivative checkpoints>>106793562as the other anon said it's v48 but officialized
>>106793003>want to upgrade to using SSD for local>need an AM5 motherbaord>need to replace 32GB of old ram
>>106794293>as do various sdxl derivative checkpointsoh I know, some of the pony offshoots do it reaaaaalllllyyy well.
Reminder a huge speed % is on the table with TensorRT that for some reason died.https://github.com/comfyanonymous/ComfyUI_TensorRT
>>106794294What does ssd have to do with am5?
>>106794314>ComfyUI TensorRT engines are not yet compatible with ControlNets or LoRAs.
>>106794341The catch with Tensorrt is that you have to spend a couple hours or whatever it was to get the optimized version of the model, but you can easily merge loras into the model and then optimize that to support any lora.
>>106794294idk what limitations you have but maybe you could just use a pcie slot for the ssd?
>>106794294>need an AM5 motherboard for SSD??????????????
So I assume ovi was shit and a nothingburger?
>>106794354I don't mind waiting a few hours, but the lack of flexibility of being forced to merge loras is huge no unless I was doing the same thing for thousands of gens.
>>106793208nice
Speaking of LoRAs. How does everyone here go about using multiple LoRAs for wan? I find if you add more than two the quality shits itself.
>>106794390If you dont mind waiting then let it optimize the model for each lora that you will be using a lot when you're not genning and thats it
>>106793709I don't want to play with Wan anymore, I tasted the forbidden fruit (Sora 2) and I don't want to go back to mid
>>106794374seems to have video capabilities/aesthetics somewhat worse wan and seems to have basic tts (far from one of the better open sauce tts)but I don't think anyone here tested it very extensively yet. personally I don't see it as a high priority for myself.
https://www.reddit.com/r/StableDiffusion/comments/1ny8971/its_not_perfect_but_neither_is_my_system_12gb/not bad at all
>>106794374only 5090s can run it at the moment
>>106794401mostly just loading them in the rgthree stacker but i loaded them normally before as sequential nodes too.it did not seem any worse than loading multiple loras on other models, but of course certain combinations work poorly as always
>>106794374i briefly tested it with their official inference code. it's wan 5b, but worse, with mediocre TTS strapped on. it was 100% released to cash in on the sora 2 hype
>>106794374>a wan 2.2 5b finetune is a nothingburger?of course
>>106794430Yeah I find I have to walter white test tube meme the perfect weight at the right step to make sure the LoRAs don't cancel each other out and make body horror.
corr
>>106793562> where the model is now a pixel space model which technically should not suffer from the VAE compression artifactsdoes this mean perfect eyes, hands, fishnets, distant objects?
>>106794431from the samples people posted i did notice that perhaps it gives more control over facial expressionsmaybe that's some good to someone
>>106793498>anyone who is stupid enough to believe any capitalist is doing anything "for good" deserves what they getThe problem is Capitalism, not the people. The majority genuinely believe they are doing great things. Unironically. You know you shitlibs think you're doing right, don't you?
>>106794454would>>106794478oh yes baby go ahead & donate 8^)
does qwen image have prompt adherence as good as qwen edit?
>>106794481shitlibs are the idiots who believe in capitalism dummy.. they're the brunch crew and the pearl clutchers and the cheerleaders for the whole rotten system.. they only cry when the chickens finally come home to roost
>>106794505They think they are doing what is best.
>>106793877thisi wish i could train wan loras
https://www.reddit.com/r/StableDiffusion/comments/1ny9h3f/samsungcam_ultrareal_qwenimage_lora/this looks really good
Chroma1-BaseorChroma1-HDorChroma1-FlashorChroma1-RadianceI feel like this is a trick question.
>>106794478i do think the eyes are pretty decent.fishnets usually have flaws but it's not like they're completely oddhands certainly not always correct, distant objects... idk, what do you expect from them?
>>106794510sure, everyone thinks that, but most people are ignorant as fuck when it comes to politics and that's by design.. they're the morons who cheer for the system that actively fucks not just themselves, but virtually everybody else too
>>106794186What’s strange is that sora1 seemed pretty free rein up until now and they’ve clamped it all down. Wonder why they didn’t care before
>>106794523Base is a decent go-to.Radiance is the current experimental model. For that one my personal opinion is that you might as well take the current snapshot if you're using something that experimental.
>>106794530democracy is stupid. women are retards.
>>106794523start with hd
>>106794546democracy is completely at odds with capitalism
>>106794540Why does it have banding?
>>106794538>Wonder why they didn’t care beforebecause no one wanted to play with sora 1, it's a bad model, OpenAI "only" got insane hype for GPT4, o4 imagegen and Sora 2
>>106794551hardly. capitalism typically captures whatever is around it, if it's weak.
>>106794527>fishnets usually have flaws but it's not like they're completely oddthis kind of pattern has always been the chroma tell. knowing that chroma itself can't produce these fine details makes radiance kind of useless as a proof of concept for pixel space vs vae.
ok
>>106794555Call me crazy but sora1 was the best image Gen model I’ve used locally or saas. Won’t do lewd of course but it always seemed to understand whatever half baked idea I threw at it. Since I have ChatGPT plus for work I figured I may as well mess around with it and was pretty pleased actually. But local is still king I can’t gen big titty 1girls getting anally spit roasted on saas
>>106794527insufficient style prompting
>>106794557capitalism can only move in one direction and that is toward complete monopoly which is entirely at odds with anything but fascism
>>106794565local is king for racism.repost from previous thread. (other gen going on)
>>106794565>always seemed to understand whatever half baked idea I threw at itAll OAI image/video models interpret and rewrite your prompts. That's why they feel so "intuitive". Try doing the same with wan.
>>106794582not only that, but o4 imagegen is an autoregressive model, so it knows how to "think" and expand your simple prompt in its own layers without any rewrite, as a real human would actually
so much newfaggotry recently
>>106794588always happens with any big model launch, saas or not
>>106794552I don't really see it. perhaps it's just the influence of 2d art>>106794559even if it simply learns more quickly it'll be fine for many users
>>106794538>Wonder why they didn’t care beforeThey did. They just wanted to hook people in before they built up any real liability.
>>106794607classic bait and switch, and the APIkeks fall for it again and again, when will they learn?
>>106794608Are those guys packing cocaine?
>>106794401lower the strength of the loras on the low noise model side to something like 0.4.
>>106794602Well, when there's a vae, we'd say "it's the vae". But actually, I think it's the stripe the model follows. could be wrong.
>>106794401>I find if you add more than two the quality shits itself.that's the problem of loras, you can't really stack them, that's why I'm angry that the modern base models know so little concepts, you can't go far with just loraMaxxing
horror no ai can generate. checkmate ai.
>>106794608change the guy walking down the row to big smoke
>>106794629You can stack them if you basically do a some final inpaint passes where you inpaint each character on it's own pass. Style loras generally stack well.That being said has comfy ever introduced a decent inpaint node where you can paint directly on the image or do you still have to make masks in a third party app?
>>106794355>>106794332>>106794370My current mobo is MSI B450 Gaming Plus Max. Apparently I will only get half the speed from this ssd.
>>106794658It don't matter. None of 'dis matters.
>>106794658bro half speed is like 3TB/s lol, that ain't an issue
How is the new Wan2.2 lightning LoRas? Better than the 2.1 version?
>>106794688The NEW new ones? They definitely fix the motion issues, but it's also t2v only right now. They also increase the saturation like crazy if you don't tone them down. Like the acid just started kicking in or something.
the woman laughs. she drinks beer from the mugthe beer is floating in the air but it works
>>106794737good, almost there but still slopped sadly
>>106794744neat
>>106794621Which is why the multi day trolling could only be done by a disabled retard
why has nobody done for qwen what pony and its derivatives done for sdxli love qwen man
>>106794828big model
>>106794828it's a 20b model anon, look at chroma, it's a 8.9b model and that furry fag had to wait 6 months to """finish""" that finetune
>>106794744butiful lightx2v slow motion wanslop
>>106794840>>106794852are we talking like crypto rich nigga, ben mallah, or elon musk?
>>106794853can't wait to get the improved i2v version, they nailed their latest t2v lora
bros i have yet to try qwen image / edit, are there any lightning loras or anything like that? gonna need all the cope i can get to run it
>>106794868it'll probably cost millions to make a full scale finetune of a 20b model, for chroma we have those informations:>8.9b model>5 millions images>48 epochs at 512x512 + 2 epochs at 1024x1024>150 000 dollars
>>106794737>the beer is floating in the air but it worksthe beer was "floating" in the initial pic bro
yeah, I'm thinking pytorch is cooked
>>106794872https://huggingface.co/lightx2v/Qwen-Image-Lightning/tree/mainuse 8 steps v2
>>106794887yea sure it """runs""" but how good will it actually be
sora 2 killed the party. even the other online tools, are less impressive now. local is still great for nsfw content. but for sfw... lol
>>106794628IIRC it's 16x16 patches. IDK if anything could encourage banding.>>106794658I think you'll be ok with that speed. YMMV about the size.
>>106794887>1 bitholy lobotomy
>>106794897you forgot sora giving itself a lobotomy for the sake of safety + not wanting the biggest class action lawsuit in history against them
>>106794892but it's for the old version of qwen edit right?
>>106794887finally, a model that can function like myself (just one brain cell)
>>106794911it works with 2509
>>106794910>not wanting the biggest class action lawsuit in history against themthis is why SaaS will always lose
>>106794897>>106794910>you forgot sora giving itself a lobotomyit's like witnessing the Iphone release a second time and then Steve Jobs stops producing those phones 2 days later, like we've seen the future but the company chickened out because it was deemed too dangerous, now we'll have to wait for a company with more balls to release the equivalent to the Iphone
>>106794887people need to train their model to be 1bit as u cant just quant it down to it and have it work
is there any good db of openpose / controlnet skeletons?
>>106794925desu, if they managed to train model at 4bit while keeping the quality of fp16, we would be able to run bigger models and compete with APIkeks
>>106794887>I'm thinking pytorch is cooked>April 2025https://xcancel.com/LiorOnAI/status/1913664684705874030
>>106794933nvm im retarded
reminder, magic is real with qwen edit.with remove clothes qwen edit 2509 + the 8 step lightning lora: "remove the yellow banner over the breasts of the naked anime girls."https://files.catbox.moe/dmjhva.png
>>106794979i love you for shilling this anon
>>106794991well no one has to pay me, I shill it cause it's a really neat tool for manipulating and editing all images. now all we need is wan 2.5.
I only hope China can drop some insane uncensored models before they grow too indulgent and start caring about nonsense like women's rights and copyright
>>106794997>insane uncensored modelsYeah but from who? Wan is censored.
>>106795004I don't know. But there is only one nation which can rival Altman's legions of Ugandans. CHYNA.
>>106794997>before they grow too indulgent and start caring about nonsense like women's rights and copyrightthey seem to care a lot about copyright though, Wan and Hunyuan don't know shit in pop culture
>>106794979also, if you feed the 1024x1024 emptysd3 latent image node to ksampler latent image (override the default size of image1)you get this: "show the nude anime girls."it's not wide enough for all of them but you get the idea.https://files.catbox.moe/1iqs3d.png
>>106794871hopefully it's good, lightning 2.1 is not as consistent
>>106795019>care a lot about copyrightthis could've just been the autocaptioner not including character names or game names
>>106794923>dangerousthat's a funny way of saying expensive
>>106795020got a link to that lora?
>>106795030Anthropic got raped in court, why do you think OpenAI is immune to the copyright mafia?
>>106794997it's too late. they already started to act like americans. it wasn't even 5 years ago children had aspirations of being astronauts and scientists. now they all want to be brainrot influencers
>>106795039ill upload it, some anon posted it earlier in the week, give it a bit
>>106795041>Anthropic got raped in courtis this why they're going in an anti-slop direction with their "Keep Thinking" brand
>a local model consistently passing my "2d cat with 3d tail" test that some SOTA models struggle withoh shit- >model too fucking fat for home use:(we're so fucking close man. hunyuan 3 would've been salvageable if smaller but atleast this gives me hope for the future of image gen
the anime girl is wearing a black bikini.steel bikini run?
does qwen image work best with hyperslopped llm rewritten prompts or can i just type shit in like with wan
>>106795027they should use Gemini to caption image desu, this shit seems to know characters
>>106795060it understands language better than most models, no need for llm scripts
>>106795064is this pro or flash
>>106795070gemini 2.5 pro
>>106795074that'd be expensive as shit to run over tens of millions of images
>>106795059better glove:
>>106795079well yeah that's the price to pay for a quality dataset, we're talking about multi billion dollar companies (Tencent and Alibaba), I'm more angry at Tencent though, instead of spending millions on a giant 80b model they should've used that money on Gemini pro and a normal sized model
>>106795090they dont give a fuck about the dataset as long as it's merely okay. it's a means to an end so they can move on to the next project
>>106795095>they dont give a fuck about the dataset as long as it's merely okay.that's the problem, they don't give a fuck and at the same time expect us to give a fuck
For the sdxl-based 1girl sloppers out there, I’m messing around with the new epsilon scaling thing hao ported to forge classic and it does seem to produce nicer outputs. I can’t compare directly as pretty much all the gens I’ve got stored are from vpred noobxl and derivatives, no eps models, but even for the bored models it seems to improve output. However it also changes the output, unlike what it is supposed to do for eps models. Fun to mess around with if you guys haven’t pulled recently. It’s also in comfy, he ported it from there
>>106795039k it finished uploadinghttps://gofile.io/d/sfoxubqwen clothes remove 2509 lora anons
any word if Celestial will be any closer to tensor?
>143s/it qwen edit really nigga
>>106795144use the template workflow, use the 8 step qwen image v2 lightning lora
>>106795058not bad but also it doesn't indicate sufficient 1girl power even if
>>106795110ill give it a shot, thx anon
>>106795160I am :|
>>106795058>we're so fucking close manfar from it, the outputs are ultra slopped, it doesn't know characters or celebrities, it's just qwen image with a slightly better prompt comprehension since it's an autoregressive model
>>106795058for all those parameters it better be able to do that
https://huggingface.co/drbaph/Qwen-Image-Edit-Mannequin-Clipper-LoRAIs this trolling?
>>106795163which gpu?
the anime girl Hatsune Miku is on a giant billboard on the side of a building, in Akihabara Japan. The billboard extends from the street to the ceiling.bit redundant since the image is of an anime girl, but it still works.
>>106795144you using a 1050 ti or some shit? i'm a vramlet and im getting acceptable speeds
>>106795175>>1067951817900 GRE. But I'm noticing there may be significant updates I can make to rocm. I have to check this because I agree something is wrong.
>>106795185>amdthat explains it.
>>106795095>it's a means to an end so they can move on to the next projectso they're not aiming for something usable for themselves?
>>106795162Go to “settings in ui” and add “scaling_factor” to get the slider with your txt2img settings, rather than switching to the settings tab each time to fiddle with it.If you have noobai eps or models based off it it should work as a “refiner” of sorts if I understand right.
>>106795187We will see. I'm leaning more towards my incompetence in setting it up with the latest updates. Someone says that every time and it always just ends up being my incompetence. AMD deserves bullying but it's not completely useless.
>>106795201is he still working on that shit? do we know when it's gonna be done?
fucking jeets, now we'll never get sovl models because souless shit like HunyuanImage 3.0 managed to be 1st on that memeboardhttps://xcancel.com/TencentHunyuan/status/1974522542858911935#m
>change seed on qwen>image changes slightlynot a very creative model i see
>>106795180>in Akihabara Japanakiba doesn't have the giant screens. it's all static billboards and none that big. you are thinking of shibuya
https://xcancel.com/LodestoneRock/status/1974487600225546733#mis this another snake oil?
>>106795205> is he still working on that shit?yes, and also it has only been like a month> do we know when it's gonna be done?no clue
the girl is sitting in a chair in a library. keep her expression the same.neat, from a swimming photo even.
>>106795218how much slower is it compared to the regular VAE process on chroma? and does it eat more vram?
>>106795194>>106795162>>106795110I'm on comfy, I pulled but I can't seen to find the node, lol
>>106795238switch to nightly
>>106795242whats the node called
>>106795226>the girlCan you feed it a highly specific person and it work? Like does it work with this person?I seriously doubt I can get qwen working with my gpu, gotta upgrade.
>>106795216>the pseudo intellectual furry retard that ruined his model to try meme experiments wants to experiment with memory directlygee I wonder
>>106795250sure, use "keep their expression the same" to retain their facial details though, in general.
>>106795252does it work with the library thing? Like can you make him (he's just a random person) sit in the library? Or does it look basically like a photoshop?Because my suspicion is that the model detects Laura Croft, and then just gens her. I don't think it has an internal sense of the look of a random new person.
>>106795261it works with any image, real person or anime. you can make them point a gun if you want, whatever. it's like gen AI but can manipulate stuff with prompts.
>>106795227i'd say about a third to half the speed but it is hard to compare. as for the RAM that might need comparative testing across a bunch of resolutions but IDK if anything much is optimized so not sure there's much of a point.
>>106795247It’s called “epsilon scaling”. I apologize for linking preddit but here https://old.reddit.com/r/StableDiffusion/comments/1nwmj4m/epsilon_scaling_a_real_improvement_for_epspred/
change the background to a movie studio with large green screens and a couple of movie cameras. A white piece of paper taped to the wall says "we'll do it, some day".
>>106795268>>106795268
>>106795261like the wan orbital camera movement videos, it works with basically any character including random chroma gens or w/e where there's no way it's an actual known character as suchshit is pretty magical
>>106795261like this for example.the man with glasses is pointing a silver pistol at his head.the model doesn't know who the fuck they are. but it knows how to edit/swap stuff.
>>106795216>"We solved VRAM bro, Nvdia is no more!">By the creator of Chroma!kek, you know it's bullshit, c'mon
>>106795279Yeah, but does it know how to read/understand faces enough to rotate them in 3d space? Like generating actual new views with a likeness intact?if so, imo that would be the first ai with documented likeness capability.
>>106795285it can do transforms, like "view from the back" or side, pretty neat how it works desu
>>106795278>like the wan orbital camera movement videosIt's more magical to do a large movement than basically slight changes in a row.Does the likeness drift if you have the head return to the original position?
>>106795196using --use-split-cross-attentionimproved it considerably>>106795250pic related
>>1067952913/4ths view, does it look like the same person? or at least usually?
>>106795296it depends, if no face is visible the model has to guess, but you can prompt details (eye color, ethnicity, shape, etc).
What's the nomenclature fore Qwen edit with multiple images? Insert image1 into image2? Can it work up to image3?
>>106795307the default 2509 edit layout that comes with comfyui has 3 load image nodes so I assume yes. And yes to the image1 image2 thing.
>>106795294not bad. Seems close enough.
>>106795317>>106795294added the samsung insta lora
>>106795292have you ever actually done anything like that? it do slight changes the trivial way and you'll lose coherence and the backgrounds wont make sense and so on. ai techniques are pretty fucking magical now.
>>106795330That jiggle is so unrealistic, but also like so good. Real women never stood a chance
best model at generating HELL?
>>106795330i'd play that walking simulator
>>106795218
is cum covered considered sfw? I could name the file "slime girl" or something.
>>106795350features more jiggle than base wan, surely still to be perfected.>>106795391might not be long in the future - multiple different ai techniques could lead there>>106795398nice
>>106795417>features more jiggle than base wan, surely still to be perfected.are you just using a lora on wan or something? what do you mean
>wan 2.2 smoothmix is great at motion but ignores the last frame for a loopFuck off.
>>106795214who cares
It's spooky gen season
>>106793488>>106793501>>106793518>>106793638Kill yourself>>106793535That was before you transitioned and suffered traumatic head injuries, i assume?
https://files.catbox.moe/fnh3wo.pngnsfw tittiesif only qwen didnt fuck up the teak
>>106796532
>>106796532>>106796596any tips on unslopping the skin? possible?
qwen_image_edit_2509 is so bad at prompt following
>>106796617I think the model is really just trained for "putting clothes in image 2 on person in image 1" and anything other than that is a throw of the dice
>>106796617>>106796625I don't know if this is some kind of elaborate samefag ruse but there has been an anon posting gens over the last few generals which has proven that this isn't true at all.
>>106796636I assume you mean e.g.>> 106791124I tried that with the native comfy workflow i.e. https://docs.comfy.org/tutorials/image/qwen/qwen-image-edit1/5 it works nicely, the rest of the time it spits out the input image unmodified. Tried different prompts, no luck.
>>106796688dunno what to tell youI've had that before a few times, it was because my prompt was super vague or needed adjustmentfirst takethe girl is seated in the tokyo metro reading a newspaper in a populated subway car
>/ldg/>/sdg/>/adt/all identical please get your shit together