Discussion of Free and Open Source Text-to-Image/Video Models and UIPrev: >>106418741https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GPAniStudio: https://github.com/FizzleDorf/AniStudio/tree/dev>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://tensor.arthttps://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://rentry.org/wan22ldgguidehttps://github.com/Wan-Videohttps://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
reminder that i used nunchaku before all of youseethe
Comfy ..... shot
>>106426685is nunchaku good for you?
Blessed thread of frenship
>>106426692yes, but i cant use it for qwen image yet because im a 12gb vramlet and it doesn't support cpu offloadinghttps://github.com/nunchaku-tech/nunchaku/pull/624
>>106426715god damn i would absolutely rape that little slut and make her never smile again
why does batch processing work so much better for images?it seems like comfy is making waaay better use of system resources despite apparent increased load.
>>106426724anon! no! bad anon! anon no raping!!!
>>106426727it was the same shit for auto1111. it isn't some space magic, it's just pytorch. comfy has been doing a great job at fucking memory as hard as possible because of diaper leakies
Anyone ever done i2v of their poop?
>>106426724no rape allowed
>>106426734I feel like I'm standing on top of a very tall tower made out of rotted 2x4s and rusty nails assembled by Pedro and the gang
>>106426745this one is too uncanny to be deserving of my rape
>>106426774would want to be reverse raped by
>>106426782Reverse rape is just consensual sex.
Why is everyone shitting themselves over nano banana when qwen edit is basically just as good?
>>106426793The smile should be less authentic and more sardonic.Otherwise kino gen.
>>106426793this theme used to be funny. but now I doubt they're the only big villain
>>106426800Is everybody? after they pissed off pretty much everyone with the flood marketing they did before release. I don't see it anywhere anymore (unless it's on xitter, don't waste my time there)
this might my favorite gen of the year.
>>106426697>>106426689>>106426685THE DUALiTY OF MAN!BWAHAGHAHAH
>>106426774>>106426861rocketnon this is just sad
>>106426863Why are you scared of confrontation?
I used comfy example workflow for qwen edit and it creates blank image with errorComfyUI\nodes.py:1590: RuntimeWarning: invalid value encountered in cast img = Image.fromarray(np.clip(i, 0, 255).astype(np.uint8))tf going on? did i fuck up updating?
>>106426898why are you afraid to have sex?
>>106426800>everyoneSure, Jan
*yawn*
>>106426901>did i fuck up updating?good software wouldn't allow that
>>106426800both qwen models desperately needs a finetunes and more lora support from the community.
>>106426861it's pretty nice, i'll give you that. middle finger is a bit long though, def looks odd.
>>106426906i forgot that you are underagebut i also don't know how people your age communicateso why do you always hide behind this?
>>106426917"its not that serious mate"
https://files.catbox.moe/p8xa4s.mp4
redeem >>>/g/sdg/>>106426861that is a very nice gen
>>106426793actually, the shadow got fucked up a bit in that one>>106426800it's not organic. it's a massive, obnoxious indian operation hired by google. blatantly spamming threads and subreddits where it's against the rules/topic.https://files.cat .. box.moe/5if355.png>>106426838they're not the only problem, but I'm still sick of them.
>>106426926so you ARE just scared of confrontationwell i hope you didn't leave enough breadcrumbs for someone to do things postcard
what do you mean anon?
Is there a fp16 qwen image edit?
someone needs their meds, NOW.
every time i see s/w get too many replies i get curious and then eventually i click on one of the videos, because i know what they look likethen my heart starts HURTING (i am not exaggerating) out of cringe
>>106426901https://github.com/comfyanonymous/ComfyUI/issues/9265https://github.com/thu-ml/SageAttention/issues/162https://github.com/comfyanonymous/ComfyUI/issues/8689#issuecomment-3177486707sageattention issue, sounds like they're still working on it. tho there seems to be a temp fix you can manually do for now.using sageattention by itself i haven't had any issues so far.
what's wrong bwos...
>>106426993Because you touch yourself at night
>>106426989poor thing! have mommy cover your eyes!
>>106426993Update the gguf node
>>106426993Using third party nodes, Comfy will always break those
>>106426989sounds like we need a s\w only collage next thread ;3
>>106427008wow what an asshole
>GOOD MORNING!!!!B O T . S T A T U S ?
>>106427012i dont mind thumbnails tho
>>106426993this is the correct answer>>106427004
>>106426931>>106427018>horribleDEF GOIN IN THE COLLAGE!!!
>>106426993NOODLES
>>106427018this reminds me of a scene from https://en.wikipedia.org/wiki/The_Poughkeepsie_Tapeswhich I wish I hadn't watched
>make comfyui workflow a total shitload of fuck>complain that its 'shit' and 'doesn't work'>become threadly anti-comfy schizowhy does this keep happening?
>>106426992>https://docs.comfy.org/tutorials/image/qwen/qwen-image-editcan you try the first qwen edit example with --use-sage-attention on?I removed it and it ran fine. so yea this may be it
>>106427038well someone is definitely trying since months to discredit comfyui so who could it be ...
REALLY ACTIVATES THE ALMONDS
>>106427050users
>>106427038>make comfyui workflow a total shitload of fuck>whack it against claude until it works>anons give me helpful suggestions which speed up my workflow considerablywhy does this keep happening?
>>106427050a certain shota enjoyer?
>>106426945>post violates united states law
>>106427050girls with penis chad?
imagine being so psychotic that you make direct calls for violence\dox
>>106427067mens with tits
Legitimately don't know how people do anything with their images outside of comfyui. They just press a big stupid orange generate button.
>>106427076its blue tho?
imagine how shitty a general has to be to complain about software and schizos all day with barely any engagement of on-topic discussion
>>106427076some people prefer simplicity over customization
did anyone update the wan 2.2 ldg guide yet?
>>106427039i don't have the edit model downloaded somebody else might trywith qwen image using --use-sage-attention runs fine for me
>>106427084I see gradio has upped its game since I last used it.
>>106427061>>106427070>thinking swedes care about mutt laws
>>106427106just emailed expect my attorney to get involved>dmca, tida, etcringing any bells?
>>106427061whiny bitch kek
>>106427087cant even post a gen, laughable
>>106427145>says this>is a nogennerpottery
>>106427146>>106426931do you HAVE to post this garbage? its making our general look bad!
>>106427137see >>106427138
>>106427088most people do actually. it's why complaints are more common
when will i be able to experience the pleasure of inseminating my cute petite large breasts wide hips 1girl
>>106427197>>106427160BEAGHAHAHAH
>>106427215You may have to leave the house to achieve this....
before i even bother diving into this, can my mbp m4 laptop handle wan2i want to make vids from pics
>>106427232>cant run windoze
>>106427092the relevant information that needs to be updated to get up and running you could easily do by either copying pasting from the git pages or just linking to them.that or just link to one of the many updated auto installs i've seen on reddit if you're lazy.
>>106426800Qwen Edit is not just as good lmao, Nano Banana is a model with Imagen 4 Ultra (or a bit better) realism capability, Qwen Edit is a model with Qwen realism capability, it simply cannot retain sufficient detail or likeness if given e.g. an actual photograph.
gm
>>106427242you're right, i was able to get it up and running on my own, but wanted to see if its updated for other anons
>>106427215it'll happen sooner if you never say inseminating again
I pulled, for the first time since the 22nd, and not only did it not crash anything, but actually I got a 5% performance increaseIs Comfy getting it together or should I go out and buy a lottery ticked due to my current good fortune ?
>>106427160Can it, Spergules.
>>106427197>>106427243need her
what's the point of talking about Nano Banana here? we already know that online slop tools are better. except for the nsfw part kek
>>106427324Can't believe people have forgotten how Disney used beloved characters like Hercules to influence kids into smoking
>>106427346this. /ldg/ is a gentleman's club of gooners
This must be what a gambling addiction feels like. You randomly get a really nice gen with a whole bunch of good extra details that you didn't even ask for, then you try to add those extra details to the prompt and it just goes lol no and ignores half of it and fucks up the other half.
>>106427232assuming you have 32GB+ ram, yes, but it will be pretty slow
>>106427324this ones pretty good
>>106427365Learning to inpaint and photobash a bit will ameliorate to some extent dependency on the RNG god.
>>106427346>online slop tools are betterIf a much larger model than anything you can run at consumer hardware wasn't better, it would be insane.Actually it is insane how little gap there is between Wan and SAAS commercial alternatives, which in turn are heavily censored and impossible to augment with extra training like loras.Local is such a pure win.
>>106427346>Needing NSFW images to goonyou are weak
use case for comfyui needing to run at 1000 FPS?
>>106427364*losers
>>106427387snazzier snap-tos
>>106427346it's not even significantly better than qwen, and that's before qwen LORAs and finetunes really take off. Xi achieved a pre-emptive strike with that model.>>106427365unlike gambling, you can re-engineer prompts, workflows, etc to improve your chances. yeah it's addictive as fuck though. it's really like one of those cyber drugs from cyberpunk fiction.
>>106427387Kelvan Empire Occupiers want gens.
>>106427397*cutting edge ai researchers
>>106427403holy sloppa(probably going in the collage)
>>106427416I dunno, the catgirl I made from the same workflow didn't get into it, though the chroma image I I worked on did.
>>106427416I dunno, I like the messy shading lines, makes it a lot less 'AI'
>>106427365https://youtu.be/OVAkL2YbisE?list=RDOVAkL2YbisE
>>106427387it's a bunch of fucking divs!!! why the fuck should it cost more to render than 4chan???
bro is SO upset that hes not in the faggollage
>>106427406There is absolutely no way to train a Qwen single-subject likeness lora on photographs of an actual person and inference with it in a way that looks nearly as good as a Flux Krea lora trained on the same dataset (I've tried), the model just isn't realistic enough inherently, even with schizo negatives it still likes to veer into Pony Realism esque sloppa real quick.This isn't to say the likenesses are super off or that the coherency is bad, in those regards the Loras I've tried came out fine, it's just it's not a good platform aesthetically for anything non-illustrative at the moment.
>>106427493still no fast krea?
>>106426800Not as good as everyone here has you thinkhttps://lmarena.ai/leaderboard/image-editMy findings are pretty much the same. Qwen edit still doesn't hold a candle to whatever BFL is hosting behind their API. And nano banana is basically just a hypothetical Krea edit, so it's not really that far off from Kontext Pro/Max.
>>106427493Raw output or no LoRA? The output is comparable to Chroma.
Man, I really fucking need my local music gen fix, fast.Every single open music gen model is garbage.
>>106427568I've found Qwen edit does basically as good a job and I can train it to do exactly what I want.
>>106427564nunchaku
>>106427575Ace step wasn't too bad. Definitely a step in the right direction. I think it's getting a big update soon too.
>>106427564Not sure what you're referring to exactly, it got a Nunchaku already at least though:https://huggingface.co/nunchaku-tech/nunchaku-flux.1-krea-dev
>>106427230so what?
>>106427584When stacked against other models though you can easily pick apart the slopped Qwen edit images. That's why it's not performing well in LMArena.
>>106427573I legit can't tell if you're asking a question or stating something here lol, I don't know what you meant by this.
>>106427535>There is absolutely no way to train a Qwen single-subject likeness lora on photographs of an actual person and inference with it in a way that looks nearly as good as a Flux Krea lora trained on the same datasetThe model has barely been out a month. We don't really know how to train it yet. Flux was out for a whole year before we finally got some decent de-slopping of it like Krea and Chroma. I'll reserve judgement since some LORAs seem to do an OK job at de-slopping, way better than Flux was at this age.That said, if we do get a Qwen finetune, by the time it drops we'll have a new completely different SOTA lmao
>>106427592>>106427597so nothing for vanilla comfyui?
>>106427594I hope they at very least improve the alignment related to the genre tags, and make it work with niche genres (eg Eurobeat), or follow the reference audio of said genres well
>vanilla comfyui
>>10642761615 rupees were deposited on your shit account saar
>>106427648the thing is, everything is for vanilla comfyui
>>106427631I'm asking whether you used a LoRA or not.
when are we going to see 1080p genning?
>>106427742Now if you have infinite amounts of time.
>>106427568>And nano banana is basically just a hypothetical Krea editnot really, Imagen (which Nano is clearly related to) has it's own aesthetic, obviously it's not a complete comparison of everything either model can do here or really even a proper one to one comparison at all but this is Imagen 4 Ultra at 2K straight from the API on the left and the Krea on the right, same prompt.At least for this particular type of gen / prompt, Imagen is more like, "a model that's aesthetically tuned but with really high detail and not distilled at all", Krea sort of leans more raw (perhaps a little too much so, it kind of falls into the trope of representing modern phones as having rather worse camera quality than they actually do)
>>106427317>Is Comfy getting it together or should I go out and buy a lottery ticked due to my current good fortune ?You pulled when he just put out a releasehttps://github.com/comfyanonymous/ComfyUI/releases/tag/v0.3.55So I guess the lottery ticket?
>>106427638no it trains fine, the only people struggling are the sort of people who ALWAYS used super weird training configs with retardedly low learning rates and the constant scheduler even on older models, there's definitely absolutely nothing technically "wrong" with the Loras themselves, trust me
>>106427648I guess not? I dunno what that would even be in terms of equivalents on any model that's come before though
>>106427735oh my bad, yeah I did, one that was actually trained on Krea, not regular Flux
>>106427861>constant schedulerNothing wrong with the constant scheduler.Every large model is trained using constant scheduler, including all released as open for local, from SD15, SDXL, Flux, Wan, Chroma, Qwen, I have no doubt that holds true for all local finetunes as well, but they seldom release papers / detailed info on how they train their models.
>>106427841might be the first time ive seen the mirror effect in a gen. thats sweet.
If it's finished, why does he keep updating it?
>>106427450Hey Ani, we post your shitty UI in /adt/'s OP, go there and do some clowning around at least as a way of saying thanks!
>>106427983>anibeahafahahah
>>106427898you're a gorillion times more likely to overtrain or undertrain with constant on a Lora than with Cosine or Cosine with Restarts, in my experience.these settings (epoch count might change depending on the dataset or whatever obviously, I never count in "steps" or use more than one repeat though, and the text encoder learning rating there wasn't relevant cause it's obviously not actually being trained) have been working for me well on TensorArt, anyways.I don't know what exactly their backend is but Dim seems to always been in Kohya-equivalent scale, I'll note also, for Qwen Dim 16 gives about a 260 MB Lora, Dim 32 is 500 something, Dim 64 would presumably be well over 1 GB.
>>106427669https://files.catbox.moe/v8ysdr.jpgLeft: Kontext Pro, right: Qwen Edit. Bottom: original.You are free to draw your own conclusions.>>106427841Imagen 4's aesthetic is not really in line with Gemini 2.5 Flash though. The photorealism is a lot closer to the Flux side.
>>106427982he is uploading the radiance progress. chroma but in pixel space.
>Blows out the colorHuh. Any Qwen Image Edit Anons got some pro tips?
>>106428125use nanobannana instead
>>106428143kys jewgle shill
Alright, had the resize to be divisible by 16 instead of 32. Way less blown out.>>106428143I'd rather be drowned in a vat of boiling shit.
>>106428084I'd say Nano is very very similar to be honest, they both have somewhat broad output diversity even on the same prompt (if the prompt isn't super detailed at least) so it can be a bit hard to compare them (and also Nano doesn't have straight text-to-image functionality for anything but 1MP 1:1 aspect ratio yet unlike what it has for image-to-image).This is them on a much longer more detailed prompt than the one I used before though, at 1024x1024 it's pretty clear there's no way they aren't closely related IMO.
>>106428084Can you provide the original image and prompt? That would be cool.
is it possible to set a desired output size, for qwen image edit? or is it always based on the image input size
>>106427416I discard your allegations of slop, they mean nothing to me, for I have seen what you gen for your own satisfaction.
>>106428027>gorillion times more likely to overtrain or undertrain with constant on a Lora than with Cosine or Cosine with RestartsNo I must disagree, it's actually a LOT more easy to avoid overtraining or undertraining with constant than with Cosine.With cosine the amount of guessing increases, will X epochs be good or will it fall too sharply or too slow ? And restarting a cosine training often makes the model go haywire.With constant and adamw you will have a small dropoff, so it's not actually constant, but it is VERY gradual, so as long as you check your samples / eval / loss curve to see when it overtrains, you will have great results. With a adaptive parameter free optimizer like Prodigy, here I agree that cosine is good since it makes it work like adamw dropoff, very gradual and also fights back Prodigy's eagerness to raise LR again after having settled, and often too high.
>>106428094Slop
>>106426678Well, maybe the NetaYume guy isn't dumb after all doing reckless things at the behest of randos and popular demand, but I dunno how much I trust that to last given his prior finetune results. Really sucks we're down to this kind of cope hope for anime models.
>>106428225>https://files.catbox.moe/v8ysdr.jpgOriginal image is at the bottom. The prompt was to make her lay at the beach.
after the eternal chroma vs qwen, now krea vs banana...
>things that didn't happen
I made my wife talk>trigger warning: hooveshttps://files.catbox.moe/jxse60.webm
>>106428317I was really hoping for tom green there.
>>106428323>LOOK IM A FARMER!Society is such a joke that he actually became a farmer.
Mcdonalds, fix your menu or else.
>>106428337Do you have a lora for him? him being violent towards fast food has a lot of potential.
>>106428337>McDonaidskek
>>106428341no need, qwen edit can make anyone do anything more or lesssource image:prompt: A brown bag with the McDonalds logo on it is resting on a marble counter in a kitchen, beside some McDonalds cheeseburgers. the man is looking at the camera with a serious expression, and is holding a large butcher knife.with the light2x lora it only takes 8 steps, qwen is otherwise slower than flux.https://huggingface.co/lightx2v/Qwen-Image-Lightning/blob/main/Qwen-Image-Edit-Lightning-8steps-V1.0.safetensors
>schizoid theory: statler is rocketnon
>>106428362Holy crap. TY anon. AI has so many buzzwords its impossible to research.
https://huggingface.co/starsfriday/Qwen-Image-Edit-Remove-Clotheshave not tried this yet, does it work like kontextand dont do it to reviewbrah
>>106428280his first version was honestly a fine attempt given it was a tune of the BETA version of the original Neta IMO. Once he moved over to tuning against the main 1.0 release, in his 2.0 / 2.0 Plus release, it was basically immediately better than the base 1.0, I'd say.
>>106428370yes
>>106428370no
>>106428304no one was arguing Krea vs Banana lol, I merely pointed out that Banana (and Imagen) are not actually really that aesthetically similar to any other non-Google model
>>106428370maybe
>>106428377downloading now b4 nuked
>>106428370can you repeat the question?
>>106428370only one of them posts good gens so...
>>106428397with google you can find all these without much issue, they just cant host it on civitai or whatever.
The man is wearing a black suit and is standing outside a McDonalds restaurant on a summer day. He is pointing at the McDonalds sign above the restaurant and is smiling. He is wearing brown dress shoes.it didn't make him a hobbit, based qwen
>>106428415>not holding an assault riflePlease do him>outside with assault rifle>inside angry, scaring register staff>inside the back kitchen>shooting hamburgers
>>106428415>it didn't make him a hobbitKontext be seething
Is there a way to apply LoRa for variable strength over time-step in Wan video? As in 1.0 strength at step 1 and then strength 0.5 at step 3. Controlnet has something like that.
>>106428280Kek.NetaLumina is Chroma story arc but for anime models.
The man is wearing a black suit and is sitting in a booth inside a McDonalds restaurant on a summer day. beside him is a brown bag with the McDonalds logo, a bottle of champagne in a bucket of ice, and a plate and silverware. A McDonalds cheeseburger is on the plate. fine dining.
>>106428280no need for cope, wai v14 or hassaku + controlnet union + adetailer is all you need, the base checkpoints know 99% of characters even before loras (which there are millions of).use this extension in forge/reforge for anime models, it's so easy to use tags just by typing:https://github.com/DominikDoom/a1111-sd-webui-tagcomplete
>>106428084Yeah, I'd say kontext pro certainly wins that one. Fuck SaaS models, though, still.
>>106428394https://files.catbox.moe/i8tbj8.pngPrompt: A woman is cosplaying at comiket.Left: Krea, Middle: Imagen 4, Right: Nano banana.They're kind of similar.
>>106428084>>106428517This shit is really pandora's box.
>>106428406>>106428397>>106428377initial test results:>works instantly with simple "remove her clothes" prompt as advertised>nipples a bit weird still, seems to like adding bush which is nice>does a much better job of preserving body shape/anatomy. I was able to get nudes out of qwen with no LORA and also with that NSFW LORA, but they tend to modify body shape a lot moreit's a step forward but not great
>>106428524(I mean Krea Dev on left of course)
https://huggingface.co/starsfriday/Qwen-Image-Edit-Remove-Clothesso uhh, I guess it works."remove all the clothes of the figure in the picture", like the huggingface prompt examplesbikini be gone, AI magic:https://files.catbox.moe/7khb2e.png
>>106428574also in the event of any issues you can just inpaint to fix with any noob/illu model for realism or even anime models will fix it.
>>106428517SaaS-crap is not necessarily better. Just out of the box Qwen is not that good at realism. It needs some kind of photorealism tune, then it's on par with those.
>>106428574blue board GODAI is better for funny memes, because all the small details make most porn uncanny. If I'm a tit man I'm not going to jack off to some weird AI nipples.
>>106428524were you using a Lora there? I've never seen Krea natively have that kind of random text watermark, ever
another example, can easily retouch with inpaint but the core functionality (lora to remove clothes) works in qwen edit."remove all the clothes of the figure in the picture. her skin is light."had to add a skin prompt cause the shadow was making the right side appear tanned, lolhere's an a -> b example for the lora. retouching/inpainting would take like 20 seconds, the core functionality works great.https://files.catbox.moe/rl4em1.png
>>106428600still sloppa
>>106428635No LoRA.>>106428636Careful anon, are those real pics?
>>106428524this. if you are not familiar with AI tools, you can easily think that the same tool created them
>>106428670it's a test image for some random gravure photoset, not rando girls.
>>106428649different guy but yes it is a remake of that first one lol
>>106428476nobody wants to used outdated sdxl 4ch vae sloppa. it's 2+ years old now...
also, even if it was trained on realistic photos, it works for anime too. should have prompted black blindfold to keep it on but...you get the idea.got a nice 2b racing stripe:https://files.catbox.moe/zve1dd.png
>>106428672The same tool can create them all depending on seeds and prompts imo. Maybe by default you will see a more realistic output and background on the Google models, and the outfits and props are more involved, but aside from that, with good prompt engineering it should be possible on Krea.
>>106428713"remove all the clothes of the figures in the picture."yes, I know it's incredibly easy to generate noob/illu anime lewds. the point is testing the lora on what is by default a model that doesn't allow nsfw. I think it works better than the kontext remover, imo.https://files.catbox.moe/pjynov.png
kek, it even worked on the misato picture from eva.source is this:result is this:https://files.catbox.moe/pvgkg3.png
>>106428734Not surprising. Flux is not a strong anime base.
>>106428748it did a decent job of preserving the style there
>>106428590That would be great, for sure.Still going to mess around with it some more, but the lighting on the model is real annoying, it's so damn bright. Might fuck around with model shift tomorrow, gotta sleep now though.
>>106428758yeah, for the anime images ive tried it has preserved color/shading style which is neat. kontext is pretty good but qwen edit is even better imo at preserving style.
>everyone forgetting it's a blue boarduhh based?
>>106428748the purple hair anime woman in the picture is standing on a sunny beach, waving hello to the camera.this time just a reg prompt with the 8 step lightx2 lora (for speed purposes)neat. I didnt even prompt to keep her expression the same or style the same or whatever.
>>106428789got the blues and totally bored. so?
>>106428799this time with "keep her expression the same" added:
>>106428789I'm colorblind
>>106428835thought i was in /sdg/ for a second
>>106428803the purple hair anime woman in the picture is sitting in a business office typing at a computer, with a large white CRT monitor and white tower computer. she is typing. keep her expression the same.nice retro pc.
>>106428802blue boards are stupid. adult website. not nearly as much grooming as roblox or discord
>>106428845
>>106428636seems like undertrained, the breasts size are totally different in the output image, same as the kontext nudify lora, it would give women giant boobs
>>106428855last misato pc.
>>106428094cute, borrowing that gen
>>106427243
>>106428872Neet. I removed the garbage at the end.
>>106428718literally no one was even saying Imagen was "better" specifically, my original point (which I maintain) was just that most of the time if given a very short prompt, it's really not all that stylistically similar to Flux Krea at all (both for realism and non, really). The other guy's cosplay example is a pretty rare exception IMO.
>>106428872rocketnon this is horrible we can all tell which gens are yours
>>106428872extremely good clarity. wan 2.2?
>>106428901this one is dope
>>106428907cry harder no gen
>>106428906For every image I gen with nano banana, I feel like I'm genning with Flux Krea. This behavior is consistent across every photoreal prompt I tested.
>>106428872>>106428901really nice, thanks
>>106428907rent free
>>106428939he is a THREAD SHITTER HE IS WALDORF CANT YOU SEE THAT?
>>106428962In fact, it wouldn't surprise me if Google just picked up Krea weights, did some tuning to make it multimodal and then called it a day.
Having a picture of a girl and prompting her to remove her clothes feels like a form of mindcontrol and is very hot.
>replying to himself
>my order is wrong, and my day is ruined.
>>106429003>everyone i dont like is all one human
>>106429012okay now we have a wan candidate. no violence just shooting a vending machine or something.
>>106428996It's a bit too good at anime for it to be Krea, it's just probably once you get photoreal you can't really get more photoreal lol>>106429000Then that strip LoRA for Wan may just ruin you lol
>>106428600
>>106429012>>106429022From the size and fit of his suits, I didn't think he was that big...
>>106429036he's a big guy.
>>106429022nice chair bro
Gimme your Labor Day Special!
>>106428969Slop
>>106429023he has to be trolling lol, there's no way he actually thinks there's any chance that Nano Banana (which like I said earlier is VERY CLEARLY related to Imagen) is a Krea finetune
>>106429025nice
Disturbing lack of gens in this thread.
the man holding a plate with a hamburger is upset and throws the plate at the counter behind him. the people behind him squat down to avoid the plate.upset brah
>>106429172I'm not genning anything interesting... just fiddling with my LLM sys prompt tonight.>>106429179His expression (or lack thereof) had me giggling pretty hard. All hunched over like that.
>>106429179I don't know how McDonalds taste in America, but every time they tried to start a franchise where I live they always shut down a few months later because of how terrible they taste.
>>106429172
Does Wan2GP have a proper NSFW model?
better toss:
>>106429133It can be. Outputs are very similiar and we don't know how Google trains their models. This could be a Flux-based model, such as Kontext, or a tune.>>106429023That's what a tune would do, improve it at anime etc.
>>106429199amerifat here. last time i ate mcdonalds it was honestly the worst fast food i've ever eaten. i was upset
>>106429206cliprel. he needs an uzi, and to shoot hamburgers.https://www.youtube.com/watch?v=XkwQ6EjLdMQ
day: ruined
https://huggingface.co/tencent/HunyuanVideo-Foley/tree/mainHow does this fare in comparison to the Wan audio model? Anyone tested yet?
>>106428027this is what a Lauren Boebert lora looks like by just epoch 10 with these settings(this doesn't mean it's done though, Qwen has a gorillion parameters and isn't distilled so it's VERY hard to overtrain a Lora on it, the only thing you can do to combat just the way Qwen looks in general and add more detail to the likeness is to keep going for basically as many epochs as you can afford timewise or moneywise or creditswise or whatever)
holy shit, well this was unexpected.the man holding a plate with a hamburger runs towards the man on the right and hits him with the plate. The man on the right falls down.technically...it worked?
>>106429253and yes this was reply-to-self if anyone was confused>>106429211no it can't, the overall prompt adherence of Imagen 4 and Nano Banana has zero chance of being from a T5-based model, they're definitely on a much larger, way more advanced Google LLM with basically infinity context as opposed to the 512 token limit of T5.
>>106429270I need to specify wearing a tie to make it more obvious who is the one doing the action. still funny though.
the same guy did it. I even said man with the tie.
>still no chroma category on civitwhy
>>106429253Here, I got you a present: https://civitai.com/models/1910170/woman-feet?modelVersionId=2162006
Do any of you guys know anything about good printers? I'd like to make little wallet-size print-outs of my sexiest 1girls to carry around with me, but I'd want the best possible color depth and DPI and so on, so I can really enjoy looking at it up close. Has anyone done this?
>>106429297staff too busy trying to keep the site from imploding
>>106429298why would I want this
okay, now brah is dealing the pain.
>>106429275>512 token limit of T5.When I'm describing an image with gemini or joycaption, can I specify the token count and will it understand it?
>>106429306i thought they figured out the payment stuff and got a new processor?
>suddenly firefox is using 50% cpu and eating up 15gb ram>returns to normal when I close all my civitai tabscivitai sometimes goes fucking crazy are they mining on my cpu or what the fuck
>>106429314nah hommie, asuka don't cry
okay. this is what I wanted. get em reviewbrah.
>>106429324Gemini can kinda do it but not with extreme accuracy, so it helps to sort of leave headroom with how many you tell it to cap at.
>>106429275Haven't seen how long their context limits are. Though LLMs show it's possible to extend that dynamically even with existing architectures: https://kaiokendev.github.io/contextPlus a prompt adherence improving paper came out that even helps SD 1.5 with no changes to the code. Google does have talented engineers, and there's no doubt they probably have a better model from a higher quality dataset, but what they're doing is not necessarily more architecturally groundbreaking than what we have, at least not yet. It seems like it's just a scaled up version of what we already have, and even then, look at the far right image https://files.catbox.moe/i8tbj8.pngThe faces in the background... That is not perfect. I contend that many other models can do better. Again here is Krea dev https://files.catbox.moe/qmbyb5.jpgYou guys are really overhyping this Google model.
>>106429365If lode gets it working, Chroma will move on from t5 to qwen 2.5
in theory we aren't far away from full AI reviewsbut you cant replace reviewbrah
>>106427085i stitched together gens with no prompt but to what end
successthe anime girl stands up, and picks up the white CRT monitor and throws it out the window to the right.
>>106429365Nvm, seems Gemini is autoregressive like 4o. So it understands prompts like "show me a room with no elephants in it, make sure to annotate the image to show me why there are no possible elephants" which is pretty cool. Not impressive aesthetic wise, but still impressive architecture/prompt following.
I think nano banana is better. Qwen is my new watermark remover now I guess.
the anime girl is typing on her computer, and the site 4chan appears on the screen, along with the text "/LDG/". She gives the thumbs up.not quite 4chan but still a good gesture
>>106429460Gemini resulthttps://files.catbox.moe/18zeg3.png
make sure to remove tranistudio from the next bake
In wan, is it better to get a sampler than converges fast or something slower so the refiner low noise model can do something?
>>106429434lol i do that sometimes too shits fun just to see what it does.https://www.reddit.com/r/comfyui/comments/1n3qm5c/this dude did it with s2v got a cool one with no image/prompt input.
>>106429545>>106429545>>106429545
>>106429460I think at this point they probably train that in since it's such a overdone benchmark lol. Try man without a beard. Will give you a man with a beard
>>106429440lmao epic gen
>>106429480based??
any tips for achieving extreme angles with SDXL? like from directly below?
>>106429336Slop