Obese Nerds Using My Images EditionDiscussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107017112https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Neta Yume (Lumina 2)https://civitai.com/models/1790792?modelVersionId=2298660https://gumgum10.github.io/gumgum.github.io/https://neta-lumina-style.tz03.xyz/https://huggingface.co/neta-art/Neta-Lumina>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
btw is there some good method people use to mix up qwen's low seed variety other than having a whole LLM create prompt variants instead?
blessed thread of chroma :-)
SONGBLOOM IS THE NEW KING OF LDG
Has anything surpassed Illustrious yet
>when u SHAFT look too hard
this thread should be all about asses
>>107019722in regards to what
>>107019711>is there some good method people use to mix up qwen's low seed variety other than having a whole LLM create prompt variants instead?Let me answer your question with another question: if a model has low seed variety, how else could you mix it up if not for varying the content/structure of the prompt?Completely random thought I had: I wonder what It would take for diffusion models and their text encoders to be smart enough to understand Pig Latin. Is it really that much harder than understanding a non English language? You'd just need enough examples in the training data
More and more people are realizing chroma is a shitty failbake, and it's glorious to see
>>107019722in many aspects yes, but unfortunately not all of these aspects in one practical modele.g. qwen utterly wrecks illustrious for multiple characters or text or such, but it has some aesthetic failings (includin with anime) and it can't do most anime characters.you might say aesthetics are subjective but I don't thin most people particularly like qwen's
>>107019737it's more like we were convinced that chroma would end up being good, but it didn't improve much between v25ish and v50, it even got worse
>>107019569Chroma does black ladies very, also try different camera/lighting loras for different flavors>>107019729Yes
The disillusionment I am under after realizing the repercussions of it being a wrapper cannot be exaggerated. What the fuck.
>>107019733> if a model has low seed variety, how else could you mix it up if not for varying the content/structure of the prompt?for example by messing with the initial noise? add some blotches with patterns or something.or some additional thing to the model that does <something> at 50% of the steps... idk, segment and just move the discernible elements aroundthese may not be extremely clever ideas, but that's why I'm asking of some had a good idea
>>107019722noob. though it's basically just a variant of illustrious.neta yume is about to beat noob though if they can train some more artist knowledge into it.
>>107019768I know understand why corsets were so popular in the 19th century, damn that looks good to my monke brain
>>107019729>this thread should be all about assesI am on a genning hiatus to try and improve my life (and also I am burnt out and waiting for videos with sound) but one of my benchmarks for video models now is if it can actually generate an accurate body from behind for the age and height. Wan is really bad at this in terms of both prompt adherence and just composition in general once you try to do anything that isn't an Instagram model in her 20s-30sOh and you're thinking too much about cunny, I'm talking equally about being able to generate anatomically accurate 70 year old granny cheeks in thongs too. Both a 70 year old and a 07 year old are most likely not in the training data which means that it would have an actual understanding of human anatomy across the entirety of the aging process at this pointI hope that's enough ass discussion for you>>107019755>Chroma does black ladies very, also try different camera/lighting loras for different flavorsI think anon wanted anime/2.5d black women while keeping their facial features like a 90s anime or something like that, not photorealism
me on the right
>>107019775>I now understand Is this the first time you have learned about the hip-waist ratio? That's what corsets exaggerate, there's nothing inherently attractive about them. If anything it should be considered the original "shapewear" and is actually frauding and lying about the woman's bodies, but that sentiment doesn't exist because when corsets were a thing only rich people wore them unlike fat poor women at the gym wearing shapewearSometimes it surprises me how little people introspect especially about the things they're uniquely passionate for like their own sexuality to the point where a 1girl slop makes you realize something about yourself and about the world
>>107019791nice extra fingers, sd 1.5?
>>107019790It do be like that mr mice
>>107019799it's not that deep bro, women with corsets make my pp harder than without, end of the story
>>107019800Oops. Fixed. Maybe.
>>107019800>extra fingersthere's 5 fingers on each of her hands, what are you talking about?
>>107019807>it's not that deep broYeah it is, retarded mysticist. There is an explanation for everything. Thanks for self reporting that my paragraph about your lack of sexual introspection struck a nerve, though
>>107019819you must be blind retard>>107019818
>>107019821>struck a nerve>retarded mysticistOh how I love irony.
>>107019821why do you need introspection on your preferences? you can't control what you like
>>107019827>my pp gets hard>*insert autistic rambling how its because u monkey who lack introspection [LMAO]*>yes im such a deep and intelligent individual unlike a mongrel like udont even bother replaying to that retard
>>107019821>>107019799Usecase for sexual introspection?
Local Diffusion?
>>107019840Thanks for self reporting that my post struck a nerve, though.
>>107019775agreed. we'll certainly gen a lot of girls with corsets over time.and they were quite popular in modern times too (goth girls, now social media)
>>107019799>Is this the first time you have learned about the hip-waist ratioI know about the ratio obviously, I love the 8 shape, but I've mostly seen corsets from history books, and they kind of looked gross with only illustrations with stomachs so constrained it looked more alien than anything hot.
>>107019955>stomachs so constrained it looked more alien than anything hotthere is a reason it's a ratio and not "zero stomach", like anything if you go too far it goes quickly from hot to horrible
>>107019677>https://www.reddit.com/r/StableDiffusion/comments/1ogx7j4/chroma_radiance_mid_training_but_the_most/Radiance absolutely is looking insane and like it will be the endgame once Chroma is done.
>>107019721https://vocaroo.com/186PM8O3j8W2
>>107020025the doll has a skin texture less plastic than the actual woman lool
>>107019790Oh look it's assholegolden!
>>107020047Bonus points if you can guess the style mp3
>>107020056Ahah so true, I must add perchance that orange man is bad! Take my upvote kind sir!
Is it possible to block swap native wan gguf (any wan) to load more frames?
LOL
>>107019684>Open thread>No feet sloppa>Leave>:/
>>107020052The woman is Korean so the plastic is realistic, notice it's only her face anyways.
>>107020072>The woman is Korean so the plastic is realistica sweaty korean woman that just went out from a sauna then lol
>>107020025lodestone should have given up on it as soon as qwen came out and switched.
>>107020068yes, the redditor is also shilling the model here, take a look at this>>107018294https://www.reddit.com/r/StableDiffusion/comments/1ogx7j4/comment/nlk1foz/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button
>order ram>its not compatible afterall>order another pack>gets delayed>get new date>get email that its out of stock>find a new pack miraculously>arrives today to postalbox at my apartment building>get message that the box is full and the package will be redirected to a 10min drive awayFUCKING CURSED UPGRADE REEEEE
>>107020086this, if he finetunes Qwen Image Edit and makes it VAE-less then he'll be the true hero
for me, it's all about the SongBloom.
>>107020091I'd feel bad for you but there are zero local models right now worth upgrading your ram forUnless you went 16 -> 32 and I wouldn't even consider that an upgrade but more of actually buying the entry ticket to the amusement park>>107020101your name is so fucking gross bro I hate you
>>107020068hobby is full of those expressing and exploring their aesthetic sensibilities for the very first time
>>107020118Actually I think burn-out where you can no longer see the slop is the bigger issue. Because now that I've stepped away from genning and I'm in the waiting phase of the cycle sdxl models are unpalatable to me. Actually they were equally unpalatable at the start of this year but still
>>107020153>where you can no longer see the slopsome simply cannot see it no matter how often they look, is the thing >>107020153>sdxl models are unpalatable to me. Actually they were equally unpalatable at the start of this year but stilli agree in that the only worthwhile models are the massive finetunes and not the barrage of cancermixes you can really stretch your enjoyment finding new artists to train little loras on desu, short of something truly new and exciting
>>107020107that's some chroma-looking mesh there
I spend all evening genning 1girls with the largest possible breasts and then I go to sleep dreaming about the girl I'm in love with irl who hardly even has breasts really. I see the contradiction, but I don't feel it.
>>107020198here it is with "fishnets, mesh" in the prompt as they were not before https://files.catbox.moe/w9wh1l.pngbut i agree netayume's details could be a bit tighter
>>107020241dont fret it, fishnets are hard to do for a lot of models (sadly). Chroma is quite terrible at it too. Didn't actually try Qwen.
>>107020153I went on Artstation this weekend and was stunned by the quality of the art there. I've been staring at slop for so long...
>>107020025It is a very cool model! So Reddit picked up on it and actually made nice gens - cool.>>107020086It takes ~3x+ as much to train. it seems wise to train radiance first actually unless pony or w/e provides the resources. >>107020093Qwen Image Edit also is a somewhat different training dataset than for Qwen Image.Also not sure VAE-less will make it better, Qwen does seem to have a nice VAE. I guess you'd have to try?
>>107020294>Also not sure VAE-less will make it betterbecause it won't compress the image if you edit it, it'll just change the area you wanted to change, that's really important for professional work, you start with a png you don't want to end up with a low quality jpg because you did the edit iteration too many times
>>107020254qwen and hunyuan can both do them pretty well
>>107020294>it seems wise to train radiance first actually unless pony or w/e provides the resources.I think that continuing to train the flux base, especially having already fed it a bunch of 512x images that negate the benefit of no vae, is a waste of time. Sunk cost
>>107020334loool
>>107020334Poorfags should not be allowed into this hobby, and quants/speed loras were the worst thing that ever happened to local.
>>107020025>Radiance absolutely is looking insaneit's true that it's so shiny it's radiant
>>107020239>I spend all evening genning 1girls with the largest possible breasts and then I go to sleep dreaming about the girl I'm in love with irl who hardly even has breasts really. I see the contradiction, but I don't feel it.I don't see any contradiction. I am interested in women with J cup breasts for 10 minutes at a time a couple of times a week at most. I am entirely disinterested in dealing with a large breasted woman beyond masturbating to her. I don't want to deal with the personality, society, upkeep etc required to have and keep one in real life. I am not interested in destroying the physical chemical pair bonding I have with my real life wife who I love by cheating on her with a IRL prostitute either. These are just a few reasons off the top of my head why AI generating a big titty woman is preferable for everyone
>>107020364>Poorfags should not be allowed into this hobbyI hope you can run HunyuanImage 3.0 anon, or else I will unfortunately have to call you a poorfag
>>107020364Poorfags are still not let into this hobby. If they were then these threads would be more active from 60 series shitters and laptop GPU zoomersBy the time "poorfags are let into this hobby", kids are gonna be running around with VR glasses creepshotting their mother's friends under the kitchen table
>>107020370I've had crushes on a lot of women in my life, but none of them ever had big tits. That's the part I don't understand. The women in my family all have DD's (and not from being fat), so you'd think I'd be genetically drawn to that, maybe I am, but as it happens I've never felt that kind of strong desire for anyone larger than a B-cup.Other than my sister of course.
>>107020086>Qwen20B model, so overbaked it lacks variety, realism LoRAs are at best on par with Flux despite lack of distillation (realism tunes completely breaks anatomy among other things if you ask them to do anything novel outside of the training data).It's good as edit model because BFL never gave us a proper Kontext Pro (which equivalent of Dev), but tuning it would pretty much yield worse or similar results to Chroma at a bigger cost.
Reminder the 5000s series cards are still in poorfag tier
>>107020453Severe cope. Base qwen is already much more flexible than flux was when it was released (flux couldn't even do basic animu), and it's not an intentionally crippled distill like flux either.
>>107020453a lot of what you're talking about is solely the result of retards refusing to accept that they HAVE to caption Qwen with proper natural language or it won't work properly.That said last time I checked translating my prompts into Chinese while running one of my loras (which was only captioned in English obviously) produced nearly identical results, so I don't think the multilingual support even has anything to do with captions. Meaning even if you captioned it in proper natural language Chinese it would probably still work.
>>107020475>(flux couldn't even do basic animu),To be clear it could but only in a very narrow range of style.
>>107020254>Chroma is quite terrible at it tooIt's all about the prompt you give it. Don't have a skill issue and it won't suck.
>more boring asian woman slop
>>107020606>complainer is an eternal nogen
>>107020328i think as long as AI is good about inserting a fitting thing into whatever area you change, it'll be ok either way?>>107020348slower training AND THEN 4x or 8x larger images? it's just getting harder to do for not very much benefit most of the time.no, 512px images don't really "negate" the advantages of the pixnerd method? they may not be the most optimal thing but you'll find it difficult to train on 100mp images perfectly sampled from 400mp+ photos
We are grateful that chromajeet attaches his same shitty gens to every post, otherwise i'd think there was more than one retard in this thread delusional enough to believe chroma is a anything more than a failbake
Is there a single place left on the internet that isn't overrun with newfags who cannot figure out how to read text in command line, pajeets spamming pure sloppa or coomers that are satisfied with 1girl standing?I miss 2022
>>107020823>2022>>107020651>We
I wish we could go back to the days of Dall-E Mini
>>107020610They are nogens for a reason.
>>107019768>>107019711I'm messing with this now, because, surprise, I dig qwen prompt adherence. And it's so very fast with lighting lora I can iterate almost instantly.Results are inconclusive. Had images previewed instead of saved in Comfy, so nothing to cherrypick from.All in all, the attack vector for now is splitting sigmas at first step and dropping first sigma to about 0.85-0.95 in a heavily front-loaded scheduler (say, beta with alpha 3: beta 20) so that first step denoise defines low frequencies. And then follow up with 3 steps of a well-behaved sampler which solidifies the intial composition (which is *occasionally* out-of-distribution) into the typical qwen look. Results are a mixed bag of typical qwen gens and worst imaginable murky brown chroma gens (but with qwen prompt following), with very little middle ground. This is understandable when one considers the nature of sigmas, but still annoying. I'm also trying to keep total gen time under 15 seconds for 512px on 3090 because I ain't got all day.Adding external noise at the start is a compromise between qwen's RL'd "mean" image and low sigma brown bog, but rn it introduces trypophobic patterns I couldn't manage to contain... yet, or maybe never. I've seen it's source, though, it's the edges of DiT patches in the low contrast areas.The next attack vector would be getting rid of the scheduler entirely and passing noise in hand-crafted dosages. Why bother scheduling when you're denoising just 4 steps? Not sure how different that would be.
>>107020591this is the only example of it from chroma that I've seen that wasn't severely artifacted, so I'm curious about what you did
>>107019711>>107019768Mind spoonfeeding me a catbox or two for qwen? I'd like to start messing with it too, but lately have no time to keep up with various models and their quirks
When will Chroma become good?
>>107021193when pooma 2 is out
Retard here, is there any way to run stable diffusion with vulkan, like koboldcpp vulkan support for llms?
>>107021193when he decides to stop this fluxenstein shit and finetune qwen instead
>>107021304>stop this shit and finetune worse shit instead
>>107021118that's zoomed in, try with a full body shot or with a fitter fishnet, models spaz out
>>107021149unlike the other anon i don't really have anything interesting. it's basically the template from comfyui but loading a gguf with the comfyui-multigpu loader + loading loras with the rgthree power loader for convenience. i could share it later but really you might as well use the template.
>>107021368yeah, it looks like a very low frequency pattern picked to be too large to exhibit the issue.
>>107021374Sure, but what about prompting? Does it need huge paragraphs? How do you get anything stylistic or drawn? Those are mainly my issues with post-illust models.
>>107021220nice I'd like to see the worklow
i this still how you upscale shit or should i be using a more up to date method?
>>107021382you don't need huge paragraphs, but you can use them and get more consistent results. it's easy to get generic "styles" like "anime" or "renaissance painting" or "pencil drawing" but you'll struggle to get more specific than that. if you request a subject associated with a particular style, it might override whatever you requested.
>>107021478Can you get consistent original characters and styles across different gens?
>>107021496sort of, it doesn't like to deviate too much so it will generally give you the same result for the same detailed prompt. you may alternatively want to generate the character in a neutral pose and then use qwen image edit to place it in different scenarios.
>chroma + lanpaintholy shit why didn't anon tell me this based combo
>>107021681please elaborate
>>107021698no
>>107021698you can inpaint realistic pussy for example
nigga i think 4chan barely stays afloat as it is
>>107021738catbox please
᠏ ᠏
>>107020067yes, with multigpu nodes. I only discovered this recently. been gimped by kijai this whole time AAAAAAAAAH
>>107021382>Does it need huge paragraphs? I just describe things in sentences until it's about the thing you wanted or until I find out qwen can't do it. Most prompts give nearly identical images so you will have to wildcard and vary prompts a lot to get actually different images. I haven't figured out a particularly good way yet.SDXL or Chroma or Neta Yume or Wan or others vary vastly more with just a different seed.> How do you get anything stylistic or drawn? 2d digital drawing, pencil sketch, watercolor painting. cubist acrylic painting. statue, ... a lot of that type of thing works. some classical art styles work too.
>>107019845liquid deathlucid dreamslethal dose
>>107022014add qwen and lumina to this.I think lumina uses the flux vae maybe? i dont rememebr
stop d*bo
>>107022059Are restarts a meme? I usually train with cosine with restarts and Adamw8 but I heard they are useless with adaptive optimizers. Anyone knows if this is true?
it's funny how Qwen Image Edit doesn't give a single fuck about the original image's style, it'll add the new element like you'd add something on paint and call it a day, that's soo lazyyy
using a seed that was made from lightx2 wont work good. are seeds even still relevant for trying to get the same output with wan?
that is indeed the issue
>>107022081they don't, the only other super-high-image-count finetunes ever done (starting from a literal base model as in SDXL) are like Animagine and BigASP. Starting from something like Chroma which has heavy concept knowledge already though is going to be a totally different story even if Lodestones insists it's itself a "base model". A high-diversity high-quality dataset could make it way more coherent with nowhere remotely close to millions of images.
>>107021900doesn't wan do this by default anyway?
>>107022069that's why I'm still using kontext
>>107022087I am skeptical of the claim but would be great if you can publish the lora that can do that.
>>107022089If you can't tell what model an output comes from you should just give up honestly
hunyuan 3 is supposed to be like that, like an LLM that can generate images (why it's so big)
>>107022090i mean that for example the native workflow already block swaps to ram and paging file without the kijai block swapping node or the distorch node, no lora needed
>>107022066nta, Adaptive optimizers set their own internal learning rate. Using anything other than a fixed lr with them is a bad idea all around
>>107022116I see what you mean.I actually haven't tried specific styles with chroma, mainly just realism stuff.Shame that he also did this stupid BS.
>>107022117>notice how he never posts gensthat sentence is /sdg/ coded btw
>>107022118If you haven't noticed Chorma can't do artist tags despite having a dataset that includes various artist and styles, this is a conscious effort done by the creator and has historically lead to massive issues with models as we saw with SD3 being dumbed down and the latest victim being pony V7. It always seems to do massive damage to models because concepts and other things that are often put into separate tags or related to some tags become completely fucked and you get stuff like inconsistent styles or broken anatomy like SD3 not being able to produce an image of a woman lying in the grass because they decided to obfuscate anything close to a sex act.
>>107022119Show me a model that does as well as consistently as chroma at as many styles and is not qwen which is style locked
so the shitter is just reposting posts from the last threadreport>>107022129>>107022100>>107022096>>107022074
>>107022126>Flux creators give training guidelines>decide to ignore them when the goal was to make a flux model without restrictionsAlso since you're pointing out single gens it's clear to me you're not at a level for this discussion, if a model is not consistent then it's worthless especially when you have to put so much time investment into creating chroma gens compared to other models even on top end hardware.I think we're done here because you fail to realize that 1 out of 5 gens is not acceptable when you're trying to maintain a look or style especially at the dog shit speeds chroma runs at
Hopefully they start fully supporting python 3.14 my generations are 2x to 3x faster with it but it's extremely unstable.
>>107022140Or deliberately p-hacked BS that is useless in practice.So many fucking papers on arxiv that does not yield promised results when trying the experiment on different data/params.
>>107022116nice try diddy
>>107022145Hypothetically if I were to train one where would I even upload it at this point?Can't register to that broken piece of shit seaart at all and tensorart asks for glowing email.
>>107022148no, pixnerd doesn't improve the training speed and works on the pixel space, that one works on something different than pixel space and latent space
melty
Use a non realism lora and post 5 back to back seeds using the same prompt, I'm not fucking spoon feeding you
id rather have 99% perfect details with crazy good aesthetics and prompt following AND style flexibility and then just fix small details with adetailer over having nothing but the same image with better small details due to being overfit to death on a single styleIts one or the other, at least until the vae issue is fixed
What is going on
>>107022182Him getting in Loadstone's ear to make a equally shitty model. The irony is this piece of shit is worse than chroma and instead of following THE FUCKING DOCUMENTATION he decided to listen to pony fag to make a stillborn model that can't even work with loras.Fuck both of them for burning so much time and money
>>107022182see>>107022140
>>107022185only low iq browns cant extract the 95% of good from something and focus on the fixable 5% bad
Hey guys, I need a little bit of help.Instead of generating videos locally I wanna use an API that uses wan or other models. I'm trying Pollo API, i can manage to make it work but the generated videos are midly NSFW.Is there a way to directly use an existing API (could be another one than Pollo) to generate full NSFW (porn) content using image to video?
>>107022191show me a image remotely similar both style wise and quality wise from qwen or sdxl or illustrious at the same res, also that is the point of chroma radience which is training, all models have that issue with tiny details
>>107021440there are native comfy nodes. what is even that monstrosity? WAS?with ESRGAN or related upscalers (like yours), you DONT really decide the upscale ratio, it's baked in the model. how does the upscale_by even work here? these kitchen sink nodes are fucking garbage.There are other ways to do upscaling, one is controlnet tile upscale (model dependant), another is LATENT upscaling, and finally we have the new entry on the block, seedvr2 upscaler model.DESU I mostly do seedvr2 upscaling nowadays (I've got the vram, it's fucking HUNGRY), for the simple reason that with CNET tile and LATENT upscale you need to do the upscaling the moment you're generating if you have a complex workflow.With esrgan or seedvr2, you don't care, so you can just handpick any image you have and shove it in there.
>>107022217NTA and I have no experience but as far as I understand the most you can do is keeping separate parts of the model (TE, UNET, VAE) on different GPUs. This can provide a moderate speed up as opposed to constant loading and unloading (especially on lower VRAM systems) but it's not even close to what you think can be done.
Is it true you can stack multiple cards or is vid gen more nuanced~?
i have trained pony loras in the past, is the process the same for illustrious? can i follow the same guide kohya guide i used for pony?
JANNIES
>>107022226I mean I can run it on 12gb VRAM but it runs like shit.
>>107022219I'm not talking about VRAM limits or how to offload what to where>I have no experience>it's not even close to what you think can be donegetting mixed signals here. I've just given you a list of different upscaling methodologies, yes it's not exhaustive but I'm not sure what you're asking for then.
>>107022230Where's your songbloom, bitch?(shoves you)
>>107022238Isn't there a node that lets you offload vram to another card? So instead of going to system ram, it goes to the 2nd gpu. This wouldn't speed up video gens, but atleast you'd be able to use higher parameter models without worrying about OOM'ing. Also great for higher res videos
>>107022217> how does the upscale_by even work here?it downscales, keep or upscales upscaler output, is it so hard to assume?
>>107022248You can, but probably not with comfyui. Most newer models support multi-gpu in their reference implementation.
>>107022248how is it downscaling? area? lanczos?
>>107022256sdcpp can but pootorch is just jank
When I see your pictures I'm comming to the conclusion that online models are better than local. PixAI is superior.
I mean, on /gif/ I can see full-blown NSFW content that I can’t seem to generate using the Pollo API.I’ve tried with WANx models, Kling, and the Pollo model itself.Is it a matter of the model training, or does the Pollo API apply some kind of moderation/filtering?
>>107022217> with CNET tile and LATENT upscale you need to do the upscaling the moment you're generating if you have a complex workflowelaborate
>>107022261>posts a terrible plastic slop
>>107022261not sure what you mean because of how you are asking the question but character loras work just fine with wan 2.2, you would need to train them for wan 2.2 and then use them as you use loras normally for any mode
>>107022242yes there are multiple (KJ and MultiGPU) but I never pointed out OOM issues in my postwait are we fucking getting the thread botted? what the fuck is happening?
>>107022263this is one of the saddest posts I've ever seen on 4chan, why are you so passionate about defending this model?
>>107022264As someone who is only interested in text to video (and is burnt out of that and waiting for a local sora 2 so I can generate teens acting bratty with sound before going back to genning),The fact that local models are in such a state that you even wrote a three-paragraph persuasive essay is top grim. I don't think it could get any grimmer than this. At least time will heal all wounds and it's just a matter of surviving and waiting for things to get better.
>>107022265>another wall of textthis guy is seriously mentaly ill
>>107022226> i have trained pony loras in the past, is the process the same for illustrious? can i follow the same guide kohya guide i used for pony?yesit's scatPICKLER is spamming the thread for sure
>>107022268I used diffusion pipe's defaults. LR 1-e4, 68 epochs, over 500 images, took me about 3 days
>>107022282that sounds great. i tried ace step and it was crap it couldnt even follow a consistent beat.
>>107022270rank 64 seems kinda high DESU, at least if you mean that in Kohya scaling where it's gonna thus be > 500 MBwhat training settings? was it trained at 1024x1024? If not that's your problem lol
>>107022256> how is it downscaling? area? lanczos?look at that screenshot again
>>107022269I have now idea what you're saying buddy. I just want to generate NSFW content lol, i'm trying to find the best solution, could be pollo or anything else I dont care
>>107022273it's funny how Qwen Image Edit doesn't give a single fuck about the original image's style, it'll add the new element like you'd add something on paint and call it a day, that's soo lazyyy
>>107022293can you fuck off nigger
>>107022292Same trend I am seeing as I go down my list of VAEs and eyeballing the error image brightness. Flux and Hunyuan are best for the random 3 anime images I tested.
>>107022300What does it do anyway? It's compression?
how the fuck do you search while banning tags in civitai?
>>107022395Don't ask me to find logic in that.
>>107022297born to coom, forced to troll
>>107022413now this looks like chorma
>>107022425i don't get
>>107022447now this looks like shityou understand now?
>>107022425
>>107022447nigga claimed previous goth thots gens were chroma, but they were qwen or wan
>>107022457based fellow satanic sitcom coomsumer
Why does /ldg/ have the most abhorrent taste in women?
>>107022467we are different anons
>>107022478post your 1girl. I'll wait
and this >>107022472>>107022482 is wan or most likely qwen
>>107022478you'd think I'd marry my gens or something? what does it matter
>>107022478I expect it to be banned the same way nude loras for qwen image edit systematically get banned.
>>107022467This is what chroma radient is upposed to avoid.
>>107022502Hunyuan 2.1 has a 32ch VAEActually, I don't remember if it uses it for the full process or only the refiner
>>107022489Yep.Despite being still 4 channel the SDXL vae was a lot better than the turbo garbage in SD 1.5 (for which you wouldn't even need comparison node to see how much it destroys the image)Hence why it was still a noticeable improvement over its predecessors in term of fine detail.
>>107022413ramtorch will save us unironically
>>107022454it's VRAM usage. the vae is for converting to and from latents. openai already did this for years because of the compute they have access to. it's faster but the entire point is sd1.x having a vae in the first place was to fit on consumer gpus
>>107022529I have now idea what you're saying buddy. I just want to generate NSFW content lol, i'm trying to find the best solution, could be pollo or anything else I dont care
>>107022489its chroma lol, I tried qwen before but it seemed to me like it gave me the same image over and over but when I see images like >>107022502 it makes me wanna try it more, but I feel that qwen is stiff AF when it comes to image generation
>>107022551My bad. I didn't realize more modest SDXL finetunes below pony/illust level still trained on millions.
>>107022551try lowering the shift, might get duds but loses that porcellain look
>>107022566posted wrong image at first:pony is better than originally thought (still not good) but it has major issues, he did not do uncond dropoutTo somewhat fix it spam tons of random nonsense tokens into the negativeIt is also somewhat locked to long prompts = good image
>>107022478because most are reddit tourists and dont belong on 4ch
>>107022574I think a lot of the initial tests disregarded the prompting instructions and tried to a-b test models with exactly the same prompt, getting terrible results.pony 7 still doesn't look great, but not as bad as the first mutagenic horrors people initally posted.
Why aren't jannies doing anything? This spam is actually really fucking disruptive.
>>107022582No it's the same faggot that griefs this thread daily, sora is now irrelevant and the api bullshit gets no traction so he's cycling through his autistic bullshit.I do think chroma is good for some memes if you can cope with the art constantly being inconsistent between seeds
>>107022551i think it might actually be chroma lol
Reporting you for trash content and furry pedophilia, and then blocking you.
>>107022607Is that the first model? So can we expect a distilled/smaller model in the future? I want to interact it like a LLM model with LM Studio. Kinda annoying trying to install billion different dependencies to generate image with python for every new model.
>>107022609You're too stupid for this conversation please go back to your containment thread, no other model besides chroma is unable to maintain the same consistent art style even with the aid of loras
>>107022566Does it fix the composition stiffness? or I was just being worked by the 1girl qwen (probably overtrained) lora, because it always gave me the similar photo over and over, for example I generated this selfie but no matter if I changed the prompt, If I had mirror selfie in the prompt I would get this exact type of image, the little image variation is what killed my interest for qwen
>>107022623shoot me a prompt, I'll see
>>107022623I see what you mean.I actually haven't tried specific styles with chroma, mainly just realism stuff.Shame that he also did this stupid BS.
>>107022642Now post 5 images with the same prompt within the same batch so I can laugh at you
>>107022642If you haven't noticed Chorma can't do artist tags despite having a dataset that includes various artist and styles, this is a conscious effort done by the creator and has historically lead to massive issues with models as we saw with SD3 being dumbed down and the latest victim being pony V7. It always seems to do massive damage to models because concepts and other things that are often put into separate tags or related to some tags become completely fucked and you get stuff like inconsistent styles or broken anatomy like SD3 not being able to produce an image of a woman lying in the grass because they decided to obfuscate anything close to a sex act.
ignore the anti chroma troll, notice how he never posts gens
>>107022665i'm anti-chroma, here's a gen
>>107022674Show me a model that does as well as consistently as chroma at as many styles and is not qwen which is style locked
>>107022674actually this survey cannot be reproduced
>>107022651>>107022659Idiots, we are talking about Qwen here, not Chroma. Chroma doesn't have image variations issues like Qwen does
>>107022683It's a known issue and one of the reason details like hands or feet look bad.
>>107022683It's a bot.
>>107022687Create a torrent, port forward the client port, and post the magnet link on the civarchive page too as another mirror
>>107022683there's a spambot bombarding the thread if you havent noticed, jannies are apparently asleep as I'm doing my due diligence.
how do i specify i want the female wearing sandals and not the male with XL?
>>107022691thats on the usual initial toy model size, seeing if it actually scales is what kills papers
>>107022675i'm not interested in your debate. use whatever model you like. i do not like chroma though.
>>107022695again, I tried this bullshit and the output was slopped as fuck and the background was grainy. 480p high, latent upscaled 1.5x to 720
>>107022698I'm a local fag but bored with muted videos. just paid for a month at a discount test the backlog of 1girl images i have on my output folder.
>>107022577>>107022700this responses have nothing to do with what im talking about. can a janny get rid of this bot already?
>>107022709And what are you gonna do different from the 6 gorrillian upscale models that already exist
damn so the dead internet theory is real and I've replying to bots this whole damn time
>>107022721I know I need to, but I'm lazy... What you got nigga?
>>107022721The schizo is trying everything in their power to kill /ldg/. It's actually funny.
>>107022730> you are fucking degenerate learn how comfyui works and treats workflows in particular first mr i know how's better
the Any Switch nodes in rgthree already support dynamically typed inputs and outputs, you're a fucking retard. You can dynamically type the output depending on the input you're connecting it to, likewise for inputs.kys nocoder retard
>>107022730it's baffling as this is the fastest moving thread on /g/, but I guess jannies are too busy erping in IRC to do anything about it.
>>107022747because it generalized to understand that the only purpoes for a digital anus is for it to be fucked, equating ti to a vagina
>>107022747polydactyl
>>107022775you are fucking degenerate learn how comfyui works and treats workflows in particular first mr i know how's better
>thinking you can out-spam 1girl
>>107022747it's only the fastest moving because of the schizo, but then again i guess that every fast moving thread on 4chan has schizos bloating them.
>>107022788I think it's great the average normie is so clueless, so the anti-AI mob calms down a bit while open source flourishes. Though Qwen definitely has a bit of yellow tint (Alibaba trains on synthetic data unfortunately).
>>107022790another retard that doesnt understand how python works.you can have outputs be dynamically typed (or having a dropdown for casting to your desired type), but everything is mathable together in python, makes absolutely no sense to have 09593421906234 nodes all to do casting and implementing math operations for each fucking type, there's absolutely no fucking need retard. Go learn python fucking monkey retard, kys
>>107022790it's actually always fastest moving, along with /aicg/ followed by /lmg/
Bros, I'm home.
>>107022805All this extra shit just for latent upscale and you're stuck with 1.5x, nothing else works or you get the mismatch error.
>>107022808Top right and bottom right are very much my shit.Hnnnng
>>107022808>38 CL>probably 4 sticks with 2 IMC cpu*vomits*
>>107022816don't be mean anon,
>>107022808you don't even need more than 64GB ram. you did with WAN2.1 because torch compile and such, but WAN2.2 doesn't need those optimizations. I haven't gone over 96GB ram usage in months
Hello, this is the Chroma anon from last thread that made the rank 64 lora.Please test it and report back. I would like to get feedback on what are the most glaring issues Chroma still has.https://huggingface.co/chromanon/goldenchroma
>>107022818it's a shame because for 'general' speed it's going to be a big downgrade to regular CL30 6000mhz ddr5 ram (which you can get in consumer platforms at 96gb currently, im not sure if there are any 128gb 2x kits capable of doing this).This particularly matters for LLMs, not so much videogen.
>>107022293>PixaiThis is a local general, why you contribute nothing but a closed service?
>>107022815wat>>107022816>>107022832I still have my 64gb 6000mhz lying around if I need it for something in particular.>>107022825You very much do, I can't run loras when doing 720p. And now I can keep doing other stuff while genning and not ooming.
Just imagine how pathetic his life is THO kek
>>107022829I think it increases gen times, i've noticed when i forgot to disable them, i had only genned 81 frames but i had the context on and set to 41 frame context size. And yeah they gen i'm doing is taking way too long, its been stuck at 33% for a while now... and it just jumped to 67% thank fuck, i thought it was having a fit. but yeah its effecting the gen time a lot, so only use the context window when you need it.
>>107022854how bad is your autism? they look pretty close
>>107022928scarlett johansson looking rough these days
>>107022947>(scarjo x margot x kbell)keen eye anon
>>107022857I have a feeling we are gonna be eating so good in the new year or Christmas coming.
>>107022976never got into wan and alike.so this does not work well on faces?it is clearly different when it gets animated
how do you know if your lora is shit due to the images you used, the captions, or the training settings themselves?
>>107023058kind of, yeah... Its actually meant for doing longer videos but it stop's oom's and it works very good. Its really underrated or just over looked desu because its a new beta node guess.
>>107023058Context Window is actually a frame batch node
107023073107023078wat
>>107023096Describe what you want
>>107023095I wish there was more feet sloppa...
>>107023096bot is shitting up the thread by reposting old posts
>posting through the pain>jannies wru?
>>107023102The example is a single prompt for everything, but I'm not sure if it will do well with a sequence of different events. It looks like it should be possible to prompt each segment differently, but it might get weird when it fuses them together.
>>107023104I got you anon just just tweaking to hit the right spot and i'm getting close to the sweet spot. this i think was only 4 steps using old wan2.1 lora rank 64 iv2 in high at 5.00 strength and old wan2.2 low in low at 1.0 strength. But I'm using unconventional sampler settings that some anons here think is retarded because it uses cfg but honestly the gen time is not all that much more time and you get high quality with prompt adherence so its balance I seek and not just the fastest speed.
might as well cool on a new bread anyway since all those posts will get wiped
>>107023150the car doesn't have any windows anon, pay more attention
>>107023150>since all those posts will get wipedyeah in two more weeks at this rate
most if not all flux loras work in chroma so try them.
>>107023171you don't understand when you install requirements.txt it will check and re download most probably.
>>107023172speed up loras are for losers
spooky door>>107023058Bad dataset is the usual reason for bad results. Use AdamW or something else that just works.
>>107023058What model architecture are we talking about here?
>>107023199can you go back to posting women?
>>107022829I was the guy you were talking to, I'll have to see how this compares to my own Chroma experiments
>>107022791Qwen is ridiculously 4o-maxxed specifically at 1024x1024 yeah. Not really at 1328x1328 or the other weird standard Qwen resolutions though
>literally change nothing about my workflow>regression into sparkly bullshitcmon dawg
>>107022691I've seen this kind of shit before, they seem to just randomly repeat recent comments that other people already made, nothing they say is ever original. Not really sure what the intent is by whoever is deploying them, maybe just trolling
>>107022829Better stick to one type of dataset if you want to improve anatomy with lora. I would remove cgi stuff and just stick to photos
>>107023309A lot of things are redundant old versions that won't be needed anymore or from nodes that were deleted after testing
>>107023267why would hair fly in the car?
Is the 5b model of wan2.2 useless, or am I doing something terribly wrong?
>>107023340why? Is Chroma against rules too? Did Flux krauts do this?
>>107023343There's several good loras for Chroma just civitai are cucks and take them down almost as immediately as they get uploaded.
>>107023340your question is useless
>>107023354Most of its community still thinks Flux is SOTA. And then you have all the Chinks who have successfully shilled Qwen.
Discord trannies having a melty
>>107023363I'm talking Wan btw
>>107023345>>107023354>>107023357I'm not comparing wan to anything, just asking if the 5b model is supposed to look like the will smith spaghetti gif because of it's size, or if I fucked up the setup
>>107023379I agree that Chroma has its issues but to be fair to Chroma that's not the correct resolution for 1.75 ratio in a 1024p model.
>>107023382Sdg will never be popular :[
Pathetic
>>107023404its not just here its everywhere in ai
>>107023405and i will make eat you shit you just posted within a few minutes once i adjust the cfg from 4 to 3.5 and load the q8 and up the resolution to full 720p then you can eat shit and die
>>107023451>Its pretty good once you figure it out how to use it>posts a gen with the long torso issue
>>107023490>letting your dog lick chocolate syrupfucking retarded kid
I will now proceed to post 4 more variations of my gen
>>107023497Kind of annoying how the lightx2v ruins videos with end frames. It distorts right at the ending but without the lora it works fine
>>107023503>>107023503
>>107023340it depends on subject (and arguably your expectations)sure enough, the bigger wan is noticeably more competent, so generally you want to try that if 5b doesn't work [well enough]