Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107040459https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Neta Yume (Lumina 2)https://civitai.com/models/1790792?modelVersionId=2298660https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQdhttps://gumgum10.github.io/gumgum.github.io/https://neta-lumina-style.tz03.xyz/https://huggingface.co/neta-art/Neta-Lumina>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Cookies
FIBO (new image gen model, json-based) is out:https://huggingface.co/spaces/briaai/FIBOhttps://huggingface.co/briaai/FIBOFrom my tests it's a bit less aesthetically slopped than Flux and Qwen, but can't do anything "copyrighted"
>>107045555>generate a beautiful representation of the local diffusion general thread from the 4chan image board
>>107045562Considering it is using an LLM under the hood, it looks incoherent garbage
Cursed thread of anti-comfy samefag
>>107045562Suspicious lack of cunny in this representation desu
>>107045599maybe but you aren't human if you don't hate on comfy
>>107045551This isn't true just because you say it is lmao, there's tons of nodes that do all sorts of different shit that I can just run stuff through as a batch in a very specific order. The only programs that can actually do the same shit I'm talking about are like e.g. ChaiNNer which literally also uses a node based system.
I HATE SPAGHETTI
I noticed some models (variations of noobai) will do expressive/creative pics without overcooking the prompt, but look like shit aesthetics wise, or a lot of malformed shit, and on the other side of the spectrum (illustrious variations) they do a very clean and polished artstyle, but come out with dead expressions and basic angles and such.Why is AI so dead bros?
>>107045555IT'SSHIT
stoopid frog poaster
>>107045665I love Apu posters so much bros
>>107045555the json thing seems like a good fit for danbooru style tags so you could hotswap parts of the composition etc, too bad the general tags are not labeled in any way aside from wiki groups
>>107045628so just ai stuff? Gimp can batch edit shit faster and easier than noodling some garbage together. this also sounds like custom nodes, comfy alone is barren when it comes to doing anything else
>>107045672run it locally, then?
>>107045720I have 16gb (4080) and am trying not to OOM but we'll see
>>107045533Imagine how much better the world would be today if the top left corner of pic related was accurate. Shit would get done.
>>107045555Wow it's great
>>107045732but how would they suck dick
>>107045746the deer should burst into flames upon handling
>>107045555Seems OK I guess, I don't think it's quite as detailed and crisp as Flux Krea though. The JSON thing seems a bit cumbersome also.
>>107045555Does it run locally in anything other then Diffusers? (Their Comfy node is an API node)
>>107045562>trained exclusively on long structred captions.Cum on.
>>107045672>>107045746Can you guys cut it out? Feeling pretty unsafe here...
>>107045794>thats not a failgenthank you for the encouragement.
Some rare seed just have faster motion
>>107045533Is it safe to use local diffusion on my local/only computer or could the power required for it fuck my computer up?
>>107045665Because it's a tool of interpolating between known images.
>>107045819>Their Comfy node is an API nodeit's over
>>107045750there's other holes.
>>107045200i did this before and it came out crap. i want to give it another shot. did you use screencaps from a blu-ray rip or old vhs caps?
A few threads back some anons were asking for AceStep 1.5 gens (from the current training run), there goes:https://vocaroo.com/16OrwGV5Wqi5https://vocaroo.com/1b2iSwhkZN3yI strongly believe it will match or even surpass Suno once they finish the later training stages
>>107045901They did the same thing with Bria 3.2 when the released the weights, provided local Diffusers code but a Comfy node that just called the API on their commercial website, not sure why
>>107045890have a good quality psu and nothing will blow up
>>107045899Sometimes I do an img2img with one of the creative pics I like to try and improve the style, and the following model completely kills the soul in it.For example illunoob 2.0 > diving illustrious
>>107045941>Buy etc 5090 for $2400>Sell my rtx 3080ti for $450>Use a "wellness" credit from work worth 2k t that I get to spend on anything that makes me happy (tax free too)>5090 is free>Get 2% cash back on the credit card purchase>Anon seethes because I'm not buying his SaaS bullshit
>>107045941> one of the creative pics Generated?
>>107046046amazing what someone can do when they're not an attention-seeking, tech illiterate trannoid
>>107045914I used blu-ray screencaps. The best possible quality for dataset, no matter what
>>107045899Source?
>all the piss and golden shower loras on civitai are goneit's over
>>107046032>>107046065Wtf are these two niggas talking about.>>107046046Yeah, lets put as example, "asanagi bukkake", noobai ones will tend to do more expressive and dynamic pics than illustrious, yet are messy and not so good artstyle wise as diving illustrious
>>107046085> yet are messy and not so good artstyle wiseSoul.
>>107046076the girl on the right isn't even looking at her own phone, this isn't close to reality at all.
what's with random comments from older threads being spammed here, is this one of them schizos I keep hearing about?
>>107046095Its souly but its chroma-esque in how theres always something entirely too fucked up, like hands, I'd wish to get "diving illustrious flat anime" levels of polish without sacrificing on creativity. I'd posts some examples here but blue board.
>"clean and polished" style = good I hate that I share this earth with you.
>>107046123Then kill yourself and stop sharing it, duh :P
>>107046109spambot that comes and goes
>>107046068thx, maybe that's where i messed up trying to do too much at once.
>>107046168Take your time with dataset. Gather around 50-60 good images first, crop them by hand to 1:1 and then caption whatever way you want. Start training and don't stop until it works. It's much better when you have success under your belt, it will demystify the whole thing
ive been making videos exclusively of 1girl or 2girl but more and more i am liking the idea of ugly fat bastard with girl or pathetic nerd with girl for the self insert
>>107046483>1girl>picture of ugly pathetic bastard nerd (You)>stitch them side by side>????>profit
>>107046314nigger just give up with this shit
It's not him most of it is his disabled attack dog. When he's actually active his bitching and moaning is more targeted and he seethes out another anon's name especially when he's drunk
>>107045890When I first got my particular 4090 I had no idea it could draw 700W or that Stable Diffusion (even back in the simpler 1.5 days) would suck as much power as it could out of the system during inference and it repeatedly tripped the breaker because my PSU wasn't up to snuff. Upgraded to a 1.3kW from an 850 and haven't had a problem since.So like >>107045928 said, just make sure you have a quality PSU.
>>107046498hmm
Why does Stable Video Infinity exist? Examples I've seen of people using it still have the stutter between clips. Zero consistency with motion throughout the video. Its far worse than SkyReel.
>>107046582Wait for the proper comfyui integration. The loras we have now only kinda work with kijai's wrappers
>>107046631why isn't it in yet? why is it always API nodes first priority for these fucks
>>107046653why do you care, you don't gen?
>>107046562visible square grid
>>107046709>elbows too pointy
>illustrious still king after more than a yearwhat happened to all the progress?
>>107046298Good tips
>>107046841all of the new models are huge and most people are still vramlets so the majority sticks to XL based models
>>107046841NetaYume is slower than SDXL but basically as close as you can get size-wise and speed wise while actually still having a worthwhile arch. A lot of it is due to the better text encoder and VAE too.
>>107046884No, dont act like theres an alternative, chroma IS GARBAGE FOR ANIME AND YOU KNOW IT, qwen I never tried but it cant be much bettert. 24gb VRAM still sticking to 6gb illustrious models
>>107046946This. There's little reason to use Illustrious anymore.
>>107047015Can it do proper artist styles like asanagi? Im gonna download it and if its worse than illunoob 3.0/diving illustrious im gonna shit on it till the end of days.
>>107046946I hope there's a chance of Netayume realistic finetunes, realistic XL models all kinda suck
>>107047046With 1.6k images on Danbooru I assume yes. But now I have a feeling it'll be too difficult a model for you to handle so please ignore my post and do not use it.
>>107046841because it has sovl. technological progress alone can't create models with sovl and can cause regressions too>>107045555this actually looks really good IF you are using it for corpo or government work where you need to abide by copyright laws. The JSON captions sound awesome if you're building an API around this model.
>>107047075>its not bad you just cant use itC'mon now. Anyway I'll give it a chance, its still downloading.
>>107047075>too difficult a model for you to handleI think "shit" is the word you're looking for. The model is shit.
>>107047046it can do asanagi. could do better though, this model needs more training.
Are there "long video" tweaks but for wan2.2?Everything released seems to be for wan2.1 and it's a bit annoying.
the man stands up and runs down a sunny beach on the left.2.2 MoE kijai lora high, 2.2 lightning low, 1 str each, new high lora seems to work well. shift is 8 (from 5)
I'm trying out an XL model(snakebite 2.1) and all outputs I get with the suggested settings look like flourescence microscopy, any idea what coudl cause that? Obviously al lother XL models and XL based ones I used so far worked.
>>107047101Many were/are filtered by Noob for this reason, especially in the beginning, even though now it's not controversial to say it's the local anime SOTA.Anyway, good luck and make sure to check out the links in OP for the Yume prompt book and what not.
>>107047067it would make more sense to just do a realistic tune of the Lumina 2.0 base model without the significant amount of anime-specific fine tune on top of it I think>>107047121some artists work better at different resolutions than others TBQH, it depends on how their work was originally uploaded. NetaYume works best at a bit higher than typical SDXL resolutions though overall, I like 968x1322, 1024x1536, and 1280x1536 for portrait at least. Generally it's coherent up to at least 1536x1536 for one-shot gens, sometimes higher.
>>107047159this image does not look better or worse than the last one you posted really. Also you can't just post coom here, you're gonna get three-dayed lol
New optimization seems broken.Can someone try to gen anything using : --fast pinned_memory with latest comfy nightly?
How the shit do I do video with neoforge?
>>107047110nah it's good, and not really hard to use, just leave the boilerplate prompts where they are in the pos and neg and don't forget to put `@` in front of your artist tags. Pretty simple.
oh brother. not another one
>>107047219what if its its just glue?
did I get banni
>>107047227Clearly you haven't read the documentation.
>>107047227how is she standing looking at viewer hands on own hips in one and lying on stomach in the other if it's the same prompt?
>>107046709Yeah, it's the VAE I use. While it enhances my LoRA that was trained with an EQ VAE, it "sharpens" the output just enough to pull that grid out of SRPO/other LoRAs. The guy suggests not using his EQ VAE for inference, but the color is much more vibrant and my LoRA holds up much better (even when the face is very small!) that I prefer it to the default even with the extra noise.
>>107047261hollero im gonna cook an actual nsfw comparison
>>107047267sfw*
>>107047267we all already know that both of these models can do NSFW properly when you prompt them in a not retarded way, we don't need a comparison lol
>asanagi, blonde, milf, frontal shot, white tank top, jeans, upper body in frame
>>107047289He's gunna do it anyway and then whine because his horrible prompt and settings don't look good. Such is the way of imagen.
>>107047315>>107047289please stop shilling this netayume garbage
>>107047312oof nice tits on the rightdownloading it right now
>>107047312read https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd
>>107047344not even the pics they have in that site are good lmao
>>107047312these are both plausibly asanagi TBQH, I dunno what your point here is really.if that was the actual prompt you're still not really prompting NetaYume quite right though. Just go look at the user-upload gallery on Civit for V3.5.>>107047327>won't say why or how exactly it's garbage
>>107047312damn yume obviously has far superior prompt comprehension in that comparison. you should listen to this guy >>107047344 though.
>>107047386>Just go look at the user-upload gallery on Civit for V3.5.It's all garbage, all of it. I think this model might just be useless.
>>107047386>>won't say why or how exactly it's garbagebecause when he types "1girl big boobs" into his shitmix it looks sooooooo good but when he does it in yume it suckss!!!!! >:(
>>107047344i wouldn't say EVERYTHING in this guide totally applies to NetaYume's most recent versions honestly but it's a good starting point. The model is more flexible prompt-wise then they imply I think, overall most important things IMO are just don't delete the Gemma prompt boilerplates, and do use either Res Multistep Linear Quadratic or DPM++ 2S Ancestral Linear Quadratic at around CFG 4.5 - 5.5.
https://huggingface.co/morphic/Wan2.2-frames-to-video
>>107047399the guy who actually posted the comparison literally posted images that could believably have been from the same model they were so aesthetically similar though, I don't really get it
>>107047413I agree though I use a different sampler and scheduler. Once you get the hang of it you can really do whatever you want promptwise just like other models.
>>107046946Still fucks up things such as hands though right? What's the point.
>>107047474Not as much with a good prompt. But to imply XL does good hands more often without detailers is funny.
>>107047496I mean, why should I stop using IL if the other one is just as bad? It's like picking up a fancy SDXL finetune.
>>107047515If you feel it's bad then you shouldn't use it, I don't care. But the only thing IL has on Yume is that it's been out for longer and thus more people have raped it. In all other respects, Yume is superior. Your same argument was made when IL first released and look at what everyone's using now.
>>107047424>the guy who actually posted the comparison literally posted images that could believably have been from the same model they were so aesthetically similar though, I don't really get itBro what? WHAT? That is absolutely, positively not what im seeing here
>>107047312>>107047560This comparison means nothing unless you use regular Illustrious 0.1 or even 2.0. You're comparing a mix to a bare finetune. It's not apples to apples.
>>107047590hmm, fair enough
>>107047560Also I hope you're using 3.5 and not 2 like your filename implies. Regardless, all Lumina models not just Yume don't look as good unless you use the initial boilerplate prompt because it's using an LLM as the text encoder and not T5 like XL.
>>107046578heheh
if this is still asanagi have you actually looked through his 79 or so pages of shit on Danbooru? His style isn't THAT consistent, these images all fall within stuff he's uploaded at various times to my eye
>>107047142Post catbox.
>>107047515>why should I stop using IL if the other one is just as bad? It's like picking up a fancy SDXL finetune.IL is just a fancy XL finetune fucking KEK
>>107047142based on the CivitAI page the person who made this model doesn't seem to understand that BigASP V2.5 did not train the text encoders AT ALL (whereas BigASP V2.0 did, a lot). Beyond that he's merged a flow-matching SDXL model with a not-flow-matching-one in a weird way so there could be lots of things wrong.
Did you ever get "what the fuck am I doing with my life" feeling while slopping?Happened to me today.I fear wasting my life on garbage.Like even when you can gen a genuinely good looking image no one gives too much shit since it is slop.No amount of wf/prompt optimization changes the underlying fact that it is a slot machine.And for coom gens, same principle why gooners are mocked applies. I keep getting a strong urge to delete everything AI related on my computer and do NNN.
>>107047678The point is that unless I see a tangible difference, swapping over my workflow feels like too much work. I don't want to swap unless the images I get from swapping are pristine quality, but that's not what's gonna happen. They will be equal, and maybe they might be better, but there's a speed cost. A good model should need as little supervision as possible from me, I shouldn't have to fix its mistakes.
>>107047708And yes sorry for the faggy blogpost but I needed that out.
Been trying all day to get comfyui to work on my >gaming laptoplmao takes 30s to render a shitty 480x720 clip after using an all in one work flow with no adjustable loras.Fml is there no way to generate good shit as fast as grok with my setup?
>>107047726>tangible difference I mean, even the fact that its vae is VASTLY superior to XL's... how is that not a tangible difference? Not even mentioning Gemma kicks T5's ass or the fact that it can do higher res outright. >swapping over my workflowI use the exact same workflow as I did for Illust. The only difference is the prompt, it was painless.
>>107047708no, I gooned my way into 6 figure pay and I hope you get the same opportunity at some point.
>>107047764I meant 30 MINUTES lol
>>107047764genning videos on a laptop is like tryna fuck an obese bitch with your micropenis; pointless
>>107047708do it
someone got tired of waiting for nunchuku devs to implement wan2.2https://github.com/Disty0/sdnq?tab=readme-ov-filehttps://huggingface.co/Disty0/Wan2.2-I2V-A14B-SDNQ-uint4-svd-r32
>>107047708its fun to mess around with high creativity settings and just spin the wheel for a fun goon, but if you're putting in excessive amounts of effort into it, you gotta rethink your life
ani and comfy are basically examples of when you win the jackpot
>>107047708>I fear wasting my life on garbage.no need to fear anon you are actually wasting your life on garbage.
>>107047764>as fast as grok with my setupthat's a hard no
Oh shit this guy is based kekhttps://huggingface.co/Disty0/Chroma1-HD-SDNQ-uint4-svd-r32Chroma bros we are so back!
I'll wait until FurkGOD weighs in on the issue.
>>107047838chroma cannot afford to be dumbed down any further
>>107047726NetaYume positive:`You are an assistant designed to generate anime images based on textual prompts. <Prompt Start> @asanagi, a cartoon of an overweight nerdy man wearing a fedora and sweating and shaking as he sits at a desk in his darkened bedroom and stares at his computer monitor with a deranged expression on his face. The computer monitor is facing away from the viewer and seen from behind. There is a pepe the frog poster on the wall behind the man. A speech bubble coming directly from the man's mouth reads "OH FUCK, I'M COOOMING!"`NetaYume negative:`You are an assistant designed to generate low-quality images based on textual prompts. <Prompt Start> worst quality, lowres, sketch, greyscale, monochrome`IlluNoob positive / negative: same as above but with Gemma boilerplate removed and no @ sign for the artist tagIlluNoob didn't even succeed at NOT making the image black and white here lol
>>107047838I don't get
>>107047795Maybe the gayming laptop guy has a chance after all
>>107047881near lossless 4bit quants for a bunch of modelshttps://huggingface.co/collections/Disty0/sdnq
>>107047795>>107047838These are not SVD (A4W4) quants though.It says SVD is implemented on the github but I am not seeing it among the two examples you posted.
>>107047838Someone now just needs to tell the guy about HD Flash, or maybe we could do it ourselves? https://github.com/vladmandic/sdnext/wiki/SDNQ-Quantization>>107047855Oh yes it can. HD Flash -> nunchaku for SDXL speeds.
>>107047903https://github.com/vladmandic/sdnext/wiki/SDNQ-Quantization
Here are some benchmarkshttps://github.com/vladmandic/sdnext/wiki/Quantization#benchmarksnot as fast as nunchuku but still a 2-3x speed up
>>107046946Last time I tried 3.5 it couldn't gen sub 500 pic dataset gachas characters
>>107047943like?
>>107047943>sub 500 picdid you also describe the characteristics or did you only prompt the name but i also wouldnt be surprised if some random gacha characters werent included in his dataset considering the way he describes it
>>107047838how is this any better than other chroma projects?
>>107047997read nigga read>>107047914>>107047930
>>107047943How reliably can you gen sub 500 characters on base il 0.1 tho
>>107047890>XL quants holy poorfag i couldnt imagine needing that
>>107046946It needs a lightning lora asap.
>>107047708>Did you ever get "what the fuck am I doing with my life" feeling while slopping?No, it's fun tinkering. Anon, there are people who enjoy making legos or solving random puzzles, I just consider what i'm doing as a hobby.>Like even when you can gen a genuinely good looking image no one gives too much shit since it is slop.Do you make images/videos for your hobby enjoyment or to get a dopamine hit of people congratulating you?>And for coom gens, same principle why gooners are mocked applies.Again, who gives a shit about "mocking" if you do something you enjoy.>do NNN.Ah yes, the latest protestant culture ramadan.
>>107047890>HunyuanImage3 still too bigreee!
>>107047874NovelAI 4.5 didn't do as well as I expected on this one. More accurate Pepe (but in the wrong place), that's about it
>>107048123novelai still uses shitty T5. They will prob switch to gemma / another VLM for 5, I assume that is why they only went from 4 to 4.5
>>107047621Stop coping, illustrious just works, netayume doesnt look good even when you throw a whole bible of prompts onto it, how exactly is the text encoder in SDXL worse when it just works the way its intended?
>>107047874If you wanted to do this type of non nsfw stuff you'd use chroma, not netayume or illustrious
>refresh civitai>ball punching lora and ball kicking lora for wan i2vcan someone explain why someone would want their balls punched or endure any kind of physical abuse targeted towards their balls in any capacity?
>>107047874Soul vs souless
>>107046841The two big models chroma and pony were trained with yolo retard methods based off of vibes instead of following the documentation. Neither model should have the issues that they are having but they both decided to obscure tokens and do random shit. Pony v7 is a fucking embarrassment and the horse fucker faggot scammed the retards that supported him
>>107048227The real cope is thinking T5 is as good as Gemma. >how exactly is the text encoder in SDXL worse when it just works the way its intended?So does Gemma. It seems as if you don't really understand the discussion.
>>107048289I agree with pony, you are wrong about chroma, chroma is great and we had no idea a better model that was not distilled that is qwen image was going to release after
https://huggingface.co/valiantcat/Qwen-Image-Edit-MeiTu
>>107048290You've been hammering down your point of netayume being better for nsfw/anime but its just not better, like not even close to illustrious.
>>107048296>can't stick to a single style because it's slopped after talking and following pony fag>lied about artist tags being present when he obfuscated them>fails at 2D due to consistencyChroma failed at doing what it set out to do >>107048306You're comparing a base model to a finetuned model with way more iterations of training, the only real flaw is the text encoder which should be better
newfag here. Where exactly do you start with turning images into videos with prompts like grok imagine? Is there like a popular model everyone likes to use now or something
>>107048306Cool comparison you posted to illustrate your point.
>>107048324cool furk
>>107048340kek'd
>>107048241does chroma now manage to pull off consistent-across-seeds anime styles?
>>107048328>can't stick to a single styleThat is the point of a 'base' model, if it preferred a style that would be a huge issue>lied about artist tags being present when he obfuscated themyou are thinking about pony, not chroma>fails at 2D due to consistencywha?
>>107048331what GPU do you have? if its not a 5090/4090/3090 you can forget about it.
>>107048367>this fucking retard againWe're not doing this, we already had this conversation. You already got told and you fucked off when asked to post 5 back to back images of the same seed and you shat your pants after another anon proved you wrong.
>>107048227no one is coping lmao, the only person other than me who actually did a comparison just posted several very simple side-by-side examples where both images looked like something the 1000+ upload artist in question could have actually done
>>107048379>You already got toldlol rewriting history did we? You could not show me a single better gen from any other model that is not a style finetune
>that reddit gen I hope he doesn't seriously believe that a model being able to generate a consistent artist style is the same as it being overtuned to whatever is the authors preference kek
>>107048332>post comparisons>noo not like that thats just better cause its fine tuned you see that model had more iterations and fine tunes and this and that-
>>107048241now THIS is coping kek>it's actually bad that the model has excellent prompt adherence, which could never be relevant for NSFW surely!
>>107048328Not this shit again>You're comparing a base model to -No one is going to finetune this shit, just like how no one finetuned chroma. Bonus failure points since this was supposed to be "great" at weebshit and it isn't.
>>107048399Dude theres not even a single pic in civitai where netayume looks better than illustrious, and when I did nsfw it was UGLY, do you not understand what that means? If I want to make meme shit then ill just get a 22gb chroma model and it will be infinitely better than netayume, but STILL INFERIOR FOR HENTAI TO ILLUSTRIOUS
>>107048394Which legit comparison proved that base illustrious is better? The rest of your post further illustrates that you don't understand the discussion if you are the same anon kek
>>107048394the only actual comparisons in this thread were by me (who did "i'm cooming" guy, and isn't the same anon you just replied to) and the anon who did booba lady earlier.
>>107048373How bad is it on a 3070...
>>107048289If it's so easy to deslop Flux or train an 8B model why don't you do it anon?
>>107048413>No one is going to finetune this shitSo you don't realize that Yume is already a finetune of Neta which is a finetune of Lumina? There's no reason for anon to take your post seriously.
>Faggot is recycling his old bit to derail the threadYou really need a new fucking hobby dude you recycle the same bait every fucking day
its just one guy attacking literally every new model, they are just trying to discourage new models period, ignore the troll
>>107048418are you the same guy who actually seemed to believe that his asanagi 1girls showed any kind of noteworthy difference at all? If so there's your problem, if not I don't really understand what you're talking about. If you give a "known good results to you on XYZ Illu model" NSFW prompt I'll try it on Neta though.
>>107048437Fact of the matter is, last model we got worth a damn for artist styles was SD 3.5 Medium, but it was a broken weight. Still, if community figured out way to work around it we'd be eating good.
>>107048435>8gb>slowOH IT'S BAD. better get a job boy
>>107048423107048433These two: >>107047560>>107047312Stop coping, I dont need to "understand" anything, you just kept saying "nyooo use netayume its better than illustrious" but then I ran a bunch of pics and it sucks.
>>107048435its not going to be fun for you.
>>107047708>I fear wasting my life on garbage.same it needs to be at least twice as fast so I have life left over
>>107047289>when you prompt them in a not retarded way>>107047315>He's gunna do it anyway and then whine because his horrible prompt and settings don't look good. Such is the way of imagen.>>107048477>Stop coping, I dont need to "understand" anything, you just kept saying "nyooo use netayume its better than illustrious" but then I ran a bunch of pics and it sucks.jej
>>107048477(again I'm not the same person you just replied to) as I said before you don't know what you're talking about if you really believe that ANY of the four images you posted are actually particularly "bad" or "good" outputs for a very simplistic `asanagi` tag positive prompt, kek
>>107048497migu :(
If you put in the work, you can make netayume look like a quantized version of low step illustrious pic, believe in your dreams and never give up!!
>>107048514>heh youre just not smart enough to use my modelYeah, I think ill stick to chroma for the retarded shit you do, and illustrious for quality anime, those seem to work just fine on my tiny brain
>>107048597STILL not the same guy as the guy you just replied to, you ARE dumb though if you literally refuse to pay any attention to the most basic recommendations on how to use a given model, even down to sampler choices.>>107048581I dare you to box this
Neta yume is not goodThere, I said it
I'm torn between linear quadratic and bong tangent desu
I wish your caretaker would pull the plug on your internet
>>1070486491schizo, trolling at reader, vramlet psyop
>>107048655for Yume? I think res2s bong tangent was actually even better for fingers and text than DPM++ 2S Ancestral Linear Quadratic when I tried it once. Slower still though. And both of those are slower (but better) than Res Multistep Linear Quadratic.
>>107048639>I dare you to box thisAnon, ifs from civitai, they all look like low step quants of better models
Who knew skill issues could manifest in such a way
>>107048743true, whoever made netayume has a severe skill issue
>>107048743Ran cannot post anything positive but he needs to snicker and criticize others.
>>107048639he scrolled down past all the good gens to the bottom of the civitai page to find it kek
>>107048303why are they comparing against fp8, that's not a fair comparison at all
>>107048804they're comparing prompt adherence not general quality
>>107048815the prompt adherence gets worse with worse quants though, that's the fucking problem
>>107048782>netayume>good gensBro? Where is it? This is like 2gb illustrious tier
>>107048671What do you keep your sampling shift at or do you even use it at all?
>>107048671>>107048835nta but normal sampling shift doesn't do anything to res/bongmath samplers, you have to use "ModelSamplingAdvancedResolution" node
>>107048829I dare you to box this one too
>>107048871I dare you to find one pic on civitai that doesnt suck
>>107048671>res2sres3m should be faster for an equivalent result
>>107045794tried a few more, it's coherent enough. I sincerely doubt it has any NSFW knowledge at all though.
use case for flux krea?
>>107048891Just use it
>>107045555lol
>>107048876I mean like every other similar troll you're never going to actually show a specific example of what YOU supposedly think a "good gen" is
>>107048828Nice. It's refreshing to see a Radiance gen that doesn't look like it's only half converged.
>>107048906>HiDream superior to flux devthat's how you know it's a mememark
>>107048870>"ModelSamplingAdvancedResolution" node> cannot access local variable 'sampling_base' where it is not associated with a value Dang it
cannot access local variable 'sampling_base' where it is not associated with a value
>>107048906Now compare them all to SDXL
>>107048303is it a finetune of the original QIE or the 2509 version?
>>107048891better prompt adherence than normal Flux, relatively better understanding of styles, MUCH better out of the box realism (in most cases it's kinda like what Asian Waifu Chroma Schizo claims Chroma is for gens that aren't NSFW enough to be disallowed on /ldg/, except actually and without negative prompts existing at all)
>>107048946can you say that again in English?
>>107048931the Full was better than Flux Dev but not in a way that really justified it since it was still extremely similar looking to Flux Dev
>>107048941what the hell, works out of the box for me
>>107048945Original.
New, I have a general question so apologies if this was already answered fifteen-thousand times before.I'm trying to generate Img2Vid stuff (furry slop lmao) and after days of fucking about with Comfy I came to this from a youtube video.https://files.catbox.moe/x5uk9r.jsonI think it might work but I keep getting torch out of VRAM errors trying to run it. I know the 4070ti and 64GB RAM I'm rocking isn't as powerful as a 4080 or 4090, but I figured 12gig of VRAM was enough if I did some scraping. Should I download a Q4 version of Wan, would that help? Anything else I could do to get this thing working?
have you guys seen this?https://huggingface.co/lightx2v/Autoencoders
>>107048907Right right, im the troll, its not you who is shilling netayume like you got paid for it, despite there being 0 instances where its good
>mentions ran for no reason at allLike I said he always reveals himself, he's upset that his general is dead much. He will always engage in circular time wasting arguments because his life has no meaning.
>>107048983Don't care
>>107048983you have to use this custom node to use ithttps://github.com/ModelTC/ComfyUI-LightVAE
>>107048983you won't see anywhere near good performance unless you have a H100.
>>107048995if you weren't an extremely generic, mediocre troll you would lead in with an actual "positive" baseline for comparison
>>107048974even after updating it still doesnt work and theres no related github issues lole fuck me dang it
>>107049027Stop arguing with him, he does this every fucking day. He moved from the api spam to do this. Read the rentry in OP for Christ sake. He literally did this not even a week ago and tried his chroma troll with the same exact images only to gear shift after anon called him out. He's a low functioning autist
>>107048983>encode speed>decode speedthe encode speed is when it's making the video and the decode is when it's converting the latent to pixels right?
>>107048983Wouldn't anything be fast with over 3TB/s of memory bandwidth?
>>107049027Diving illustrious flat paradigm shift > netayumeI have to go to sleep, tomorrow I'll shit on it a bit more, so you can cope about prompts and training some more, hope youre being paid well, shill.
>>107045555Introducing... the sloppifier!
>>107048471>A beautiful queen wearing an ornate silver crown with diamonds and fleur d'lis. The queen is crying with her head in her hand, side view, tears running down her cheeks. She is lit by a candle with a castle wall in the background. A hint of Gothic in an Art Nouveau drawing.SD3.5L.Remember what they took from you.
>>107049073>cute skunk in a wildflower field, whimsical style with small details and organic patterns, stylized, hand-drawn, vibrant, warm colors, #handcrafted #handmadeActual sovl not possible with any of these modern models. It's so rare/impossible to see.
>>107049073>artist styles >a hint of Gothic in an Art Nouveau drawing erm... anon thats not a specific artist
>>107048982this looks way too complicated, just use the default i2v model and add picrel node with offloading half the blocks to ram (blocks to swap value of 20), use two of them, one for high one for lowstart with that and see if it works, then add whatever lora you want
>>107049053You never posted a gen because you know you would get exposed disabo
>>107048982yea if you correctly use block swap/vram management and unload models at the right time it should presumably work?but i don't remember the details of a kj wrapper workflow, I use non-kj wan
Is telling anon you'll be back tomorrow supposed to be a diss
>>107049041share your wf, I can take a look
>>107049092He's too disabled to know how to troll and gets mad when called outIgnore him and read OP
>>107049050There is an infinite amount of newfags or just people who can't stop themselves from taking the bait.
>>107049080>Portrait, collage of black block letters on a white background, letters of different sizes draw a dream portrait, light and shadow, low angle, fog, monochrome, kinetic art, Victor Vasarely, hyper detailed>>107049081Not my gens, saw them in SD3 post that I searched for due to nostalgiahttps://wiki.monai.art/en/models/SD3_5_Large_and_MediumSo sad how far local has fallen. There used to be a time when local devs seriously cared about catching up with Midjourney, but they stopped caring.
>moved to the next troll>still unable to post his gensWe see your gens in /sdg/ it's obvious you don't have a pot to piss in
>>107049084nta but is this a new node and assuming I have to update something? I dont see it in my nodes list. Hopefully I can add more thand 300 frames with this
>>107049093https://files.catbox.moe/bn2v7i.json
>>107049008>LightVAE nodes depend on WanVideoWrapper for main model supportpass
>>107048983yes ive seen many asian women specifically in this thread even
>>107049127it's a custom node : https://github.com/orssorbit/ComfyUI-wanBlockswap
>>107047890>>107048045Hunyuan 3 at 4 bit is very bad, don't bother. It's not like it's easy to get good results even at 16, but at 4 it's a lot of low-detail mutants.
>localpajeets coping with speedslop to make their already outdated models look even worseKek!
>>107049008lEmaohttps://github.com/ModelTC/ComfyUI-LightVAE/issues/5
>>107049186Thanks, been looking for something like this. Not sure why comfy has implemented a native version by now, not a fan of kijai's wan.
>>107048982install multigpu nodeshttps://files.catbox.moe/p1n5b2.json
>>107049103SD 3.5 Medium could have been a good base but it was tricky to train, very tricky, so people got fed up quickly I think.
>>107045555you neglected to post this sick pic from their huggingface >>107048885for 8B i dont think that looks too bad? idrk
>>107045555>jsonwhy? what's wrong with plain text?
Stress test prompt on FIBO.>Detailed photograph RAW of seven smiling friends of different races that are at a nightclub concert with dim lighting that is shining on their faces, behind them is a crowd of people dancing while fighting with large swords, everyone is holding a sword in their left hand and an intricate beer glass with differently colored beer in the right hand. Far behind them above the DJ there is a sign which has "Minimum drinKing age 021!" written on it in stylized cursive letters.Not too bad.
>>107049217it works with native workflows
>>107049239damn this shit is slopped, and lmao at that
>>107049239
>>107049103Krea output
>>107049252
>>107047265jenny yum
>>107049263>>107049073
>>107049251it doesn't seem like it's trained on synthetic data at all from my tests DESU
>>107049239Qwen Image with the same prompt
>>107049263is this supposed to be bad? or are you a different anon. I think this one is cool TBQH
>>107049275
>>107048982>>107049084here a catbox of a very simple 2.2 i2v workflow based on the example one for you to start with, you can enhance it down the line with more stuff if you want : https://files.catbox.moe/on2e3y.json
>>107049284>>107049284>>107049284>>107049284
>>107049239>Not too bad.Compared to most other models, that is, but still not great
>>107049252>>107049273>>107048885Great, we have caught up to Dalle 3 with this model, except there's no nudes or copyrighted content in the dataset so it isn't Dalle 3 tier...
>>107049214based