Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>106981016https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Neta Luminahttps://civitai.com/models/1790792?modelVersionId=2298660https://gumgum10.github.io/gumgum.github.io/https://huggingface.co/neta-art/Neta-Lumina>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
kek, that VH:D gen is alright
>>106988377yeah it's slopped and there's minor other flaws in places but it's generally coherent, like I said Qwen with no Lora will give you something fucking hilarious that's not even vaguely close to correct for the same prompt>>106988462so is Qwen by default, that's not really the point here lol
Blessed thread of frenship
>>106988433kino>>106988382figured i'd shift gears from my usual (huge breasts, wide hips:1.5) subjects for a little bit.>>106988479VERY cute. what did you prompt to get that little heart sticker on her cheek, or was that an accident?
>the chinese release an uncensored model>it's slopped and 80 FUCKING GB
>>106988458can i do anything with a GTX 750 TI w/ 4GB and CUDA 11.1?
>>106988523>hunyuan 2.1 doesn't exist
Only base model gens below this line (I'll allow a single lora but only if you trained it yourself)-------------------------------------------------------------------------------------
>>106988507the heart on her cheeks are not Intentional. it just appears randomly because of the "heart shape hand" or "heart shape eye pupils" on in the positive prompt section. https://files.catbox.moe/zigbch.png
>>106988526You can make utter slop with outdated SD1.5 models like me!Can't tell ya what settings to use though...
new yume making SDXL look antiquated desu
>>106988581oh shit nigga, just use "heart hands". Never had this happen to me using that tag.never used heart pupils though, but it helps to simplify your tags and triple check that they're actually booru tags, going accidentally natural language has those sideeffects.https://files.catbox.moe/yezzoy.png
>>106988591i'm noob and have no idea what that means
>>106988526you can punch yourself in the balls
best facefix at the end of wf? is the regular facedetailer node enough?
>>106988622With a good enough model, prompt, and settings you need not a face detailer
>>106988615what about cpu? i saw comfyui will only use 1 core, but i have 2 x 12-core cpus
>>106988523this was just a test. don't worry, they already have a better version. of course, for the api only. kek
>>106988612IDK ask Grok or something.This is the coolfag high roller's club!
>>106988526No, unfortunately. I think 1060 is bare minimum.
>>106988646>>106988657kthx
>>106988657>>1069886663060, ideally ti with i think 12gb of vram, would be bare minimum. i don't think the 2000 series even gets a single optimization.
>>106988680i have a 2080 ti just not installed. you saying that won't even work? i bought it to play with vgpu
Chroma vs Qwen If I want to put irl women in unpredictable situationsWhich is better?
>>106988526i think you'll be basically running on cpu. get a new card.
>>106988698>>106988698gemini'd it>SageAttention 1.x and the Triton kernel in SageAttention 2 have been reported to work on RTX 20 series (Turing architecture) cards, typically using specific versions of the Triton dependency.
>>106988538don't remember, i did post post hot spring partial submerged gens of her but got 3 day vacations.>>106988603perfect
>>106988732thank you. i'll look into that
>>106988741more advice if you wanna give it a try, run an eyes only adetailer pass and in the prompt just do eye color + heart pupils, works pretty well. (though i think my setup just needs a few more steps.)
>>1069886801060 is enough for SDXL based. Slow, but doable, good results if put effort.
>>106988706Qwen if you want prompt adherence and accurate detailsChroma if you want to convince yourself that nonsensical noisy artifacts are “analog realism”
>>106988706Qwen will do anything (minus nsfw) that you can think of. Only drawback is that there is literally no seed variety. The image you get is what you get. No alternatives.
https://wccftech.com/amd-officially-launches-radeon-ai-pro-r9700-at-1299/>32gb>1299 dollarswtf I love AMD now!
>>106988680So you're saying this might not be a skill issue?>3060ti 8gb VRAMlet>tried every possible combination of samplers, schedulers, and refiners in SwarmUI>it's always slop.
>>106988599wtf am I doing wrong with neta yume? I'm getting acid trip artifacts from ithttps://files.catbox.moe/kyg9n3.png
>>106988803I'm sorry bud, but it is a skill issue too. You can absolutely get by on 8gb for imagegen, especially if you can run sage attention.Hey everyone starts somewhere.
>>106988706qwen sucks with the same scenes. I tolerated it before alibaba's betrayal. now I only usechroma. I always use wan, because I have no choice.
>>106988801And only 1/8 the speed of a 4060!
>>106988801>$700 less than the 5090>have to deal with the endless jank that is using AMD for AIwe're good
>>106988591
>>106988801you can be swimming in vram, if you don't have the compute, it won't be helpful
>>106988819>sage attention.intredasting...I will look into this.Here is a sloppy cyborg for your troubles.>>106988834Is this real?
>>106988309Wait, that's without any lora? If so that's pretty cool, I don't remember any non anime nsfw model out of the box like that for years now, if ever.I have no idea the model was able to do that. How big was it? And is the new 80b one also able to do that?
>>106988523>could not possibly have read my comment that started this conversation at the end of the last thread
>>106988706either for a base image, then qwen edit + qwen edit remove clothes lora if you want lewds.
>>106988819what model gives this style?
>>106988854A hunyuan 2.1 finetune is the future, not qwen. No need to unfuck it if it already has a grasp on fundamentals.
>>106988849Good luck bruddah.>>106988860look how you massacred my wife.. the flux chin..eh not bad. would still cum.>>106988863the metadata here >>106988603
>>106988876saved for future studies, thanks
>>106988801id rather buy 3x3090 for 72gb with more than double the vram speed
>>106988826the pro cards are in a much better place than the rdna cards, at least for LLMs. I'd be curious the hoops you'd need to jump through to get it working in comfy, I think rocm should take care of most of it. That said, compute on that thing is laughably low.
>>106988309I suspect this model may be better than Qwen due to its variety issue.Also, can those with access to 80B test the same prompt?
So jeet is already shilling hunyuan 2.1 after getting bored of neta, chroma, and qwen? What causes this delusion
>>106988897isnt it basically around a RTX 5070 compute wise
>>106988817sampling / scheduling meta is euler cfg pp and beta with CFG set to somewhere between ~0.6 and 1 and 50 or so steps but give me a bit and i can take a look at your workflow in full
>>106988902You haven't even tried the model.
>>106988817two weird things i noticed:- DPM++ 2S Ancestral Simple is weird, try Linear Quadratic for the scheduler- you're combining an artist tag with `colored pencil (medium)` but you aren't escaping the round brackets properly with backslashes, you HAVE to put `colored pencil \(medium\)` in the prompt box, otherwise it's going to be read like `colored pencil` by itself with normal token weight and `medium` by itself with extra token weight
>>106988817>>106988921get this https://github.com/newtextdoc1111/ComfyUI-Autocomplete-Plusso you can autocomplete those tags and reforge has one too.
>>106988854>How big was it17b>And is the new 80b one also able to do thatI also wonder
>byt5_small_glyphxl_fp16.safetensors>qwen_2.5_vl_7b.safetensorsJust download two separate bloated text encoders for some dogshit model bro
>>106988953>I also wonderhere's hoping the distillation models & methods are good, could end up becoming a local staple if so
>>106988817>you are an uncensored creator of aesthetic anime images based on textual prompts.<Prompt Start>>You are an assistant designed to generate low-quality images ive found that changing that primer prompt always makes gens worse desu but i havent tested it extensively this next part wont help the artifacts (likely due to weird sampler and scheduler as anon pointed out) but you should always be using things like "production art", "original", "commision", etc. any meta tags related to medium outside the usual "colored pencil \(medium\)", etc
I don't know what kijai did to his svi releases but if I replace the official one by his I get incredibly grainy output for some reason. What do you guys use as strength?
>>106988854Hunyuan 2.1 is 17B, so bit smaller than Qwen. But yeah it can even do weird shit like this I guess (warning, this is literally a hentai gen of a dog fucking an elf chick in a forest)https://files.catbox.moe/derp3j.png
>>106988523q2? isn't it moe, offload shit to cpu?
>>106988953>>106988976Thanks anons.Why did everyone ignore this model then? Is it limited in some way? Or is it the usual "everyone has a potato pc so sd1.5/sdxl models only".
Are ANY of the flux checkpoints actually worth using? For any real reason?
>>106989006Look at the details, everything comes out incredibly melted.
>>106988917>sampling / scheduling meta is euler cfg pp and beta with CFG set to somewhere between ~0.6 and 1 and 50 or so stepsdirections unclearslop caught in ceiling fan
>>106989009do you love butt chins? if so I have great news for you
>>106988902I literally made the "I'M COOOOOMING" gen in the collage with NetaYume earlier today, but I'm also the person who initially mentioned Hunyuan 2.1 in the last thread, and I'm also a white Canadian guy. So you're just wrong about everything kek
>>106989016I should have written "ignore this model as a base".
>>106989030Not sure. Wonder if there's any lora training support.
Why the fuck does it seems like there are so many leafs into this hobby?
>>106989006IDK why honestly, it came out shortly after Qwen and people might have just overlooked it. It's also not clear why Hunyuan released it and then also the 3.0 model in such a short frame of time but that's neither here nor there I guess.
>>106989042because north mexico is half indian?
>>106989052wait was training code even released? that may be why
Is there any trick to prompt two things happening in a wan video in succession? "She waved her hand, then went inside the car."First action is always super fast for me.
>>106989006Because it’s shit that can’t even compete with 2024 APIs. Nobody cares to train this bloated crap, especially after the waste of money known as chroma
>>106989018i want muh text and better backgrounds so bad man.
>>106989094But it's completely uncensored, unlike the hyperslopped Qwen.
>>106989006Because it's bloated and looks like plastic, so we all assume it's not uncensored. Interesting to see that it's not fully censored, still, unless you can get a proper photoreal tune to do the same thing, this model doesn't hold a candle to Chroma. Looking at >>106988309Small details like her eyes also look a bit noisy for a model that's supposed to be the base. Not saying the fact that we're now dealing with a base model that can do porn isn't special, but this model would require further tuning to be useful, and who knows how well that holds up at that size.
>>106988854>And is the new 80b one also able to do thatI don't remember any significant examples outside of the official showcases80b is hard to run, it's basically an LLM more than an image model, so I don't think many people played around with it to test how good for nsfw it is
>>106989126Also what happened to SRPO, does that work with this model? And would something likehttps://github.com/ClownsharkBatwing/RES4LYFHelp remove the blur and sloppiness? It's worth a try.
>>106989101That doesn’t matter at all. Were you one of the retards saying SDXL could never do porn because it didnt contain the same 70~ or so nude art images in the dataset that sd1.5 did?
>>106989151It matters quite a bit. There's a difference in wall time between having to teach a model the concept of genitals from scratch, and having to alter existing knowledge of genitals.
genitalia doesn't sound very safe, I feel unsafe
>>106989164The amount of time spent on that is nothing compared to teaching it to actually be coherent. Just look at chroma, it understood nudity by epoch 7 yet 50 epochs later it still can’t handle fine anatomy properly. The most important part of a base model is fidelity and coherence, not being able to render a plastic boob.
>>106989170just imagine, pontificate in your mind's eye if you will.a PERFECT milfy pussy lined with the right amount of pubic hair, the SMELL of it. She's not showered in 16 hours, and she just ran a mile.oooh yeaaahh. It's unsafe alright, and that's the THRILL OF IT.
>>106989186You leave out the important part that chroma was essentially a finetune from scratch, due to the aggressive de-distillation that rendered a ton of the pre-existing knowledge useless. Not to mention the questionable training methods. The gen shared earlier by an anon showed Hunyuan can (with nitpicks) already generate coherent genitals. The issue of the absolutely destroyed fine details is something else entirely. Not to mention the lack of any training code, I'd be curious to see how easy/hard it'd be to de-melt the model
just imagine, plastic sdxl slop for 3 more years. a PERFECT 4ch vae 3b model. Not updated in 5 years.oooh yeah, it’s local alright, and that’s the STATE OF IT.
>>106989247why are you like this
just because one or two anons really love their 2.5D hypersloppa doesnt mean we all do
Replace the anime girl with Hatsune Miku.
>>106989279ask it to match the lighting
>>106989279or, remove the girl from the image (after her camo activates)
>>106989293you didn't leave her brap cloud
more hun 2.1 examples please
>>106989317just go back in the archive to when flux came out, it looks the same anyway
neta doesn't know who pepe is, but it is getting very capable/usable. I think v4 or v5 of this model could replace noob.
>>106989286Replace the anime girl with Hatsune Miku. keep the lighting and artstyle the same.I could prompt darker lighting but it still works
>>106989368looks basically the same as before, disappointing
>>106989101It seems like those slopped AI porn images, it has seen in the dataset. When you ask it to do something novel that you'd expect an uncensored base model to nail (E.G. pic rel, which Chroma does right away), it shits the bed.Prompt was simple>Amateur photograph, a cute Japanese alt emo woman standing, with short, dark hair from a low angle, extending her bare foot toward the camera. She wears a ribbed top and plaid skirt, holding a glass with amber liquid. Indoors, adding a delicate contrast to the edgy, artistic composition.I really don't think this thing has seen real porn. Its dataset is pure synthetic slop. Remember, that everything that Chroma can do, it can generalize and you can add things to it, like a proper base model.
LMAOOOO >>106989230
>>106988817You can make the prompt even better but this is a start. https://files.catbox.moe/u5dpt5.png>>106989352The author states his dataset includes both e621 and Danbooru but it doesn't really feel as kino pilled e621 wise as Noob desu.
>>106989098
>>106989148Clownshark samplers work for all flow-matching models basically (and to a large extent SDXL and SD 1.5 ones too)
>>106989412Wonder how the 80b monstrosity would perform on this
the man is holding a magazine with one hand, with the title "UNATCO". Below the title on the magazine is the text "how to spot an illuminati operative" and a man that looks exactly like him, wearing the same sunglasses, with the same expression.
>>106989421holy retardation
>>106989421>Steps to Reproduce>use itthe unbridled, schizophrenic rage. we've all been there, i've felt it inside. so fucking based and unhinged oh my god. this reads like someone's manifesto.>>106989425thank you for blessing my senses, off to yoink my shploinker now.
>>106989425>load up wan>slop bounce>slop twerk>eeup it's 1girl, jiggly time
>>106989456and it's glorious
>>106989245Kohya added support for Hunyuan Image 2.1 recently:https://github.com/kohya-ss/sd-scripts/tree/sd3It also supports Lumina 2 arch models like NetaYume but you'll want this PR:https://github.com/kohya-ss/sd-scripts/pull/2225
>bro chroma is great for nsfw>try it>it's all body horror slop
>>106989456eeeyup.
>>106989477you've learned the hard lesson, this general is only populated by paid shills, don't listen to their praise it's not genuine at all
>>106989473>Kohyathis trainer is the literal definition of intuitive
>>106989447the man in image1 is holding a magazine with image2 on the cover. keep the expression of the man in image1 the same.
>skill issue: the post
>>106989497>keep the expression of the man in image1 the same.its funny how qwen edit users cope with this prompt. cant change anything without severely fucking up the image
>>106989352`pepe the frog` is an actual Booru tag but there's only 263 entries at least on the main Danbooru site
>>106989507need a model trained on basedbooru
>>106989488idk if you're being sarcastic or not or what this is supposed to have meant quite frankly
>>106989506works on my pc>>106989507if you want infinite pepes just use qwen edit, any pepe image can be transformed into a new pepe of any type.
>>106989528whoops i meant unintuitive lol
>>106989529have his expression change without destroying the style
Ldg wouldn't be the same with the turbo autist who posts multiple dozens of gens of the same image with minor variations
>>106989529like so:the green cartoon frog is wearing a blue tshirt and red shorts, and is sitting at a computer with a white CRT monitor that says "LDG" on the back. On his bag is a bag of potato chips that says "SIPS".source is just a regular pepe
>>106989547and says "neat" after adding miku to the same three images over and over again like its something anons never seen before
>>106989555it even worked with a bad prompt (on his bag is a bag)the green cartoon frog is wearing a blue tshirt and red shorts, and is sitting at a computer with a white CRT monitor that says "LDG" on the back. On his desk is a bag of potato chips that says "SIPS".see? even better.
>>106989566the body was drawn too well. doesn't match the sovl of the face
>>106989571the pepe was just a headshot, if you want to be specific then prompt it (skinny, etc).
>>106989578>helo honey this is me come meet me
>>106989571He's unironically unable to understand what you mean per his reply.
>>106989578THIS IS NOT MY BEAUTIFUL WIFE
>>106989583>skinnythe sovl of the poor drawing not girth of his body
>>106989595well it's up to you to prompt style specifics to your preference. in any case, it can generate pepes, how they appear is up to you.
>>106989609>well it's up to you to prompt style specifics to your preference>implying it would actually listenkek
noobai and illustrious are still the ultimate local models
>>106989412Chroma can't even do the blowjob prompt in a way that looks realistic or with as much prompt adherence, she's sucking off the wrong dude:https://files.catbox.moe/6gnraw.pngMost of the NSFW in Chroma definitely is not porn photos, it's certainly almost all 2D and 3D content
>>106989620It's pretty impressive to make that model look that bad. Even asian footfag's doesn't look that shitty kek.
>>106989537I think it's user friendly enough with the GUI:https://github.com/bmaltais/kohya_ssI can't think of a trainer that's moreso really
>>106989279>>106989368this is my fetish>>106989497this too>>106989412close second
>>106989542>jacket on shoulders kino
tell me about the schizo, why does he img2img?
>>106989749>>106989762not sure why these look like shit
>>106989529>>106989555I want to make pepes that look different
>>106989352sometimes it gets close, other times it does not
>>106989784meant to post picrel
>>106989630that's literally what Chroma looks like if you give it any remotely complex prompt for a lot of subjects, it oversaturates and gets increasingly less realistic.
>>106989317https://files.catbox.moe/3drgrs.pngi don't have time to mess with it but this is without a refiner, then throwing it through a .16 denoise with sdxl and then flux which obviously doesn't know tits. the coherence really isn't bad, and with a refiner it's probably kinda dope
If your 10b+ model still has to be refined with superior SDXL, why bother?
hmm better
>>106989583I like your examples. Here's>Photorealistic wide angle full-body shot of the standing subject from head to toe, from the left facing left, against a plain white wall background.
>>106989412>Amateur photograph, a cute Japanese alt emo woman standing, with short, dark hair from a low angle, extending her bare foot toward the camera. She wears a ribbed top and plaid skirt, holding a glass with amber liquid. Indoors, adding a delicate contrast to the edgy, artistic composition.why did you lie?
>>106989819F, picrel
>>106989795Hunyuan 2.1 doesn't officially support 1 megapixel FYI, they straight up say on the huggingface page that 1 megapixel "might cause artifacts" and give a list of base resolutions to use for different aspect ratios. 1 megapixel might work for some stuff but you probably shouldn't have any expectations.
>>106989412On different seeds, there's different performance, but it's still slopped. >Amateur photograph, a Japanese idol woman, performing an advanced contortion pose at a bench in a barn. She is sitting on a surface with her legs bent backward and extended over her shoulders, so that her feet are positioned and touching over her head, displaying an impressive level of flexibility.>There's a rooftop rope attached to both of her ankles and duck tape on her mouth.>A long white towel is draped over her entire front for modesty. She has straight black hair with bangs.>>106989620Skill issue. Hunyuan doesn't follow my very basic prompts in a coherent manner, so it's slopped. Crazy how now the ceiling is at a model that we could only dream of having back in Dalle days (which is arguably still the only APIshit model that could do anything like it in terms of coherence prior to its censorship). Anyways, for a proper base model performance, look at what Chroma can do. Why should a base model do less? I would give it points if it could do it even while slopped. No idea about Qwen, it can probably pull this off even if it spits back the same image. Another point. Chroma can generalize and give me entirely different images in different settings. It also obeys my command if I say the girl has to be naked.
Hidream, flux, qwen, hunyuanIt’s all the same plastic garbage. Why even bother arguing over which is superior? Is local really so far behind that these are the only options?
>>106989815
>>106989836>spends all his days making wall of texts to defend a meme modelwhat kind of mental illness is this?
>>106989832this is what I meant:
>>106989836can you post a catbox of right without the towel? i want to make sexo with her
>>106989752the tag is so good, I need to use it more often
>>106989836>Hunyuan doesn't follow my very basic prompts in a coherent mannersounds like a skill issue
>>106989618>>106989806trvth nukes
>>106989836Chroma doesn't look like your gens do AT ALL unless you're using some kind of over the top Clownshark sampling workflow, I don't know why you're pretending like it does lmao
>>106989858gun and briefcase look good too nice work anon
>>106989865nah NetaYume is a great anime model that as far as I can tell is only gonna keep improving. Like I was saying earlier nothing else open source is remotely as comparable to NovelAI 4.5's overall capabilities
>>106989854good catch anon, thanks
this entire thread is just jeets defending their failbakes, claiming the “true model” accessible by “skilled prompters” is actually way better than what’s posted here. meanwhile not a single one even benches in the top 15 on any arena. localkeks are a unique breed of pathetic
dam. youtube is full of this crap.
>>106989821Try different seeds. It messes up. A proper base model wouldn't be this bad. At the end of the day, the fact they cut corners in training really shows. Tencent is a massive corporation. That's not their 100%. That's their failed bake that they're giving us.
*yawn*
>>106989887That face is a horrific blend of SDXL mixslop and fluxplastic
>>106989423whoa thanks btw that unfucked my gen
>>106989904Kek, this. Chroma is shit but at least it’s NSFW. The rest of this crap is so bad.
What makes anon seethe and mald so hard about Local Diffusion? I don't get it.
>>106989909>>106989913you are very very upset about a gen that is so much better than chroma and i haven't even touched the settings yet. the base output of this shits all over chroma dude, i'm sorry you're taking it personally. is it an easy model to get right out of the box? no. i've only done five gens and it's still configured. do they all look way better than chroma shit? YES.i probably won't use this model myself, but lets be real
>>106989913Chroma does the same if you look at it wrong. It literally was trained off of SD 1.5 hyperslop gens
So the conclusion is all local models are shit?
>trying out the new ditto style transfer model>Works great and can transfer clips really quickly>Want to run a whole video through it and restritch it all together when it's doneAre there any nodes for quick looping? Basically just want to hit run once and have the workflow select intervals of 74 frames at a time so I can create a full length video.
>>106989984>ditto style transfer modellink?
>>106989908the only difference now is the slop is synthetic
>>106989984>try>time and effort required: >result: maybe>dont try>time and effort required:>result: all the free time in the world to do whatever you want
>>106990002>try>time and effort required:*>time and effort required: insane
>>106989793I have yet to encounter a single thing that Chroma can't do. There for Hunyuan I encountered one thing it does half-assed, and another it can't do at all.>>106989972You posted heavily noised Flux Schnell tier slop, thanks for proving my point.
Chroma can’t render a decent looking image
>>106990013>a single thing that Chroma can't do.Make a coherent picture without fucked up anatomy
>API shill upon seeing localchads enjoying what they have
>>106990013>I have yet to encounter a single thing that Chroma can't doDo a POV perspective of someone sitting beside the viewer on a couch. I can't get chroma to do it.
>>106989992https://editto.net/
anon must be tired after casting all this b8
>>106990026he hasn't had his right arm worked this hard since the fappening!
>>106990013You are a chad for trying to beat some sense into these skill-lets
Chroma is decent but any moderately complex prompt turns it into visual diarrhea
>>106990019
>nogen browns seething in the thread literally 24/7/365 since the very beginning about any and all local models being shit while i have 2000 gigakino images queued to generate with four different models as i eat, read papers, and watch the gen bar filling upprompts for this feel?
>>106990088The things I gen are too spicy to be posted here
>>106990088>1man, comfy, unbothered, not a slave to SaaS
>>106990112you forgot masterpiece :^)
But really are there any easy looping nodes that just return a bigger number? Seems so basic.
>slopjeet browns shitting up the thread with plastic 1girl slop literally 24/7/365 since the very release of sd1.4, waiting 70 seconds for their turboquanted qwenslop to generate the same generic image regardless of the seed all while i have 1000s of gigakino 4k seedream gens and 60 second studio-quality Sora 2 gensdon’t even need a prompt for this feel, GPT can generate one for me
>>106990120wow...... xo butiful
>>106990119Open manager, search "loop", find one that works for you. Or videcode your own.
>>106990121based and apipilled
how unsafe is it to use custom nodes in comfy?
>>106990172ComfyUI-Manager is already phoning home so if you have that installed you already have a virus.
>>106990172The only nodes you need are API nodes, which are included with ComfyUI linked in the OP
>>106988801this is going to be a very good value AI card, but with the same chip as a 9070 XT, will it be slower than a 7900 XTX at image gen?has anyone done a recent benchmark of their 9070? Say with Lumina or SD35m so you don't run into VRAM limits? I'd assume the 7900 XTX is faster, but rdna4 has improved AI support supposedly.This pic took 30 seconds for 8 steps. btw if you're on AMD, USE THESE FLAGS:>TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1 MIOPEN_FIND_MODE=FAST python main.py --use-pytorch-cross-attention --bf16-vaeit will unfuck your initial gen times on SDXL models by skipping a shitty compile phasehttps://rocm.docs.amd.com/projects/MIOpen/en/develop/how-to/find-and-immediate.html#find-modes
>>106990020>Do a POV perspective of someone sitting beside the viewer on a couch. I can't get chroma to do it.Close enough. Depends on what exactly you mean though.
>>106990181>ComfyUI-Manager is already phoning homesource?
>>106990223>Close enough. Depends on what exactly you mean though.The viewer's legs should be in frame, as if a camera was strapped to his head and someone is sitting beside them. Best I got with chroma was someone directly in front, none to the side. Seedream could do it... but it was very gacha.
I'm the guy who started the conversation about Hunyuan Image 2.1 and I literally never said it WASN'T slopped lol, I just thought it was interesting that it was capable of some complex NSFW concepts right out of the box.This is Qwen on the same prompt with the Lora I mentioned earlier anyways:https://files.catbox.moe/rbiirz.pngAnd this is Qwen on that prompt on the same seed without the Lora:https://files.catbox.moe/lf4hap.pngBasically it's not really difficult to train Qwen on actual photos and get good results, I don't suspect it would be difficult to train Hunyuan Image 2.1 either since it's also not distilled, that was pretty much my original point, I don't think the way they look out of the box is that important.
>>106990237This was my first result before I optimized my prompt, I do think it is possible.Here's what that washttps://files.catbox.moe/nqxfbj.pngAnd what it is right nowhttps://files.catbox.moe/l9rnet.pngYou can modify depending on your needs, I'm sure it can do it.
>>106990272>I'm sure it can do it.I feel like a POV lora would be a better use of time considering the failures Chroma was netting me. I'll give it another shot though
>>106990272she looks underage
>>106990251Qwen is easily the best local model available. It learns NSFW concepts faster and more coherently in a day of training than chroma did in 6 months. An actual Qwen finetune would be insanely powerful
>>106990283chroma schizo is a notorious pedo
>>106990283kek
>>106990251>Lora I mentioned earlierlink?
>>106990269nice
>>106990294>PEDO PARKER
>>106990226wireshark
>>106990296>when you have to stow away your fat girlfriends dildo because guests are coming over
>>106990345lmao
>>106990300I haven't released it yet, I may still add more images and re-train just to tighten it up a bit more. The current one can do two girls / one guy, two guys / one girl, or one girl / one guy blowjob stuff all pretty coherently though.
>>106990352would love to have it whenever it's ready
>>106990251If only qwen had seed variety...
>>106990281Actually, I realize what you mean with the whole POV thing here's another attempt from a generic image.https://files.catbox.moe/6gz9rz.pngIt's possible, just depends on what you're going for. When in doubt, feed the image to gemini as that's what it was trained on.
>>106990290you can at least tell he really is one specific person because he only seems to gen "muh azn gymnast waifu" type stuff
>>106990409That's wayyy closer, good shit. In my experiments I was aiming for something like this, but the POV viewer's head turned torwards whoever was seated there. Appreciate the experimentation you've done.
>>106989841
what's the best way to install Nvidia drivers on debian 13?
>>106990468sudo rm -rf /
https://civitai.com/models/1901521/pony-v7-base?dialog=commentThread&commentId=985535Incompetent grifter won't even release his synthslop shitpile out of shameKWABEROONI
>>106990484thank you that cleared it up
>>106990524That was the right choice.
>>106990547got dam
>>106990547>MFW my face gets eaten>MFW this thing ate my face>MFW I have no face
>>106990358yeah it'll be on Civit too
>>106989856Exact image/girl is not possible. Area where Qwen Edit could help. Due to changing one token, Chroma makes variation.https://files.catbox.moe/q8393g.pngNot the prettiest, could use a innie pussy LoRA.>>106990460Np anon, I'm sure it's possible with the right engineering
>>106990524he's clearly joking lmao, if you go on the pony discord he's been actively working on making lora support and stuff work properly
>>106990365my prompts are never short enough for that to matter too much personally
For the new 2.2 light loras, does it matter if you use it separately or does the quality degrade if don't use the merge?I saw a comparison video and the quality was better with the merfe, but he only had one seed going.
>>106990740
>>106990762the fukkin glasses. I kek'd
>>106988680this is complete BS if you're talking about SDXL lol. Any Nvidia card from a GTX 1660 Super or so upwards can do XL fine
everyone's done with genning now huh
>>106990795cute!>>106990860I'm just done with abi madness for now. still have to redo the filepaths and plugin unloading. weird edge cases keep popping up too. if anything it's a good time to gen for me at least. I need to blow off steam.
Got it know. Describing the guy's pant color in original one I had seems to push it/help a lot with precisely what you're looking for. Then you could say he's relaxing or extending his legs on the floor or a small table or whatever.https://files.catbox.moe/eclebn.png
>>106990460>>106990909
Even Comfy himself is on the NetaYume train lol, I guess he gave it its own workflow template
>>106990909I've already gotten closer, though it's still pretty gacha. If anything this tells me the model is an extremely good base for loras
anyone try SVI with wan 2.2 to fix color shift? I know it's for 2.1 but still
>>106990947I think it’s decent, but it suffers the same issue as novelai where styles look very vectorized and simplified. I hope he continues to train it, he was using 8xb200s. I find a lot of artist styles just don’t work compared to noob so maybe it’s still undertrained
>>106990990comfy doesn't train anything
>>106990990It's also slightly worrisome that the additions he's made to the dataset dont have NLP captions, just tags
>>106990996Nsfw NLP is shit anyway
>>106991008It's not just NSFW, it's everything new on danbooru
>>106989412>>106989436Hunyuan 3
>>106990996im okay with yume just being noob with a 16ch vae desu, illustrious v2 had decent nlp support so i highly doubt whatever neta(yume)'s ends up being will be worse than that
>>106991048did pretty good
>>106991048significantly better, though given the size, i hope it would lmao. hope the distilled version of this model performs close to this
>>106991048Hunyuan 3 has the same issue base sdxl did: everything has a tendency to lean towards a dull brown/grey tint
>>106990641That Qwen LoRA is so kinosovl....
>>106989042because we are powerless to right this ship, we can only generate better things digitally
so is chroma v46 better than the latest? i've been seeing you guys post more often with that version
>>106989247Why do we even have SDXL?How'd that happen?
>>106991144no, anons just memeing
>>106991144no, every chromadome has their own preferred version
>>106989230>>106989421audible kek
>>106991048
>>106989452>unbridled, schizophrenic rageKek I can't stop laughing at thisI've put exactly that in driver crash reportsPLAY THE FUCKING THING NIGGER REEEEI like to imagine Chang in Taipei gets a smile out of it
>>106991170too squishy
>>106990959Nice. I remember Chroma being trained/test with lots of POV pics like this back when lodestone had that training preview.>>106991048Kek, Qwen tier output, though at 80B a Chroma style finetune that takes advantage of that many params would be insane.
>>106991181cat>>106991191box?
>>106991158Dear lord
Fresh>>106991205>>106991205>>106991205>>106991205>>106991205
>>106988526You can try to run compressed version of SD1.5. Go to civitai.com and in model section chose sd 1.5, if it's good enough for you just to mess locally, you can try comfyui. It will take 2.2gb vram for 768x768 image with this command:.\python_embeded\python.exe -s ComfyUI\main.py --windows-standalone-build --fp8_e5m2-unetpause
>>106990996that's just for part of it in the most recent version though, it's probably not a big deal. Mixed NLP / tag captions are generally what you want for this kind of model anyways.>>106991062that's not gonna happen lmao, it would take an enormously huge amount of degradation given the text encoder itself is far superior to CLIP
How do I create hyperslop?
I'm having an issue using the stickied thread:I've managed to get all the custom nodes to work but now i've got:backend='inductor' raised:ImportError: cannot import name 'triton_key' from 'triton.compiler.compiler'
>>106991732whut gpu
>>1069917704070 super
>>106989680nice