Discussion of Free and Open Source Text-to-Image/Video Models and UIPrev: >>106442596https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GPAniStudio: https://github.com/FizzleDorf/AniStudio/tree/dev>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://tensor.arthttps://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://rentry.org/wan22ldgguidehttps://github.com/Wan-Videohttps://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
https://civitai.com/models/827184?modelVersionId=2167369https://civitai.com/models/827184?modelVersionId=2167369https://civitai.com/models/827184?modelVersionId=2167369https://civitai.com/models/827184?modelVersionId=2167369WAI-NSFW V15 IS OUT!!
best method or workflow for achieving gen with 2 people wearing different outfits? its like pulling fucking nailsonly have a face detailer/upscaler in the workflow now and I can get very close to what I want but it's always getting some element wrong
Is it normal for your computer to shit itself and freeze completely when trying to use chroma on 12GB VRAM? I used various flux models on forge in the past and just had to wait a bit and could still browse the net and other stuff in the meantime, now I'm on comfy and every attempt to use chroma resulted in everything freezing completely and having to reboot
>>106447688I have a 4070S and I'm fine
>>106447661>>106447718
>>106447640uh oh poopy made a stinky
>>106447688The offloading should keep you from ever going OOM. How much ram do you have?
>>106447661safely file that under shit i dont care about
>>106447661oh wow, finally updated>miku as cover image>update on miku's birthday
>>106447661I don't think using an image with a lot of gibberish text was a smart promo move.
>>106447701Same >>10644772932GB
>>106447754it's SDXL
>>106447661what's the point of these updates if it's still based on an outdated model? it wont really improve anymore.
>>106447661>slopNEXT! When the new Noob will arrive?
>>106447754for text all you need to do is use qwen or kontext edit anyway now. as an anime model wai 14 was the best one (so far).
>>106447765>new NoobLAX sold out and works for an AI company. When we get GPT-4o at home (it HAS to be autoregressive), then he'll come back
>>106447756I actually have the same card too. Not sure what's causing it. Are you using the default workflow? What's your output res? Steps?
>>106447661v15-added data (roughly up to May 2025, mainly popular social games and some anime).PS:The new character data hasn’t been fully fixed yet. I’ll continue to improve it in the upcoming versions.-Data adjustment, trying to reduce the chance of watermarks appearingSo is nothing burger?
>>106447661slop>>106447416chroma flash can't do paintings. it has its uses though
>>106447784pretty much. it's still using XL 1.0, so minimal improvements.
>>106447771Default, 32 stets, 1024
>>106447768>LAX sold out and works for an AI company.why all talented people ends like that?
>>106447688try gguf chroma
stream is over mr fors
>>106447784>added data (roughly up to May 2025, mainly popular social games and some anime).is there a list of stuff he added?
>>106447805money
>>106447824>up to May 2025it can't gen Eri and Kanoe, it's over...
im....im... OOOOOMIIING
>>106447886KEK
>>106447886lol
Somebody mentioned the other day that there is some kind of add-on that can help change lighting while the thing is genning. I think it was for A111. What was it?
>>106447957IC light? That was more like a gimmick than anything useful.
>>106447957oh its bad, REAL BAD
>>106447977No I don't think it was IC light. The guy who mentioned it posted some examples. The same image with completely different lighting compositions and luminosity. Supposed to run while genning. It seemed really neat. I wish I paid more attention at the time.>>106447985But it's so bad it's good, right?
I could tell that was botched from the preview
first wai v15 test with miku>update comes on miku's birthday>last update was like, Mayso they waited on purpose.
>>106448085Thats what your mother said when she saw the ultrasounds.
>>106448041VectorscopeCC?
>>106447784It's sdxl. It's plateaud tech at this point.
the man is grabbed by two doctors in white lab coats and pulled into a room on the right, and the doctors close the door.in to the asylum he goes
>>106448085it's like something out of a Guy Ritchie movie, ramping the speed to get the character to where you want instead of cutting
>>106447802Hmm. Have you tried a Q8 quant?
>>106447802Try mine. Just adjust the steps to a more human level. And with the chroma cache use at least 30 steps for interval 1 and 50+ or more for interval 2 since it needs to iron out the artifacts. Also you can probably disable Fresca since this snakeoil seems useless now.
>>106447802Forgor.https://files.catbox.moe/wdbmqx.json
>>106448150the Zulu got him
>>106448166you know what's funny, I didn't even specify uganda men grab him. wan just knew.this time it's different
>>106448108>VectorscopeCChmm yeah, this looks promising. Guess I'll check that out. thanks, saves me a trip to the archives.
>>106448045I've said this before here, but I'm going to repeat it for those that have never seen it, that AI webms like these behave like dreams. One moment you can be sitting in your room, only to turn around and dive into the sea. I find that coincidence fascinating, as if these are the recorded dreams of global AI.
>>106448180games with realtime AI could be so trippy, you could have any type of scenario/environment made on the fly.
>>106448191but we all know what terms would be in the prompt at all times
the man runs very fast to the right, as a large group of ugandan men chase him down the hallway.HOW DID I GET JEETS WTF
>>106448191You should play AI minecraft if you want to see how shitty it would be.
>>106448180And you even get the same frustration you feel inside your dreams when you can't control what's happening or how things play out
>>106448203if you had a pool of prompts with interesting themes it could be decent. hand made environments are best but it'd be interesting to see AI levels that aren't just procedurally generated based on templates.
>>106448210>And you even get the same frustration you feel inside your dreams when you can't control what's happening or how things play outExactly. The coincidence is mind blowing with its accuracy.
>>106448216I think in the near future, the closest we will get is AI powered dialogue and AI logic that calls handmade or procedural assets. Generating the entire thing on the fly and keeping it consistent would be a huge undertaking.
>>106448196is that actually a generated image?
the man gets in a car and drives down the street in Uganda, as a large group of black men chase him.forsen playing pubg:
>>106448271yeah
Qwen BJ lora I was talking about yesterday came out pretty good, even does the two-girl-one-guy stuff with decent coherence. Both boxes are direct recreations using the Lora of porn pics that WEREN'T in the dataset:https://files.catbox.moe/uzdfgg.pnghttps://files.catbox.moe/s3e3mb.png
https://civitai.com/models/827184?modelVersionId=2167369wai v15 out, make some mikus
>>106448299Actually not bad at all, can the model be fully uncensored then? I have not been paying attention to qwen at all desu.
>>106448227I remember a thread some years ago where some anon mentioned that he worked at some facility which was contracted with the military iirc, and how they were researching and studying about these sort of stuff to see if they can use it to manipulate brain behavior and such, it somehow stuck with me as eerie and lately I even started to believe that maybe he was telling the truth, especially with how they have discovered that AI can cause psychosis. Who knows what else the gov is aware of and what plans they are pushing behind the scenes. I wish I took a screenshot back then.
>>106448327NTA, but if flux can be uncensored, then qwen can 100%.
>>106448298What sampler/scheduler are you using? Love how crispy it looks.
>>106448180I find text to be the most AI like in dreams. Sometimes, right before I wake up I realized I'm dreaming and can look at text in my dream and the letters and numbers always warp and shift just enough that it's always illegible. Very AI like.
>>106448346Probably much easier since it's not distilled to shit though more gpu intensive since it's bigger. pros and cons lol
>>106448327we waiting for a finetune
>>106448299very nice. did you make more loras, or an all-in-one lora?
>>106448355I wonder how much the size is really an issue when it comes to larger scale training. Like if it fits on the 80GB enterprise GPU, it fits.
>>106448298got a bit of weirdness around the hair but the detail is damn impressive. was totally fooled, thought it was a stock image on first view.daaaaayum, shit's real.
Still not gonna pollute my HD with chroma.
>>106448357I wonder how good Chroma is going to be when the eventual porn finetune hits it, I think one of the guys behind one of the popular SDXL finetunes is training one right now
>>106448360Logically it will train slower but should also learn faster cause of the better vae and no distillation, who knows until someone does it I suppose.
>>106448291man I've been to Uganda back in like 2006 with a charity, you have no idea how fucking hot it is in Africa lol. Also the Coca-Cola company has infiltrated the entire continent in a crazy way, EVERY FUCKING WHERE YOU GO they've got the glass bottle coke, even tiny village shops, I dunno how they even do it logistically lol.
>>106448299holy shit that cock goes right through that bitch's head
>>106448406Well all of the loras I've trained for it turned out really nice, clean and flexible. I imagine fine tuning will be just as productive.
the man rides a motorcycle off a ramp high into the air, on a sunny beach.
>>106448327It already does perfect booba out of the box, so you could definitely train in downstairs genitals I think. Whether there's enough people out there who will take the time to caption their datasets with like autistically perfect accuracy and be willing to do the kind of slow burn training that seems to work best (BJ lora was trained for 100 epochs at 1x repeat) is a different story though.Something that's probably good to know though, it seems that how the model handles different languages isn't directly related to the captions themselves, my Lora is only captioned in English, but doing a verbatim translation to Chinese of one of the prompts for the two pics I posted earlier produces basically the same results. So you just don't really have to worry about that at all as far as I can tell:https://files.catbox.moe/3zb6ix.png
>>106448448I wonder if Qwen VL is the one handling that. It's a really good text encoder.
>>106448359it's one Lora, just 150 pics, but with none of them ever having the same woman appear more than once, and all of them having very thorough NLP captions. Only about ~20 of the pics have two ladies as opposed to one lady but I guess that's enough to allow it to kinda work alongside the overall concept from all the other pics.
Anyone ever notice how notrainers look at trainers like gods? >Can you train X please>OMG AMAZING CAN YOU TRAIN Y?>Can u link youtube tutorial how to train?lmao it's just a bunch of pictures in a folder and a python scrip.
>>106448413IDK what you mean, are you talking about the black guy one lol? it's a pretty normal PP I'd say. Unless you just mean how her cheeks are kinda chipmunking a bit
the man rides a motorcycle off a ramp flying far away into the distance, onto another ramp 300 feet away.can be tricky getting them to go far
>>106448494no I mean how the second girl is sucking his cock at the back of the first girl's head
>>106448462yeah I guess there must be some kind of translation layer happening. Either that or the linguistic embeddings are just pre-linked in some kind of way internally.
>>106448482anything that isn't a touchscreen UI is like black magic to them
>>106448462This is exactly why chroma is switching to qwenvl2.5 because t5 is old and busted.
>>106448505oh I see what you mean, it does kind of look like that yeah. It's similar to what the "girl pushing other girls head" pics I had in the dataset actually look like though mostly.
>>106448514even Gemma as used in Lumina 2 has 8192 tokens of context vs T5's 512 lol, despite having less params
Does anyone have a collection of sample prompts for Wan2GP I can mess around with? I can't seem to get the AI to do anything that I want it to, or maybe what I want is too specific?I basically want it to make a video of someone bouncing on their feet back and forth, kinda like what boxers and mma fighters do at the beginning of a fight, kinda like in this gif
>>106448541what we basically need is openpose controlnets for video. turn an existing video into stick men. now turn the stick men back into a video.Pretty sure somebody must be working on this if it hasn't been done already. I saw somebody mentioning to he does keyframe animation in blender, can't remember where.
>>106448565even better than human made anime. because you can see the underlying 3d model directing the shapes.
>>106448588ppl like you are why it's a good thing /adt/ exists.
>>106448595/adt/ ??
>>106448601>>106439892
>>106448588Ehh I don't know has too much of a rotoscope feel which I don't hate per se but gives you uncanny feels once you get used to usual anime animation.
>>106448604Stop sending your trash to us
>>106448595This is what autism looks like.
>>106448045Made me kek at the end
Haven't been in the game for a while. What's the best model for animu shit? Also can illustrious do text or no?
Anyone been keeping an eye on >>>/t/1377945 ? I'm brand new and don't quite know how big a find that is or if those are models you can just get elsewhere.
I'm trying to make an HD version of the sprite on the right, but I can't get the bandana to be a "top of head" bandana; it always wants to go over the forehead, insteadSuggestions?
>>106448604I don't watch anime at all desu. I enjoy western art much more. I appreciate the technical aspects of art though. And AI.>>106448605Perhaps. And I guess one could make and argument against it. But those feels were bound to become more widespread with or without AI.
>>106448482well post the script then nigga
>>106448680It's already in the thread somewhere.
>>106448672Cris?
>>106448672inpaint the bandana part and prompt bandana, and use openpose maybe?
One more Qwen BJ example:https://files.catbox.moe/wv7q64.pngOverall it took to the photographic data a little better than I expected I think.
>Julien and debo defacing the OP againPathetic
>>106448703Damn that's pretty good
>>106448703>https://files.catbox.moe/wv7q64.pngLooks fantastic, extremely high quality. Do you mind sharing your dataset? Would love to train it on chroma to compare.
>>106448703fucking awesome
>>106448703The women that most men find attractive and the women I find attractive are very different and sometimes that surprises me.
the man with glasses is packing cardboard boxes at an Amazon warehouse. The Amazon logo is visible in the background. the camera pans out to show him packing the boxes.just like the game
>>106448514is he actually retraining on qwenvl2.5
>>106448684he's been bouncing between generals and always announces when he is frustrated with: 3d, game engines and AI. it's a periodic cycle but he never finishes any of his games I think it's close to 14 years he's been doing this with AI being the recent addition to the cycle
>>106448703Donot share your dataset with Chroma kekes. Let them sink
>>106448887Dumbledore doese it again
>>106448932Holy based and cold turkey pilled.
I don't think Flux knows what a PC98 is but it makes some nice gens when I ask for the style. Kind of Stardew Valley ish.
Does anyone watercool their GPU's?
>>106449028do you think I'm a rockerfella who's gonna overclock an already very strained piece of hardware, consuming hundreds more watts for practically no gains at all?
>>106449037you can watercool without needing to OC. my cpu is watercooled to keep it quiet and always under 40C
>>106448867It's on the docket after the radiance model.
>>106449053NTA, but I don't see how my GPU running at 60 degrees during inference vs 70 degrees during inference is going to matter at all.
>>106449084docket? i looked at that one reddit thread he posted but it's not on that one
how does hercules lad's gens look so good
>>106449028both my 5090 and 4090 have a waterblock. I like a quiet room.
>>106449103hey, not her. try abu instead
>>106449103he's doing twice the resolution. also what is this smoking auitism? this is fuckin retarded
>>106449099lower temps = lower fan noise + lease wear on the fans. also means temps in room are lower>>106449123what are the temps during 100% usage?
>>106449131>he's doing twice the resolutionwrong
>>1064491584x whatever
>>106449136my 4090 block is slightly uneven and die contact isn't correct, so temps similar to air (even with phase cool pad). 5090 block is around 50c during gaming, high 50s during constant 600w AI load, memory around the same.
>>106449136>lower temps on gpu and lower temps in the roomwhere does the heat go?
>>106449169the a/c, duh. jeez anon, you take your stupid pills instead of your smart pills today?
>Watercooling GPUsAI bros are not beating the water wasting allegations are they?
>>106449103he's not using a still from a VHS rip
after being around this stuff for years now, i'm only now just finally getting around to playing with stuff beyond just "basic prompting" and wildcards. wildcards were basically as spicy as i ever went lol. been messing around with controlnet and regional prompter all evening and anons you should have told me to try these things sooner. feels like more valuable tools in the kit.picrel ultra sloppa trigger warning. feels a bit more fun again tbqhwy. what other goodies come recommended beside those two?
>>106449099if you're not hitting Tjunc then you will never have a worry. dont pay mind to the autismos here.
>>106449192you typed it like cooling the gpu would result in cooler room
>>106449102Like most of Lode's behavior, this is all on the discord only at this point.
>>106449257very cool. looking forward to another sorta cool sorta bad model
I love that this thread has a smoking section now
>try to discuss AI on the Star Trek general>get screamed at about environmentalism and AI bad by anti AI ludditesLiterally the last place I expected that kind of attitude considering the contents of the show
>>106449375Crazy right? This is basically very very early holodeck stages, you’d think they would all be for it.
>>106449375A strange and weird dissonance lol. I think some people watch science fiction like it's a impossible magical world akin to tolkien stuff
>>106448887Animate this one.
>op hijacked by troonsguess i won't be posting then
>>106449375try /hor/ they were p chill when i use to post there. too many footfags tho last time i checked in.
>>106449472But you just did, too late anon you are now a troon too. I am sorry, I don't write the rules
so qwen image is the next great hope for anime ponos in vagoo?
What quant of Qwen should I use with a 3090?
>>106447661don't give a shit about 2d sloopa.
>>106449561if someone provides a generous $300k donation we might be able to get a 5 epoch 256x256 finetune for it
>>106449375kekshow them some startrek ai porn
boring unproductive thread today
>>106449585our chinese benefactors will surely throw us a bone
>>106449622you could ask LAX from NoobAI, but I think he's contributing to some 3.6b lumina project. I don't think they would do Qwen Image because of how slow it is.
>>106448357was there even one announced?would be nice to have a chroma like version of qwen, but it'll probably be even more expensive to train
>>106449630Neta is pretty slow compared to XL almost 3x as slow. If you compare it to qwen nunchaku don't think the difference is that much (I think, haven't used nunchaku yet)
>>106449375Ai is stupid goonfuel that's mostly vaporware propping up USD. Doesn't mean I won't enjoy jerking off to it.
>>106449620i wonder why
>>106449108lol'd
>>106449585I mean any finetune needs serious hardware (unless you are doing some worthless shit tune with 5k images lol), so once you have that kinda hardware it probably wouldn't matter.
>>106449375>environmentalism and AIThe smear campaign worked wonderfully well I see.
>>106449620Kinda feels like genning in general these days
What max resolutions are recommended for chroma hd? I get a lot of weird things with 1080p gens.
>>106449661the definition of 'serious hardware' is getting higher and higher. pony was done with a fraction of the compute of illustrious which was done with a fraction of the compute of chroma. and chroma had to cope by gutting params and image size yet still cost 6 figures to train. a qwen finetune would easily be over $500k
>>106449720That depends on how Qwen responds to training. It might be the case that it picks up the needed information long before the cost to train it ever surpasses Chroma.
>>106449725possibly, but who will rent the hardware to find out? that's the difference with the newer bigger models. they no longer train on consumer GPUs. a lot of early SDXL finetunes like kohakuXL (which served as the base to illustrious) were trained on only 2x 3090s. same with vpred, it was experimented with on consumer rigs first.even asking for $10 in cloud compute is enough of a barrier to deter people from trying, as one experiment quickly leads to another and the costs start to mount. the best bet, unironically, is to hope some generous guy appears with actual direct hardware access (no renting) and partners with a finetuning team.
>>106449739Load of ass stones will switch to Qwen within the month. I have no doubt.
what UIs you all use here? everyone use Comfy or is there anything else that is actually usable?
>>106449759I don’t do video or photorealism so forge classic and sdxl-based models just werks for me. I do have a comfy install with flux and chroma but eh, for what I want to gen it’s just simpler and faster to fire up ole forge classic and fire off my 1girls. I’m used to the extensions and things like that, if it ain’t broke….
>>106449375On the surface level, yeah, Star Trek is "the holodeck show," but on a slightly deeper level it's "the communist utopia show."Same as how on the surface level, AI is a tool that lets you make whatever you want, but on a deeper level it's an expensive product promoted by the world's most powerful tech companies so they can centralize wealth even more than it already is.So it's no surprise that a Star Trek fan who has a bit of a think beyond the surface level doesn't like it.
>>106449779oh nice, didn't know forge was still a thing.. i thought i read that guy gave up on it
random question i want to try and gen a picture at like bodypillow size, around 6000 x 18000 pixels, now obviously its best to start lower res and then upscale but it keeps spitting out dawg shit, is there like a max limit at the amount you can upscale because i did times 5
>>106449739Ehh I think someone will eventually turn up, tuning this properly will easily net the best anime/porn model in the market (I won't be surprised if the NAI guys jump on it, instead of the crap they are using now), and where there is money to be made there will be someone to make it..
>>106449782>Trillions invested to try and fire all workers and replace everybody with vibe coders and prompt monkeys At least I can generate all the deranged porn I want, thanks bezos??
>>106449758I hope he does, but does he even have the funds?
>>106449782fr though, what happens when the US tech bubble bursts and everyone realizes America is a country that manufactures nothing but dollars?
>>106449795>NAI guys jump on itI'm sure they would for anime, but zero chance they'd do the same for porn.
>>106449796This is good, but it would be better if it was an anthro of some kind.Also honest question, is the Chroma checkpoint on civit still not updated?
>>106448807lucky for you the actual dataset is wildly appearance agnostic (as it should be), like I said earlier there's absolutely no multiple appearances by any individual woman, and the range of ethnicity / age is quite wide
>>106448915who?
>>106448867He has advanced access to Stable Diffusion 6 in fact, we'll be there in 10 epochs Chroma Bros
>>106449831don't you want to make a buttchin lora?
>>106449820it was updated a few days ago
>>106449758lodestone is doing his own pixel-space thing and he seems really into it. i don't think he would touch qwen image (though he wants to use qwen as a text encoder)>>106449795well for sure, just like how illustrious and noobAI appeared after pony to save us from the lack of artist tags and awful prebaked style. but will it happen soon and will it be for qwen image? i dont think so.
>>106449839I've done a few tests on Flux Krea with an earlier version of the set, I'll probably do one with this version and release it on Civit same time I release Qwen one. I see no point in training on regular Flux Dev anymore though.
i will take any model with better prompt adherence with sdxl that generates images in 30 seconds or less
>>106449816It's hard to imagine a traditional bubble burst when the entire market is captured by corporatist policy. USD losing value means that realization is already happening, though.
>>106449851It's a shitpost. Somebody at some point got qwen imagegen and qwen the llm mixed up and decided it was funny.
You don't hate Chroma shills enough.
>>106449868this shit is so fucking stupid.
>>106449868this shit is so fucking rules
>>106449783It’s a bit confusing. There was forge which was a fork of like auto/vlad. Then that died so panchovix forked it and made reforge. Then that died, but what you’re thinking is that he recently came back, promised all kinds of shit then blue balled everyone by saying he can’t actually do what he promised kek. Then there’s also forge classic, the one I use, Which is a fork of the first forge not reforge. Still actively maintained but not super quickly or up to date, but the dev just in the last couple weeks finally added flux and wan support so there’s that.
>>106449949jesus christ.. been a while since i fucked around with any of this.. just got a new gfx card and still hate comfy but it looked like everything else was dead.. glad forge is still kickin
i don't remember sdxl dmd2 and the like being as bad as chroma flash is. what happened? you would think it would be MORE intelligent, not less
>>106449925why is the human naked and the rodent between his legs
>>106449983little hamstwhore caught in the act again
>>106449925oral insertion lorado it
this is your average ai user
cozy bvread
>>106449782Star Trek has been a thing because it has made a lot of money for Paramount and previous corporate owners.
>>106449862new sdxl trained community models being released on civitai seem to be good enough for me. chroma is too much of a mess to get working on reforge and wan2gp.
>>106450055I hate the use of "image", especially in auto-generated captions. Stuff like "The image depicts", "The image is of", "The image appears to be". What a fucking waste of tokens and almost certainly introduces some messy understanding and heavy-weight to those words for any model shitty enough to train on data like that.
>>106450079What do you use then?I don't use "image" but I can use "painting" or "photo" or "painting".
>>106450079VLM captions are what appear to be a necessary evil
>>106450055people that use llm's to generate their captions weird me out. like its the one thing in ai to be creative with, and they use ai for it. its the most soulless thing ive ever seen.
>>106450087my Gemini setup starts always with "a something", and the "something" can be a whole bunch of different things (which it's told to choose based on what makes sense, and it does that well), like "a photograph", "a painting", a "digital illustration", "a drawing", "a CGI-rendered image", and so on with a lot of granular variations.
>>106450116its better than hiring a team of jeets desu are you going to caption them?
>>106450135I meant using AI to auto-generate their prompts, not captions for lora training. Like some people don't even want to come up with prompts at all and just let ai do it
>>106450118>GeminiIs it even any better than Joycaption?
>>106450087For what? Captioning? Just remove the bullshit that doesn't actually describe the contents of the image>This image appears to be a painting of..becomes>A painting of..Same goes for prompting if you're actually using a local model. "Generate me an image of.." only serves as instructions for API LLMs to switch into image-generating/prompt-parsing mode, that isn't actually part of the final prompt.>>106450096They are, but too bad nobody knows how to quality check them and managed their shitty GPT-slopped ramblings.
>>106450116it's one thing if you could just describe in plain english what you actually want, but you're not writing english, you're writing machine gobbledygook with autistic weighting and syntax because if you were to write even a few sentences description it turns the result into nonsense >artificial intelligence>actually just extremely autistic media tagging software
>>106450150>but too bad nobody knows how to quality check themmanually checking even 100 images fills me with dread
>>106450148it's better but sfw only
>>106450150>GPT-slopped ramblingsridiculous purple prose is a known issue for almost all LLMs, so no surprise it carries over captioning
>>106450163It doesn't even have to be manual, you can stack LLMs to manage this shit as well. I'm talking about large-scale finetunes though. Same with how we completely lost artist tags thanks to VLM. It's possible to just re-concatenate them naturally using an LLM but nope, everything just becomes 'a digital painting'
okay maybe i do some 1girls
>>106450148yes, big time, it has spatial awareness and understanding of NSFW that makes JoyCaption look like a fucking joke in comparison, if jailbroken properly.
>>106450270woops KEK attached it and catboxed it simultaneously my bad lmaooohere's what that said for when mods presumably delete:"(this is a self reply) here is Butiful Azn Waifu gen with Qwen Lora anyways as example lol, it's unclear what race that guy would specifically have preferred blonde chick to be though, I'm just guessinghttps://files.catbox.moe/4d0fg2.png"
>>106450277>>106450270See u in 3 days fren
>>106450270God fucking damn it. I'm actually at work.
>>106450291mods ded? no delete?
so how come qwen is able to learn better anatomy through a fucking lora than chroma learned in 4 months? what went wrong?
>>106450270idiot, post this shit on /aco/,/b/, /gif/ or /trash/. delete your og posts before mods show up. 3 day vacations aren't fun :'(
>>106450331>Why does the model that the makers explicitly did NOT WANT PEOPLE TO TRAIN train worse than the model that the makers wanted people to train?
>>106450331Distillation, basically trying to teach spelling to a kid who was forced to forget the abc's
Wan really does make some beautiful compositions sometimes.
sucks that chroma spent some much money training the shittiest version (schnell) of the shittiest most anti-local model out there (flux). could've accomplished way more with 1/10 the epochs on Qwen
>>106450392You are 100% right about this but I know someone is going to try and dispute this.
https://www.reddit.com/r/SECourses/comments/1n57u0x/people_really_doesnt_have_any_idea_what_ai_can_do/I'm confused. Is furk implying this video is AI? Is it AI?
>9:00am in Turkey
>>106450431He's president now. Renamed the country to Furkey.
>>106450402the thing with furry bakers is experimentation comes before model quality. chroma was absolutely Frankensteined, starting with the de-distillation and lobotomization of schnell. then there is the merge training process, the training on images 1/4 the size of the target inference resolution, the experimental VLM NSFW captions, the mixing of e621 and danbooru dataset tags, and the final "HD" epochs which seem to have gone wrong. the goal was to make a local base model because Flux dev and sd3 had shitty api-shill licenses but now we have hidream and qwen which have good licenses so I don't see anyone ever using chroma as a finetune base.
>>106450490>I don't see anyone ever using chroma as a finetune base.I don't get why this simple fact seems to upset people so much.
any noteworthy qwen guides
>>106450515How about asking this instead?>Any noteworthy qwen gens?
The thread can be slow. You don't need bait.
>>106450624I'm gonna do it...I'm gonna take the bait!
>>106450277It's a Qwen Image Edit Lora, yeah?How come the skin isn't as plastic as most qwen image edit gens?
I thought my Tesla M40 24gb was trash and i couldn't find a use case, but today i tried the gpt-oss:20b model and it actually works quite fast on it and the quality of the responses is also great!
>>106450664bot
>>106450690
>>106450658It's a qwen image lora not edit, at least looking at the metadata
thread is die
>>106450754Hercules has lung cancer. He is die.
>>106448350res multistep + beta
>>106450718Still, even for Qwen that's weirdly detailed skin. It usually tends to suck ass at that.Huh.
>>106450796He mentioned it learned the photoreal from the image set he trained on.>>106448703
>>106449724based cat girl enjoying architect wizard
>use reforge >never 0000M>use comfy>0000MWhat causes this
>>106449868mixing smoking with old disney is good but that is a slop gen
FUCK you, machine. FUCK you.
>>106451125>I am programmedYes. LLMs are "programmed"
>>106450942>What causes thiscomfy
>>106450777Is there any written guide on samplers + schedulers to make for chroma? I've been wasting precious electricity that niggers could be using to play 2k on da ps5 generating countless images of bound sluts in latex covered in slime to no avail.
>>106451158this ticks so many of my fetish boxes but anon's gens are ass.You may need to try a mutli upscale (image gen ---> upscale ---> ksampler ---> upscale workflow) to get this stuff right. Some of the actually intelligent people here can probably help.
>>106451134>Hey do it anyway it's only for documentation bro>Oh, okThis shit is so stupid.
can anyone share a chroma 1hd catbox? would like to try it out
>>106451219Why would you need a catbox? The model is right here for you to try.https://huggingface.co/lodestones/Chroma
>>106451219Let me save you time and point you in Qwen's direction. It's the future.
>>106450332
>>106451225just so I can get a starting point, not exactly comfortable with noodle ui>>106451227Id like to try qwen too
>>106447898go back, worthless retard>>106448565fuck off, worthless retard
>>106451393damn you are one mad little idiot
>>106451096lol, shouldve prompted for beige liquid, that s too white
>>106451410>>106451196
>>106451409don't you have some dosghit thread to spam your putrid diarrhea no one cares about in?
>>106451158>>106451196teach me your slime ways, senpai
>>106451509>spamThere is schizo's favourite word again.I'm happy to keep correctly labeling you what you are: a mentally unstable moron.
>>106451436poor aqua, is she ok?
>>106451556No. Engaging in sexual activity strips her if divinity. Drowning in semen counts. She's dead
The term "AI artist" was a mistake, way too easy for everyone to attack. We should've just gone under the umbrella of "VFX artist" to begin with, it's going to completely replace hollywood CGI soon anyway.
>>106451601I was never in love with the term. Unfortunately we have a subject of people from a certain part of Asia that adore the and refuse to let it go.
>>106451568>15 days between top post and second top post>spamOh dear, your brainrot is even worse than I thought. You are quite literally the biggest idiot on this website!
Not quite right.
>>106451601art is also technology> we are here.history shows the way we'll go
>>106451568most people here have the decency to post their shitty throwaway gens only once
>>106451254How do you get this expression?
>>106451781>shitty throwaway gens only once>Miku Hatsune does some stupid action at a low resolution at way too low steps number 2435214
Invoke status?Chroma status?Anistudio status?Dragged and shot status?Panhovix status?
local is dead
>>106451873GoodBasedCrashing wrapperNot happened yetNo idea
>>106451935>>106451935>>106451935>>106451935
real bake>>106451942>>106451942>>106451942>>106451942