[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106442596

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
AniStudio: https://github.com/FizzleDorf/AniStudio/tree/dev

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: 1748567469641728.mp4 (1.51 MB, 640x640)
1.51 MB
1.51 MB MP4
>>
File: 00.jpg (687 KB, 758x1017)
687 KB
687 KB JPG
https://civitai.com/models/827184?modelVersionId=2167369
https://civitai.com/models/827184?modelVersionId=2167369
https://civitai.com/models/827184?modelVersionId=2167369
https://civitai.com/models/827184?modelVersionId=2167369

WAI-NSFW V15 IS OUT!!
>>
best method or workflow for achieving gen with 2 people wearing different outfits? its like pulling fucking nails
only have a face detailer/upscaler in the workflow now and I can get very close to what I want but it's always getting some element wrong
>>
File: 1753963890121730.jpg (162 KB, 1170x1072)
162 KB
162 KB JPG
Is it normal for your computer to shit itself and freeze completely when trying to use chroma on 12GB VRAM? I used various flux models on forge in the past and just had to wait a bit and could still browse the net and other stuff in the meantime, now I'm on comfy and every attempt to use chroma resulted in everything freezing completely and having to reboot
>>
>>106447688
I have a 4070S and I'm fine
>>
>>106447661
>>106447718
>>
>>106447640
uh oh poopy made a stinky
>>
>>106447688
The offloading should keep you from ever going OOM. How much ram do you have?
>>
>>106447661
safely file that under shit i dont care about
>>
>>106447661
oh wow, finally updated
>miku as cover image
>update on miku's birthday
>>
>>106447661
I don't think using an image with a lot of gibberish text was a smart promo move.
>>
>>106447701
Same
>>106447729
32GB
>>
>>106447754
it's SDXL
>>
>>106447661
what's the point of these updates if it's still based on an outdated model? it wont really improve anymore.
>>
>>106447661
>slop
NEXT! When the new Noob will arrive?
>>
>>106447754
for text all you need to do is use qwen or kontext edit anyway now. as an anime model wai 14 was the best one (so far).
>>
>>106447765
>new Noob
LAX sold out and works for an AI company. When we get GPT-4o at home (it HAS to be autoregressive), then he'll come back
>>
>>106447756
I actually have the same card too. Not sure what's causing it. Are you using the default workflow? What's your output res? Steps?
>>
>>106447661
v15
-added data (roughly up to May 2025, mainly popular social games and some anime).
PS:The new character data hasn’t been fully fixed yet. I’ll continue to improve it in the upcoming versions.
-Data adjustment, trying to reduce the chance of watermarks appearing

So is nothing burger?
>>
>>106447661
slop

>>106447416
chroma flash can't do paintings. it has its uses though
>>
>>106447784
pretty much. it's still using XL 1.0, so minimal improvements.
>>
>>106447771
Default, 32 stets, 1024
>>
>>106447768
>LAX sold out and works for an AI company.
why all talented people ends like that?
>>
>>106447688
try gguf chroma
>>
File: 1731858747381285.webm (760 KB, 640x640)
760 KB
760 KB WEBM
stream is over mr fors
>>
>>106447784
>added data (roughly up to May 2025, mainly popular social games and some anime).
is there a list of stuff he added?
>>
>>106447805
money
>>
>>106447824
>up to May 2025
it can't gen Eri and Kanoe, it's over...
>>
im....im... OOOOOMIIING
>>
>>106447886
KEK
>>
>>106447886
lol
>>
File: ComfyUI_00519_.png (1.97 MB, 896x1152)
1.97 MB
1.97 MB PNG
Somebody mentioned the other day that there is some kind of add-on that can help change lighting while the thing is genning. I think it was for A111. What was it?
>>
>>106447957
IC light? That was more like a gimmick than anything useful.
>>
>>106447957
oh its bad, REAL BAD
>>
File: ComfyUI_00518_.png (1.7 MB, 896x1152)
1.7 MB
1.7 MB PNG
>>106447977
No I don't think it was IC light. The guy who mentioned it posted some examples. The same image with completely different lighting compositions and luminosity. Supposed to run while genning. It seemed really neat. I wish I paid more attention at the time.
>>106447985
But it's so bad it's good, right?
>>
File: 1745836121044394.mp4 (1.07 MB, 640x640)
1.07 MB
1.07 MB MP4
>>
File: WanVid_00001.webm (1020 KB, 800x560)
1020 KB
1020 KB WEBM
I could tell that was botched from the preview
>>
File: 1732875074818249.png (2.59 MB, 1536x1536)
2.59 MB
2.59 MB PNG
first wai v15 test with miku
>update comes on miku's birthday
>last update was like, May
so they waited on purpose.
>>
>>106448085
Thats what your mother said when she saw the ultrasounds.
>>
>>106448041
VectorscopeCC?
>>
File: ComfyUI_00053_.mp4 (156 KB, 288x400)
156 KB
156 KB MP4
>>
File: WanVideo2_2_I2V_00264.webm (1.25 MB, 1248x720)
1.25 MB
1.25 MB WEBM
>>
>>106447784
It's sdxl. It's plateaud tech at this point.
>>
File: 1753511417965740.webm (776 KB, 640x640)
776 KB
776 KB WEBM
the man is grabbed by two doctors in white lab coats and pulled into a room on the right, and the doctors close the door.

in to the asylum he goes
>>
>>106448085
it's like something out of a Guy Ritchie movie, ramping the speed to get the character to where you want instead of cutting
>>
>>106447802
Hmm. Have you tried a Q8 quant?
>>
File: ComfyUI_temp_lpssx_00030_.png (1.1 MB, 1152x1152)
1.1 MB
1.1 MB PNG
>>106447802
Try mine. Just adjust the steps to a more human level. And with the chroma cache use at least 30 steps for interval 1 and 50+ or more for interval 2 since it needs to iron out the artifacts. Also you can probably disable Fresca since this snakeoil seems useless now.
>>
>>106447802
Forgor.
https://files.catbox.moe/wdbmqx.json
>>
>>106448150
the Zulu got him
>>
File: 1725359593446475.webm (789 KB, 640x640)
789 KB
789 KB WEBM
>>106448166
you know what's funny, I didn't even specify uganda men grab him. wan just knew.

this time it's different
>>
File: 02790-1532105264.png (1.09 MB, 896x1152)
1.09 MB
1.09 MB PNG
>>106448108
>VectorscopeCC
hmm yeah, this looks promising. Guess I'll check that out. thanks, saves me a trip to the archives.
>>
File: Happy Libra Merchant.jpg (294 KB, 1024x1024)
294 KB
294 KB JPG
>>106448045
I've said this before here, but I'm going to repeat it for those that have never seen it, that AI webms like these behave like dreams. One moment you can be sitting in your room, only to turn around and dive into the sea. I find that coincidence fascinating, as if these are the recorded dreams of global AI.
>>
>>106448180
games with realtime AI could be so trippy, you could have any type of scenario/environment made on the fly.
>>
File: chroma1HD_00050_.jpg (359 KB, 840x1408)
359 KB
359 KB JPG
>>
>>106448191
but we all know what terms would be in the prompt at all times
>>
File: 1726066669472309.mp4 (1005 KB, 640x640)
1005 KB
1005 KB MP4
the man runs very fast to the right, as a large group of ugandan men chase him down the hallway.

HOW DID I GET JEETS WTF
>>
>>106448191
You should play AI minecraft if you want to see how shitty it would be.
>>
>>106448180
And you even get the same frustration you feel inside your dreams when you can't control what's happening or how things play out
>>
>>106448203
if you had a pool of prompts with interesting themes it could be decent. hand made environments are best but it'd be interesting to see AI levels that aren't just procedurally generated based on templates.
>>
File: Beard slap.gif (492 KB, 355x255)
492 KB
492 KB GIF
>>106448210
>And you even get the same frustration you feel inside your dreams when you can't control what's happening or how things play out
Exactly. The coincidence is mind blowing with its accuracy.
>>
>>106448216
I think in the near future, the closest we will get is AI powered dialogue and AI logic that calls handmade or procedural assets. Generating the entire thing on the fly and keeping it consistent would be a huge undertaking.
>>
>>106448196
is that actually a generated image?
>>
File: 1744238951956774.webm (2.13 MB, 640x640)
2.13 MB
2.13 MB WEBM
the man gets in a car and drives down the street in Uganda, as a large group of black men chase him.

forsen playing pubg:
>>
File: chroma1HD_00054_.jpg (412 KB, 840x1408)
412 KB
412 KB JPG
>>106448271
yeah
>>
Qwen BJ lora I was talking about yesterday came out pretty good, even does the two-girl-one-guy stuff with decent coherence. Both boxes are direct recreations using the Lora of porn pics that WEREN'T in the dataset:
https://files.catbox.moe/uzdfgg.png
https://files.catbox.moe/s3e3mb.png
>>
https://civitai.com/models/827184?modelVersionId=2167369

wai v15 out, make some mikus
>>
>>106448299

Actually not bad at all, can the model be fully uncensored then? I have not been paying attention to qwen at all desu.
>>
>>106448227
I remember a thread some years ago where some anon mentioned that he worked at some facility which was contracted with the military iirc, and how they were researching and studying about these sort of stuff to see if they can use it to manipulate brain behavior and such, it somehow stuck with me as eerie and lately I even started to believe that maybe he was telling the truth, especially with how they have discovered that AI can cause psychosis. Who knows what else the gov is aware of and what plans they are pushing behind the scenes. I wish I took a screenshot back then.
>>
>>106448327
NTA, but if flux can be uncensored, then qwen can 100%.
>>
>>106448298

What sampler/scheduler are you using? Love how crispy it looks.
>>
>>106448180
I find text to be the most AI like in dreams. Sometimes, right before I wake up I realized I'm dreaming and can look at text in my dream and the letters and numbers always warp and shift just enough that it's always illegible. Very AI like.
>>
>>106448346
Probably much easier since it's not distilled to shit though more gpu intensive since it's bigger. pros and cons lol
>>
>>106448327
we waiting for a finetune
>>
>>106448299
very nice. did you make more loras, or an all-in-one lora?
>>
>>106448355
I wonder how much the size is really an issue when it comes to larger scale training. Like if it fits on the 80GB enterprise GPU, it fits.
>>
>>106448298
got a bit of weirdness around the hair but the detail is damn impressive. was totally fooled, thought it was a stock image on first view.
daaaaayum, shit's real.
>>
Still not gonna pollute my HD with chroma.
>>
File: chromamilf.jpg (882 KB, 832x1248)
882 KB
882 KB JPG
>>106448357

I wonder how good Chroma is going to be when the eventual porn finetune hits it, I think one of the guys behind one of the popular SDXL finetunes is training one right now
>>
>>106448360
Logically it will train slower but should also learn faster cause of the better vae and no distillation, who knows until someone does it I suppose.
>>
>>106448291
man I've been to Uganda back in like 2006 with a charity, you have no idea how fucking hot it is in Africa lol. Also the Coca-Cola company has infiltrated the entire continent in a crazy way, EVERY FUCKING WHERE YOU GO they've got the glass bottle coke, even tiny village shops, I dunno how they even do it logistically lol.
>>
>>106448299
holy shit that cock goes right through that bitch's head
>>
>>106448406
Well all of the loras I've trained for it turned out really nice, clean and flexible. I imagine fine tuning will be just as productive.
>>
File: WanVideo2_2_I2V_00266.webm (465 KB, 1248x720)
465 KB
465 KB WEBM
>>
File: 1749480318418691.mp4 (643 KB, 640x640)
643 KB
643 KB MP4
the man rides a motorcycle off a ramp high into the air, on a sunny beach.
>>
>>106448327
It already does perfect booba out of the box, so you could definitely train in downstairs genitals I think. Whether there's enough people out there who will take the time to caption their datasets with like autistically perfect accuracy and be willing to do the kind of slow burn training that seems to work best (BJ lora was trained for 100 epochs at 1x repeat) is a different story though.

Something that's probably good to know though, it seems that how the model handles different languages isn't directly related to the captions themselves, my Lora is only captioned in English, but doing a verbatim translation to Chinese of one of the prompts for the two pics I posted earlier produces basically the same results. So you just don't really have to worry about that at all as far as I can tell:
https://files.catbox.moe/3zb6ix.png
>>
>>106448448
I wonder if Qwen VL is the one handling that. It's a really good text encoder.
>>
>>106448359
it's one Lora, just 150 pics, but with none of them ever having the same woman appear more than once, and all of them having very thorough NLP captions. Only about ~20 of the pics have two ladies as opposed to one lady but I guess that's enough to allow it to kinda work alongside the overall concept from all the other pics.
>>
Anyone ever notice how notrainers look at trainers like gods?
>Can you train X please
>OMG AMAZING CAN YOU TRAIN Y?
>Can u link youtube tutorial how to train?

lmao it's just a bunch of pictures in a folder and a python scrip.
>>
>>106448413
IDK what you mean, are you talking about the black guy one lol? it's a pretty normal PP I'd say. Unless you just mean how her cheeks are kinda chipmunking a bit
>>
File: 1731047468445748.mp4 (898 KB, 640x640)
898 KB
898 KB MP4
the man rides a motorcycle off a ramp flying far away into the distance, onto another ramp 300 feet away.

can be tricky getting them to go far
>>
>>106448494
no I mean how the second girl is sucking his cock at the back of the first girl's head
>>
>>106448462
yeah I guess there must be some kind of translation layer happening. Either that or the linguistic embeddings are just pre-linked in some kind of way internally.
>>
>>106448482
anything that isn't a touchscreen UI is like black magic to them
>>
>>106448462
This is exactly why chroma is switching to qwenvl2.5 because t5 is old and busted.
>>
File: WanVideo2_2_I2V_00267.webm (2.02 MB, 720x1248)
2.02 MB
2.02 MB WEBM
>>
>>106448505
oh I see what you mean, it does kind of look like that yeah. It's similar to what the "girl pushing other girls head" pics I had in the dataset actually look like though mostly.
>>
>>106448514
even Gemma as used in Lumina 2 has 8192 tokens of context vs T5's 512 lol, despite having less params
>>
File: Kevin_89271_thumb.gif (18 KB, 83x109)
18 KB
18 KB GIF
Does anyone have a collection of sample prompts for Wan2GP I can mess around with? I can't seem to get the AI to do anything that I want it to, or maybe what I want is too specific?

I basically want it to make a video of someone bouncing on their feet back and forth, kinda like what boxers and mma fighters do at the beginning of a fight, kinda like in this gif
>>
File: Haruka_00117.webm (1.28 MB, 720x1072)
1.28 MB
1.28 MB WEBM
>>
>>106448541
what we basically need is openpose controlnets for video. turn an existing video into stick men. now turn the stick men back into a video.
Pretty sure somebody must be working on this if it hasn't been done already. I saw somebody mentioning to he does keyframe animation in blender, can't remember where.
>>
>>106448565
even better than human made anime. because you can see the underlying 3d model directing the shapes.
>>
>>106448588
ppl like you are why it's a good thing /adt/ exists.
>>
>>106448595
/adt/ ??
>>
>>106448601
>>106439892
>>
>>106448588
Ehh I don't know has too much of a rotoscope feel which I don't hate per se but gives you uncanny feels once you get used to usual anime animation.
>>
>>106448604
Stop sending your trash to us
>>
>>106448595
This is what autism looks like.
>>
>>106448045
Made me kek at the end
>>
Haven't been in the game for a while. What's the best model for animu shit? Also can illustrious do text or no?
>>
Anyone been keeping an eye on >>>/t/1377945 ? I'm brand new and don't quite know how big a find that is or if those are models you can just get elsewhere.
>>
>>
File: heyr.png (714 KB, 1452x1216)
714 KB
714 KB PNG
I'm trying to make an HD version of the sprite on the right, but I can't get the bandana to be a "top of head" bandana; it always wants to go over the forehead, instead
Suggestions?
>>
>>106448604
I don't watch anime at all desu. I enjoy western art much more. I appreciate the technical aspects of art though. And AI.
>>106448605
Perhaps. And I guess one could make and argument against it. But those feels were bound to become more widespread with or without AI.
>>
>>106448482
well post the script then nigga
>>
>>106448680
It's already in the thread somewhere.
>>
>>106448672
Cris?
>>
>>106448672
inpaint the bandana part and prompt bandana, and use openpose maybe?
>>
One more Qwen BJ example:
https://files.catbox.moe/wv7q64.png

Overall it took to the photographic data a little better than I expected I think.
>>
>Julien and debo defacing the OP again
Pathetic
>>
>>106448703
Damn that's pretty good
>>
>>106448703
>https://files.catbox.moe/wv7q64.png


Looks fantastic, extremely high quality. Do you mind sharing your dataset? Would love to train it on chroma to compare.
>>
>>106448703
fucking awesome
>>
>>106448703
The women that most men find attractive and the women I find attractive are very different and sometimes that surprises me.
>>
File: 1743881322280291.mp4 (1.23 MB, 640x640)
1.23 MB
1.23 MB MP4
the man with glasses is packing cardboard boxes at an Amazon warehouse. The Amazon logo is visible in the background. the camera pans out to show him packing the boxes.

just like the game
>>
File: choke.mp4 (2.42 MB, 720x1072)
2.42 MB
2.42 MB MP4
>>
>>106448514
is he actually retraining on qwenvl2.5
>>
File: WanVideo2_2_I2V_00270.webm (1.18 MB, 720x1248)
1.18 MB
1.18 MB WEBM
>>
File: 1737594875464183.mp4 (1.25 MB, 640x640)
1.25 MB
1.25 MB MP4
>>
>>106448684
he's been bouncing between generals and always announces when he is frustrated with: 3d, game engines and AI. it's a periodic cycle but he never finishes any of his games I think it's close to 14 years he's been doing this with AI being the recent addition to the cycle
>>
>>106448703
Donot share your dataset with Chroma kekes. Let them sink
>>
>>106448887
Dumbledore doese it again
>>
>>106448932
Holy based and cold turkey pilled.
>>
I don't think Flux knows what a PC98 is but it makes some nice gens when I ask for the style. Kind of Stardew Valley ish.
>>
Does anyone watercool their GPU's?
>>
>>106449028
do you think I'm a rockerfella who's gonna overclock an already very strained piece of hardware, consuming hundreds more watts for practically no gains at all?
>>
>>106449037
you can watercool without needing to OC. my cpu is watercooled to keep it quiet and always under 40C
>>
>>106448867
It's on the docket after the radiance model.
>>
>>106449053
NTA, but I don't see how my GPU running at 60 degrees during inference vs 70 degrees during inference is going to matter at all.
>>
>>106449084
docket? i looked at that one reddit thread he posted but it's not on that one
>>
File: WanVid_00002.webm (192 KB, 608x368)
192 KB
192 KB WEBM
how does hercules lad's gens look so good
>>
File: WanVideo2_1_T2V_00180.mp4 (1.59 MB, 1248x720)
1.59 MB
1.59 MB MP4
>>
>>106449028
both my 5090 and 4090 have a waterblock. I like a quiet room.
>>
>>106449103
hey, not her. try abu instead
>>
>>106449103
he's doing twice the resolution. also what is this smoking auitism? this is fuckin retarded
>>
>>106449099
lower temps = lower fan noise + lease wear on the fans. also means temps in room are lower

>>106449123
what are the temps during 100% usage?
>>
>>106449131
>he's doing twice the resolution
wrong
>>
>>106449158
4x whatever
>>
>>106449136
my 4090 block is slightly uneven and die contact isn't correct, so temps similar to air (even with phase cool pad). 5090 block is around 50c during gaming, high 50s during constant 600w AI load, memory around the same.
>>
>>106449136
>lower temps on gpu and lower temps in the room
where does the heat go?
>>
>>106449169
the a/c, duh. jeez anon, you take your stupid pills instead of your smart pills today?
>>
>Watercooling GPUs
AI bros are not beating the water wasting allegations are they?
>>
>>106449103
he's not using a still from a VHS rip
>>
after being around this stuff for years now, i'm only now just finally getting around to playing with stuff beyond just "basic prompting" and wildcards. wildcards were basically as spicy as i ever went lol. been messing around with controlnet and regional prompter all evening and anons you should have told me to try these things sooner. feels like more valuable tools in the kit.
picrel ultra sloppa trigger warning. feels a bit more fun again tbqhwy. what other goodies come recommended beside those two?
>>
>>106449099
if you're not hitting Tjunc then you will never have a worry. dont pay mind to the autismos here.
>>
>>106449192
you typed it like cooling the gpu would result in cooler room
>>
File: 00059-4033496557.png (1.67 MB, 1344x1024)
1.67 MB
1.67 MB PNG
>>
>>106449102
Like most of Lode's behavior, this is all on the discord only at this point.
>>
>>106449257
very cool. looking forward to another sorta cool sorta bad model
>>
File: smokeules.png (1017 KB, 683x1024)
1017 KB
1017 KB PNG
>>
File: 00477-3251588642.png (2 MB, 1280x1920)
2 MB
2 MB PNG
I love that this thread has a smoking section now
>>
File: 1728112475485322.png (509 KB, 632x910)
509 KB
509 KB PNG
>try to discuss AI on the Star Trek general
>get screamed at about environmentalism and AI bad by anti AI luddites
Literally the last place I expected that kind of attitude considering the contents of the show
>>
>>106449375
Crazy right? This is basically very very early holodeck stages, you’d think they would all be for it.
>>
>>106449375
A strange and weird dissonance lol. I think some people watch science fiction like it's a impossible magical world akin to tolkien stuff
>>
File: 1756691311225151.png (251 KB, 412x618)
251 KB
251 KB PNG
>>106448887
Animate this one.
>>
>op hijacked by troons
guess i won't be posting then
>>
File: Screenshot (21).png (932 KB, 1920x1080)
932 KB
932 KB PNG
>>106449375
try /hor/ they were p chill when i use to post there. too many footfags tho last time i checked in.
>>
>>106449472
But you just did, too late anon you are now a troon too. I am sorry, I don't write the rules
>>
so qwen image is the next great hope for anime ponos in vagoo?
>>
What quant of Qwen should I use with a 3090?
>>
File: 00186-819204502.jpg (164 KB, 1824x1248)
164 KB
164 KB JPG
>>106447661
don't give a shit about 2d sloopa.
>>
>>106449561
if someone provides a generous $300k donation we might be able to get a 5 epoch 256x256 finetune for it
>>
>>106449375
kek
show them some startrek ai porn
>>
boring unproductive thread today
>>
>>106449585
our chinese benefactors will surely throw us a bone
>>
>>106449622
you could ask LAX from NoobAI, but I think he's contributing to some 3.6b lumina project. I don't think they would do Qwen Image because of how slow it is.
>>
>>106448357
was there even one announced?
would be nice to have a chroma like version of qwen, but it'll probably be even more expensive to train
>>
File: ComfyUI_00118_.png (622 KB, 768x768)
622 KB
622 KB PNG
>>
>>106449630
Neta is pretty slow compared to XL almost 3x as slow. If you compare it to qwen nunchaku don't think the difference is that much (I think, haven't used nunchaku yet)
>>
>>106449375
Ai is stupid goonfuel that's mostly vaporware propping up USD. Doesn't mean I won't enjoy jerking off to it.
>>
>>106449620
i wonder why
>>
>>106449108
lol'd
>>
>>106449585
I mean any finetune needs serious hardware (unless you are doing some worthless shit tune with 5k images lol), so once you have that kinda hardware it probably wouldn't matter.
>>
>>106449375
>environmentalism and AI
The smear campaign worked wonderfully well I see.
>>
File: ComfyUI_08569_.jpg (2.29 MB, 2000x1498)
2.29 MB
2.29 MB JPG
>>
>>106449620
Kinda feels like genning in general these days
>>
What max resolutions are recommended for chroma hd? I get a lot of weird things with 1080p gens.
>>
>>106449661
the definition of 'serious hardware' is getting higher and higher. pony was done with a fraction of the compute of illustrious which was done with a fraction of the compute of chroma. and chroma had to cope by gutting params and image size yet still cost 6 figures to train. a qwen finetune would easily be over $500k
>>
File: WanVideo2_1_T2V_00182.mp4 (1.52 MB, 1248x720)
1.52 MB
1.52 MB MP4
>>
>>106449720
That depends on how Qwen responds to training. It might be the case that it picks up the needed information long before the cost to train it ever surpasses Chroma.
>>
>>106449725
possibly, but who will rent the hardware to find out? that's the difference with the newer bigger models. they no longer train on consumer GPUs. a lot of early SDXL finetunes like kohakuXL (which served as the base to illustrious) were trained on only 2x 3090s. same with vpred, it was experimented with on consumer rigs first.
even asking for $10 in cloud compute is enough of a barrier to deter people from trying, as one experiment quickly leads to another and the costs start to mount. the best bet, unironically, is to hope some generous guy appears with actual direct hardware access (no renting) and partners with a finetuning team.
>>
>>106449739
Load of ass stones will switch to Qwen within the month. I have no doubt.
>>
what UIs you all use here? everyone use Comfy or is there anything else that is actually usable?
>>
>>106449759
I don’t do video or photorealism so forge classic and sdxl-based models just werks for me. I do have a comfy install with flux and chroma but eh, for what I want to gen it’s just simpler and faster to fire up ole forge classic and fire off my 1girls. I’m used to the extensions and things like that, if it ain’t broke….
>>
>>106449375
On the surface level, yeah, Star Trek is "the holodeck show," but on a slightly deeper level it's "the communist utopia show."
Same as how on the surface level, AI is a tool that lets you make whatever you want, but on a deeper level it's an expensive product promoted by the world's most powerful tech companies so they can centralize wealth even more than it already is.
So it's no surprise that a Star Trek fan who has a bit of a think beyond the surface level doesn't like it.
>>
>>106449779
oh nice, didn't know forge was still a thing.. i thought i read that guy gave up on it
>>
random question i want to try and gen a picture at like bodypillow size, around 6000 x 18000 pixels, now obviously its best to start lower res and then upscale but it keeps spitting out dawg shit, is there like a max limit at the amount you can upscale because i did times 5
>>
>>106449739
Ehh I think someone will eventually turn up, tuning this properly will easily net the best anime/porn model in the market (I won't be surprised if the NAI guys jump on it, instead of the crap they are using now), and where there is money to be made there will be someone to make it..
>>
File: WanVideo2_1_T2V_00184.mp4 (3.63 MB, 1248x720)
3.63 MB
3.63 MB MP4
>>
>>106449782
>Trillions invested to try and fire all workers and replace everybody with vibe coders and prompt monkeys
At least I can generate all the deranged porn I want, thanks bezos??
>>
>>106449758
I hope he does, but does he even have the funds?
>>
>>106449782
fr though, what happens when the US tech bubble bursts and everyone realizes America is a country that manufactures nothing but dollars?
>>
>>106449795
>NAI guys jump on it
I'm sure they would for anime, but zero chance they'd do the same for porn.
>>
>>106449796
This is good, but it would be better if it was an anthro of some kind.

Also honest question, is the Chroma checkpoint on civit still not updated?
>>
>>106448807
lucky for you the actual dataset is wildly appearance agnostic (as it should be), like I said earlier there's absolutely no multiple appearances by any individual woman, and the range of ethnicity / age is quite wide
>>
>>106448915
who?
>>
>>106448867
He has advanced access to Stable Diffusion 6 in fact, we'll be there in 10 epochs Chroma Bros
>>
>>106449831
don't you want to make a buttchin lora?
>>
>>106449820
it was updated a few days ago
>>
>>106449758
lodestone is doing his own pixel-space thing and he seems really into it. i don't think he would touch qwen image (though he wants to use qwen as a text encoder)
>>106449795
well for sure, just like how illustrious and noobAI appeared after pony to save us from the lack of artist tags and awful prebaked style. but will it happen soon and will it be for qwen image? i dont think so.
>>
>>106449839
I've done a few tests on Flux Krea with an earlier version of the set, I'll probably do one with this version and release it on Civit same time I release Qwen one. I see no point in training on regular Flux Dev anymore though.
>>
i will take any model with better prompt adherence with sdxl that generates images in 30 seconds or less
>>
File: WanVideo2_2_I2V_00271.webm (317 KB, 1248x720)
317 KB
317 KB WEBM
>>
>>106449816
It's hard to imagine a traditional bubble burst when the entire market is captured by corporatist policy. USD losing value means that realization is already happening, though.
>>
>>106449851
It's a shitpost. Somebody at some point got qwen imagegen and qwen the llm mixed up and decided it was funny.
>>
You don't hate Chroma shills enough.
>>
>>106449868
this shit is so fucking stupid.
>>
File: 00002-2187239102.png (1.49 MB, 896x1152)
1.49 MB
1.49 MB PNG
>>
>>106449868
this shit is so fucking rules
>>
>>106449783
It’s a bit confusing. There was forge which was a fork of like auto/vlad. Then that died so panchovix forked it and made reforge. Then that died, but what you’re thinking is that he recently came back, promised all kinds of shit then blue balled everyone by saying he can’t actually do what he promised kek. Then there’s also forge classic, the one I use, Which is a fork of the first forge not reforge. Still actively maintained but not super quickly or up to date, but the dev just in the last couple weeks finally added flux and wan support so there’s that.
>>
>>106449949
jesus christ.. been a while since i fucked around with any of this.. just got a new gfx card and still hate comfy but it looked like everything else was dead.. glad forge is still kickin
>>
i don't remember sdxl dmd2 and the like being as bad as chroma flash is. what happened? you would think it would be MORE intelligent, not less
>>
>>106449925
why is the human naked and the rodent between his legs
>>
>>106449983
little hamstwhore caught in the act again
>>
File: 1740962259794891.gif (3.71 MB, 420x420)
3.71 MB
3.71 MB GIF
>>106449925
oral insertion lora
do it
>>
File: file.png (73 KB, 1115x482)
73 KB
73 KB PNG
this is your average ai user
>>
cozy bvread
>>
>>106449782
Star Trek has been a thing because it has made a lot of money for Paramount and previous corporate owners.
>>
>>106449862
new sdxl trained community models being released on civitai seem to be good enough for me. chroma is too much of a mess to get working on reforge and wan2gp.
>>
>>106450055
I hate the use of "image", especially in auto-generated captions. Stuff like "The image depicts", "The image is of", "The image appears to be". What a fucking waste of tokens and almost certainly introduces some messy understanding and heavy-weight to those words for any model shitty enough to train on data like that.
>>
>>106450079
What do you use then?
I don't use "image" but I can use "painting" or "photo" or "painting".
>>
>>106450079
VLM captions are what appear to be a necessary evil
>>
>>106450055
people that use llm's to generate their captions weird me out. like its the one thing in ai to be creative with, and they use ai for it. its the most soulless thing ive ever seen.
>>
>>106450087
my Gemini setup starts always with "a something", and the "something" can be a whole bunch of different things (which it's told to choose based on what makes sense, and it does that well), like "a photograph", "a painting", a "digital illustration", "a drawing", "a CGI-rendered image", and so on with a lot of granular variations.
>>
>>106450116
its better than hiring a team of jeets desu
are you going to caption them?
>>
>>106450135
I meant using AI to auto-generate their prompts, not captions for lora training. Like some people don't even want to come up with prompts at all and just let ai do it
>>
>>106450118
>Gemini
Is it even any better than Joycaption?
>>
>>106450087
For what? Captioning? Just remove the bullshit that doesn't actually describe the contents of the image
>This image appears to be a painting of..
becomes
>A painting of..
Same goes for prompting if you're actually using a local model. "Generate me an image of.." only serves as instructions for API LLMs to switch into image-generating/prompt-parsing mode, that isn't actually part of the final prompt.
>>106450096
They are, but too bad nobody knows how to quality check them and managed their shitty GPT-slopped ramblings.
>>
>>106450116
it's one thing if you could just describe in plain english what you actually want, but you're not writing english, you're writing machine gobbledygook with autistic weighting and syntax because if you were to write even a few sentences description it turns the result into nonsense
>artificial intelligence
>actually just extremely autistic media tagging software
>>
>>106450150
>but too bad nobody knows how to quality check them
manually checking even 100 images fills me with dread
>>
>>106450148
it's better but sfw only
>>
>>106450150
>GPT-slopped ramblings
ridiculous purple prose is a known issue for almost all LLMs, so no surprise it carries over captioning
>>
>>106450163
It doesn't even have to be manual, you can stack LLMs to manage this shit as well. I'm talking about large-scale finetunes though. Same with how we completely lost artist tags thanks to VLM. It's possible to just re-concatenate them naturally using an LLM but nope, everything just becomes 'a digital painting'
>>
okay maybe i do some 1girls
>>
>>106450148
yes, big time, it has spatial awareness and understanding of NSFW that makes JoyCaption look like a fucking joke in comparison, if jailbroken properly.
>>
>>106450270
woops KEK attached it and catboxed it simultaneously my bad lmaooo

here's what that said for when mods presumably delete:

"(this is a self reply) here is Butiful Azn Waifu gen with Qwen Lora anyways as example lol, it's unclear what race that guy would specifically have preferred blonde chick to be though, I'm just guessing

https://files.catbox.moe/4d0fg2.png"
>>
>>106450277
>>106450270
See u in 3 days fren
>>
>>106450270
God fucking damn it. I'm actually at work.
>>
>>106450291
mods ded? no delete?
>>
so how come qwen is able to learn better anatomy through a fucking lora than chroma learned in 4 months? what went wrong?
>>
File: 00029-3035661566.jpg (152 KB, 1824x1248)
152 KB
152 KB JPG
>>106450270
idiot, post this shit on /aco/,/b/, /gif/ or /trash/. delete your og posts before mods show up. 3 day vacations aren't fun :'(
>>
File: 1731327496494646.png (1.64 MB, 1416x2120)
1.64 MB
1.64 MB PNG
>>
>>106450331
>Why does the model that the makers explicitly did NOT WANT PEOPLE TO TRAIN train worse than the model that the makers wanted people to train?
>>
>>106450331
Distillation, basically trying to teach spelling to a kid who was forced to forget the abc's
>>
File: WanVideo2_1_T2V_00188.mp4 (1.69 MB, 1248x720)
1.69 MB
1.69 MB MP4
Wan really does make some beautiful compositions sometimes.
>>
sucks that chroma spent some much money training the shittiest version (schnell) of the shittiest most anti-local model out there (flux). could've accomplished way more with 1/10 the epochs on Qwen
>>
>>106450392
You are 100% right about this but I know someone is going to try and dispute this.
>>
https://www.reddit.com/r/SECourses/comments/1n57u0x/people_really_doesnt_have_any_idea_what_ai_can_do/

I'm confused. Is furk implying this video is AI? Is it AI?
>>
>9:00am in Turkey
>>
File: WanVideo2_1_T2V_00189.mp4 (1.55 MB, 1248x720)
1.55 MB
1.55 MB MP4
>>106450431
He's president now. Renamed the country to Furkey.
>>
>>106450402
the thing with furry bakers is experimentation comes before model quality. chroma was absolutely Frankensteined, starting with the de-distillation and lobotomization of schnell. then there is the merge training process, the training on images 1/4 the size of the target inference resolution, the experimental VLM NSFW captions, the mixing of e621 and danbooru dataset tags, and the final "HD" epochs which seem to have gone wrong.
the goal was to make a local base model because Flux dev and sd3 had shitty api-shill licenses but now we have hidream and qwen which have good licenses so I don't see anyone ever using chroma as a finetune base.
>>
>>106450490
>I don't see anyone ever using chroma as a finetune base.
I don't get why this simple fact seems to upset people so much.
>>
any noteworthy qwen guides
>>
>>106450515
How about asking this instead?
>Any noteworthy qwen gens?
>>
The thread can be slow. You don't need bait.
>>
>>106450624
I'm gonna do it...I'm gonna take the bait!
>>
File: WanVideo2_1_T2V_00190.mp4 (3.19 MB, 1248x720)
3.19 MB
3.19 MB MP4
>>
>>106450277
It's a Qwen Image Edit Lora, yeah?
How come the skin isn't as plastic as most qwen image edit gens?
>>
I thought my Tesla M40 24gb was trash and i couldn't find a use case, but today i tried the gpt-oss:20b model and it actually works quite fast on it and the quality of the responses is also great!
>>
>>106450664
bot
>>
File: file.png (22 KB, 435x166)
22 KB
22 KB PNG
>>106450690
>>
>>106450658
It's a qwen image lora not edit, at least looking at the metadata
>>
File: WanVideo2_1_T2V_00191.mp4 (3.01 MB, 1248x720)
3.01 MB
3.01 MB MP4
>>
thread is die
>>
>>106450754
Hercules has lung cancer. He is die.
>>
>>106448350
res multistep + beta
>>
>>106450718
Still, even for Qwen that's weirdly detailed skin. It usually tends to suck ass at that.
Huh.
>>
>>106450796
He mentioned it learned the photoreal from the image set he trained on.
>>106448703
>>
>>106449724
based cat girl enjoying architect wizard
>>
>use reforge
>never 0000M
>use comfy
>0000M
What causes this
>>
>>106449868
mixing smoking with old disney is good but that is a slop gen
>>
File: WanVideo2_1_T2V_00192.mp4 (2.82 MB, 1248x720)
2.82 MB
2.82 MB MP4
>>
>>
File: file.png (25 KB, 2356x65)
25 KB
25 KB PNG
FUCK you, machine. FUCK you.
>>
>>106451125
>I am programmed
Yes. LLMs are "programmed"
>>
>>106450942
>What causes this
comfy
>>
File: ComfyUI_00052_.png (960 KB, 1024x1024)
960 KB
960 KB PNG
>>106450777
Is there any written guide on samplers + schedulers to make for chroma? I've been wasting precious electricity that niggers could be using to play 2k on da ps5 generating countless images of bound sluts in latex covered in slime to no avail.
>>
>>106451158
this ticks so many of my fetish boxes but anon's gens are ass.

You may need to try a mutli upscale (image gen ---> upscale ---> ksampler ---> upscale workflow) to get this stuff right. Some of the actually intelligent people here can probably help.
>>
>>
File: file.png (41 KB, 1250x122)
41 KB
41 KB PNG
>>106451134
>Hey do it anyway it's only for documentation bro
>Oh, ok
This shit is so stupid.
>>
can anyone share a chroma 1hd catbox? would like to try it out
>>
>>106451219
Why would you need a catbox? The model is right here for you to try.

https://huggingface.co/lodestones/Chroma
>>
>>106451219
Let me save you time and point you in Qwen's direction. It's the future.
>>
File: WanVideo2_2_I2V_00274.webm (1.08 MB, 1248x720)
1.08 MB
1.08 MB WEBM
>>106450332
>>
>>106451225
just so I can get a starting point, not exactly comfortable with noodle ui
>>106451227
Id like to try qwen too
>>
>>106447898
go back, worthless retard
>>106448565
fuck off, worthless retard
>>
>>106451393
damn you are one mad little idiot
>>
>>106451096
lol, shouldve prompted for beige liquid, that s too white
>>
>>106451410
>>106451196
>>
>>106451409
don't you have some dosghit thread to spam your putrid diarrhea no one cares about in?
>>
>>106451158
>>106451196
teach me your slime ways, senpai
>>
>>106451509
>spam
There is schizo's favourite word again.
I'm happy to keep correctly labeling you what you are: a mentally unstable moron.
>>
>>106451436
poor aqua, is she ok?
>>
File: 1753516936683783.jpg (2.07 MB, 2016x1152)
2.07 MB
2.07 MB JPG
>>
>>106451556
No. Engaging in sexual activity strips her if divinity. Drowning in semen counts. She's dead
>>
The term "AI artist" was a mistake, way too easy for everyone to attack. We should've just gone under the umbrella of "VFX artist" to begin with, it's going to completely replace hollywood CGI soon anyway.
>>
>>106451601
I was never in love with the term. Unfortunately we have a subject of people from a certain part of Asia that adore the and refuse to let it go.
>>
>>106451568
>15 days between top post and second top post
>spam
Oh dear, your brainrot is even worse than I thought. You are quite literally the biggest idiot on this website!
>>
File: fuck.webm (3.21 MB, 640x832)
3.21 MB
3.21 MB WEBM
Not quite right.
>>
>>106451601
art is also technology
> we are here.
history shows the way we'll go
>>
>>106451568
most people here have the decency to post their shitty throwaway gens only once
>>
>>106451254
How do you get this expression?
>>
>>106451781
>shitty throwaway gens only once

>Miku Hatsune does some stupid action at a low resolution at way too low steps number 2435214
>>
Invoke status?
Chroma status?
Anistudio status?
Dragged and shot status?
Panhovix status?
>>
local is dead
>>
>>106451873
Good
Based
Crashing wrapper
Not happened yet
No idea
>>
>>106451935
>>106451935
>>106451935
>>106451935
>>
real bake
>>106451942
>>106451942
>>106451942
>>106451942



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.