[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor applications are now being accepted. Click here to apply.


[Advertise on 4chan]


Boing Boing Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106547629

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
Blessed thread of frenship
>>
File: 1727826041995371.jpg (861 KB, 2048x1257)
861 KB
861 KB JPG
https://tencent.github.io/srpo-project-page/
this is pretty bad, what the hell?
>>
>>106551018
Seems like you're fucking something up there, bucko.
>>
>>106551018
>fp8
Maybe because of that. Try to give it a wall of text prompt to properly gauge it.
>>
where the fuck is jenny
>>
>>106551018
everytime someone tries to finetune flux, the details gets destroyed (Chroma, Pixelwave, now this...)
>>
>>106550879
>for pony
Well shit is one Lora worth downloading pony for the first time ever
>>
How do you increase wan 2.2 motion? I tried bumping the lightning loras strength, using random workflows from civit but it just produces static. I'm going to have to add a retarded 3rd sampler, arent I?
>>
>>106551043
Don't use lightning at all
>>
File: 1742641546961832.jpg (602 KB, 2048x1257)
602 KB
602 KB JPG
>>106551024
>Try to give it a wall of text prompt to properly gauge it.
All right, seems like it's really undistilled though
https://github.com/Tencent-Hunyuan/SRPO?tab=readme-ov-file#inference
>>
File: 1_00140_.mp4 (856 KB, 640x640)
856 KB
856 KB MP4
>>
File: 1_00141_.mp4 (1.03 MB, 640x640)
1.03 MB
1.03 MB MP4
>>
>>106551063
By wall of text I meant some sprawling boomer prompt lol. But this is fine too.
>>
>>106551057
Ok, how do I do it without it taking 15 minutes to gen?
>>
File: 1_00142_.mp4 (1.23 MB, 640x640)
1.23 MB
1.23 MB MP4
>>
>>106551077
ohhhh, my b kek
>>
>>106551043
try not using the lightx2v lora on the high noise model.
>>
>>106551018
I mean it's clearly more of an optimization than a large-scale finetune along the lines of Flux Krea, Tencent literally says
>To further accelerate the process, our method supports replacing online rollouts entirely with a small dataset of real images; we find that fewer than 1500 images are sufficient to effectively train FLUX.1.dev.
>>
File: 1732524380469499.jpg (1.1 MB, 2048x1335)
1.1 MB
1.1 MB JPG
>>106551077
>>106551090
>boomer prompt
meh... I really believe that Flux is impossible to improve, maybe they'll be more lucky when they'll go for Qwen Image
https://github.com/Tencent-Hunyuan/SRPO/issues/2#issuecomment-3274994781
>>
File: 1_00133_.mp4 (1 MB, 640x640)
1 MB
1 MB MP4
>>
>>106551118
Seems like 3.5 is baking it too hard.
>>
>>106551038
Nah, Flux Krea is very very good
>>
>>106551043
Don't use the lora on the high noise model. That's the one "in charge" of motion so to speak.
>>
>>106551122
uoh
>>
>>106550316
>how to make that work with wan22 two models
pic related is with native nodes
here is a workflow with kijai's nodes: https://pastebin.com/bZKuAKmx

wan "unlimited" frames
>>
File: 1733329024626790.png (1.9 MB, 1024x1024)
1.9 MB
1.9 MB PNG
>>106551127
>Seems like 3.5 is baking it too hard.
I went for 2.5 and yeah it seems to be the right spot, but it still looks bad, Krea is still the only succesful finetune of Flux
>>
>>106551118
Why didn't you just plop that shit into comfy and start prompting? Your results look nothing how they're supposed to.
>>
File: top kek.gif (259 KB, 500x354)
259 KB
259 KB GIF
>>106551018
>he thought Tencent would save local
>>
>the 640x640 menace is back

Gen something bigger for once you hack
>>
File: its_raw.jpg (221 KB, 2127x1148)
221 KB
221 KB JPG
>>106551177
Krea wasn't finetuned from the dev model that we got. That got a base distilled model to finetune off of. Picrel.
>>
>>106551230
You will be accused of being a vramlet but we all know what you said is true
>>
>>106551238
that explains a lot of things, I think it's clear that distilled models can't be saved at all
>>
>>106551149
You have to use radial attention or can I change that back to regular sage?
>>
>>106551230
Let's see one of your videos.
>>
removing --fast from the comfyui launch arguments has really improved things, the quality jump is worth waiting an extra ~3s
>>
>chroma wasted $150k on flux
why didn't he realize it was unsalvageable? why is he still wasting time with it?
>>
File: FluxKrea_Output_255635.jpg (3.06 MB, 2432x1664)
3.06 MB
3.06 MB JPG
saaar
>>
>>106551093
>>106551137
Thanks, much better. However, it creates a new issue, there seems to be a drunk effect where it adds "double". I assume high noise is slopping out
>>
File: 1_00151_.mp4 (959 KB, 640x960)
959 KB
959 KB MP4
>>106551230
>>106551249
samefag mad because he got called out yesterday
>>
File: AnimateDiff_00302.mp4 (2.45 MB, 720x1072)
2.45 MB
2.45 MB MP4
>>106551269
>>
>>106551287
How many steps are you using for each sampler? It's easy to accidentally introduce ghosting with the loras.
>>
>>106551288
Whatever helps you sleep at night anon
>>
>>106551287
the problem is if you don't use light with the high noise you might not generate a coherent enough latent for the low noise model to work with. how many steps are you doing?
>>
>>106551300
4 for high and 4 for low. Yeah, pretty new to 2.2. If I had the actual time to run these normally like 20 steps or something, I would
>>
File: step up senpai.jpg (70 KB, 786x595)
70 KB
70 KB JPG
>>106551318
>>
>>106551332
>end at step 2
>start at step 2
>end at step 4
>>
I've tried running Wan 2.2 and ran into out of pagefile memory, it used 40GB of memory (drive and RAM combined), what does it take to run this thing?
>>
>>106551323
Is the model with the Lora set to cfg 1? If you're using 4 steps, try setting the "stop at" to 2 for the high noise.
>>
>>106551332
you're not doing 4 steps with each. you're doing 2 steps with each for 4 total. change steps to 8 and adjust the "end at" and "start at"
>>
>>106551287
I use at least 3 steps for the high noise model without the lora, CFG set to 3.5, euler/simple.
>>
>>106551093
>>106551318
Different anon, do I just increase the steps for high noise if I'm not using the 4steps lora? So it should be like ~10steps high no lora 4~6 steps low with lora?
>>
>>106551332
The cfg for the high noise should probably be higher than 1.
>>
>>106551149
I tried going for 165 frames but it freezes, though it works fine until at least 140 frames
>>
>>106551341
>>106551356
Yeah Im confused. So I have to no increase the steps to 8 for each sampler? Then High end set 4 / Low start step 4 end step 8?

>>106551366
>>106551371
Increasing the cfg just burns it for me
>>
>>106551421
>So I have to no increase the steps to 8 for each sampler? Then High end set 4 / Low start step 4 end step 8?
yes if you want to do 8 steps
>>
>>106551461
Thanks for the update rocketnon.
>>
File: 1_00138_.mp4 (1.92 MB, 640x640)
1.92 MB
1.92 MB MP4
>>
>>106551466
Would increasing it to 8 steps get rid of the ghosting issue? Think I'll just try the pusa loras
>>
File: wan.mp4 (2.01 MB, 640x1144)
2.01 MB
2.01 MB MP4
>>106551354
vastly depends on how you configured things

you can reduce resolution and/or frames

you can use the comfyui-multigpu distorch node to configure how exactly your RAM/VRAM tradeoff works (one of the most efficient options to do that, also works with safetensors apart from gguf now).
definitely do NOT let the nvidia drivers swap to system RAM, that solution sucks

you can use gguf q8 or lower

there is also the "rapid" combined checkpoint that has a fairly average requirement and performance

maybe your browser still has hardware acceleration enabled and offloads quite a lot to the gpu vram? you may not want that.

and so on. just try one after the other until it's fine for your machine

>>106551290
nice and sloppy
>>
>>106551461
anime models have been perfected, all we need is a video model that can animate them well and we are set for life
>>
>>106551461
expected but still based
>>
>>
>>106551670
>4ch sloppa sdxl vae
>perfection
you're brown
>>
File: 1733073867741519.jpg (1.3 MB, 1248x1824)
1.3 MB
1.3 MB JPG
>>
>>106551719
Reminds me of the yokai hundred eyes.
>>
>>106551705
Works fine for me, anything more and you're just fooling yourself into thinking artists can crank out something way better than slop.
>>
>>106551731
Shit can barely follow prompts, look like melted shit. Anime has gotta long way to go man. You gotta stop huffing that 1girl copium
>>
Say, is there any podman/dockerfiles to serve an image generation instance ready to go?
I'm making my own from scratch, and it's so boring...
Also, is it worth getting two RTX 5060 Ti 16G?
Or maybe most AI can't do multiple GPU?
Lastly, I'm a fucking noob (well, I knew a lot, but that was two years ago, so in practical terms, I'm a noob now). Where can I learn back a bit on current AI status?
>>
Retard here. I bricked my ComfyUI and had to reinstall from scratch. Usually that fixes it, but it's not working now.
I have a 4090. The Wan 2.2 rentry guide installed:
-Torch 2.8 with CUDA 12.8
-Triton 3.4

Before I had Torch 2.7 and Triton 3.3. Do I need to downgrade?
>>
>>106551769
Do you have the extra folders from the triton github readme?
>>
>>106551751
>Shit can barely follow prompts, look like melted shit.
hasn't been the case since SD1.5
>Anime has gotta long way to go man.
consistency and video generation maybe
>You gotta stop huffing that 1girl copium
it's an hentai generator, what else am I supposed to do with it?
>>
>>106551788
Spoken like a true gooner lol. If it satisfies you, you do you but I am personally sick of looking samey shit for years now
>>
>>106551781
looks like i have to update those, i'll give it a try. thanks.
>>
>>106551802
>looking samey shit
skill issue, git gud
>>
File: 1754517623251643.mp4 (1.25 MB, 832x640)
1.25 MB
1.25 MB MP4
>>
>>106551848
>reposting my gens
I am confused, anon.
>>
File: 1756754049072100.png (1.41 MB, 1080x1080)
1.41 MB
1.41 MB PNG
>>106551854
>>
>>106551544
Keep using 4 in each for now. Do you have a screenshot of your whole workflow? I'm using the stock comfyui one myself.
>>
Prompt better.
>>
Be better.
>>
>>106551781
>>106551807
yeah, no good. It's definitely something with triton/sage because without the speedups it runs fine (just slow)
I'll try to downgrade Torch & Triton and see if that helps, I guess others had issues with Torch 2.8 as well
>>
>>106552024
What's the exact error you got?
>>
Coom better.
>>
>>106552030
It runs, but my VRAM spikes up and down from 0% to 100% over and over, getting to the point where it freezes my computer and I have to close Comfy
>>
File: 1_00164_.mp4 (1.88 MB, 720x1072)
1.88 MB
1.88 MB MP4
>>
Progress: Step 19 of 20 | Time: 2.72341s
ggml_backend_cuda_buffer_type_alloc_buffer: allocating 27263.30 MiB on device 0: cudaMalloc failed: out of memory
alloc_tensor_range: failed to allocate CUDA0 buffer of size 28587643136
[ERROR]: ggml_extend.hpp:1503 - Wan2.2-I2V-14B alloc runtime params backend buffer failed, num_tensors = 1095
[ERROR]: ggml_extend.hpp:1671 - Wan2.2-I2V-14B offload params to runtime backend failed
ggml_backend_cuda_buffer_type_alloc_buffer: allocating 27263.30 MiB on device 0: cudaMalloc failed: out of memory
alloc_tensor_range: failed to allocate CUDA0 buffer of size 28587643136
[ERROR]: ggml_extend.hpp:1503 - Wan2.2-I2V-14B alloc runtime params backend buffer failed, num_tensors = 1095
[ERROR]: ggml_extend.hpp:1671 - Wan2.2-I2V-14B offload params to runtime backend failed
Progress: Step 20 of 20 | Time: 2.71643s
[INFO]: stable-diffusion.cpp:2673 - sampling completed, taking 54.78s
[INFO]: stable-diffusion.cpp:2680 - generating latent video completed, taking 115.02s
ggml_new_object: not enough space in the context's memory pool (needed 1834938448, available 1738539008)
F:\Stable_Diffusion_Stuff\AniStudio\external\stable-diffusion.cpp\ggml\src\ggml.c:1663: GGML_ASSERT(obj_new) failed


awee maaaan. Imma wait until conv2d is all fixed up before I go for it again. this sampling speed is fucking crazy without all the snake oils. the vae absolutely ruins it all tho. fucking hell. have this hot reload to the face. I'm just going to clean shit up, test on linux and get the new binaries out
>>
File: 1_00166_.mp4 (1.72 MB, 736x928)
1.72 MB
1.72 MB MP4
>>
>>106552108
I giggled
>>
>>106552082
>115.02s
what precision were you running?
>>
File: ComfyUI_00665_.png (2.07 MB, 1328x1328)
2.07 MB
2.07 MB PNG
>>
File: 00063-3612921603.png (1.78 MB, 1024x1024)
1.78 MB
1.78 MB PNG
>>
>>106552108
kek
>>
>>106552123
low/high noise fp8, umt5 Q6_k
>>
File: 1725545289666750.jpg (4 KB, 233x216)
4 KB
4 KB JPG
did you guys, make any celebrities for chroma or qwen?
>>
>106552158
>fp8
lol
>Q6 TE
lmao
>106552159
glow
>>
>>106552159
I unironically find most celebrities physically repulsive so no. Glow.
>>
>>106552168
it's not my fault ggml makes the vae so FAT
>>
>>
File: 1_00169_.mp4 (1.46 MB, 736x928)
1.46 MB
1.46 MB MP4
fixed
>>
>>106552082
it might not work well now but god damn that speed is kind of nuts. I didn't think I would be excited for your app (no offense) but this is really good news
>>
>>106552241
she seems to jiggle according to her own "local physics", looks weird, and the lack of runners in the background makes this even less amusing
>>
>>
>>106552219
Praise be, Furkman.
>>
>>106552272
like bags of sand
>>
Are flux models generally the only ones that are 10gb+ in size?
>>
>>106552347
No. How new are you?
>>
>>106552347
non-pruned models meant for training can be that large, too, or something like that
>>
>>106552347
mine are larger but thats only a reflection of the size of my cock
>>
>>106552355
Is there much of a benefit to the non pruned models?
>>
>>106552359
none for regular image gen, as far as I know
>>
>>106552347
sdxl models are generally the only ones that are NOT 10gb+ in size
>>
>samefagging
>>
>>
Why did you stop posting anime? It's not enough.
>>
>>106552371
I meant for pruned models
>>
File: ups.png (814 KB, 711x558)
814 KB
814 KB PNG
I thought upscale needs to be on low denoise...
>>
File: 2660229885.png (1.4 MB, 896x1152)
1.4 MB
1.4 MB PNG
Edgar Allan Poe really had it figured out

https://voca.ro/11JHaUfcKJmb
>>
Any good Charlie Kirk Lora’s to celebrate the passing of a hero by genning?
>>
File: AnimateDiff_00771.webm (3.97 MB, 960x960)
3.97 MB
3.97 MB WEBM
>post clean AF furk gens
>crickets
>Someone posts a women with weirdly wobbling boobs
>5+ replies

You have no taste
>>
>>106552159
Put Ryan gosling in a KFC
>>
>>106552472
waah waah gugu gaga
>>
anything new around here? 5s isn't doing it for me anymore.
>>
>>106552490
Make it so she pees in the chair and we can see the pee thru the chair
>>
>>106552490
notice hand go through chair after I post darnit
>>
>>106552490
>gooning yourself into impotence
>>
Still cooking 50% of the way there
>>
>>106552490
>5s isn't doing it for me anymore.
Slaanesh is with us.
>>
>>106552472
What makes you think people want to see that turkroach?

Get a grip on reality.
>>
>>106552540
You're gonna see him regardless.
>>
File: SAASgods.jpg (361 KB, 720x1117)
361 KB
361 KB JPG
!!!!!!!Baby wake up new SAAS model came out!

Prepare your Comfy nodes!
Prepare your update.bat!
Check you ComfyUI account balance,
check your payment method is loaded.
ITS HAPPENING!
>>
>>106552543
We don't care. You can complain about it again when you get no (you)s and we give them all to coomgens, bitchboy.
>>
>>106552543
Take your meds.
>>
>>106552415
Depends on multiple factors. Chads are able to denoise the second pass at 0.7
>>
File: comfy1.jpg (1.47 MB, 1440x2752)
1.47 MB
1.47 MB JPG
>>
tfw didn't log into ComfyUI for 2 days and lost my monthly achievement. Now getting ads for using nunchaku nodes again. Pain.
>>
File: Comparison.png (3.85 MB, 2496x1216)
3.85 MB
3.85 MB PNG
SRPO is a huge nothingburger, it deviates WAY less from Dev than Krea already did
>>
>>106552589
chinkbros....
>>
>>106552589
Yeah but it buried the last thread in shitposting anyway
>>
File: 2_00005_.mp4 (2.26 MB, 752x1104)
2.26 MB
2.26 MB MP4
>>106552472
stay mad and brown
>>
File: file.png (139 KB, 1459x559)
139 KB
139 KB PNG
why won't the first bar update at all?
It's pinokio and wan2.1 ITV 14B
>>
>>106552614
is she real bros
>>
>>
File: 2_00006_.mp4 (2.17 MB, 752x1104)
2.17 MB
2.17 MB MP4
>>106552631
Yes that is a real woman, no source though sorry.
>>
File: 00076-889644807.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>
>>106552589
But it only cost tencent, so it's still worth it
>>
>>106552589
it deviates less but that doesnt matter when krea is slopped, srpo is less slopped and could also be applied to krea if you actually like that trash lora too, retard
>>
File: 2_00008_.mp4 (537 KB, 528x1024)
537 KB
537 KB MP4
>>106552583
cute
>>
>>106552655
that guy looks like a monster
>>
File: file.png (57 KB, 1320x288)
57 KB
57 KB PNG
>>106552616
oh nvm is it a progression, like showing each and every step? Sorry I'm retarded... ._.''
>>106552614
hot!
>>
>>106552688
Was there a guy in this video ?
>>
>>106552705
No, whoever coded that just sucks, or it could be your terminal settings.
>>
>Complain about no replies,
>get 3 replies
Heh, gg no re.
>>
what models do you guy daily drive?
>>
>>106552758
Chroma obviously
>>
>>106552758
Illustrious and Wan
>>
>>106552758
Chroma, Wan, once OneTrainer Qwen support is done I'll try training some loras for it again since OneTrainer is so much faster than Diffusion-Pipe

If I can get qwen loras to train as well as Chroma does, it might become Qwen, Wan
>>
>>106552758
wan & noob eps 1.1
>>
>>106552773
>qwen
i tried qwen and didnt like it at all. went back to chroma. what makes you like qwen so much? also, what gpu are you using to train for chroma?
>>
>>106552758
What's it to you?
>>
>>106552788
Osmosis.
>>
there's so many fucking chroma models with no changelog ahhhhhhhHHHHHHHHHHHHHHH
>>
when the fuck are we getting the technology to combine two character loras and not have them blend together?
>>
>>106552783
>what makes you like qwen so much
Nothing really at the moment outside of its potential given its anatomy and prompt comperehension, but if I can get lora training to overcome its slopiness and soft censorship, it could be a great model.

It's too slow in training for me to experiment effectively with when using Diffusion-Pipe, but for Chroma, OneTrainer cuts training time by half compared to Diffusion-Pipe, so with Qwen support soon in OneTrainer, it could become viable to experiment again.

Chroma is incredibly easy to train though and the results are great, and fast to train, so it's really hard to beat in GPU time / results ratio.
>>
>>106552783
Oh, forgot, I use a 3090, and sometimes a 5060 ti as well.
>>
>>106552472
am i supposed to know who that is?
>>
>>106552820
>>106552820
>to overcome its slopiness and soft censorship
yeah, i dropped it because of that.

>Chroma is incredibly easy to train though and the results are great, and fast to train, so it's really hard to beat in GPU time / results ratio.
that's good to hear. ill mess with onetrainer, i assume the default profile for chroma should fine.
>>
>>106551122
Top cute
>>
>>106552801
Chroma1-HD is the official high definition model trained on lots of resolutions, 512-1152

Chroma Base is 48 epochs trained only at 512

The other models are their specific epochs trained at 512, with the exception of v49, which is the same as Base but with a single epoch trained at 1024
>>
>>106552831
that's our king
>>
>>106551691
Do come in, Anon-kun
>>
>>106552819
they're not two characters but differences to a large neuronal network thing. *you* come up with the complex trick to do that reliably, figures we also get the solution to train every concept individually at the same time. maybe we won't even need whole models anymore, just have all your concepts one file each.

good luck anon
>>
File: ComfyUI_17153.png (2.9 MB, 1200x1600)
2.9 MB
2.9 MB PNG
I only have time to test a few pulls before heading to work tonight, but I like the SRPO outputs so far.
>>
>>106552758
Chroma and occasionally Wan for a particularly good output.
>>
>>106552836
>default profile for chroma should fine
There are quite a few Chroma presets, adjusted to your vram availability

If you can train at full bfloat16 even with offloading, I'd suggest doing that since it's better quality and the speed isn't much lower than float8

I can even fit full bfloat16 Chroma training on the 5060 ti 16gb, using 0.3 offloading, and it gets ~2s / iteration on 512 res, meaning you can train a person of 25 training images for 100 epochs in ~1h 30 minutes on said 5060 ti, OneTrainer is really well optimized

On my 3090, I think the same training takes ~25-30 minutes
>>
>>106552874
Thanks for that information. I have 16gbs so I can just set it to train and go do something else.
>>
File: comfy3.jpg (3.25 MB, 2160x2160)
3.25 MB
3.25 MB JPG
>>106552679
Nice one anon, but you are the cute one here
>>
>>106552842
To add, there's a flash model available as well which simplifies outputs but still works fine.

There's also a radiance model (VAE-less) that isn't done yet, but it shouldn't take much longer at this point.

Any others fall under "experiment" territory and probably are janky as fuck. Annealed, supermix, 2k, etc.
>>
File: 2_00016_.mp4 (1.19 MB, 640x960)
1.19 MB
1.19 MB MP4
>>106552862
>>
>>106552862
Would
>>
>>106552911
>ai is good enough for anons to goon themselves retarded to their oneitis
Truly a golden age we are living in
>>
>>106552905
i tried 2k and i like it. the radiant one gave me pitch black results but as you said, it isnt done yet. thanks for the descriptions.
>>
>>106552936
I haven't tried the 2k yet since I rarely output past 1024 to begin with. I'll probably still give it a try.
>>
>>106552936
radiance requires a modified version of comfy
>>
>>106552952
yeah, i figured it was me not doing it right since i assume he wouldnt upload something that gives those results.
>>
any suggestions on no-reg web img2img?
>>
>>106552672
this has to be bait lmao, there's no way anyone has a take this retarded
>>
>>106552862
it's objectively worse in every way than Krea, anyone claiming otherwise is gaslighting lmao. SPRO has absolutely none of the improved understanding of various styles and concepts or improved prompt adherence, it's barely different from normal Dev at all.
>>
>>106552987
sir this is long dick general. you're looking for small dick general
>>
File deleted.
ugh what are some good loras, model, and prompts for blowjobs..? all i'm getting is this nightmare fuel
>>
>>106552990
great argument
nobody gives a shit about krea bro
>>
>>106553006
Some guy a few days ago was going around being the oral insertion Lora bandit, he was making EVERYTHING suck dicks lol, was pretty funny.
>>
>>106553006
all you need is wan + a lora
>>
>>106553010
I'll see if i can find anything on archive thanks anon
>>
>>106553004
fair
>>
>>106553007
again this has to be bait, you would have to be an absolute moron to not think Krea is objectively better than Dev in a way that SRPO is not because it's nowhere remotely as close to as large-scale of a finetune.
>>
File deleted.
>>106553016
Yeah this was done with wan 2.1 and a blowjob lora... not what i was expecting when another anon made this with the same image, wan 2.1 and a 2070s.
https://files.catbox.moe/wkn0y8.mp4
>>
>wan 2.1
>>
>>106553032
again, nobody uses the censormaxxed bluefilterslop that krea is, its even worse than default flux when it comes to that blueish filter all krea images have, theres a reason its dead
>>
>>106553034
What lora? And use Wan 2.2
>>
>>106553050
gr8 b8 m8
dead by what standard lmao, images you see posted here that happen to state the model name?
>>
>>106553051
I don't know what lora that anon used, and he said he used 2.1
If you have a 2.2 lora you recommend I'll give it a try.
>>
>>106553058
just go on civitai, download blowjob loras, try one until you get one that works the best. simple
>>
>>106553054
notice how your low iq brain cant actually respond to the recognizable blueish filter of krea and censored arguments i made?

if gooners and foss community wont use your censored slopped krea model, then who will? the corpos? the people who have access to top tier proprietary models will use blue slop tune of FLUX? lmao.
>>
File: ComfyUI_17164.png (3.04 MB, 1200x1600)
3.04 MB
3.04 MB PNG
>>106552911
Cute!

>>106553003
I personally didn't find Krea as flexible as vanilla Dev. It felt "smaller" and more limited in its output.
>>
>>106553058
Use Wan 2.2 and this lora; https://civitai.com/models/1874153/oral-insertion-wan-22
You only need to enter the trigger words and that's it, everything will end up with a dick in their mouths.
>>
File: comfy2.jpg (2.71 MB, 1440x2560)
2.71 MB
2.71 MB JPG
>>
Hunyuan Image updated their repository to fix the refiner, which didn't run properly before. First test here. The refiner seems to be adding some dithering-like noise, similar to what's often visible in subtle textures from Wan. It's most evident in the sweater. Not sure if I can see any positive effects but need to try some more tests.
>>
>>106553084
wow much more plastic!!!!
>>
>>106553081
Oh yeah i was just seeing that one. I'll try it out thanks!
And by trigger words you mean the promp? Like Large White Cock will be enough?
>>
>>106553087
just read the description and test the model bro
>>
>>106553081
>>106553087
nevermind I get what you mean now. saw it on the description
>>
>>106553091
yea sorry bro I'm a little retarded .__.
>>
>>106553086
brutal
>>
How the fuck do I get Wan 2.2 i2v shit to actually look animated when using an anime/cartoon image? No matter what combo of buzzwords and phrases I use, it always ends up looking like the Live2d puppeteer/paperdoll bullshit instead of actual animation.
I've tried mentioning anime, animation, fuck I even detailed some of the 12 principles trying to get literally anything.
Still either get paperdoll animation or 3d model shit.
>>
File: eye unref-ref.jpg (94 KB, 604x298)
94 KB
94 KB JPG
Tested Hunyuan refiner on a moeblob image and it actually helped a lot. Crop of eyeball. You can see that the details are significantly better with the refiner (right) than the base model (left).
>>
>>106553099
watch and learn, grasshopper
https://litter.catbox.moe/si0bgk6lhp87vt30.mp4
>>
>A Chroma lora trained entirely on >>>/mu/127706934 kpop imgs

Would be worth a try
>>
>>106553072
lmao, there's nothing you can say that will change the fact that here, for example, SRPO is very arguably WORSE than the orginal Dev due to how poorly resolved all of the details (like the watch / necklace / and her eyes to some extent) are in it. Krea also did a better job IMO of getting the appearance right since the prompt there specified "mixed ethnicity woman".
>>
>>106552472
Missing his shots so bad the dudes don't even see him as a threat and keep walking.
>>
>>106553124
woops, didn't attach properly
>>
File: comfy6.jpg (3.57 MB, 2176x2880)
3.57 MB
3.57 MB JPG
>>
File: Comparison2.jpg (2.66 MB, 2496x1216)
2.66 MB
2.66 MB JPG
>>106553130
wtf kek
>>
>>106553121
Holy shit that's perfect!! I hope I'll be able to get a result as good as this
>>
>>106553123
>>>/r/
>>
>>106552904
https://litter.catbox.moe/kos2fpu1u90dit9y.mp4
>>
Here's a test showing output of Qwen and Hunyuan Image (no refiner) with the same prompt. I was seriously surprised how closely the character matched. The prompt wasn't that specific.

Suspicion of shared datasets still strong, but another more charitable explanation might be that they both used Qwen VLM for TE and probably captioning as well.
>>
>>106553134
srpo is a direct improvement towards actual realism and is better than krea because the problem is that krea is very opinionanted in its colors and shadows/shading, for example try to generate anything similar to the realism of srpo in picrel with krea, its literally impossible
>>
>>106553172
>>
>>106553154
>>>/r/

Nigga I'm doing it myself
>>
>>106553179
What was the prompt?
>>
>>106553197
>a painting by mucha of a female space marine wearing a robotic exoskeleton. she has medium-length layered cyan hair and dark orange glasses. her armor is painted in a worn arctic camo pattern. she is standing in a forest with one foot raised on top of a small boulder. her arms are crossed, and she is looking at the viewer. the painting is by mucha.

If you're curious about the repetition of the artist name, I'd noticed that occasionally it managed to extract a bit more style.
>>
>>106552801
Every Chroma version leading up to v48 is the same. V49 and v50 are substantial improvements (changelog: multiple subjects, higher quality images). Chroma HD Flash takes those improvements and then fixes small details. The best Chroma experience is by playing around with settings figuring out which version is best for whatever prompt you're going for.
>>
>>106553214
>V49 and v50 are substantial improvements
bro lives in an alternate timeline
>>
>>106553124
You're acing like it's the devil incarnate when you don't understand the significance of why people are hyping it up. It's supposed to help you get better results from training a model vs prior methods, full stop. The finetune is to show what it can do in a limited time and hardware budget to steer a model towards what "better" means and that the method work. A finetune of the magnitude Tencent did with Flux to get the current model is not going to solve anything you have been complaining about. Also, yes, it's going to "resolve" details like the watch necklace and eyes because SRPO actually understands and steered the model towards how a photograh's blur and focus actually works.
>>
>>106553219
Prompt following jumped up a lot with v50
>>
>>106552108
you can't run
https://litter.catbox.moe/2spkjf7gbz5j30pd.mp4
>>
>>106553214
I heard 29 was the last good one from another poster but I can try 49 and 50 too.

>Chroma HD Flash
Chroma1-HD-dc-fl.safetensors
this one, i assume?
>>
File: comparison3.jpg (1.87 MB, 2048x1024)
1.87 MB
1.87 MB JPG
>>106553178
Here's an actual same-seed comparison of SRPO to Krea I did for that example prompt from Tencent myself (cause it's not my problem that their original images aren't reproducible since there's no params provided).

I don't think either are bad here but I prefer the less intense bokeh in Krea (which is an advantage it has over Dev in general pretty often).
>>
>>106551063
https://litter.catbox.moe/uvc9k0ryzm1snbnm.mp4
>>
>>106553214
This was covered a bunch already. The "official" versions are now Base and HD. Base is v48 and HD is that + additional hir-es training up to 1152. 49 and 50 are both deprecated and there's no real reason to use them.
>>
>>106553244
>v29 was best
Nah, it's on par with v26-v48 which are all on par with each other.

>HD Flash
https://huggingface.co/lodestones/Chroma1-Flash/tree/main
>>
no one is safe
https://litter.catbox.moe/usenaq5afndz2n7a.mp4
>>
>>106553219
I agree with bro, in fact v49 is my slight favorite over Chroma1-HD for lora training

Have never used Chroma Flash though
>>
File: comfy7.jpg (1.78 MB, 2560x1440)
1.78 MB
1.78 MB JPG
>>
>>106553237
the other guy is the one yelling about MUH SLOP with regards to Krea and claiming that I should care about SRPO from a "right now" end user perspective despite it being barely different from Dev at all.

>Also, yes, it's going to "resolve" details like the watch necklace and eyes because SRPO actually understands and steered the model towards how a photograh's blur and focus actually works.

I don't understand what you mean by this, it does not, in fact, do anything other than what it actually did IRL when I ran inference with it. I don't care about the hypotheticals of a larger application of SRPO that doesn't actually exist currently, if that's what you mean.
>>
>>106553264
you seem pretty knowledgable about chroma so i'll try these out. thanks
>>
>>106553268
Fuckin kekd, the bandit has returned
>>
>>106553278
Keep in mind the proper settings for HD Flash is heun 8 steps.
>>
>>106553290
And CFG set to 1 (no negs). It's slightly worse at prompt following, but it's fast.
>>
>>106553268
Not like this furkansisters
>>
>>106553272
>the other guy is the one yelling about MUH SLOP with regards to Krea
And he is right, Krea is "slopped" per the definition of having biased preferences for certain tones, lighting exposure, and detail texturing that results in, for example, oily/glossy skin on humans.
>I don't care about the hypotheticals of a larger application of SRPO that doesn't actually exist currently, if that's what you mean.
Why you are out to do A-B testing and poo-pooing on it then if you "don't care"? You're a terrible liar because you clearly do care enough to complain and do several comparison shots and posts about it. If "slop" doesn't bother you, by all means, keep enjoying it until the end of time but I would actually like my models to realistically generate images when I specify it to do realism without the issues I pointed out above.
>>
>>106553134
Krea has sharper details but imo looks really fake and uncanny, I think SRPO won in this one, same for the goth one but yeah Krea looks better for the cat. Though I feel like I have seen much better realism from Krea so could be your prompt.
>>
File: ComfyUI_00016.mp4 (1.53 MB, 480x832)
1.53 MB
1.53 MB MP4
Chroma is good but Wan T2V with a good Lora is just too convenient
>>
>>106553290
>>106553295
yeah, i saw it. thanks.
>>
>>106553327
It looks like Stephanie for like half a second and then it's ruined. Nice touch on the docile choker.
>>
File: ggirs5ouh5l91.jpg (40 KB, 640x366)
40 KB
40 KB JPG
>>106553270
colors reminded me of this shit
>>
>My furk gens
Exciting, dynamic, high fidelity, broad subject matter

>Your one girl gens
Noisy, lazy, boring, 1 girl

The people who scream against furk gens are just a vocal minority. I know my true fans number in the billions.
>>
File: ComfyUI_00938.mp4 (1.11 MB, 544x896)
1.11 MB
1.11 MB MP4
>>106553358
I think I need to retrain with a better dataset
>>
>>106553407
Yeah, she likeness is there but it's clearly not her, which ruins it in my opinion. Might want to train some of her with the rorschach face paint she wears too, i think its nice.
>>
>>106553402
buy an ad, turkroach
>>
File: 00113-1290074669.jpg (442 KB, 2112x1184)
442 KB
442 KB JPG
The power crystal in the enchanted forest of Valenvar has awakened.
>>
>>106553402
stay true king
>>
File: 00103-955750093.jpg (447 KB, 2112x1184)
447 KB
447 KB JPG
>>106553420
Collect it, or it may eventually shatter in a year,bringing doom to the lands through crystallization.
>>
File: 12.jpg (1.29 MB, 2048x2048)
1.29 MB
1.29 MB JPG
>>106553179
>>106553213
seedream aces this btw. neither qwen nor hunyuan had even the slightest idea who mucha was despite your repetition. another total SaaS victory while local can only cope with slop. local has no artstyle
>>
>>106553402
>Exciting, dynamic, high fidelity, broad subject matter
>Furkan
And this is why you lose
>>
File: 00106-1122131952.jpg (471 KB, 2112x1184)
471 KB
471 KB JPG
>>106553427
A pretty, but ultimate fate as all becomes crystals.
>>
>>106553428
First law of image generation: If you can run an image model locally, it's garbage.
>>
>>106553429
Furk is always in an exciting and uniqe situation, 1girls just stand there boobing around boobily.
>>
>>106553439
Stop being gay

Worse, you're gay with shitty taste
>>
>>106553438
>source: my ass
no
>>
File: 1743989819271220.png (2.11 MB, 1328x1328)
2.11 MB
2.11 MB PNG
>>106553428
>>
>>106553449
God you're dumb.
>>
>>106553438
Go and generate safely in your padded cell
>>
>>106553428
Not wrong. Really, it's very embarrassing how downhill it's been in this area since SD 1.5.

>"byzantine mosaic"
>>
>>106552048 here. The problem was actually Firefox - it does not like the new ComfyUI update. I switched over to Chrome and everything is running perfectly (even better than before I fucked it - 70 seconds to generate 480p videos, down from 90 seconds).

Thank you >>106551781 and >>106552030 for trying to help.
>>
>>
Ok I’m not terminally online like some of you fuckers itt, what’s with the shitskin posting?
>>
>>106553530
That "shitskin" invented the LoRA. Watch your tone.
>>
>>106553530
>what’s with the shitskin posting?
Mental illness.
>>
>>106553530
>what’s with the shitskin posting?
Mental wellness.
>>
File: IMG_2968.jpg (305 KB, 1206x537)
305 KB
305 KB JPG
>>106553538
? The original low rank adaptation paper was published by chinks, the fuck you on about?
>>
File: unnamed.png (490 KB, 1024x448)
490 KB
490 KB PNG
>>106553550
ru okay?
>>
>>106553530
The shitskin became notorious after taking anons training script from this thread and paywalling it. That's the real answer.
>>
>>106553538
Are you retarded ? Furkan hasn't invented shit. Lora was created by chinks
>>
File: file.png (66 KB, 885x565)
66 KB
66 KB PNG
>>106553121
what settings was this made with?
this is what Mine came out as...https://files.catbox.moe/otb5sj.mp4
And I think these are the settings on Wan2.2 Image2video 14B
>>
>>106553530
>>106553573
He now apparently makes enough money from repackaging guides and resources to get a hair transplant and probably some other shit too but I'm not that big a TurkFurk connoisseur. Anon will call me schizo but he absolutely drops in here from time to time still.
>>
>>106553574
>>106553567
????
>>
>>106553567
Give link to paper
>>
>>106553582
Here's the paper: https://arxiv.org/abs/2106.09685

Show me Furkan
>>
>>106553583
When did furk haters get so lazy?
>>
File: unnamed (1).png (470 KB, 1024x402)
470 KB
470 KB PNG
>>106553587
I'm not sure I understand. He's right there. Last name in the credits.
>>
>current time in Turkey: 8:00
>>
File: WanVideo2_1_T2V_00197.mp4 (1.67 MB, 1248x720)
1.67 MB
1.67 MB MP4
>>
>>106553596
Why continue with this lie ?

I posted a link to the paper, no Furkan, the jig is up
>>
>>106553611
Maybe you should visit the optometrist. Are you okay my friend?
>>
Oh ok I get it now this is just some schizo shit, gotcha.
>>
>>106553588
remember the good ol days when entire threads would be furk discussion
>>
>>106553634
You can tell who's new here by their abject hatred of the man rather than enjoying clowning on him.
>>
furk if youre in here give us a sign
>>
>>106553647
>>
>>106553579
just kijai's 2.2 workflow with 2.1 lightx2v loras. 6 steps, dpm++ sde
>>
>>106553634
Thankfully no
>>
File: WanVideo2_1_T2V_00198.mp4 (2.01 MB, 1248x720)
2.01 MB
2.01 MB MP4
desu. I think you're dumb.
>>
>>
File: file.png (686 KB, 1080x524)
686 KB
686 KB PNG
>>106553264
>>106553278
>>v29 was best
>Nah
he's full of shit, if you want sovl you have to go for versions before 30, after that he slopped the model by forcing it to go for lower steps
>>
>>106553775
noob?
>>
>>106552589
that's because Krea was trained on the undistilled version of flux dev, and you can see that SPRO unslopped the skin, which is all we asked from Flux dev >>106551238
>>
>>106553794
>>106553794
>>106553794
>>106553794
>>106553794
>>
File: WanVideo2_1_T2V_00199.mp4 (2.35 MB, 1248x720)
2.35 MB
2.35 MB MP4
>>
>>106553792
Ya
>>
>>106553845
Actually, wait, that one was with at Noob variant: https://civitai.com/models/1045588?modelVersionId=1767015
>>
>>106553514
we should put this on his discord kek
>>
>>106553688
>lightx2v loras
You're lazy which is why your slop will only ever be highly polished incoherent simplistic garbage. You're the type of faggot that immediately demands a workflow before downloading a lora.

Stop using lightx2v lora's and any other gimmick and watch as you no longer have to keep genning failure after failure because dur that is how cfg fucking works you fucking pea-brained cunt, i hate everyone of you. Those lora's have always only ever been retarded and shit. even 2.1 version is shit, it always fucking ignores my prompts.
>>
>>106553466
>requiring a chastity belt
>>
>>106553853
Nice pic, and actual femenine hands, not like Chroma. Its the new chinese model?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.