[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


🎉 Happy Birthday 4chan! 🎉


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106830604

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: cuda_throne.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
first time qwen failed text this bad for me altho I didn't prompt it
>>
File: 1733666995063777.png (1.14 MB, 1376x760)
1.14 MB
1.14 MB PNG
the dog on the left is wearing a chinese rice farming hat, and has glowing white eyes. the dog is shooting lightning from its paws at the man in the middle who looks surprised.

qwen image edit 2509 Q8 quants + new edit lora, 8 steps

kek
>>
>>106833555
Lyra wins! Cockroachlity!
>>
File: WanVideo2_2_I2V_00511.webm (742 KB, 704x1248)
742 KB
742 KB WEBM
>Ugughugh
>I'm a street fighter
>>
File: 1742981747010385.png (1.15 MB, 1376x760)
1.15 MB
1.15 MB PNG
>>106833555
the dog on the left is wearing a chinese rice farming hat, and the dog has glowing white eyes. keep the dog's appearance the same. a large lightning bolt is approaching the man in the middle from the right.

better
>>
>>106830631
strange biting pattern, probably needs to be re-genned?
>>
>>106833588
what's the news on that story? Hasan made a video "showing" that it wasn't a dog shock collar (that is was only a vibrating one), then some twitter posts showed that it was a dog shock collar
>>
>>106833627
he lied about a shock collar not being a shock collar and /pol/ already found the exact model he used.
>>
>>106833627
It was obviously a shock collar, but the story will be spinned as Hassan wants, his fans will follow his every word, and others will just have more disgust on how utterly horrible this human being is.
>>
>>106833633
>/pol/ already found the exact model he used.
/pol/ is always useful when dealing with shit like this, that's why I can't hate this board, they fight the right fight
>>
File: 1741544148475389.png (1.02 MB, 1376x760)
1.02 MB
1.02 MB PNG
remove the man with glasses. replace the blue "HASAN" text with "KAYA". Add a large dog bone and dog treats to the room.

saved.
>>
>>106833652
qwen edit even copied the neon sign style, such a neat tool.
>>
oh, oh no
https://civitai.com/models/1901521
>>
>>106833686
https://en.wikipedia.org/wiki/Shock_collar
>>
>>106833689
wake me when i can download and shit on it properly
>>
Anyone else wish we could place shock collars on some of the posters here?
>>
>>106833689
Everyone saw this coming.
>>
Local Diffusion?
>>
File: 1754463850207224.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
the anime girl is giving a thumbs up and wearing pixelized black sunglasses.

cleansing my system with a Miku cause fuck that dog abusing faggot.
>>
File: 1750570060379494.png (54 KB, 568x202)
54 KB
54 KB PNG
>>106833689
What a sad and pathetic ending.
>>
>>106833737
they said its for a week, im talking about the shitty gens
>>
File: file.png (2.79 MB, 1280x1536)
2.79 MB
2.79 MB PNG
>>106833689
>>106833743
SD1.5 tier
>>
>>106833689
https://civitai.com/images/104983866
kek'd
>>
>>106833582
If someone trained, for example Sniper elite type of x-ray bulletcams for WanVideo, where could they publish this type of lora? (I didn't trained just curious if someone did.)
>>
>>106833758
Still cute freckles.
>>
>>106833750
>subject expands to include one more example that mentions a rabbi
>a regular anonynous poster (tm) starts kvetching like his dad was accused of something
smartest brown lmao
>>
>>106833743
"Style clusters" clearly participate on the god awful results.
>>
File: WanVideo2_2_I2V_00512.webm (699 KB, 1248x704)
699 KB
699 KB WEBM
>When you left the ez bake oven on.
>>
>>106833737
surpassed by noobai vpred and illustrious. wai v15 does literally everything you can ask for and you rarely need a lora cause it knows so many characters natively.

https://github.com/DominikDoom/a1111-sd-webui-tagcomplete

with this extension I can get exact tags for characters/styles/concepts with ease, no need to look it up. add in adetailer and controlnets and you can do basically anything easily.
>>
>>106833821
*also note it works with reforge too, you dont just need auto1111 GUI.
>>
File: 00002-3815787921.png (2.23 MB, 1024x1280)
2.23 MB
2.23 MB PNG
>>
in the grand scheme of things you are just a faggot boot licker an anti jewish and christian person. Just following orders.
>>
File: 00003-323737886.png (2.89 MB, 1024x1280)
2.89 MB
2.89 MB PNG
>>
https://old.reddit.com/r/StableDiffusion/comments/1o0v232/pony_v7_release_imminent_on_civitai_weights/nih4khl/
https://old.reddit.com/r/StableDiffusion/comments/1o0v232/pony_v7_release_imminent_on_civitai_weights/nih4h1z/
>ponyfag working with lodestones to train v8 on qwen
fucking lol. i guess chroma came and went.
>>
>>106833928
and you complain?
>>
File: 104970654.jpg (230 KB, 1024x1536)
230 KB
230 KB JPG
https://civitai.com/images/104970654
>With big text "It's so real" on top and small text "You can almost taste it!" on bottom.
lel
>>
>>106833945
yes
>>
enough talking to the retarded hate everything crowd tonight.
>>
File: WanVideo2_2_I2V_00513.webm (658 KB, 704x1248)
658 KB
658 KB WEBM
>Oh noo. I can't stop worlding my warcraft!
>>
File: 1750262188069430.png (2.12 MB, 1040x1560)
2.12 MB
2.12 MB PNG
>>
how excruciatingly slow would doing something like this >>106833582 on rtx 20xx 8gb be ?
>>
>>106834030
At that res? You probably could have saved up for a new gpu in that time otherwise what like... ten minutes? idk.
>>
>>106834030
probably quicker with cpu
>>
>>106834033
>... ten minutes? idk.
lol no.
>>
File: qwen syuen bf16 test.png (1.33 MB, 1584x1312)
1.33 MB
1.33 MB PNG
I downloaded Qwen 2509 BF16 and tested it. It's capable of matching Nanobanana in terms of creating backside view for weird and complicated costumes. The Qwen 2509 FP8 e4m3fn quantize version did not have good understanding of avant garde character designs. Very happy that I can kick nanobana to the curb with its bullshit filters.

DisTorch2 Model Final Device/Layer Assignments
--------------------------------------------------
Device Layers Memory (MB) % Total
--------------------------------------------------
cuda:0 (<0.01%) 485 2.33 0.0%
cuda:0 447 20543.58 52.7%
cuda:1 396 18421.99 47.3%
--------------------------------------------------
[MultiGPU_DisTorch2] DisTorch loading completed. Total memory: 38967.90MB
100%|| 8/8 [00:20<00:00, 2.51s/it]
Requested to load WanVAE
loaded completely 4538.394530296326 242.02829551696777 True
Prompt executed in 48.43 seconds


https://github.com/pollockjj/ComfyUI-MultiGPU
>>
>>106834033
>>106834037
is wan iq3_km adequate?
>>
>>106834053
No idea, but in my experience anything below Q4 is basically unusable and even Q4 itself is a huge compromise.
>>
>>106834030
maybe 10-15 mins? you might have to find out.
>>
Anyone use Pinokio? I hear it is very messy to uninstall and akin to bloatware.
>>
>>106834049
Huh, did multigpu get better or something?
>>
>>106834078
What is pinokio doing that you can't?
>>
>>106834100
Makes installing much easier.
>>
>>106834078
Its bloatware. But so the others. The only thing thats not bloatware is if the program is written in C/C++ and then uses dll libraries for clean usage.

Like stable-diffusion.cpp. Clean/simple shit.
>>
>>106834111
I've only ever seen pinokio in reference to issues with it. I feel it's better to spend five minutes writing the basic command line arguments to install a repo rather than obfuscate it behind whatever pinokio is doing.
>>
>>106834049
I had no speed difference between using ram + gpu and second gpu vram instead.
So it was faster (twice as fast) for me to use 2 separate comfyui sessions, each one with one gpu and offloading to ram.
Now I had 2 x 3090, not sure what you use.
>>
I am genuinely confused as to wheter ramtorch is actually a real thing or just the furfag dipping his toes where they don't belong again and causing a fuss over nothing.
>>
>>106833928
More like Auraflow came and went.
>>
>>106834128

Interesting strategy, but I don't prompt fast enough for the need for batch size 2 gen.

5090+4090+ 124gb sysram.
>>
>>106834133
He's training Radiance with it and Ostris seems to mange to train Qwen with it.

Also when did he cause a fuss over nothing? Altering the Flux Schnell model and training Chroma on it also worked. Radiance works too after adopting the PixNerd scientific paper into Chroma.
>>
>>106834164
>Also when did he cause a fuss over nothing?
I don't like this tone. I'm going to assume you're one of his lackeys and hide you response. Can I get a response from someone who isn't a member of his patreon?
>>
>>106834133
>>106832028
>>
File: WanVideo2_2_I2V_00514.webm (965 KB, 1248x704)
965 KB
965 KB WEBM
>Braps then leaves.
>>
>>106834142
It's just a way to prompt more, I copy paste the workflow in both, it's essentially giving me twice the speed.
Of course, this is useful for slow stuff like wan.
>>
Can anyone post a funny?
>>
File: 1739774532460391.png (1.28 MB, 1040x1560)
1.28 MB
1.28 MB PNG
>>
File: AnimateDiff_00329.mp4 (1.91 MB, 1280x720)
1.91 MB
1.91 MB MP4
>>
>>106834049
Did you try the GGUF Q8? I got the impression it's one of the most used variants around here.

>>106834182
> this tone
don't make asspulls if you don't want to be challenged on them.
>>
>>106834269
>Challenged.
Don't take a accusatory tone with me. Blocked.
>>
>>106834256
Apparently wan doesn't understand how violence works. Battle Royale with Cheese?
>>
>>106834287
yes, and I'd guess most chinese video models will filter violence
>>
File: AnimateDiff_00331.mp4 (1.91 MB, 1280x720)
1.91 MB
1.91 MB MP4
>>106834287
yes, but lightx2v makes it worse
this one is without lora
>>
File: 1735082152733117.png (1.91 MB, 1000x1127)
1.91 MB
1.91 MB PNG
>>106833689
>in the past:"guys buy a high pc, let's goooo"
>now:"only api bros, no one care about your rtx 500gb vram"
>>
>>106834360
No, I think it'll work on rather normal hardware. It just probably isn't very good overall.
>>
>>106833928
now he's gonna finetune a 20b model? jesus, how much money does he have?
>>
>>106834414
He could slap his supporters in the face and fuck their sister and they would still donate to him.
>>
File: 1730278165465832.png (1.55 MB, 1728x1344)
1.55 MB
1.55 MB PNG
>>
File: WanVideo2_2_I2V_00515.webm (2.09 MB, 1488x848)
2.09 MB
2.09 MB WEBM
>>
>>106834424
he DID release pony and you can complain about AuraFlow - but it wasn't an impossible idea and it's not a rugpull or anything.

most commercial models devoured more money behind the scenes too, we just didn't see it.
>>
>>106834445
bruh, it took him more than a year to get this piece of shit lol, by the time he has finished finetuning Qwen, we'll probably have a stronger base model, he's taking way too long
>>
https://www.reddit.com/r/StableDiffusion/comments/1o1u2zm/text_encoders_in_noobai_are_dramatically_flawed_a/
Uh oh, noobai sissies, how do we respond?
>>
File: 1744868365595611.png (2.91 MB, 1728x1344)
2.91 MB
2.91 MB PNG
>>106833928
holy fuck v8 is going to be a disaster. the furry has been ok but flawed. the horsefucker is going to drag him down.

>>106834455
exactly, qwen is fun but too fucking bloated. we need a new model that has a local-friendly architecture that uses newer optimizations.
>>
>>106834466
Is this the same clip schizo as always?
>>
>>106834472
no, looks like someone else
>>
>>106834476
Is it possible to have a clip schizo get to their point in under three paragraphs?
>>
>>106833928
>ponyfag working with lodestones
I mean...
>>
>>106834486
kek you have a point, everytime a guy talks about clip he writes a fucking bible about it, I don't know why
>>
File: radiance.png (2.94 MB, 848x1488)
2.94 MB
2.94 MB PNG
>>106834455
so far it is not looking like anyone is publishing local and open nsfw (including furry and most of *booru) base models

these finetuning efforts that currently take months will likely usually be first for now
>>
>>106834443
How is it such high quality?
>>
>>106834506
BUNDA BUNDA BUNDA
>>
>>106834506
>such high quality
there's a lot of motion weirdness goin on though, and after looking at thousands of sora's videos, Wan looks like shit now :(
>>
>>106833928
fuck it we needed a qwen finetune anyway
>>
>>106833928
desu if I was lodestone I would take it personally, the ponyfag is literally making him understand that his chroma model is shit and is not deserving of a finetune lmao
>>
>The anti chroma schizo was right about the qwen finetune all along
>>
>>106834466
Stopped reading at "Noob 1.1" as the last and defacto release is vp 1.0
>>
File: radiance.png (2.65 MB, 848x1488)
2.65 MB
2.65 MB PNG
>>106834471
>we need a new model that has a local-friendly architecture that uses newer optimizations.
that was likely what he tried with auraflow but obviously it didn't go as well as hoped?

yes, neta yume lumina and chroma worked out better but that's also somewhat larger models... still it's fairly obvious to me why they'd want qwen also with the edit mode features.
>>
File: 1756338247316909.png (137 KB, 2092x714)
137 KB
137 KB PNG
>>106833928
I don't get it, he wants to transform Qwen Image into an edit model? then why not finetuning Qwen Image Edit instead?
>>
>>106834501
Neta Yume Lumina isn't even a month old and has great potential.
>>
this pony faggot is intentionally making his models worse by censoring the dataset and removing artists time and time again and yet you dumbasses still have some hope in him?
>>
>>106834589
I don't, I'm passing time desu kek
>>
>>106834589
>and yet you dumbasses still have some hope in him?
this thread is not a collective consciousness dont lump me in with the rewards
>>
>>106834599
you're such a reward anon!
>>
>>106834584
unless that notorious fraud actually makes a noobxl size finetune i don't see it actually being something besides a 1girl simulator
>>
>>106834589
the lode of stones will set him straight.
>>
File: WanVideo2_2_I2V_00516.webm (2.7 MB, 848x1488)
2.7 MB
2.7 MB WEBM
>>
File: radiance.png (2.63 MB, 848x1488)
2.63 MB
2.63 MB PNG
>>106834584
yes, certainly, that one still seems to be learning useful stuff!

the overall tuning of lumina, neta lumina to neta yume lumina took a while tho. it is simply good that it continued working until now and therefore likely more into the future.

for auraflow from what I gather it hit a wall and retries and trying to "go around the blockage" figuratively didn't seem to work, so unless you had ideas how to fix the architecture I guess it's better to abandon auraflow
>>
this level of ESL has to be SEA
i know its you, jungle asian
>>
>>106834638
You have to link to the posts you're accusing, schizo.
>>
File: radiance.png (2.85 MB, 848x1488)
2.85 MB
2.85 MB PNG
>>106834506
yea, wan is great. it could be even a little better if you burn more gpu time... most don't have the patience tho
>>
Does the different scaled models change animation dramatically or is it mainly the quality that differs?
When trying to make a prompt work, would it make make sense to spam them on the lower quality models to then get final results on tbe bigger models?
>>
File: WanVideo2_2_I2V_00517.webm (1.76 MB, 848x1488)
1.76 MB
1.76 MB WEBM
>When you forgot the alter the framerate
>>
>>106834615
>i don't see it actually being something besides a 1girl simulator
Some of the examples show multiple. In fact, one of them shows 5girls all individually prompted. https://civitai.com/images/99675719
>noobxl size finetune
Obviously desired but not crucial for adoption. Illustrious was great even before it was dicked down with e621.
>>
File: file.png (2.88 MB, 848x1488)
2.88 MB
2.88 MB PNG
>>106834680
the quants can affect what happens in the animation pretty strongly. it isn't just "the same with less visual fidelity".
>>
>>106834680

Video gen is a wild horse. 1 settings different and the whole motion flow changes. I doubt you can swap quantization and get the same motion. There is some semblance of control if you use start/end frames. But even that doesn't guarantee reproducibility.
>>
>>106834696
>>106834711
Damn, guess I'll stick to 3-4min gens.
>>
File: file.png (1.27 MB, 1280x720)
1.27 MB
1.27 MB PNG
Diella lora when.
>>
>>106834689
i'm afraid without e621 you can forget about complex porn positions. and i don't want another model that's only capable of standing and pov cowgirl.
>>
File: catbox_mzdun7.png (1.72 MB, 1040x1520)
1.72 MB
1.72 MB PNG
Beginner here, using SwarmUI. Trying to replicate civitai images and it always comes out slightly wrong.

https://civitai.com/images/105020580 as a random example.

It always has this weird yellow tint, it's blurrier even with me using remacri as an upscaler, and it's less detailed. The detail and blur I guess I can solve by upping the refiner upscale, but I can't get rid of that weird yellow tint.
>>
File: WanVideo2_2_I2V_00518.webm (677 KB, 1488x848)
677 KB
677 KB WEBM
>>
>>106834745
What's an example of a position Noob can do that Illustrious can't? But if we're talking about which model has the easiest path to noob-but-better while inference still being within the reach of the majority, I can't think of another besides Yume.
>>
>>106834443
oh shiiii
>>
File: WanVideo2_2_I2V_00519.webm (1.78 MB, 848x1488)
1.78 MB
1.78 MB WEBM
>>
cozy
>>
File: WanVideo2_2_I2V_00520.webm (1.25 MB, 1488x848)
1.25 MB
1.25 MB WEBM
>>
File: ComfyUI_0003-urd.png (2.4 MB, 1728x1296)
2.4 MB
2.4 MB PNG
>>106832615
Model isn't quite as good for loraless Urd as it is for Bell.
>>
How come China is so based

https://files.catbox.moe/ztpndq.mp4
>>
File: ComfyUI_temp_auxpg_00001_.png (2.83 MB, 1728x1296)
2.83 MB
2.83 MB PNG
>>106835058
>>
Is this node banned or something? It won't download and other people report it missing as well but the links to those pages are fucking deleted.
>>
File: ComfyUI_temp_irfic_00001_.png (3.17 MB, 1296x1728)
3.17 MB
3.17 MB PNG
>>106835116
>>
Is a gpu with 16gb of vram much faster than one with 12 when using wan?
>>
File: radiance.png (3.13 MB, 848x1488)
3.13 MB
3.13 MB PNG
>>106835115
getting wan 2.2 and animate was extremely nice, yea
>>
>>106835171
Extremely cool texture.
>>
File: radiance.png (2.66 MB, 848x1488)
2.66 MB
2.66 MB PNG
>>106835157
depends on the exact settings, but usually not a huge amount. it's more noticeable for the 4090 and 5090 because they have both more RAM and quite a lot more processing power.

same could again be said for H200 presumably but no one here seems to have that.
>>
>>106835132
There were and there always will be node which fail to be installed via Manager. Use git clone manually
>>
>>106835132
the official seedvr2 node is garbage, you gotta install from here:
https://github.com/AInVFX/ComfyUI-SeedVR2_VideoUpscaler/tree/nightly

youre welcome
>>
can I use qwen lora on qwen edit?
>>
>>106834016
>>106833582
lora for this feel?
>>
With large enough batch, you can even use nvme ssd on ramtorch.
>>
>>106834745
>i'm afraid without e621 you can forget about complex porn positions
Bbbut... Don't animals have different porn positions than humans, including those reverse joints/elongated feet, somesuch. Won't it mess up concepts?
>>
File: WanVideo2_2_I2V_00521.webm (2.35 MB, 848x1488)
2.35 MB
2.35 MB WEBM
omg migu?!
>>
>>106835115
Any thoughts on how to smooth oth the tremor in the DW "bones"?
>>
>>106834745
idk about yume but neta lumina did have e621 in it's dataset
>>
>>106835658
I can't believe she feels empowered
>>
Realistic models for i2i refinement after qwen edit?
>>
>>106835058
>>106835116
>>106835135
very nice
>>
>>106835796
maybe you can use a flux finetune but even then I'm not sure you won't lose more than you gain unless you limit it to masked/segmented areas.
>>
>>106832401
Sometimes colors work, others it doesn't get recognized and gives you different colors each time.

Something like coral gives you actual corals sometimes. Pantone could work but I'm not sure, has no one tested this?
>>
holy fuck v7 is worse than v6
how did he manage to fuck it up that badly
aside from removing artists, using auraflow
>>
>>106836350
>aside from removing artists
Removing artist tags and replacing with a random number has to be one of the most retarded things a hobbyist could do. Even novelai has artists, and it's a fucking company, they just don't ever talk about it.
At this point, I truly believe Pony being good was just pure cosmic coincidence.
>>
File: 1744544534389429.png (11 KB, 1435x59)
11 KB
11 KB PNG
it is finally time to download all wan loras I've seen since last week
I expect half of them to be banned
>>
>>106833514
*checks collage*
I like it a lot, gj u guise
>>
Bros, my comfyui venv folder disappeared, help.
>>
>>106836688
rip
>>
>>106836740
But it's still working..
>>
the prior few weeks of diffusion has been so busy with new stuff that the last few days have felt like a drag.
where is qween 2510?
>>
File: file.png (818 KB, 1135x773)
818 KB
818 KB PNG
>>
What model do you guys use for image-to-video?
>>
>>106837017
what do you mean? there is only one model for that.
>>
>>106837017
japanese barking dog
>>
File: ComfyUI_02891_.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
>>
File: ComfyUI_02899_.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
>>
>>106836350
It's more retarded how it's being released like it's a huge improvement. The gens are so blatantly bad that he should've just been upfront about it being shit.
>>
File: Face.png (403 KB, 600x900)
403 KB
403 KB PNG
What realism model produces this face phenotype?
>>
To the anon wondering last thread, I report and block trash like this.
Why even upload this? It's a shitty quality image, gibberish, bad artstyle. Waste of drive space.
>>
>>106837168
Anon, pretty sure that looks like a child...
>>
>>106837214
and?
>>
why does she do it?
>>
File: sisyphus.png (12 KB, 1729x73)
12 KB
12 KB PNG
>>
File: 00013-2026355533.png (2.62 MB, 1824x1248)
2.62 MB
2.62 MB PNG
>>
File: 00000-50913402.png (1.76 MB, 1024x1024)
1.76 MB
1.76 MB PNG
>>
File: 00022-2930050790.png (2.73 MB, 1248x1824)
2.73 MB
2.73 MB PNG
>>
File: ComfyUI_05648_.png (653 KB, 840x1240)
653 KB
653 KB PNG
>>
File: 00001-1903339760.png (2.34 MB, 1024x1024)
2.34 MB
2.34 MB PNG
>>
File: qwen_fluxkrea__0016.jpg (609 KB, 3328x1216)
609 KB
609 KB JPG
>>106835796
depends on your aesthetics but i did some test with qwen (t2i) and a flux krea detail
>>
File: wan22___0001.png (1.49 MB, 832x1216)
1.49 MB
1.49 MB PNG
kinda tired of civit being fucked all the time, is there a better place to dump all my gens
>>
Hello from /adt/. Our baker is unavailable right now, so we're asking the neighboring thread for help. Could someone bake our thread?
>>
>>106837168
cute, looks like some kind of mix between 75% realistic and 25% anime
>>
>>106837537
Cool I like it
>>
>>106837577
Recycle bin
>>
File: videoframe_4984.png (2.78 MB, 1188x1740)
2.78 MB
2.78 MB PNG
>>106833514
Please can someone help, I've trained over 20,000 lora models for previous generations and am settling on converting my catalog to Chroma1-HD.

For efficiency/ hardware limits I need to reduce vram/ram to 12gb ram, 15gb vram during lora training.

I'm trying to get Chroma1-HD ggufs/fp8 scaled to work but most trainers fail so far.

I read that fp8 unscaled could work... does anyone have this model? Trying to create it in python overloads my system.

Thanks, all my models are open source also and I'm rank top 100 in the world for lora.

Does anyone have kohya_ss settings/preset for chroma too? even locally I cant get it to work.

Can GGUF ever be used for lora training?
>>
>we're not getting a nunchaku chroma
>we're not getting any finetunes of chroma
>lodestone abandoned chroma to help ponyfag make another danbooru abortion model instead of going the extra mile and unfucking chroma
i am so fucking tired of pony and of anime models in general man. the idea of chroma was at least nice because it was capable of doing other things but the furry fucked it up and now he's abandoning it. five million more years of sdxl i guess.
>>
>>106837664
you have to train on the full model
>>
>>106837691
WHERE THE FUCK IS NUNCHAKU WAN
WHERE THE FUCK IS NUNCHAKU LORA SUPPORT
WHERE THE FUCK ARE LIGHTX2V WAN I2V LORAS
>>
>>106837664
This just came out low VRAM king https://xcancel.com/ostrisai/status/1975642220960072047

https://github.com/ostris/ai-toolkit/
>>
>>106837716
sorry anon they're busy working on nunchaku pony v7
>>
>>106837664
from what i understand lora training is best done with base models and may be near impossible with GGUF. even if it is done with GGUF it wouldn't be very good.

i have trained lora with fp8 models before but they are prone to model degeneration when used at regular strengths
>>
can u guys btfo grok
>>
>>106837716
>nunchaku team constantly changing their goal and focusing on whatever new shiny model
>lightx team delivering wan t2v v2, promising i2v v2, then immediately focusing on qwen instead

adhd devs, they literally can't focus on one thing
>>
Does Chroma radiance still take twice as long to gen with as normal Chroma or was that issue fixed?
>>
>>106837717
this looks promising... he says hes implemented it but its not on the main or dev branch...? is this only for Qwen ?

>>106837707
iirc most sdxl lora were done at lower fp etc
>>
>>
>>106835135
Based Megami-sama boomer
>>
>>106837733
The lora are some kind of approximation/compression anyway, and some of the "detail" can be regained by running the full model at inference time.

I hope.
>>
File: 00004-364763086.jpg (1.5 MB, 1536x1920)
1.5 MB
1.5 MB JPG
>>
>>106837050
heh
>>
>>106837168
Qwux
>>
>>106837168
literally all pony shitmixes. it's total slop and disgusting.

>>106837214
did that make your dick twitch? kys
>>
>>106837664
> 've trained over 20,000 lora models for previous generations

why start with such an obvious lie?
you have no idea what you are doing, gtfo
>>
>>106837310
That's a myth.
>>
File: wan22___0012.png (1.55 MB, 832x1216)
1.55 MB
1.55 MB PNG
>>106837629
>>
>>106837892
shitpost about qwen + flux?
>>
>>106837691
It is a shame, now that I finally figured out how to get photorealistic, non-slopped outputs I'm quite happy with it, but it isn't exactly plug and play so I feel like a lot of people gave up on chroma early. What's worked for me
>hyper specific negative prompt stolen from a good set of gens on civitai
>hyper specific positive "style" prompt concat'd with main prompt, again stolen from those same gens on civitai
>clownsharksampler + chain samplers from the same pack starting with res_5s or higher for the first few steps , then decreasing to res_3s, for the next several steps, and so on, and finishing with multistep 2m or 3m for about or a little less than half of the total number of steps, all using beta57 and bongmath.
About 45-55 seconds on 4090 for base gen (no upscale).
>>
>>106837908

You have no idea...

Also please help, I just got it to load most of the fp8 scaled model but then OOM, maybe I need just a little more system ram reduction in ai-toolkit. with chroma fp8 scaled.
>>
>>106837691
Chroma can't be fixed without starting from scratch
>>
>>106837966
I'm using 48GB RAM when loading the full model
>>
If the world were clear, AI would not exist.
>>
>>106837986
what vram have you got....? I'm fine tuning on the full model and my ram for the python process is 2,600 MB....
>>
>>106838018
Not the anon, but finetuning or training a lora?
>>
File: file.png (2.84 MB, 1024x1536)
2.84 MB
2.84 MB PNG
>add convenient censoring tag
>out of 10 gens non of them had it
I guess the natural prompt is taking precedence.
>>
>>106838018
12GB vram, 48GB ram. But I use onetrainer. I haven't managed to run it in Ostrix because it oomed even with the low vram mode.
>>
>>106838028
training a lora, with ai-toolkit
>>
>>106833689
>>106833764
I fucking hate living in Bongistan so much
>>
>>106835115
>How come China is so based
they're not based, their model produce plastic slop, it doesn't have IP in there, they have no balls and they have shit taste
>>
SDXL will reign eternal.
>>
>>106837962
all that bullshit and you get a result that you can do with sdxl + lora and an upscale
>>
how do we fix wan?
>>
>>106838070
This but SD1
>>
>>106838074
same as how you fix all the other models that need fixing: find a millionaire to fund your finetuning
>>
>spend more time intentionally crippling model than making it good
>model is not good
>How could this possibly be happening to me?!
I hope IL makes Ponyfag seethe eternally.
>>
File: wan22___0018.png (1.37 MB, 832x1216)
1.37 MB
1.37 MB PNG
>>106838074
>>
For those having 200gb of memory, you can now run HunyuanSlop 3.0 through ComfyUi! LFG!
https://www.reddit.com/r/StableDiffusion/comments/1o1z71x/hunyuan_30_available_in_comfyui_through_custom/
>>
https://civitai.com/articles/19986
>Previously, we were afraid it would affect the model's style too much without better style control, but our research in style clusters helped alleviate this issue. We'll continue increasing synthetic content, including our own generation loops, to improve character recognition and especially style blending.
>We'll continue increasing synthetic content
is this retard acually serious?
>>
>>106837664
could something like this work anyone?

Training Specific Layers
To train specific layers with LoRA, you can use the only_if_contains network kwargs. For instance, if you want to train only the 2 layers used by The Last Ben, mentioned in this post, you can adjust your network kwargs like so:

network:
type: "lora"
linear: 128
linear_alpha: 128
network_kwargs:
only_if_contains:
- "transformer.single_transformer_blocks.7.proj_out"
- "transformer.single_transformer_blocks.20.proj_out"

from ai-toolkit
>>
what dataset do I need to create porn lora for wan2.2?

how large videos...? how long? how many? etc
>>
>>106838148
>how large videos...?
saar do not redeem the booba and vagene!
>>
>>106838160
i fuck
>>
>>106837664
>Can GGUF ever be used for lora training?
can it? i dunno.
has it? no.
>>
>>106838148
i was able to train a titty drop lora with 13 x 5s clips



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.