[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


动态网络自由门 Edition

Discussion of Free and Open Source Diffusion Models

Prev: >>107889954

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux Klein
https://huggingface.co/collections/black-forest-labs/flux2

>WanX
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
this is the real and correct continuation of the /ldg/ general
>>
based maintainer of thread quality
>>
File: 1737563391481000.mp4 (1.61 MB, 1312x768)
1.61 MB
1.61 MB MP4
>>107892557
Need to fix collage script but in the meantime this was supposed to be included
>>
Blessed thread of frenship
>>
alt collage
>>
File: 1747177936798224.mp4 (735 KB, 1180x572)
735 KB
735 KB MP4
>>107892589
And this

>>107891027
>>107891808
>>
File: 1767939446138031.png (8 KB, 255x110)
8 KB
8 KB PNG
>>
flex klein 4b is so good. i hope we get at least one noob tier finetune out of it.
>>
File: ComfyUI_00048_.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
tehe
>>
File: chinese culture.jpg (230 KB, 1280x720)
230 KB
230 KB JPG
https://www.youtube.com/watch?v=uHQZNNajxOk
>>
In Wan2.2, is there a way to fix wan mouth flapping on anime characters? NAG isn't consistent.
>>
>>107892614
would rather a bigger model get trained since theres a bunch of copes to run them even without a lot of vram now
>>
File: klein.png (17 KB, 560x87)
17 KB
17 KB PNG
>>107892614
>flex klein 4b is so good. i hope we get at least one noob tier finetune out of it.
sd3.5m tier
>>
File: zimg_00335_.png (2.52 MB, 1248x1872)
2.52 MB
2.52 MB PNG
thanks mods
>>
>>107892667
Is this supposed to mean something?
>>
>>107892667
Are you retarded or something?
>>
What's the deal with manchild japanese cartoon addicts who have 0 creativity and absolutely need to use their beloved copyrighted characters and styles?
>>
>>107892702
benchod
>>
>>107892708
kurwa
>>
File: 1766324990700334.jpg (3.99 MB, 4800x6912)
3.99 MB
3.99 MB JPG
>>
>>107892613
but big black is a punk band anon https://www.youtube.com/watch?v=jtPFzBLDSPk
>>
>>107892680
They will continue finetune lumina 2.0 instead.
>>
>>107892641
>>107892667
you need to compare it with XL, once it's finetuned it will be great we had no good quality models in this range before 4b ~ 6b. the reason bigger models won't work is because of the cost to train them. it always end up having a bunch of bs settings to cope with the cost of hardware to create a free model. i would rather press F to dance on XL's grave than spend another year coping about that supposed bigger model finetune.
>>
>>107892738
what's experimental about that?
>>
File: 00107-1850405957.png (812 KB, 1024x1024)
812 KB
812 KB PNG
>>
>>107892754
it's just how i organize my gens
>>
File: 1768674024287186.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>107892557
Whoever posted this, thank you. Gorgeous.
>>
>>107892831
my wife
>>
Even the klein base has terrible variance in seeds. That's why it looks so good I guess?
>>
Yeah I suggest we all just move to /bant/ for a week or two until they get bored.
>>
File: 1767414986901821.jpg (1.83 MB, 3112x4914)
1.83 MB
1.83 MB JPG
>>107892757
>>
should i throw 100 grand at finetuning flux2 4b or z-image-turbo or wait for future z-image-copium releases etc
>>
File: 00114-3250353421.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>107892850
>>
File: 1742435947470836.jpg (1.3 MB, 3112x2178)
1.3 MB
1.3 MB JPG
>>107892831
>>
>>107892850
Neat thanks
>>
>>
>>107892880
The 9b model is great for removing watermarks, logos etc. I put it trough batch of 275 anime-illustrations, only had to manually edit 3 (!!!) images afterwards
>>
File: ComfyUI_00032_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>
>>107892980
the flux vae is pretty good, but your images are still getting fucked by it, i would be cautious if you want to use them for training
>>
How do I prompt for Z? I want new camera angles and shit but majority of the time it's just the default one or like if i use "off angle" it shifts it slightly but using prompts that are obvious like birds eye view or other stuff does nothing. I rarely truly get the actions I want through my prompts even when explaining them pretty good.
>>
>>107892995
ai slop
>>
If I merge WAI 16 with a Chris Chan lora at low weight, will i have the same quality as NoobAI?
>>
>>107893039
maybe
>>
>>107893010
A trick is to have whichever LLM you prefer to caption an image with the angle you want and use that in your prompt.
>>
>>107892998
I didn't see any degradation, some got slight zoom effect. But it's good to keep in mind, you are right.
>>
Just a bit of banter :]
>>
>this other guy is so obsessed with me
>proceed to post 80 posts about him
yep another day another melty
>>
If I download ggufs, do I need to grab clip/text encoder files from the main versions for best results anyway?
It's my first day
>>
>>107893126
Ye
>>
what no z image base does to a general
>>
>>107893160
there's still a lot to explore with turbo
>>
>>107893183
i mostly care about the fine tunes/loras, after the disappointment of v7 there has been nothing exciting in that segment
>>
has anyone tried training flux2 klein loras so far? for example with this PR https://github.com/Nerogar/OneTrainer/pull/1261
>>
>>107893055

What is an easy to use free LLM? How do I even get/use them I am a brainlet at this stuff clearly.
>>
>>107893183
box?
>>
>>107893252
i use this https://github.com/comfyanon/llm-utilities
>>
>>107893259
https://files.catbox.moe/winvi8.png
>>
>>107893272
thanks
>>
File: 1748718086085430.jpg (1.95 MB, 3112x2178)
1.95 MB
1.95 MB JPG
>>107892869
>>
>>107893272
preciate it anon
>>
yes posting such images is totally normal and not schizo behavior
>>
I propose this solution to the schizo problem: All schizos fight to the death, and the survivor then gets tortured to death
>>
File: img_00101_.jpg (523 KB, 1520x1728)
523 KB
523 KB JPG
>>107893272
doing the bamboozling
>>
>>107893341
proof?
>>
File: 1girl_00096_edit.jpg (501 KB, 1216x832)
501 KB
501 KB JPG
its ok guys its safe to post
>>
File: chrome_mLNQMNtc7V.png (32 KB, 1049x460)
32 KB
32 KB PNG
>>107893272
>>107893305
lmao kekd u got me
>>
>>107893288
You are welcome.
>>
File: Flux2-Klein_00037_.png (1.71 MB, 1136x912)
1.71 MB
1.71 MB PNG
holy schizo nuke, based mods
>>
what causes a person to do this for mroe than a year
>>
>>107893393
reaL?
>>
File: Flein_00278_.png (2.3 MB, 1152x1440)
2.3 MB
2.3 MB PNG
finally, my paradise
>>
>thread blesser anon is not one of the schizos
based and frenship pilled
>>
>>107893411
their faces were swapped with flux klein
>>
File: 1755905518006501.jpg (178 KB, 2048x1186)
178 KB
178 KB JPG
>klein demolished in a single comparison image
just as reminder the current year is now 2026
>>
File: img_00117_.jpg (483 KB, 1520x1728)
483 KB
483 KB JPG
>>
qwen 8b Q8 working good with klein. Any sense in looking for a fp16/bf16/whatever? How does it work with these LLM's?
>>
>>107893412
switch, top, bottom
>>
File: Flein_00281_.png (2.21 MB, 1152x1440)
2.21 MB
2.21 MB PNG
if god were real he'd make mondo girls real
>>
>>107893436
what are you comapring here?
>>
>>107893128
Thanks
>>
>>107893461
Left: z-image
Right: Klein
>>
>>107893469
oh okay. can you post an edit comparison between the two?
>>
File: 1759660611794961.jpg (1.79 MB, 3448x1954)
1.79 MB
1.79 MB JPG
>>107893393
>>
File: ls4.png (2.4 MB, 2048x3720)
2.4 MB
2.4 MB PNG
>if god were real he'd make mondo girls real
>>
>>107892557
thx 4 bake
>>
>>107892602
ty 4 2nd fagollage
>>
What do I have to prompt to get sharp backgrounds in ZIT? I tried describing the backgroudn as sharp, crisp, shot with high depth of field etc but get blurry backgrounds most of the time.
>>
File: Flein_00284_.png (2.3 MB, 1152x1440)
2.3 MB
2.3 MB PNG
>>107893519
it's better than setting a bush on fire, any jackass with a bic can do that
>>
File: 1767569632625656.jpg (3.43 MB, 6184x7570)
3.43 MB
3.43 MB JPG
>>107893519
>>
>>107893539
use nag and put 'blurry background'
>>
>>107893547
nag?
>>
File: 1738274979737068.png (979 KB, 1136x896)
979 KB
979 KB PNG
try 8 steps with 9b klein distilled, can fix any issue with text/give more detail. it's already super fast anyways (8 steps was 15s)
>>
>>107892602
based alert
>>
File: 1756667763158427.jpg (2.3 MB, 3496x3010)
2.3 MB
2.3 MB JPG
>>107893543
>>
>>107893545
NIGGA THAT'S CUTE!
>>
File: 1767452167809402.png (963 KB, 1136x896)
963 KB
963 KB PNG
>>107893562
change the text at the bottom to "I'm in a heckin reddit game, rick!". the man is holding a bag of "onions chips"
>>
File: 1766896378771685.jpg (870 KB, 3496x1922)
870 KB
870 KB JPG
>>107893562
>>
File: img_00133_.jpg (543 KB, 1520x1728)
543 KB
543 KB JPG
>>
>>107893581
i dont see any onion chips, FAIL
>>
File: 1759007130682918.jpg (1.75 MB, 4600x3586)
1.75 MB
1.75 MB JPG
>>107893590
>>
File: 1742983909153620.jpg (2.09 MB, 3496x3010)
2.09 MB
2.09 MB JPG
>>107893459
>>
is it just me or are 90% of the Z loras on civitai total garbage
>>
File: Flux2-KleinEdit_00041_.png (2.11 MB, 2509x840)
2.11 MB
2.11 MB PNG
Colorize the black and white photo
>>
uh oh
>>
>>107893627
duh
no base
>>
>>107893633
is that a kid on the right?
>>
File: 1753988423434039.jpg (2.22 MB, 4600x3586)
2.22 MB
2.22 MB JPG
>>107893440
>>
>>107893635
yeah but I've tried a few test loras and the results looked at least halfway decent, the example images on most Z loras look like they were done with early XL shitmixes, you have to wrangle Z to do shit this bad
>>
>>107893627
98% of loras on civitai are shit which means z is actually doing well comparatively
>>
File: 1747143228452594.jpg (2.03 MB, 3496x3010)
2.03 MB
2.03 MB JPG
>>107893412
>>
File: 1744551669522595.png (1.25 MB, 736x1392)
1.25 MB
1.25 MB PNG
the anime girl with white hair is wearing a white hoodie with a picture of hatsune miku on it, blue jeans, and white sneakers. keep her head the same and blindfold the same. she is holding a bottle of water instead of a tea cup. change the background to a sunny beach.
>>
>>107893638
idk much about history, i think that photo is world war 2 and not sure what the conscription age was
>>
>>107893667
kek
>>
>>107893667
klein is racist
>>
File: Flux2-KleinEdit_00042_.png (2.48 MB, 1806x1153)
2.48 MB
2.48 MB PNG
one more
>>
>>107893687
Who is this old geezer?
>>
File: 1744037741784888.png (972 KB, 736x1392)
972 KB
972 KB PNG
>>107893668
make a low polygon 3d render of the anime girl in the style of a playstation 1 game.

2B version 0.1:
>>
File: 1749043392249359.jpg (1.06 MB, 3736x1794)
1.06 MB
1.06 MB JPG
>>107893379
>>
>>107893689
isaac newton
>>
>>107893689
i think he invented physics or some shit
>>
>>107893705
before that people would just float?
>>
>>107893689
thats me
>>
>>107893689
it's me
>>
File: 1737675544670581.jpg (999 KB, 2248x2946)
999 KB
999 KB JPG
>>107893668
>>
File: 1764637111570295.png (960 KB, 896x1152)
960 KB
960 KB PNG
make a low polygon 3d render of the anime girl in the style of a playstation game. keep her black blindfold the same.

low poly thighs: also I think 8 steps is the way to go, generally better results and it's already fast as a model. 15s vs 10s or so.
>>
File: hp9.png (3.1 MB, 3568x2560)
3.1 MB
3.1 MB PNG
>idk much about history, i think that photo is world war 2 and not sure what the conscription age was
>>
>>107893689
it's him>>107893715
>>
File: 1766670741408997.jpg (1.8 MB, 4600x3586)
1.8 MB
1.8 MB JPG
>>107893353
>>
File: 1739777243646873.png (990 KB, 896x1152)
990 KB
990 KB PNG
>>107893725
make the anime girl in a pixel art style like a nintendo game. keep her black blindfold, white hair, and dress color the same.
>>
no Z-Base
BFL won
chinese century cancelled
>>
>>107893760
now make it an anime figure with a bikini outfit, same pose and sword and pod droid but black lace sling bikini
>>
File: ComfyUI_00128_.png (1.65 MB, 848x1216)
1.65 MB
1.65 MB PNG
>>
File: Hitler.jpg (22 KB, 474x624)
22 KB
22 KB JPG
>>107893687
>"Hmm, my senses are tingling"
>>
File: 1737824616712606.jpg (775 KB, 2728x2434)
775 KB
775 KB JPG
>>107893725
>>
File: img_00148_.jpg (456 KB, 1520x1728)
456 KB
456 KB JPG
>>107893739
I liked stuff like "Remove sepia filter. Improve image with flat anime colors. Improve shading with chromatic aberration."
>>
Why is it almost impossible to make a petite girl in z-image without going full degen? Like petite girl small breasts is fucking impossible.
>>
>>107893804
just use the lora
>>
File: 1754603594288373.jpg (782 KB, 2680x2466)
782 KB
782 KB JPG
>>107893786
>>
>>107893590
This is the gayest shit I’ve seen here, im fucking puking rainbows.
>>
So as someone new with edit models, whats the best practice for prompting.

Say I'm just doing faceswaps. Do I reference the original picture? Is there a preference for picture order? Does it prioritize things like resolution or anything? Having success with some prompts, but issues with other, and I feel like it boils down to how some things are easier to prompt in language than the others. Thoughts?
>>
>>107893827
prompt?
>>
>update diffusers
>cant import Flux2KleinPipeline
huh?
>>
File: ly2.png (2.63 MB, 2560x3288)
2.63 MB
2.63 MB PNG
>make her into my wife pretty please
>>
File: 1751064563253851.jpg (1.9 MB, 2584x2562)
1.9 MB
1.9 MB JPG
>>107893834
>>107893783
Just time stuff and pull gacha
>>
File: 1758238396536745.png (893 KB, 896x1152)
893 KB
893 KB PNG
>>107893782
make a plastic anime figure of the white hair anime girl, remove the black dress and change her clothing to a black bikini. keep her black blindfold, and white hair the same.
>>
File: Flein_00308_.png (1.04 MB, 1488x1600)
1.04 MB
1.04 MB PNG
>>107893827
>>
>>107893847
very cool
>>
>>107893837
oh wait its not officially in there yet
>pip install git+https://github.com/huggingface/diffusers.git
werks
>>
File: 1759104323335939.jpg (3.47 MB, 7720x6706)
3.47 MB
3.47 MB JPG
>>107893840
>>
File: 1740194383294639.png (2.56 MB, 1632x928)
2.56 MB
2.56 MB PNG
>>
File: 1745949177175852.png (812 KB, 1088x944)
812 KB
812 KB PNG
the man is holding a black and white polaroid picture of hatsune miku with one hand, and is pointing to it with the other.

check em
>>
>>107893847
cover it in egg white
>>
File: 1754134087627842.png (2.64 MB, 1600x960)
2.64 MB
2.64 MB PNG
>>
File: 1741256015614226.jpg (2.08 MB, 4600x3586)
2.08 MB
2.08 MB JPG
>>107893791
>>
>>107893791
>improve
>anime colors
you can only choose one, anon
>>
File: 1757347137083308.png (1.08 MB, 1088x944)
1.08 MB
1.08 MB PNG
>>107893876
this is a job for the 2 image gen though:

the man is holding a black and white polaroid picture of the girl in image 2 with one hand, and is pointing to it with the other.
>>
do weez got 9b q8 yet fp8 is booty
>>
>>107893909
fp8 is still good, but yes

https://huggingface.co/unsloth/FLUX.2-klein-9B-GGUF
>>
File: 1742562078185330.png (2.59 MB, 1632x928)
2.59 MB
2.59 MB PNG
>>
File: ComfyUI_00146_.png (1.73 MB, 1376x752)
1.73 MB
1.73 MB PNG
>>
>>107893938
prompt: make it into slop
>>
File: 1765563501288521.jpg (1.39 MB, 4936x1986)
1.39 MB
1.39 MB JPG
>>107893875
>>
File: img_00161_.jpg (518 KB, 1720x1147)
518 KB
518 KB JPG
>>107893827
>>
>>107893729
oh hey what's up dude. i remember getting vacations with you like two years ago here.
>>
>>107893996
why she crang?
>>
Want to install reforge
Should I install the latest Python or should I install the 3.7 like the github says?
>>
File: 1741527007430668.jpg (1.4 MB, 4540x3330)
1.4 MB
1.4 MB JPG
>>107893827
>>
File: ComfyUI_00152_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>107893964
yes
>>
File: 1754857821672773.png (1 MB, 1088x944)
1 MB
1 MB PNG
replace the head of the man in image 1 with the man in image 2 in the same proportions. make the image black and white.

so BFL is fine with face swaps now I guess (even though you can do it with reactor anyway)
>>
>>107894036
they're probably fine with it cause it looks like a bad photoshop
>>
File: 1754699174499411.png (1.1 MB, 1088x944)
1.1 MB
1.1 MB PNG
>>107894036
todd:
>>
File: 1741032103427354.jpg (1.8 MB, 4168x1634)
1.8 MB
1.8 MB JPG
>>107893938
>>
File: 1742724234524339.png (1.24 MB, 784x1312)
1.24 MB
1.24 MB PNG
>>107894048
replace the head of the man wearing a blue shirt in image 1 with the man in image 2. replace the black man on the floor with a cardboard box that says "SKYRIM".
>>
File: test.png (1.59 MB, 1024x1305)
1.59 MB
1.59 MB PNG
Using the not-yet-merged klein branch of stable-diffusion.cpp with Flux2 klein 4B
When image editing, it OOMs rather quickly at Q8_0 on 12GB VRAM, so I'm limited to very low resolutions. The editing capabilities are great but the output quality tends to be poor. I assume that's down to the low resolution and being the 4B model?
>>
>>107894062
yeah the other anon wasn't kidding about bad photoshop lmao
>>
File: 1747992478542082.jpg (2.08 MB, 5200x2424)
2.08 MB
2.08 MB JPG
>>107893996
>>
File: 1756691895463329.png (1.24 MB, 784x1312)
1.24 MB
1.24 MB PNG
>>107894070
the fent man image is potato quality, I swear these fucking news orgs cant put up a single good image of it, despite how much they cried about it.

regardless this is for memes not high art
>>
>>107894087
who
>>
File: 1763471721443132.jpg (971 KB, 2392x2754)
971 KB
971 KB JPG
>>107894062
>>
File: 1767197375670258.png (1.42 MB, 896x1152)
1.42 MB
1.42 MB PNG
replace the head of the man in image 1 with the man in image 2.

kek, drive but blade runner:
>>
>>107894105
You were not born back then when Bateman was a thing.
>>
File: 1737684686772799.jpg (1.44 MB, 4936x1986)
1.44 MB
1.44 MB JPG
>>107893930
>>
File: 1762779490471660.png (1.25 MB, 1440x720)
1.25 MB
1.25 MB PNG
>>107894122
see, high quality source yields a better output:
>>
>>107894128
i was watched dark knight in the cinemas zoomer
>>
File: fs8.webm (3.7 MB, 1024x1024)
3.7 MB
3.7 MB WEBM
>>107893999
good times. I started genning again like a month ago.
>>
File: 1756741946989563.jpg (1.73 MB, 3400x3746)
1.73 MB
1.73 MB JPG
>>107893883
>>
>>107893814
Chadolf Hitler.
>>
>flux-2-klein-9b-fp8

>t2i
>CFG 1
>5s/it

>single input imgedit
>1.5s/it

wat
>>
>>107894067
Which is the base resolution?
>>
>>107894154
4 and 6 are the best alt Lauras
>>
>>107894160
retard
>>
klein training when?
>>
>>107893885
model / lora?
>>
>>107894172
zit no lora (loras are for faggots)
>>
File: 1763745830344306.png (1.42 MB, 896x1152)
1.42 MB
1.42 MB PNG
literally Miku
>>
>>107894144
Ok.
>>
File: 1742143993673448.png (2.44 MB, 912x1136)
2.44 MB
2.44 MB PNG
>>
>>107894169
when khoya is done adding support for qwen layered for some goddamn reason
what does that model even need loras or tuning for?
>>
>>107894067
I couldn't find the recommended resolution for 4B on the huggingface or their blogpost.
If I don't specify a resolution when editing, it default to 512x512.
>>
>>107894179
thanks, i should have guessed a chink model would know xianxia
>>
>>107894167
please help me not be a retard
>>
>>107894206
ok
>>
File: lol.png (1.44 MB, 1024x768)
1.44 MB
1.44 MB PNG
>>107894196
was for >>107894162

also lol Barbie anatomy (it could have just put panties there, I didn't specify otherwise.)
>>
>>107894192
>what does that model even need loras or tuning for?
penus and vagooper
>>
File: 1766078545455204.jpg (1.6 MB, 2776x2402)
1.6 MB
1.6 MB JPG
>>107894189
>>
whoever you are, you told me that flux was always fine with boob, but I'm not gettin flux2dev to boob the same pictures I boobed with klein
hang your head in shame
>>
>>107892557
ACEStep 1.5 got me thinking about a possible way one could leverage a dataset of music without copying the artist's voice, and that's where music cover solutions come in handy. Does anyone know if RVC is still SOTA? Perhaps using https://github.com/Mangio621/Mangio-RVC-Fork
wouldn't be too bad. I also know ACEStep can do music in a reference style, but I assume tunes or LoRAs are more flexible and lead to better quality.
>>
>>107894256
damn that's crazy
>>
>>107893883
catbox?
>>
File: 1765631292139296.jpg (866 KB, 2720x1168)
866 KB
866 KB JPG
almost but still so far
>>
>>107894267
here you go
https://files.catbox.moe/8jqkb7.png
>>
File: 1757805784591906.jpg (1.24 MB, 3544x1890)
1.24 MB
1.24 MB JPG
>>107894215
>>
I it just my shitty loras or does ZIT still have some difficultires with maintaining likeness when the character is further away? I've tried some celeb loras and the likeness is spot on portraits up to cowboy shots, but the further the character is away, the more generic the facial features seem to become. There's still some resemblance but not nearly as close as in closer views.
>>
>>107894280
cheeky
>>
>>107894226
Slopped as it may be, Qwen is most accurate to the model's face. That's why a 2 pass, one with Qwen then next with Flux is better.
>>
>>107894308
tldr retard
>>
>>107894330
nigger
>>
>>107894336
useless
>>
>>107894345
faggot
>>
File: 1757784443391257.mp4 (3.78 MB, 2048x1228)
3.78 MB
3.78 MB MP4
>>
>>107894267
here is real version

https://files.catbox.moe/uny7m8.png
https://files.catbox.moe/89wfqk.png
https://files.catbox.moe/y7hx60.jpg
https://files.catbox.moe/mglace.png
https://files.catbox.moe/yi8fvb.png
>>
File: Flux2Img_00019_.png (2.47 MB, 1152x1440)
2.47 MB
2.47 MB PNG
>>107893412
dev got that... flux look
>>
File: 1761146008226286.png (39 KB, 800x256)
39 KB
39 KB PNG
me too haha...
>>
>>107893725
>>107893760
>>107893847
9B or 4B?
>>
File: 1743248205358553.png (1.07 MB, 1280x800)
1.07 MB
1.07 MB PNG
>>
File: 4.png (2.06 MB, 1088x1088)
2.06 MB
2.06 MB PNG
>>107893436
>a single datapoint is all you need
>>
>>107893436
damn everyone start training Z-Base NOW
>>
File: 1746888655587296.png (2.3 MB, 1248x1248)
2.3 MB
2.3 MB PNG
>>
>>107894368
thanks, appreciate it
>>
>>107894036
>>107894048
this looks so bad
>>
>>107894424
its the ad schizo making them
>>
should I run distilled or non distilled klein for best quality?
>>
>>107894445
retard
>>
File: 142.png (668 KB, 2098x1499)
668 KB
668 KB PNG
>>107894445
Distilled retard kun
>>
>>107893814
These faces are all the same but none look like the original lol
>>
The Z Base model they have is so bad they're embarrased to share it. Otherwise they would've done so a long time ago. It's simple. That's the part about "Chinese culture" you must understand. Wonder where's DeepSeek R2? Chinese simply are not good as the West.
>>
File: 1762227713820122.png (2.81 MB, 1520x1024)
2.81 MB
2.81 MB PNG
hm...
>>
Is there a node that pauses the current cycle until user input? eg., I want to pause+notify after 1 video is created before it works on the 2nd, and so forth.
>>
>>107894464
>Chinese simply are not good as the West.
Yet z-image turbo is still the curent goat
>>
>>107894463
hmm i wonder why
>>
>>107894477
yeap
>>
>>107893412
make the characters wear identical black and yellow colored cheerleader outfits
>>
File: 1742551110430903.jpg (1.36 MB, 3304x2386)
1.36 MB
1.36 MB JPG
>>107894445
>>107894393
>>
>>107894477
just generate one video at a time? then it stops till you hit generate again
>>
>>107894532
how do i do that?
>>
>>107894532
fool of atuk
>>
>>107894506
>fluxhands
>>
File: 1757800449653966.png (1.8 MB, 1104x1392)
1.8 MB
1.8 MB PNG
>>
>>107894551
ai slop
>>
>>107894459
>>107894506
thank you anons
>>
File: Laura window.jpg (537 KB, 1744x2240)
537 KB
537 KB JPG
>>107894551
is this a gen or some program stuff?
>>
>>107894563
loooooool
>>
>>107894464
big finetunes are unlikely to use it either way cause it uses a far worse vae (flux 1) instead of klein's flux 2 vae which is 2x as accurate and should converge much faster
>>
File: 1767489780752632.png (1.76 MB, 848x1824)
1.76 MB
1.76 MB PNG
>>107894563
klein with a pic from the /hr/ playboy thread and this prompt https://pastebin.com/drZ0yjyb
>>
>>107894579
I'm pretty normie

>>107894592
that's cool, looks like that old vector tank game or the intro to the 90s Jonny Quest
>>
>>107894605
isnt that comfy?
>>
>>107894617
no I'm still using Illustrous
>>
File: 1759963590691463.png (72 KB, 357x200)
72 KB
72 KB PNG
>>107894624
>>
>>107894605
nice
>>
File: 1760213374432816.png (1.65 MB, 1072x1456)
1.65 MB
1.65 MB PNG
its a little too clean but still neat



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.