[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107894964

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux Klein
https://huggingface.co/collections/black-forest-labs/flux2

>WanX
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of friendship
>>
File: 1747233206253473.jpg (3.99 MB, 4800x6912)
3.99 MB
3.99 MB JPG
>>
Where the hell is unsharded Qwen3-8B in BF16?
>>
>>107896312
>unsharded Qwen3-8B
what?
>>
File: 1744254943357984.mp4 (3.79 MB, 1638x2048)
3.79 MB
3.79 MB MP4
>>107896284
ltx
>>
>>107896301
>picrel
that's very cursed

>4B? Skill issue if you are using 9B. 9B also occasionally refuses to do nipples across seeds but I got a lot of it still.
9B, I just asked to make an anime image realistic, I think the model didn't know what to do with nipple piercings
I didn't test multiple seeds so there's that
>>
>>107896312
https://huggingface.co/Comfy-Org/flux2-klein-9B/tree/main/split_files/text_encoders
>>
File: 1764701272526086.png (1.77 MB, 2282x1225)
1.77 MB
1.77 MB PNG
>>107896245
lul
>>
>>107896322
one giant safetensor instead of multiple safetensor files you can't import in comfyui
>>
>>107896330
asking for cosplay might work better
>>
>>107896333
Thank you!
>>
>>107896311
Is she pointing at her feet?
>>
Juggernaut the best model for making anime?
>>
>>107896336
whats qie
>>
>>107896348
Someone gotta clean 'em soles bucko
>>
>>107896350
Qwen Image Edit
>>
File: I volunteer.gif (29 KB, 220x214)
29 KB
29 KB GIF
>>107896360
>>
Can any one of you troglodytes inform me as to what the current meta is?
I'm still using noobAI (29+1+h)
>>
>>107896374
No fuck you, maybe don't name call while asking to be spoon fed
>>
>>107896374
you are in the meta. it's going to be six months before zimage base releases then you have to wait for the finetune
>>
>>107896128
>retard here. since Klein can do edits does that mean if you finetune it you'll need to include edit pairs as well?
>>
>>107896393
2 weeks*
>>
>>107896408
6 years*
>>
>>107896341
you are either a jaded loser or a zoomer
>>
>>107896384
I asked it before and not one faggot volunteered so I figured acting like a cunt would do it
>>107896393
Thanks anon
Since you said base, I'm assuming the turbo ver currently released isn't all that good, yeah?
>>
File: 1742500495187424.png (215 KB, 2524x1052)
215 KB
215 KB PNG
https://github.com/Tongyi-MAI/Z-Image/issues/126
SHUT UP GWELLO YOU ARE IN NO POSITION OF POWE-
>Klein gets released
ACK, Chingsisies, what should we do?
>>
File: Z-image turbo.png (1.72 MB, 1280x720)
1.72 MB
1.72 MB PNG
>>107896429
>I'm assuming the turbo ver currently released isn't all that good, yeah?
are you joking? this model is amazing, that's why people are begging for Z-image base to be released
>>
>>107896432
my guess : internal politics + turbo model more popular than they expected + finetuning because the base model isn't that good and turbo was lightning in a bottle moment
>>
>>107896440
Well, that anon said I was in the meta when I said I'm using a noobai finetune so the natural conclusion is that zimage turbo doesn't replace it
What's the pros and cons of zimage turbo? Is it good for anime, realism, classical paintings etc?
>>
>>107896432
would be funny if these gituhub randos piss off Tongyi so much that they don't release it, memeing Chinese Culture into reality
>>
>>107896457
z-image if you make only 10-20 images a day is the best for realism
klein 9b if you want diversity
chroma if you want 2d goon
>>
>>107896457
For it's size it's incredibly good for realism while being fast.
It can also do complex prompts SDXL can never do without controlnet and regional prompt autism. The text, texture detail and backgrounds are infinitely better than SDXL too.
>>
>>107896432
I get the frustration but that's just asking for "get a refund" response
>>
File: Z-image turbo.png (2.13 MB, 1280x720)
2.13 MB
2.13 MB PNG
>>107896457
Z-image turbo is distilled, so it can't be finetuned and can't replace SDXL base, which is why we need Z-image to be able to move on
>Is it good for anime, realism, classical paintings etc?
it's insanely good at realism, anime is ok I guess
>>
>>107896473
>chroma if you want 2d goon
and 10 out of 100 images without anatomy abomination
>>
>>107896465
kek, i guess this is what happened to wan 2.5. if i recall they said something like "if you ask nicely" jokingly then the "community" collectively lost their shit lmao
>>
>>107896484
the quality of the good one makes up for it, it is a gacha alright
>>
>>107896374
Zimage and Chroma 2k. Klein for editing. 8gb vram minimum, 16gb is ideal.
>>
File: 1743459197440173.png (1.37 MB, 896x1152)
1.37 MB
1.37 MB PNG
ryan gosling in drive, netflix edition:
>>
>>107896507
why are you so obsessed with this dude, at least pick a cute girl
>>
no styles is not meta
>>
File: 1765667012223871.png (1.44 MB, 1168x880)
1.44 MB
1.44 MB PNG
>>107896507
it would appear you arent high enough off fent, mr. bond.
>>107896516
cause it makes libtards mad on social media, and he's a good test case for meme gens
>>
File: 1744018335711699.png (1.91 MB, 1296x1200)
1.91 MB
1.91 MB PNG
>>
>>107896516
because it makes you mad
>>
>>107896516
>why are you so obsessed with this dude
I'm asking myself the same question when I see endless Charlie Kirk memes on tiktok desu
>>
>>107896530
but it doesn't?
>>
>>107896533
>>107896516
enough, we all know what they want, they are all fags in deep denial and end up worshiping males as a way to cope with their lust
>>
>>107896446
you're probably close, my guess is that they want to make this right and have a really solid base model, but I'm afraid they'll overdo it and slop this shit instead of letting it act like a normal base model
>>
>>107896530
I'm tired of seeing his ugly face anon, it's not about being mad, this shit is gay and boring

>>107896533
same shit
>>
>>107896542
come on, CK and GF are ugly motherfuckers, if they were homo they would go for more handesome dudes lol
>>
File: Flux2-Klein_00115_.png (2.49 MB, 1440x1440)
2.49 MB
2.49 MB PNG
>>
>>107896556
You vill hear about trannies every day, you vill see the big lipped babboon every hour and you vill be happy.
>>
>>107896561
>applying logic to a coping mechanism
>>
>>107896561
did you miss the deep denial part? if they genned handsome men and got hard they'd have to accept they're gay
>>
File: 1742654724389070.png (952 KB, 896x1152)
952 KB
952 KB PNG
>>107896473
I'm always in a complex situation in these cases because the things I enjoy in AI are pretty niche
Noob allows me to do pretty show-accurate anime/manga and the style of almost any artist (even some esoteric mfers) on danbooru
>>107896475
>>107896480
I see, thank you guys for the context
>>107896506
>16gb is ideal
I gotta get a new card
>>
>>107896581
buy a 6000 btw
>>
File: 1768495058857827.png (1.59 MB, 1360x752)
1.59 MB
1.59 MB PNG
>>107896528
one more with this guy.

replace the face of the black man in image 1 with the man in image 2, who is holding a white bag of powder in a ziploc bag. Change the text "RUSH HOUR" to "FENT HOUR". leave the asian man on the left unchanged.

klein 9b distill (grab the q8, it's a small model) is a lot of fun. also the ability to copy font styles is neat, seems to do it better than qwen.
>>
>>107896542
zoomers are just poisoned by politics, there is nothing much you can do about it, they will make a billion trump floyd and whatever else american politics related instead of obsessing over nice female curves
>>
>>107896581
>Noob allows me to do pretty show-accurate anime/manga and the style of almost any artist (even some esoteric mfers) on danbooru
At the expense of having some bad anatomy, background, composition and overall consistency/cohesion. It was good a year ago but I can't help but see all it's faults now.
>>107896584
For sure, tomorrow by midnight I'll be shipping you the receipt
>>
Highest resolution you've gone with Klein9b and Zimage before it broke down?
With klein I tried up to 2.5MP and it dealt with it fine.
>>
File: 1750027553857730.jpg (676 KB, 1504x1728)
676 KB
676 KB JPG
>>
>>107896596
yeah it works fine at high resolutions, those modern models don't shit their pants like their predecessors, that's when you can see the field is improving for real
>>
File: Flux2-Klein_00129_.png (2.63 MB, 1264x1632)
2.63 MB
2.63 MB PNG
>>107896507
>>
>>107896618
lmao
>>
>>107896618
faggot
>>
Is zimage natural language or does it understand booru prompts?
>>
>>107896629
i'm not 100% sure
>>
File: 1737928024700966.png (866 KB, 640x1632)
866 KB
866 KB PNG
make the image a wireframe like a technical document for the anime girl:
>>
File: 1750630152729472.png (2.59 MB, 1796x1800)
2.59 MB
2.59 MB PNG
>>107896631
can you make him into a girl?
>>
>>107896629
why would a model not trained on danbooru understand danbooru tags?
>>
>>107896629
>>107896630
not sure
https://www.youtube.com/watch?v=sVyRkl5qNb8
>>
File: 1756049948050662.png (1.28 MB, 640x1632)
1.28 MB
1.28 MB PNG
make a chibi size plush doll of the anime girl made of fabric.
>>
File: 1767257510018198.png (1.31 MB, 816x1264)
1.31 MB
1.31 MB PNG
>>107896670
>>
File: 1757623176966597.mp4 (1.18 MB, 320x768)
1.18 MB
1.18 MB MP4
>>107896670
>>
File: 1768465468395546.mp4 (3.31 MB, 1592x2048)
3.31 MB
3.31 MB MP4
>>107896581
>>
>>107896592
>having some bad anatomy
it gets hands right more often than not compared to the new models but they haven't been tuned yet so maybe in time it will be better
>backgrounds
I agree with you here
>consistency/cohesion
the main complaint about dit models is the lack of variation, I wouldn't mark this as a point against sdxl
>>
>>107896670
>>107896686
I'd get these
>>
>>107896670
>>107896686
that's cute!
>>
>>107896697
>it gets hands right more often than not compared to the new models but they haven't been tuned yet so maybe in time it will be better
gonna are gonna have to lear the hard way that that is 100% tied to aesthetic tuning it to one specific style like anime for noob or photorealism for z image (plus tons of rl training)
>>
>>107896432
The real answer is internal company review processes.
In research, the release of things backlogs for a billion different reasons because of internal reviewers.
You know how we got all this "model coming soon" stuff on Github? People forget about it, then 6 months later the researchers are like "Model live on Huggingface!"
It sucks but its the truth.
>>
File: 1749034714536976.mp4 (3.74 MB, 1292x2048)
3.74 MB
3.74 MB MP4
>>107896686
>>
File: 1743266224367751.png (1.13 MB, 816x1264)
1.13 MB
1.13 MB PNG
make a plastic anime figure of the girl on a round pedestal
>>
>>107896716
>You know how we got all this "model coming soon" stuff on Github? People forget about it, then 6 months later the researchers are like "Model live on Huggingface!"
yeah I know what you mean, I remember this same situation happen once but I don't remember what model it was though, probably something mid lol
>>
Where can I download Klein image edit workflow for comfy? I can only find the text to image workflow.
>>
>>107896696
Endearing
Now do him turning into a titan

>>107896697
Good to know
I suppose I'll just wait a while before I try another model, doesn't seem like there's a local option that can fit my needs better than noob rn
>>
>>107896738
https://github.com/BlenderNeko/ComfyUI_ADV_CLIP_emb
>>
File: 1754846995826639.png (216 KB, 820x952)
216 KB
216 KB PNG
how the fuck does klein know what image 1/2/3 is?
>>
File: file.png (2.52 MB, 2047x1388)
2.52 MB
2.52 MB PNG
>>107896738
look at templates in comfyui
>>
>>
>>107896745
it stiches them together I think from left to right. You can describe it in other ways though.
>>
File: 1746728656431076.mp4 (2.74 MB, 640x1024)
2.74 MB
2.74 MB MP4
>>107896732
>>
>>107896697
yeah i mostly agree with you. the newer models that /ldg/ are fawning over are powerful for realism but dont know characters, series, or proper danbooru tags. not really suited to anime gooner material if thats your thing
>>
>>107896793
with i2v or an image reference it doesn't need to know them natively

for anime image sources you have stuff like wai/illustrious which knows everything
>>
>>107896756
if black forest labs was nintendo they would start suing plastic doll factories for copyright infringement
>>
>>107896807
racist
>>
>>107896800
this is just cope and legwork to make a model work the way you expect ootb at this point. models are great when it's easy to use. loras are just as bad imo
>>
>>107896800
having a structured prompt tag system, which applies to noob, but also illustrious finetunes in general is a huge advantage. you might not be good at first, but you can study and get good
>>
>>107896825
if you aren't using an llm to write your prompts you are doing it wrong
>>
>>107896732
>>
File: 1758609721508638.mp4 (1.5 MB, 576x896)
1.5 MB
1.5 MB MP4
>>107896756
>>
File: 1766929188848771.jpg (431 KB, 1920x1344)
431 KB
431 KB JPG
>>
File: 1739721515300015.mp4 (1.63 MB, 768x576)
1.63 MB
1.63 MB MP4
>>107896836
>>
File: vz5.png (3.01 MB, 2560x2560)
3.01 MB
3.01 MB PNG
>I stopped using 4chan since the hack. I now browse alt chans that actually care about their users, and don't need an userscript fighting their shitty design.
>>
>>107896618
How are you getting Klein to maintain the likeness so sharply? It's changing every person I try and manipulate to a horrible mess of JPEG compressed Flux-face.
>>
>>107896885
i'll tell you
>>
>>107896807
show me whatever chink model you think would do better at that kind of closeup while still realistically representing the peach fuzz and such
>>
File: Comparison.jpg (2.69 MB, 2496x1872)
2.69 MB
2.69 MB JPG
>>107896885
be less retarded
Klein 9B Distilled, 8 steps, "The man is now wearing blackface. Everything else is exactly the same." Input image resolutio same as output resolution.
>>
File: 1745099928200689.mp4 (3.95 MB, 2048x2048)
3.95 MB
3.95 MB MP4
>>107896875
>>
>>107896905
Not what I asked about, fuck you piece of shit
>>
File: 1754479483592823.jpg (541 KB, 1872x1392)
541 KB
541 KB JPG
>>
File: Flux2-Image_00102_.png (1.46 MB, 848x1232)
1.46 MB
1.46 MB PNG
>>107896848
nice
>>
>>107896908
wat
why am i a piece of shit for pointing out that Klein shouldn't be making significant changes to anything, generally
>>
>>107896914
(samefag) like in general, literally make the output the same res as the input, you'll get best results that way
>>
>>107896913
that head tho
>>
why didnt NAI switch to chroma?
>>
>>107896913
good body, face needs to be 20s and not early 30s
>>
File: file.png (1.33 MB, 2100x1350)
1.33 MB
1.33 MB PNG
>>107896885
pretty much default
>>
>>107896929
>early 30s
do gweilo women really age like that? lol
>>
>>107896927
what the fuck are you talking about nigger, Noob stopped training when they ran out of money, in general
>>
>>107896927
they made their own from scratch model that is better than everything else for animie. They refuse to do realism since they allow loli so of course that would get them in trouble
>>
>>107896942
>Noob stopped training when they ran out of money, in general
wait what? what happened?
>>
File: 1737513274129046.png (2.44 MB, 1120x1392)
2.44 MB
2.44 MB PNG
>>
>>107896927
isn't nai novel ai?
in that case this:
>>107896943
>>
File: Flux2-Concat_00109_.png (2.57 MB, 1694x1232)
2.57 MB
2.57 MB PNG
>>107896920
it matches the original, and stubbornly refuses to get smaller no matter how I prompt it
>>107896929
better? can't get it to do any ages between this and the original
>>
>>107896951
i bet she fucks desi men
>>
>>107896955
her head looks too big lol, face is fine sure
I guess it's the input image
>>
>>107896948
it was a long time ago dude lmao, like at least a year
also the other guy is right, NAI usually means NovelAI
only zoomer retards associate "NAI" strictly with Noob
>>
>>107896954
who is noob and why did he ran out of money
>>
File: 1768709490.png (1.65 MB, 992x1056)
1.65 MB
1.65 MB PNG
>>
File: 1740897416925863.mp4 (1.81 MB, 832x1216)
1.81 MB
1.81 MB MP4
>>107896955
>>
File: 1759315082119941.png (132 KB, 819x1044)
132 KB
132 KB PNG
I added a "max_images_allowed" parameter so that you won't have to bypass anything to switch from "2 images mode" to "1 image mode" or "no image mode" or whatever
https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced
https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.json
>>
>>107896987
buy an ad
>>
>>107896964
>I guess it's the input image
anime has always head huge heads proportionally to the body compared to a real human
>>
>>107896987
Your intro is about QIE. Does Klein have the same issue?
>>
>>107896954
>isn't nai novel ai?
i'll never not love the chinks that made noob for stealing nai's name because of the seethe it causes them
>>
File: 1768709707.png (1.78 MB, 1088x944)
1.78 MB
1.78 MB PNG
>>
>>107897002
the zoom-in issue? it has, but it's way less severe than QiE, anyways, the vl_megapixels value must always be at 0 for Klein since it's used only for QiE
>>
File: ChinkDreamFourPointFive.jpg (3.81 MB, 3072x4096)
3.81 MB
3.81 MB JPG
>>107896896
(intentional samefag self reply)
Seedream 4.5 gives this at max resolution for the exact same prompt, for the record. Continues to suffer from the retarded obsession of incel Chinks with weird fucking glowing eyes on white women and a general facial structure that resembles no real person who exists anywhere on earth.

Like when I train loras I train them on actual photographs of actual normal looking real people, literally go out of my way most of the time to have a good spread of ethnic and age variance, and also caption the specific ethnicity and age of all people who appear, both male and female. As such I have no respect whatsoever for anyone who does anything less, and I especially don't give a shit about ugly retarded Chinese beauty standards that don't look good in any way to anyone who isn't nativly Chinese.
>>
>>107896913
make a 3d printed recreation of the character
>>
>>107897012
No, this :
>The default TextEncodeQwenImageEdit node downscales your images to 0.15 megapixels before feeding them to the VLM.

But you answered my question. You should write somewhere that your node can be used with Klein too.
>>
>>107897015
now prompt it to turn her back to og asuka
>>
File: 1751321610817989.png (2.67 MB, 1008x1536)
2.67 MB
2.67 MB PNG
>>
File: 1741221835943003.mp4 (3.86 MB, 1890x2048)
3.86 MB
3.86 MB MP4
>>107896976
>>
wan2gp now supports flux klein 4b and 9b.
>>
>>107897021
yeah I should probably change the readme at some point
>>
File: Flux2-Image_00120_.png (3.19 MB, 1232x2048)
3.19 MB
3.19 MB PNG
>>
>>107897033
>it's the anon with the shitty skin fetish
fuck me
>>
klein loras
klein loras
klein loras
klein loras
klein loras
>>
>>107897044
post "good skin" the you nogen faggot, I dare you
>>
>>107897022
make this into an anime illustration
>>
what schedulers and samplers are yall folx using with klein for realism? Res2/beta seems to kinda work for me at 1.5 cfg, steps
>>
>>107897049
a bit washed out colors but pretty good
>>
>>107897044
>shitty skin
you say that because she's black racist :'(
>>
>>107896987
>ConditioningNoiseInjection

Neat, zimage severely lacks variation so might give z another go. Will it support other models in the future?
>>
>>107897058
correct
>>
File: 1767425939385074.mp4 (2.18 MB, 576x1024)
2.18 MB
2.18 MB MP4
>>107897041
>>
File: Flux2-Klein_00013_.jpg (2.32 MB, 2048x2048)
2.32 MB
2.32 MB JPG
17.6 seconds on a 4090, 9b distilled
The rug is a large, intricately patterned tapestry with a dominant circular mandala design in the center. The primary motif is a concentric radial pattern with multiple layers radiating outward, resembling a blooming flower or sunburst. At the center is a small, dark blue circle encircled by lighter blue and grey petal-like shapes forming a tight rosette. Surrounding this core, layers alternate in warm and cool tones—rust red, deep navy, soft purples, and desaturated oranges.
The second ring features diamond-shaped petals with red centers and dark blue outlines, creating a sharp, rhythmic contrast. The next ring is a wide band of leaf- or feather-like shapes, pointing outward and alternating between muted blue and copper tones. These are bordered by thin concentric rings that divide each section cleanly, maintaining geometric precision.
Beyond the central mandala, the rug transitions into a repeating motif of teardrop and eye shapes, arranged in a circular rhythm. These shapes are filled with detailed internal patterns: dots, lines, and floral curves, mostly in subdued hues of indigo, grey-blue, and maroon.
The outer background of the rug is a deep violet or midnight blue, filled with tiny floral emblems and star-like dots, scattered evenly to create a celestial ambiance. There’s also a visible fabric texture throughout, giving the sense of a woven tapestry rather than a printed rug.
>>
>>107897052
cfg 2 + res6s + whatever the default node uses for scheduler since it's not explicit
>>
>>107897078
>res6s
bro you gen 1 image per hour huh
>>
NovelAI is still better than everything local, right? At least generally
>>
File: 1761575028856961.mp4 (2.01 MB, 576x896)
2.01 MB
2.01 MB MP4
>>107897033
>>
lol. klein doesn't generate genitalia, but it swaps them in no problem.
>>
File: 1750031417049621.mp4 (3.03 MB, 640x512)
3.03 MB
3.03 MB MP4
>>107897009
>>
File: Flux2-Image_00126_.jpg (773 KB, 1280x1904)
773 KB
773 KB JPG
>>
>>107897060
>Will it support other models in the future?
I think it works on every models
>>
>>107897079
120s per image, I'm fine with it
>>
>>107897088
swaps them??
for me half the time it adds panties, always conservative types
>>
>>107896342
Amusingly (and correctly) asking to make the image like cosplay = background will by default look like inner city sprawl because of conventions all being hosted in those environments, though of course you can just ask for whatever location. still, i've found just asking to make it into a real photo works completely fine; the big issue is more unnatural elements (wings, coloured eyes, wands/staffs, tails etc, or ofc very stylised art) stack up and make the face worse and more uncanny the more of them there are whether you call it cosplay or not. if you really badly want a nice aesthetic pic of a character like this, the play might honestly be (1) image edit to remove those, (2) convert to photo, (3) show it photo & original drawing & ask to add the staff/tail/wings/etc to the photo. Those individual elements might still look a little bit tacky but at least the face will preserve its good quality instead of coming across with poison baggage.
>>
>>107897106
>picture 1 with a skin mound
>picture 2 with genitals
>prompt "swap the genitals"
>boom
sample size of 2 kek
>>
File: Flux2-Concat_00141_.png (2.76 MB, 1796x1200)
2.76 MB
2.76 MB PNG
my god, it's flawless, you actually can't even tell it's ai
>>
>>107897141
I see, I'll definitely try, this a good way to leverage other models good understanding of genitals
>>
File: 1760020691226156.mp4 (3.3 MB, 448x704)
3.3 MB
3.3 MB MP4
>>107897015
>>
File: ZImageTurbo_Output_1251.png (3.23 MB, 1248x1824)
3.23 MB
3.23 MB PNG
>>107897013
Z Image version genned and upscaled the same way as the Klein one
wow can't believe she looks like a completely generic as fuck unrealistic Asian-injected SD 1.5 esque sloppa 1girl with significantly worse detail than the Klein version
who could have imagined these results
>>
>>107897044
its ok to admit that you want to colonize her anon. This isn't /pol/ or reddit.
>>
>>107897166
black women are based, that anon and his obsession with damaged leathery dirty skin is not
>>
File: 1740083143952022.png (1.99 MB, 896x1744)
1.99 MB
1.99 MB PNG
>>
File: Flux2-Klein_00001_.png (913 KB, 1024x1024)
913 KB
913 KB PNG
My very first Flux Klein gen. (9b base, fp8)

>A woman with large breasts wearing a long gown at midnight. The image is so dark it is barely possible to see anything.

I must admit I'm impressed. Many models would not be able to do an adequate job with this prompt. Whether it can do a gen that looks good on close inspection remains to be seen, but so far so good.
>>
File: 1762382441486374.mp4 (3.88 MB, 1024x2048)
3.88 MB
3.88 MB MP4
>>107897187
>>
>>107897235
I think it goes too hard on the night though, it's almost completly black lol
>>
>>107896432
>>107896446
since base model will be so much slower than turbo, it needs to be sufficiently smarter that waiting 60s for a base gen is more appealing than waiting 50s for 5x turbo gens and filtering for the best result, e.g. needs to be so much better or so much more competent at edge case difficulty tasks that the turbo model frequently fucks up. W/ that said i dont know why theyd even call the models "turbo" and "base" of the "same" model after this much work, sounds like it's practically gonna be Z-Image V2. Might as well have co-released base on the day but told people not to directly use it for inference, as with Flux 2 Klein's base.
>>107897148
i know you're just joking around but i actually do think the clothes and hair etc on this gen look great. the chunky fingers and retarded eyes matched the input image which is a pretty valid decision you can easily explicitly prompt against and same for the neotenous jawline, the only other blatant mistake is one pinky finger being in grey glove material / almost unnoticeable, and i guess the zipper being gone. frankly i think its mindblowing that raw unfinetuned local can do this now.
>>
>>107897250
Did you read the prompt?
>>
>>107897250
nta but to be fair
>The image is so dark it is barely possible to see anything.
it did what it was asked lol
>>
>input a low res image of a person
>tell klein to upscale the image, and use a high res picture of the same person to grab detail from
why doesnt this work?
>>
File: Flux2-Klein_00001_.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
Uh oh.. gen #2 is SLOPPED. Time to play with settings and maybe img2img if necessary
>>
File: 1748706960440617.png (2.93 MB, 1248x1248)
2.93 MB
2.93 MB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.