[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage.jpg (3.03 MB, 4300x4210)
3.03 MB JPG
Discussion and Development of Local Image, Video, and Music Models

Previous: >>109143837

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://huggingface.co/models
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Krea 2
https://huggingface.co/krea/Krea-2-Raw
https://huggingface.co/krea/Krea-2-Turbo

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
gm saars
>>
File: 1759239721939930.jpg (388 KB, 2560x812)
388 KB JPG
>>
File: 1758695702235703.png (2.47 MB, 2560x812)
2.47 MB PNG
>>109146228
>>
File: 1764796703913377.jpg (447 KB, 2560x812)
447 KB JPG
>>109146238
Noo don't eat migu :d
>>
File: animatunetest1_00214_.jpg (625 KB, 1152x1568)
625 KB JPG
>>
File: 1760401851642719.jpg (795 KB, 3072x963)
795 KB JPG
>>109146247
>>
File: Krea2_turbo_00385_.png (1.71 MB, 1672x944)
1.71 MB PNG
>>
File: image_00001_.png (3.26 MB, 1320x1984)
3.26 MB PNG
It's a winner. It's over for the rest.
>>
File: ComfyUI_Krea2__00115_.png (2.09 MB, 1120x1496)
2.09 MB PNG
>candle in the wind
>>
>>109146273
>It's a winner.
if it participates to the "Plastic skin contest" yeah
>>
File: 001.png (1.72 MB, 1024x1536)
1.72 MB PNG
>>
>>
File: 1753415267803990.jpg (409 KB, 2560x812)
409 KB JPG
>>109146279
Bruh, I asked it to write "Trade Offer", sometimes Krea 2 just refuses to listen to prompts it's frustrating
>>
Blessed thread of frenship
>>
File: 745434577.webm (3.97 MB, 420x291)
3.97 MB
3.97 MB WEBM
where are the kinos at?
>>
File: animatunetest1_00228_.jpg (583 KB, 1152x1568)
583 KB JPG
bisley test, nsfw: https://files.catbox.moe/itdfam.jpg
>>
>>109146336
You did it. It has the essence.
>>
>>109146336
>catbox
>no metadata
>>
File: NOSOUPFORYOU.gif (131 KB, 220x154)
131 KB GIF
>>109146350
NO METADATA FOR YOU
>>
File: Krea2_turbo_00391_.png (1.72 MB, 1672x944)
1.72 MB PNG
>>109146279
what's the meaning of this?
>>
File: animatunetest1_00232_.jpg (647 KB, 1152x1568)
647 KB JPG
>>109146345
I think it's pretty damn close. I need to tag characters like Lobo way better and perhaps try to separate 90s style and his modern style that are completely different

>>109146350
workflow is filled with embarrassing vibecoded nodes
>best quality, highres, intricately detailed, muscular blue-skinned woman with enormous breasts, long wild blue hair flowing, completely nude, glossy sweaty skin, prominent nipples, holding two large liquor bottles, one raised to drink, powerful stance with wide hips and thick thighs, scars and tattoos across body, gritty industrial bar background with broken bottles and metal pipes, smoke and neon highlights, dutch angle, simon bisley style, painterly, visible brush strokes, gritty 90s comic art, heavy metal illustration, exaggerated anatomy, dramatic lighting, textured skin, raw sensual energy, mixed media, thick impasto
>>
>>109146373
Would be interesting to do another pass to further mess up the style or something like that
with ink and more heavy painterly stuff
>>
>>109146370
Prompt?
>>
>>109146290
skill issue
>>109146370
>>109146380
^ redditors
>>
>>109146373
Fine, thanks for something instead of nothing.
>>
File: 55454645654.jpg (706 KB, 3414x2474)
706 KB JPG
VAE autism solved
https://xcancel.com/PhotogenicWeekE/status/2070641554784768187#m
It's convenient to talk about Qwen Image VAE, but very little is understood in general about others. What plagues the Qwen doesn't involve the Wan VAE, which might be on par with Flux's VAE.
>>
>>109146385
>skill issue
you have a skill issue, even with Krea standards the skin is way too plastic
>>
>>109146380
a high quality CGI illustration of Rick and Morty and Summer.They are inside the cockpit of the Space Cruiser and flying in space.Rick has a serious expression and his mouth is open now as he is talking to Morty. Summer Smith without lipstick sits in the middle and is looking happy while texting on her phone.Morty is on the right looking at Rick , with a worried expression on his face. The subtitle at the bottom in white text reads "That's not important right now Morty. Remember what we're here for. A once in a lifetime experience Morty. Anything you can dream of Morty" The style of the image is in Pixar style volumetric 3D CGI
>>
>tfw it's the hottest day of the year and I'm doing video gens
I'm not a very smart fella
>>
File: 1768813200815541.jpg (750 KB, 2048x1128)
750 KB JPG
>>109146390
>VAE autism solved
it barely changes the image, I wouldn't call that solved
>>
>>109146400
doesn't matter for my computer. always the same max gpu temperature whether it is winter or summer
>>
>>109146399
thanks anon
>>
File: Krea2_turbo_00393_.png (1.86 MB, 1536x1024)
1.86 MB PNG
>>109146385
>>
File: 2026 sucks.jpg (507 KB, 2560x812)
507 KB JPG
2025 was the golden year for local diffusion, the best models (Z-image turbo and Wan) are from this year
>>
File: 549169926151280.png (1.4 MB, 1152x1472)
1.4 MB PNG
So now that Comfy has native support int8-convrot for krea2 is basically a 2x speedup for free? I feel like this should be advertised better.
>>
>>109146433
yeah, this shit is huge, Krea 2 would have been too slow without it
>>
File: animatunetest1_00248_.jpg (615 KB, 1152x1568)
615 KB JPG
>>109146433
Does it actually work? I don't want to pull
>>
File: kino.mp4 (872 KB, 1248x832)
872 KB
872 KB MP4
>>109146331
still digging
>>
>>109146440
>Does it actually work?
it does, you can load this model for example, it's literally 2x faster and the quality is between fp8 and Q8
https://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8/blob/main/Krea2_Turbo_convrot_int8mixed.safetensors
>>
>>109146405
yeah it's not that bad on the hardware it's just the psychological factor of it being so fucking hot already and I'm just continuously pumping out more heat
>>
File: 00005-2883732523.jpg (695 KB, 2016x2592)
695 KB JPG
>>
File: Krea 2.png (1.18 MB, 1280x720)
1.18 MB PNG
>>109146447
>Peach and Daisy
kek
>>
>>109146439
I have 6 gb of vram and it's 120 seconds for a gen. It's not that slow. 1024 x 2048 or so.
>>
>>109146458
I mean for regular fp8.
>>
>>109146458
>120 seconds for a gen. It's not that slow.
>>
File: 363567545.webm (3.84 MB, 420x291)
3.84 MB
3.84 MB WEBM
>>109146447
99% of kino miners give up right before striking gold
>>
>>109146464
You kids need to learn some patience.
>>
File: 2026-06-27_krea2_24.jpg (1.77 MB, 2160x3840)
1.77 MB JPG
>>109143939
Thanks Anon.
>>
>>109146471
high iq kino miners are genning 24/7
>>
File: Krea 2.png (1.46 MB, 1280x720)
1.46 MB PNG
>>109146458
I had to use the bypass lora to make this dirty, fuck that filter shit lol
https://civitai.red/models/2728234/krea2filterbypass?modelVersionId=3067151&dialog=commentThread&commentId=1232042
>>
>>109146433
should it be this noisy?
>>
If you never waited 50 minutes to gen 1 5 second video with Wan 2.1 when it released, lower your tone complaining about speed here.
>>
File: minimum.png (1.56 MB, 1024x1536)
1.56 MB PNG
>>
File: 1116687357130230_00001_1.png (2.87 MB, 2304x1619)
2.87 MB PNG
>>109146439
>>109146440
Yeah, surprisingly it works exactly as advertised. The image is different, but not worse in any discernible way and it's ~2x faster.
>>
>>109146494
not only it's faster, but the quality is higher, it's literally free food, feelsgoodman
https://github.com/BobJohnson24/ComfyUI-INT8-Fast/blob/main/Metrics.md
>>
i catasrophically forgot and convolutedly rotated my balls
>>
File: HunyuanVideo_00038.mp4 (1.01 MB, 960x544)
1.01 MB
1.01 MB MP4
>>109146485
>wan 2.1
I remember trying to use hunyuan before the optimizations like tea cache, sage attention and torch compile. It was a nightmare
>>
>>109146485
>If you never waited 50 minutes to gen 1 5 second video with Wan 2.1 when it released, lower your tone complaining about speed here.
only subhumans would go through this humiliation ritual lol
>>
File: HunyuanVideo_00060.mp4 (552 KB, 1280x720)
552 KB
552 KB MP4
Going through these some of them are actually quite nice kek. Reminds me what a huge leap Hunyuan was over shit like mochi and cogvideo
>>
>>109146521
>lol
>>
>>109146514
>tea cache, sage attention and torch compile.
only sageattention is worth it, and now we also have int8-conv, I wonder if they can use that method to make gguf quants, int8 is a bit big for my pc if I want to try LTX 2.3 for example
>>
>>109146521
Only men who have the capacity of future thinking and delayed gratification (White) would use the new technology in its infancy and then optimize it to what we have today, newniggy
>>
File: Krea2_turbo_00397_.png (1.52 MB, 1536x1024)
1.52 MB PNG
>>109146410
pleasure anon
>>
File: Absolute Kino.webm (477 KB, 960x544)
477 KB
477 KB WEBM
>>109146522
HunyuanVideo still has made the most kino AI video ever
>>
>int8-convrot

rot-shills really are loud, but it doesn't have 10k downloads in civit so it's shit btw, if normies aren't using it it's useless
>>
>>109146526
>>109146533
>subhumans spotted
>>
>>109146540
It's the same principle as rotating kv cache vectors in LLMs. It's not magic but it is helping with free accuracy.
>>
File: lewd peach.jpg (923 KB, 1776x1184)
923 KB JPG
>>109146457
>>
>>109146539
I remember this, it's so good kek
>>
>>109146542
>no argument
Concession accepted, brown newfag.
>>
File: lmao.png (713 KB, 1280x720)
713 KB PNG
>>109146571
>n-no u
kek
>>
>>109146582
i already called you a newnig before that

browny cant even follow a 2 reply convo, grim
>>
File: coffin.jpg (416 KB, 1023x1534)
416 KB JPG
>>
Krea 2 convrot workflow with at least 1 lora node pl0x, it's too cryptic to follow what everyone is saying
>>
File: image_00013_.png (3.47 MB, 1320x1984)
3.47 MB PNG
>>
>>109146540
it's a popular method on LLMs, it's just being used on diffusion models now
>>
File: Krea2_turbo_00403_.png (1.7 MB, 1536x1024)
1.7 MB PNG
>>
File: 1755202031573695.png (3.74 MB, 3072x1024)
3.74 MB PNG
Babe wake up, the guy who created the rebalance node improved it
https://github.com/huwhitememes/comfyui-krea2-conditioning
>>
>>109146676
>Why it works
>Bunch of em dashes
I hate those retards who let their LLM write everything for them, at least put some effort on hidding that this is written by AI
>>
>>109146676
it's not the same guy, that one made a fork out of the rebalance thing
>>
>>109146690
Was just about to post the same thing.
It's not that hard to write a simple readme especially after all the hard work..
>>
Is there any quality reason to go beyond 8 steps with Krea Turbo or are you just wasting gpu time ?

inb4 VAE fag throwing a tantrum over me using Krea
>>
>>109146694
Okay so it's worthless LLM speculation then.
I'm not going to even try it.
>>
File: taps sign.png (1.63 MB, 1536x1024)
1.63 MB PNG
>>
File: mall confrontation.jpg (496 KB, 1021x1521)
496 KB JPG
>>
>>109146722
Is that 50 cent?
>>
>>109146733
I think it captures Floyd really well even from side profile.
>>
File: Krea 2.png (1.6 MB, 1024x1024)
1.6 MB PNG
It's really too bad it's slopped, the meme potential of this model is huge
>>
File: Krea 2.png (1.85 MB, 1024x1024)
1.85 MB PNG
>>109146761
I swear if they still use Qwen VAE for the edit model I'm gonna scream so hard
>>
File: ComfyUI_00264_.jpg (492 KB, 1023x1534)
492 KB JPG
>>
File: 00012-2945369007.jpg (811 KB, 2016x2592)
811 KB JPG
>>
File: Krea2_turbo_00408_.png (1.43 MB, 1672x944)
1.43 MB PNG
>>
File: 1770984034468126.jpg (505 KB, 2048x1128)
505 KB JPG
History repeats itself
>>
>>109146784
I think it's not that different from ZiT. I have posted these gens with zit just testing with Krea 2. It's the same slop but with a different flavor.
>>
>>109146804
vicc isn't that fat come on lol
>>
File: file.png (96 KB, 1007x397)
96 KB PNG
is this kind of subgraph really necessary?
>>
File: bruh.png (3.01 MB, 2046x1128)
3.01 MB PNG
>>109146808
>I think it's not that different from ZiT.
it's night and day
>>
>>109146832
It's same slop, different flavour.
>>
>>109146808
What if you were in control of what they put out?
>>
>>109146824
Cum UI isn't Houdini
if you see something like this it's a big red flag
>>
File: 96531173817088.png (1.35 MB, 832x1216)
1.35 MB PNG
>>
>>109146835
>more realism is slop
(You)
>>
>>109146695
to esl - hard
>>
>>109146837
I don't think there isn't that much difference except more flexibility when it comes down to art styles. It wouldn't be that much different because it's already tied to an existing tech market.
So if I was in charge I would say fuck it and let's expose the art style controls.
Model itself wouldn't change that much because it would need to be a future iteration.
>>
>>109146844
Hmm, yes - I am from India and you are probably from Seattle or from California. Why are you this racist then?
>>
File: file.png (1.46 MB, 1280x720)
1.46 MB PNG
Say what you want about its VAE, but I think it's a cool model just for the fact it has so much knowledge, you don't see that often from moden models
>>
>>109146849
My shitty english probably failed me again. I meant it's just a tool like every other model. Use it for the kinds of things it’s good at, if there are any.
>>
>>109146874
Yeah this is what I understood.
>>
>>
File: ComfyUI_255270_.jpg (377 KB, 1010x1515)
377 KB JPG
>>
File: 03d-4046939421-3.jpg (41 KB, 405x720)
41 KB JPG
>>109146761
>>
File: Krea2_turbo_00422_.png (1.49 MB, 1672x944)
1.49 MB PNG
>>
File: Ideogram__00136_.jpg (1.19 MB, 2048x2048)
1.19 MB JPG
>>
:) it would be heaven on earth if there was man of culture willing to make lora multiple celebrities, onlyfans/fansly e-thouts and all the characters loras from very ecchi/harem anime. I worried that anima will still remain preferred over krea2 by the dominant anime character lora makers on civitai. A lot of the anime character lora makers are vramlets that don't have a beefy rig to train a lora for krea2 :'(
>>
>>109146943
Krea 2 edit will have a perfect character reproduction, no need for loras anymore, trust the plan
>>
>>109146932
what is this oldhag doing in classroom?
>>
>>109146943
>>109146950
it already does https://photos.google.com/share/AF1QipNIONmNur4qtfMg7ar2MD5z-1opZQBBzoefJfVEAKLyjwmU-wOphoVyyUuKK6gcWA?key=VC0zX0ZUd0diQUJpTWRxYThBelA5QWNQc3EzT3p3
>>
VAEtroons wanting their realistic models to have flux2vae despite the fact that the images they've been gooning to their whole life probably have less definition than flux2vae. 2000s gooning material like
Playboy magazines' definition was solved with SDXL vae. For illustrations, specify anime, you don't need more than an SDXL vae.
Only zoomers want max definition because they were born in the ultra HD era.
>>
>>109146954
Learning, is that not allowed?
>>
File: clouds.jpg (2.16 MB, 3840x2160)
2.16 MB JPG
>>
>>109146970
>token weights
Heh
>>
File: ComfyUI_Krea2__00045_.png (1.65 MB, 1088x1448)
1.65 MB PNG
>>109146973
>>109146919
>no UFO
>>
File: 27-56-2026_01.jpg (2.62 MB, 3295x4928)
2.62 MB JPG
>>
File: 1764591063341858.gif (949 KB, 352x200)
949 KB GIF
>>109146959
>gooning to magazines
ok unc time to go to bed
>>
File: Krea2_turbo_00424_.png (1.4 MB, 1672x944)
1.4 MB PNG
>>
>>109146932
Ideogram looks like Z-image turbo with Flux 2 vae, it's giving really impressive images, it would have been a slam dunk without the bbox autism...
>>
>>109146989
Why do you want pornhub souless hd slop deffinition?
>>
>>109146986
I'll try UFO next, my prompt is shrimple:
Atmospheric photography, 
Cumulus clouds in the sky, lighting, aerial footage,
mist, haze, atmospheric, analog film grain,
teal background, gradient, film grain, chromatic aberration,

Testing my old prompts, I still have some left before I deleted my stuff.
This actually produced nice results with SDXL...
>>
>>109146998
>would have been a slam dunk without the bbox autism
What? That's the best part. The control you have over your gens is incredible
>>
>>109146998
Pretty sure you can use it without the boxes just fine.
>>
>>109147000
I want choices, with a good VAE, you have the choice to go for something sharp or blurry (if you prompt for it) if you like, but with a MID vae all you have is smoothed out shit
>>
File: 134910CUI_00001_.png (1.92 MB, 1152x1536)
1.92 MB PNG
>updoot system
>comfy stops working
>updoot comfy
>it starts asking for a nvidia gpu on my amd setup
no idea what caused this, just nuked the venv folder and downgraded rocm and python
>>
File: 548179884147484.png (2.01 MB, 1344x1344)
2.01 MB PNG
>>
>>109147007
Some SDXL shots had nice lighting streaks but Krea 2 doesn't react to this.
Of course I could run this through a LLM and see what sort of word salad it spits out.
>>
File: 1781658574392069.gif (1.73 MB, 220x206)
1.73 MB GIF
>>109147008
>That's the best part.
absolutely not, I want my gambling slop, I want to write some shit, press enter, and then see how the model handles it, I want the surprise, I don't want control
https://www.youtube.com/watch?v=IPFiKEm-oNI
>>
less than 20 of the images in slay the princess actually contain *the princess* (unless you reuse identical images with different facial expressions), but my actual STP dataset after curating down to remove crap or repeated mirror/arm/etc stuff is 300 images, so it's not learning the concept of "the princess" very well and just has other stuff like witch and adversary's hair/eyes bleed into her. what solves this? can i double all those images without them getting burnt into the lora too hard (or de facto double them with some training setting to see them more often)? will it be better if i double them but mirror them? add a dozen decent booru/fanart pics? train a separate set of character loras on just the ~20 pics per character instead of trying to have the one lora learn the style and all the characters?
>>
File: I said what I said.png (956 KB, 1080x762)
956 KB PNG
>>109146932
You remove the refiner meme, you make a better licence, you allow people to write prompts like normal human beings, and Krea 2 wouldn't have existed, we would have ridden Ideogram's dick until the end of time
>>
>>109147031
I don't think so, you just percieve it that way because she is much larger than her environment would suggest she should be.
>>
File: 1776430184295457.png (67 KB, 1024x1024)
67 KB PNG
>>109147013
>Pretty sure you can use it without the boxes just fine.
no lol
>>
File: UAP.jpg (627 KB, 1920x1080)
627 KB JPG
>>109146986
>>
File: actors__00201_.png (2.61 MB, 1088x1920)
2.61 MB PNG
>>109146957
I've looked through those images. its not bad but it lacking nuance details of the celebrities and looks to basic, over simplified in a ridged way. some of them need loras with data set the shows them in multiple views, different kinds of clothes, camera shot types, emotional expressions, viewing angles and strong captioning details of their facial and body features.
>>
>>109147020
that's krea 2?
>>
>>109147050
Well that sucks. I could’ve sworn I prompted it with just a normal text prompt at some point when I was testing workflows for it.
>>
>>109147065
ya
>>
>>109147020
>>109146932
I mean, come on, let's be real for a second shall we?
>>
>>109146840
Nice
>>
>>109147039
yes gpt image 2 or grok to make new synthetic images with uploaded references.
>>
File: ComfyUI_temp_sbotc_00003_.png (2.74 MB, 1616x1616)
2.74 MB PNG
>>109146960
>>109147020
you will never be a real student, oldhag! you come here to sip bubble tea and show your ass!
>>109147052
noice!
>>
>>109147122
It's just like Adamski UFO photos.
>>
File: Krea2_turbo_00440_.png (1.41 MB, 1672x944)
1.41 MB PNG
>>
File: 514580590926753.png (2.64 MB, 1536x1536)
2.64 MB PNG
>>109147083
Ideogram definitely looks better in that example. Maybe with tuning or different prompting Krea 2 could look more like that, but it's also turbo with just 8 steps so there's that too.
>>109147089
Thanks.
>>
>>109147169
>Good porn needs
a good VAE, there's a reason why no NSFW loras worked on Qwen Image, this airbrushed shit can't keep the details and it'll always looks like AI
>>
File: ComfyUI_Krea2__00135_.png (1.76 MB, 1448x1448)
1.76 MB PNG
>>109147142
>>
File: the goat.png (2.88 MB, 6154x4608)
2.88 MB PNG
>>109147207
>a huge model that uses flux 2 vae
it already exists
>>
>>109147240
>Ideogram is too huge
what? it's a 9b model
>but muhh refiner
don't use it!
>>
File: ComfyUI_88514_.jpg (604 KB, 1920x1080)
604 KB JPG
How does RTX upscale work? I did one previously but it is just dumb eg it scales up noise and all that.
Should I use an upscale model before feeding it to rtx upscale?
>>
File: ComfyUI_88514_b.jpg (2.07 MB, 3840x2160)
2.07 MB JPG
>>109147270
It's not a proper upscale.
>>
>>109147207
>train a huge model that uses flux 2 vae and release it for free.
https://huggingface.co/fancyfeast/bigasp-3
>>
File: 482723142680830.png (2.48 MB, 1536x1536)
2.48 MB PNG
>>109147169
I tried it, didn't find it all that useful though.
>>
>la krea2ra
>>
>>109147259
https://www.reddit.com/r/StableDiffusion/comments/1tzimq0/ideogram_4_single_model_conditional_vs_double/
>>
>>109147283
nice
there is a special node similar to convrot node for rtx upscaling
but i dont know
>>
File: yes I love hags, and??.png (1.01 MB, 1080x607)
1.01 MB PNG
I don't get the hype for Krea 2, looks like your regular slop model
>>
Does anyone have a recommendation for a text to voice model? 16gb vram if that matters.
>>
File: image_00036_.png (3.62 MB, 1616x1616)
3.62 MB PNG
>>109147305
its good and quick but lacks details without extra effort
>>
File: Krea2_00201_.png (3.64 MB, 1616x1616)
3.64 MB PNG
white boy summer
>>
>>109147286
speaking of which, why is everyone ignoring this model? large model, flux2 vae, full finetune on millions of images, no censorship. and apparently nobody gives a fuck
>>
>>109147346
the training is not over (he has to do RL), and he also said that he has to restore its edit capabilities, Klein without the ability to edit is lame imo
>>
File: 466525734553281.png (1.96 MB, 1984x832)
1.96 MB PNG
>>
File: krea2-turbo_00513_.png (1.85 MB, 1280x1280)
1.85 MB PNG
>Ja, herr Doktor ?
>>
>>109147286
>>109147346
>>109147350
is it good as it is? did anyone try it?
>>
>>109147346
>why is everyone ignoring this model?
bruh, it's a non distilled model, do you really want to run a 9b model at cfg > 1 + 35 steps? it's gonna take ages
>>
Krea seems to really like adding a bunch of nasty skin details completely unprompted. sometimes they look like acne, or boils in this case. wtf could that be from in their dataset?
The gen after this has none of that. Noticed that one guy's grid test of celebrities has some of that.
>>
File: krea2-turbo_00520_.png (1.82 MB, 1280x1280)
1.82 MB PNG
>Du hast ein gross...
>>
File: Krea2_turbo_00448_.png (1.67 MB, 1672x944)
1.67 MB PNG
>>
>>109146676
it's pretty meh imo, the older node had its issues but it managed to fully get rid of the filter, that one isn't that potent
>>
>>109147420
literally why is every local model still so ass at text when it's a solved problem on the frontier models?
>>
>>109147424
Because it doesn't make any sense. Claude doesn't know what it is supposed to normalize or emphasize. It's snake oil.
>>
File: krea2-turbo_00526_.png (1.89 MB, 1280x1280)
1.89 MB PNG
Schnell bitte
>>
>>109147439
thats because no one care about text, look at SenseNova U1, this shit is specialized on text, do you hear a single fag whether it's from here, leddit or twitter that talks about it? lol
>>
File: 151400CUI_00001_.png (1.74 MB, 1152x1536)
1.74 MB PNG
>>
>>109147458
I think adding a safety filter is a retarded move, what's wrong with simply not train your model on nudity and porn, I much rather have a model not knowing some concepts than one that knows it all but decides to refuse to cooperate based on the weather (no filter is perfect you always have false positive/negative bullshit)
>>
>>109147474
pfft i mean, i can do more out of the box with krea 2 being censored than i ever could with z-image tardbo, and that's after the many memetunes and slopmerges and loras to add the nudity.
i'll take softcore censoring where the model was clearly still trained on NSFW over completely not training on it.
>>
>>109147482
>pfft i mean, i can do more out of the box with krea 2 being censored than i ever could with z-image tardbo
looks slopped though, so what's the point? plastic doll is your fetish?
>>
>>109147474
>I think adding a safety filter is a retarded move
It keeps them safer when someone starts genning CP on /b/ using their model, since it means the poster had to circumvent the safety mechanisms, something they can't be directly blamed for.

Overall I think it's better if the model understands genitalia etc as long as you can circumvent it, worst is what BFL does, which is to train on synthetic data with botched nipples and naked people with no genitalia, thus poisoning the model and making it really hard to train it back in.
>>
>>109147503
the poison is more potent when the model straight up refuses to follow prompts, because it's not filtering just NSFW, but also normal prompts (false positive), it's possible to add new concepts to a model (pussy, dicks...) with more training, but can you remove a filter with a finetune? time will tell
>>
Do you anons know if someone ever released an automated inpainting workflow using LLM?
For example using Gemma31B:
- You generate x
- Output is sent to Gemma
- It checks and boxes anything like blurry face/eyes, limbs, weird text, etc
- It generates by itself again

It's basically what gpt 2 imagegen does without the autoregressive nature.
>>
File: Krea2_turbo_00451_.png (1.65 MB, 1672x944)
1.65 MB PNG
>>109147439
I must say Krea is pretty good out of all the models that I've used. It only seems to become a problem when you use particularly long words and once your sentences start getting somewhat long.
>>
File: krea2-turbo_00540_.png (1.81 MB, 1280x1280)
1.81 MB PNG
>>
File: 1769418036802239.png (852 KB, 1080x817)
852 KB PNG
>>109147503
>it means the poster had to circumvent the safety mechanisms, something they can't be directly blamed for.
depends, if the safety mechanism is straight up easy to remove (it has been removed less than 24 hours after its release), I'm sure they can be blamed from not having implemented a tight enough security
>>
File: ComfyUI_Krea_2_00199_.png (2.21 MB, 1672x1256)
2.21 MB PNG
>>
>>109147569
Yeah, the key word is 'safer', not 'safe'

That said people bypass 'Big Tech' security measures all the time, and they have way more resources, so I think it will be hard to go after some small team.
>>
how is krea2 fp16 vs fp8 for text? does it matter or is the difference indiscernible?
>>
>>109147599
>Yeah, the key word is 'safer', not 'safe'
like I said, it also depends on how tight your security is, the judge will also see how hard it is to jailbreak the security, if they've done a half assed job to that and didn't take it seriously enough (they clearly didn't, it's so fucking easy to bypass the filter), this can be taken into account
>I think it will be hard to go after some small team.
again, it depends on the quality of the model, no one will bat an eye if it's SDXL quality in terms of realism, but if this shit is really powerful, a lot of people will talk about it, and here comes the trouble...
>>
>>109147169
>snofs lora
what's that?
>>
>>109147597
But can it do s5-s10 Homer?
>>
File: debo_k_00119_.png (2.75 MB, 1792x977)
2.75 MB PNG
>>
File: Krea2_turbo_00454_.png (1.61 MB, 1672x944)
1.61 MB PNG
Krea 2 is pretty good for story-boarding Rick and Morty episodes, I won't lie
>>
>>109147646
Go back in your splinter general thread schizo
>>
>>109147322
imagine being that fat
sweaty
heavy breathing
rubbing thighs when walking
>>
File: debo_k_00120_.png (2.94 MB, 1792x977)
2.94 MB PNG
>>109147656
>>
>>109147666
Thanks for posting here.
I'm out of inspiration but I'll gen something cool soon.
>>
>>109147653
come on, they're not that far and some eyes are fucked, it's straight up dalle 2 level of details kek
https://www.youtube.com/watch?v=5pBArfeBVkk
>>
File: debo_k_00121_.png (2.83 MB, 1792x977)
2.83 MB PNG
>>109147673
the best part of new models is you can just rerun your entire library of old prompts. everything old is new again :)
>>
>>109147689
That's what I did, I picked up some of the specific prompts. I deleted my old images anyway.
>>
>>109147689
This man speaks the truth
>>
>>109147689
Great to have you debo
>>
File: rockstar.png (2.67 MB, 1920x1080)
2.67 MB PNG
>>
File: debo_k_00123_.png (2.84 MB, 1792x977)
2.84 MB PNG
>>109147696
>I deleted my old images anyway.
even better. the true zen of creation. unburdened by the cruft of past ideas; an empty pot ready to receive all new inspiration

>>109147701
>>109147704
:)
>>
>>109147689
>>109147713
you should bake the next thread. you'd be a way better OP baker than catjack
>>
>>109147713
>>109147711
Look at this - krea 2 doesn't handle film grain at all.
>>
File: debo_k_00126_.png (2.17 MB, 1792x977)
2.17 MB PNG
>>109147716
I have a strict no baking rule
although it would be funny
>>
>>109147726
Why? Just bake some debo
>>
>>109147726
>>109147733
>debo bakes thread
>a dozen splits and two weeks of non stop dram ensue
i agree, it would be funny
>>
>>109146216
>Qwen
>https://huggingface.co/collections/Qwen/qwen-image
>Klein
>https://huggingface.co/collections/black-forest-labs/flux2
what are their use cases when we have bernini for wan 2.2? bernini is fucking insane btw.
>>
>>109147718
It's the size of snow flakes
>>
File: 1754095595450544.png (138 KB, 498x354)
138 KB PNG
>>109147758
>bernini is fucking insane btw.
>>
File: krea2-turbo_00581_.png (1.94 MB, 1280x1280)
1.94 MB PNG
>>
>>109147767
put it this way, you can do a lot with one reference image as your main actor. it can edit videos but who the fuck cares about that when you can just use it in image to video mode and feed it more than one reference image if you want.
>>
File: Krea2_turbo_00459_.png (1.8 MB, 1672x944)
1.8 MB PNG
>>
>>109147767
erm it also uses a thing called rope? so its much better than vace or any other crap out there, it does not need a mask or control net. it just does as you as in the prompt.
>>
File: Beach(0)_0.png (1.09 MB, 1280x768)
1.09 MB PNG
disco diffusion had sovl
>>
>>109147802
You can use rope with ZiT as well
>>
>>109147785
>>109147767
>>
File: FY2j49fWIAY5nb5.jpg (442 KB, 1664x1664)
442 KB JPG
>>109147806
good ol days
>>
>>
File: ComfyUI_Krea_2_00238_.png (2.64 MB, 1928x1088)
2.64 MB PNG
>>
Krea can only surpass ZIT and get on top of open source if they release uncensored Krea Large Turbo as open weights. Everything else is a censored jack of all trades master of none cope.
>>
>>109147810
sure what would you like me to create? give me a reference image or images, i'm noting do that part of a demo.
>>
File: bruh.png (168 KB, 498x498)
168 KB PNG
>>109147758
then fucking show it, what are you waiting for?
>>
File: Untitled.png (3.85 MB, 2048x1280)
3.85 MB PNG
>>
>>109147758
Gian Lorenzo Bernini; 7 December 1598 – 28 November 1680) was an Italian sculptor, architect, painter and city planner.
>>
>>109147815
that won't happen lol, they didn't even release the style transfer adapter, to be fair I can't blame them, if they release their good shit, they are dead, there will be no reason to try their API models anymore
>>
>>109147825
well fucking give me the reference images and the action to be performed, has to be work safe mate...
>>
File: table.jpg (729 KB, 1920x1080)
729 KB JPG
>>
>>109147815
Just use the 160 bytes big krea2filterbypass3 lora to bypass the censorship, it's already been circumvented

AND/OR use one of the already many NSFW loras out there
>>
>>109147844
>shits on the quality in your path
>>
>>109147844
>Just use the 160 bytes big krea2filterbypass3 lora to bypass the censorship
I tried that shit it makes the prompt adherence worse overall, there won't be free food, I can tell that filter shit will be hard to remove proprely
>>
>>109147835
to say that it's "incredible" it means that you've seen videos of it right? then show that first of all
>>
>>109147851
No it doesn't, it doesn't affect the quality in any noticeable way.

And if you somehow imagines it does, just use one or several of the many Krea 2 NSFW loras already out, and there will be MANY more.
>>
>>109147873
It doesn't even matter since it won't reach ZIT realism
>>
>>109147873
Krea won't go far with that retarded VAE, or else they find a way to change it for something else, or else it's DOA
>>
File: Krea2_turbo_00466_.png (1.62 MB, 1672x944)
1.62 MB PNG
>>
File: krea2-turbo_00587_.png (1.91 MB, 1280x1280)
1.91 MB PNG
>>109147863
I've posted these images in this thread:
>>109147384
>>109147419
>>109147447
... etc
All of them used krea2filterbypass3 with a setting of 15.0, only thing quality-wise that happens if I disable it is that it censors any cleavage.
>>
File: debo_k_00128_.png (2.31 MB, 1792x977)
2.31 MB PNG
>>109147900
this image goes pretty hard ngl
>>
>>109147911
CJ is always there.
>>
>>109147875
It already has ZiT realism, and also a ton of artstyle and character knowledge that ZiT totally lacked
>>
>>109147941
>It already has ZiT realism
why do you keep lying it's exhausting, it's not even close
>>109146832
>>
using realism engine lora is probably the best solution for nsfw. don't need any bypass nodes or other loras.
>>
>>109147879
prove it
>>
>>109147948
Enough with your shitty cherry-picking, every single krea2_turbo_* image in this thread is better than your hand picked example, you are pathetic beyond belief
>>
File: this has to be bait.png (73 KB, 625x656)
73 KB PNG
>>109147966
>every single krea2_turbo_* image
they're all Morty 3d render shit, what does this has to do with realism? are you trolling or something?
>>
>>109147957
does it have any downside?
>>
>>109147978
>they're all Morty 3d render shit
lel, go troll somewhere else
>>
>>109146786
nice. stylish and cute ghosts/ghost moths!
>>
>>109147987
show me a krea2_turbo_ image that isn't that (you won't because it doesn't exist, you're just bored and decided to troll lol), last (You) for you
>>
>>109147990
krea btw
>>
>>109147996
Go cry in a corner, schizo
>>
Reminder the schizo is so hyperfixated on this vae argument because comfy brought it up in that one livestream he keeps posting in the thread.
>>
File: Untitled-1.jpg (145 KB, 912x1360)
145 KB JPG
>>109146216
Nice ryu
>>
>>109147997
>krea btw
I fucking hope it's good at illustrations, they sacrificed flux 1's vae so that the othe schizos can spam some rick and morty, completly worth it...
>>
>>109148021
It was a test but with bubbly artifacts.
>>
>>109148018
>comfy brought it up in that one livestream
he didn't though? he never talked about the VAE, his only complain was about the fact that krea 2 didn't listen to his prompt from time to time (because of the safety filter)
>>
>>109147982
Don't notice anything but maybe I'm just blind.
>>
>>109148018
nobody cares about what comfy says because he just lies
>>
File: ComfyUI_Krea2__00139_.png (2.13 MB, 1448x1448)
2.13 MB PNG
>>
>>109148040
KEEP MY PLASTIC SKIN, OUT YOUR FUCKING MOUTH
>>
>>109148034
The schizo does. And has been jerking off to his face ever since the livestream was posted.
>>
File: ComfyUI_00020_.png (3.67 MB, 1696x1696)
3.67 MB PNG
>>109148040
>>109148044
This one is on flux
>>
>>109148050
>And has been jerking off to his face ever since the livestream was posted.
sanest Kreatard behavior btw
>>
>>109148050
weird. julien is sexier than yannik by far. schizo has poor taste
>>
>>109148059
you're kinda ugly ani ngl
>>
>>109148057
>Kreatard
You have to be insanely bored to come up with these nintendont juvenile levels of insults, sharty.
>>
>>109147824

>>109147020 camera pans to the left to show >>109147122
>>
>>109148069
>juvenile
says the guy writing homoerotic fanfictions btw
>>
>>109148067
>schizo has poor taste
>>
>>109148075
>the ugly schizo is crying in the corner
kek
>>
>>109147825
I'm doing it using this image >>109147813
and https://www.wired.com/story/the-making-of-the-atomic-bomb-artificial-intelligence/
See i just learned something new with bernini node, you need to make sure the the max reference image resolution is to the largest dimension of your reference images or it will crop to much and looks shit. The video resolution how ever can be anything you want and it will fit what ever into the video based on your prompt.

You will see what i mean by that when i upload the next output, plus the first time i tried some lora's and i think that effected the mushroom cloud way to much, so i won't even upload that because its a bad example due to my previous settings.
>>
>>109148085
>wall of text
where's the fucking video dude? show us some kino :(
>>
>>109148072
that can be done easy desu, i might do that next.
>>
Please tell me how you support open source Yannik! I love the adoption of supporting commercial licences and apis! Extremely open source behavior! I love it! I also love forcing people into dynamic vram! What a funny prank!
>>
>>109147621
https://civitai.red/models/1972981/sex-nudes-other-fun-stuff-snofs?modelVersionId=3072664
>>
>>109148095
>I also love forcing people into dynamic vram!
the schizo is right on that one, this is fucking bullshit
>>
>>109148096
this and the Realism Engine loras make me OOM using the int8 version
anyone know of a simple way to merge the lora and save the model in int8-convrot?
>>
>>109148096
thanks anon
>>
>post thing
>schizos start duking it out on my behalf again
kek
>>
File: ComfyUI_00105_.jpg (87 KB, 800x800)
87 KB JPG
>>
why is someone reposting peoples posts like it's their own. this isn't xitter
>>
File: 1060697532051002.png (1.93 MB, 1472x1152)
1.93 MB PNG
>>
>>109148115
Realism Engine is a HUGE lora though, 1.5gb

That said it worked on 16gb vram before a new commit today, but now it's gone from the commit log, so maybe you should roll the dice and git pull, it could have been reverted.
>>
File: bernini testing_00001.mp4 (1.01 MB, 656x656)
1.01 MB
1.01 MB MP4
>>109148090
Its not as accurate probably because of the cartoonish style of the first reference, i could always try to make something better with that later. but now onto the 2girls video that other anon wanted to see if it can do.
>>
File: 1117437249058121.png (1.94 MB, 1472x1152)
1.94 MB PNG
>>
>>109148162
>but now it's gone from the commit log
what, that would be a very silly thing for comfy to do, you'd see a revert commit if they were to do that
>>
>>109148166
is it limited to 5sec?
>>
File: Krea2_turbo_00471_.png (1.77 MB, 1672x944)
1.77 MB PNG
>>
File: ComfyUI_Krea_2_00310_.png (2.02 MB, 1672x1256)
2.02 MB PNG
>>
>>109148193
yup 81 frames that is just a wan 2.2 thing mate sadly, but if you're smart like i am you can extend the video easy by chaining and using some quality control of the last frames to keep consistence. It requires highly detailed images after all otherwise it will fall apart after about 20 seconds. if you go after 81 frames was just begins to ping pong loop on it self.

wan prompts should be literally description of the scene and characters and 1 - 2 actions that can happen normally within 5 seconds. as long as you stick to that rule everything is fine.

SVI pro helps with video extension, so bernini really is good for setting up the first scene for those who don't care for image editor models that produce shitty results.
>>
>>109148230
Why not use LTX with LTX director node?
https://github.com/WhatDreamsCost/WhatDreamsCost-ComfyUI
>>
>>109148181
My bad, I had another tab with another commit log up, the commit is this one: https://github.com/Comfy-Org/ComfyUI/commit/470ac36a0a807471a0fb78dc0a5548490c9abae4

which makes Realism Engine OOM on 16gb vram which it didn't previous, so likely it will be reworked
>>
>>109148248
https://www.youtube.com/watch?v=o0l6Ikvn5Q0
>>
>>109148193
>>109148230
in fact i don't even think you would need SVI pro now since you could just feed the last frame after some polish into the bernini node and just keep chaining it, and if you wanted to add something new into the scene like a character you could just by feeding in the last frame plus the new character as reference.

>>109148248
because melting faces....
>>
>>109148166
garbage
>>
>>109148248
>>109148258
thats great and all, but the underlying model is just not good enough, ltx 2.5 maybe
>>
File: SmoothVid_00007.webm (706 KB, 704x768)
706 KB
706 KB WEBM
>>109148263
>because melting faces....
>>
>>109148291
Ok. Enjoy exclusively genning asian girls dancing.
>>
File: Krea2_turbo_00472_.png (1.67 MB, 1672x944)
1.67 MB PNG
>>
>>109148072
i've tested on this and i don't think it works like that, instead it combines the reference images to create the video. maybe i my prompt was not perfect but desu this is a first frame last frame job you are asking for.

But I will change my prompt to be less descriptive of the 2 subjects and see if that helps.
>>
>>109148375
did you think that nobody gens with wan here?

next time think twice before doing such >>109147758 claims
>>
>>109148317
I will, thanks!
>>
File: 496028815775948.png (1.82 MB, 1472x1152)
1.82 MB PNG
>>
File: 662171280850953.png (1.53 MB, 1472x1152)
1.53 MB PNG
>>
https://github.com/capitan01R/ComfyUI-Krea2T-Enhancer
https://youtu.be/v6KRngGo10U?t=147
looks like this node can bypass the filter (like rebalance) but without the artifacts (noise, more slopiness etc...), nice
>>
Are there any advanced noise generator nodes? Where you can start genning with a more crazy noise pattern from the beginning?
>>
>>109148429
what the fuck is going on with the legs broski
>>
File: debo_k_00134_.png (2.12 MB, 1792x977)
2.12 MB PNG
>>
>>109148440
here's a noise generated for you
*braaaaaaaaaaaaaaaaaaaap*
>>
File: bernini testing_00003.mp4 (1012 KB, 656x656)
1012 KB
1012 KB MP4
>>109148072
>>109148375
yeah i got it, you need to not describe the subjects and it works properly. you will see what i mean.
prompt change to just.
You are a helpful assistant specialized in image-to-video generation. Using the 2 reference images. The camera pans left from the first woman to the second woman.

maybe even a prompt like.
You are a helpful assistant specialized in image-to-video generation. Using the 2 reference images. The camera pans left from the first person to the second person.

the less descriptive the better imo since describing them will only make the model change them according to prompt.

its only 656x656 but so the quality could be better
>>109148413
you're gay.
>>
>>109148492
garbage
>>
>>109148520
>>109148520
>>
>>109148518
yeah i'm gonna fucking listen to a one word reply moron. if you don't me or that model that is fine but at least explain why.
>>
File: 1002421413265059.png (1.93 MB, 1472x1152)
1.93 MB PNG
>>109148456
beats me, brotherman
>>
>krea 2 is another qwen

so basically everything is flux or qwen, Im getting tired of this shit



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.