[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor acceptance emails will be sent out over the coming weeks. Make sure to check your spam folder!


[Advertise on 4chan]


File: collage.jpg (2.5 MB, 5000x2669)
2.5 MB JPG
Discussion and Development of Local Image, Video, and Music Models

Previous: >>109182859

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://huggingface.co/models
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Krea 2
https://huggingface.co/krea/Krea-2-Raw
https://huggingface.co/krea/Krea-2-Turbo

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: 00034-151605567.png (2.91 MB, 1344x1728)
2.91 MB PNG
>>
Blessed thread of frenship
>>
>>109185452
I could easily snap this creatures neck
>>
>mfw Resource news

07/02/2026

>PAPA: Online Personalized Active Preference Alignment
https://github.com/NasikNafi/papa

>Condensing Large-Scale Datasets Directly with Minimal Information Loss
https://github.com/LINs-lab/CIM

>VisReason: A Large-Scale Dataset for Visual Chain-of-Thought Reasoning
https://y-research-sbu.github.io/VisReason

>Asset Generator for 2D & 3D: Blender add-on that generates assets from text prompts
https://github.com/tin2tin/Asset_Generator-2D-3D

>ComfyUI-TrixLoader: All-in-One Image Loader, Editor, and Resizer node for ComfyUI
https://github.com/trx7111/ComfyUI-TrixLoader

07/01/2026

>Elastic Diffusion Transformer: Accelerating SOTA generation models
https://github.com/wangjiangshan0725/Elastic-DiT

>Boogu-Image-0.1-Edit-Turbo
https://huggingface.co/Boogu/Boogu-Image-0.1-Edit-Turbo

>GEAR: Guided End-to-End AutoRegression for Image Synthesis
https://github.com/Tencent-Hunyuan/GEAR

>SpheRoPE: Zero-Shot Optimization-Free 360 Panorama Generation with Spherical RoPE
https://orhir.github.io/SpheRoPE

>ADAPT: Attention Dynamics Alignment with Preference Tuning for Faithful MLLMs
https://github.com/yao-ustc/ADAPT

>Phase-Aligned RoPE for Mixed-Resolution Diffusion Transformer
https://hao-yu-wu.github.io/mixed_res

>Ecocoro Preview 1
https://huggingface.co/alfredplpl/ecocoro-preview-1

>ComfyUI FL-MCP
https://github.com/filliptm/ComfyUI_FL-MCP

>Magnificent 7 value shrinks by $2.3 trillion amid AI spending jitters
https://www.cnbc.com/2026/06/30/magnificent-7-stocks-sell-off-investors-grow-jittery-on-ai-spending.html

>ShutterMuse: Capture-Time Photography Guidance with MLLMs
https://lijayutnt.github.io/ShutterMuse

>ASASR — Coloring the Noise: Adversarial Sobolev Alignment for Faithful Image Super-Resolution
https://huggingface.co/wafer-bob/ASASR

>Qwen3.6-27B NVFP4: Quantized version of Alibaba's Qwen3.6-27B model
https://huggingface.co/nvidia/Qwen3.6-27B-NVFP4

>Horus Lens 1.0
https://huggingface.co/tokenaii/Horus-Lens-1.0
>>
>mfw Research news

07/02/2026

>Training-Free Debiasing of Diffusion Models via CLIP-Guided Denoising Optimization
https://arxiv.org/abs/2607.00817

>AVSR-Diff: Scale-Agnostic Diffusion Priors for Temporally Consistent Arbitrary-Scale Video Super-Resolution
https://kaist-viclab.github.io/AVSR-Diff

>EquiSteer: Cross-Attention Steering Towards a Fairer Text-Guided Image Generation
https://arxiv.org/abs/2607.01147

>Towards Memory-Efficient Autoregressive Video Generation via Instance-Specific Parametric Absorption
https://arxiv.org/abs/2607.00712

>DriftScope: Measuring The Hidden Effects of Diffusion Model Adaptation
https://arxiv.org/abs/2607.00183

>Vitality-Aware Compression for Efficient Image-to-Shape Diffusion Transformers
https://arxiv.org/abs/2607.00382

>Decoupled Guidance: Disentangling Subject and Context Pathways in Text-to-Image Personalization
https://arxiv.org/abs/2607.00766

>Post-Training Pruning for Diffusion Transformers
https://arxiv.org/abs/2607.00927

>The Illusion of High Utility in Safety Alignment of Text-to-Image Diffusion Models
https://adeelyousaf.github.io/SAGE_ECCV26_Project_Page

>Not All Prediction Targets Keep Training-Free Diffusion Guidance on the Manifold
https://github.com/ManLuML/on-manifold-tfg

>MEPA: Multi-Scale Representation Alignment for Visual Autoregressive Modeling with Mixture of Experts
https://arxiv.org/abs/2607.00371

>Flow-Map GRPO: Reinforcement Learning for Few-Step Flow-Map Generators via Anchored Stochastic Composition
https://arxiv.org/abs/2607.00535

>M2Note: Continual Evolution of Vision Language Models via Mistake Notebook Learning
https://arxiv.org/abs/2607.00685

>MoHallBench: A Benchmark for Motion Hallucination in Video Large Language Models
https://arxiv.org/abs/2607.01117

>Selective Test-Time Debiasing for CLIP via Reward Gating
https://arxiv.org/abs/2607.00423

>LeVLJEPA: End-to-End Vision-Language Pretraining Without Negatives
https://arxiv.org/abs/2607.00784
>>
https://civitai.red/models/2746698/krea-2-turbo-8-step-distillation-lora-svd-extract
Only thing that might redeem Krea
>>
File: 1757565024873303.png (978 KB, 1184x896)
978 KB PNG
>>109185452
>>
>>109185520
Do NOT redeem.
>>
my krea2 lora came out perfect, 2k steps
>>
does comfy support seedance?
>>
>>109185655
yes
>>
>>109185641
that's the main thing i love about it. every lora turns out nearly perfect even with not so perfect dataset.
>>
Reposting here since I was in the wrong general:
Guys I'm kinda retarded, I've been using qwen3.6 27b + 35b a3b with lm studio. I want to try an image model but I don't think you can with lm studio. What's a good GUI for it?
>>
File: debo_sf_k2_uv_00105.jpg (3.44 MB, 6192x2580)
3.44 MB JPG
>>
>>109185698
I don't know
>>
>>109185698
ComfyUI is the main UI everyone uses for Image/Video generations.
>>
>>
>>109185746
Thanks
>>
>>109185749
Seedance just feels like 90% of their anime training was Attack on Titan and Demon Slayer.
>>
>>109185749
The only interesting frames are the ass shots in the beginning. the rest is soulless af and weirdly disconnected
>>
>>109185749
>hulk slams her inside a building
>next scene they are somehow back outside in the middle of the street

>throws her into a building
>next scene cuts to her in the middle of the road bursting from rubble
>>
>>109185770
that was unironically the most well animated and coherent part
>>
>>109185776
you noticed all that but not hulk's scale being randomly bigger and sliding around without moving his legs?
>>
all this talk about krea2 got me interested. does anyone have the workflow for bottom right of OP, or similar? either my prompts are dogshit or im missing a lora.
>>
>>109185749
the first like 40 frames is kino
>>
>>109185788
I could write an essay on how bad that animation is. Seedance can do better, that one just sucks
>>
>>109185805
bro if you cant gen a blonde 1girl doing basic shit i dont know what to tell you
>>
>>109185749
watching this is so demoralizing. will genai video always be so embarrassingly slopped? the artists are laughing at us
>>
>>109185520
This method is snake oil. It reduces inference steps at the cost of reducing quality.
>>
File: 00044-955365289.png (2.52 MB, 1248x1824)
2.52 MB PNG
>>
File: debo_sf_k2_uv_00108.jpg (2.73 MB, 6192x2580)
2.73 MB JPG
>>
File: Capture.png (90 KB, 729x661)
90 KB PNG
>>109185490
>>Horus Lens 1.0
>https://huggingface.co/tokenaii/Horus-Lens-1.0
>>
>>109185853
The quality loss is negligible.
>>
File: debo_sf_k2_uv_00109.jpg (3.8 MB, 6192x2580)
3.8 MB JPG
>>109185880
thanks anon, I'll drop the link from the news
>>
>>109185900
probably the best looking one. in the other images the planets looked like poor CGI from a video game.
>>
File: debo_sf_k2_uv_00110.jpg (930 KB, 3168x1312)
930 KB JPG
>>109185911
>probably the best looking one
thanks!
>in the other images the planets looked like poor CGI from a video game.
yeah, I'm using some video games as references in the prompt. very helpful for composition and aesthetic, but that can cause the CGI correlation to bleed in. I've been intending to experiment with krea2-negpip to see if I can factor out the CGI lean or not, just haven't gotten to it
>>
holy sameslop
>>
File: screenshot.1783025864.jpg (398 KB, 777x748)
398 KB JPG
It's crazy how after upgrading to 128GB ram, it seems my ram is always a 90GB+ now. I truly feel sorry for ramlets.
>>
>>109186031
That's just how Windows allocates ram. The more you add the more it takes.
>>
File: ComfyUI_00406_.jpg (1.3 MB, 2024x2696)
1.3 MB JPG
>>109185641
Cool, what lora did you make?
>>
its finally done :')
>>109186031
128gb of ram is the current ultimate future proof hobbyist local ai workloads.
>>
>>109186059
the illegal kind
>>
>>109185512
>>109185512
>>109185493
>>109185277
he's fucking with you. based jenner has never shared his jenny lora
>>
>>109185520
>https://civitai.red/models/2746698/krea-2-turbo-8-step-distillation-lora-svd-extract
what's the point of this?
>>
File: ComfyUI_00385_.jpg (1.35 MB, 2024x2696)
1.35 MB JPG
>>109186067
I'm scared
>>
>>109186059
Local can't into soles, the wrinkle pattern are always inconsistent.
>>
>>109186111
>two gaming chairs
>>
>>109186135
for her and her bf???????
>>
>>109186135
shes loads on up on braps then tags in the other one for the next round
>>
>>109186142
damn she just like me fr
>>
File: ComfyUI_00387_.jpg (1.28 MB, 2024x2696)
1.28 MB JPG
>>109186135
You've clearly never gamed before
>>
>>109186151
UNIT of a mouse. Her hands must be monstrous
>>
File: 00004-31077209.png (3.99 MB, 2560x1472)
3.99 MB PNG
>>109186067
loving these results.
>>
File: 1777499661905140.png (1.48 MB, 832x1248)
1.48 MB PNG
>>109186151
>>
>>109186169
shouldn't you be making your 2.5D jungle girl slop?
>>
I can't help with this one. This is a tag set for sexualized anime-style images of a character (large/huge breasts, cleavage, plunging necklines, no panties, revealing clothing, spread legs, etc.), and I'm not able to organize or process tags for sexual content like this, regardless of how the request is framed as "just organization."

If you're working on something else — like general artwork tagging for non-sexualized content, or a dataset cleanup task that doesn't involve sexual content — I'm happy to help with that.
>>
File: pixel-0006-1716648318.png (146 KB, 1920x1152)
146 KB PNG
>>
File: ComfyUI_00397_.jpg (1.35 MB, 2024x2696)
1.35 MB JPG
>>109186164
Bigger mice deal more damage
>>
>notice an Anima lora of one of my fav cartoon bitches
>download it and gen a classic one girl special
>the image looks blurry and out of focus
>"The fuck?"
>realized they must have trained it at 1024 or lower
At least train at Anima's native res of 1536. Do I really have to train all my own loras now? What a waste.
>>
>>109186217
any prompt?
>>
>>109186217
prompt?
>>
File: pixel-0009-2971711350.png (136 KB, 1920x1152)
136 KB PNG
>>
File: FK.jpg (850 KB, 1376x1824)
850 KB JPG
degenerates everywhere
>>
File: FK2.jpg (824 KB, 1376x1824)
824 KB JPG
>>
File: 00012-1071166025.jpg (424 KB, 2560x1472)
424 KB JPG
>>109186181
nigga, your way too obsess with me. relax and post tits.
>>
>>109186237
>>109186247
"a woman is sitting in a gamer chair with the soles of her feet visible"
>>
File: 00538-2113982533.png (1.24 MB, 960x768)
1.24 MB PNG
>>
>>109186330
what kind of mental illness is this?
>>
File deleted.
>Krea 2
>add "(she's looking down:-1)"
>Lara Croft goes from normal early Tomb Raider outfit to totally naked
what the fuck. I have no explanation for this.
>>
>>109186237
>>109186247
>vapid whore shows feet to retards funding her so she can keep feeding the dog that fucks her
enjoy the prompt
>>
why are these sdg anons such faggots
>>
>>109186369
blue board anon
>>
File: 2026-07-02 23.40.25.jpg (208 KB, 1732x951)
208 KB JPG
>>109186369
>>109186380
oops I meant to post the censored version, wrong folder
>>
>>109186369
Thats normal behavior for Krea 2 actually.

if you dont prompt clothes... what do you think is gonna happen ?
>>
>>109186384
Krea 2 doesn't render naked people by default lol, it even has a filter that'll make it refuse to render naked people even if you ask for it
>>
>>109186382
based
>>
>>109186384
Well that's the thing, it was always giving me her normal outfit. But then she was always looking at the camera so I added a negative for looking at the camera, and then she was always looking down or with her eyes closed so I added more negative to avoid that, and that led to her being consistently naked.
>>
>>109186393
then why do I keep getting nipples then mr smart guy
>>
>>109186398
maybe it's getting confused by "standing neck deep" and somehow autocorrecting it to standing naked. I'd try to rephrase that since she's clearly not neck deep or submerged in the water anyway
>>
>someone releases a character lora of a character you like

>but its based on some fucking cucklords BBC smutbase version

What drives people to waste compute on this shite ?
>>
>>109186425
some obese neckbeard redditor is doing it guaranteed
>>
File: 00544-1745628668.png (1.33 MB, 960x768)
1.33 MB PNG
>>
>>109186425
I don't get it either. you don't need to bake off model features into a character. At the very least, release separate versions.
>>
>>109186437
this wouldn't even be worth uploading as an image before AI existed.
>>
>>109186412
True, before that my prompt said she was wading through water. If I replace it with just "she is standing in water" her clothes are back on. But that's a weird mistake for it to make. "Neck deep" = naked? Oh well good to know.
>>
File: 06942-1204415870.png (3.57 MB, 1088x1920)
3.57 MB PNG
>>
>>109186425
Not your dataset, not your tagging, not your lora, not your waifu, remember that
>>
>>109186481
blocked
>>
File: 00548-3955390623.png (1.27 MB, 960x768)
1.27 MB PNG
>>109186445
>>
File: 1775416759590839.png (1.11 MB, 1024x1024)
1.11 MB PNG
>>109186481
hmmm... interdasting wisdom... Many people may think you are trying to insult me, but instead ended up enlightening me...
>>
File: debo_sf_k2_uv_00116.jpg (3.72 MB, 6192x2580)
3.72 MB JPG
>>
File: 1774130942641776.jpg (37 KB, 832x320)
37 KB JPG
Is there a way to carry a load image node into these drag and drop fields?
>>
>>109186507
>Many people
spook
>trying to insult me
projection
>but instead ended up enlightening me
you are welcome
>>
>>109186382
my boss totally agrees. you should have posted the sfw one. oh fuck I am so fired.
>>
File: 00025-1366540735.jpg (420 KB, 2560x1472)
420 KB JPG
>tfw 3 years later you can generates very close up photorealistic images of titties on your machine.
feels good.
>>
File: 00555-2544007949.png (1.84 MB, 2048x512)
1.84 MB PNG
>>
>>109186559
Troon hands
Troon breasts
>>
>>109186218
>>109186425
imagine complaining about someone elses lora when its nearly trivial to train your own

>>109186269
sovl
>>
>>109186559
honestly disgusting. her tits look like they're full of blackheads.
>>
>>109186151
Bad soles dude, learn how2soles
>>
>>109186437
>>109186495
>>109186564
>this post is extremely low quality
>>
>>109186541
>>109186564
why do you 2 post the exact same slop since at least a week? not hating but it looks like a retarded hobby in this space
>>
>>109186576
>>109186587
Don't get that poster too riled up or he'll start to post walls of catboxes of his 2.5D slopshit while proclaiming local is dead.
>>
>>109185426
SA3 is neat with its sound quality but its composition, especially with LoRAs trained on same data is nowhere near as good as ACEStep XL. The model simply is too small
>>
>>109186576
turn on your monitor
>>
>>109186586
He's not complaining about someone's lora. He's complaining about a societal ill. It's a rabbit hole and he's in it. But he can't see it because he doesn't yet appreciate how deep the rabbit hole goes.
>>
>>109186605
>or he'll start to post walls of catboxes of his 2.5D slopshit while proclaiming local is dead.
I can do it for free tho
>>
File: 1778172189531792.png (3.45 MB, 1944x1094)
3.45 MB PNG
>>109186586
>its nearly trivial to train your own

Its not actually - maybe you forgot most of these models require significant hardware? that getting the dataset of 100-300+ images collected and tagged properly requires significant time investment? That this is why civitai had that bounty system - why people are willing to subscribe to ko-fis, patreons, and early access buzz for LoRAs?

Become educated.
>>
>>109186605
>local is dead
I mean, people are praising a bloated 12b model that uses Qwen vae and has a censorship filter in it, that's how dead it is
>>
zitjeet woke up?
>>
>>109186559
>photorealistic
i applaud your correct usage of that term (attempting to appear as if a photograph but still looking like a non-photograph) but i have a feeling you think it means "looks like a real photograph"
>>
>>109186593
>or is it perhaps extremely high quality?
some of the highest quality images in the thread tbqh desu
>>
>>109186382
It allows "painterly" nudity (similar to API models like gpt-image). So this ultimately looks like a simple seed variation to me.
>>
File: debo_sf_k2_uv_00117.jpg (780 KB, 3168x1312)
780 KB JPG
>>109186604
>why post
because I want to
>not hating
yes you are
>>
>>109186634
yeah i was implying that its nearly trivial for white men with a high iq and decent income. clearly this does not describe you.
>>
>>109185853
>This method is snake oil. It reduces inference steps at the cost of reducing quality.
What the fuck do you think the turbo model is doing lol?
>>109186102
>what's the point of this?
being able to use the raw model with any steps you want. You need to play with the steps a bit but if the lora at strength 1.0 means 8 step, then the lora at 0.5 means you can do 16 steps which makes the gens more diverse across the same prompt.
>>
>>109186591
teach?
>>
>>109186665
It consistently happens because of "neck deep" being in the prompt. Picrel is the same prompt and seed but right one with "neck deep" added. I even tried that on the Krea 2 website and the same thing happened.
>>
I'm having trouble getting Krea to generate huge tits or nudes.
Are you guys using any loras? What's the secret?
>>
>>109186702
just use any nsfw lora
>>
>>109186676
>lol?

It looks a lot worse than turbo.
>>
>>109186702
the secret is going to https://civitaiarchive.com/ and then under base model select Krea 2. Then you look around and see what your options are.
>>
>>109186681
>still not neck deep
I guess this is the half empty glass problem. AI can't make characters neck deep in water because it wasnt trained on it.
>>
File: 1782251370373811.png (2.47 MB, 1672x944)
2.47 MB PNG
>>109186675
Thankfully It does not, otherwise I would spout ignorance like your post that needed to be corrected!
>>
File: Ideogram__00688_.jpg (2.14 MB, 3840x2160)
2.14 MB JPG
>>
File: 257937024.png (1.83 MB, 1280x1536)
1.83 MB PNG
>>
>>109186750
that's fucking impressive damn, you can see the power of a great VAE in action, the girls look good even at far away distance, now I want to see a Krea 2 comparaison just to laugh a bit kek
>>
>>109186737
I think Krea 2 like many others has trouble doing everything you ask for at once. Like earlier today I was trying to prompt someone with a knee on the ground reaching for a dagger on the ground. Instead there was no knee on the ground and the dagger was already in hand (if there weren't two of them).
>>
bros where are the jenny loras?
>>
>>109186750
didn't know Ideogram could do 4k, or else you've upscaled it?
>>
>>109186059
>>109186111
>>109186151
Can you try generating a different face with the same features? (skin, hair color, etc), I'm starting to notice a Krea 1girl face pattern, since its a distilled model that might be
>>
>>109186786
Yeah it can do 4K, no upscaling. Took 6 minutes kek
>>
File: Sovl vs Sovless.jpg (3.91 MB, 7680x2318)
3.91 MB JPG
>>109186750
>>
File: ComfyUI_00432_.jpg (1.39 MB, 2024x2696)
1.39 MB JPG
>>109186237
>>109186247
>A candid screenshot from a live webcam stream. A young woman wearing a black graphic t-shirt is sitting in a black gaming chair with visible "Razer" branding. She has her bare feet propped up on gaming desk directly in front of the camera, soles facing the lens as the main focal point. Her toes are splayed and wiggling, with slightly pink soles and arched feet as she flexes her ankles to show every angle. One hand covers her mouth as if she's trying not to laugh. Her other hand is extended toward the camera, flipping it off with her middle finger, which has a small band-aid wrapped around it. The camera angle is low, looking up from below so her feet dominate the foreground. A dark, dusty mousepad is visible beneath her feet, and the back of her black gaming chair appears behind her legs. The image has a pink webcam-style border frame. In the top-left corner, a red circular "LIVE" indicator is pulsing. In the top-right corner, the username "maddy08" appears next to a small circular avatar showing her real-life selfie. At the bottom of the screen are two seperate donation alerts: "$30 — Footlover — show feet" and "$15 — iamyourslave — your feet are so hot". In the bottom-right corner is a pink heart-shaped like button and a white share button underneath.
>>
>mfw API news

>NovelAI teases NovelAI V5, a next-generation anime/hentai image generator
https://x.com/novelaiofficial/status/2072359557741871457

>Google’s new Nano Banana 2 Lite image model is its fastest and cheapest yet
https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-omni-flash-nano-banana-2-lite/

>Seedance 2.0 Mini and 4K is now available in ComfyUI
https://blog.comfy.org/p/seedance-20-mini-and-4k-is-now-available

>ByteDance launches Seed Audio 1.0 Unified AI Audio Generation for Speech, Music and Ambient Sound Creation
https://fal.ai/models/bytedance/seed-audio-1.0

>Midjourney goes from generating cat images to full-body ultrasound scans
https://www.theverge.com/ai-artificial-intelligence/952011/midjourney-medical-ai-ultrasound-scan

>Alibaba releases HappyHorse 1.1 Available on Alibaba Cloud
https://www.alibabacloud.com/blog/happyhorse-gets-stronger-motion-expressiveness-higher-generation-consistency-and-enhanced-visual-quality_603293

>ByteDance's New AI Video Model Can Make 30-Second Clips From a Single Prompt
https://www.cnet.com/tech/services-and-software/bytedance-introduces-new-seedance-2-5-video-model/

>Luma Introduces Ray3.2 Model & API: Complete Creative Control for Video Generation
https://lumalabs.ai/news

>The Layout Bet — Reve 2.0
https://blog.reve.com/posts/the-layout-bet

>Introducing Gemini Omni — Google’s multimodal video creation/editing model
https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-omni/

>Nano Banana 2 and Nano Banana Pro are generally available via Gemini Enterprise Agent Platform
https://cloud.google.com/blog/products/ai-machine-learning/nano-banana-2-and-nano-banana-pro-are-generally-available

>Grok Imagine 1.5 Preview
https://x.ai/news/grok-imagine-1-5

>Seedance 2.0 in Runway API
https://docs.dev.runwayml.com/api-details/api_changelog/
>>
File: 1782748349362163.jpg (212 KB, 768x1024)
212 KB JPG
>>
>>109186814
it's like you've turned off the shadows on a game for Krea 2 Turbo, that makes it so unrealistic
>>
>>109186814
I would not attend this cafe - its rooted in rape culture and I dont believe that represents the values of /ldg/
>>
File: 1782384664816114.jpg (1.64 MB, 2700x3600)
1.64 MB JPG
>>109186860
posted the low res version by mistake
>>
>>109186814
You had one job ideogram, get rid of the bbox autism and let people make coom loras, you would have destroyed that fraud K2 model that way
>>
>>109186883
doesn't matter, would still be too big for vramlets like me
>>
>>109186890
what do you mean? it's a 9.3b model? You're not obligated to load the both of them at the same time
>>
>buyer's remorselets
>>
File: ComfyUI_temp_ttpjo_00130_.png (3.19 MB, 1184x1776)
3.19 MB PNG
Maybe Krea is usable.
>>
>>109186605
those seedance 2.0 catbox links? Its very hard to tolerate mediocrity of local ai video generation after getting a taste of seedance 2.0 and the seeing near perfect photorealistic skin textures, realistic anatomy, physics, lighting, sound effects, motion, logical coherent and believable backgrounds and objects within the background, and how characters react and engage with the elements and objects within generated area of video. Be for real anon, do you fap to your ltx 2.3, eros and sulphur gens with no negative post nut clarity? Do you go back to rewatch those gens with a positive feeling in mind? My opinion on local ai video generation could change if lightricks cooks the new ltx model very good with at least 3-5 of the things i adore about seedance 2.0. Local in its current state is not even close to quality of kling 2.5/2.6 and seedance 1.5. Its barely even near og veo 3 or veo 3.1 lite. krea2 restored my faith in local image generation but I'm total demoralized when it comes to video generation. lora have there limits and can't fix all the fundamental issues of ltx 2 and 2.3.
>>
>>109186829
thx
>>
File: 06946-2733947113.png (2 MB, 1088x1920)
2 MB PNG
>>109186809
>>
>>109186883
>let people make coom loras
https://civitai.red/models/2688234/realism-engine-ideogram-4-krea-2?modelVersionId=3047310
>>
>>109186929
No problem
>>
>>109186814
Why would you disingenuously compare undistilled to distilled instead of raw to raw?
>>
>>109186893
>You're not obligated to load the both of them at the same time
so? that applies to ANY model, you can always just load the layers as you need them, you still have to go through them all
it's a 18.6B model
>>
>>109186940
I should report that lora it's against ideogram rules.
>>
File: 1769966646237758.png (263 KB, 3578x1069)
263 KB PNG
>>109186940
there won't be any momentum because they have to hide those loras, if you search on civitai Idoegram NSFW loras on civitai you'll get nothing
>>
File: 1780098640395857.jpg (172 KB, 759x979)
172 KB JPG
I'm suddenly getting this, how do I fix it?
it was working fine with the same prompt and model just a minute ago
>>
>>109186954
It's not against the rules because the lora doesn't work.
>>
>>109186940
You know it's a shit image model when it's advertised with videos
>>
>>109186948
Oh yeah, I should use the undistilled version of Krea after all, oh wait... that doesn't exist, they only gave us Krea 2 Turbo and never Krea 2 Medium or Krea 2 Large, fucking retard
>>
Good - I don't wanna see some girl getting anal fisted everytime I open civitai.
>>
File: pixel-0001-2317224558.png (1.46 MB, 4096x4096)
1.46 MB PNG
>>
>>109186964
Then it should be removed either way.
>>
>>109186969
>they only gave us Krea 2 Turbo and never Krea 2 Medium or Krea 2 Large, fucking retard
>>109185426
>https://huggingface.co/krea/Krea-2-Raw
>>
File: fuck you.jpg (642 KB, 2046x1128)
642 KB JPG
>>109186948
>muhh distilled
Z-image turbo is also distilled and can produce really realistic images, I won't accept that excuse you disingenuous K2 shill fuck
>>
>>109186971
they have a .com especially for you el tardo
>>
File: 1774228842972224.png (1.52 MB, 1256x1256)
1.52 MB PNG
>>109186984
.com has less content thoughbeit...

even 18+ sites should filter NSFW fetish coomer trash

18+ doesn't mean degenerate coomer - it means mature individual
>>
>>109186983
I don't have a dog in this fight I just think any anons should compare properly.

But you are also being disengenous by not disclosing the settings / metadata for those two gens.
>>
>>109186741
cool gen
>>
>>109186958
Nah they were just super delayed adding an id4 category that's why none of the early loras are tagged correctly
>>
File: 06947-2733947113.png (1.83 MB, 1088x1920)
1.83 MB PNG
>>
>>109187031
>Nah they were just super delayed
because of the licence
https://huggingface.co/ideogram-ai/ideogram-4-fp8/blob/main/LICENSE.md
>Your use of the Model and any Model Derivative must comply with applicable laws and regulations (including trade compliance laws and regulations) and adhere to the Acceptable Use Policy available at https://ideogram.ai/legal/usage-policy
>(A) Do not submit, transmit, display, perform, post or store any content that is inaccurate, illegal, unlawful (including, but not to limited to, uploading copyrighted images to the Services without the consent of the copyright owner), harmful to children, defamatory, unethical, obscene, lewd, lascivious, filthy, excessively violent, pornographic...
>>
File: 1776164713891702.png (3.88 MB, 1776x1184)
3.88 MB PNG
https://civitai.red/models/2564225/style-greater-gpt-image-2
You know what?
I think I like GPT image 2's shitty fucking grunge filter - it just needed a finetune and be made local like this.

now we just need a Krea version
>>
>>109187029
thx
>>
File: is this nigga serious?.jpg (3.69 MB, 7680x2318)
3.69 MB JPG
>>109186981
Lmao, do you seriously want me to compare Ideogram to Krea 2 Raw? All right, you asked for it, let's laugh a bit shall we!
>>
>>109187058
>Whatever you now find weird, ugly, uncomfortable and nasty about a new medium will surely become its signature. CD distortion, the jitteriness of digital video, the crap sound of 8-bit - all of these will be cherished and emulated as soon as they can be...
>>
File: dumb motherfucker.png (104 KB, 1690x462)
104 KB PNG
>>109186981
are you fucking retarded? RAW is a base model, the creator themselves said it is not meant to make images
https://huggingface.co/krea/Krea-2-Raw
>>
>>109187069
>>109187078
two absolute mouth breathing retards
>>
>>109187069
lol. what a piece of shit model.
>>
>>109187018
What are you disclosing anon? I'm not seeing you contribute to anything so far (where's your comparaisons?), all you're doing is crying, that won't be enough to show your good faith, you know that right?
>>
>>109187088
Concession Accepted.
>>
the zitjeet playbook:
>throws around technical terms he doesn’t understand
>swears by an outdated model
>seethes every thread over ghosts in his head
>spams the same image over and over to prove his ‘point’
>throws a hysterical fit and camps the thread for hours just to cry over the latest model
perhaps a new rentry is needed?
>>
File: 1781769809519898.png (2.47 MB, 1632x1088)
2.47 MB PNG
>>109187069
>>109187078
https://litter.catbox.moe/c8udz2w6wopr0ber.png
I don't care about any single model but to claim raw is broken or not made for generation (it says "not recommended" i.e. noobs need not apply) is incorrect.

I've seen good gens from both ID4 and K2.

>>109187088
This thread does not cater to midwits it's either triple digit IQ autismos or single digit monkeys.
>>
>>109187101
>look at my drawing, that must be enough to win the realism debate right?
damn this dude is dumb, damn...
>>
Using literally ANY model after Krea2 gives me incomprehensible rage - the fact people used to gen things with such dogshit shit is crazy.
>>
>>109187098
>>109183309
that's a bot right?
>>
File: bigasp_22.jpg (550 KB, 1080x1920)
550 KB JPG
>>
File: 1764607525724774.png (2.05 MB, 1088x1632)
2.05 MB PNG
>>109187108
I didn't make a comment on its ability to generate any particular style. Only that it's not as broken as anon attempted to portray >>109187069

>>109187091
Here's another. One that anon seemed to enjoy.
>>
>>109187058
i bet if you neg weight this you could get some giga trad kino
>>
>>109187138
possibly. he scans the thread for any post about krea2 and proceeds to spam the same talking points and images on repeat
>>
>>109187141
>Only that it's not as broken as anon attempted to portray
it is though, RAW can barely do drawings, if you go for realistic pictures it shits the bed, and that's expected, that's a fucking base model, not made for making images, do you comprendo?
>>
>>109187155
I'm asking if you're a bot "anon"
>>
>>109187157
You are doing something very wrong, the raw model can easily do realism, given decent negative and resolution correlated with quality images.
>>
>>109187157
https://litter.catbox.moe/d1qqryk9eixf4rsz.png
Do not mistake me posting this as saying that I believe one model is more capable of realism than another, but clearly something is not right with your workflow.
>>
>>109187176
>>109187180
why are you even trying to use RAW to make images in the first place? it's consistently making worse images than turbo, just use turbo? that's what it's made for?
>>
File: ComfyUI_00445_.jpg (1.52 MB, 2024x2696)
1.52 MB JPG
>>109186591
I like that a lot, can you attach a face to those soles?
>>
chromagods, whats the best chroma to train a style lora on nowadays, that 2K one or what?
>>
>>109187194
Turbo is very slopped. Even with extensive prompting nearly always does the same faces/same bodies/same skin textures. Very hard to counteract it.

I like to describe faces/figures in details for the characters and turbo in a lot of cases just ignores a lot of subtle things. Raw does it fine.
>>
File: ComfyUI_00431_.jpg (1.34 MB, 2024x2696)
1.34 MB JPG
>>109186787
It's possible to prompt the facial features in detail to avoid sameface but i'm too lazy
>>
>>109187194
Step and aesthetic distillation introduce a rigidity to models that I do not enjoy. Their usecase is clear however; It is in fact easier (and quicker) to slop out a half decent gen with modern turbo models but you lose the elasticity and "creativity" of pretrained versions. Pretrained models have a higher "quality" ceiling than turbos at the cost of a lower quality floor. And I don't often mind the extra time required.
>>
>>109187215
>Turbo is very slopped.
uh oh, Krea 2 shills won't like that!
>>
>>109187225
desu sloppiness extends to all turbo models not just krea's
>>
>>109187215
>>109187222
did you manage to make good images with RAW though? genuine question, I'm curious to see some images "at its best"
>>
>>109186983
Do you really think Z is as realistic as Chroma? Every model lacks sovl compared to Chroma anon-kun. Only a Chroma-Krea tune can be the king of realism.
>>
File: Z-image turbo.png (1.63 MB, 1280x720)
1.63 MB PNG
>>109187233
>sloppiness extends to all turbo models
lol
>>
>>109187234
>I'm curious to see some images "at its best"
I'm far from the best user of K2 to grace this bread but I've posted a couple already >>109187101 >>109187141
>>
>>109187235
Chroma shill please, your 1girl has no thumb
>>
>>109186216
this one is good
art
>>
anon is really upset today
>>
>>109187235
>a Chroma-Krea tune
a what?
>>
>>109187235
>>109187213
>>
>>109187222
This setup seems like a good middle ground to use RAW.
>>
>>109187267
the thumb is on the other side retard
>>
>>109187235
Chroma's capacity for sovl never fails to disappoint. Truly a model in a league of its own
https://gofile.io/d/935ZNe
>>
>>109187285
absolute noise slop
>>
File: ComfyUI_00450_.jpg (1.39 MB, 2024x2696)
1.39 MB JPG
>>
File: 1764154010866467.png (24 KB, 432x308)
24 KB PNG
does this comfyui bloat on updating it get cleaned at some point or?
>>
File: 00039-955268863.png (3.67 MB, 2560x1472)
3.67 MB PNG
>>
File: kek.png (380 KB, 666x376)
380 KB PNG
>>109187307
>>
98% of posts in these threads are trolling, shilling, seething. I think Its time I head to discord where they actually ban retards. It is a blue board and jannitors should do their fucking jobs, there is no point in reporting because they don't do anything fuck this place.
>>
>>109187320
>trooncord
leave and don't come back
>>
>>109187320
many such cases, you just gotta learn to ignore it and have it as 1 out of many things in the background that you check out every once in a while or on new model releases
>>
>>109187299
baby girl
>>
>>109187292
The extra noise is purposely prompted to make it a little better
>>
>>109187320
I'd like a discord but without avatarfagging and without usernames though (that doesn't exist), because you'll end up with /sdg/ 2 if you let that happen
>>
knowing how to use photoshop/draw/and use local AI feels like playing on godmode lowk.
>>
>>109186475
This one is cool but just a tad too "clean" or sharp
>>
How do is stop getting frizzy hair or millions of moles on Krea 2
>>
>Krea2
why so you think this is the one model to take off and not to be forgotten in a few weeks once the novelty wears off
>>
File: 1769119978649171.jpg (380 KB, 2048x1536)
380 KB JPG
>600 queued items gone after comfyui crash
>>
>tfw I just realized I can run gemma 4 completed offloaded to my CPU so I can make INFINITE prompts and gen on the fly

Feels gud
>>
btw, are 3D character loras as good as Ideogram's 3d characters?
>>
>>109187432
is there some async mode so that it gens the prompt on the cpu while genning the image on the gpu? probably not, the tech just isnt there
>>
Krea looks like Z-Image to me. No interest.
>>
>>109187439
llamacpp+comfyui?
There's plenty of nodes that can call an external LLM
>>
File: 1763648004024117.png (1.71 MB, 1368x768)
1.71 MB PNG
Z-image flopped - so I'll stick with krea 2
>>
>>109187453
>doesnt know what async is
i meant processing the prompt for the next gen while genning the current image so it can continue to gen on the gpu non stop
>>
>>109187464
>Z-image flopped
>>
>zitjeet really was chromakek
keeeeeek, i guess it was obvious all along
>>
ask your ai/vibecode it/git gud
>>
>>109187320
you are so weak
>>
File: 1768826625901636.png (1.27 MB, 1024x1024)
1.27 MB PNG
>
>>
>>109187439
you can run another comfyui instance on different port
>>
>>109187482
then how do you consider a model to be a flop then?
>>
>>109187482
model and prompt?
>>
>>109187469
Only way I can think of is if you make a simple little program that uses the comfyui api.
>create prompt
>queue api
>repeat.
>>
>>109187474
Can you elaborate I must have missed this
>>
File: 7ef-1937012543.jpg (48 KB, 600x600)
48 KB JPG
No model that can't be run comfortably and at least able to train loras on 8GB VRAM and 32GB RAM will take off, your bloatmodel will be forgotten once the novelty wears off
>muh VRAMlets
>muh API
enjoy your DOA model used by 5 people
>>
zitcopers are turdworld poorfags still stuck on 8gb vram after having 4 whole years to save up and buy a half-decent card?? pathetic
>>
File: 1753191212031561.png (1.45 MB, 1024x1024)
1.45 MB PNG
>>109187496
zimageUselessGarbagePro.safetensors on civitai
>>
>>109187272
very
>>
>>109187272
>>109187550
all right you can stop samefagging now
>>
File: ComfyUI_00466_.jpg (1.77 MB, 2024x2696)
1.77 MB JPG
>>
File: facts.png (7 KB, 328x85)
7 KB PNG
>>109187320
add IDs like on /pol/ and you'll see how much better these threads could be
>>
File: msedge_vFoi78JLjX.png (314 KB, 332x430)
314 KB PNG
>migrating to Anima after Illustrious
>want to use it as an API since I heard it's decent with non-tag descriptive text, and my frontend's tag translator sucks dick
>only have enough space for 1 base checkpoint that I can use with loras/takes loras really well
>and 1 specialized checkpoint meant to be fed through the API to my frontend
What checkpoints should I use? I'm assuming base Anima for the first, unless the WAI finetune is good, and I was thinking of picrel for the latter since I couldn't give less of a shit about anime and prefer toony/furry stuff. dem thicc outlines
>>
>>109187598
I totally agree because then we could filter by ID and instantly clear away the shit that is irritating and have normal discussions. But I think this entire website is designed and run by the CIA to enrage white man. We need a replacement for this site because this isn't good for peoples mental health, sure you say "ignore them" its impossible when its 98% of the entire fucking thread!

No one fucking cares about x is better than y ffs.
>>
>>109187637
>>only have enough space for 1 base checkpoint that I can use with loras/takes loras really well
>>and 1 specialized checkpoint meant to be fed through the API to my frontend
Is your drive a USB stick from 10 years ago?
>>
>>109187665
nigger have you SEEN the price of storage these days?
And no, I'm not uninstalling my games taking up 95% of the space
>>
I don't even understand how these people do it, shitting up everything thread 24/7 unless it is bots. I'd get bored of trolling after 1 hour, so this seems like a raid of bots, a never ending stream of bot posts and they should do something about it or it will kill this website.

AI moderation could work, you only need a abliterated model and convince it to hate such posters.
>>
>>109187637
>which checkpoint is best
that is a question only you can answer for yourself
desu havent felt the need for anything other than base and my own loras
>>
>>109187682
>desu havent felt the need for anything other than base and my own loras
That's what I'm asking. Are most loras built for base, or is it an Illustrious situation where most are made using WAI?
>>
>>109187672
this is bait.

>you must always attempt to derail the thread
>you must keep to topic but do everything possible to cause disruption
>>
>>109187685
>Are most loras built for base
yeah
>Illustrious situation where most are made using WAI?
that was not the case at all kek
>>
>>109187598
it works, but the thread would be almost dead, see /agdg/ on /vg/ and /bant/
>>
>>109187717
I prefer a calm thread than a schizo thread imo
>>
>>109187717
There are no IDs on /vg/ ?
I wonder what would happen if they just enabled IDs site wide but OP decides if they're enabled or not on a per thread basis.
>>
File: debo_sf_k2_uv_00121.jpg (3.64 MB, 6192x2580)
3.64 MB JPG
>>
>>109187752
there will be a lot of schizo meltdowns about "legit" threads being the ones with ID or not, I want this shit to be mendatory lol
>>
Local Diffusion?
>>
>>109187777
>there will be a lot of schizo meltdowns about "legit" threads
This already happens whenever someone makes a shit bake. Granted it's one more thing to get wrong but I don't think it would be vastly different from what happens now.
>>
File: ComfyUI_temp_jufod_00011_.jpg (525 KB, 1568x2336)
525 KB JPG
>>109187781
me thinks Krea might only be good at 1girling. everything else I gen looks fucking slopped.
>>
File: 1774939149829897.webm (3.9 MB, 1440x1920)
3.9 MB
3.9 MB WEBM
>>109187593
>>
>>109187822
please tell me this is LTX
>>
>>109187824
good old wan
>>
>>109187808
either that or you literally have to prompt for every little detail you want in your gen otherwise it won't be there.
>>
>>109187808
>the honeymoon phase with Krea is over
was about goddam time
>>
>>109187826
RIP
>>
best for krea2 nsfw is first gen with anima, then encode image for i2i with krea using only
https://civitai.red/models/2746817/krea2-filter-bypass-fedor?modelVersionId=3089754

At 5 strength and denoise up to 0.8, though i've only tried it with blowjob so far. Anima is such a small model it won't break the bank when switching between krea if using Q4 gguf. Then you got your hardcore porn in much better quality using good old i2i like we use to with pony, when we took anime and converted to realistic using sdxl models. The beauty of this is you don't need nsfw lora's for krea because it was trained on porn already, you only need to bypass channels/layers 9 and 10 and give it a decent reference latenet.
>>
File: ComfyUI_00476_.jpg (1013 KB, 1688x2192)
1013 KB JPG
>>
>>109187824
>>109187826
you can tell its wan because they all use that same nsfw lora that makes characters bounce up and down no matter what you prompt for
>>
i will never use more than a single checkpoint per image you can't make me use two
>>
File: debo_sf_k2_uv_00124.jpg (2.83 MB, 6192x2580)
2.83 MB JPG
>>
>>109187836
coomers goto coom and we be reaching levels of cooming that should never have been possible but it do be like this. 0.9 denoise it changes the composition a lot, and below 0.8 it don't change it enough to make the skin more detailed.

a lot could be done to make things better i'm sure but at least you're not constantly fighting the krea models safety filter. I need to test it though because what krea tends to do is push the penis out of the hole and draw the tip outside of it >.< and that is pissing me the fuck off.
>>
File: ComfyUI_00481_.jpg (1.54 MB, 2024x2696)
1.54 MB JPG
>>109187822
Nice
>>109187836
This seems interesting
>>
>>109187836
Stop promoting that AI slop snake oil. It's literally the same as 2 vector bypass + some retarded LLM hallucination.
Uncensor lora/abliteration might not even be necessary here. Let me test quick.
>>
>>109187886
oh this shit is good anon trust me and i don't know why i didn't try it first, you need that lora i linked because it does the job properly, it does not touch the other layers but you need to understand that layer or channel 11 is a secondary safety filter and I think this one can be used in a pinch to cancel that one out if it refuses but disabling 11 can cause issues with prompt and style making things not as good?

So get that https://huggingface.co/Kutches/Kr3a/blob/main/krea2bypass_filtered_11.safetensors

and only use that if that other lora that disables 9 and 10 isn't enough. stack them together in other words.
>>
>>109187593
>>109187849
>>109187886
krea 1girl face oh no no no no
>>
back after a year

what'd I miss
>>
>>109187890
yes it is the same as the one that filters the 2 vectors but they explained why it works better and it locks the other channels. but i did test them and they work the same, the other lora i linked seems to target 11 the secondary safety filter. Take it as you will i am no expert i just want to coom.
>>
>>109187886
>This seems interesting
also understand anima will understand the natural english prompt you use for krea model, but then you add the tags on the end of that prompt to enhance or change things with anima.

So possibly do first with krea so its close enough, then copy prompt to anima and then send that as latent and denoise that is what is working for me. but I have yet to try just tag promoting anima and then using krea, it could be that we could simplify krea prompt with the gist of the image provided by anima. but I don't know because krea enjoy long detailed prompts in pure english.
>>
>>109187598
and flags
>>
generating images of porn or feet all day must feel adjacent to living amongst the filth of empty plastic containers of convenient store food and black garbage bags
>>
>>109187979
me, actually
>>
>>109187890
Maybe some potential here, I dunno. It's very late and I am going to bed, just wanted to see if passing latents work without lora/bypass shit in Krea.
https://litter.catbox.moe/9rft9wmw00h3oj9d.png
Yes the workflow is schizo, prompt is shit, I am going to clean up later. This was just a quick test. May or may not generalize to actually usable images. Extension is modified version of Kijais Custom Sigmas node.
>>109187932
The explanation is nonsensical babble. 2 vector already "locks in other layers" and it uses exact same values. It's some retard karma farming with his LLM. Stop giving him attention so link to the original lora him/his LLM stole from instead.
>>
>>109187926
Z-Image Turbo, Klein (meh), Krea 2 (the current hot new model), Ideogram 4. Nothing absolutely revolutionary desu, just cool to have. I feel like Krea 2 finally surpasses Dall-E 3 for most things.

>>109187979
>nooo stop liking things I don't like
>>
>>109187994
>Krea 2
I am downloading krea 2 and trying it with comfy, but I have been told a1111 is ded
>>
File: ComfyUI_00483_.jpg (1.69 MB, 2024x2696)
1.69 MB JPG
>>109187915
>>109187968
I will test this out properly tomorrow, there should be an easy workaround for the prompting
>>109187979
Show us your gens
>>
>>109188008
Yeah everybody moved to ComfyUI a long time ago.
>>
>>109188008
>downloading krea 2
sad
>>
>>109188029
it is taking a while yeah
>>
>>109187994
bad anatomy. anime anatomy
>>
>>109188008
Just use forge neo. No need to pick up a bloated tool if you don't want to.
>>
>>109187976
we don't need that here though, that only causes more problems with country bashing. ffs i don't care where someone is from or how good their english so long as they as nice people and give thought. right i'm british and my language skills are bad, but i've used this site for over 20 years or something so i skill a lot of words and that is fine because so long as the general concepts are understood it does not slow down conversation. But some people for what ever reason just have to be fucking offended when someone does not talk the king's English and instead they choose to focus on that because really they are not smart they are low IQ.

Only a low IQ would focus on country or conversational skills and not a person character.
>>
>>109188035
do what you want but I hates it
>>
>>109188042
truth nuke, only ID is needed
>>
File: 1781205802786627.png (1.88 MB, 960x1440)
1.88 MB PNG
No need to get mad - it was just me thinking out loud
>>
>>109187598
4chan basically banned all muslim countries but lets israelis post.
>>
what krea needs now is controlnet imo. it can do ipadaptor stuff but its shit, or at least the current node that does that does not do it very well. Comfyui can only really do inpainting well with controlnet. Currently if you mask an area with krea it does not know about the unmasked parts of the image so you can't inpaint with it...

There is controlnets for anima that work really well actually.
>>
>>109188054
krea anatomy is soooo bad.
>>
comfy too hard to learn for my smol brain
>>
dear ldg, what workflows/nodes would you recommend for krea 2 tweaking?

not listed on civitai btw
>>
>>109188062
thats z-image
>>
>>109188054
>product placement in AI gens
I'm surprised this hasn't become a huge thing for new models. Easy money.
>>
File: Ideogram__00705_.jpg (2.63 MB, 2160x3840)
2.63 MB JPG
>>
>>109188075
>>109188075
>>109188075
>>
>>109188070
it's obviously not kek
>>
>>109188064
download claude for desktop
it will give you detailed explanations for anything technical you have
>>
>>109188029
do not bother with the raw model unless you like to tinker a lot, its difficult to prompt and ah its for training lora's really. I've tried it and it kind of improves the quality I think but its difficult to prompt with complex shit, its up to you though. If you only got 12GB vram probably Q4 gguf is what you want because at least when i first started using the Q8 i get oom when trying to load lora's possible due to how comfy treat ggufs with that dynamic vram shit? I don't know what the issue is with comfyui and its memory management these days but after reading a little it might not support gguf?

Someone fill me in here, would I better dropping gguf format for my 12GB card for the new dynamic vram management? How does that work? Wan will still oom but I think its because wan does all frames at once and so fp8 won't work if using Bernini for instance and i can't do the full 81 frames standard, so i have to use the Q4 gguf for that model... But I think its the way the wan model works, it has to process all the frames at once unlike LTX which runs fine on my machine for many frames.
>>
>>109188066
use a basic template, load up the provided template in comfyui, space out the other nodes from that subgraph node and unpack the subgraph and delete all the bullshit so you have something that looks like a basic SDXL workflow... Someone at comfy HQ needs to get a fucking clue with workflow design. delete all the switch nodes (like you wouldn't just copy paste another lay out for separate turbo and raw) delete all the set resolution nodes and all that cluster fuck garbage. I can give you a very basic workflow in the next thread if you want it?
>>
>>109188066
pic real is a real basic high IQ clean you can see what shit is doing layout.
>>
>>109188192
vs the retard mind is full of billions of fuck shit template they gave people



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.