[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor acceptance emails will be sent out over the coming weeks. Make sure to check your spam folder!


[Advertise on 4chan]


Previous /sdg/ thread : >>109122905

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Anima
https://huggingface.co/circlestone-labs/Anima

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/aco/csdg/
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai

OP https://rentry.co/twkuk8tz
>>
First for slop containment general
>>
>>
>mfw Resource news

06/25/2026

>Bernini-R — GGUF (high & low noise experts)
https://huggingface.co/neuregex/Bernini-R-GGUF

>Physics Question Scene Graph: Fine-grained Evaluation of Physical Plausibility in Text-to-Video Generation
https://github.com/atinpothiraj/pqsg

>VPA-Guard: Defending and Benchmarking Image-to-Video Generation Against Visual Prompt Attacks
https://huggingface.co/datasets/CSU-JPG/VVA-Bench

>Minimalist Preprocessing Approach for Image Synthesis Detection
https://github.com/vohoaidanh/adof

06/24/2026

>Krea-2-Turbo Training Adapter
https://huggingface.co/ostris/krea2_turbo_training_adapter

>Vera: A Layered Diffusion Model for Content-Preserving Video Editing
https://vera-layered-diffusion.github.io

>Advancing WordArt-Oriented Scene Text Recognition: Datasets and Methods
https://github.com/YesianRohn/WATER

>DramaDirector: Geometry-Guided Short Drama Generation
https://github.com/iLearn-Lab/DramaDirector

>PG-MAP: Joint MAP Optimization for Inference-Time Alignment of Diffusion and Flow-Matching Models
https://github.com/sophialanlan/PG-MAP

>Safe Few-Step Generation via Velocity Editing
https://uzn36.github.io/VESFlow

>Co-occurring associated retained concepts in Diffusion Unlearning
https://github.com/damilab/CARE

>MeshFlow: Mesh Generation with Equivariant Flow Matching
https://qiisun.github.io/MeshFlow

>Arbor: Explicit Geometric Conditioning for Controllable 3D Asset Generation
https://arbor.jdihlmann.com

>VideoAgent: All-in-One Framework for Video Understanding and Editing
https://github.com/HKUDS/VideoAgent

>Krea 2 GGUF
https://huggingface.co/molbal/krea2-gguf

>Anima updates license to version 1.2. Clarifies commercial use restrictions
https://huggingface.co/circlestone-labs/Anima/commit/8cca6bb7b35f7f6abb2e21616ae44de083dbb8fa

>ComfyUI-Krea2T-Enhancer
https://github.com/capitan01R/ComfyUI-Krea2T-Enhancer

>PiD: Plug-and-play diffusion decoder that replaces VAE/RAE decoders
https://github.com/nv-tlabs/PiD
>>
>mfw Research news

06/25/2026

>DomainShuttle: Freeform Open Domain Subject-driven Text-to-video Generation
https://arxiv.org/abs/2606.26058

>Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models
https://arxiv.org/abs/2606.25473

>Chorus II: Cross-Request Sparsity Reuse for Efficient Image-to-Video Generation
https://arxiv.org/abs/2606.25040

>EchoStyle: Unlocking High-Fidelity Video Stylization with Reverse Data Synthesis
https://arxiv.org/abs/2606.25465

>Concept Removal for Frontier Image Generative Models
https://arxiv.org/abs/2606.25548

>Structuring Sparsity: Block-Sparse Featurizers Capture Visual Concept Manifolds
https://arxiv.org/abs/2606.25234

>MIMFlow: Integrating Masked Image Modeling with Normalizing Flows for End-to-End Image Generation
https://arxiv.org/abs/2606.26016

>FreeStory: Training-Free Character Consistency for Free-Form Visual Storytelling
https://arxiv.org/abs/2606.25079

>TryOnCrafter: Unleashing Camera Trajectories for Realistic Video Virtual Try-on via a Renderable 4D Try-on Proxy
https://sunhao242.github.io/TryOnCrafter_web.github.io

>In-context Region-based Drag: Drag Any Region to Any Shape
https://arxiv.org/abs/2606.25907

>Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models
https://wan-streamer.com

>Steering Vision-Language Models with Joint Sparse Autoencoders
https://arxiv.org/abs/2606.25657

>Brevity is the Soul of Inference Efficiency: Inducing Concision in VLMs via Data Curation
https://arxiv.org/abs/2606.25432

>Do vision-language models search like humans? Reasoning tokens as a reaction-time analog in classic visual-search paradigms
https://arxiv.org/abs/2606.25066
>>
>>109136737
>>109136746
you're not welcome in /ldg/ thread schizo
>>
>>109136646
thx, yah that came out nicer than the first few attempts
i'm giong for the 2nd lora now
src=200, same settings on musubi
this one should fly by
>>
>>
>>
File: debo_agf_wah_00055_.png (2.25 MB, 1792x977)
2.25 MB PNG
>>109136700
nice OP

>>109136813
are they sprinting for the window
>>
File: debo_agf_wah_00056_.png (2.47 MB, 1792x977)
2.47 MB PNG
fugg, I have to update comfy for krea2
I also have to try to unfuck this stupid subgraph workflow
>>
File: debo_k_00001_.png (2.88 MB, 1792x977)
2.88 MB PNG
first successful krea2 gen
>>
>>109137088
most of the shit in the subgraph is pointless. i took a zit workflow, got rid of auraflow swapped in empty latent and otherwise it's basically the same
>>
File: debo_k_00005_.png (2.69 MB, 1792x977)
2.69 MB PNG
>>109137288
yeah it was just hard to read cuz subgraphs are shit. just hiding stuff on purpose for no reason. I got it sorted out tho
>>
>>109137300
the krea2 one was especially bad, all this janky shit for loras and switches and other crap. the generate text from clip is pretty clever but who wants a 4b quantized qwen3 doing jack shit? not me. the ideogram one was really dumb to0, doing json fiddling and all this nonsense. i get it's part of comfy's "you don't need custom nodes" schtick but i'd rather deal with a node pack then wiring up 250 nodes to approximate the same thing
>>
File: debo_k_00006_.png (2.87 MB, 1792x977)
2.87 MB PNG
>>109137359
>generate text from clip
I'm trying out the prompt enhacement thing, just out of curiosity
>>
>>
>>
File: debo_k_00011_.png (2.55 MB, 1792x977)
2.55 MB PNG
>>
>>
>>
File: debo_k_00015_.png (2.64 MB, 1792x977)
2.64 MB PNG
>>
i like krea2 so far. its fast too. nice
>>
>>
>>
>>
Is img 2 vid and txt 2 vid still balls to set up? Last time I barely got it working on my 4070 and just used cloud hosted stuff but I'm paranoid and want to host locally. Is wan still the top video gen now a days?
>>
File: debo_k_00021_.png (2.43 MB, 1792x977)
2.43 MB PNG
>>109137513
i havent formed an opinion yet but its lookin good
>>
>>109137634
i mean if u just want retarded/lazy, the comfy wan2.whatever template will tell you what to download and just work, within its limits. i haven't messed with it bc i'm a 16gb vramlet so it's fuckin aids, 4070 it's iffy, videogen is a rich man's game frankly
>>
>>109137659
not enough latent jankery or what?
>>
File: 1763774527983063.jpg (91 KB, 1080x777)
91 KB JPG
Do you think it's possible for Krea 2 to get Booru tag support?
>>
File: 42100060149146125.png (3.96 MB, 1444x2152)
3.96 MB PNG
>>
>>109137716
it seems to be fine tune friendly so yes, either be the change you want to see in the world or wait for some autist to do it for you. or feed ur booru tags thorugh an llm that's what i do
>>
>>
File: 000000_75742_.png (2.86 MB, 1920x800)
2.86 MB PNG
G'mornin Anons, have a great day!
>>
i miss schizo anon
>>
File: 00001-3086059860.png (2.87 MB, 1728x1344)
2.87 MB PNG
>>109138877
good morning
>>
>leave lora training overnight
>the 3rd epoch (out of like 30) is the best one
i knew it was going to be fast
>>
>>109136996
>sprinting for the window
Maybe they're Russian.

>>109139274
gm
>>
>gm
>>
>>
>>
File: 00006-2765994258.jpg (779 KB, 2016x2592)
779 KB JPG
>>
File: 00008-1170368417.jpg (1.02 MB, 2016x2592)
1.02 MB JPG
>>
>>109139545
gm
What's for breakfast?
>>
>>
>>
>>109139842
Jabba the Hutt jr back there?
>>
>>109139853
mopey the hutt
>>
>>
File: 00013-3120248225.jpg (712 KB, 2016x2592)
712 KB JPG
>>
>>
>>
krea2 text2img+comfyui then grok

very grindy, installed krea yday, its been a year since trying local gen, I'm impresed, yed to find a grok equivalent for a 3060Ti
>>
>>
File: 00018-1575739514.jpg (919 KB, 2016x2592)
919 KB JPG
>>
>>
heh
>>
>>
>>
>>
>>
File: output.mp4 (3.66 MB, 1280x1024)
3.66 MB
3.66 MB MP4
>>
File: pixel-0000-2261510643.png (742 KB, 2560x2048)
742 KB PNG
>>
File: pixel-0001-688170757.png (839 KB, 2560x2048)
839 KB PNG
>>
>>109136700
How do you guys maintain consistent characters across images? I get the impression faceID is kinda old.
>>
File: pixel-0002-224804035.png (1.28 MB, 2048x2560)
1.28 MB PNG
>>
>>109140835
>>>/g/ldg
>>
>>109140835
lora
>>
morning anons
>>
File: 00025-2584806310.jpg (611 KB, 1872x2736)
611 KB JPG
>>109141103
morning
>>
File: debo_k_00023_.png (2.56 MB, 1792x977)
2.56 MB PNG
>>109141103
>>109141207
gm
>>
>>109141103
gm
>>
>>
File: debo_k_00029_.png (2.98 MB, 1792x977)
2.98 MB PNG
>>
>>
>>
>>
File: debo_k_00031_.png (2.53 MB, 1792x977)
2.53 MB PNG
>>
>>
>>
>>
File: debo_k_00033_.png (2.39 MB, 1792x977)
2.39 MB PNG
>>109141708
now that your lora is cooking and you've spent some time with krea2, do you think its the new gold standard?
>>
File: 1751046583149.jpg (226 KB, 1024x1536)
226 KB JPG
desuarchive.org/g/thread/105712607
1 year ago
>>
>>109141741
I'm not sure. I think z is more "creative" or rather, easier to go crazy with, ig like i said before seems to know more non-media related concepts, but krea seems a bit better at putting my stuff together. it's a tough choice
i still need a third lora to finish her tho, so we'll see how it does after
>>109141811
good ol' chroma days
>>
File: 00028-2552546708.jpg (579 KB, 1728x2880)
579 KB JPG
a bit out-of-season
>>
>>109141811
>rocketgirl anon
sad, sad
>>
File: debo_k_00035_.png (2.47 MB, 1792x977)
2.47 MB PNG
>>109141811
feels like yesterday

>>109141869
pour one out for a fallen homie
>>
lolwut
time to prep the third lora
60 images, super quality, so hopefully it wont take 6 hours like the first
>>
File: 00031-3193865500.png (1.48 MB, 1152x1920)
1.48 MB PNG
>>
>>109141937
do you train on your own images? like pick the ones with the best face or whatever?
>>
>>109142036
yah, i have a few source sets that i've collected over time, and have a "chromagirl" set that i use for that specific look
i've curated them over time, setting to a good resolution, enhancing, captioning,etc (although i'm always recaptioning)
basically start with 10MP+ pictures/sources, and set them to what you want the lora to do
>>
good quality src = good training
it's going fast but i think it may take a bit of time for it to cook
>>
>>109142174
>>109142112
i've never tried it. can't really think of what i'd train one on.
>>
>>109142186
you could probably train a style lora using some of your outputs lel
i've never done a style lora cuz i just use whatever the models are capable of, but i have a specific look i want for the girl
>>
File: 00035-1598581741.png (3.09 MB, 1728x2880)
3.09 MB PNG
>>
>>
File: debo_k_00040_.png (2.83 MB, 1792x977)
2.83 MB PNG
>>109141937
lmao this gen is awesome

>>109142002
I'm 100% for this mohawk arc

>>109142036
>I know I dropped my eyeball around here somewhere....
>>
File: 00038-4169151840.png (2.12 MB, 1152x1920)
2.12 MB PNG
>>109142387
added it to my favored hair style wildcard, just a few styles in it, pixie, bob, and hime, mainly
>>
i think she's done
taht didnt seem to take long
>>
File: debo_k_00041_.png (2.19 MB, 1792x977)
2.19 MB PNG
>>109142640
awaken, kreagirl
>>
File: 00041-2935961542.jpg (544 KB, 1728x2880)
544 KB JPG
>>
File: 000000_75761_.png (2.92 MB, 1965x818)
2.92 MB PNG
omg >https://huggingface.co/ilkerzgi/fal-Krea-2-Style-LoRAs
>>
>>109142830
sloppedy slop
you can just prompt for 99% of that
>>
File: debo_k_00046_.png (2.63 MB, 1792x977)
2.63 MB PNG
>>109142830
lora overload
>>
>>109142863
it'd be good for those slop saas things like freepik or whatever the fuck they call it now
>>
File: 000000_stylez_.png (373 KB, 488x686)
373 KB PNG
>>109142863
>>109142872
I have a styles node running, same?
>gens look better than anima
>>
File: IMG_2311.jpg (74 KB, 934x2000)
74 KB JPG
>>109142296
>>
>>
File: pixel-0004-3042867688.png (1.13 MB, 3072x2048)
1.13 MB PNG
>>
File: debo_k_00049_.png (2.12 MB, 1792x977)
2.12 MB PNG
>>109142974
literally me
>>
>>109142809
she's near
>>109142891
probably better since the prompts used on those nodes are more accurate
>>
File: pixel-0006-1359450471.png (438 KB, 2048x2048)
438 KB PNG
>>
File: debo_k_00051_.png (2.62 MB, 1792x977)
2.62 MB PNG
>>
File: 00047-4069428809.jpg (341 KB, 2048x2048)
341 KB JPG
>>
File: pixel-0009-812956108.png (804 KB, 2048x2048)
804 KB PNG
>>
File: debo_k_00053_.png (2.45 MB, 1792x977)
2.45 MB PNG
krea2 still falls victim to the same "planets everywhere" syndrome as other models

>>109143151
heckin cute
>>
>>109143158
Only one sun. That's pretty good.
>>
>>
File: debo_k_00054_.png (2.53 MB, 1792x977)
2.53 MB PNG
>>109143262
I don't think I've really had issues with multiple stars when I've done these space gens with other models. the biggest problems are usually 1) way too many planets or 2) really goofy scaling (picrel). but these krea2 tests def make me wanna tune this prompt in and pump out a bunch of these

oh your gens, what are your prompts for realism styling? I was noticing it didn't like doing realism with some of my other tests
>>
>>109143289
>what are your prompts for realism styling
Still trying to figure it out. I get 2-3 out of 10 which are 3D art or drawings. It's not bad. Some new lora's might help. TBD.

Yeah, you're right. It was almost impossible to gen a spaceship without having a planet nearby.
>>
File: 00168-4261606951.png (1.49 MB, 1024x1024)
1.49 MB PNG
>>
File: debo_k_00058_.png (2.73 MB, 1792x977)
2.73 MB PNG
>>
>>
File: debo_k_00059_.png (2.39 MB, 1792x977)
2.39 MB PNG
>>109143391
based
i mean, spaced
>>
File: 000000_75769_.png (2.86 MB, 1977x824)
2.86 MB PNG
>>109143064
The station is perfect.
>>
File: debo_k_00060_.png (2.89 MB, 1792x977)
2.89 MB PNG
>>109143499
yeah, krea2 was understanding a lot of stuff that other models didn't. space stations were usually a mess or absent
>>
>>
>>109143570
>>109143499
>>109143429
nice
>>
File: debo_k_00061_.png (2.9 MB, 1792x977)
2.9 MB PNG
>>109143596
:)
>>
File: 000000_75780_.png (2.8 MB, 1965x818)
2.8 MB PNG
>>109143596
TY
>>
>>109143596
Thank you and gm :D
>>
File: debo_k_00063_.png (2.81 MB, 1792x977)
2.81 MB PNG
>>109143696
>this is green leader, beginning my attack run. watch for cannon fire on the flank
>>
File: 000000_75786_.png (2.86 MB, 1680x960)
2.86 MB PNG
>>109143770
>Green leader checking in, commies on my left, just entered the chaotic atmosphere!
>>
File: 000000_75791_.jpg (3.3 MB, 4868x2782)
3.3 MB JPG
>>
File: debo_k_00067_.png (2.64 MB, 1792x977)
2.64 MB PNG
>>
me and my 1girl
>>
>>
File: debo_k_00078_.png (2.27 MB, 1792x977)
2.27 MB PNG
>>
>>
>>
File: debo_k_00079_.png (2.5 MB, 1792x977)
2.5 MB PNG
>>
File: debo_k_00085_.png (2.57 MB, 1792x977)
2.57 MB PNG
>>
i may have too many feet pics in my sources
>>
still too clean, but better than zit ever did. can't remember if i ever tried nb2
>>
File: debo_k_00086_.png (2.31 MB, 1792x977)
2.31 MB PNG
>>109144577
maybe getting screwed over by the distillation
>>
>>109144594
probably
>>
>>
meant to say it doesn't seem to understand bokeh/tilt shift very well
>>
File: debo_k_00091_.png (2.5 MB, 1792x977)
2.5 MB PNG
>>
jebus there's alraedy a million slop krea2 loras on civit, not even counting >>109142830
>>
bruh
>>
>>109144686
welcome to the future. it's claude's world now we just live in it
>>
File: debo_k_00093_.png (2.77 MB, 1792x977)
2.77 MB PNG
>>109144686
fal shit out a flood of loras for some reason
https://huggingface.co/ilkerzgi/fal-Krea-2-Style-LoRAs
>>
>>109144705
slooper gonna slop
>>
spidergoat
>>
File: debo_k_00095_.png (2.58 MB, 1792x977)
2.58 MB PNG
>>
>>
>>
File: debo_k_00099_.png (3 MB, 1792x977)
3 MB PNG
>>
>>
>>
File: debo_k_00102_.png (2.76 MB, 1792x977)
2.76 MB PNG
>>
>>109144886
Trypophobia seems like a great idea for a workflow
>>
>>109144922
personally i'm starting to get massive-foot phobia
>>
File: debo_k_00105_.png (2.87 MB, 1792x977)
2.87 MB PNG
>>109144922
do I have any veto powers I can invoke
>>
fin
>>
baking
>>
>>109144957
>>109144957
>>109144957



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.