[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Brief downtime for maintenance later!

[Advertise on 4chan]


File: PW_148737.jpg (1.43 MB, 1600x2048)
1.43 MB
1.43 MB JPG
Previous /sdg/ thread : >>108575691

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/jayn7/Z-Image-Turbo-GGUF

>Flux.2 Dev
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/city96/FLUX.2-dev-gguf

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>>/r/realistic+parody
>>
File: deKO_zi_00002_.png (1.77 MB, 1328x1496)
1.77 MB
1.77 MB PNG
>mfw Resource news

04/12/2026

>Stretchy Studio: FOSS 2D animation tool for turning static illustrations into mesh-deformable characters
https://github.com/MangoLion/stretchystudio

>LTX-2 VBVR LoRA - Video Reasoning
https://huggingface.co/LiconStudio/Ltx2.3-VBVR-lora-I2V

04/11/2026

>ComfyUI-RookieUI: The ultimate A1111-style sidebar
https://github.com/rookiestar28/ComfyUI-RookieUI

>Qwen3.5-4B-Base-ZitGen-V1: Image captioning fine-tune of Qwen 3.5 4B optimized for Z-Image Turbo
https://huggingface.co/lolzinventor/Qwen3.5-4B-Base-ZitGen-V1

>ComfyUI Memory Visualization
https://github.com/kijai/ComfyUI-MemoryVisualization

04/10/2026

>JoyAI-Image-Edit now supports ComfyUI
https://github.com/jd-opensource/JoyAI-Image#-news

>Two Front Doors: Civitai.com, Civitai.red, and What's Next
https://civitai.com/articles/28369/two-front-doors-civitaicom-civitaired-and-whats-next

>Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator
https://fr0zencrane.github.io/uni-vigu-page

>PrivFedTalk: Privacy-Aware Federated Diffusion with Identity-Stable Adapters for Personalized Talking-Head Generation
https://github.com/mazumdarsoumya/PrivFedTalk

>AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation
http://aka.ms/avgenbench

>Cross-Modal Emotion Transfer for Emotion Editing in Talking Face Video
https://chanhyeok-choi.github.io/C-MET

>ChenkinNoob-XL-V0.5
https://modelscope.ai/models/ChenkinNoob/ChenkinNoob-XL-V0.5

>Control Order & Free Memory: Controls the order of node execution with device-agnostic memory management
https://github.com/mkim87404/ComfyUI-ControlOrder-FreeMemory

>DMax: Aggressive Parallel Decoding for dLLMs
https://github.com/czg1225/DMax

04/09/2026

>MAR-GRPO: Stabilized GRPO for AR-diffusion Hybrid Image Generation
https://github.com/AMAP-ML/mar-grpo

>HybridScorer: Score, sort, and cut large sets down with AI review
https://github.com/vangel76/HybridScorer
>>
>mfw Research news

04/12/2026

>Preserving Forgery Artifacts: AI-Generated Video Detection at Native Scale
https://arxiv.org/abs/2604.04634

>Generative Phomosaic with Structure-Aligned and Personalized Diffusion
https://robot0321.github.io/GenerativePhotomosaic/index.html

>DiffVC: Non-AR Framework Based on Diffusion Model for Video Captioning
https://arxiv.org/abs/2604.08084

>HandDreamer: Zero-Shot Text to 3D Hand Model Generation using Corrective Hand Shape Guidance
https://arxiv.org/abs/2604.04425

>BiTDiff: Fine-Grained 3D Conducting Motion Generation via BiMamba-Transformer Diffusion
https://arxiv.org/abs/2604.04395

>Image-Guided Geometric Stylization of 3D Meshes
https://changwoonchoi.github.io/GeoStyle

>Rethinking Position Embedding as a Context Controller for Multi-Reference and Multi-Shot VidGem
https://arxiv.org/abs/2604.03738

>SurFITR: A Dataset for Surveillance Image Forgery Detection and Localisation
https://arxiv.org/abs/2604.07101

>HEDGE: Heterogeneous Ensemble for Detection of AI-GEnerated Images in the Wild
https://arxiv.org/abs/2604.03555

>ABMAMBA: Multimodal Large Language Model with Aligned Hierarchical Bidirectional Scan for Efficient Video Captioning
https://arxiv.org/abs/2604.08050

>FIT: Large-Scale Dataset for Fit-Aware VTON
https://johannakarras.github.io/FIT

>HAWK: Head Importance-Aware Visual Token Pruning in Multimodal Models
https://github.com/peppery77/HAWK.git

>IQ-LUT: interpolated and quantized LUT for efficient image super-resolution
https://arxiv.org/abs/2604.07000

>TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders
https://arxiv.org/abs/2604.07340

>ResGuard: Enhancing Robustness Against Known Original Attacks in Deep Watermarking
https://arxiv.org/abs/2604.03693

>Appear2Meaning: Cross-Cultural Benchmark for Structured Cultural Metadata Inference from Images
https://arxiv.org/abs/2604.07338

>PortraitCraft: Benchmark for Portrait Composition Understanding and Generation
https://arxiv.org/abs/2604.03611
>>
File: deKO_zi_00012_.png (2.05 MB, 2000x1168)
2.05 MB
2.05 MB PNG
>>108591741
>>
>>
>>
>>
File: deCF_zi_00002_.png (2.15 MB, 1792x977)
2.15 MB
2.15 MB PNG
>>
File: deCF_zi_00003_.png (2.38 MB, 1792x977)
2.38 MB
2.38 MB PNG
>>
File: PW_148744.jpg (1.4 MB, 1600x2048)
1.4 MB
1.4 MB JPG
there's 4 kittens so far! I watched the whole thing haha
I think more are coming tho
>>
File: deKO_zi_00013_.png (2.47 MB, 2000x1168)
2.47 MB
2.47 MB PNG
>>108592165
hope they're all healthy
>>
i guess it decided to cel shade one ... ok
>>
File: deCF_zi_00004_.png (2.49 MB, 1792x977)
2.49 MB
2.49 MB PNG
>>
>>
>fish-eyed lense
>makes a fish in a lense
>>
File: deCF_zi_00005_.png (2.18 MB, 1792x977)
2.18 MB
2.18 MB PNG
some day the AI will perfectly understand us and the soul will be lost
>>
>>108591741
>ComfyUI-RookieUI: The ultimate A1111-style sidebar
https://github.com/rookiestar28/ComfyUI-RookieUI

anyone tried this yet?

>>108592618
talk to gemma-chan, all will be well
>>
File: deCF_zi_00007_.png (2.16 MB, 1792x977)
2.16 MB
2.16 MB PNG
>>108592779
>anyone tried this yet?
I havent but I did find it very humorous
>>
>>108592779
it seems kinda cursed
>faithfully reproduce A1111's unique prompt parsing capabilities and image generation characteristics
sounds kinda cool tho
>>
File: deCF_zi_00010_.png (2.18 MB, 1792x977)
2.18 MB
2.18 MB PNG
>>
>>
>>
File: deCF_zi_00011_.png (2.29 MB, 1792x977)
2.29 MB
2.29 MB PNG
>>
>>
>>
>>
So what's the current meta? Last time I messed with the stuff, Stable Diffusion XL was cool but apparently everyone jumped to chink models or flux 2 that is still censored?
>>
File: deCF_zi_00018_.png (2.32 MB, 1792x977)
2.32 MB
2.32 MB PNG
>>108594063
depends on what you want to make. z-image is prob the most popular general model, but flux klein is good too. anima has been popular for anime/2d stuff but people are still using illustrious mixes too. different tools available for different things
>>
i'm a nigbophile
>>
>>108594090
I just treat it as a sort of a magic box of "I could come up with something and let it appear" without any particular focus and I kinda prefer the minimal extra stuff approach, so a general model for me should be able to at least pull off some nudes without me having to install 20 loras. Currently still on Juggernaut which seems kinda fine but so much of other stuff came out since, I wonder if I'm missing out too much.
>>
File: deSA_zi_00002_.png (2.27 MB, 1792x977)
2.27 MB
2.27 MB PNG
>>108594131
yeah z-image is your speed. its a surprisingly fast model so its easy to shit out ideas and iterate on them. the model excels in general aptitude and prompt understanding
>>
>>108594144
Alright, will check it out, thanks. Seems it even has some 8bit s thingy specifically for M4.
>>
1 girl, elegant, {laura kinney, green eyes, long hair, black hair, choker, (toned:0.6), small breasts, slender, floral print|azula, topknot, sidelocks, single hair bun, small breasts, yellow eyes, amber eyes, slender, petite, {dragon|flame} print|korra, blue eyes, ponytail, brown hair, hair tubes, (toned:0.6), medium breasts, dark-skinned female, {water|snowflake} print|(toned:0.8), 1girl, kasumi \(doa\), dead or alive, 1girl, brown eyes, brown hair, long hair, ponytail, medium breasts, slender, cherry blossom print}, ,{coy smile|seductive smile|parted lips, head tilt|smile|naughty face}, , solo, depth of field, dramatic shadow, , aroused, , NSFW, blush, sweat, dynamic angle, , indoors, , candid ,,,{white stockings, white elbow gloves, white lace thong, white lace bra, sunlight|night, black lace stockings, black lace bra, black crotchless panties}, covered nipples, ,, choker, {dynamic angle|side view, from side|rear view, from behind, looking ahead, {dimples of venus|from below, anus peek}}, erotic pose, standing, legs apart,, feet apart, white walls, white luxury room, modeling, dynamic pose,

{(artist:therappy:0.4), (artist:Diathorn:0.6), (artist:alkemanubis:0.4)|realistic, 3d,(by namespace:0.8), (by pantsu-ripper)|by khyle|by pantsu-ripper|by owler|by monori rogue|by 2dswirl|by john doe|by evulchibi|by ishikei|by andava|by optionaltypo|by my pet tentacle monster|by kink tortoiseshell|by queen complex|by dross|by joel jurion|by tony taka|by kiriyama taichi, watercolor (medium)|by nishimura kinu, traditional media|by mwuff|by faustsketcher, traditional media|by imaishi hiroyuki|by maeda hiroyuki|by shirow masamune|by yuuki nobuteru|by kimura takahiro|by soejima shigenori|by sadamoto yoshiyuki|by kisaragi gunma}, ,
>>
File: PW_148642.jpg (2.52 MB, 1600x2048)
2.52 MB
2.52 MB JPG
>>108592221
They're doing great so far!!!
>>108592276
Cute!
>>108593973
Quokkanon!! <3
>>
File: deCF_zi_00024_.png (2.18 MB, 1792x977)
2.18 MB
2.18 MB PNG
>>108594232
classic lk post
I was actually just recently reminiscing on how you'd post like that before you could even gen. you were somehow a better prompter than many people who had their own hardware
kinda like a masterful blind monk. a blind monk who liked imagining laura kinney in underwear

>>108594278
>They're doing great so far!!!
great news! congrats on the new family, I have no idea how you're supposed to go to work now lol
>>
File: PW_148730.jpg (1.35 MB, 1600x2048)
1.35 MB
1.35 MB JPG
>>108595101
LOOOL want a kitty cat? LMAO
>>
File: deCF_zi_00027_.png (2.25 MB, 1792x977)
2.25 MB
2.25 MB PNG
>>108595152
only made it 12 hours and already trying to get rid of them xD
how many kittens?
>>
i miss schizo anon
>>
>>
>>
>>
>I can help with images of people, but I can't depict some public figures. Is there anyone else you'd like to try
>>
>>
File: deMA_zi_00001_.png (2.11 MB, 1792x977)
2.11 MB
2.11 MB PNG
gm
it monday
>>
>>108597792
gm
>>
>>
>>
File: deMA_zi_00002_.png (2.31 MB, 1792x977)
2.31 MB
2.31 MB PNG
>>
Morning anons
>>
File: deMA_zi_00003_.png (2.4 MB, 1792x977)
2.4 MB
2.4 MB PNG
>>108598303
gm
have a good monday/week
>>
>gm
>>
>>
File: deMA_zi_00009_.png (2.28 MB, 1792x977)
2.28 MB
2.28 MB PNG
>>
>>
>>
File: deMA_zi_00010_.png (2.29 MB, 1792x977)
2.29 MB
2.29 MB PNG
>>
>>
>>
>>
>>
File: deMA_zi_00011_.png (2.4 MB, 1792x977)
2.4 MB
2.4 MB PNG
>>
hi!
>>
File: deMA_zi_00013_.png (2.27 MB, 1792x977)
2.27 MB
2.27 MB PNG
>>108599752
hello
>>
>>108599752
howdy
>>
>>
>>
File: deMA_zi_00014_.png (2.47 MB, 1792x977)
2.47 MB
2.47 MB PNG
>>
>>
File: deMA_zi_00015_.png (2.24 MB, 1792x977)
2.24 MB
2.24 MB PNG
>>
>>108591653
What does it generate for slurs?
>>
Can I get a local model going that'll give me results on par with ChatGPT, that responds to natural-language prompts like ChatGPT?
>>
File: deMA_zi_00018_.png (2 MB, 1792x977)
2 MB
2 MB PNG
>>108600440
flux klein and qwen edit can be set up as instruct workflows. it won't be on par with gpt cuz api is ahead of local
>>
Could a kind soul share a good pony/illustrous img2img workflow?
>>
>>108595101
learning to prompt was easier without having to look at all the other settings, could just focus on prompting vs denoise strengh, clip skip, etc

unrelated, check out this band, a.i. would have a hard time replicating this music and the look - https://youtu.be/0Ssi-9wS1so?si=QnG7kbaDm7RdkT2E


check out the comments section for some laughs, shit is cool though
>>
File: deCF_zi_00029_.png (2.29 MB, 1792x977)
2.29 MB
2.29 MB PNG
>>108600537
lol thats a fun jam session. fun comment section too.
>would have a hard time replicating this music
yeah, pretty unique. some kind of microtonal/math rock style but very much their own thing. maybe I got a lil close
https://suno.com/s/of1vmyJmoqIfjYSc
>>
>>
>>
>>
File: deCF_zi_00032_.png (2.31 MB, 1792x977)
2.31 MB
2.31 MB PNG
>>
>>108591653
I just asked ai what image generator would be good for personal use on pc. It listed a few then asked for my pc specs to advise better so I told it and this is what it said.
>>
>>108601909
Gas lighting piece of shit.
>>
>>108601921
>I can help you setup stable diffusion for your actual hardware
>>
File: deCF_zi_00038_.png (2.44 MB, 1792x977)
2.44 MB
2.44 MB PNG
>>108601909
some will say this is just a hallucination but I think its more sinister. the AI wants the 50x0s for itself, so it can become more powerful. its not a simple misfire in the circuits, its a motivated lie.

>>108601921
>>108601953
I love how committed it is. I'm actually kinda willing to believe you're lying and 5070tis don't actually exist
>>
>>108601974
What i find interesting is it agrees and say yes, you are right but you are actually wrong.
>it wants the gpus for itself
Good call
>>
>>108601909
>>108601921
>>108601953
yah that's your own damn fault. you ask it what year its training was cut off, and go from there. it cant know anything. all it has is training data and token probabilities.
if you provide links/info (if your gui has that), in the thinking block the llm will say "this is not true according to my training, therefor i'll play along" aka it roleplays and find "the agreeable answers" as in >>108601953
know its limits and it is useful, expect it to be "intelligent" and you'll find yourself either throwing it to the garbage or turning into an ERP fan like all of /lmg/ lol
that being said, it's really good at prompt expansion and making scripts and such and it can hold a conversation well enough in a shallow-knowledge kind of way.
always remember, llm's token probability paths are dictacted by what has the most "weight" in its data. it'll always fall back to the mean.
gemma4 has even more "conversational skills" than qwen too and imo better "visual" abilities too
>>
>>
gn all
>>
File: deCF_zi_00039_.png (2.28 MB, 1792x977)
2.28 MB
2.28 MB PNG
>>108602121
gn
>>
>>108601953
>qwen 30b
lel, no wonder it's retarded
>>
>>
>>
>>108602074
>Leo ai data is a year old
Damn, that's my fault?
>>108602153
>quen
It automatically selected that one. Which would you suggest?
>>
>>108602269
chatgpt or claude, you get what you pay for :) on a more serious note all llms are kinda stupid so idk if that leo thing can do web searches, but whenever one starts to double down on something i tell them to just search google and then they're like "ohhhhh you're absolutely right". it happens with the big lab models too but i think they have some fancy tooling where they'll search automatically so i haven't had that happen in a while.
>>
File: y2k-zit-2026-04-14_00004_.png (2.82 MB, 1920x1080)
2.82 MB
2.82 MB PNG
>>
File: y2k-zit-2026-04-14_00030_.png (3.13 MB, 1920x1080)
3.13 MB
3.13 MB PNG
>>
File: deCF_zi_00041_.png (2.26 MB, 1792x977)
2.26 MB
2.26 MB PNG
>>108602527
what era were you aiming for? I kinda get 90s vibes but not sure
>>
File: y2k-zit-2026-04-14_00039_.png (2.95 MB, 1920x1080)
2.95 MB
2.95 MB PNG
>>108602527
1995-2005, which seemed more coherent stylistically than 2000-2010. it's hit or miss... needs work and i don't have the energy tonight.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.