[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1779160809319074.png (1.8 MB, 1920x1080)
1.8 MB PNG
Previous /sdg/ thread : >>108853572

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Anima
https://huggingface.co/circlestone-labs/Anima

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/aco/sdg
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/r/realistic+parody
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai

OP https://rentry.co/twkuk8tz
>>
>>
>>
>>
File: comp_01.jpg (1.41 MB, 3845x1124)
1.41 MB JPG
>>108860784
>>
File: comp_02.jpg (2.46 MB, 3845x1124)
2.46 MB JPG
>>108860784
2
>>
File: comp_03.jpg (1.26 MB, 3845x1124)
1.26 MB JPG
>>108860784
3. i'm partial to the er_sde for this workflow. i'll try the bong thing on a zit later, been usin euler simple forever
>>
>>
>>
>mfw Resource news

05/19/2026

>HierEdit: Region-Aware Hierarchical Diffusion for Efficient High-Resolution Editing
https://peteryyzhang.github.io/HierEdit-page

>HighSync: High-Quality Lip Synchronization via Latent Diffusion Models
https://github.com/saeed5959/high_sync

>CAM-VFD: Cross-Attention Multimodal Video Forgery Detection
https://github.com/Hoda-Osama/CAM-VFD/tree/main

>WOW-Seg: A Word-free Open World Segmentation Model
https://github.com/AAwcAA/WOW-Seg-Meta

>Vision Inference Former: Sustaining Visual Consistency in Multimodal Large Language Models
https://github.com/Dong-Xinpeng/VIF

>A More Word-like Image Tokenization for MLLMs
https://github.com/snuviplab/DiVT

>EchoSR: Efficient Context Harnessing for Lightweight Image Super-Resolution
https://github.com/funnyWang-Echoes/EchoSR

>Forget-It-All: Multi-Concept Machine Unlearning via Concept-Aware Neuron Masking
https://github.com/kaiyuan02415/Forget-It-All

>LTX OmniNFT RL-Lora from Kijai
https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/loras

>Google's SynthID AI Watermarking Tech Adopted by OpenAI, Nvidia, And More
https://arstechnica.com/google/2026/05/googles-synthid-ai-watermarking-tech-is-being-adopted-by-openai-nvidia-and-more

05/18/2026

>Lance: Unified Multimodal Modeling by Multi-Task Synergy
https://lance-project.github.io

>GridLoraTester: Workbench for character LoRA training on FLUX.2: dataset curation
https://github.com/Mandrakia/GridLoraTester

>FLUX MCP server
https://docs.bfl.ai/api_integration/mcp_integration

>Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
https://shredded-pork.github.io/Flash-GRPO.github.io

>LongLive2.0 5B BF16: AR-trained Wan2.2-TI2V-5B generator
https://huggingface.co/Efficient-Large-Model/LongLive-2.0-5B

>DealMaTe: Multi-Dimensional Material Transfer via Diffusion Transformer
https://github.com/haha-lisa/DealMaTe

>Deep Pre-Alignment for VLMs
https://github.com/THUMAI-Lab/Deep-Pre-Alignment
>>
>mfw Research news

05/19/2026

>VISTA: Triplet-Supervised Video Style Transfer with Diffusion Transformers
https://arxiv.org/abs/2605.17312

>The Silent Brush: Evaluating Artistic Style Leakage in AI Art Generation
https://arxiv.org/abs/2605.17500

>Curriculum Group Policy Optimization: Adaptive Sampling for Unleashing the Potential of Text-to-Image Generation
https://arxiv.org/abs/2605.17807

>Omni-Customizer: End-to-End MultiModal Customization for Joint Audio-Video Generation
https://arxiv.org/abs/2605.17488

>AutoRubric-T2I: Robust Rule-Based Reward Model for Text-to-Image Alignment
https://arxiv.org/abs/2605.17602

>HL-OutPaint: Coarse-to-Fine Video Outpainting for High-Resolution Long-Range Videos
https://arxiv.org/abs/2605.17543

>Image-to-Video Diffusion: From Foundations to Open Frontiers
https://arxiv.org/abs/2605.17248

>Beyond Point-Wise Matching: Structural Representation Alignment for Accelerating Diffusion Transformers
https://arxiv.org/abs/2605.16949

>DreamEdit3D: Personalization of Multi-View Diffusion Models for 3D Editing
https://arxiv.org/abs/2605.16990

>Stabilizing, Scaling & Enhancing MeanFlow for Large-scale Diffusion Distillation
https://arxiv.org/abs/2605.17834

>Temporal Aware Pruning for Efficient Diffusion-based Video Generation
https://arxiv.org/abs/2605.17837

>Dual-Rate Diffusion: Accelerating diffusion models with an interleaved heavy-light network
https://arxiv.org/abs/2605.18190

>FrequencyBooster: Full-Frequency Modeling for High-Fidelity Pixel Diffusion
https://arxiv.org/abs/2605.17759

>Dimension-Free Convergence of Discrete Diffusion Models: Adjoint Equations Induce the Right Space
https://arxiv.org/abs/2605.17232

>SAS: Semantic-aware Sampling for Generative Dataset Distillation
https://arxiv.org/abs/2605.18012

>Latent Action Control for Reasoning-Guided Unified Image Generation
https://arxiv.org/abs/2605.16961

>Generation Navigator: A State-Aware Agentic Framework for Image Generation
https://arxiv.org/abs/2605.17969
>>
>>
>>
gn all
>>
File: debo_cs-f_anima1_00059_.png (2.74 MB, 1792x1140)
2.74 MB PNG
>>108862852
>>108862880
>>108862889
alright, time to ogle these

>>108863098
gn
>>
File: debo_cs-f_anima1_00060_.png (2.68 MB, 1792x1140)
2.68 MB PNG
>>108862889
this one isnt labeled. this was euler simple?
>>
>>108863234
oh weird, i did it in photoshop shoulda made 5.5 do a script for it, but they were all the same settings. i did some runs on models i need to prepare too, so euler bong_tangent is OK but probably depends on the prompting
>>
How would you go around creating a nice looking RPG character sheet?

I have the contents and don't need any character pictures. Just make the text and tables look nice. One that fits the character.

Also it is not a standard D&D sheet. A custom system.

Right now, the most obvious way is to create a rought draft manually, and then have image edit add the nice design that fits the character.
>>
File: file.png (2.77 MB, 1086x1448)
2.77 MB PNG
>>108863629
big lab imagen like Nano Banana 2 or Gpt Image 2, those are the best at heavy text nb2 is like $0.06 per image so it's not free but it ain't pricy either. i told gpt-image-2 "make a character sheet w/ arpg stats i don't care what else" but if you have it all specified out you could get pretty fancy with it
>>
>>108863646
Very interesting.
No that I think of it, local is not going to cut it. Text and structure are ver parameter-hungry.
>>
File: debo_cs-f_anima1_00061_.png (2.55 MB, 1792x1140)
2.55 MB PNG
>>108863646
its so over for local
>>
>>108863759
it always was. wb
>>
>>
File: debo_cs-f_anima1_00062_.png (2.66 MB, 1792x1140)
2.66 MB PNG
>>
>>108863646
>Gpt Image 2
This works amazing.
But gets blocked when I try to generate an R-18 character sheet. Bummer.
>>
File: debo_cs-f_anima1_00063_.png (2.8 MB, 1792x1140)
2.8 MB PNG
>>108863840
such is the curse of saas. it can do anything, except the things you want it to do
>>
>>108863873
Shame.

Now I gotta remake this macro with cute anime girl.

Maybe I can finnagle it by having big AI make me an empty sheet template, and then have local fill in the blanks? But local can't even do text properly. Let alone complex layout. Fuck.
>>
>>
File: debo_cs-f_anima1_00065_.png (2.76 MB, 1792x1140)
2.76 MB PNG
>>108863902
you could have saas cook up the sheet in a PG way then use a local edit model to give it tiddies
>>
>>
File: debo_cs-f_anima1_00067_.png (2.79 MB, 1792x1140)
2.79 MB PNG
>>108863933
how close are those twitter tags to the artist you're using?
>>
>>108863944
good question. i don't have any artist specified in the wildcards but who knows what deepseek did...
masterpiece, very aesthetic, score_9, newest, highres,chibi anime cel illustration, smooth bold outlines, flat bright colors, hard-edged shadows, simplified shapes, adult female with ample curves, low-tied long hair with drill sidelocks, black sclera, playful expression, three-quarter view, diving posture with hands on knees, wearing provocative court attire, amidst clouds and shifting sands, one eye bearing an illuminati symbol, beeswax seals stamped with unfamiliar marks near faintly glowing Norse runes, full-frame composition

my guess? chicks posing like that usually have socials watermarks. the latents giveth and the latents taketh away
>>
File: debo_cs-f_anima1_00068_.png (2.59 MB, 1792x1140)
2.59 MB PNG
>>108863956
classic chicks and their watermarks
>>
should probably put that shit in the negatives. been a minute since i had to worry about those
>>
>>
File: debo_cs-f_anima1_00070_.png (2.79 MB, 1792x1140)
2.79 MB PNG
>>108863988
i keep forgetting negatives exist. spent too much time with zimg turbo
>>
File: 1895025810015826.png (3.46 MB, 1156x1722)
3.46 MB PNG
>>
File: 000000_70925_.png (2.21 MB, 1532x1048)
2.21 MB PNG
>>
File: debo_cs-f_anima1_00072_.png (2.91 MB, 1792x1140)
2.91 MB PNG
>>108864103
interesting
>>
>>
>>
>>
>>
i miss schizo anon
>>
>>
>>
I'm kind of sad, vageen heavy prompts are almost non existing
>>
>>108865624
>>>/r/realistic+parody
>>
File: 00000-3379244253.jpg (1.06 MB, 1536x1536)
1.06 MB JPG
>>
>>
File: 00001-3246186562.jpg (1.04 MB, 1344x1728)
1.04 MB JPG
>>
>>
>>
>>
>>
File: debo_cs-f_anima1_00073_.png (2.79 MB, 1792x1140)
2.79 MB PNG
>>
>>
File: debo_cs-f_anima1_00074_.png (2.85 MB, 1792x1140)
2.85 MB PNG
now thats a hand
>>
>>108866731
idk man, it just keeps getting worse lel
>>
File: debo_cs-f_anima1_00075_.png (2.68 MB, 1792x1140)
2.68 MB PNG
>>108866831
fun not allowed
>>
>>108866848
whatever floats your boat in terms of fun i guess
i like seeing pretty 1girl doing funny with ani-pals
if the model doesnt show me that, it gets dropped
>>
File: debo_cs-f_anima1_00076_.png (2.92 MB, 1792x1140)
2.92 MB PNG
>>
gm
>>
>>108866731
Even by your "standards" that's super slop
>>
>>108866889
gm
>>
File: debo_cs-f_anima1_00078_.png (2.47 MB, 1792x1140)
2.47 MB PNG
>>108866889
gm

>>108866892
sorry you're mad
>>
>gm
>>
It's interesting how you can gen with very narrow proportions and still get a decent image.
>>
>>108866944
was that a "three side-by-side panels" type of prompt?
>>
File: debo_cs-f_anima1_00079_.png (2.72 MB, 1792x1140)
2.72 MB PNG
>>108866944
weird dimensions is always fun. depending on the model, you can sometimes get very unique things from weird dimensions. we had an anon for a bit who did very very wide landscapes, which were very cool
>>
>>108866966
now do anima with tall dimensions/multipanel lel
>>
File: debo_cs-f_anima1_00080_.png (2.73 MB, 1792x1140)
2.73 MB PNG
>>108867000
I might add that to the list later to see what it does
>>
>>
>>
>>
File: 00004-294175878.jpg (1.1 MB, 1152x3456)
1.1 MB JPG
>>
>>
I am installing Easy Diffusion and trying out local image generation for the first time. What to expect?
>>
File: 00006-4234640722.jpg (1.06 MB, 3456x1152)
1.06 MB JPG
>>108867270
unbounded fun
>>
>>108867270
1girls
>>
File: debo_cs-f_anima1_00081_.png (2.43 MB, 1792x1140)
2.43 MB PNG
>>108867270
addiction
>>
>>
>>108867277
>>108867285
>>108867293
Easydiffusion is disgusting and severely limited, nvm
>>
>>108867414
come to the dark side and install comfy. it's only mostly named ironically... it's not so bad.
>>
File: 00008-725080685.jpg (2.09 MB, 2016x2592)
2.09 MB JPG
>>
>>
>>
>>108866960
Yes. Three separate gens copy/pasted into one using xvview mp.

>>108866966
yes. the "seen from behind" must be interesting.
>>
File: debo_cs-f_anima1_00083_.png (3.02 MB, 1792x1140)
3.02 MB PNG
>>
>>
>>
File: debo_cs-f_anima1_00085_.png (2.69 MB, 1792x1140)
2.69 MB PNG
>>108867925
this deserves an upscale
>>
File: ComfyUI_00017_.jpg (1.21 MB, 4800x1664)
1.21 MB JPG
>>108867963
rtx upscale
took longer to save than upscale lol
>>
File: 00010-2328029523.jpg (1.6 MB, 2592x2016)
1.6 MB JPG
>>
>>
File: debo_cs-f_anima1_00086_.png (2.58 MB, 1792x1140)
2.58 MB PNG
>>108867984
based

>>108867986
love this

>>108868037
the first chroma girl in history
>>
File: 00012-1260196396.jpg (1.44 MB, 2592x2016)
1.44 MB JPG
>>
>>108868125
she's a time traveller now
>>
>>
File: 00014-963094721.jpg (1.02 MB, 1152x3456)
1.02 MB JPG
>>
>>
File: 00016-4610538.jpg (1.86 MB, 2304x2304)
1.86 MB JPG
>>
>>
File: 43654654675.jpg (174 KB, 1024x1024)
174 KB JPG
>>
File: debo_cs-f_anima1_00087_.png (2.92 MB, 1792x1140)
2.92 MB PNG
>>108868694
can you give him a jar of scub
>>
File: 00017-2839510173.png (2.83 MB, 1344x1728)
2.83 MB PNG
>>108868709
whatever scub is, the civitai generator now seems frozen.
i have been using civit for the occasional flux dev gen, lately.
>>
>>
File: debo_cs-f_anima1_00090_.png (2.92 MB, 1792x1140)
2.92 MB PNG
>>108868823
>the civitai generator now seems frozen.
meaning broken or turned off?
civit seems to be having lots of problems lately
>>
>>
File: 465546546.jpg (154 KB, 1024x1024)
154 KB JPG
>>108868894
was stuck on 'generating', but now works. this is a jar of scub, i guess
>>
File: debo_cs-f_anima1_00091_.png (2.83 MB, 1792x1140)
2.83 MB PNG
>>108868916
thanks. that could be scub
>>
>>
>>
>>
File: 00021-3401221313.jpg (1.2 MB, 2304x2304)
1.2 MB JPG
>>
>>
Afternoon anons
>>
File: debo_cs-f_anima1_00092_.png (2.88 MB, 1792x1140)
2.88 MB PNG
>>
>ga
>>
File: 00022-268162525.jpg (786 KB, 3072x1024)
786 KB JPG
>>108869281
afternoon
>>
>>
>>
File: 00024-1815171712.jpg (1.38 MB, 1800x1800)
1.38 MB JPG
>>
File: debo_cs-f_anima1_00094_.png (2.64 MB, 1792x1140)
2.64 MB PNG
>>108868984
>>
>>
>>108869478
hi there
>>
File: 00025-1362711709.jpg (1.42 MB, 1920x2456)
1.42 MB JPG
>>
>>
>>
File: 00026-3483876447.jpg (2.3 MB, 1920x2272)
2.3 MB JPG
>>
>>
>>
File: 00028-4287572478.png (3.62 MB, 1920x1464)
3.62 MB PNG
>>
>>108869838
oswalt shooting?
>>
>>108869885
looks like it. hmm. AI trying to tell us something?
>>
>>108869885
>>108870187
ok good it wasnt just me seeing that.
>>
>>
>>108869838
>>108869885
>>108870187
>>108870206
yah it is, here's the original with some blur
>>
gn all
>>
File: 00770-2884799089.png (2.26 MB, 1024x1024)
2.26 MB PNG
>>
i miss schizo anon
>>
>>
File: 00001-3163062873.jpg (1.08 MB, 1728x2000)
1.08 MB JPG
>>
File: 00002-1563444576.jpg (1.41 MB, 1440x2144)
1.41 MB JPG
>>
>>
>>
>>
>>108872530
Thank you for the page 10 nigbobump.
>>
File: 01009-774896671.jpg (1.51 MB, 2048x2048)
1.51 MB JPG
>>
File: 0521_104518.jpg (727 KB, 3136x3564)
727 KB JPG
>>
>>
File: 01010-2932444744.png (3.67 MB, 1344x1728)
3.67 MB PNG
>>
>>
>>
File: 01011-1224880702.png (3.78 MB, 1344x1728)
3.78 MB PNG
>>
>>
File: debo_cr-s_anima1_00004_.png (3.51 MB, 1792x1194)
3.51 MB PNG
bad news for chromagirl. I've got a lot of imperfect gens to post today
>>
File: 01012-2499053853.png (3.77 MB, 1344x1728)
3.77 MB PNG
>>
>>
Morning Anons
>>
File: 01014-1422823216.png (3.34 MB, 1344x1728)
3.34 MB PNG
>>108874531
morning
>>
>>
>>
File: debo_cd-a_anima1_00004_.png (2.77 MB, 1792x1194)
2.77 MB PNG
>>
gm
>>
>>
baking
>>
>>108874794
>>108874794
>>108874794
>>
>>
Some time ago I've found a way to train embeddings from just a text prompt, no images, no Unet, no VAE.

I am not talking about regular embedding / textual inversion training. Those require images.

I didn't gave it much attention but I searched and asked around and I couldn't seem to find this existing anywhere so I just want to double check. Asking on reddit was a mistake I won't repeat.

Takes 2min, 1.6gb vram and produces a 70kb .safetensors that works normally in comfy via embedding:name.

use case is character identity locking and prompt bleed reduction.

Again, my questions are. Does this already exist and if it doesn't how useful do you find it?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.