/g/ - /sdg/ - Stable Diffusion General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/sdg/ - Stable Diffusion Gener(...) 07/03/24(Wed)02:11:23 No.101252896

File: 1719977660696950.png (3.03 MB, 1280x1920)

/sdg/ - Stable Diffusion General Anonymous 07/03/24(Wed)02:11:23 No.101252896

Previous /sdg/ thread : >>101245506

>Beginner UI local install
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Auto1111 forks
SD.Next: https://github.com/vladmandic/automatic
Anapnoe UX: https://github.com/anapnoe/stable-diffusion-webui-ux

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>SD3 info & download
https://rentry.org/sdg-link#sd3
https://education.civitai.com/quickstart-guide-to-stable-diffusion-3
https://aitracker.art/viewtopic.php?t=57

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://openmodeldb.info

>Animation
https://rentry.org/AnimAnon
https://rentry.org/AnimAnon-AnimDiff
https://rentry.org/AnimAnon-Deforum

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Discord
6wUwtcJsr2

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg

Anonymous
07/03/24(Wed)02:18:09 No.101252963

Anonymous 07/03/24(Wed)02:18:09 No.101252963

File: 1694585461251945.jpg (474 KB, 600x870)

474 KB JPG

please help me im useless and stupid >>101252944

Anonymous
07/03/24(Wed)02:18:23 No.101252964

Anonymous 07/03/24(Wed)02:18:23 No.101252964

File: 21694457859353946.png (3 MB, 2048x2048)

3 MB PNG

>>101252896
https://civitai.com/models/188114/rouge-the-bat

Anonymous
07/03/24(Wed)02:19:24 No.101252974

Anonymous 07/03/24(Wed)02:19:24 No.101252974

File: _DG_News_00031_.png (1.75 MB, 1560x896)

1.75 MB PNG

>mfw Resource news

07/02/2024

>16ch-VAE: Open source 16ch VAE reproduction for SD3
https://huggingface.co/AuraDiffusion/16ch-vae

>DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
https://jimmycv07.github.io/DiffIR2VR_web/

>MimicMotion wrapper for ComfyUI
https://github.com/kijai/ComfyUI-MimicMotionWrapper

>E.T. the Exceptional Trajectories: Text-to-camera-trajectory generation with character awareness
https://www.lix.polytechnique.fr/vista/projects/2024_et_courant/

>FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
https://foleycrafter.github.io/

>FORA: Fast-Forward Caching in Diffusion Transformer Acceleration
https://github.com/prathebaselva/FORA

>Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion
https://boyuan.space/diffusion-forcing/

>RunwayML: Gen-3 Alpha Text to Video is now available to everyone.
https://x.com/runwayml/status/1807822396415467686

07/01/24

>Segment Anything without Supervision
https://github.com/frank-xwang/UnSAM

>MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
https://tencent.github.io/MimicMotion/

>PopAlign: Population-Level Alignment for Fair Text-to-Image Generation
https://github.com/jacklishufan/PopAlignSDXL

>B-LoRA for Kohya-SS
https://github.com/ThereforeGames/blora_for_kohya

>Nearly half of US firms using AI say goal is to cut staffing costs
https://www.baka.com.au/world/north-america/nearly-half-of-us-firms-using-ai-say-goal-is-to-cut-staffing-costs-20240629-p5jpsl.html

06/30/2024

>HunyuanDiT-v1.2
https://huggingface.co/Tencent-Hunyuan/HunyuanDiT-v1.2

>Hunyuan-Captioner
https://huggingface.co/Tencent-Hunyuan/HunyuanCaptioner

>MotionClone: Training-Free Motion Cloning for Controllable Video Generation
https://github.com/Bujiazi/MotionClone

>SDFX, no-code platform to build and share AI apps, goes open source
https://github.com/sdfxai/sdfx

Anonymous
07/03/24(Wed)02:20:24 No.101252983

Anonymous 07/03/24(Wed)02:20:24 No.101252983

>mfw Research news

07/02/24

>Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing
https://arxiv.org/abs/2407.01521

>MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs
https://arxiv.org/abs/2407.01509

>Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning
https://arxiv.org/abs/2407.01491

>FastCLIP: A Suite of Optimization Techniques to Accelerate CLIP Training with Limited Resources
https://arxiv.org/abs/2407.01445

>StyleShot: A SnapShot on Any Style
https://styleshot.github.io/

>TransferAttn: Transferable-guided Attention Is All You Need for Video Domain Adaptation
https://arxiv.org/abs/2407.01375

>Restyling Unsupervised Concept Based Interpretable Networks with Generative Models
https://jayneelparekh.github.io/VisCoIN_project_page/

>Unaligning Everything: Or Aligning Any Text to Any Image in Multimodal Models
https://arxiv.org/abs/2407.01157

>Evaluation of Text-to-Video Generation Models: A Dynamics Perspective
https://arxiv.org/abs/2407.01094

>Blind Inversion using Latent Diffusion Priors
https://arxiv.org/abs/2407.01027

>Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval
https://arxiv.org/abs/2407.00979

>InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation
https://arxiv.org/abs/2407.00788

>Diffusion Models and Representation Learning: A Survey
https://arxiv.org/abs/2407.00783

>LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation
https://arxiv.org/abs/2407.00737

>Consistency Purification: Effective and Efficient Diffusion Purification towards Certified Robustness
https://arxiv.org/abs/2407.00623

>GenderBias-VL: Benchmarking Gender Bias in Vision Language Models via Counterfactual Probing
https://arxiv.org/abs/2407.00600

>Toward a Diffusion-Based Generalist for Dense Vision Tasks
https://arxiv.org/abs/2407.00503

Anonymous
07/03/24(Wed)02:21:25 No.101252991

Anonymous 07/03/24(Wed)02:21:25 No.101252991

>mfw MORE Research news

>The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention
https://arxiv.org/abs/2407.00377

>SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix
https://daipengwa.github.io/SVG_ProjectPage/

>OccFusion: Rendering Occluded Humans with Generative Diffusion Priors
https://arxiv.org/abs/2407.00316

>Prompt Refinement with Image Pivot for Text-to-Image Generation
https://arxiv.org/abs/2407.00247

>Transformer-based Image and Video Inpainting: Current Challenges and Future Directions
https://arxiv.org/abs/2407.00226

>The impact of model size on catastrophic forgetting in Online Continual Learning
https://arxiv.org/abs/2407.00176

>Analyzing Quality, Bias, and Performance in Text-to-Image Generative Models
https://arxiv.org/abs/2407.00138

>DaBiT: Depth and Blur informed Transformer for Joint Refocusing and Super-Resolution
https://arxiv.org/abs/2407.01230

>Multimodal Conditional 3D Face Geometry Generation
https://arxiv.org/abs/2407.01074

>Human-like object concept representations emerge naturally in multimodal large language models
https://arxiv.org/abs/2407.01067

>An Expectation-Maximization Algorithm for Training Clean Diffusion Models from Corrupted Observations
https://arxiv.org/abs/2407.01014

>Unveiling Glitches: A Deep Dive into Image Encoding Bugs within CLIP
https://arxiv.org/abs/2407.00592

>Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models
https://arxiv.org/abs/2407.00569

>From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models
https://arxiv.org/abs/2407.00263

>GalLoP: Learning Global and Local Prompts for Vision-Language Models
https://arxiv.org/abs/2407.01400

>GaussianStego: Embedding Invisible Information within Generative 3D Gaussian Splatting
https://gaussian-stego.github.io/

Anonymous
07/03/24(Wed)02:22:40 No.101253006

Anonymous 07/03/24(Wed)02:22:40 No.101253006

File: 00015-3963437392.png (1.81 MB, 1064x1192)

1.81 MB PNG

Anonymous
07/03/24(Wed)02:28:50 No.101253056

Anonymous 07/03/24(Wed)02:28:50 No.101253056

File: sd3bo_00018_.png (2.14 MB, 1536x880)

2.14 MB PNG

Anonymous
07/03/24(Wed)02:31:26 No.101253078

Anonymous 07/03/24(Wed)02:31:26 No.101253078

File: 1716313071156993.png (521 KB, 733x950)

521 KB PNG

>>101253018
I tried that, and added another himiko lora, and i guess its """comprehensible""" now but its realistic garbo instead of an artstyle, what did i do wrong anon...

Anonymous
07/03/24(Wed)02:34:29 No.101253099

Anonymous 07/03/24(Wed)02:34:29 No.101253099

>>101253078
also I think {{}} don't work on stable diffusion, but I could be wrong, try a different model like this https://civitai.com/models/7371/rev-animated

Anonymous
07/03/24(Wed)02:36:51 No.101253123

Anonymous 07/03/24(Wed)02:36:51 No.101253123

File: PW_74096_.jpg (301 KB, 2048x1536)

301 KB JPG

Sorry! Neighbors came to chat!
>>101252410
I like my steak medium well haha
>>101252421
Hello, anon! Welcome to /sdg/! Sometimes we do, they come and go

Anonymous
07/03/24(Wed)02:50:13 No.101253240

Anonymous 07/03/24(Wed)02:50:13 No.101253240

>>101253099
>{{}}
you are right, it doesn't work in the ui, only on the nai site

Anonymous
07/03/24(Wed)02:58:11 No.101253306

Anonymous 07/03/24(Wed)02:58:11 No.101253306

File: 00023-2830588716_cleanup.png (1.56 MB, 1064x1192)

1.56 MB PNG

Didn't notice the cameltoe

Anonymous
07/03/24(Wed)03:15:41 No.101253432

Anonymous 07/03/24(Wed)03:15:41 No.101253432

help i cant stop genning futa

Anonymous
07/03/24(Wed)03:16:00 No.101253434

Anonymous 07/03/24(Wed)03:16:00 No.101253434

File: PW_74137_.jpg (298 KB, 2048x1536)

298 KB JPG

Anonymous
07/03/24(Wed)03:33:20 No.101253557

Anonymous 07/03/24(Wed)03:33:20 No.101253557

File: 62banaok_lxi3b3ij_p.png (63 KB, 1024x1024)

63 KB PNG

>>101253306
fun style! can you catbox?
>>101253434
hi pw, haven't been in the general in many moons

i did do an art project generating portraits for people from selfies using SD tho, it was a lot of fun

Anonymous
07/03/24(Wed)03:36:50 No.101253579

Anonymous 07/03/24(Wed)03:36:50 No.101253579

File: j6bc8nom_lxi39akh_p.png (63 KB, 1024x1024)

63 KB PNG

>>101253056
oh hi debo didn't see you there, who else still hangs out here?

Anonymous
07/03/24(Wed)03:38:23 No.101253591

Anonymous 07/03/24(Wed)03:38:23 No.101253591

File: 00025-392711799.png (1.48 MB, 1064x1192)

1.48 MB PNG

>>101253557
https://files.catbox.moe/oxa9zl.png

Anonymous
07/03/24(Wed)03:41:44 No.101253613

Anonymous 07/03/24(Wed)03:41:44 No.101253613

File: PW_74163_.jpg (929 KB, 4096x3072)

929 KB JPG

>>101253557
Hey there, anon! It's great to have you back!! I hope you've been doing well :]
That pixel style is so cool!

Anonymous
07/03/24(Wed)03:49:44 No.101253673

Anonymous 07/03/24(Wed)03:49:44 No.101253673

File: PW_74166_.jpg (353 KB, 2048x1536)

353 KB JPG

>>101253557
>i did do an art project generating portraits for people from selfies using SD tho, it was a lot of fun
I just read this part haha
That sounds really fun! I haven't tried anything like that yet!

Anonymous
07/03/24(Wed)04:04:03 No.101253756

Anonymous 07/03/24(Wed)04:04:03 No.101253756

File: image - 2024-07-03T030354.245.png (765 KB, 1272x720)

765 KB PNG

Anonymous
07/03/24(Wed)04:17:40 No.101253857

Anonymous 07/03/24(Wed)04:17:40 No.101253857

>>101252896
> LDSR has the best quality upscale but uses 40GB of RAM and somehow destroys the color space in the process.
> R-ESRGAN-4x is quick and fast but everything ends up looking flat.
What upscaler do you use?

Anonymous
07/03/24(Wed)04:22:56 No.101253915

Anonymous 07/03/24(Wed)04:22:56 No.101253915

File: 202402074_tune.jpg (248 KB, 1504x1256)

248 KB JPG

Anonymous
07/03/24(Wed)04:27:30 No.101253954

Anonymous 07/03/24(Wed)04:27:30 No.101253954

File: 0a_lxi1sgtd_p.png (60 KB, 1024x1024)

60 KB PNG

>>101253673
it was a lot of stress but it was fun, i think i had like 450 people come by and get portraits done

>>101253756
ahhaha hey azulanon, your gens have improved a lot

>>101253591
thanks by the way, looks like i gotta try pony

Anonymous
07/03/24(Wed)04:34:18 No.101254001

Anonymous 07/03/24(Wed)04:34:18 No.101254001

File: image - 2024-07-03T031024.675.png (1.22 MB, 1064x1192)

1.22 MB PNG

>>101253954
thanks

Anonymous
07/03/24(Wed)04:36:35 No.101254015

Anonymous 07/03/24(Wed)04:36:35 No.101254015

File: PW_74209_.jpg (512 KB, 2048x1536)

512 KB JPG

>>101253954
Oh wow! That's a lot LOL
I bet that took quite some time haha
That looks really cool, is that one of em?

Anonymous
07/03/24(Wed)04:39:26 No.101254028

Anonymous 07/03/24(Wed)04:39:26 No.101254028

File: peace_SD-12.jpg (357 KB, 1600x1200)

357 KB JPG

Good morning SDG

Anonymous
07/03/24(Wed)04:43:24 No.101254066

Anonymous 07/03/24(Wed)04:43:24 No.101254066

File: PW_74200_.jpg (380 KB, 2048x1536)

380 KB JPG

>>101254028
Good morning, anon! :]
I hope you've slept well!

Anonymous
07/03/24(Wed)04:44:23 No.101254076

Anonymous 07/03/24(Wed)04:44:23 No.101254076

File: peace_SD-10.jpg (292 KB, 1600x1200)

292 KB JPG

>>101254066
I did thanks did you sleep well

Anonymous
07/03/24(Wed)04:48:34 No.101254101

Anonymous 07/03/24(Wed)04:48:34 No.101254101

File: 07129-2830588717_a50ea6fb(...).png (1.25 MB, 968x1088)

1.25 MB PNG

>>101253591
>>101253306
trying my own spin on it, yours are better though

>>101254015
it was about 20 seconds per image, yeah the last three i posted were from that project
>>101254001
curious for a catbox here as well

Anonymous
07/03/24(Wed)04:48:41 No.101254104

Anonymous 07/03/24(Wed)04:48:41 No.101254104

>>101253078
>what did i do wrong
garbage in, garbage out, prompt, model, etc. literally everything is crap

Anonymous
07/03/24(Wed)04:50:08 No.101254114

Anonymous 07/03/24(Wed)04:50:08 No.101254114

File: Default_Minecraft_voxel_s(...).jpg (445 KB, 1224x600)

445 KB JPG

is it horny 1girl day again?

Anonymous
07/03/24(Wed)04:52:59 No.101254134

Anonymous 07/03/24(Wed)04:52:59 No.101254134

Cursed thread

Anonymous
07/03/24(Wed)04:53:23 No.101254140

Anonymous 07/03/24(Wed)04:53:23 No.101254140

File: PW_74214_.jpg (393 KB, 2048x1536)

393 KB JPG

>>101254076
I'm glad to hear it! :D
I haven't slept yet actually hahaha
I woke up late cause it was super hot today, but i'm probably gonna go in an hour or so
>>101254101
Oh nice! That's not too bad!
Niceee I bet they were all really happy with the results :]

Anonymous
07/03/24(Wed)04:55:05 No.101254153

Anonymous 07/03/24(Wed)04:55:05 No.101254153

>>101254101
https://files.catbox.moe/ubjamo.png

Anonymous
07/03/24(Wed)05:08:12 No.101254249

Anonymous 07/03/24(Wed)05:08:12 No.101254249

File: BMP_15039_.png (3.03 MB, 1536x1536)

3.03 MB PNG

>>101254140
Same, spent all evening cooking for the 4th

Anonymous
07/03/24(Wed)05:22:04 No.101254353

Anonymous 07/03/24(Wed)05:22:04 No.101254353

File: PW_74024_.jpg (1.54 MB, 4096x3072)

1.54 MB JPG

>>101254249
Heyy Mouse anon! :D
Nice! Whacha make?
Cute gen!

Anonymous
07/03/24(Wed)05:27:26 No.101254395

Anonymous 07/03/24(Wed)05:27:26 No.101254395

File: landscape4.jpg (196 KB, 1536x1024)

196 KB JPG

>>101253915
cool, can you make something like this as wallpaper?

Anonymous
07/03/24(Wed)06:09:57 No.101254705

Anonymous 07/03/24(Wed)06:09:57 No.101254705

>>101254395
Give it a rest

Anonymous
07/03/24(Wed)06:11:51 No.101254715

Anonymous 07/03/24(Wed)06:11:51 No.101254715

>people who made SD 1.5
>Beat open ai to public release of Sora tier video model
>Stability foundation
>Static image models actually getting worse with each release
Imagine if Emad was competent and the runway team was a stability lab

Anonymous
07/03/24(Wed)06:19:32 No.101254797

Anonymous 07/03/24(Wed)06:19:32 No.101254797

File: Seiran.jpg (191 KB, 1024x1024)

191 KB JPG

Anonymous
07/03/24(Wed)06:24:52 No.101254856

Anonymous 07/03/24(Wed)06:24:52 No.101254856

File: 00002-[humu_v10]_[DPM++ 2(...).jpg (724 KB, 2048x3072)

724 KB JPG

Sometimes fingers turn out nice

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.