[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1719977660696950.png (3.03 MB, 1280x1920)
3.03 MB
3.03 MB PNG
Previous /sdg/ thread : >>101245506

>Beginner UI local install
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Auto1111 forks
SD.Next: https://github.com/vladmandic/automatic
Anapnoe UX: https://github.com/anapnoe/stable-diffusion-webui-ux

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>SD3 info & download
https://rentry.org/sdg-link#sd3
https://education.civitai.com/quickstart-guide-to-stable-diffusion-3
https://aitracker.art/viewtopic.php?t=57

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://openmodeldb.info

>Animation
https://rentry.org/AnimAnon
https://rentry.org/AnimAnon-AnimDiff
https://rentry.org/AnimAnon-Deforum

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Discord
6wUwtcJsr2

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: 1694585461251945.jpg (474 KB, 600x870)
474 KB
474 KB JPG
please help me im useless and stupid >>101252944
>>
File: 21694457859353946.png (3 MB, 2048x2048)
3 MB
3 MB PNG
>>101252896
https://civitai.com/models/188114/rouge-the-bat
>>
File: _DG_News_00031_.png (1.75 MB, 1560x896)
1.75 MB
1.75 MB PNG
>mfw Resource news

07/02/2024

>16ch-VAE: Open source 16ch VAE reproduction for SD3
https://huggingface.co/AuraDiffusion/16ch-vae

>DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
https://jimmycv07.github.io/DiffIR2VR_web/

>MimicMotion wrapper for ComfyUI
https://github.com/kijai/ComfyUI-MimicMotionWrapper

>E.T. the Exceptional Trajectories: Text-to-camera-trajectory generation with character awareness
https://www.lix.polytechnique.fr/vista/projects/2024_et_courant/

>FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
https://foleycrafter.github.io/

>FORA: Fast-Forward Caching in Diffusion Transformer Acceleration
https://github.com/prathebaselva/FORA

>Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion
https://boyuan.space/diffusion-forcing/

>RunwayML: Gen-3 Alpha Text to Video is now available to everyone.
https://x.com/runwayml/status/1807822396415467686

07/01/24

>Segment Anything without Supervision
https://github.com/frank-xwang/UnSAM

>MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
https://tencent.github.io/MimicMotion/

>PopAlign: Population-Level Alignment for Fair Text-to-Image Generation
https://github.com/jacklishufan/PopAlignSDXL

>B-LoRA for Kohya-SS
https://github.com/ThereforeGames/blora_for_kohya

>Nearly half of US firms using AI say goal is to cut staffing costs
https://www.baka.com.au/world/north-america/nearly-half-of-us-firms-using-ai-say-goal-is-to-cut-staffing-costs-20240629-p5jpsl.html

06/30/2024

>HunyuanDiT-v1.2
https://huggingface.co/Tencent-Hunyuan/HunyuanDiT-v1.2

>Hunyuan-Captioner
https://huggingface.co/Tencent-Hunyuan/HunyuanCaptioner

>MotionClone: Training-Free Motion Cloning for Controllable Video Generation
https://github.com/Bujiazi/MotionClone

>SDFX, no-code platform to build and share AI apps, goes open source
https://github.com/sdfxai/sdfx
>>
>mfw Research news

07/02/24

>Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing
https://arxiv.org/abs/2407.01521

>MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs
https://arxiv.org/abs/2407.01509

>Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning
https://arxiv.org/abs/2407.01491

>FastCLIP: A Suite of Optimization Techniques to Accelerate CLIP Training with Limited Resources
https://arxiv.org/abs/2407.01445

>StyleShot: A SnapShot on Any Style
https://styleshot.github.io/

>TransferAttn: Transferable-guided Attention Is All You Need for Video Domain Adaptation
https://arxiv.org/abs/2407.01375

>Restyling Unsupervised Concept Based Interpretable Networks with Generative Models
https://jayneelparekh.github.io/VisCoIN_project_page/

>Unaligning Everything: Or Aligning Any Text to Any Image in Multimodal Models
https://arxiv.org/abs/2407.01157

>Evaluation of Text-to-Video Generation Models: A Dynamics Perspective
https://arxiv.org/abs/2407.01094

>Blind Inversion using Latent Diffusion Priors
https://arxiv.org/abs/2407.01027

>Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval
https://arxiv.org/abs/2407.00979

>InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation
https://arxiv.org/abs/2407.00788

>Diffusion Models and Representation Learning: A Survey
https://arxiv.org/abs/2407.00783

>LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation
https://arxiv.org/abs/2407.00737

>Consistency Purification: Effective and Efficient Diffusion Purification towards Certified Robustness
https://arxiv.org/abs/2407.00623

>GenderBias-VL: Benchmarking Gender Bias in Vision Language Models via Counterfactual Probing
https://arxiv.org/abs/2407.00600

>Toward a Diffusion-Based Generalist for Dense Vision Tasks
https://arxiv.org/abs/2407.00503
>>
>mfw MORE Research news

>The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention
https://arxiv.org/abs/2407.00377

>SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix
https://daipengwa.github.io/SVG_ProjectPage/

>OccFusion: Rendering Occluded Humans with Generative Diffusion Priors
https://arxiv.org/abs/2407.00316

>Prompt Refinement with Image Pivot for Text-to-Image Generation
https://arxiv.org/abs/2407.00247

>Transformer-based Image and Video Inpainting: Current Challenges and Future Directions
https://arxiv.org/abs/2407.00226

>The impact of model size on catastrophic forgetting in Online Continual Learning
https://arxiv.org/abs/2407.00176

>Analyzing Quality, Bias, and Performance in Text-to-Image Generative Models
https://arxiv.org/abs/2407.00138

>DaBiT: Depth and Blur informed Transformer for Joint Refocusing and Super-Resolution
https://arxiv.org/abs/2407.01230

>Multimodal Conditional 3D Face Geometry Generation
https://arxiv.org/abs/2407.01074

>Human-like object concept representations emerge naturally in multimodal large language models
https://arxiv.org/abs/2407.01067

>An Expectation-Maximization Algorithm for Training Clean Diffusion Models from Corrupted Observations
https://arxiv.org/abs/2407.01014

>Unveiling Glitches: A Deep Dive into Image Encoding Bugs within CLIP
https://arxiv.org/abs/2407.00592

>Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models
https://arxiv.org/abs/2407.00569

>From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models
https://arxiv.org/abs/2407.00263

>GalLoP: Learning Global and Local Prompts for Vision-Language Models
https://arxiv.org/abs/2407.01400

>GaussianStego: Embedding Invisible Information within Generative 3D Gaussian Splatting
https://gaussian-stego.github.io/
>>
File: 00015-3963437392.png (1.81 MB, 1064x1192)
1.81 MB
1.81 MB PNG
>>
File: sd3bo_00018_.png (2.14 MB, 1536x880)
2.14 MB
2.14 MB PNG
>>
File: 1716313071156993.png (521 KB, 733x950)
521 KB
521 KB PNG
>>101253018
I tried that, and added another himiko lora, and i guess its """comprehensible""" now but its realistic garbo instead of an artstyle, what did i do wrong anon...
>>
>>101253078
also I think {{}} don't work on stable diffusion, but I could be wrong, try a different model like this https://civitai.com/models/7371/rev-animated
>>
File: PW_74096_.jpg (301 KB, 2048x1536)
301 KB
301 KB JPG
Sorry! Neighbors came to chat!
>>101252410
I like my steak medium well haha
>>101252421
Hello, anon! Welcome to /sdg/! Sometimes we do, they come and go
>>
>>101253099
>{{}}
you are right, it doesn't work in the ui, only on the nai site
>>
File: 00023-2830588716_cleanup.png (1.56 MB, 1064x1192)
1.56 MB
1.56 MB PNG
Didn't notice the cameltoe
>>
help i cant stop genning futa
>>
File: PW_74137_.jpg (298 KB, 2048x1536)
298 KB
298 KB JPG
>>
File: 62banaok_lxi3b3ij_p.png (63 KB, 1024x1024)
63 KB
63 KB PNG
>>101253306
fun style! can you catbox?
>>101253434
hi pw, haven't been in the general in many moons

i did do an art project generating portraits for people from selfies using SD tho, it was a lot of fun
>>
File: j6bc8nom_lxi39akh_p.png (63 KB, 1024x1024)
63 KB
63 KB PNG
>>101253056
oh hi debo didn't see you there, who else still hangs out here?
>>
File: 00025-392711799.png (1.48 MB, 1064x1192)
1.48 MB
1.48 MB PNG
>>101253557
https://files.catbox.moe/oxa9zl.png
>>
File: PW_74163_.jpg (929 KB, 4096x3072)
929 KB
929 KB JPG
>>101253557
Hey there, anon! It's great to have you back!! I hope you've been doing well :]
That pixel style is so cool!
>>
File: PW_74166_.jpg (353 KB, 2048x1536)
353 KB
353 KB JPG
>>101253557
>i did do an art project generating portraits for people from selfies using SD tho, it was a lot of fun
I just read this part haha
That sounds really fun! I haven't tried anything like that yet!
>>
>>
>>101252896
> LDSR has the best quality upscale but uses 40GB of RAM and somehow destroys the color space in the process.
> R-ESRGAN-4x is quick and fast but everything ends up looking flat.
What upscaler do you use?
>>
File: 202402074_tune.jpg (248 KB, 1504x1256)
248 KB
248 KB JPG
>>
File: 0a_lxi1sgtd_p.png (60 KB, 1024x1024)
60 KB
60 KB PNG
>>101253673
it was a lot of stress but it was fun, i think i had like 450 people come by and get portraits done

>>101253756
ahhaha hey azulanon, your gens have improved a lot

>>101253591
thanks by the way, looks like i gotta try pony
>>
>>101253954
thanks
>>
File: PW_74209_.jpg (512 KB, 2048x1536)
512 KB
512 KB JPG
>>101253954
Oh wow! That's a lot LOL
I bet that took quite some time haha
That looks really cool, is that one of em?
>>
File: peace_SD-12.jpg (357 KB, 1600x1200)
357 KB
357 KB JPG
Good morning SDG
>>
File: PW_74200_.jpg (380 KB, 2048x1536)
380 KB
380 KB JPG
>>101254028
Good morning, anon! :]
I hope you've slept well!
>>
File: peace_SD-10.jpg (292 KB, 1600x1200)
292 KB
292 KB JPG
>>101254066
I did thanks did you sleep well
>>
>>101253591
>>101253306
trying my own spin on it, yours are better though

>>101254015
it was about 20 seconds per image, yeah the last three i posted were from that project
>>101254001
curious for a catbox here as well
>>
>>101253078
>what did i do wrong
garbage in, garbage out, prompt, model, etc. literally everything is crap
>>
is it horny 1girl day again?
>>
Cursed thread
>>
File: PW_74214_.jpg (393 KB, 2048x1536)
393 KB
393 KB JPG
>>101254076
I'm glad to hear it! :D
I haven't slept yet actually hahaha
I woke up late cause it was super hot today, but i'm probably gonna go in an hour or so
>>101254101
Oh nice! That's not too bad!
Niceee I bet they were all really happy with the results :]
>>
>>101254101
https://files.catbox.moe/ubjamo.png
>>
File: BMP_15039_.png (3.03 MB, 1536x1536)
3.03 MB
3.03 MB PNG
>>101254140
Same, spent all evening cooking for the 4th
>>
File: PW_74024_.jpg (1.54 MB, 4096x3072)
1.54 MB
1.54 MB JPG
>>101254249
Heyy Mouse anon! :D
Nice! Whacha make?
Cute gen!
>>
File: landscape4.jpg (196 KB, 1536x1024)
196 KB
196 KB JPG
>>101253915
cool, can you make something like this as wallpaper?
>>
>>101254395
Give it a rest
>>
>people who made SD 1.5
>Beat open ai to public release of Sora tier video model
>Stability foundation
>Static image models actually getting worse with each release
Imagine if Emad was competent and the runway team was a stability lab
>>
File: Seiran.jpg (191 KB, 1024x1024)
191 KB
191 KB JPG
>>
Sometimes fingers turn out nice



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.