[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Settings Mobile Home
/g/ - Technology

4chan Pass users can bypass this verification. [Learn More] [Login]
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

File: 1722049297682495.jpg (694 KB, 3616x2048)
694 KB
694 KB JPG
Previous /sdg/ thread : >>101582499

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out

>Run cloud hosted instance

>SD3 info & download

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling


>Index of guides and other tools

>View and submit GPU performance data

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...


>Related boards
File: _DG_News_00304_.png (1.69 MB, 1560x896)
1.69 MB
1.69 MB PNG
>mfw Resource news


>ComfyUI implements native Hunyuan-DiT support, packages model to single file

>Fooocus v2.5.1 Update

>HVM-1: Video models trained on ~5k hours of human-like video data

>Move and Act: Enhanced Object Manipulation and Background Integrity for Image Editing

>LLMImageIndexer: Use local LLM to create descriptive metadata

>Generative AI for Krita: Version 1.21.0

>Rope-Live: Customized Rope for Streaming


>ViPer: Visual Personalization of Generative Models via Individual Preference Learning

>ComfyUI-Kolors-Translator: Translate prompts into Chinese



>SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency

>Open-Sora-Plan Report v1.2.0

>Official global launch of Kling AI's International Version 1.0

>INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal LLMs

>FoRA: Low-Rank Adaptation Model beyond Multimodal Siamese Network


>Mammoth - Extendible (General) Continual Learning Framework for Pytorch

>Differentiable Convex Polyhedra Optimization from Multi-view Images

>Cinemo: Consistent and Controllable Animation with Motion Diffusion Models
>mfw Research news


>RegionDrag: Fast Region-Based Image Editing with Diffusion Models

>Imagine yourself: Tuning-Free Personalized Image Generation

>Sparse vs Contiguous Adversarial Pixel Perturbations in Multimodal Models: An Empirical Analysis

>Quasar-ViT: Hardware-Oriented Quantization-Aware Architecture Search for Vision Transformers

>GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution

>RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models

>AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction

>Scaling Training Data with Lossy Image Compression

>Guided Latent Slot Diffusion for Object-Centric Learning

>Reasoning and Correcting Diffusion for HOI Generation

>Amortized Posterior Sampling with Diffusion Prior Distillation

>FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing

>DragText: Rethinking Text Embedding in Point-based Image Editing

>How Lightweight Can A Vision Transformer Be

>Quality Assured: Rethinking Annotation Strategies in Imaging AI

>Diffusion Models for Multi-Task Generative Modeling

>ReDiFine: Reusable Diffusion Finetuning for Mitigating Degradation in the Chain of Diffusion

>A Survey of Accessible Explainable Artificial Intelligence Research
how was the movie
File: ComfyUI_00122_.png (1.34 MB, 960x1088)
1.34 MB
1.34 MB PNG
You're alright. Don't go to Paris tomorrow.
File: 00018-212246229.jpg (1.37 MB, 2240x3360)
1.37 MB
1.37 MB JPG
That's a shame...
File: 00068-1143213473.png (1.81 MB, 1200x1496)
1.81 MB
1.81 MB PNG
Why does the American government keep tall white alien waifus from us? Do they want to have em all for themselves?
File: up_0002.jpg (714 KB, 3072x5120)
714 KB
714 KB JPG
File: 00075-2941897621.jpg (198 KB, 2560x1440)
198 KB
198 KB JPG
File: ComfyUI_00124_.png (1.35 MB, 960x1088)
1.35 MB
1.35 MB PNG
goo night
File: 00007-1642856451.jpg (640 KB, 2240x3360)
640 KB
640 KB JPG
Night night.
File: 00000-171560374_cleanup.png (1.56 MB, 1112x1248)
1.56 MB
1.56 MB PNG
I like the film grain, good aesthetic
I liked it more than i thought i would, it was pre fun.
File: 00011-27647891.jpg (719 KB, 2240x3360)
719 KB
719 KB JPG
Thanks, it's definitely the look I'm going for with this latest batch (phone camera/low quality). Adds a bit of extra realism I think.
A lot of the upscalers don't like it though, and it's just pure luck that they don't ruin the whole image 90% of the time. Still trying to find consistency when it comes to that.
File: image (13).png (1.3 MB, 1112x1248)
1.3 MB
1.3 MB PNG
Can it run on 6GB vram yet?
File: LivePortrait_00001.webm (2 MB, 1280x768)
2 MB
Nice, I didn't know it worked with video. I used SVD and it's not as good as the cloud models, but I made it work with liveportrait.
How much VRAM do you think these video models need? (runway, kling, etc.)
File: 00013-2341014822.png (1.56 MB, 1112x1248)
1.56 MB
1.56 MB PNG
Who posts the news when news guy is sleeping?
No one actually reads news beyond pointing out absurd headlines like
>Rope-Live: Customized Rope for Streaming
we people do repost the new in their absence it's just a thread of news posts
File: 00089-1624372523.jpg (342 KB, 2560x1440)
342 KB
342 KB JPG
File: 00093-4167790712.jpg (380 KB, 2560x1440)
380 KB
380 KB JPG
File: HomoCity2.jpg (560 KB, 1720x1152)
560 KB
560 KB JPG
File: file.jpg (174 KB, 1024x1792)
174 KB
174 KB JPG
217m from ~95 days
half way to Diffusion-1B with the other sets
might do some more on ait, either carry on debugging why stable-cascade-prior works and stable-cascade doesnt or just test another model
its a weird bug where a tensor gets removed from the graph or overwritten or something, tracked it to up_blocks_repeat_mappers being >1 compared to -prior, decoder works with it set to 1, -prior doesnt work either if i change the config to 2
no wonder meta abandoned the project desu
>Model recognizes every cartoon character I can think of

>Except Spongebob

So annoying
>Mr. Sponge Bob, sponge with big cartoon eyes, half human half sponge,
File: sponge.jpg (59 KB, 415x550)
59 KB
>Mr. Sponge Bob, sponge with big cartoon eyes, half human half sponge,
Works for me, same exact prompt...
File: Nope.jpg (81 KB, 1024x1024)
81 KB

Took me a bit to get it. I had a couple already queued to finish.

Let's see what I get.

She's cute at least.
File: sponge2.jpg (67 KB, 1152x768)
67 KB
File: sponge3.jpg (159 KB, 1152x768)
159 KB
159 KB JPG
File: sponge4.jpg (71 KB, 1152x768)
71 KB
>So, about your application...
File: sponge5.jpg (252 KB, 1152x768)
252 KB
252 KB JPG
Last one.
This is CinematicRedmond model btw.
File: ComfyUI_00562_.png (2.03 MB, 1280x1024)
2.03 MB
2.03 MB PNG
This roasted chicken isn't roasted at all! lololo
File: ComfyUI_00528_.png (1.72 MB, 1280x1024)
1.72 MB
1.72 MB PNG
i'm bored of portrait already
there's very little you can do with it
File: 1704752501211500.jpg (128 KB, 963x1024)
128 KB
128 KB JPG
Need some tips on training LoRAs.
I'm doing a concept LoRA based on a niche fetish (if you really must know, google r34 xxx urethral insertion). My number of training images is large and I'm very pedantic about the quality of the dataset.

1. Should I keep everything on 1012x1012 or is the aspect ratio whatever?
2. Captioning. Should I focus on the concepts only or should I go fullblown autism like the booru taggers?
1. Aspect ratio should be rectified and anally inspected to be exact same.
2. Better go autism.
File: 000000_15408_.png (2.39 MB, 952x1667)
2.39 MB
2.39 MB PNG
>SpongeBob Squarepants
File: 1697423567582515.png (7 KB, 289x81)
7 KB
so is this now forge level performance or what
File: 00096-2900360052.jpg (504 KB, 2560x1440)
504 KB
504 KB JPG
if they didnt fix the bugs in it then its slower than 1.9.4
File: 00104-1505696446.jpg (244 KB, 1440x2560)
244 KB
244 KB JPG
>tfw no bizarre alien gf who lives in dark basement and only eats rodents
i miss schizo anon
File: file.jpg (224 KB, 1024x1792)
224 KB
224 KB JPG
Are you that data hoarder?
how do I sandbox SD on Windows so a rouge github tranny doesn't pwn my machine?
I think you should not bother with genning images at all if this is the first question what comes to your mind.
File: file.jpg (340 KB, 1024x1792)
340 KB
340 KB JPG
>red scare
I've been genning for months, and ever training my own loras. I just think it isn't smart to run so much ever-changing code without a sandbox.
airgap your inference machine
Well, you could use a virtual machine with gpu passthrough but I'm sure it will cause complications. Or maybe not. You could test it.
File: 00363-4129701192.jpg (310 KB, 1920x1152)
310 KB
310 KB JPG
anyone know anything about goomaxing?
File: ComfyUI_00648_.png (1.86 MB, 1280x1024)
1.86 MB
1.86 MB PNG
File: HomoForest1.jpg (631 KB, 1720x1152)
631 KB
631 KB JPG
File: HomoForest2.jpg (714 KB, 1720x1152)
714 KB
714 KB JPG
Mght as well post second one.
yeah, because I have so many machines to spare...
I was considering it, but it's an atomic option. don't want to deal with two graphics cards etc. I guess I could run it as some restricted user, but it's Windows so I wouldn't expect this helping much.
You don't need two cards because you already (in most cases) have an integrated gpu on your cpu.
I'm not familiar with virtual machines and gpu passthrough anyway but I know using integrated one is possible and you won't need two cards. Besides, you could have something very cheap as a second one anyway...
I'm sure an ordinary firewall configuration will help to isolate any unwanted traffic, for Windows I recommend SimpleWall by Henry++.
File: ComfyUI_00665_.png (1.79 MB, 1280x1024)
1.79 MB
1.79 MB PNG
How did you define this painterly style?
I just use
>oil painting by Brian LeBlanc
for example but I find it hard to get specific styles like this. They all look like 'something but not quite'. It should be doable with almost any SDXL checkpoint.
File: 00005-3811351947.jpg (563 KB, 1024x1536)
563 KB
563 KB JPG
good morning, i hope you are feeling well
File: ComfyUI_00674_.png (1.98 MB, 1280x1024)
1.98 MB
1.98 MB PNG

Sometimes i use realistic , or oil painting or realistic anime, but it's quirky

t's more like the combination of euler, simple, and the right amount of denoise and cfg parameters that triggers it
Was listening to
Thought about your muse in this situation
File: 000000_15415_.png (2.39 MB, 998x1747)
2.39 MB
2.39 MB PNG
>bizarre alien gf who lives in dark basement and only eats rodents
thanks anon, some good advice there. not all of it applies to my situation (for example, I don't have integrated GPU in my 5950x), but thanks anyway. I guess I was hoping Windows would have some better sandboxing mechanism by now, without resorting to virtualization. for now I will try using a restricted user. I guess it's marginally better than just raw dogging it as an admin user, kek.
File: ComfyUI_00125_.png (1.48 MB, 960x1088)
1.48 MB
1.48 MB PNG
>want to gen a fox peeking his head up from the snow
>get a 1girl
thanks, pony
anyway, goo morning!
I think you would be more vulnerable on Linux (as Python is integrated way deeper) than on Windows especially if you're using standalone venv things. I mean you are exaggerating this.
File: forest1.jpg (108 KB, 1152x768)
108 KB
108 KB JPG
File: 00018-77904884.jpg (382 KB, 1496x1000)
382 KB
382 KB JPG
indeed, it is caturday, expecting lots of feisty felines aujourdhui
File: 000000_15419_.png (3.01 MB, 998x1747)
3.01 MB
3.01 MB PNG
>cats are egotistical, look at me I'm a cat. lizard hybrid, look at its eyes.
Very nice style combination, amazing.
Kolors model with ipadapter,
Art by (Theodoros Ralli:1.05) and (Michael Garmash
Jean Giraud)

posted workflow for base gen and upscale last thread.
File: 0.jpg (291 KB, 1024x1024)
291 KB
291 KB JPG
Anyone that paranoid has something to hide
My gpu is so slow I haven't bothered with ipadapter, lmao.
Try adding
>(art by Paul Lehr:x.x)
He has very colorful paintings. Don't have a new example of this so I won't post any images.
>>(art by Paul Lehr:x.x)
thank you, mines a 12GB 3060,, need a 16GB now. I have multiple workflows to finish one image kek...
File: cables1.jpg (840 KB, 1720x1152)
840 KB
840 KB JPG
Not as strong as I wanted it to be but whatever.
File: 00036-2599334433.jpg (707 KB, 1559x2250)
707 KB
707 KB JPG
File: 000000_15423_.png (3.09 MB, 998x1747)
3.09 MB
3.09 MB PNG
Very nice.

>(art by Paul Lehr:1.2)
lmao well... I can swear it works on some things
File: cables2.jpg (267 KB, 1720x1152)
267 KB
267 KB JPG
Super saturated, took away a bit of realism, but I also have other artists in prompt so most likely didn't transfer well, moved on.lol
i think trani is shit (as a human)
Any groundbreaking realism models for SD3?
File: FeistyFelines01.jpg (612 KB, 1720x1152)
612 KB
612 KB JPG
File: FeistyFelines02.jpg (557 KB, 1720x1152)
557 KB
557 KB JPG
File: FeistyFelines03.jpg (663 KB, 1720x1152)
663 KB
663 KB JPG
no one has been training for sd3 because of the shit license
I'm wondering what has happened with these Chink models?
I tried the latest Comfy portable with Hunyuan DiT but it couldn't recognize the checkpoint (all from the comfy example page). I wonder maybe it was because the portable wasn't updated.
I mean aside from the initial propaganda I do think these might have potential when adequately trained.
File: 000000_15428_.png (3.16 MB, 998x1747)
3.16 MB
3.16 MB PNG
File: FeistyFelines04.jpg (568 KB, 1720x1152)
568 KB
568 KB JPG
>Conned by a Bangledeshi and cucked by a Jeet
What's next for Stability AI?
File: ComfyUI_00798_.png (1.85 MB, 1280x1024)
1.85 MB
1.85 MB PNG
File: PurpleWizard01.jpg (86 KB, 1152x768)
86 KB
FInally,after so much struggle, i've made the scene i had in mind.
Multiple characters are a pain in the ass,also making them fight
File: ComfyUI_00800_.png (1.96 MB, 1280x1024)
1.96 MB
1.96 MB PNG
here it is
File: 00051-610656515.jpg (1.22 MB, 1560x2064)
1.22 MB
1.22 MB JPG
File: file.jpg (314 KB, 1024x1792)
314 KB
314 KB JPG
File: 00000_15430_.png (3.02 MB, 998x1747)
3.02 MB
3.02 MB PNG
based schizophrenic thread guardian of frenship
File: grid-0001.jpg (1.6 MB, 3200x3200)
1.6 MB
1.6 MB JPG
File: Watercolor01originalxl.png (2.53 MB, 1920x1080)
2.53 MB
2.53 MB PNG
Testing. I don't remember if this was done with albedoXl or vanillaSdxl.
Next one is the same prompt with Redmond but upscaled with sd1.5 and also in different resolution.
File: Watercolor01Redmond.jpg (172 KB, 1720x1152)
172 KB
172 KB JPG
The aspect ratio makes it more squeezy.
Original is better because it was upscaled with the same sdxl model.
What was the point? I don't know there is no point.

[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.