[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1722049297682495.jpg (694 KB, 3616x2048)
694 KB
694 KB JPG
Previous /sdg/ thread : >>101582499

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>SD3 info & download
https://rentry.org/sdg-link#sd3

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Animation
https://rentry.org/AnimAnon
https://rentry.org/AnimAnon-AnimDiff
https://rentry.org/AnimAnon-Deforum

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Discord
6wUwtcJsr2

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: _DG_News_00304_.png (1.69 MB, 1560x896)
1.69 MB
1.69 MB PNG
>mfw Resource news

07/26/2024

>ComfyUI implements native Hunyuan-DiT support, packages model to single file
https://comfyanonymous.github.io/ComfyUI_examples/hunyuan_dit

>Fooocus v2.5.1 Update
https://github.com/lllyasviel/Fooocus/releases/tag/v2.5.1

>HVM-1: Video models trained on ~5k hours of human-like video data
https://github.com/eminorhan/hvm-1

>Move and Act: Enhanced Object Manipulation and Background Integrity for Image Editing
https://github.com/mobiushy/move-act

>LLMImageIndexer: Use local LLM to create descriptive metadata
https://github.com/jabberjabberjabber/LLavaImageTagger/

>Generative AI for Krita: Version 1.21.0
https://github.com/Acly/krita-ai-diffusion/releases/tag/v1.21.0

>Rope-Live: Customized Rope for Streaming
https://github.com/argenspin/Rope-Live

07/25/2024

>ViPer: Visual Personalization of Generative Models via Individual Preference Learning
https://viper.epfl.ch

>ComfyUI-Kolors-Translator: Translate prompts into Chinese
https://github.com/BetaDoggo/ComfyUI-Kolors-Translator

>ComfyUI-FollowYourEmojiWrapper
https://github.com/kijai/ComfyUI-FollowYourEmojiWrapper

07/24/2024

>SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency
https://sv4d.github.io

>Open-Sora-Plan Report v1.2.0
https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/main/docs/Report-v1.2.0.md

>Official global launch of Kling AI's International Version 1.0
https://klingai.com

>INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal LLMs
https://github.com/WeihuangLin/INF-LLaVA

>FoRA: Low-Rank Adaptation Model beyond Multimodal Siamese Network
https://github.com/zyszxhy/FoRA

07/23/2024

>Mammoth - Extendible (General) Continual Learning Framework for Pytorch
https://github.com/aimagelab/mammoth

>Differentiable Convex Polyhedra Optimization from Multi-view Images
https://github.com/kimren227/DiffConvex

>Cinemo: Consistent and Controllable Animation with Motion Diffusion Models
https://maxin-cn.github.io/cinemo_project
>>
>mfw Research news

07/26/2024

>RegionDrag: Fast Region-Based Image Editing with Diffusion Models
https://arxiv.org/abs/2407.18247

>Imagine yourself: Tuning-Free Personalized Image Generation
https://ai.meta.com/research/publications/imagine-yourself-tuning-free-personalized-image-generation

>Sparse vs Contiguous Adversarial Pixel Perturbations in Multimodal Models: An Empirical Analysis
https://arxiv.org/abs/2407.18251

>Quasar-ViT: Hardware-Oriented Quantization-Aware Architecture Search for Vision Transformers
https://arxiv.org/abs/2407.18175

>GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution
https://arxiv.org/abs/2407.18046

>RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models
https://arxiv.org/abs/2407.18035

>AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction
https://arxiv.org/abs/2407.18034

>Scaling Training Data with Lossy Image Compression
https://arxiv.org/abs/2407.17954

>Guided Latent Slot Diffusion for Object-Centric Learning
https://guided-sa.github.io/

>Reasoning and Correcting Diffusion for HOI Generation
https://alberthkyhky.github.io/ReCorD/

>Amortized Posterior Sampling with Diffusion Prior Distillation
https://arxiv.org/abs/2407.17907

>FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing
https://arxiv.org/abs/2407.17850

>DragText: Rethinking Text Embedding in Point-based Image Editing
https://arxiv.org/abs/2407.17843

>How Lightweight Can A Vision Transformer Be
https://arxiv.org/abs/2407.17783

>Quality Assured: Rethinking Annotation Strategies in Imaging AI
https://arxiv.org/abs/2407.17596

>Diffusion Models for Multi-Task Generative Modeling
https://arxiv.org/abs/2407.17571

>ReDiFine: Reusable Diffusion Finetuning for Mitigating Degradation in the Chain of Diffusion
https://arxiv.org/abs/2407.17493

>A Survey of Accessible Explainable Artificial Intelligence Research
https://arxiv.org/abs/2407.17484
>>
>>
>>101590591
how was the movie
>>
File: ComfyUI_00122_.png (1.34 MB, 960x1088)
1.34 MB
1.34 MB PNG
>>101590591
You're alright. Don't go to Paris tomorrow.
>>
File: 00018-212246229.jpg (1.37 MB, 2240x3360)
1.37 MB
1.37 MB JPG
>>101590550
That's a shame...
>>
File: 00068-1143213473.png (1.81 MB, 1200x1496)
1.81 MB
1.81 MB PNG
Why does the American government keep tall white alien waifus from us? Do they want to have em all for themselves?
>>
File: up_0002.jpg (714 KB, 3072x5120)
714 KB
714 KB JPG
>>
File: 00075-2941897621.jpg (198 KB, 2560x1440)
198 KB
198 KB JPG
>>
File: ComfyUI_00124_.png (1.35 MB, 960x1088)
1.35 MB
1.35 MB PNG
goo night
>>
File: 00007-1642856451.jpg (640 KB, 2240x3360)
640 KB
640 KB JPG
>>101590882
Night night.
>>
>>101590882
Goodnight.
>>
File: 00000-171560374_cleanup.png (1.56 MB, 1112x1248)
1.56 MB
1.56 MB PNG
>>101590893
I like the film grain, good aesthetic
>>
>>101590593
I liked it more than i thought i would, it was pre fun.
>>101590626
Kek
>>
File: 00011-27647891.jpg (719 KB, 2240x3360)
719 KB
719 KB JPG
>>101590965
Thanks, it's definitely the look I'm going for with this latest batch (phone camera/low quality). Adds a bit of extra realism I think.
A lot of the upscalers don't like it though, and it's just pure luck that they don't ruin the whole image 90% of the time. Still trying to find consistency when it comes to that.
>>
File: image (13).png (1.3 MB, 1112x1248)
1.3 MB
1.3 MB PNG
>>
>>101590480
>Hunyuan-DiT
Can it run on 6GB vram yet?
>>
File: LivePortrait_00001.webm (2 MB, 1280x768)
2 MB
2 MB WEBM
>>101586231
>https://files.catbox.moe/7662s9.mp4
Nice, I didn't know it worked with video. I used SVD and it's not as good as the cloud models, but I made it work with liveportrait.
How much VRAM do you think these video models need? (runway, kling, etc.)
>>
File: 00013-2341014822.png (1.56 MB, 1112x1248)
1.56 MB
1.56 MB PNG
>>
Who posts the news when news guy is sleeping?
>>
>>101591487
No one actually reads news beyond pointing out absurd headlines like
>Rope-Live: Customized Rope for Streaming
we people do repost the new in their absence it's just a thread of news posts
>>
File: 00089-1624372523.jpg (342 KB, 2560x1440)
342 KB
342 KB JPG
>>
File: 00093-4167790712.jpg (380 KB, 2560x1440)
380 KB
380 KB JPG
>>
File: HomoCity2.jpg (560 KB, 1720x1152)
560 KB
560 KB JPG
>>
File: file.jpg (174 KB, 1024x1792)
174 KB
174 KB JPG
217m from ~95 days
half way to Diffusion-1B with the other sets
might do some more on ait, either carry on debugging why stable-cascade-prior works and stable-cascade doesnt or just test another model
its a weird bug where a tensor gets removed from the graph or overwritten or something, tracked it to up_blocks_repeat_mappers being >1 compared to -prior, decoder works with it set to 1, -prior doesnt work either if i change the config to 2
no wonder meta abandoned the project desu
>>
>Model recognizes every cartoon character I can think of

>Except Spongebob

So annoying
>>
>>101591840
Try
>Mr. Sponge Bob, sponge with big cartoon eyes, half human half sponge,
>>
>>
File: sponge.jpg (59 KB, 415x550)
59 KB
59 KB JPG
>>101591865
>>101591840
>Mr. Sponge Bob, sponge with big cartoon eyes, half human half sponge,
Works for me, same exact prompt...
>>
>>
File: Nope.jpg (81 KB, 1024x1024)
81 KB
81 KB JPG
>>101591865
>>101591890

Took me a bit to get it. I had a couple already queued to finish.

Let's see what I get.


She's cute at least.
>>
File: sponge2.jpg (67 KB, 1152x768)
67 KB
67 KB JPG
>>
File: sponge3.jpg (159 KB, 1152x768)
159 KB
159 KB JPG
>>
File: sponge4.jpg (71 KB, 1152x768)
71 KB
71 KB JPG
>So, about your application...
>>
File: sponge5.jpg (252 KB, 1152x768)
252 KB
252 KB JPG
Last one.
This is CinematicRedmond model btw.
>>
File: ComfyUI_00562_.png (2.03 MB, 1280x1024)
2.03 MB
2.03 MB PNG
This roasted chicken isn't roasted at all! lololo
>>
File: ComfyUI_00528_.png (1.72 MB, 1280x1024)
1.72 MB
1.72 MB PNG
>>
i'm bored of portrait already
there's very little you can do with it
>>
File: 1704752501211500.jpg (128 KB, 963x1024)
128 KB
128 KB JPG
Need some tips on training LoRAs.
I'm doing a concept LoRA based on a niche fetish (if you really must know, google r34 xxx urethral insertion). My number of training images is large and I'm very pedantic about the quality of the dataset.

1. Should I keep everything on 1012x1012 or is the aspect ratio whatever?
2. Captioning. Should I focus on the concepts only or should I go fullblown autism like the booru taggers?
>>
>>101592590
1. Aspect ratio should be rectified and anally inspected to be exact same.
2. Better go autism.
>>
File: 000000_15408_.png (2.39 MB, 952x1667)
2.39 MB
2.39 MB PNG
>>101591840
>SpongeBob Squarepants
>>
File: 1697423567582515.png (7 KB, 289x81)
7 KB
7 KB PNG
so is this now forge level performance or what
>>
File: 00096-2900360052.jpg (504 KB, 2560x1440)
504 KB
504 KB JPG
>>101592888
if they didnt fix the bugs in it then its slower than 1.9.4
>>
File: 00104-1505696446.jpg (244 KB, 1440x2560)
244 KB
244 KB JPG
>>
>>101593216
>tfw no bizarre alien gf who lives in dark basement and only eats rodents
>>
i miss schizo anon
>>
File: file.jpg (224 KB, 1024x1792)
224 KB
224 KB JPG
>>101591840
>>
>>101593482
Are you that data hoarder?
>>
>>101590306
how do I sandbox SD on Windows so a rouge github tranny doesn't pwn my machine?
>>
>>101593518
I think you should not bother with genning images at all if this is the first question what comes to your mind.
>>
File: file.jpg (340 KB, 1024x1792)
340 KB
340 KB JPG
>>101593512
yh
>>101593518
>red scare
>>
>>101593538
I've been genning for months, and ever training my own loras. I just think it isn't smart to run so much ever-changing code without a sandbox.
>>
>>101593518
airgap your inference machine
>>
>>101593597
Well, you could use a virtual machine with gpu passthrough but I'm sure it will cause complications. Or maybe not. You could test it.
>>
File: 00363-4129701192.jpg (310 KB, 1920x1152)
310 KB
310 KB JPG
>>101590306
anyone know anything about goomaxing?
>>
File: ComfyUI_00648_.png (1.86 MB, 1280x1024)
1.86 MB
1.86 MB PNG
>>
File: HomoForest1.jpg (631 KB, 1720x1152)
631 KB
631 KB JPG
>>
File: HomoForest2.jpg (714 KB, 1720x1152)
714 KB
714 KB JPG
Mght as well post second one.
>>
>>101593657
yeah, because I have so many machines to spare...
>>101593668
I was considering it, but it's an atomic option. don't want to deal with two graphics cards etc. I guess I could run it as some restricted user, but it's Windows so I wouldn't expect this helping much.
>>
>>101594025
You don't need two cards because you already (in most cases) have an integrated gpu on your cpu.
I'm not familiar with virtual machines and gpu passthrough anyway but I know using integrated one is possible and you won't need two cards. Besides, you could have something very cheap as a second one anyway...
I'm sure an ordinary firewall configuration will help to isolate any unwanted traffic, for Windows I recommend SimpleWall by Henry++.
>>
File: ComfyUI_00665_.png (1.79 MB, 1280x1024)
1.79 MB
1.79 MB PNG
>>
>>101594095
How did you define this painterly style?
I just use
>oil painting by Brian LeBlanc
for example but I find it hard to get specific styles like this. They all look like 'something but not quite'. It should be doable with almost any SDXL checkpoint.
>>
File: 00005-3811351947.jpg (563 KB, 1024x1536)
563 KB
563 KB JPG
mornin
>>
>>101594192
good morning, i hope you are feeling well
>>
File: ComfyUI_00674_.png (1.98 MB, 1280x1024)
1.98 MB
1.98 MB PNG
>>101594143

Sometimes i use realistic , or oil painting or realistic anime, but it's quirky

t's more like the combination of euler, simple, and the right amount of denoise and cfg parameters that triggers it
>>
>>101594192
Gmorn
Was listening to
https://youtu.be/tpKCqp9CALQ?feature=shared
Thought about your muse in this situation
>>
File: 000000_15415_.png (2.39 MB, 998x1747)
2.39 MB
2.39 MB PNG
>>101593319
>bizarre alien gf who lives in dark basement and only eats rodents
>>
>>101594062
thanks anon, some good advice there. not all of it applies to my situation (for example, I don't have integrated GPU in my 5950x), but thanks anyway. I guess I was hoping Windows would have some better sandboxing mechanism by now, without resorting to virtualization. for now I will try using a restricted user. I guess it's marginally better than just raw dogging it as an admin user, kek.
>>
File: ComfyUI_00125_.png (1.48 MB, 960x1088)
1.48 MB
1.48 MB PNG
>>101594192
>want to gen a fox peeking his head up from the snow
>get a 1girl
thanks, pony
anyway, goo morning!
>>
>>101594406
I think you would be more vulnerable on Linux (as Python is integrated way deeper) than on Windows especially if you're using standalone venv things. I mean you are exaggerating this.
>>
File: forest1.jpg (108 KB, 1152x768)
108 KB
108 KB JPG
>>
File: 00018-77904884.jpg (382 KB, 1496x1000)
382 KB
382 KB JPG
indeed, it is caturday, expecting lots of feisty felines aujourdhui
>>
File: 000000_15419_.png (3.01 MB, 998x1747)
3.01 MB
3.01 MB PNG
>cats are egotistical, look at me I'm a cat. lizard hybrid, look at its eyes.
>>
>>101594639
Very nice style combination, amazing.
>>
>>101594660
Kolors model with ipadapter,
Art by (Theodoros Ralli:1.05) and (Michael Garmash
Jean Giraud)

posted workflow for base gen and upscale last thread.
>>
File: 0.jpg (291 KB, 1024x1024)
291 KB
291 KB JPG
>>
Anyone that paranoid has something to hide
>>
>>101594673
My gpu is so slow I haven't bothered with ipadapter, lmao.
Try adding
>(art by Paul Lehr:x.x)
He has very colorful paintings. Don't have a new example of this so I won't post any images.
>>
>>101594702
>>(art by Paul Lehr:x.x)
thank you, mines a 12GB 3060,, need a 16GB now. I have multiple workflows to finish one image kek...
>>
File: cables1.jpg (840 KB, 1720x1152)
840 KB
840 KB JPG
Not as strong as I wanted it to be but whatever.
>>
File: 00036-2599334433.jpg (707 KB, 1559x2250)
707 KB
707 KB JPG
>>
File: 000000_15423_.png (3.09 MB, 998x1747)
3.09 MB
3.09 MB PNG
>>101594841
Very nice.

>>101594702
>(art by Paul Lehr:1.2)
>>
>>101594889
lmao well... I can swear it works on some things
>>
File: cables2.jpg (267 KB, 1720x1152)
267 KB
267 KB JPG
>>
>>101594923
Super saturated, took away a bit of realism, but I also have other artists in prompt so most likely didn't transfer well, moved on.lol
>>
i think trani is shit (as a human)
>>
Any groundbreaking realism models for SD3?
>>
File: FeistyFelines01.jpg (612 KB, 1720x1152)
612 KB
612 KB JPG
>>
File: FeistyFelines02.jpg (557 KB, 1720x1152)
557 KB
557 KB JPG
>>
File: FeistyFelines03.jpg (663 KB, 1720x1152)
663 KB
663 KB JPG
>>
>>101595087
no one has been training for sd3 because of the shit license
>>
>>101595309
I'm wondering what has happened with these Chink models?
I tried the latest Comfy portable with Hunyuan DiT but it couldn't recognize the checkpoint (all from the comfy example page). I wonder maybe it was because the portable wasn't updated.
I mean aside from the initial propaganda I do think these might have potential when adequately trained.
>>
File: 000000_15428_.png (3.16 MB, 998x1747)
3.16 MB
3.16 MB PNG
>>
File: FeistyFelines04.jpg (568 KB, 1720x1152)
568 KB
568 KB JPG
Final.
>>
>Conned by a Bangledeshi and cucked by a Jeet
What's next for Stability AI?
>>
howdy
>>
File: ComfyUI_00798_.png (1.85 MB, 1280x1024)
1.85 MB
1.85 MB PNG
>>
File: PurpleWizard01.jpg (86 KB, 1152x768)
86 KB
86 KB JPG
>>
FInally,after so much struggle, i've made the scene i had in mind.
Multiple characters are a pain in the ass,also making them fight
>>
File: ComfyUI_00800_.png (1.96 MB, 1280x1024)
1.96 MB
1.96 MB PNG
here it is
>>
File: 00051-610656515.jpg (1.22 MB, 1560x2064)
1.22 MB
1.22 MB JPG
>>101595364
appreciated
>>
File: file.jpg (314 KB, 1024x1792)
314 KB
314 KB JPG
headache
>>
File: 00000_15430_.png (3.02 MB, 998x1747)
3.02 MB
3.02 MB PNG
>>
>>101595511
>>
>>101595364
neat
>>
>>101594978
based schizophrenic thread guardian of frenship
>>
File: grid-0001.jpg (1.6 MB, 3200x3200)
1.6 MB
1.6 MB JPG
>>
File: Watercolor01originalxl.png (2.53 MB, 1920x1080)
2.53 MB
2.53 MB PNG
Testing. I don't remember if this was done with albedoXl or vanillaSdxl.
Next one is the same prompt with Redmond but upscaled with sd1.5 and also in different resolution.
>>
File: Watercolor01Redmond.jpg (172 KB, 1720x1152)
172 KB
172 KB JPG
>>101596038
The aspect ratio makes it more squeezy.
>>
>>101596052
Original is better because it was upscaled with the same sdxl model.
What was the point? I don't know there is no point.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.