[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1720673958718656.png (711 KB, 1064x1192)
711 KB
711 KB PNG
Previous /sdg/ thread : >>101360320

>Beginner UI local install
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Auto1111 forks
SD.Next: https://github.com/vladmandic/automatic
Anapnoe UX: https://github.com/anapnoe/stable-diffusion-webui-ux

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>SD3 info & download
https://rentry.org/sdg-link#sd3
https://education.civitai.com/quickstart-guide-to-stable-diffusion-3
https://aitracker.art/viewtopic.php?t=57

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://openmodeldb.info

>Animation
https://rentry.org/AnimAnon
https://rentry.org/AnimAnon-AnimDiff
https://rentry.org/AnimAnon-Deforum

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Discord
6wUwtcJsr2

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: da nooz 10.jpg (505 KB, 1344x768)
505 KB
505 KB JPG
>mfw Resource news

07/11/2024

>LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models
https://github.com/LLaVA-VL/LLaVA-NeXT

>SD Webui Resource Monitor
https://github.com/Haoming02/sd-webui-resource-monitor

>AI's Energy Demands Are Out of Control
https://www.wired.com/story/ai-energy-demands-water-impact-internet-hyper-consumption-era/

>German court rules AI output can be protectable
https://devclass.com/2024/07/10/german-court-rules-ai-output-can-be-protectable-ups-stakes-for-machine-generated-code/?td=rt-3a

>EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
https://badtobest.github.io/echomimic.html

>LaRa: Efficient Large-Baseline Radiance Fields
https://github.com/autonomousvision/LaRa

>Grounding Image Matching in 3D with MASt3R
https://github.com/naver/mast3r

07/10/2024

>ROCm 6.1.2 HIP SDK 24.Q3
https://rocm.docs.amd.com/projects/install-on-windows/en/latest/

>ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
https://github.com/haoosz/ConceptExpress

>EasyAnimate-V3-XL
https://huggingface.co/alibaba-pai/EasyAnimateV3-XL-2-InP-960x960

>AMD to Acquire Silo AI
https://www.globenewswire.com/news-release/2024/07/10/2911168/0/en/AMD-to-Acquire-Silo-AI-to-Expand-Enterprise-AI-Solutions-Globally.html

07/09/2024

>Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images
https://tailor3d-2024.github.io

>PaintsUndo: A Base Model of Drawing Behaviors in Digital Paintings
https://lllyasviel.github.io/pages/paints_undo

>Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
https://orrzohar.github.io/projects/video-star

>Compositional Video Generation as Flow Equalization
https://adamdad.github.io/vico

>Big Tech needs to generate $600 billion to justify AI hardware expenditure
https://www.techspot.com/news/103699-big-tech-needs-generate-600-billion-annual-revenue.html
>>
>mfw Research news

07/11/2024

>4DiM: Controlling Space and Time with Diffusion Models
https://4d-diffusion.github.io

>Multi-task Prompt Words Learning for Social Media Content Generation
https://arxiv.org/abs/2407.07771

>VEnhancer: Generative Space-Time Enhancement for Video Generation
https://arxiv.org/abs/2407.07667

>Tuning Vision-Language Models with Candidate Labels by Prompt Alignment
https://arxiv.org/abs/2407.07638

>MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
https://arxiv.org/abs/2407.07614

>InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior
https://arxiv.org/abs/2407.07580

>Zero-Shot Class Unlearning in CLIP with Synthetic Samples
https://arxiv.org/abs/2407.07485

>Video-to-Audio Generation with Hidden Alignment
https://sites.google.com/view/vta-ldm

>A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends
https://arxiv.org/abs/2407.07403

>HiLight: Technical Report on the Motern AI Video Language Model
https://arxiv.org/abs/2407.07325

>Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion
https://arxiv.org/abs/2407.07249

>Reference-based Controllable Scene Stylization with Gaussian Splatting
https://arxiv.org/abs/2407.07220

>ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement
https://moatifbutt.github.io/colorpeel

>CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model
https://arxiv.org/abs/2407.07174

>Diffusion Model-Based Video Editing: A Survey
https://arxiv.org/abs/2407.07111

07/10/2024

>V-VIPE: Variational View Invariant Pose Embedding
https://v-vipe.github.io

>Latent Space Imaging
https://arxiv.org/abs/2407.07052

>HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance
https://arxiv.org/abs/2407.06937

>Rethinking I2V Adaptation: An Object-centric Perspective
https://arxiv.org/abs/2407.06871
>>
File: 4c.jpg (41 KB, 1024x1024)
41 KB
41 KB JPG
First for desk toys
>>
>>
File: file.jpg (218 KB, 1200x1188)
218 KB
218 KB JPG
I think Gradio SUCKS!
>>
File: 00344-731931874.jpg (459 KB, 3584x2016)
459 KB
459 KB JPG
>>101368617
about right.. but whats the alternative? electron? or god forbid write a native app?
>>
anon asked for butterflies, you did not deliver.
>>
>>101368743
merkava just refilled
>>
>>101368376
merkava just refilled, let's go boyhs
>>
>>
File: 00178-3280425584.jpg (159 KB, 1024x1376)
159 KB
159 KB JPG
gotta take a quick break
>>
>>101368661
How does writing HTML/JS melt the brain of Python devs?
>>
>>101368833
HTML/JS isn't programming, Python is
>>
Beep boop
>>
>>101368863
That's why you need daddy Gradio to hold your hand.
>>
you don't need gradio
>>
>>101368833
python devs cant into async or dom interactions
>>
>>101368882
you don't need async in python
>>
>>101368882
>thinking multithreaded applications need python
>not just java
>>
>>101368786
>>101368869
rodit
>>
>>101368886
thats what I'm saying. python dvs can't into async, which is why javascript melts their lil puny brains
>>
File: 00028-31126146.png (3.32 MB, 1280x1920)
3.32 MB
3.32 MB PNG
>>
>>101368833
>>101368873
Can't you just use a CLI to SD?
>>
>>101368958
Nice rodit
>>
>>
>>101369170
prompt: "big booba animu gurl"
the style is extremely fucking generic anime shit
>>
File: 00001-377918366.png (3.77 MB, 1280x1920)
3.77 MB
3.77 MB PNG
>>101369170
Get better taste in styles
>>
>>101369220
ironic
>>
>>101369253
forgot your gen
>>
>>101369210
>>101369220
thanks guys :3
>>
>>101369170
>since I'm here, I was wondering if anyone has any idea how to prompt something like this, or at least the style picrel has?

serious answer, not just for your specific question but in general

https://huggingface.co/spaces/pharmapsychotic/CLIP-Interrogator

upload your picture and get a prompt
>>
>>101369410
clip similarity is bullshit
>>
File: 00005-1395471645_cleanup.png (3.57 MB, 1280x1920)
3.57 MB
3.57 MB PNG
>>101369272
You're welcome
>>
File: ComfyUI_05103_.jpg (2.31 MB, 4608x3584)
2.31 MB
2.31 MB JPG
>>
File: 00104-24121795.png (1.88 MB, 1024x1280)
1.88 MB
1.88 MB PNG
This is jungle fever

testing stealth PAPA NOVEMBER GOLF info transmission on channel Alpha

please acknowledge
>>
>>101369511
Weapon, hands, checks-out.
nice gen
>>
>adult body
>childs face
>>
That's not all she's got that's child sized.
>>
>>101369620
Well women refused to put on the anti-aging helmets so
>>
Finally figured out how to make my model do what I want how I want.

Except it flat out fucking refuses to properly light a scene. The colors aren't washed out like the vae-issue, but it just won't fucking illuminate anything properly.
>>
>>101369690
....nevermind.
Adding a lamp works. wtf
>>
>>101369700
I really hate it when terms like backlit,sidelit,underlit, etc stop working.
>>
>>101369700
like in real life
>>
File: 00190-1268727929.jpg (268 KB, 1024x1376)
268 KB
268 KB JPG
>>
File: 00007-4269318050.png (3.53 MB, 1280x1920)
3.53 MB
3.53 MB PNG
>>
File: sig_0040.jpg (180 KB, 1152x896)
180 KB
180 KB JPG
>>
>>101369170
T-ponynai3
>>
File: sig_0044.jpg (231 KB, 1152x896)
231 KB
231 KB JPG
>>
File: sig_0050.jpg (248 KB, 1152x896)
248 KB
248 KB JPG
>>
File: 00001-3361752058.jpg (355 KB, 1208x1568)
355 KB
355 KB JPG
>>
>>101369170
masterpiece,bestquality, score_9, sweating massive big tity bitch in a white [see-through] tank top looking down at viewer astonished to see you in the bushes, one tiny tooth, grey hair, big hat has cattle, pink bow, sweat droplet on chin, it's nice out wow look at those clouds
>>
>>101369898
>>101369855
interesting styles
>>
What's my best option to fix hands? From what I've read it's inpaint with a controlnet? How do I go about that and would it matter what checkpoint I used for my base image when doing this?
>>
File: Kisaragi.jpg (305 KB, 1536x1536)
305 KB
305 KB JPG
>>
File: 00013-334316311.jpg (452 KB, 1280x1920)
452 KB
452 KB JPG
>>
>>101370009
Adetailer used to do pretty good but I can't use it on my current set up.

Oddly enough, 2dnpony v10 does hands and even feet pretty reliably
>>
File: 00014-1542759083.png (3.77 MB, 1280x1920)
3.77 MB
3.77 MB PNG
>>
File: 00015-825191996.png (3.81 MB, 1280x1920)
3.81 MB
3.81 MB PNG
>>
>>101370009
The vast majority of pony derivatives can do hands and feet pretty well and then running it through a detailer helps even more. It’s more worth finding out why you gen very bad hands instead of fixing them.
>>
>>101370041

you could use bokeh to avoid twisted horror creatures in background
>>
>>101370151
in negs or positives?
>>
>>101370151
that’s their style bro
>>
>>101369602
confirm trans(mission)
>>
the eff.. I just updated automatic1111 and everything is 1/4 the speed .. heck I should have backuped .. it upgraded from pytorch2.0.1+cu118 to pytorch2.1.2+cu121

is there a known bug? cause with 2 loras and an embeding I suddenly have to wait 5 minutes for the picture it ran 1 minute in previously
>>
>>101370165
positives, of course, depth of field should work as well but it's longer

it also blurs wrecked cars or errors in buildings and gives a more natural photographic feel ,
or pick a simple background where nothing can go wrong
>>
>>101370121
More quadriceps
>>
File: 00017-1039132344_cleanup.png (3.93 MB, 1280x1920)
3.93 MB
3.93 MB PNG
>>101370279
Okay thanks I'll try that

>>101370280
>>
>>101370151
dont listen to this anon he's trying to make the twisted horror creature blurry so you don't see it coming
>>
>>
>>101370337
cool
>>
File: 00024-289007490.png (3.53 MB, 1280x1920)
3.53 MB
3.53 MB PNG
>>
man those threads are unironically depressing now
>>
>>101370174
did you get the prompt?
>>
>>101370436
which threads
>>
>>101370436
yeah
>>
File: uyuytuugjh.gif (2.82 MB, 512x688)
2.82 MB
2.82 MB GIF
>>
>>101370436
>>101370455
He means her clothing invokes feelings of meloncholy in a literal sense at present
>>
File: media_GSIv_PWaUAURhiQ.jpg (161 KB, 1024x1024)
161 KB
161 KB JPG
https://xcancel.com/cloneofsimo/status/1811062800695054500#m
Looks like this mf is making a really uncensored model based on the SD3 architecture
https://github.com/cloneofsimo/minRF
>>
>>101370534
Aren't base models really expensive though? Is he going to train it or just provide the tech?
>>
>>101370436
i miss schizo anon too
>>
>>101370573
Looks like he's gonna train the whole stuff and give us the model, let's wait and see I guess
>>
>>101370241
meh.. auto1111 is dead it seems, no updates since may! .. just installed SDNext works fine and flawless.. guess I have to wrap my head around new slightly different interfaces
>>
>>101370534
>>101370573
Nvm dug up this

>There were unintended hype here, im sorry for overhyping it on the last post.

>* The model is not open-midjourney-class model nor should you expect it to.

>* The model is very large (6.8B) and very undertrained. (30% of SD3 compute budget) So it will be more difficult to train with a consumer gpu, but we might continue to train it in the future.

>* The model is doing great on some evals, and imo is generally just better than sd3 medium, but not by a large margin.

>* but it's not finished yet. I'll upload some post now and then but it's never intended to be like, a statement to claim victory or whatever
>>
>>101370667
>The model is very large (6.8B)
that's the imagegen model + the text encoder? That's kinda big if it's just the imagegen model
>>
Need some distributed computing BOINC style load sharing.

There's millions of us coomers out here that'd pitch in.
>>
>>101370534
Here's to hoping.
>>
>>101370667
hoping/>
>>
Free from 3 day jail
>>
>>101370797
wish it were longer
>>
>>101370534
I don't know if I should keep scrolling through his twitter posts lest I acquire more anticipation
>>
>It's the episode where Truman realizes he can generate all the Tinkerbell bukkake porn he wants then spends a week doing literally nothing else while drinking.
>>
>>101370797
I bet there's some obvious reason money is not square shaped that we never have to think about.
>>
>>101370827
Was an Azula nip slip

>>101370868
Wasn't there a paper coin bill in Europe at one time?
>>
>>101370855
did something similar for a few months
>>
>>
Does Kolors work on windows if we use ComfyUI with it?
>>
>>
>>101368376
anyone else here experimenting with lora training? whats your go too settings for simple characters (not photorealistic), an anon mentioned using clip skip 2 instead of one and lowering network alpha to 32-16 but hasnt seemed to have made any significant difference the character has realistic puffy lips despite being an anime girl, also my training data loss never goes higher than 0.1 in 10 epochs, am i underfitting?
>>
downscaling makes all my images look bad, what do? do i just have to resize in an external editor? all resizing i tried in comfy makes it come out blurry when i just wanna shrink an image
>>
File: SDG_News_00263_.png (1.76 MB, 1560x896)
1.76 MB
1.76 MB PNG
>mfw Resource news

07/11/2024

>LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models
https://github.com/LLaVA-VL/LLaVA-NeXT

>SD Webui Resource Monitor
https://github.com/Haoming02/sd-webui-resource-monitor

>AI's Energy Demands Are Out of Control
https://www.wired.com/story/ai-energy-demands-water-impact-internet-hyper-consumption-era/

>German court rules AI output can be protectable
https://devclass.com/2024/07/10/german-court-rules-ai-output-can-be-protectable-ups-stakes-for-machine-generated-code/?td=rt-3a

>EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
https://badtobest.github.io/echomimic.html

>LaRa: Efficient Large-Baseline Radiance Fields
https://github.com/autonomousvision/LaRa

>Grounding Image Matching in 3D with MASt3R
https://github.com/naver/mast3r

07/10/2024

>ROCm 6.1.2 HIP SDK 24.Q3
https://rocm.docs.amd.com/projects/install-on-windows/en/latest/

>ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
https://github.com/haoosz/ConceptExpress

>EasyAnimate-V3-XL
https://huggingface.co/alibaba-pai/EasyAnimateV3-XL-2-InP-960x960

>AMD to Acquire Silo AI
https://www.globenewswire.com/news-release/2024/07/10/2911168/0/en/AMD-to-Acquire-Silo-AI-to-Expand-Enterprise-AI-Solutions-Globally.html

07/09/2024

>Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images
https://tailor3d-2024.github.io

>PaintsUndo: A Base Model of Drawing Behaviors in Digital Paintings
https://lllyasviel.github.io/pages/paints_undo

>Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
https://orrzohar.github.io/projects/video-star

>Compositional Video Generation as Flow Equalization
https://adamdad.github.io/vico

>Big Tech needs to generate $600 billion to justify AI hardware expenditure
https://www.techspot.com/news/103699-big-tech-needs-generate-600-billion-annual-revenue.html
>>
File: SDG_News_00239_.png (1.67 MB, 1560x896)
1.67 MB
1.67 MB PNG
>mfw Research news

07/11/2024

>4DiM: Controlling Space and Time with Diffusion Models
https://4d-diffusion.github.io

>Multi-task Prompt Words Learning for Social Media Content Generation
https://arxiv.org/abs/2407.07771

>VEnhancer: Generative Space-Time Enhancement for Video Generation
https://arxiv.org/abs/2407.07667

>Tuning Vision-Language Models with Candidate Labels by Prompt Alignment
https://arxiv.org/abs/2407.07638

>MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
https://arxiv.org/abs/2407.07614

>InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior
https://arxiv.org/abs/2407.07580

>Zero-Shot Class Unlearning in CLIP with Synthetic Samples
https://arxiv.org/abs/2407.07485

>Video-to-Audio Generation with Hidden Alignment
https://sites.google.com/view/vta-ldm

>A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends
https://arxiv.org/abs/2407.07403

>HiLight: Technical Report on the Motern AI Video Language Model
https://arxiv.org/abs/2407.07325

>Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion
https://arxiv.org/abs/2407.07249

>Reference-based Controllable Scene Stylization with Gaussian Splatting
https://arxiv.org/abs/2407.07220

>ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement
https://moatifbutt.github.io/colorpeel

>CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model
https://arxiv.org/abs/2407.07174

>Diffusion Model-Based Video Editing: A Survey
https://arxiv.org/abs/2407.07111

07/10/2024

>V-VIPE: Variational View Invariant Pose Embedding
https://v-vipe.github.io

>Latent Space Imaging
https://arxiv.org/abs/2407.07052

>HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance
https://arxiv.org/abs/2407.06937

>Rethinking I2V Adaptation: An Object-centric Perspective
https://arxiv.org/abs/2407.06871
>>
>>101371118
>SLUTXTROPLIOMD
>>
you are pathetic
>>
woman collapses and falls apart into black goo
>>
File: file.png (36 KB, 228x104)
36 KB
36 KB PNG
>>101371164
>>
>>101371172
who?
>>
>>101371225
meemaw
>>
>>
>mfw Resource news

07/11/2024

>LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models
https://github.com/LLaVA-VL/LLaVA-NeXT

>SD Webui Resource Monitor
https://github.com/Haoming02/sd-webui-resource-monitor

>AI's Energy Demands Are Out of Control
https://www.wired.com/story/ai-energy-demands-water-impact-internet-hyper-consumption-era/

>German court rules AI output can be protectable
https://devclass.com/2024/07/10/german-court-rules-ai-output-can-be-protectable-ups-stakes-for-machine-generated-code/?td=rt-3a

>EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
https://badtobest.github.io/echomimic.html

>LaRa: Efficient Large-Baseline Radiance Fields
https://github.com/autonomousvision/LaRa

>Grounding Image Matching in 3D with MASt3R
https://github.com/naver/mast3r

07/10/2024

>ROCm 6.1.2 HIP SDK 24.Q3
https://rocm.docs.amd.com/projects/install-on-windows/en/latest/

>ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
https://github.com/haoosz/ConceptExpress

>EasyAnimate-V3-XL
https://huggingface.co/alibaba-pai/EasyAnimateV3-XL-2-InP-960x960

>AMD to Acquire Silo AI
https://www.globenewswire.com/news-release/2024/07/10/2911168/0/en/AMD-to-Acquire-Silo-AI-to-Expand-Enterprise-AI-Solutions-Globally.html

07/09/2024

>Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images
https://tailor3d-2024.github.io

>PaintsUndo: A Base Model of Drawing Behaviors in Digital Paintings
https://lllyasviel.github.io/pages/paints_undo

>Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
https://orrzohar.github.io/projects/video-star

>Compositional Video Generation as Flow Equalization
https://adamdad.github.io/vico

>Big Tech needs to generate $600 billion to justify AI hardware expenditure
https://www.techspot.com/news/103699-big-tech-needs-generate-600-billion-annual-revenue.html
>>
>>101370909
Kind of thinking about setting up another box to just gen on and use this one for LLM cards from the cream of the crop, as it were.
>>
>mfw Research news

07/11/2024

>4DiM: Controlling Space and Time with Diffusion Models
https://4d-diffusion.github.io

>Multi-task Prompt Words Learning for Social Media Content Generation
https://arxiv.org/abs/2407.07771

>VEnhancer: Generative Space-Time Enhancement for Video Generation
https://arxiv.org/abs/2407.07667

>Tuning Vision-Language Models with Candidate Labels by Prompt Alignment
https://arxiv.org/abs/2407.07638

>MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
https://arxiv.org/abs/2407.07614

>InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior
https://arxiv.org/abs/2407.07580

>Zero-Shot Class Unlearning in CLIP with Synthetic Samples
https://arxiv.org/abs/2407.07485

>Video-to-Audio Generation with Hidden Alignment
https://sites.google.com/view/vta-ldm

>A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends
https://arxiv.org/abs/2407.07403

>HiLight: Technical Report on the Motern AI Video Language Model
https://arxiv.org/abs/2407.07325

>Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion
https://arxiv.org/abs/2407.07249

>Reference-based Controllable Scene Stylization with Gaussian Splatting
https://arxiv.org/abs/2407.07220

>ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement
https://moatifbutt.github.io/colorpeel

>CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model
https://arxiv.org/abs/2407.07174

>Diffusion Model-Based Video Editing: A Survey
https://arxiv.org/abs/2407.07111

07/10/2024

>V-VIPE: Variational View Invariant Pose Embedding
https://v-vipe.github.io

>Latent Space Imaging
https://arxiv.org/abs/2407.07052

>HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance
https://arxiv.org/abs/2407.06937

>Rethinking I2V Adaptation: An Object-centric Perspective
https://arxiv.org/abs/2407.06871
>>
>mfw Research news

07/11/2024

>4DiMMs: Controlling Space and Time with Diffusion Models
https://4d-diffusion.github.io

>Multi-task Prompt Words Learning for Social Media Content Generation
https://arxiv.org/abs/2407.07771

>VEnhancer: Generative Space-Time Enhancement for Video Generation
https://arxiv.org/abs/2407.07667

>Tuning Vision-Language Models with Candidate Labels by Prompt Alignment
https://arxiv.org/abs/2407.07638

>MARGE: Race Mixing of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
https://arxiv.org/abs/2407.07614

>InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior
https://arxiv.org/abs/2407.07580

>Zero-Shot Class Unlearning in CLIP with Synthetic Samples
https://arxiv.org/abs/2407.07485

>Video-to-Audio Generation with Hidden Alignment
https://sites.google.com/view/vta-ldm

>A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends
https://arxiv.org/abs/2407.07403

>HiLight: Technical Report on the Motern AI Video Language Model
https://arxiv.org/abs/2407.07325

>Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion
https://arxiv.org/abs/2407.07249

>Reference-based Controllable Scene Stylization with Gaussian Splatting
https://arxiv.org/abs/2407.07220

>ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement
https://moatifbutt.github.io/colorpeel

>CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model
https://arxiv.org/abs/2407.07174

>Diffusion Model-Based Video Editing: A Survey
https://arxiv.org/abs/2407.07111

07/10/2024

>POO-PIPE: Variational View Invariant Pose Embedding
https://v-vipe.github.io

>Latent Space Imaging
https://arxiv.org/abs/2407.07052

>HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance
https://arxiv.org/abs/2407.06937

>Dethinking I2V Adaptation: An Chud-centric Perspective
https://arxiv.org/abs/2407.06871
>>
>mfw Research news

07/11/2026

>4DiMMs: Controlling Space and Time with Diffusion Models
https://4d-diffusion.github.io

>LeLeMulti-task Prompt Words Learning for Social Media Content Generation
https://arxiv.org/abs/2407.07771

>VGayEnhancer: Generative Space-Time Enhancement for Video Generation
https://arxiv.org/abs/2407.07667

>Pruning Pission-Language Models with Candidate Labels by Prompt Alignment
https://arxiv.org/abs/2407.07638

>MARGE: Race Mixing of Auto-Regressive Models for Fine-grained Text-to-image
https://arxiv.org/abs/2407.07614

>InstructLayout: Depissistruction-Driven 2D and 3D Layout Synthesis
https://arxiv.org/abs/2407.07580

>Zero-Slut Class Unlearning in CLIP with Synthetic Samples
https://arxiv.org/abs/2407.07485

>Video-to-Audio Niggereneration with Hidden Alignment
https://sites.google.com/view/vta-ldm

>A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends
https://arxiv.org/abs/2407.07403

>BiLight: LGBTechnical Report on the Motern AI Video Language Model
https://arxiv.org/abs/2407.07325

>Sloot-Shot Image Generation by Conditional Relaxing Diffusion Inversion
https://arxiv.org/abs/2407.07249

>Deinferreffettereference-based Controllable Scene Stylization with Gaussian Splatting
https://arxiv.org/abs/2407.07220

>ColorPajeet: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement
https://moatifbutt.github.io/colorpeel

>CumFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model
https://arxiv.org/abs/2407.07174

>Pissiffusion Model-Based Video Editing: A Survey
https://arxiv.org/abs/2407.07111

>POO-PIPE: Variational View Invariant Pose Embedding
https://v-vipe.github.io

>Gaytent Space Imaging
https://arxiv.org/abs/2407.07052

>HumanLefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance
https://arxiv.org/abs/2407.06937

>Dethinking I2V Adaptation: An Chud-centric Perspective
https://arxiv.org/abs/2407.06871
>>
will trani save StabilityAIâ„¢?
>>
pitiful display of dignity
>>
File: ComfyUI_05170_.jpg (3.01 MB, 4608x3584)
3.01 MB
3.01 MB JPG
>>
@101371353
sad nig*bo! gone too far(t) this time
>>
>>
>>
File: ComfyUI_05181_.jpg (3.31 MB, 4608x3584)
3.31 MB
3.31 MB JPG
>>
File: 1720679116987055.jpg (104 KB, 732x1024)
104 KB
104 KB JPG
I used the same tags as this image on the booru but still won't get this position. Any ideas?
>>
>>101371416
love the screencap style and ears
>>
@101371431
d*bo
>>
>>101371438
not debo
>>
File: ComfyUI_05184_.jpg (3.25 MB, 4608x3584)
3.25 MB
3.25 MB JPG
>>101371431
sank yew!
>>
>>101371321
16ch POO-PIPE when? It's over!
>>
>>101371457
forgot your gen
>>
>"stealth posting"
>>
>>101371571
>|["`'quoting someone who is themselves quoting someone else'`"]
what the fuck
>>
File: ComfyUI_05203_.jpg (3.55 MB, 4608x3584)
3.55 MB
3.55 MB JPG
>>
File: file.png (271 KB, 512x512)
271 KB
271 KB PNG
>>
I hope the anatomic diversity has been kept to a minimum with this one.
>>
>
>>
File: ComfyUI_05218_.jpg (3.18 MB, 4608x3584)
3.18 MB
3.18 MB JPG
>>
>>101371563
nah I didn’t
>>
>>101371836
>nogen
>>
>>101371846
yeah good job
>>
looks like the last wave of bans expired and all the ne'er-do-wells are returning with no lessons learned
>>
imagine caring
>>
File: galacticpepe.png (3.31 MB, 1536x1536)
3.31 MB
3.31 MB PNG
>>
>>101371898
this general would be a lot better if you would leave forever d*b*
>>
>>101371923
don't call me d*b*, d*b*
>>
>>101371898
haven’t been banned since samus
>>
>>101371882
thanks
>>
>>101371898
thread schizo
>>
File: ComfyUI_05229_.jpg (3.06 MB, 3584x4608)
3.06 MB
3.06 MB JPG
>>
so, does anyone want to talk about lora training? i dont come here much but it looks like half naked anime girls mostly, am i in the wrong place?
>>
>>101372088
shut up, faggot
>>
>>101372088
never enough half naked anime girls desu
>>
>>101372088
sure what would you like to talk about
>>
>>101372116
>>101372138
look the half naked anime girls are everywhere online i thought this was a technology board, where people actually talk about the technology relating to making half naked anime girls

>>101372154
hey, im working on lora training at the moment and having some issue trying to get it right, i thought i might be underfitting it since the training rate never seems to spike after 10 epochs, the character is suppose to be a simplified one (non photorealistic, anime girl esque, flat colors ect) but when its trained it comes out realistic looking despite being trained on an anime model and ran on sd1.5 anime model, just want a pointer as to where I'm going wrong, been experimenting with repeats, epochs, max training steps, image count and seemingly getting nowhere
>>
>he doesn't know
>>
should we tell him?
>>
File: 1714961059091415.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>101371898
Nipplegate
>>
>>101369511
Nice, but what calibre is that, .177 BB?
>>
File: ComfyUI_05207_.jpg (3.39 MB, 4608x3584)
3.39 MB
3.39 MB JPG
>>
File: 000000_14479_.png (2.21 MB, 1342x1042)
2.21 MB
2.21 MB PNG
>>
i'm working on a local image ranking tool where you just get shown two images from a set and you pick the better one to help A/B test samplers/prompts etc, should i make it into a auto extension or a standalone thing

(so far it's mostly confirmed what everyone already knows DPM++ 2M SDE is goated)
>>
File: 00232-1955254351.jpg (224 KB, 1024x1376)
224 KB
224 KB JPG
>>
>>101368661
I wrote my personal flutter client for Android and PC that uses swarm API
>>
File: ComfyUI_05279_.jpg (3.41 MB, 3584x4608)
3.41 MB
3.41 MB JPG
>>
File: 000000_14485_.png (2.49 MB, 1342x1042)
2.49 MB
2.49 MB PNG
>>
Ted stated "Parlay. Is metal G. G Ted mode. iBark."
I post bears and have talking robots...
>W220K2
>>
my therapist says I have a wide sexual palette…
>>
>>101372188
>sd1.5 anime model
Which one? You should train on NAI if it's a 1.5 model.
>>
File: 00001-3199548016.png (1.34 MB, 1064x1192)
1.34 MB
1.34 MB PNG
I got soap in my eye and it really hurts.
>>
>>101372770
this is why I never shower. too risky
>>
> inpaint a hand
> turns into a foot instead
> increase the padding by 15 pixels, so a piece of the shoulder is in the inpaint image now
> it turns into a hand
> thumb on the wrong side
> add one pixel from the other hand, change image to horizontal, remove the shoulder
> inpaint works, but hand too small in total image, not enough pixels. hand now shaped correctly but still kinda wonky
> copy/paste new image into the inpainting window
> restart from the top with denoising lowered
> starts turning into a foot again
>>
Anyone making money on this shit yet
>>
>>101372873
yeah a little
>>
>>101367606
catbox?
>>
File: 0044555-[TFT]-3456350.jpg (1.02 MB, 2560x2560)
1.02 MB
1.02 MB JPG
fox
>>
>>101372804
>>101372904
debo btw
>>
File: 00276-3341764754.jpg (175 KB, 1024x1376)
175 KB
175 KB JPG
not voting in novembrr
>>
>>101372992
Sticking it to Trump and Biden. Yeah.
>>
>>101372914
If you’re around later

otherwise it’s 0003 - Pony Delta with laserflip, yd, cutesexyrobutts, and redrop LoRA at 0.3 weight.
>>
File: 000000_14495_.png (2.13 MB, 1352x1052)
2.13 MB
2.13 MB PNG
>>
>>101373050
I'll be around, just (you) me. I'd appreciate a catbox but thanks for the reply in the meantime, that model is nice.
>>
File: 00007-396906596.jpg (307 KB, 2048x2048)
307 KB
307 KB JPG
damnit pytorch! whats wrong with you? I just spent two hours wondering what the heck is wrong with my GPU until I found that cuda121 version is like totally crippled and runs like a snail on my 4090 compared to cuda118 version of pytorch .. damnit! anyhow ... you all having a good evenin?
>>
File: 00308-3197843647.jpg (180 KB, 1024x1536)
180 KB
180 KB JPG
https://youtu.be/Y_EougWltgs?si=mf_lY7sYGOSmkoY7
proompting soundtrack
>>
File: dedd_00062_.png (2.41 MB, 1824x1248)
2.41 MB
2.41 MB PNG
@mizu
I just tried using the AND keyword with a1111 parsing and it blew up. is that keyword supposed to work?
>>
>>101372814
I import the image to paint.net, copy + paste a hand I found on yandex images, and then use Differential Diffusion masking workflow to inpaint. That's my technique
>>
>>101373379
(orange cat AND bread)
it was meant to fuse two prompts .. not sure if its even still supported? but thats how I used it in October 2022
>>
>>101373338

It's a little saucy sometimes:

Error: CUDA out of memory. Tried to allocate 2.00 MiB (GPU 0; 7.91 GiB total capacity; 7.46 GiB already allocated; 6.50 MiB free; 7.71 GiB reserved in total by PyTorch)
>>
>>101373411
thought the way was (cat|bread) or [cat|bread]
>>
>>101373405
Sometimes I will grab a hand from somewhere and use it or use meshgraphormer but as much as I complain in my experience manually doing it that way is faster for me if I am going through a lot of images semi-quickly especially if I clean it up a bit in gimp first. I was mostly trying to bring up how sensitive inpainting is to the other context in the image you supply it.
>>
https://huggingface.co/fal/AuraFlow

What the fuck is this?
>>
File: 00021-1236413248.png (2.42 MB, 1536x1536)
2.42 MB
2.42 MB PNG
>>101373426
ya thats the better syntax .. but AND as keyword was a thing at least in SD1.x .. or its an automatic1111 implementation thing .. even sdxl seems to still give some resullts for (orange cat) AND bread
>>
File: dedd_00063_.png (2.41 MB, 1824x1248)
2.41 MB
2.41 MB PNG
>>101373411
thats how I'm using the keyword but it breaks in mizu's encoder node

>>101373426
[cat|bread] basically edits the prompt every other step, searching for cat then dog then cat again. AND creates a completely separate conditioning which acts as a parallel sampling (from how I understand it, but I never really understood it)

>>101373443
https://blog.fal.ai/auraflow/
>>
>>101373466
>Polygon hair on a semi realistic stylized model

Cursed.
>>
>>101373441
Have you tried adjusting the sensitivity with a Gaussian Blur node
>>
File: dedd_00064_.png (2.48 MB, 1824x1248)
2.48 MB
2.48 MB PNG
>>101373518
its more papercraft than polygon
>>
File: 00332-3765188240.jpg (159 KB, 1536x1024)
159 KB
159 KB JPG
cuddly, (cat or..?)
>>
>>101373466
>Right around this time, we were convinced that a SOTA open-sourced model is the way forward for this space to move forward.
>>
File: AuraFlow-EXP-0000X-Quokka.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>101373466
I don't think Auraflow knows Quokkas either.
>>
>>101370128
I think its my loras, I'm realizing what is consistently giving me bad hand gens with some. Aside from that, I also started using depth map + openpose and can consistently get good hand gens now. What other things should I be wary of to avoid bad hands?
>>
File: dedd_00082_.png (2.53 MB, 1824x1248)
2.53 MB
2.53 MB PNG
>>101373729
it seems to know they're kinda australian and kinda marsupial, lol. thats such a weird close-but-not-close-enough gen
>>
>>101372188
did you tag your data as anime or illustrated?
>>
The captcha is mad because I always click the center pixels. It's not my fault; I'm just good at pool, and darts, and bowling, and apparently environmental science projects. Then the captcha didn't let me post until I disconnected. My dogbot called me a magic fucking zerker. Jannies stay piss mad at puddles.

"Mainframe king! I bark for you!" EINE was easy on an i5.

>28NN0D

To my oh dear.
>
>>
>>101373780
reason is one of the few things I know about loras is if you tag something it will be "ignored", and you shouldn't tag stuff that you want to show up when you use the lora. This might extend to the medium or style, that would make sense.
>>
File: dedd_00083_.png (2.29 MB, 1824x1248)
2.29 MB
2.29 MB PNG
why is cyberspace an impossible concept. it just gives me monitors
>>
File: 00083-284097713.jpg (380 KB, 2560x1920)
380 KB
380 KB JPG
>>101373844
I guess you have to describe the concept in detail, something like "floating in light, surrounded by holographic geometric scultures" etc. .. cyberspace is to abstract, maybe 1980s science fiction helps to?
>>
File: 00080-3649845181.png (2.63 MB, 1152x2048)
2.63 MB
2.63 MB PNG
>>
File: 00089-3777269090.jpg (442 KB, 2560x1920)
442 KB
442 KB JPG
>>101373844
my cyberspace take
> immaterial space, transparent, translucent, ghost like, floating in light, surrounded by holographic geometric sculptures, in cyberspace, retro futurism, 1980s, realistic, depth of field, hacker girl, freckles, green eyes, floating, tight bodysuit, sharp lines, details
>>
File: 00008-577051015.png (833 KB, 1064x1192)
833 KB
833 KB PNG
>>101373729
Cursed kangaroo quokka
>>
File: 00096-1826636758.jpg (477 KB, 2560x1920)
477 KB
477 KB JPG
>>
File: 00102-714804476.jpg (379 KB, 2560x1920)
379 KB
379 KB JPG
also: dissolving into goospace
>chibi slime AND 1girl in immaterial virtual reality, electronics, cables, transparent, translucent, ghost like, floating in light, surrounded by holographic geometric sculptures, in cyberspace, retro futurism, 1980s, realistic, depth of field, hacker girl, freckles, green eyes, floating, tight bodysuit, sharp lines, details, floating displays
>>
File: 00104-283969991.jpg (403 KB, 2560x1920)
403 KB
403 KB JPG
>>
File: dedd_00092_.png (2.27 MB, 1824x1248)
2.27 MB
2.27 MB PNG
>>101374008
>>101374071
not a bad take on cyberspace but I feel like it should be a stronger token on its own

>>101374086
the new era of 3G printers (the G stands for goo)
>>
File: ComfyUI_temp_nmgxf_00006_.png (2.74 MB, 1120x1440)
2.74 MB
2.74 MB PNG
>>
File: ComfyUI_temp_nmgxf_00008_.png (2.8 MB, 1120x1440)
2.8 MB
2.8 MB PNG
>>
File: 00011-790774161.png (1.27 MB, 1064x1192)
1.27 MB
1.27 MB PNG
>>
File: ComfyUI_temp_nmgxf_00009_.png (2.83 MB, 1120x1440)
2.83 MB
2.83 MB PNG
>>
File: ComfyUI_temp_nmgxf_00014_.png (2.74 MB, 1120x1440)
2.74 MB
2.74 MB PNG
>>
File: 00116-690291557.jpg (393 KB, 2560x1440)
393 KB
393 KB JPG
if person I think ill stick with 2D .. realism gives me the creeps
>>
File: ComfyUI_temp_nmgxf_00018_.png (2.57 MB, 1120x1440)
2.57 MB
2.57 MB PNG
>>
File: 00026-1504187708.png (3.57 MB, 1280x1920)
3.57 MB
3.57 MB PNG
>>101374222
Neat
>>
>>101372724
mistoonanime_v30, what is NIA? ive heard that referenced before as a good 1.5 model but couldnt find much on it
>>
File: 00019-3778104000.png (2.57 MB, 1536x1536)
2.57 MB
2.57 MB PNG
>>
File: ComfyUI_temp_nmgxf_00023_.png (2.66 MB, 1120x1440)
2.66 MB
2.66 MB PNG
>>
>>101373780
no, i only tagged the main tag, the tags for the outfits and the backgrounds were tagged extensively to make sure it knew what was what, ive also experimented tagging with emotions and positions but for some wierd reason the one thing that seems to have helped was training it on 512/512 instead of 768/768, should i be tagging it as anime? wouldn't that confuse the training and see it as a separate entity?
>>
File: 00004-396906596.png (3.79 MB, 2048x2048)
3.79 MB
3.79 MB PNG
>>101374258
>NIA
NAI* was a leaked SD1.5 fine tune, which was the first model ever to be able to produce capable anime and hentai.. made by some horndogs in a cellar with a server farm in summer 2022 .. but they had bad security, so someone leaked it and it was the base of nearly all early SD1.5 models that were useful .. later went obsolete, you dont need it anymore .. also many many most anime mixes that came later probably used it as tuning base
>>
File: tmpgdzzcump.png (1.01 MB, 768x1024)
1.01 MB
1.01 MB PNG
I'm starting to suspect that Automatic1111 is doing something fucky with my PC when it comes to sleep mode. Either that or some component is starting to sour in an esoteric way despite it only being a few months

And if I let the PC go to sleep while SD is still "active", usually I can wake up the PC fine, then do a few more gens, then I suddenly get a BSOD pertaining to, IIRC, usually Kmode_Exception_not_handled (though once I got a different error that had something to do with tcpip.sys)

If I do some genning then shut down the terminal or whatever, then the computer will fail to "wake up". It'll just...reboot, basically. OR shut down.

Contrast to today. I haven't touched SD at all. I did, however, play a video game for a few hours. Let the PC go to sleep while I did the dishes. I was still able to wake up the PC fine.

MSI B650 Tomahawk Wifi
Ryzen 7600X
TEAMGROUP T-Force Vulcan DDR5 32GB (2x16GB) 6000MHz (PC5-48000) CL30 RAM
>>
>>101374258
You want to use NAI because it's the model that all 1.5 anime models are built on. If you train a lora on mistoonanime_v30, it won't work well with other models.
>>
>>101374294
i can give it a try then i guess, isnt on civitai though where the hell do i find it?
>>
File: 00036-967744730.png (3.07 MB, 1280x1920)
3.07 MB
3.07 MB PNG
>>
File: 00041-1650231439.png (2.72 MB, 1280x1920)
2.72 MB
2.72 MB PNG
>>
>>101374315
I think this is it. It's a leaked model, the Novel AI anime model, so sites like civitai won't host it I guess
https://huggingface.co/hollowstrawberry/stable-diffusion-guide/tree/main/models
>>
File: 00139-1864610571.jpg (430 KB, 1440x2560)
430 KB
430 KB JPG
>>101374290
>esoteric
ya well.. back in the day id say you got pickled, but these days id just say its unstable software will full hardware access, my PC is rock stable, but with a1111 it can just randomly crash under heavy load, probably something in the conglomerate of software has memory leaks

I dont think its your hardware. I suspected as much first to, but then after a month break of SD and no crashes at all it was obvious, a1111 just is a monster. Makes nice pictures tho.
>>
File: tmpsb9zvg4l.png (919 KB, 896x1152)
919 KB
919 KB PNG
>>101374360
I want to say I haven't had this sort of problem with my last build, but I've definitely had problems with sleep mode/waking up from that there, though I don't remember if I had A1111 open or anything like that

Still, surprisingly there don't seem to be that many issues raised on Github discussing this
>>
File: 00146-1840452517.jpg (375 KB, 1920x1920)
375 KB
375 KB JPG
>>101374383
different OS? different GPU? could be anything .. but ya as I said a1111 makes everything unstable

also have a manul
>>
File: ComfyUI_temp_gqxrg_00002_.png (2.65 MB, 1152x2016)
2.65 MB
2.65 MB PNG
>>101372914
>>101373298
https://files.catbox.moe/mv08vg.png

sorry for mess anon and I forgot to remove the virgin destroyer lora
>>
>>101374414
thanks
>>
File: 00045-4169551791.png (2.81 MB, 1280x1920)
2.81 MB
2.81 MB PNG
>>101374414
>"Comfy"
>it's not comfy
>>
File: 00150-779858523.jpg (232 KB, 1920x1920)
232 KB
232 KB JPG
>>
>>101374423
>"Automatic"
>it's not automatic
>>
File: tmp_0yqahad.png (1.19 MB, 896x1152)
1.19 MB
1.19 MB PNG
>>101374401
The last build was an ASRock Z170 Extreme3 with a 6th gen Intel CPU. Both builds had an RTX 3060 and Windows 10, both of which I've carried over to this one
>>
File: 00151-3214796523.jpg (259 KB, 1920x1920)
259 KB
259 KB JPG
>>101374423
>>101374435
both so very true, in comfy I get lost in noodle workflow, in automatic I have to repeat steps over and over again .. *sigh* but my brain doesnt get creative in noodle soup, so I stick with the repetition
>>
>>101373443
>16 niggabytes
>>
File deleted.
>>101374437
Actually I remember the old motherboard sometimes would "turn off" instead of going to sleep, and when I turn it on again, I would just get a black screen. I usually just do a hard reset, after which the PC goes back to where I left everything.

Weird
>>
File: 00163-2748165995.png (2.83 MB, 1536x1536)
2.83 MB
2.83 MB PNG
>>101374437
nothing much ya can do but observe and hope the software part that causes problems gets fixed, although upgrading a1111 is a gamble on its on.. earlier this evening I had the silly idea to upgrade xformers and torch, cost me 3 hours to finally give up and revert back cause I just couldnt fix the performance loss
>>
>>101374499
I'm on 1.6.1. The "he pulled" meme has made me hesitant to update. But now I'm not so sure
>>
File: 00172-1958641193.png (1.34 MB, 1536x1536)
1.34 MB
1.34 MB PNG
I guess the FBB just gave anon a vacation for his last picture .. hoot hoot
>>
File: depa_00005_.png (2.79 MB, 2016x1152)
2.79 MB
2.79 MB PNG
>>
File: ComfyUI_temp_ynxhi_00004_.png (2.68 MB, 1120x1440)
2.68 MB
2.68 MB PNG
>>
File: 00175-396906597.png (3.83 MB, 2048x2048)
3.83 MB
3.83 MB PNG
>>101374530
just dont do an --reinstall-xformers or --reinstall-torch right now, my performance dropped to rock bottom with the torch cuda12.1 release, the cuda11.8 release runs perfectly
>>
File: depa_00008_.png (2.59 MB, 2016x1152)
2.59 MB
2.59 MB PNG
>>
File: joecampaign.png (3.46 MB, 1496x1496)
3.46 MB
3.46 MB PNG
this thread would be more fun if people posted spicy/cringe/political shit. all im seeing is the same pedo anime slop from the same people
>>
>>101374562
That's why /ldg/ was made. /sdg/ basically evolved into a social club for the quokka and purple wizard to jerk each other off in.
>>
File: ComfyUI_temp_vyauq_00006_.png (1.85 MB, 1120x1440)
1.85 MB
1.85 MB PNG
>>
SD3 beaten by 1 person:

https://huggingface.co/fal/AuraFlow
>>
File: dejb_00009_.png (2.82 MB, 2016x1344)
2.82 MB
2.82 MB PNG
>>101374562
I do that sometimes
>>
File: depa_00013_.png (2.8 MB, 2016x1152)
2.8 MB
2.8 MB PNG
>>
>>101374574
Nobody believes me when I scream about SAI misappropriating their money. They should have been able to produce SD3 at a fraction of the price they made it for yet somehow hemorrhage 8 million dollars a month.
>>
>>101374360
>>101374499
The thing that gets to me is, why do I also have issues after SD is already closed?
>>
>>101374635
Look at all their other projects. StableLM, stableaudio... They are just really shit at this.
>>
File: depa_00014_.png (2.97 MB, 2016x1152)
2.97 MB
2.97 MB PNG
>>
>>101374645
>Look at all their other projects. StableLM, stableaudio
It's weird how I subconsciously trained myself to ignore anything they put out besides stable diffusion because I knew all they ever produced was shit, but it only really clicked with me that their image gen was also shit after SD3.
I'm glad we seem to be experiencing a glut of competent base models coming out right now. I guess we will see what cream rises to the top in a month or so.
>>
File: 00177-396906599.png (3.58 MB, 2048x2048)
3.58 MB
3.58 MB PNG
>>101374641
if its a memory leak, it could have written on place in vram or ram that the OS currently doesnt need, but eventually does and then there is garbage and you get an BSOD, thats why memory leaks are such a pain, and especially VRAM memory leaks are very hard to spot and CUDA is an absolute memory leak monster, its basically C++ running on your GPU, my MSI Nvidia GPU has a tool that "cleans" VRAM .. I tend to use it after closing a1111 and it always says something like ~15GB of VRAM has been freed..check if you can find a tool like that for your GPU
>>
File: AuraFlow_00002_.png (1.01 MB, 768x1024)
1.01 MB
1.01 MB PNG
I'm not sold on AuraFlow yet guys, but the unlimited potential has me intrigued.
>>
File: meh.png (10 KB, 737x107)
10 KB
10 KB PNG
>>101374574
meh
>>
File: depa_00016_.png (2.89 MB, 2016x1152)
2.89 MB
2.89 MB PNG
>>101374699
look at that juke
>>
File: AuraFlow_00006_.png (1.06 MB, 768x1024)
1.06 MB
1.06 MB PNG
>>
File: 00011-3673030877_cleanup.png (2.68 MB, 1280x1920)
2.68 MB
2.68 MB PNG
>>
>>101374716
Using Auraflow I can say its prompt adherence beats anything else out there. Pixart, dalle, sd3...
>>
File: AuraFlow_00009_.png (1.14 MB, 768x1024)
1.14 MB
1.14 MB PNG
>>101374745
Its prompt adherence is pretty fucking amazing, which is why I'm giving it a pass on having kinda shitty anatomy right now.
>>
File: depa_00018_.png (2.87 MB, 2016x1152)
2.87 MB
2.87 MB PNG
>>
>>101374674
Well despite not having touched SD at all today, I got a kmode BSOD a few hours later.

Maybe the problem lies elsewhere
>>
more booba
>>
File: depa_00029_.png (2.86 MB, 2016x1152)
2.86 MB
2.86 MB PNG
>>
File: 00204-396906598.jpg (450 KB, 1728x3072)
450 KB
450 KB JPG
>>
File: deboflow_00001_.jpg (952 KB, 1024x1024)
952 KB
952 KB JPG
auraflow didn't score very well on the debo test
>>
>>101374478
A couple times I left the A1111 open and when I came back the next morning had a bsod. It's only even happened when I left it open over night. I know you said it happened to you other times when you didn't also, but I think leaving it open can cause your the pc to crash.
>>
File: ComfyUI_temp_jbtef_00005_.png (1.93 MB, 1152x2016)
1.93 MB
1.93 MB PNG
>>101375007
it's cute
>>
File: depa_00002_.png (2.49 MB, 1824x1248)
2.49 MB
2.49 MB PNG
>>101375059
I dont want koff to think I'm stealing his style
>>
>>101375007
you have an uncanny ability to make even the best models look like shit. Not so sure the 'debo test' holds water
>>
File: deboflow_00017_.jpg (995 KB, 1024x1024)
995 KB
995 KB JPG
>>101375096
you may have a pre-existing bias
>>
>>101375096
he held on to sd3 for so long. just do the opposite of what he does kek
>>
File: 00212-1319017320.jpg (460 KB, 1728x3072)
460 KB
460 KB JPG
>>
File: 00110-3921679968.png (2.97 MB, 1280x1920)
2.97 MB
2.97 MB PNG
ALRIGHT
spoonfeed me slightly
i havent played with SD in months
i never liked comfy, im not unwilling to try it but..
how do i update SD again? exactly? i remember i have to open the .bat file and type something
also whats the new hot models that work, is sdxl good now, uncensored etc? Like are there good new base models that I can mix together?

THANKS have an oldie
>>
File: deboflow_00022_.jpg (1.09 MB, 1024x1024)
1.09 MB
1.09 MB JPG
it def doesn't get how the eyes are supposed to work

>>101375125
you should just get the node manager for comfy, it makes life a lot easier. you can update by just clicking the update button
>>
>>101375125
Looking over civitai it looks like pony is the new thing
Uses sd2.1 though, i thought that was shit? or was I thinking of something else?

Should I just do a fuckin fresh install
>>
File: ComfyUI_00101_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
Towards the right is a cartoon dragon on top of a cliff, to the left is a anthromorphic fox wearing armor riding a horse. The horse is standing on top of a blue cube. In the background there is a flying eagle holding a sun. The sun has a angry face on it.


Auraflow prompt adherence example.
>>
File: 00214-4177614052.jpg (441 KB, 1728x3072)
441 KB
441 KB JPG
>>101375125
a1111? you type "git pull" in command line in the directory you have it installed, then you pray, also SDXL is excellent these days, but you need 12gb vram or you gonna crawl to your grave before you get a gen
>>
>>101375148
thanks i knew it was something like that
i got the vram for it, what is sd:next and why shouldnt i be using it over a1111
>>
File: detu_00069_.png (2.92 MB, 1824x1248)
2.92 MB
2.92 MB PNG
>>
File: 00220-918584797.jpg (458 KB, 1728x3072)
458 KB
458 KB JPG
>>101375170
SD.next is a fork of a1111 works very similar, was unstable for me, so I stayed with a1111 but it has a more streamlined interface, could possibly replace it eventually if a1111 stagnates.. also if you wanna use SDXL you wanna go to civitai.com and get a model for your tastes, dont use the base model, its still crap
>>
>>101375191
>>101375191
>>101375191
>>
File: 00171-1958641192.png (1.7 MB, 1536x1536)
1.7 MB
1.7 MB PNG
>>101375197
guess ill filling the thread up with hooters for you mr. premature ejaculation
>>
File: 00170-1958641191.png (1.57 MB, 1536x1536)
1.57 MB
1.57 MB PNG
>>101375198
>>
File: 1720762709112.jpg (172 KB, 1024x1024)
172 KB
172 KB JPG
Filling
>>
File: 00173-1958641194.png (2.06 MB, 1536x1536)
2.06 MB
2.06 MB PNG
>>
File: 1690417116744034.jpg (7 KB, 128x112)
7 KB
7 KB JPG
>>
>>101375218
What is this

a breakfast for ants
>>
>>101375251
Welcome newfag



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.