[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: PW.webm (740 KB, 1238x752)
740 KB
740 KB WEBM
Previous /sdg/ thread : >>102421511

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
flux-dev: https://huggingface.co/spaces/black-forest-labs/FLUX.1-dev
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Black Forest Labs: Flux
https://huggingface.co/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>
File: fSDG_News_000101_.jpg (508 KB, 896x512)
508 KB
508 KB JPG
>mfw Resource news

09/17/2024

>Mamba-ST: State Space Model for Efficient Style Transfer
https://github.com/FilippoBotti/MambaST

>ComfyUI-Fluxtapoz: Nodes for editing images using Flux
https://github.com/logtd/ComfyUI-Fluxtapoz

>2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction
https://github.com/islab-sophia/2S-ODIS

>Towards Kinetic Manipulation of the Latent Space
https://github.com/PDillis/stylegan3-fun

>DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion
https://dreamm0ver.github.io/

>Beta-Sigma VAE (BS-VAE)
https://github.com/overnap/BS-VAE

>Only Train Once (OTO): Automatic One-Shot DNN Training And Compression Framework
https://github.com/microsoft/only_train_once

>Microsoft, BlackRock form group to raise $100 billion to invest in AI data centers
https://www.cnbc.com/2024/09/17/microsoft-blackrock-form-gaiip-to-invest-in-ai-data-centers-energy.html

09/16/2024

>CogVideo Image2Video Released
https://github.com/kijai/ComfyUI-CogVideoXWrapper/issues/54
https://cloud.tsinghua.edu.cn/d/5cc62a2d6e7d45c0a2f6/?p=%2F1&mode=list

>HF Dev adds true CFG and negatives to Flux, refuses to elaborate
https://huggingface.co/spaces/multimodalart/flux-cfg

>USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s
https://esakak.github.io/USTC-TD/

>Forge Space Ollama
https://github.com/Haoming02/forge-space-ollama

>Sam Altman departs OpenAI’s safety committee
https://techcrunch.com/2024/09/16/sam-altman-departs-openais-safety-committee

>CivitAI Introduces Rapid Flux Training
https://education.civitai.com/quickstart-guide-to-flux-1/#rapid-flux-training

>DrawingSpinUp: 3D Animation from Single Character Drawings
https://github.com/LordLiang/DrawingSpinUp

>ComfyUI-DataSet: Data research, preparation, and manipulation nodes
https://github.com/daxcay/ComfyUI-DataSet

>RunwayML Announces Runway API
https://runwayml.com/api
>>
>mfw Research news

09/17/2024

>Do Pre-trained Vision-Language Models Encode Object States?
https://arxiv.org/abs/2409.10488

>SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing
https://arxiv.org/abs/2409.10476

>MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion
https://lehongwu.github.io/ECCV24MacDiff/

>Taming Diffusion Models for Image Restoration: A Review
https://arxiv.org/abs/2409.10353

>On Synthetic Texture Datasets: Challenges, Creation, and Curation
https://arxiv.org/abs/2409.10297

>Enhancing Image Classification in Small and Unbalanced Datasets through Synthetic Data Augmentation
https://arxiv.org/abs/2409.10286

>RealDiff: Real-world 3D Shape Completion using Self-Supervised Diffusion Models
https://arxiv.org/abs/2409.10180

>Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models
https://arxiv.org/abs/2409.10197

>MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior
https://arxiv.org/abs/2409.10090

>AttnMod: Attention-Based New Art Styles
https://arxiv.org/abs/2409.10028

>GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
https://arxiv.org/abs/2409.09896

>Generalizing Alignment Paradigm of T2I Generation with Preferences through f-divergence Minimization
https://arxiv.org/abs/2409.09774

>Finetuning CLIP to Reason about Pairwise Differences
https://arxiv.org/abs/2409.09721

>EditBoard: Towards Comprehensive Evaluation Benchmark for Text-based Video Editing
https://arxiv.org/abs/2409.09668

>TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfer
https://arxiv.org/abs/2409.09610

>Bias Begets Bias: Impact of Biased Embeddings on Diffusion Models
https://arxiv.org/abs/2409.09569

>One missing piece in Vision and Language: Survey on Comics Understanding
https://arxiv.org/abs/2409.09502

>PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage
https://arxiv.org/abs/2409.09144
>>
File: PW_86800_.png (1.26 MB, 1280x768)
1.26 MB
1.26 MB PNG
>>
Don't forget to do your part and report PW for signature/avatar usage!
>>
>>
File: PW_86817_.png (956 KB, 1280x768)
956 KB
956 KB PNG
>>102435663
Great gen, anon :]
The detail is hella good!
>>
File: delux_pr_00018_.png (1.66 MB, 1536x968)
1.66 MB
1.66 MB PNG
>>
File: tmp6dr3pwl7.png (934 KB, 768x1024)
934 KB
934 KB PNG
>>
SUCK
>>102434927
I see, you can make a LoRA of her, currently there's many options to do that easily on 1.5, a lot of people here always mention the Civitai trainer, as far as I know it's pretty easy to use, all you need is a number of high quality image of her and it should work look into it
>>
>>102435739
Thanks
>>
>>
File: delux_pr_00020_.png (1.66 MB, 1536x968)
1.66 MB
1.66 MB PNG
>>
>>
File: tmpz9yhim_r.png (524 KB, 768x768)
524 KB
524 KB PNG
>>
File: PW_86828_.png (1.26 MB, 1280x768)
1.26 MB
1.26 MB PNG
>>
File: checkpoint.png (2 KB, 257x29)
2 KB
2 KB PNG
found an old checkpoint, from 2 years ago, what could it be?
>>
>>102436064
SD 1.5
>>
File: tmpaz49yg10.png (944 KB, 768x1024)
944 KB
944 KB PNG
>>
File: bra.png (2.79 MB, 1536x1536)
2.79 MB
2.79 MB PNG
god forbid vir/g/ins lay their eyes upon nipple
>>
Man this shit is still vaseline town eh?
I thought by now there would be a really good model that could do more than smear random ass art together.
>>
>>102436142
pony is supposed to look sketchy
>>
File: delux_me_00083_.jpg (357 KB, 896x512)
357 KB
357 KB JPG
>>102436142
>A hot woman covered in Vaseline
ermm
>>
File: tmp02gfwrtr.png (1.01 MB, 768x1024)
1.01 MB
1.01 MB PNG
>>
>>102436174
Ok but it doesn't. It looks like someone just ran healing brush tool over like 100 different pieces of art till it started to look like something for 1000 years and then said ok this is good enough.
>>
>>102436264
whatever dude, post gen or gtfo
>>
>>102436285
gen or gtfo
are you happy now
>>
File: flamer_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>102436301
why are you wasting your own time
>>
File: tmpa74l597d.png (844 KB, 768x1024)
844 KB
844 KB PNG
>>
>>102436312
I'm not wasting my time. I'm wasting your time while my gen finishes.
>>
File: bidenfortrump.png (1.91 MB, 1024x1024)
1.91 MB
1.91 MB PNG
>>102436338
it better be fucking good anon
>>
File: tmp1ytypiqr.png (1005 KB, 768x1024)
1005 KB
1005 KB PNG
>>
>>102436358
I can't tell if this is bush or biden
>>
File: tmpou2e06xv.png (1.03 MB, 768x1024)
1.03 MB
1.03 MB PNG
>>
File: delux_pr_00021_.png (1.39 MB, 1536x968)
1.39 MB
1.39 MB PNG
>>102436312
people like him think arguing with people online over things he doesn't even care about is a valuable use of time.
>>
>>
File: tmpgac71jzc.png (890 KB, 768x1024)
890 KB
890 KB PNG
>>
File: 00004-2980457558.png (1.05 MB, 896x1088)
1.05 MB
1.05 MB PNG
>>
>>
File: 00006-3016158533.png (1.08 MB, 1088x896)
1.08 MB
1.08 MB PNG
>>
>>102436601
Wow, something like this in portrait please? With her in a standing pose.
>>
>>102436358
why are trump supporters always the worst prompters?
>>
>>102436645
Am a Trump supporter and also am one of the finest in this general
>>
File: swiftelontrump.png (3.51 MB, 1568x1568)
3.51 MB
3.51 MB PNG
>>102436645
whats wrong with joe biden peacefully enjoying his retirement?
>>
File: pimp.png (1.91 MB, 1440x992)
1.91 MB
1.91 MB PNG
>>102436645
i have so much for you anon. run away
>>
File: tmppf7pqbe3.png (955 KB, 768x1024)
955 KB
955 KB PNG
>>
>>102436633
1
>>
>>102436698
Lovely, exquisite!
>>
>>102436633
2
>>
>>102436633
3
>>
File: banding.png (695 KB, 1045x813)
695 KB
695 KB PNG
Having banding issues. Anyone know how to fix?
>>
>>
>>
>>102436747
Are you using FreeU / Kohya HiRes Fix / PAG?
>>
>>102436899
I'm using Forge.
>>
>schizo thread
>>
>>102436940
>I'm using Forge.
Which has addons for Kohya/FreeU/etc builtin. Scroll down and toggle them off, one at a time, and gen an image each time - you'll figure out where the issue is. Same deal for stuff like, hires fix or samplers, toggle/switch things one by one until the issue resolves - and you'll have identified the issue.
Debugging 101
>>
>>
File: 00181-1318949849.png (2.12 MB, 1080x1440)
2.12 MB
2.12 MB PNG
>>
>>102436979
I wasn't using FreeU/Kohya HiRes Fix/PAG.
>>
>>102437023
I'm trying to inpaint away the banding and it's not working. Every inpaint is also banding. Not the whole image - the anime girl part is fine. It's just the dark wooden wall it's having trouble with.
>>
>>102436747
Man that is a very sexy wall
keep up the good work.
>>
>>
>>102437034
Well that sucks
>>
>>
>>
File: delux_pr_00022_.png (1.96 MB, 1536x968)
1.96 MB
1.96 MB PNG
>>
File: 00430-2965200226.png (2.15 MB, 1440x960)
2.15 MB
2.15 MB PNG
>>
>>
Hi anon, I haven't touched SD for a year. Is the flux the new meta? How am I supposed to use someone else's SD 1.5 loras on that?
>>
>>
File: delux_pr_00023_.png (1.89 MB, 1536x968)
1.89 MB
1.89 MB PNG
>>102437252
>Is the flux the new meta?
yes but prob not forever. it has some dramatic shortcomings that will impact its longevity
>How am I supposed to use someone else's SD 1.5 loras on that?
you can't
>>
>>102437426
So most people are still on sd 1.5?
What do people use flux for?
>>
>>
File: delux_pr_00024_.png (1.78 MB, 1536x968)
1.78 MB
1.78 MB PNG
>>102437447
>So most people are still on sd 1.5?
no, but some are. 1.5 is pretty dated in overall performance but it has some niches it still fills better than others. most people moved into sdxl and there's lots of loras for it too
>What do people use flux for?
flux has very good prompt adherence; much closer to dalle3 than any of the sd models have achieved. that max flux excel at complex scene, text, fine details, etc. it also has higher overall quality for a base model
>>
>>102437447
>So most people are still on sd 1.5?
No, only Illuminati anon uses 1.5. Pony is the most popular SD model, it's based on XL.
>>
>>102437522
What is the good model for realistic photos that focuses on variety? Last time I was still on SD 1.5 I remember deliberate models were the best for this.
>>
File: PW_86812_.png (1.37 MB, 1280x768)
1.37 MB
1.37 MB PNG
>>
File: 00180-1318949849.png (1.96 MB, 1600x896)
1.96 MB
1.96 MB PNG
>>
File: 00009-3994750808.png (733 KB, 896x1152)
733 KB
733 KB PNG
>>102436747
I managed to fix this, believe it or not, by switching to the DPM2 sampling method while inpainting. For some reason this particular sampling method handles the dark wooden wall well.
>>
File: cnxl.jpg (676 KB, 816x1686)
676 KB
676 KB JPG
Why is there so many fucking models?
Which one should I use?
>>
File: 1697267318268274.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
I've been trying for an hour to get any model (SD1.5, SDXL, Flux) to generate an image of a person throwing a steaming hot pot of coffee and it seems to be an impossible task. Skeptics were right, AI sucks.
>>
>>102437849
Skill issue
>>
File: 00185-1318949849.png (2.16 MB, 1440x1080)
2.16 MB
2.16 MB PNG
>>
File: 1714822370897060.png (1.65 MB, 1280x1280)
1.65 MB
1.65 MB PNG
>>102437862
Nope
>>
>>102437897
what do you are prompt be like?
>>
Is there something for comfy yet that displays lora tags?
>>
>>102437897
is the skill issue reading
>generate an image of a person throwing a steaming hot pot of coffee
>>
File: 00430-1318949849.png (2.32 MB, 1440x1080)
2.32 MB
2.32 MB PNG
>>
File: 00431-1318949849.png (2.12 MB, 1440x1080)
2.12 MB
2.12 MB PNG
>>
i miss schizo anon
>>
>>102438175
I miss her too
>>
>>102438185
hope she's alright
im worried
>>
File: 00074-2965200228.png (2.1 MB, 1440x1080)
2.1 MB
2.1 MB PNG
>>
File: 00525-1053045054.png (1.42 MB, 1400x896)
1.42 MB
1.42 MB PNG
>>
File: river night.webm (1.4 MB, 1920x960)
1.4 MB
1.4 MB WEBM
>>
>>102439646
gem
>>
>>102437746
Follow your heart
>>
File: file.jpg (321 KB, 1792x1024)
321 KB
321 KB JPG
>>
File: river night 2.webm (1.65 MB, 1920x960)
1.65 MB
1.65 MB WEBM
>>102439650
Thans anon
>>
File: centaurarmor.png (3.38 MB, 1552x1552)
3.38 MB
3.38 MB PNG
>>
File: 0.jpg (171 KB, 1024x1024)
171 KB
171 KB JPG
Do you like my hat?
>>
File: waterfalls night.webm (1.44 MB, 1920x960)
1.44 MB
1.44 MB WEBM
*k
>>102438862
Cool atmosphere.
>>102439743
Nice Kirby, very happy gen.
>>102440025
Great job with the details. reminds me of AOM but yours is clean.
>>
File: tigervswoman_.png (1.91 MB, 1018x1018)
1.91 MB
1.91 MB PNG
>>
What's the result if you add "while high on MDMA" to your usual prompt?
>>
File: waterfalls night 2.webm (1.7 MB, 1920x960)
1.7 MB
1.7 MB WEBM
>>102440280
Not sure if the other dog is convinced, but I like it.
>>102430286
>>102430315
>>102430361
These are very creative.
>>
File: 00004-3202776543.jpg (1.2 MB, 1536x2304)
1.2 MB
1.2 MB JPG
>>
File: waterfalls night 3.webm (1.73 MB, 1920x960)
1.73 MB
1.73 MB WEBM
>>
File: 1.jpg (140 KB, 896x1152)
140 KB
140 KB JPG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.