[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


3-Year duration 4chan Passes are now available for $45

[Advertise on 4chan]


File: 1709109465070877.png (1.14 MB, 1280x1536)
1.14 MB
1.14 MB PNG
Previous /sdg/ thread : >>102140480

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
flux-dev: https://huggingface.co/spaces/black-forest-labs/FLUX.1-dev
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Black Forest Labs: Flux
https://huggingface.co/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
First for debo is the thread schizo
>>
For thread quality
https://rentry.org/debo
>>
File: fSDG_News_00027_.jpg (351 KB, 896x512)
351 KB
351 KB JPG
>mfw Resource news

08/30/2024

>MistoControlNet-Flux-dev: ControlNet collections for Flux1-dev by TheMisto.ai
https://github.com/TheMistoAI/MistoControlNet-Flux-dev

>RunwayML Deletes SD15
https://huggingface.co/runwayml/stable-diffusion-v1-5

>bigdata-pw/Spotify: Dataset of ~25M tracks from Spotify
https://huggingface.co/datasets/bigdata-pw/Spotify

>SAM2Point: Segment Any 3D as Videos
https://github.com/ZiyuGuo99/SAM2Point

>CogVLM2: Visual Language Models for Image and Video Understanding
https://github.com/THUDM/CogVLM2
https://github.com/THUDM/GLM-4
https://arxiv.org/abs/2408.16500

>Adapting Vision-Language Models to Open Classes via Test-Time Prompt Tuning
https://github.com/gaozhengqing/TTPT

>Spiking Diffusion Models (IEEE Transactions on Artificial Intelligence)
https://github.com/AndyCao1125/SDM

>Enhanced Control for Diffusion Bridge in Image Restoration
https://github.com/Hammour-steak/ECDB

>SAU: A Dual-Branch Network to Enhance Long-Tailed Recognition via Generative Models
https://github.com/lgX1123/gm4lt

>Pony Diffusion Supported Characters List
https://ponychar.com/

>Non-profit GPU cluster that runs open-source paper demos
https://github.com/camenduru/non-profit-gpu-cluster

08/29/2024

>NVIDIA-Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
https://github.com/NVlabs/Eagle

>OpenAI and Anthropic will share their models with the US government
https://www.theverge.com/2024/8/29/24231395/openai-anthropic-share-models-us-ai-safety-institute

>Distribution Backtracking Distillation for One-step Diffusion Models
https://github.com/SYZhang0805/DisBack

>MAD: Semantically Coherent Montages by Merging and Splitting Diffusion Paths
https://github.com/aimagelab/MAD

>MMDRFuse: Distilled Mini-Model with Dynamic Refresh for Multi-Modality Image Fusion
https://github.com/yanglinDeng/MMDRFuse

>ComfyUI-Rpg-Architect: Custom Node for ComfyUI to create RPG Characters
https://github.com/talon468/ComfyUI-Rpg-Architect
>>
Cursed thread of hate
>>
>mfw Research news

08/30/2024

>ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model
https://liuff19.github.io/ReconX

>CSGO : Content-Style Composition in Text-to-Image Generation
https://csgo-gen.github.io/

>One-Shot Learning Meets Depth Diffusion in Multi-Object Videos
https://arxiv.org/abs/2408.16704

>GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models
https://arxiv.org/abs/2408.16700

>GRPose: Learning Graph Relations for Human Image Generation with Pose Priors
https://arxiv.org/abs/2408.16540

>Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation
https://arxiv.org/abs/2408.16506

>What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer
https://arxiv.org/abs/2408.16450

>Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks
https://arxiv.org/abs/2408.16445

>COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation
https://arxiv.org/abs/2408.16426

>Law of Vision Representation in MLLMs
https://arxiv.org/abs/2408.16357

>Learned Image Transmission with Hierarchical Variational Autoencoder
https://arxiv.org/abs/2408.16340

>Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models
https://arxiv.org/abs/2408.16296

>CNN Compression Based on Low-Rank Decomposition
https://arxiv.org/abs/2408.16289

>Improving Diffusion-based Data Augmentation with Inversion Spherical Interpolation
https://arxiv.org/abs/2408.16266

>Enhancing Conditional Image Generation with Explainable Latent Space Manipulation
https://arxiv.org/abs/2408.16232

>Does Data-Efficient Generalization Exacerbate Bias in Foundation Models?
https://arxiv.org/abs/2408.16154

>Many-Worlds Inverse Rendering
https://arxiv.org/abs/2408.16005

>Revisiting 360 Depth Estimation with PanoGabor: A New Fusion Perspective
https://arxiv.org/abs/2408.16227
>>
>>102157895
i love you
>>
>>102157922
:*
>>
File: 1724415104764437.png (1.66 MB, 1280x768)
1.66 MB
1.66 MB PNG
who is debo and why does his specter casts a dark shadow over stable diffusion threads on /g/?
>>
File: maxresdefault.jpg (79 KB, 1280x720)
79 KB
79 KB JPG
>>102158000
dont worry he is contained here forever
you can thank me
and checked
>>
File: ComfyUI_00240_.png (1.57 MB, 1280x768)
1.57 MB
1.57 MB PNG
>>102158065
that seems dangerous. this guy seems like a dangerous fellow. are you sure you know what you're doing?
>>
File: R.gif (813 KB, 500x348)
813 KB
813 KB GIF
>>102158080
leave it to me
>t. schizo anon
>>
Quokka If he high
>>
>>102158155
yeah its fine
>>
File: delux_sf_00099_.png (2.19 MB, 1536x968)
2.19 MB
2.19 MB PNG
>>102158065
roshi doesn't suceed in containing king piccolo. also, isn't king piccolo a god? this metaphor works a lot better than you intended it to

>>102158142
>bruh i bet i could make a bong out of that lighthouse

>>102158155
its risky. depends entirely on the janny's mood
>>
>>102158155
I never snitch, cause I ain't no bitch.
>>
What is the difference between this and /ldg/? Last time I dabbled in this stuff there was just /sdg/.
>>
>
>>
>>102158216
Not sure desu but /ldg/ has the better gens for some reason
>>
File: delux_sf_00100_.png (2.23 MB, 1536x968)
2.23 MB
2.23 MB PNG
>>102158216
there is no difference
>>
File: that's me btw.png (1.5 MB, 1280x960)
1.5 MB
1.5 MB PNG
>>
File: debo.jpg (14 KB, 478x361)
14 KB
14 KB JPG
>>102158181
>>
>>102158216
/sdg/ is an unofficial general designated for containing a particular individual so he doesn't wander off and shit up /ldg/
>>
File: 000000_17046_.png (2.02 MB, 1032x1428)
2.02 MB
2.02 MB PNG
>>
Did debo win? I've been out for a few weeks.
>>
File: file.png (12 KB, 1390x75)
12 KB
12 KB PNG
>Repository size: The total size of the data you’re planning to upload. We generally support repositories up to 300GB. If you would like to upload more than 300 GBs (or even TBs) of data, you will need to ask us to grant more storage. Please provide details of your project. You can contact us at datasets@huggingface.co or on our Discord.
>know it's not a hard limit because some of my datasets are already over 300gb
>decide to be considerate and contact them before uploading over 20tb
>get a reply, guy ive interacted with before, he's a prick, just reiterates the apparent limit of 300gb
yeah ok. not my problem. they can delete it if they want idc
>>
>>102158600
He succeeded in killing off this general
>>
>>102158680
god bless debo
>>
i miss schizo anon
>>
>>102158944
I'm here
>>
>>102158960
did the interview happen?
>>
File: not_de_00044_.png (2.1 MB, 1536x1536)
2.1 MB
2.1 MB PNG
>>102158600
>>102158680
>>102158803
https://suno.com/song/71b244e0-6a38-467f-9471-f49192cb3701
>>
>>102159104
thank you for your valuable opinion ****** ******
>>
File: ComfyUI_13834_.jpg (368 KB, 2304x1296)
368 KB
368 KB JPG
>>
File: file.png (584 KB, 3792x1875)
584 KB
584 KB PNG
scaled flickr stage 2 to 8 instances per node, ~7m per hour now
>>
Since this is the stable diffusion only general im gonna ask here
Is SD3 finally usable?
>>
File: castle.webm (3.91 MB, 768x922)
3.91 MB
3.91 MB WEBM
>>102157704
ty for taking my pic kek
here is a small clip that fit in 4mb btw
>>
File: delux_sf_00102_.png (2.44 MB, 1536x968)
2.44 MB
2.44 MB PNG
>>102159334
>this is the stable diffusion only general
it isnt
>Is SD3 finally usable
it isnt
>>
>schizo general
>>
>>102159409
see >>102159186
>>
>>102159398
I saw it
>>
File: blood_bath.webm (3.98 MB, 768x1122)
3.98 MB
3.98 MB WEBM
>>102159439
nice anon q_q
here is a new one then lol
>>
>>102159398
>>102159456
how are you doing these? how long does a full animation take?
>>
File: templar3_w.webm (1.97 MB, 640x438)
1.97 MB
1.97 MB WEBM
>>102159471
i use flux to get pics i really like, i fix them with pony,, then i animate with either Luma or Kling

it can go as far as you wish if you use the last frame for the next clip, it extend 5s each time
>>
>>102159518
>this is gonna be so cool, look at all these people
>wow, everyone's setting off, better get moving
>oh shit, I'm in the wrong group!
in all seriousness though, these are very cool
>>
>>102159518
>its *ai step*
>then *ai step*
>after that another *ai step*
can it do booba?
>>
>>102159641
Are you blind?
>>
>>102159701
no im schizophrenic
please give me (you)s
>>
>debo's flux gens are somehow worse than gens from 2 years ago
That takes some effort
>>
>>102159720
based as fuck
thanks
>>
Is there a way to train a lora for image-text-models? Let's say you trained a lora to generate a new concept, but then you also want the clip interrogator to recognize your new concept when the image is fed to it. Using the existing models, it would only give you generic descriptions but not recognize your specific concept.
>>
>no one cares about the "news" in any way
based
>>
>>102159736
>>102159789
>>
>>102159789
ain't nobody got time to skim all that every thread

>>102159804
no debo, not the same anon
>>
>>
>koff and fran moved to /ldg/ to get away from debo
the absolute state of affairs
>>
hey debo how do you feel now that you know i won?
>>
i like /sdg/ threads to be slow so debo doesnt post his "news" too often
its a great compromise
>>
>hlky is also fed up of debo
lmao
>>
>>102160073
i'm fed up of you. shut the fuck up jesus wept
>>
> >102160085
>nigbo
>>
>>102160059
>hlky moves to /ldg/ to get away from debo too
the absolute state of affairs
>>
>>102160121
>hlky moves to /ldg/
guess ani was right about him
>>
no surprise all good genners left kek
>>
>sxhizophrenical
>>
I'm generating some really good images with novelai's proprietary stable diffusion model. However, they all look pretty same-y. Can I use img2img on my own machine with other models to change the art style a bit, or is it more complicated than that?
>>
>>102160650
You should ask in the non schizo thread
>>>/g/ldg
>>
>>102160685
Thanks!
>>
File: causality.png (1.01 MB, 651x1200)
1.01 MB
1.01 MB PNG
>>
>>102160888
yep, that's the style
>>
File: Chen.jpg (347 KB, 1536x1536)
347 KB
347 KB JPG
>>
>>102161172
punchable
>>
File: 1725054492296.jpg (389 KB, 1536x1536)
389 KB
389 KB JPG
>mf hfz
Holy shit. He won't stop doing meth. I shoved a tire iron in his face and told his bitch ass to do it in front of me instead of being a fucking coward.
>4JBWP0K
LMAO
>Y8HT4D
LMAO
>>
File: 1724781014045.jpg (651 KB, 1536x1536)
651 KB
651 KB JPG
>>102159104
You went so ham. This is boss dude.
>>
File: delux_sf_00105_.png (2.27 MB, 1536x968)
2.27 MB
2.27 MB PNG
>>102161546
the gen and the big data gang gens are some of my favorites
>>
if purple schizo is allowed to break the rules so am i
>>
>>102161861
you certainly love to necrobump
>>
>>102161861
>its ok to break the rules
then shut up about the rules
>>
File: 1725054322123.jpg (362 KB, 1536x1536)
362 KB
362 KB JPG
I>>102161618
>the gen and the big data gang gens are some of my favorites
Did you straight up gen the full lyrics? I've been experimenting with teaching Suno my acoustic guitar. It's pretty painful. Including lyrics gives me bad experiences in public.
>SJ4HYK
I miss hlky.
>Phone breaking so bad I can't post in thermodynamic conditions *sigh*
>>
File: nvitop.png (64 KB, 1884x315)
64 KB
64 KB PNG
pip install nvitop
nvitop

You're welcome.
>>
File: forest town.webm (1.77 MB, 1920x960)
1.77 MB
1.77 MB WEBM
>>
File: 00017-2441846351.png (1.31 MB, 896x1088)
1.31 MB
1.31 MB PNG
>>
File: 00018-2869200114.png (1.23 MB, 896x1088)
1.23 MB
1.23 MB PNG
>>
File: IMG_5041.png (3.72 MB, 1536x2304)
3.72 MB
3.72 MB PNG
>>102163333
Checked
>>
File: IMG_5038.jpg (309 KB, 853x1280)
309 KB
309 KB JPG
>>
File: IMG_5008.jpg (254 KB, 853x1280)
254 KB
254 KB JPG
Xjvsa
>>
File: IMG_4993.jpg (216 KB, 853x1280)
216 KB
216 KB JPG
>>
File: IMG_4989.png (3.96 MB, 1536x2304)
3.96 MB
3.96 MB PNG
>>
File: IMG_4977.jpg (213 KB, 853x1280)
213 KB
213 KB JPG
>>
File: IMG_4970.png (3.23 MB, 1536x2304)
3.23 MB
3.23 MB PNG
>>
File: IMG_4948.jpg (204 KB, 853x1280)
204 KB
204 KB JPG
>>
File: IMG_4578.png (3.49 MB, 1536x2304)
3.49 MB
3.49 MB PNG
>>
how are iphones able to make generative AI without the cloud or nvidia chips?
>>
>>102163613
Websites like Tensor art
>>
>>102163372
Nice woman of elven descent
>>
>>102163518
Catbox or model maybe?
I like the style
>>
File: 00019-1533612836.png (1.15 MB, 896x1088)
1.15 MB
1.15 MB PNG
>>
File: IMG_4571.png (3.36 MB, 1536x2304)
3.36 MB
3.36 MB PNG
>>102164201
https://tensor.art/images/766435548450648901?post_id=766435548446454600
>>
File: 00021-520396057.png (1.06 MB, 896x1088)
1.06 MB
1.06 MB PNG
>>
>>
>>102164936
That's a traffic cone.
What are you using to make these?
>>
>>102164965
flux then kling

yeah that's in the prompt lol, because they have no buttplugs in the dataset :( (YET >:D)
>>
>>102164982
>YET
I don't think it will ever have.
>>
>>102165061
fr, has anyone figured out how to make flux not do butt chin?
>>
>>102165061
there are a shit tons of loras pouring each days, i did that nsfw anime one myself yesterday
https://files.catbox.moe/jdfar0.png

q_q kek
>>
>>102165107
provide an example wtf
>>
File: 000000_17076_.png (2.04 MB, 952x1587)
2.04 MB
2.04 MB PNG
>>
File: test_s_00384_.png (2.4 MB, 1024x1600)
2.4 MB
2.4 MB PNG
>>102165161
flux people have cleft chins very often
>>
>>102165134
Are the nipples still undefeated?
>>
File: 00023-1629574226.png (1.4 MB, 896x1088)
1.4 MB
1.4 MB PNG
>>102165221
undefeated?
>>
>>102165221
They're pretty defeated. Along with the "Flux can't depict women being railed by monsters, demons, and aliens" problem that plagued us for so long:
https://civitai.com/models/697026/unrealistic-nsfw-concepts-for-flux-photorealistic-dataset
>>
>>102164555
>Paid model
Huh, thanks I guess
>>
>>102165340
I saw a booba one today that looked pretty okay. I haven't tried it yet. Also saw a girls getting railed by monsters lora yesterday, but I'd have to dig through "new" a while to find that one
https://civitai.com/models/704013/photorealistic-nsfw
>>
>>102165365
>I saw a booba one today
I've never seen booba
>>
>>102165340
I realized if I just searched flux + monster it'd pop up
https://civitai.com/models/697026/unrealistic-nsfw-concepts-for-flux-photorealistic-dataset
>>
File: 00025-2937626534.png (1.21 MB, 896x1088)
1.21 MB
1.21 MB PNG
>>
File: BMP_00069_.png (2.82 MB, 1328x1328)
2.82 MB
2.82 MB PNG
>>
File: aayyy.png (30 KB, 742x151)
30 KB
30 KB PNG
mixed feelings, glad the day of reckoning has come for Mr Türkiye however
>>
File: 00000-1577192088.png (1.27 MB, 896x1088)
1.27 MB
1.27 MB PNG
>>102165688
training XL?
>>
File: 00443-2864839643.png (1.68 MB, 1280x960)
1.68 MB
1.68 MB PNG
>>
>>102165822
I'll be honest with you, I think some sort of pavlovian conditioning has occurred. Anytime I see one of their videos or post I'm certain it will either be a link to one of their "tutorials" or a link to their patreon.
At this point it's like a reflex.
>>
File: 00004-1427964154.png (1.36 MB, 896x1088)
1.36 MB
1.36 MB PNG
>>
File: 00442-2864839643.png (1.8 MB, 1280x960)
1.8 MB
1.8 MB PNG
>>102166011
hey cutie
>>
File: 00022-308747946.png (1.01 MB, 896x1088)
1.01 MB
1.01 MB PNG
>>102166049
2spoopy
>>
Newfag here, just installed Easy Diffusion. What's a good model/lora for realistic clouds?
>>
Stable Diffusion 1.5 finally shoad from hugging face
Which model is next?
>>
>>102166240
idk realisticvision or cyberrealistic
>>
>>102166253
What happened?
>>
File: delux_oc_00023_.png (1.49 MB, 1536x968)
1.49 MB
1.49 MB PNG
>>102166240
you making a skybox?

>>102166317
runway officially took it down
>>
>>102166320
>you making a skybox?
Yeah
>>
>>102166317
Reading the news, it was taken down because of the LAION dataset. While understandable it's a shame it met it's end rather unceremoniously, considering how influential it was.
>>
File: 00007-3817021165.png (1.39 MB, 896x1088)
1.39 MB
1.39 MB PNG
>>102166384
>because of the LAION dataset
What happened with it?
>>
>>102163485
This is one of the best and immersive gens I've seen in a while. Impressive.
>>
>>102166426
It had cuck elements which were discovered with some statistical hash identification algorithm.
>>
File: ComfyUI_02696_.png (897 KB, 768x1024)
897 KB
897 KB PNG
>>102165688
Oh shit, are people finally doing something about the turkey?
Roast turkey and it's not even thanksgiving
>>
File: 00009-3965182793.png (1.35 MB, 896x1088)
1.35 MB
1.35 MB PNG
>>
>>102166563
I was seeing if they made a statement and found a couple articles about it
>Unfortunately, it’s still available on Civitai, as are hundreds of derivative models. When we contacted Civitai, a spokesperson told us that they have no knowledge of what training data Stable Diffusion 1.5 used, and that they would only take it down if there was evidence of misuse.
>>
>>102166817
That's written by the journalist.
>Runway did not respond to a request for comment.
>suggested that we contact Runway—which we did, again, but we have not yet received a response.
Runway's new closed source mission and subsequent removal of SD1.5 has nothing to do with LAION dataset.
https://spectrum.ieee.org/stable-diffusion
>>
File: 000000_17086_.png (2.1 MB, 952x1587)
2.1 MB
2.1 MB PNG
>>
File: sorry.png (1.66 MB, 856x1440)
1.66 MB
1.66 MB PNG
my sincerest apologies
>>
File: 000000_17083_.png (2.03 MB, 952x1587)
2.03 MB
2.03 MB PNG
>I promised you I wouldn't flood this Realm again.
>>
File: so sorry.png (1.71 MB, 864x1440)
1.71 MB
1.71 MB PNG
>>
>>102160650
Just use artist tags like this: {{artist:ratatatat74}}
>>
>>102165618
>>102165822
>>102166200
>>102166426
>>102166807
These are pretty awesome, they remind me of WLOP's art. Care to share the model and lora (if any)? I've been trying to reach similar results for some time now.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.