[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


walking on grassy mountains Edition

Previously on /sdg/: >>101053907

>Beginner UI local install
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI (Node-based): https://rentry.org/comfyui
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Auto1111 forks
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
Anapnoe UX: https://github.com/anapnoe/stable-diffusion-webui-ux
Vladmandic: https://github.com/vladmandic/automatic

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
Inpainting: https://huggingface.co/spaces/fffiloni/stable-diffusion-inpainting
pixart: https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma

>Models, LoRAs & embeddings
https://civitai.com
https://huggingface.co
https://rentry.org/embeddings

>Animation
https://rentry.org/AnimAnon
https://rentry.org/AnimAnon-AnimDiff
https://rentry.org/AnimAnon-Deforum

>SDXL info & download
https://rentry.org/sdg-link#sdxl

>Index of guides and other tools
https://codeberg.org/tekakutli/neuralnomicon
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Discord
r2yJJDkm

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
>mfw Resource news

0619/2024

>TroL: Traversal of Layers for Large Language and Vision Models
https://github.com/ByungKwanLee/TroL

>diff-sampler: open-source toolbox for fast sampling of diffusion models
https://github.com/zju-pi/diff-sampler

>Diffusers Adds≈ SD3 ControlNet and Multi-ControlNet Support
https://github.com/huggingface/diffusers/pull/8566

>Better & Faster Large Language Models via Multi-token Prediction
https://huggingface.co/facebook/multi-token-prediction

06/18/2024

>Release of training script of PCM-LoRA with Stable Diffusion 3.
https://github.com/G-U-N/Phased-Consistency-Model#news

>Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
https://peizesun.github.io/llamagen/

>Flash Diffusion: FlashSD3
https://huggingface.co/jasperai/flash-sd3

>SD3-Controlnet-Tile
https://huggingface.co/InstantX/SD3-Controlnet-Tile

>Sharing new research, models, and datasets from Meta FAIR
https://ai.meta.com/blog/meta-fair-research-new-releases/

>The next chapter for ComfyUI
https://blog.comfyui.ca/comfyui/update/2024/06/18/Next-Chapter.html

>DeepFuze: Deep Learning Tool Integrating Lipsync, Face Swap, VidGen and more
https://github.com/SamKhoze/ComfyUI-DeepFuze

>Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering
https://huggingface.co/GlyphByT5/Glyph-SDXL-v2

>MegaScenes: Scene-Level View Synthesis at Scale
https://megascenes.github.io/

>microsoft/Florence-2-large - New 0.23B | 0.77B Modell for image captioning
https://huggingface.co/microsoft/Florence-2-large

06/17/2024

>CivitAI: Temporary Stable Diffusion 3 Ban
https://civitai.com/articles/5732

>VEGAIcon: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models
https://github.com/zhourax/VEGA

>ComfyUI-LuminaWrapper
https://github.com/kijai/ComfyUI-LuminaWrapper

>Generating audio for video
https://deepmind.google/discover/blog/generating-audio-for-video/
>>
>mfw Research news

06/19/2024

>Synergizing Foundation Models and Federated Learning: A Survey
https://arxiv.org/abs/2406.12844

>LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging
https://arxiv.org/abs/2406.12837

>VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
https://arxiv.org/abs/2406.12831

>Neural Approximate Mirror Maps for Constrained Diffusion Models
https://arxiv.org/abs/2406.12816

>AITTI: Learning Adaptive Inclusive Token for Text-to-Image Generation
https://itsmag11.github.io/AITTI/

>Extracting Training Data from Unconditional Diffusion Models
https://arxiv.org/abs/2406.12752

>AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
https://arxiv.org/abs/2406.12718

>SUPER: Selfie Undistortion and Head Pose Editing with Identity Preservation
https://arxiv.org/abs/2406.12700

>Do More Details Always Introduce More Hallucinations in LVLM-based Captioning?
https://arxiv.org/abs/2406.12663

>Mixing Natural and Synthetic Images for Robust Self-Supervised Representations
https://arxiv.org/abs/2406.12368

>Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment
https://arxiv.org/abs/2406.12303

>COT Flow: Learning Optimal-Transport Image Sampling and Editing by Contrastive Pairs
https://arxiv.org/abs/2406.12140

>ARTIST: Improving the Generation of Text-rich Images by Disentanglement
https://arxiv.org/abs/2406.12044

>Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
https://arxiv.org/abs/2406.12042

>Decomposed evaluations of geographic disparities in text-to-image models
https://arxiv.org/abs/2406.11988

>MAC: Benchmark for Multiple Attributes Compositional Zero-Shot Learning
https://arxiv.org/abs/2406.12757

>Disturbing Image Detection Using LMM-Elicited Emotion Embeddings
https://arxiv.org/abs/2406.12668

>What makes two models think alike?
https://arxiv.org/abs/2406.12620
>>
Everyone posting after me is a faggot.
>>
>>101059885
thx baker
>>101059657
>(Cohesium:1.5)
i guess it works
>>101059612
>>
File: 000000_13644_.png (2.74 MB, 1190x1190)
2.74 MB
2.74 MB PNG
>>101059717
All top notch!
>>
File: 1708097238678709.png (3.49 MB, 1344x1984)
3.49 MB
3.49 MB PNG
>>
File: 00102-TFT_124819.png (3.68 MB, 1536x2560)
3.68 MB
3.68 MB PNG
>>
File: 00084-TFT_124817.png (3.53 MB, 1536x2560)
3.53 MB
3.53 MB PNG
>>101059926
>>
SHANAANAKEE

>>101059920
very nice
>>101059910
>>
>>101059926
>>101059931
nice xayah. i haven't played league in years but that looks cool
>>
File: 00093-TFT_124818.png (3.22 MB, 1536x2560)
3.22 MB
3.22 MB PNG
>>101059976
thanks, I don't play league much anymore either but some of my favorite character designs outside anime/eastshit
>>
imagine the smell

>>101059957
>>
>>101059915
is this the muslim cube
>>
File: ComfyUI_00867_1.png (2.97 MB, 1796x1397)
2.97 MB
2.97 MB PNG
>>
File: grid-0012.jpg (654 KB, 2400x4000)
654 KB
654 KB JPG
>>
Holy quokkamole.
>>
>>101059999
>>
Why there is /sdg/ and /ldg/ nowadays?
>>
>>101060047
i still dont know why
but ldg posts sound more... reddit? to me
>>101060028
>>
>>101060047
Ran and other redditors wanted their own thread and made /ldg/.
>>
If you don't know at this point there's no reason to help you.
>>
>>101060000
Checked, Satan/Saturn Blackrock..
>>
File: ComfyUI_00877_1.png (3.39 MB, 1796x1397)
3.39 MB
3.39 MB PNG
>>
>>101060047
autism
>>
>>101060065
>>
File: ghosts.png (7 KB, 887x81)
7 KB
7 KB PNG
https://www.youtube.com/watch?v=BSkOzlWEmoI
>>
File: 000000_13645_.png (3.77 MB, 1349x1349)
3.77 MB
3.77 MB PNG
>>
>>101060131
>>
File: RA_2_00264_.jpg (1.02 MB, 1920x2808)
1.02 MB
1.02 MB JPG
>>
>>101059910
>(Cohesium:1.5)
whether or not anon was joking about this it actually worked for my purposes lel
so thank you
>>101060159
>>
https://arxiv.org/html/2406.11831v1

I wonder if it's actually better than Dall-E, the benchmarks don't show anything to do with people doing stuff or styles, just composition(no one cares about this).
>>
>>101060195
>>
>>101060300
>>
File: RA_2_00265_.jpg (1.06 MB, 1920x2808)
1.06 MB
1.06 MB JPG
>>
>>101060144
Hajj
>>
SD3 tile CN working in comfy yet?
https://huggingface.co/InstantX/SD3-Controlnet-Tile
>>
captcha: /g/ vag
>>101060346
>>
File: 000000_13652_.png (2.2 MB, 1190x1190)
2.2 MB
2.2 MB PNG
>>101060393
>Hajj
>>
>>101060421
>*praying intensifies*
>>
>>101060418
>>
Hello bros! I am having trouble to generate a image. everything i try just makes takes it further away from the desired goal! Can somebody help me? :(
I can't seem to get it anything close to my prompt!
My prompt is
>tall Japanese woman. Street alley at night . beautiful Japanese face covered in scars. two big scars on each side of the face. small scars on the nose. Small scars on the forehead. Small scars on the chin. redish-orange hair . messy ponytail with two loose bangs that hang on either side. eye retina is a pure sunlike yellow color. the white part of the eye is deep black in color. The surrounding area of the eyes is black . A petite body that is covered in huge and small scars . Smalls scars on the arms , huge scars on the arms , small scars on the thighs , huge scars on the tights. stomach is well toned and has a 6-pack. pointy teeth. turtleneck . lab coat . jeans. She is holding a knife covered in red liquid. The image is shot on a camera. Anime style.
>>
File: RA_2_00266_.jpg (1.08 MB, 1920x2808)
1.08 MB
1.08 MB JPG
>>
File: ComfyUI_SDXL_0193.jpg (2.21 MB, 2048x2048)
2.21 MB
2.21 MB JPG
>>
File: 0042.jpg (253 KB, 1536x2304)
253 KB
253 KB JPG
>>
File: 000000_13657_.png (2.3 MB, 1190x1190)
2.3 MB
2.3 MB PNG
>Do not direct your focus outward, go within.
>>
File: XL_ComfyUI_00044_.jpg (570 KB, 1088x896)
570 KB
570 KB JPG
>>
File: RA_2_00267_.jpg (811 KB, 1920x2808)
811 KB
811 KB JPG
>>
Any good SD3 samples?
>>
>>101060703
no
>>
>>101060437
>>
File: file.png (1.96 MB, 1024x1024)
1.96 MB
1.96 MB PNG
>>101060703
>>
File: 1699508874769345.png (854 KB, 1024x1024)
854 KB
854 KB PNG
>>101060703
ok bet
>>
Based Ella Freya enjoyer.
>>
File: ComfyUI_SDXL_0185.jpg (2.14 MB, 2048x2048)
2.14 MB
2.14 MB JPG
>>
File: RA_2_00268_.jpg (954 KB, 1920x2808)
954 KB
954 KB JPG
>>
File: 1694340969098329.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>101060703
>>
File: ComfyUI_temp_dldgs_00072_.png (2.28 MB, 1120x1440)
2.28 MB
2.28 MB PNG
>>
File: 1718128130284401.png (692 KB, 1024x1024)
692 KB
692 KB PNG
>>101060703
fuck you 4chan
>>
File: 1718234175786658.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>101060825
>>
>>101060837
>>
File: 1692977684423188.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>101060841
>>
File: ComfyUI_SDXL_0203.jpg (2.4 MB, 2048x2048)
2.4 MB
2.4 MB JPG
>>
File: 1700569488999396.png (968 KB, 1024x1024)
968 KB
968 KB PNG
>>101060863
>>
File: RA_2_00269_.jpg (841 KB, 1920x2808)
841 KB
841 KB JPG
>>
File: 1691705086269235.png (749 KB, 1024x1024)
749 KB
749 KB PNG
>>101060874
>>
File: sd15iter_0044.jpg (562 KB, 2304x2304)
562 KB
562 KB JPG
>>
File: 1700514598085111.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>101060885
there's a million more amazing images
if someone says SD3 is bad, don't take them seriously. It has it's issues but it is NOT bad.

I zipped the last time I ran this experiment and posted it. I'll probably do that again. These gens are all T5 only. no clip g or l
>>
File: 000000_13666_.png (2.24 MB, 1190x1190)
2.24 MB
2.24 MB PNG
>>
>>101060963
Neat orb
>>
File: 0.jpg (308 KB, 1024x1024)
308 KB
308 KB JPG
>>101060963
>>
>>101060974
Ty, SD3 with +5 on modelsamplingsd3
>>
>>101060980
Nice.
>>
File: 1689546868701104.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>101060951
>>
File: stolen4.png (1.03 MB, 768x1344)
1.03 MB
1.03 MB PNG
>>
File: RA_2_00270_.jpg (729 KB, 1920x2808)
729 KB
729 KB JPG
>>
>>101060755
>>
File: ComfyUI_temp_xclih_00101_.png (2.7 MB, 1120x1440)
2.7 MB
2.7 MB PNG
>>
File: 0.jpg (285 KB, 1024x1024)
285 KB
285 KB JPG
>>101061001
thx
>>
File: ComfyUI_temp_xclih_00106_.png (2.51 MB, 1120x1440)
2.51 MB
2.51 MB PNG
>>
File: ComfyUI_temp_xclih_00107_.png (2.85 MB, 1120x1440)
2.85 MB
2.85 MB PNG
>>
>>101061112
Sun in the cube!!soo good.
>>
me on the right

>>101061052
>>
File: ComfyUI_temp_xclih_00110_.png (2.79 MB, 1120x1440)
2.79 MB
2.79 MB PNG
>>
File: ComfyUI_temp_xclih_00111_.png (2.48 MB, 1120x1440)
2.48 MB
2.48 MB PNG
>>
>>101059899
>>101059890
you ever going to accept my friend request on discord?

at this point there are far more people in the discord than not.
>>
File: ComfyUI_00906_.png (3.78 MB, 1796x1397)
3.78 MB
3.78 MB PNG
>>
File: RA_2_00271_.jpg (1.01 MB, 1920x2808)
1.01 MB
1.01 MB JPG
>>
>>101061166
>>
>>101061231
deep fried as fuck
>>
>>101061233
first time?
>>
>>101061233
>>101061241
it's actually not lel
cmyk colors
>>101061231
>>
>>101061254
oh it absolutely is
>>
>>101061271
you won't get anywhere with this
>>
>>101061271
feel free to think what you wish
>>101061231
>>
fucking kek
>>
>>101061282
>>
File: RA_2_00272_.jpg (946 KB, 1920x2808)
946 KB
946 KB JPG
>>
>>101061231
>>101061282
you see how the floor is getting goosebump like textures?
You see the knees?
the bad chromatic aberration?
the latex skin texturing?
All classic signs of a deep fried gen, but feel free to be an oblivious moron
>>
>>101061314
you're assuming a lot of things
>>101061303
>>
File: ComfyUI_temp_dldgs_00122_.png (2.95 MB, 1704x1168)
2.95 MB
2.95 MB PNG
>>
>>101061329
>points out specific examples
>"you're assuming"
da fuq
how retarded can you be
>>
>>101061335
nice
>>101061329
>>
>>101061352
you pointed out things that would suit what you wish to see. in no way does your list of specific examples require me to agree with what you wish. that's all. as i said, feel free to think what you wish.
>>101061353
>>
File: 3195126433.jpg (69 KB, 896x768)
69 KB
69 KB JPG
>>
Do any of you know of some software that can crop images really quickly? I'm thinking like a fixed resolution box that moves with the cursor and when I click it saves the new cropped picture. I have a lot of cropping I want to do for loras.
>>
bring back the notables, this shit sucks
>>
>>101061382
who is a notable?
>>
>>101061380
FastStone Photo Resizer
>>
>>101061394
Ran
>>
File: 00008-150071481.jpg (297 KB, 1400x1400)
297 KB
297 KB JPG
>>
>>101061373
this isn't about agreeing or disagreeing, deep fried gens have specific qualities which is why they're immediately identifiable as being deep fried. It's not a subjective term, it has a fixed meaning. You're just unaware of what you're trying to refute and sounding retarded in the process.
>>
>>101061413
lol
yeah real notable
>>
>>101061413
I still have no idea who ran is
>>
File: RA_2_00273_.jpg (789 KB, 1920x2808)
789 KB
789 KB JPG
>>
>>101061373
>>
>>101061425
Quiet debo
>>101061421
>>
>>101061438
>dogshit gens
>sucking ran's cock
goes hand in hand
>>
>>101061437
>>
>>101061404
Saving is a little slow but this is good, thanks.
>>
>>101061456
what lora are you going to make?
>>
>>101059915
cube outta nowhere!
>>101061449
>>
>>101059885
If this image was made by a real person, I would marvel at it. "Wow," I'd go, "I don't even like Halo, but it must have taken hundreds of hours of cumulative practice and knowledge to be able to make the lighting just perfectly so, to make the sheen of the armour look just that good."
Instead, this was made by some 20 iq retard who wrote "Grassy hills background, Videogame in-game screenshot happy Adorable Quokka wearing halo spartan armor" into a text prompt.
Nothing was gained.
Nothing was lost, except some poor computer's clock cycles.
Nobody worked for anything in this picture, except for the poor likely underpaid researchers who engineers who unwittingly opened this pandora's box. No craft was perfected. No skills were honed. You didn't learn anything about yourself, life, and much less of all art by "making" this picture. All you did was take one small step towards the new future of humanity, one so utterly deprived of aspiration, ambition, talent, and meaning that they comfortably slob around in their chairs while the machines that they built for them feed them and endless mindless cycle of soulless entertainment.
Good luck, I say to you. You will find me in my cave, safe from the horrors of the world.
>>
>>101061492
>t. kaczynski
>>
>>101061492
researchers AND engineers*
my apologies
>>
>>101060992
Are you using comfy??
>>
File: ComfyUI_00915_.png (3.78 MB, 1796x1397)
3.78 MB
3.78 MB PNG
>>
File: RA_2_00274_.jpg (856 KB, 1920x2808)
856 KB
856 KB JPG
>>
File: ComfyUI_00669_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
First try SD3 output
Much better than expected based on the hate
>>
File: 1695839583176479.png (22 KB, 512x512)
22 KB
22 KB PNG
>>101061483
FFT loras. I will be cropping faces from images like this and for another lora I'll be cropping the upper half or portraits of fanart to supplement style loras. I've also made some datasets for Yoshida's FF3 art, FFTA, a couple other things, and I got someone to send me raw video files from the FMVs. I think taking a bunch of screenshots of those would make a cool lora.

I made one already if you want to try it. It's just using most of the full body job art.
https://files.catbox.moe/r6e5fo.safetensors
>>
>>101061492
If this post was made by a real person, I would marvel at it. "Wow," I'd go, "I don't even like img gen, but it must have taken hundreds of hours of cumulative practice and knowledge to be able to make the shitposting just perfectly so, to make the sheen of the nogen look just that good."
Instead, this was made by some 20 iq bot who wrote "Shit on prompt of grassy hills background, Videogame in-game screenshot happy Adorable Quokka wearing halo spartan armor" into a text prompt.
Nothing was gained.
Nothing was lost, except some poor computer's clock cycles.
Nobody worked for anything in this shitpost, except for the poor likely underpaid anons who's bots opened this pandora's box. No craft was perfected. No skills were honed. You didn't learn anything about yourself, life, and much less of all art by "making" this shitpost. All you did was take one small step towards the new future of humanity, one so utterly deprived of aspiration, ambition, talent, and meaning that they comfortably slob around in their chairs while the machines that they built for them feed them and endless mindless cycle of soulless entertainment.
Good luck, I say to you. You will find me in my cave, safe from the horrors of the world.
>>
>>101061508
Yes
>>
File: 00009-150071482.jpg (286 KB, 1400x1400)
286 KB
286 KB JPG
>>
>>101061492
>>101061504
I don't like the armor that much, i did try to make it look like halo but the model kept making armor that looked like Doom
>>
File: ComfyUI_temp_xclih_00125_.png (2.62 MB, 1120x1440)
2.62 MB
2.62 MB PNG
>>
File: ComfyUI_temp_xclih_00126_.png (2.64 MB, 1120x1440)
2.64 MB
2.64 MB PNG
>>
>>101061485
>>
Dead thread

Dead general
>>
bred
>>
>>
>>101061590
>>
File: ComfyUI_temp_dldgs_00135_.png (2.97 MB, 1168x1704)
2.97 MB
2.97 MB PNG
maybe it does know
>>
>>101061813
Is that that guy from the Pattinson Batman movie.
>>
>>101061536
Neurons activated.
>>
File: frame_00375_.png (1.14 MB, 1728x1248)
1.14 MB
1.14 MB PNG
>>
>>101061757
>>
>>101061915
>>
File: ComfyUI_temp_xclih_00103_.png (2.68 MB, 1120x1440)
2.68 MB
2.68 MB PNG
>>
File: ComfyUI_temp_cmvqb_00159_.png (3.21 MB, 1248x1824)
3.21 MB
3.21 MB PNG
>>
File: ComfyUI_temp_xclih_00158_.png (2.95 MB, 1120x1440)
2.95 MB
2.95 MB PNG
>>
File: 00018-3766957486_cleanup.png (1.47 MB, 1064x1192)
1.47 MB
1.47 MB PNG
>>
time to go
gn all

>>101061954
>>
Florence-2 seems to be finally a good NSFW captioner. The short ones are actually the best I'm finding. some funny (accurate) samples:

>A 3D rendering of a woman being fucked by two men.
>An anime illustration of a woman being fucked by a werewolf in a barn.
>A 3D rendering of a woman with long black hair and red lipstick being fucked by a black man.

Short caption mode exclusively uses "fuck", "fucking" and "fucked" to describe the action for some reason lol
>>
File: sd3bo_00070_.png (2.5 MB, 1536x1152)
2.5 MB
2.5 MB PNG
I think I assembled my 1girl wrong

>>101062118
gn

>>101062159
thats kinda funny since its a microsoft model
>>
File: 00069-343979817.png (1.35 MB, 768x1024)
1.35 MB
1.35 MB PNG
>>101057220
>I'll use inpainting to fix it for him
>inpaint: let me just get that out of there for you
>>
File: ComfyUI_temp_xclih_00177_.png (2.68 MB, 1232x1584)
2.68 MB
2.68 MB PNG
>>
>>101062159
very interesting and somewhat unexpected
>>
File: 00025-1972451512.png (3.08 MB, 1280x1920)
3.08 MB
3.08 MB PNG
>>101061929
>>101061929
>>101061929
Move to /ldg/, with Stability on its deathbed /sdg/ will be a thing of the past
>>
I'm not moving to a containment thread
>>
File: 00026-2733976969.jpg (289 KB, 1640x1304)
289 KB
289 KB JPG
Good night fellas
>>
File: 00021-876370624.png (1.23 MB, 1112x1248)
1.23 MB
1.23 MB PNG
>>
>>101062322
gn
>>
File: 00000-3679030649.png (3.19 MB, 1280x1920)
3.19 MB
3.19 MB PNG
>>101062316
Yes you are
>>
File: sd3bo_00071_.png (2.92 MB, 1536x1152)
2.92 MB
2.92 MB PNG
>>101062322
gn
>>
>>101062312
You dont go to a funeral when someone is on their deathbed. You go after they die
>>
File: ophanedagain.png (3.02 MB, 1208x1552)
3.02 MB
3.02 MB PNG
>>101062338
only if alig comes back
>>
File: 00031.png (3.04 MB, 1840x1432)
3.04 MB
3.04 MB PNG
>>
>>101062159
i dont think its good enough and the other more descriptive modes seem to hallucinate
perhaps as a supplement to another tagger or maybe if someone wanted to dropout the more descriptive tags to be able to have good results with short prompts when generating
>>
File: 00040.png (3.22 MB, 1840x1432)
3.22 MB
3.22 MB PNG
>>
>please you have to come to our dead thread and post to no one
>>
no, stay here but i know you love shitting up the place so that wont happen
>>
File: 6lns2brtsm7d1.png (2.17 MB, 1830x1148)
2.17 MB
2.17 MB PNG
SAI will make an anouncment about the SD3 fiasco kek, was about fucking time
>>
>stable diffusion is dead
>posts image made with stable diffusion
okay then
>>
>>101062349
Anon will just double bake an /sdg/ regardless
>>
>>101062418
people won't stick with SDXL forever, alternatives will come and move the imagegen model quality forward, like Pixart, HunyuanDiT or this shit maybe?

https://arxiv.org/abs/2406.11831
>Benefiting from the inherent ability of the LLMs and our innovative designs, the prompt understanding performance of LI-DiT easily surpasses state-of-the-art open-source models as well as mainstream closed-source commercial models including Stable Diffusion 3, DALL-E 3, and Midjourney V6. The powerful LI-DiT-10B will be available after further optimization and security checks.
>>
>>101062415
>SAI will make an anouncment
I hope it'll be more epic than this one
https://www.youtube.com/watch?v=fueRUi5AWWQ
>>
File: sd3bo_00072_.png (3.11 MB, 1536x1152)
3.11 MB
3.11 MB PNG
>>101062415
>announcement: we're releasing 8b on our api and giving everyone a free month subscription as an apology!
>>
File: 00009-1450292717.png (1.36 MB, 768x1024)
1.36 MB
1.36 MB PNG
>>101062234
>>101057220
figsed
>>
>>101062436
this is the 2nd paper to call sd3 closed source kek
>>
>>101062239
It seems the "FT" versions are the sanitized ones like in terms of what they will and won't say. The "Base" version of Large is what I've been using to get good NSFW results.
>>
File: BurglarBeeFinal.jpeg.png (2.41 MB, 1600x1095)
2.41 MB
2.41 MB PNG
>>101058111

Like so:

https://civitai.com/articles/5799
>>
File: 00011-1286151331.png (2.95 MB, 1280x1920)
2.95 MB
2.95 MB PNG
>>101062344
It will be a while before SAI actually dies, but better to move on now while you can
>>
File: 00057.png (3.02 MB, 1840x1432)
3.02 MB
3.02 MB PNG
>>
>>101062370
which version were you using? Large non-FT is the best, I'm finding.

I agree about using it conjunction with something else though, what I'm doing is outputting Florence first, and then running wd-vit-tagger-v3 and concatenating the list of Booru tags to come immediately after the Florence output.
>>
>>101062476
If you look at the prompts people post on there, it seems a large number use the on site image generator, which costs buzz. Hosting all those models has to be expensive though
>>
>>101062415
>Hopefuly
Why the fuck SAI posts are always so cryptic and vague, if you don't know then shut the fuck up?
>>
These really aren't nsfw despite being posted on /aco/ so I'm reposting them here.
Gwen is better when she is masked.
>>
>>101062510
ty for the trans representation during pride month
>>
>>101062491
also using base large on hf
>>
File: 00033-2838093706.png (321 KB, 512x512)
321 KB
321 KB PNG
do you have to use either SD 1.5 OR an alternate model like AOM3 and not both? I was trying to somehow use 1.5 with AOM3/WaifuDiff and a VAE but couldn't figure out how to get all 3 to work together..
>>
File: 00010-2581749588.png (3.14 MB, 1280x1920)
3.14 MB
3.14 MB PNG
>>101062535
Gwen isn't trans
>>
I have a confession to make, I only fap to my slop.
>>
>>101062556
What are you trying to achieve in base 1.5 that you can't in aom3? You can always do a second pass, that way you gen on either base1.5/aom3 then pass that latent to another ksampler for the other model (and repeat for a third pass if you want)

What are you trying to do ultimately is what I'm getting at? An anon might be able to put you in the right direction
>>
File: sd3bo_00074_.png (2.78 MB, 1536x1152)
2.78 MB
2.78 MB PNG
>>
>>101062556

ty for the trans representation during pride month
>>
>>101062535
Thank you anon, but don't sell yourself so short, You are representation enough ;)
>>
File: 00021-42766219.png (2.81 MB, 1280x1920)
2.81 MB
2.81 MB PNG
>>
https://stability.ai/news/stable-diffusion-3-medium
2b is all we gonna get
>Future Plans
>We plan to continuously improve Stable Diffusion 3 Medium based on user feedback, expand its features, and enhance its performance. Our goal is to set a new standard for creativity in AI-generated art and make Stable Diffusion 3 Medium a vital tool for professionals and hobbyists alike.
>>
>>101062612
>based on user feedback
My fucking ass
>>
>12 Jun
sd3 is good and I don't give a shit what you guys say
>>
File: Sophistication.jpg (324 KB, 980x980)
324 KB
324 KB JPG
>>101062612
>Announcing the Open Release of Stable Diffusion 3 Medium, Our Most Sophisticated Image Generation Model to Date
>>
>>101062567
She died in The Amazing Spider-Man #121–122 and it fucked Peter Parker up so bad he started dressing like her and being Spider-Gwen. So, yes.
>>
>>101062645
REEEEEEEEEEEEEEEEEEEEEEEEE spoilers
>>
>>101062574
Does aom3 include 1.5 or something? I thought they were completely separate methods or whatever and I was trying to have them influence each other. I may just not have a good understanding of what models actually are... I tried merging them and now I can't seem to switch back to aom3.. it's stuck on v1-5.pruned despite that not even being in my model folder

Just trying to create good weebshit stuff, and definitely not imitate the work of any artists I enjoy.
>>
File: NO.jpg (591 KB, 1523x1532)
591 KB
591 KB JPG
>>101062631
>sd3 is good
>>
File: 00025-3100364930.png (2.39 MB, 1280x1920)
2.39 MB
2.39 MB PNG
>>101062645
Good one lol
>>
>>101062645
so now everyone that crossdress is trans? I like how the woke reinforce stereotypes: "If you wear a dress you're a girl"
>>
File: ComfyUI_temp_spdlr_00011_.png (3.51 MB, 1400x1800)
3.51 MB
3.51 MB PNG
>>101062654
At least put in some effort into highlighting some of the problems
>hint it's still a great model

>>101062653
>Does aom3 include 1.5 or something
In short, yes it does - Choose it as your checkpoint/model. The only thing that relies on the checkpoint/model (yes there are other things) but the main thing would be LoRas which are tiny little models which influence your main checkpoint/model.

tl;dr set Aom3 as your checkpoint/model and have fun.
>>
File: 00026-1522488286.png (2.48 MB, 1280x1920)
2.48 MB
2.48 MB PNG
>>
File: Copium.jpg (30 KB, 1353x284)
30 KB
30 KB JPG
>>101062693
>hint it's still a great model
the SA researchers literally called it a "failed experiment" but you still wanna pretend it's a good model?
>exhibit 101 of the Stockholm syndrom
>>
>>101062715
I'm not coping, I see the results. It can yield some incredible gens and you're delusional if you think otherwise (including all of the faults).
>>
>>101062725
So you know better than the SAI researchers that created the model and went to the conclusion it was a failure? All right, you're entilted with your opinion I guess?
>>
>>101062731
We can go back and forth on this but the bottom line, for me, is the results. It yields what I want. Yes the training dataset was pruned, yes it there are a plethora of safety precautions used and at the end of the day it doesn't matter because I like what it gens. It's truly that simple.
>>
File: 00027-4048852779.png (2.35 MB, 1280x1920)
2.35 MB
2.35 MB PNG
>>101062725
>I'm not coping
that's someone who's coping would say
>>
>>101062715
I don't see why they would push to put it out then. Releasing a model that people think isn't good isn't going to make anyone want to buy the 8b model.
>>
File: sd3bo_00075_.png (2.91 MB, 1536x1152)
2.91 MB
2.91 MB PNG
>>101062679
>I like how the woke reinforce stereotypes
he actually typed this

>>101062715
>made by people who left a while ago
does that mean that their attempts at training models were so bad that they had to dig through the old researchers' discard bin in the hopes that i'd be marketable?
>>
>>101062747
>he actually typed this
where's the lie, that mf literally said that because Spiderman decided to use Gwen's clothes, that means he's trans, since when "wearing dresses = trans"? That's a fucking stereotype
>>
>>101062747
or
>let's give them this one we can't profit off of for free
>>
>>101062725
I have stock in the other companies so I need it to be bad. Please say it's bad.
>>
>>101062745
>I don't see why they would push to put it out then.
So that they can say "See? We fulfilled our promise of releasing SD3... it's just that it was only sd3 medium, but that's sd3 nonetheless"
>>
>>101062759
But again, by calling it SD3, that makes the 4b and 8b "look" bad
>>
>official Stability AI thread
>>
File: 00006-1652927726.png (1.29 MB, 1112x1248)
1.29 MB
1.29 MB PNG
>>
>>101062776
Even the SAI reddit is a dumbsterfire, it became so bad it's not a place to make Luma dream machine ads kek
https://reddit.com/r/StableDiffusion/comments/1djhtzt/im_testing_to_animate_cats_with_dream_machine_and/
>>
File: 1718863314826.jpg (204 KB, 1024x1024)
204 KB
204 KB JPG
>>
File: sd3bo_00087_.png (2.59 MB, 1536x1152)
2.59 MB
2.59 MB PNG
>>
File: sd3bo_00090_.png (3.29 MB, 1536x1152)
3.29 MB
3.29 MB PNG
I don't think we're getting a pre-release drop tonight
>>
File: sd3bo_00096_.png (3.17 MB, 1536x1152)
3.17 MB
3.17 MB PNG
>>
File: 1718863984600.jpg (216 KB, 1024x1024)
216 KB
216 KB JPG
>>
>>101062887
guess I can go to sleep
gn sdg
>>
File: sd3bo_00101_.png (2.97 MB, 1536x1152)
2.97 MB
2.97 MB PNG
>>101062947
gn
>>
File: ComfyUI_00932_1.png (2.86 MB, 1796x1397)
2.86 MB
2.86 MB PNG
>>
>>101062725
post a catbox of something good made with sd3
>>
File: ComfyUI_00948_1.png (2.96 MB, 1796x1397)
2.96 MB
2.96 MB PNG
>>
File: sd3bo_00104_.png (3.19 MB, 1536x1152)
3.19 MB
3.19 MB PNG
>>
File: sd3bo_00107_.png (2.86 MB, 1536x1152)
2.86 MB
2.86 MB PNG
why didn't anyone gen justin timberlake drunkenly driving into children or something? seems like it coulda been a fun genre for the day
>>
>>101063127
>>101063127
>>101063127
>>
File: 1718866174890.jpg (216 KB, 1024x1024)
216 KB
216 KB JPG
I guess I'll fill this
>>
File: 1718866188414.jpg (223 KB, 1024x1024)
223 KB
223 KB JPG
>>
File: 1718866199608.jpg (232 KB, 1024x1024)
232 KB
232 KB JPG
>>
File: 1718866208609.jpg (224 KB, 1024x1024)
224 KB
224 KB JPG
END
>>
>>101062478
>but better to move on now while you can
And why is that?
>>
>>101061492
holy based
>>
>>101062453
where's the other one
>>
>>101064312
https://github.com/Tencent/HunyuanDiT?tab=readme-ov-file#-comparisons
they actually call it open source in the paper itself but on github comparison they have it as closed
>>
>>101059885
It's so strange i knew this was halo from the landscape in the thumbnail alone
>>
>>101061149
Any way to modify these so they look lile they were shot in the late 90s? On tape



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.