[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 27-6-274551.jpg (352 KB, 1440x1920)
352 KB
352 KB JPG
Previous /sdg/ thread : >>100368968

>Beginner UI local install
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI (Node-based): https://rentry.org/comfyui
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Auto1111 forks
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
Anapnoe UX: https://github.com/anapnoe/stable-diffusion-webui-ux
Vladmandic: https://github.com/vladmandic/automatic

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
Inpainting: https://huggingface.co/spaces/fffiloni/stable-diffusion-inpainting
pixart: https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma

>Models, LoRAs & embeddings
https://civitai.com
https://huggingface.co
https://rentry.org/embeddings

>Animation
https://rentry.org/AnimAnon
https://rentry.org/AnimAnon-AnimDiff
https://rentry.org/AnimAnon-Deforum

>SDXL info & download
https://rentry.org/sdg-link#sdxl

>Index of guides and other tools
https://codeberg.org/tekakutli/neuralnomicon
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg

Official: discord.gg/stablediffusion
>>
File: SDG_News_00227_.png (1.77 MB, 1560x896)
1.77 MB
1.77 MB PNG
>mfw Resource news

05/07/2024

>CCDM: Continuous Conditional Diffusion Models for Image Generation
https://github.com/UBCDingXin/CCDM

>MediaPipe Hand Crop Fix
https://github.com/sign-language-processing/mediapipe-hand-crop-fix

>LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model
https://github.com/L-Sun/LGTM

>AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding
https://github.com/X-LANCE/AniTalker

>DVMSR: Distillated Vision Mamba for Efficient Super-Resolution
https://github.com/nathan66666/DVMSR

>ImageInWords: Unlocking Hyper-Detailed Image Descriptions
https://google.github.io/imageinwords/

>MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model
https://dai-wenxun.github.io/MotionLCM-page/

>comfy-cli: Command Line Interface for Managing ComfyUI
https://github.com/yoland68/comfy-cli

>Performance Profiling Report (Forge/A1111/ComfyUI)
https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/716

>ComfyUI-Video-Editing-X-Attention
https://github.com/chaojie/ComfyUI-Video-Editing-X-Attention

>AM-RADIO: Reduce All Domains Into One
https://github.com/NVlabs/RADIO

05/06/2024

>Detector-Free Structure from Motion
https://zju3dv.github.io/DetectorFreeSfM/

05/05/2024

>ComfyUI Prompt Quill
https://github.com/osi1880vr/prompt_quill_comfyui

>Efficient Implementation of Kolmogorov-Arnold Network [KAN]
https://github.com/Blealtan/efficient-kan

>controlnetXL_line2color
https://huggingface.co/kataragi/controlnetXL_line2color

05/04/2024

>PuLID now supported in sd-webui-controlnet!
https://github.com/Mikubill/sd-webui-controlnet/discussions/2841

>ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars
https://github.com/3DTopia/ThemeStation

05/03/2024

>Virtuoso Nodes: Set of nodes to give Photoshop-like functionality within ComfyUI.
https://github.com/chrisfreilich/virtuoso-nodes
>>
>mfw Research news

05/07/2024

>Generated Contents Enrichment
https://arxiv.org/abs/2405.03650

>Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
https://arxiv.org/abs/2405.03520

>Gaussian Splatting: 3D Reconstruction and Novel View Synthesis, a Review
https://arxiv.org/abs/2405.03417

>Animate Your Thoughts: Decoupled Reconstruction of Dynamic Natural Vision from Slow Brain Activity
https://arxiv.org/abs/2405.03280

>Mind the Gap Between Synthetic and Real: Utilizing Transfer Learning to Probe the Boundaries of Stable Diffusion Generated Data
https://arxiv.org/abs/2405.03243

>Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
https://arxiv.org/abs/2405.03190

>Video Diffusion Models: A Survey
https://arxiv.org/abs/2405.03150

>SketchGPT: Autoregressive Modeling for Sketch Generation and Recognition
https://arxiv.org/abs/2405.03099

>Matten: Video Generation with Mamba-Attention
https://arxiv.org/abs/2405.03025

>Paintings and Drawings Aesthetics Assessment with Rich Attributes for Various Artistic Categories
https://arxiv.org/abs/2405.02982

>VectorPainter: A Novel Approach to Stylized Vector Graphics Synthesis with Vectorized Strokes
https://arxiv.org/abs/2405.02962

>iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval
https://arxiv.org/abs/2405.02951

>MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior
https://arxiv.org/abs/2405.02859

>Stable Diffusion Dataset Generation for Downstream Classification Tasks
https://arxiv.org/abs/2405.02698

>Enhancing Social Media Post Popularity Prediction with Visual Content
https://arxiv.org/abs/2405.02367

>Efficient Text-driven Motion Generation via Latent Consistency Training
https://arxiv.org/abs/2405.02791

>Adapting to Distribution Shift by Visual Domain Prompt Generation
https://arxiv.org/abs/2405.02797

>U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers
https://arxiv.org/abs/2405.02730
>>
File: 00031-98381640.png (3.76 MB, 1792x1792)
3.76 MB
3.76 MB PNG
>Cold? Why would I be cold?
>>
File: sig_0001.jpg (146 KB, 1024x1024)
146 KB
146 KB JPG
>>
File: sigma240507-143814_2609.png (1.56 MB, 1344x768)
1.56 MB
1.56 MB PNG
>>
File: 57070.jpg (922 KB, 1440x3120)
922 KB
922 KB JPG
>>
File: 57071.jpg (942 KB, 1440x3120)
942 KB
942 KB JPG
just crusin
>>
File: 00034-2202329006.png (3.82 MB, 1792x1792)
3.82 MB
3.82 MB PNG
>>
seeking the truth
>>
File: 57072.jpg (660 KB, 1440x3120)
660 KB
660 KB JPG
>>100371151
whatever will the eurofags think, when they wake up to dumb big titty whores? they'll probably ban the lot of us, on grounds of heterosexuality. lel (if its happened to me, it can happen to you!)
>>
File: 57073.jpg (875 KB, 1440x3120)
875 KB
875 KB JPG
>bounce
https://www.youtube.com/watch?v=gdHao9henOs
>>
gaze upon and seek
>>100371229
goes hard
>>
>>100371191
The line seems to be full coverage of the areola, so we'll probably be fine
>>
File: 57074.jpg (916 KB, 1440x3120)
916 KB
916 KB JPG
>>100371246
hey anon, it's me. demon jesus.

what's up?
>>
File: sigxl_0012.jpg (441 KB, 2048x2048)
441 KB
441 KB JPG
>>
File: 00059-621300543.png (3.92 MB, 1792x1792)
3.92 MB
3.92 MB PNG
>AIYY YAI YAI YAIIIIIIIIII

>>100371279
My cholesterol
>>
File: 57075.png (3.69 MB, 1440x1440)
3.69 MB
3.69 MB PNG
>>100371279
what's the difference, you ask?
well let me tell you!
regular jesus loves you and stuff,
demon jesus is all mad about ... uh stuff!
>>
File: 57076.jpg (805 KB, 1440x3120)
805 KB
805 KB JPG
>>
Out of the loop for a year, whats the QRD
>>
>>100371296
A dwarf yakiniku restaurant sounds comfy
>>
File: 57077.png (3.7 MB, 1440x1440)
3.7 MB
3.7 MB PNG
>>100371360
shit's all fucked, yo!
>>
>>100371377
1girl posters are still a waste of space i see
>>
File: 57078.png (3.93 MB, 1440x1440)
3.93 MB
3.93 MB PNG
>>100371381
it is what it is.

we long to see your superior proompting skills! put us to shame, anon-sama!
>>
File: sigxl_0014.jpg (355 KB, 2048x2048)
355 KB
355 KB JPG
>>
>>100371395
okay this confirmed my suspicion - complete and utter stagnation. I'll come back in another year
>>
>>100371409
дo cкopoй вcтpeчи!
>>
Last one from me, good night anons
>>
File: 57079.jpg (878 KB, 1440x3120)
878 KB
878 KB JPG
>>100371409
dear diary,

some fag barged in and declaimed his retardness. it was pretty weird.

anyway, the ""CRITICAL SHIPMENT""" is coming soon so, look out.

Cheers,
Wyatt Mann

PSPS: TAKE A GOOD LOOK AT -53.2332W,23.NISHJW;

WOW
oh
my
OOOWOW
that's doog.
good. that's good.
>>
File: 00001-1571287515.png (560 KB, 512x768)
560 KB
560 KB PNG
>>100371451
good night.
>>
you did not create you
you were made
>>
File: 00052-622650871.png (3.96 MB, 1792x1792)
3.96 MB
3.96 MB PNG
>>100371451
sleep tight
>>
.i would be honored to have my work enshrined into latent space. "don't use my drawings in your dataset" boo hoo.
>>
what model do people use for CLIP interrogator?
>>
>>100371579
I don't think many people use it, it's 100% inaccurate.
>>
File: 00063-2528690524.png (3.97 MB, 1792x1792)
3.97 MB
3.97 MB PNG
>>
>dude's been genning giant woman with massive tits and wide hips for hours now

Having a nice goon sesh there?
>>
File: Pixart_00486_.png (1.32 MB, 1216x832)
1.32 MB
1.32 MB PNG
>>
>>100371773
Actually, I've been watching old nostalgia critic videos while waiting on gens
>>
File: Pixart_00488_.png (1.44 MB, 1216x832)
1.44 MB
1.44 MB PNG
>>
Anyone know if there is a node that lets me see what the T5 encoder is doing with my prompt?
>>
File: 00084-1342356445.png (3.95 MB, 1792x1792)
3.95 MB
3.95 MB PNG
Good night everybody
>>
burn SAI burn
>>
>>100372050
*Released unusable 3B parameter llm*
>>
i dont understand how to use this. so i downloaded the model and installed everything but it just gives me cave painting like things. what settings do i use for things that are high res and stuff?
>>
>>100371976
night
>>
>>100372081
You downloaded what model on what UI following what guide?
Your post tells me nothing except you're some kind of cave man.
>>
>>100372186
i was using that retard guide
https://rentry.org/voldy
and downloaded the 1.5 stable diffusion model
>>
File: 00000-513098803.png (1.25 MB, 1216x832)
1.25 MB
1.25 MB PNG
>>
>>100372232
Go to civitai and download a couple of newer models. Or use sentences to describe what you want with the one you have.

For example:

A painting by matisse of a woman. She is saying "paint me like one of your french girls".
>>
>>100372232
This is why I never read the OPs in generals.
https://github.com/lllyasviel/stable-diffusion-webui-forge
Just download the one click package from here and you're off. Talk about getting new and better models once you confirm it works.
>>
File: 00007-4140919231.png (2.37 MB, 1824x1248)
2.37 MB
2.37 MB PNG
>>
Anyone have any cool styles they want to recommend?
>>
>>100372288
my latest ones.

Christian Wilhelm Allers painting -- try drawing

Christoph Niemann minimal illustration -- might not work

Christophe Vacher concept art -- seems to favour fantasy scenes

Clarence Gagnon painting

Clarence Holbrook Carter painting

Claude Monet painting

Clayton Crain digital comic

Clive Barker illustration

Clyde Caldwell illustration -- fantasy

Clyfford Still painting -- limited palette?
>>
>>100372255
>>100372259
oh i think i got it. i just put the sampling steps all the way up to 150
>>
>>100372302
Bingo
>>
>>100372296
Hit me like a fucking truck that I asked a question and then just got a fucking answer.
Thank you. That never happens anymore.
>>
File: 00012-2539234504.png (3.61 MB, 1824x1248)
3.61 MB
3.61 MB PNG
damn extra leg
>>
>>100372259
Sampling steps or cfg?
>>
>>100372069
what i dont understand is, if they wanted to make more different stuff, then why not make things that would help themselves make their main thing better, this way they just giga split their resources and are below mediocre at everything
like making their own vlm (which ironically im pretty sure is the ONLY thing they have not done) and releasing it to help people align with base checkpoint captioning when finetuning would probably go a long way
>>
>>100372389
I sound like a broken record at this point, but there is no way a huge amount of SAIs financials weren't improperly used or straight up embezzled.
That had basically an unlimited money spigot to train endless top of the line models and they somehow screwed it up.
>>
File: 00023-2081245455.png (2.82 MB, 1344x1728)
2.82 MB
2.82 MB PNG
>>
>>
>>100372450
cool
>>
>>100372450
milton bradley dark souls
>>
File: Pixart_00506_.png (1.68 MB, 1216x832)
1.68 MB
1.68 MB PNG
>>
File deleted.
>>
A1111 > comfyui
SD 1.5 > SDXL

if you believe otherwise you're a midwit
>>
File: image(8).jpg (1.02 MB, 2610x2088)
1.02 MB
1.02 MB JPG
>>100371451
good night

>>100368842
>this is the quokka we've come to know
dem macropods
>>
>>100371451
night
>>
sigma > balls

if you believe otherwise you're a midwit
>>
File: Pixart_00511_.png (1.71 MB, 1216x832)
1.71 MB
1.71 MB PNG
>>
File: 00003-1669600554.png (1.26 MB, 1152x896)
1.26 MB
1.26 MB PNG
>>
>>100372507
I decoded your autism. You got better results and were more pleased with the supposedly objectively better options.
Rather be a midwit than a dipshit.
>>
>>100372595
lol
>>
File: 00031-2329841529.png (2.65 MB, 1824x1248)
2.65 MB
2.65 MB PNG
>>
>>100371875
hmm wouldn't this have to be added as output to the t5 text encode node or something like that?

I actually don't remember what format it encoded to tho.
>>
File: Pixart_00515_.png (1.75 MB, 1216x832)
1.75 MB
1.75 MB PNG
One thing I find pixart is really good at following is where things go on the screen, if I say a demon is on the right, it puts the demon on the right.
>>
>>100372631
Yeah I actually don't know what it's spitting out after the text goes in, but I'd kinda like to see it.
>>
File: 00032-1737443900.png (1.19 MB, 1216x832)
1.19 MB
1.19 MB PNG
>>
>>100372634
Cool. Is there a high success rate?
>>
File: Pixart_00517_.png (1.58 MB, 1216x832)
1.58 MB
1.58 MB PNG
>>100372673
Actually details concerning the people themselves can be a bit of a crapshoot sometimes, but it listens very clearly to what side of the images something is supposed to go on.
>>
>>100372689
That's what I meant. Thanks
>>
File: Pixart_00519_.png (1.48 MB, 1216x832)
1.48 MB
1.48 MB PNG
Okay, I'll stop posting faux fantasy art. I just really liked this last one.
>>
File: 00039-4211477696.png (1.27 MB, 1536x640)
1.27 MB
1.27 MB PNG
>>
why is cumfart so silent since weeks?
>>
File: Pixart_00521_.png (1.29 MB, 1216x832)
1.29 MB
1.29 MB PNG
>>100372717
>>
File: t.jpg (1.12 MB, 2610x2088)
1.12 MB
1.12 MB JPG
>>100372649
might have to go and add it to the node yourself
>>
>>100372751
Huh.
That gives me an idea for my comics.
what node are you using to add text?
>>
>>100372717
maybe d*b* could ask him in their discord?
>>
File: 00046-1382365984.png (1.63 MB, 1152x896)
1.63 MB
1.63 MB PNG
>>
File: Pixart_00541_.png (1.2 MB, 1216x832)
1.2 MB
1.2 MB PNG
>>
>>100372738
>correct fennec ears
SAI is so done kek
>>
File: Pixart_00558_.png (1.25 MB, 1216x832)
1.25 MB
1.25 MB PNG
>>
File: PASigma_00002_.png (928 KB, 1344x768)
928 KB
928 KB PNG
Good morning anons
>>
Hello fellow sigma male
>>
File: Pixart_00567_.png (1.22 MB, 1216x832)
1.22 MB
1.22 MB PNG
>>
>>100372507
Straight diffusers scripts > any gay ui
SDXL > 1.5

You suffer from dunning kruger.
>>
File: PASigma_00001_.png (1.03 MB, 1344x768)
1.03 MB
1.03 MB PNG
So sigma we don't even >> each other?
>>
>>
>>
>>
>>100371114
>the gothic vintage hair dryer
>>100371144
Very hot
>>100371191
As a eurofag, I like big tits.
>>
File: PASigma_00015_.png (1.3 MB, 1344x768)
1.3 MB
1.3 MB PNG
>>
File: PASigma_00016_.png (1012 KB, 1344x768)
1012 KB
1012 KB PNG
>>
>>100373184
gib
>>
File: PASigma_00004_.png (1.45 MB, 1344x768)
1.45 MB
1.45 MB PNG
>>100373216
Hungry anon?
>>
File: Pixart_00586_.png (1.12 MB, 1216x832)
1.12 MB
1.12 MB PNG
>>
File: PASigma_00011_.png (1.19 MB, 1344x768)
1.19 MB
1.19 MB PNG
>>100373216
I have extra cupcakes after you finish breakfast!
>>
>>100373235
very, your gens make me more hungry, stupid office days
great gens, they look really good anon
>>
File: PASigma_00013_.png (1.04 MB, 1344x768)
1.04 MB
1.04 MB PNG
>>100373272
Thank you anon. I made one without sprinkles. Just how you like it
>>
what's the best furry model outside of pony diffusion?
>>
File: PASigma_00030_.png (1.26 MB, 1344x768)
1.26 MB
1.26 MB PNG
>>100373300
Sameface problems?
>>
>>100373292
can you give me a quick rundown on how to get sigma setup? would love to try it this evening
iirc you only need the extra models node for comfyui or?
>>
Prompt: wawaweewaawoopeepeepeooopooodawd ad aw dwad9
>>
>>100373353
needs 20gb ram for the text encoder, do you have that.
>>
>>100373373
yeah got 64gb ram
>>
Prompt: wawaweewaawoopeepeepeooopooodawd ad aw dwad9 faF AFAF awfawf3 3t3 4ef hdafsrgv eat3 R
>>
>>100373382
looks like a well trained model
>>
File: file.png (1.43 MB, 1356x875)
1.43 MB
1.43 MB PNG
name my company
>>
File: PASigma_00003_.png (1.22 MB, 1344x768)
1.22 MB
1.22 MB PNG
>>100373353
https://github.com/city96/ComfyUI_ExtraModels
https://huggingface.co/PixArt-alpha/pixart_sigma_sdxlvae_T5_diffusers (cp text_encoder files into to models/t5, vae to models/vae)
Get your favorite finetune (bunline or vintage knockers), or the default model at https://huggingface.co/PixArt-alpha/PixArt-Sigma/resolve/main/PixArt-Sigma-XL-2-1024-MS.pth?download=true

And here's a catbox https://files.catbox.moe/uv0h8c.png
>>
anyone has a good anim diff workflow ?
>>
>>100373400
Probably just the text encoder trying to make sense of it.
>>
File: PASigma_00034_.png (1.38 MB, 1344x768)
1.38 MB
1.38 MB PNG
>>
>>100373401
microsoft
>>
>>100373414
thank you very much anon
sounds like my smooth brain should be able to handle that
>>
>>100373418
only guy doing animations here doesnt explain shit
>>
>>100372312
Easily fixed. It is the castle on the background that seems covered in cobwebs that I don't like much. Great gen though.
>>
File: PASigma_00035_.png (1.22 MB, 1344x768)
1.22 MB
1.22 MB PNG
>>100373546
In the meantime, you can make some okayish gens online for free https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
>>
File: PASigma_00037_.png (1.15 MB, 1344x768)
1.15 MB
1.15 MB PNG
PLANT
>>
File: PASigma_02810_.png (1.55 MB, 1344x768)
1.55 MB
1.55 MB PNG
time for another day of high art (especially coomer elves)
>>
File: 1691105054422116.png (177 KB, 488x680)
177 KB
177 KB PNG
controlnet lineart inpainting is fun, just mask the face or body and poof, edits that follow the original
>>
File: 1709475677666003.png (681 KB, 592x848)
681 KB
681 KB PNG
>>100373720
>>
File: 1700771102214221.png (1.1 MB, 1208x680)
1.1 MB
1.1 MB PNG
>>100373741
>>
File: PASigma_00092_.png (1.28 MB, 832x1216)
1.28 MB
1.28 MB PNG
>>
File: 1699672115851674.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>
File: PASigma_02837_.png (1.69 MB, 1344x768)
1.69 MB
1.69 MB PNG
>>
File: p.jpg (1.03 MB, 2610x2088)
1.03 MB
1.03 MB JPG
>>100372762
>That gives me an idea for my comics.
Nice. But it's actually not a text node but a whole cool new thing that keeps a character very consistent by comparison to what SD[XL] otherwise does: https://github.com/HVision-NKU/StoryDiffusion

That said I also remember an neat example of just doing stuff with text and bg images:
https://civitai.com/models/377483/artismysteriums-basic-playing-card-workflow
>>
File: myl.jpg (6 KB, 103x113)
6 KB
6 KB JPG
made you look
>>
If the T5 encoder is doing a lot of the lifting for prompt adherence, I understand the model tags play a part as well, then isn't worth using everywhere as a default?
The two nodes i use in comfy are the default and advanced CLIP encode nodes, I presume they are not "T5" encoders??
>>
File: PASigma_02853_.png (1.83 MB, 1344x768)
1.83 MB
1.83 MB PNG
>>
>>100373892
Think of the T5 encoder as a pre-sorting mechanism for meaning. The output is encoded as roughly 300*2 bytes (16bit) of input to the transformer (if you max out prompt). It's just a cleaner source of info. You don't get access to T5's weights.

This is literally as bad as it's ever going to be. T5 isn't the only encoder model and it isn't even close to SOTA. Wait until we have smaller and better encoder models on transformers.
>>
>>100373822
sloppy sloppy slop.

>Le AI face
>Chair not even facing in the correct direction
>1 girl
>>
File: s.jpg (760 KB, 2610x2088)
760 KB
760 KB JPG
>>100373892
Everyone is trying stuff. SD3 too also used T5:
https://stability.ai/news/stable-diffusion-3-research-paper
>>
File: PASigma_00128_.png (1.21 MB, 832x1216)
1.21 MB
1.21 MB PNG
>>100373929
>>100373865
>>100373715
Your posts are deliciously artistic
>>
>>100372407
you underestimate bad management, Emad wanted to make ai for african kids or something
https://www.forbes.com/sites/kenrickcai/2024/03/29/how-stability-ais-founder-tanked-his-billion-dollar-startup
>>
>>100373971
I think I misspoke in my post. I wanted to say that his financials were misused. (if not straight up embezzled).

All he had to do was make SOTA AI image models. He had the funding, the knowhow and still somehow screwed it up.
>>
>>100373933
>>100373956
Thanks anons, I was worrying about my 32GB ram being enough for the near future.
>>
File: 6.png (2.33 MB, 1024x1024)
2.33 MB
2.33 MB PNG
>>100373990
Still, don't attribute to malice what can be explained by stupidity
>>
File: ComfyUI_temp_dvetf_00014_.jpg (849 KB, 2048x2048)
849 KB
849 KB JPG
>>
>>100373971
The jews will never get me to sub to forbes, kys
>>
File: PASigma_02871_.png (1.77 MB, 1344x768)
1.77 MB
1.77 MB PNG
>>100373969
Thank you :3

Your brownies are making me hungry. Promptmaxxing in sigma is so much fun
>>
>>100373971
but according to d*b* emad is a good IT worker by default because emad is brown?
>>
>>100374013
Here, dear dumb ni****, a bypass for your pea sized brain
https://12ft.io/
>>
File: ComfyUI_temp_dvetf_00015_.jpg (967 KB, 2048x2048)
967 KB
967 KB JPG
>spend 80 seconds rendering this beautiful 4k prompt
>wizard is facing the wrong way

d'oh
>>
>>100374024
no thanks not clicking that dolphin porn
>>
>>100373994
So far there is little evidence that the FP16 version of T5 won't work just fine on Pixart Sigma and even that may be bloat. On the other hand of course having some text model with the level of understanding that llama3 has connected to an image AI would be sort-of nice, but I'm not sure this will happen that soon and even if for imagegen running THAT model on CPU may be enough? We'll find out.

They'll still find you a reason to get moar VRAM than 24GB and more system RAM than 128GB sooner rather than later
>>
File: PASigma_00133_.png (1.23 MB, 832x1216)
1.23 MB
1.23 MB PNG
>>100374045
FP16 straight up works. Why are you even mentioning there being little evidence of it not working? Just try it instead of going full no-gen.
>>
File: ComfyUI_temp_dvetf_00017_.jpg (1.15 MB, 2048x2048)
1.15 MB
1.15 MB JPG
>>
>>100374072
>there is little evidence that the FP16 version of T5 won't work just fine on Pixart Sigma and even that may be bloat
>>
>>100374045
>llama3 ... connected to an image AI
This is where i was sort of thinking things were going, perhaps faster than i expected, not knowing much about what's in the pipeline however.
>>
forbes shills and dolphin porn general
>>
Why can't I use more vram to make 4k pixart gen faster?
>>
>>100374084
>>100374045
4bit and 8bit T5 work w/ bitsandbytes too. You're literally just noise. Try things out instead of pontificating about things you don't know.
>>
>>100374109
for the same reason you cant use more RAM to make the CPU faster
>>
File: 1704253440868455.png (261 KB, 520x512)
261 KB
261 KB PNG
>>
>>100374112
I literally said there is little evidence that it won't work fine and that it probably still is bloat (=>less precise variants probably also still work)

stop making up shit
>>
>>100374021
i think indians self identify as white therefore emad is shit according to debo
>>
File: 00000-2368744184.jpg (79 KB, 1024x768)
79 KB
79 KB JPG
>>
File: PASigma_00140_.png (962 KB, 832x1216)
962 KB
962 KB PNG
>>100374132
My point: there's only evidence that it works on F16 (and below), so your words are wasted on everyone. The trainer uses FP16 by default if you look at the repo. Llama3 isn't an encoder model. And there's no "they" trying to get you to use more than 24GB of RAM. Read up bobble head.

picrel of your no-gen additions
>>
File: ComfyUI_temp_dvetf_00021_.jpg (1.26 MB, 2048x2048)
1.26 MB
1.26 MB JPG
>>100374155
There was a time where coomer art interested me. But now fully comprehend that stable diffusion can do that because that's all it has been fed on. For the better part of a year every model has been dripfed porn, merged and fed more porn then merged again. Of course they're good at it.
And it no longer impresses me.
>>
>>
File: PASigma_00149_.png (570 KB, 832x1216)
570 KB
570 KB PNG
>>100374198
Post-coomer age of enlightenment?
>>
File: ComfyUI_temp_dvetf_00022_.jpg (1.16 MB, 2048x2048)
1.16 MB
1.16 MB JPG
>>
File: 1712947250377742.png (1.1 MB, 888x1904)
1.1 MB
1.1 MB PNG
cammy, except as a fire emblem character (was tharja)
>>
File: 02049-4288545742.jpg (132 KB, 1020x2048)
132 KB
132 KB JPG
>>100374198
thanks for the essay
>>
>>100374198
>>100374228
niiiice
>>
File: ComfyUI_temp_dvetf_00023_.jpg (1.24 MB, 4096x1024)
1.24 MB
1.24 MB JPG
>>
>>100374021
Maybe he is good at IT, but he is a demented CEO/manager
>>
>>100374194
> refuting what I didn't say in the first place, then saying the same thing as I did
very helpful sar

As for the other topic: More than 24GB VRAM / more than 32GB system RAM total is a fact for all sorts of published AI models, they'll use and publish more of them. Perhaps Pixart or Stability.ai won't, who knows, but even then yet more useful additional nodes will - sooner or later.
>>
File: PASigma_02904_.png (1.95 MB, 1344x768)
1.95 MB
1.95 MB PNG
>>
>>100374234
>same ai face
>>
File: 00073-3654959161.png (1.96 MB, 1296x1536)
1.96 MB
1.96 MB PNG
>>100374227
>Post-coomer age of enlightenment?
My enlightenment is that I like gyarus
>>
File: d.jpg (490 KB, 1970x1576)
490 KB
490 KB JPG
>>100374099
how
>>
File: 1702321028692812.png (865 KB, 1392x784)
865 KB
865 KB PNG
bonus points if you can guess the game:
>>
>>
File: PASigma_00167_.png (1.39 MB, 832x1216)
1.39 MB
1.39 MB PNG
>>100374292
Crazy
>>
File: q.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>100374310
kingdom cammy: dead redemption 2?
>>
>>100374263
ohhh i see

@debo can you clarify which skin colors CEOs need to have to not be absolute garbage?
>>
>>100374343
>trying to talk to debo when he's not here
peak obsessed schizo behavior
>>
File: 1695907685170910.png (733 KB, 640x784)
733 KB
733 KB PNG
>>100374331
yes sir

also "prompt more important" in controlnet seems to work well
>>
File: PASigma_02926_.png (2.03 MB, 1344x768)
2.03 MB
2.03 MB PNG
>>
File: ComfyUI_temp_dvetf_00029_.jpg (1.86 MB, 4096x1024)
1.86 MB
1.86 MB JPG
Not quite what I was going for, but cool nonetheless
>>
File: ComfyUI_00938_.jpg (2.32 MB, 2048x2048)
2.32 MB
2.32 MB JPG
>>
>>100374343
>brown, black and shti
>>
File: PASigma_02942_.png (1.8 MB, 1344x768)
1.8 MB
1.8 MB PNG
>>
File: 0.jpg (660 KB, 1024x1600)
660 KB
660 KB JPG
>>
File: PASigma_00182_.png (1005 KB, 832x1216)
1005 KB
1005 KB PNG
>>
File: PASigma_02948_.png (1.79 MB, 1344x768)
1.79 MB
1.79 MB PNG
nobody fucks with Miyamoto Musashi
>>
File: ComfyUI_temp_dvetf_00032_.jpg (1.89 MB, 4096x1024)
1.89 MB
1.89 MB JPG
Almost actually got it that time.
>>
File: PASigma_02953_.png (1.88 MB, 1344x768)
1.88 MB
1.88 MB PNG
>>
File: 1711682008418894.png (755 KB, 504x784)
755 KB
755 KB PNG
>>
File: ComfyUI_temp_dvetf_00035_.jpg (2.16 MB, 2048x2048)
2.16 MB
2.16 MB JPG
>>
File: 00120-2864877723.png (3.03 MB, 1896x1368)
3.03 MB
3.03 MB PNG
>1girl
>>
File: 1696443236263341.png (183 KB, 384x784)
183 KB
183 KB PNG
>>100374497
>>
File: ComfyUI_00950_.jpg (2.79 MB, 2304x1792)
2.79 MB
2.79 MB JPG
>>
File: PASigma_02954_.png (1.92 MB, 1344x768)
1.92 MB
1.92 MB PNG
>>
File: ComfyUI_temp_dvetf_00038_.jpg (1.38 MB, 4096x1024)
1.38 MB
1.38 MB JPG
>>
>>
File: Sigma.jpg (176 KB, 2048x2048)
176 KB
176 KB JPG
>>100374483
i like it
>>
>>100373971
>Emad infuriated our initial investors so much it’s just making it impossible for us to raise more money under acceptable terms - A current Stability executive
lol , lamo even
>>
File: PASigma_02971_.png (1.71 MB, 1344x768)
1.71 MB
1.71 MB PNG
>1girl
>>
>>
File: PASigma_02982_.png (1.61 MB, 1344x768)
1.61 MB
1.61 MB PNG
can't wait for T5 FaceDetailer, it still uses CLIP :(
>>
File: ComfyUI_temp_dvetf_00042_.jpg (1.42 MB, 2816x1408)
1.42 MB
1.42 MB JPG
Man I fucking live pixart
>>
>>100374663
do you have a reason to believe this is actively being worked on, or is this just hopium
>>
File: ComfyUI_temp_dvetf_00044_.jpg (1.81 MB, 2816x1408)
1.81 MB
1.81 MB JPG
>>
>>100374364
he reads all messages in every threads sooner or later anyway, he can answer then
>>
File: PASigma_02993_.png (1.51 MB, 1344x768)
1.51 MB
1.51 MB PNG
>>100374707
pure hopium that some of the most popular SD tools will work on Pixart in the coming months
>>
File: 02032-3386605448.jpg (56 KB, 816x1024)
56 KB
56 KB JPG
>>
>>100374727
You can use the unsampler node, plug that latent into a ksampler at cfg-1 dmpp_2m karras then plug those samplers into a model of your choice and use your SD tools on it from there.
>>
>>100374723
the fact you spend time thinking about him when he's not even here is pretty sad
>>
File: PASigma_00245_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
Good bye for now anons
>>
>pixart shill gone
finally some breathing room
>>
>>100374727
what are the ram/vram requirements for sigma-1024? 2k? I'm going oom w/ 32gb RAM & 16gb VRAM
>>
>>100374804
they call me schizo for a reason
>>
File: ComfyUI_temp_dvetf_00048_.jpg (1.26 MB, 1408x2816)
1.26 MB
1.26 MB JPG
>>100374837
It's like 4, 5gb at most for even the 2k model. 20gb of ram tho.
>>
>>100374815
later
>>
>>100374865
much of which was for T5 where you can apparently actually use a less precise version, right?
>>
File: PASigma_03016_.png (1.7 MB, 1344x768)
1.7 MB
1.7 MB PNG
>>100374764
Thanks I'll keep it in mind when I'm feeling more technically inclined :3

>>100374837
Less than that. Load the T5 encoder on CPU, that's probably what's maxing out for you and it's fast enough like that. Otherwise close your porn tabs.
>>
>>100371229
Good taste
>>
>>100374902
I found this schizo workflow that takes a pixart image and uses unsamplers etc to refine it using SDXL.
From the few times I've used it works really well.
https://files.catbox.moe/gfpxpc.json
>>
File: PASigma_03050_.png (1.61 MB, 1344x768)
1.61 MB
1.61 MB PNG
Save pupper

>>100375012
TYVM I'll try it out later
>>
File: PASigma_03060_.png (1.74 MB, 1344x768)
1.74 MB
1.74 MB PNG
It worked!
>>
>>100375156
The schizo workflow?
>>
>>100375167
joan saving pupper, I'll have to schizo it up when my schizo energy is higher
>>
File: 00023.jpg (306 KB, 1776x2496)
306 KB
306 KB JPG
>>
>>100375156
Bruh, Your shit's all retarded.
>>
>>100374858
based schizo
>>
File: ComfyUI_PixArt_00008_.jpg (2.25 MB, 2048x2048)
2.25 MB
2.25 MB JPG
>>100374902
okay, it's working now... for whatever reason after it initially failed, python was stuck in the background with 20gb of committed memory and wouldn't release it, so I had to restart my pc
>>
File: PASigma_03062_.png (1.19 MB, 1344x768)
1.19 MB
1.19 MB PNG
>>100375191
oh look, a child skipping school and getting mad at watercolor style
>>
File: PASigma_03066_.png (1.28 MB, 1344x768)
1.28 MB
1.28 MB PNG
>>100375233
nisu! that's a pretty watercolor
>>
to the baker of the next thread: remember to NOT, i repeat, NOT include the pastebin
>>
File: ComfyUI_PixArt_00009_.jpg (2.41 MB, 2048x2048)
2.41 MB
2.41 MB JPG
>>100375251
sank yew, pretty impressed with the initial results.. wondering if you can concat t5 prompts now..probably not, but will try
>>
File: 00000-610461859.png (1.79 MB, 1328x992)
1.79 MB
1.79 MB PNG
>>
>>100375263
By mentioning the pastbin you are drawing attention to the pastebin to those who otherwise knew nothing about it.
What is the pastebin?
>>
>>100375299
pure autism and it doesnt have an effect to post it anyway
>>
>>100375299
Newfag or just playing dumb
>>
>>100375322
>>100375317
I've been here almost as long as these generals have been a thing, I just never read the OP because only autistic hall monitor faggy boys do that.
>>
>>100375317
>it doesnt have an effect to post it anyway
wrong
>>
>>100375337
>I've been here almost as long as these generals have been a thing
Then it's impossible for you to have missed the pastebin
>>
File: ComfyUI_PixArt_00010_.jpg (2.59 MB, 2048x2048)
2.59 MB
2.59 MB JPG
seems like concat conditioning does work. Wish I had more time to mess around with these workflows this morning
>>
>>100375356
that gorilla got a nice pair of tits
10/10
>>
>>100375235
That's a cope, ngl, looked through your past ones, and some of them aren't too bad. But if you aren't willing to admit you could do better with some of them, you're ngmi.
>>
>>100375235
Also, this is a good water color example you namefag.

>>100375269
>>100375356
>>
>>100375356
What are you using concat conditioning for here?
>>
Why does civitai have an SD3 tag in filters? Did it come out or something?
>>
>>100375479
some people requested it so they can tag their blurred out api 1girls (god bless SAI for making it safe)
>>
>>100375479
>Did it come out or something?
Yeah but it's called sigma now. Check it out under that tag.
>>
>>100375479
Been there for weeks anon.
>>
>>100375479
>Did it come out or something?
no, they're just getting ready for when it does. At the moment only select few people have local access to it. No they haven't leaked it.
>>
File: s.jpg (208 KB, 2048x2048)
208 KB
208 KB JPG
>>100375479
I think the civitai dev simply sometimes adds stuff ahead of time.
>>
File: PASigma_00821_.png (2 MB, 1344x768)
2 MB
2 MB PNG
>>100375376
Sorry you're not entitled to more than the roughly 10 seconds I spend writing and generating each batch of pics I pick from. I spend actually zero seconds inpainting or refining these. Pixart is just that good :3
>>
File: sigxl_0002.jpg (262 KB, 1792x2304)
262 KB
262 KB JPG
>>
File: ComfyUI_PixArt_00018_.jpg (2.1 MB, 2048x2048)
2.1 MB
2.1 MB JPG
>>100375470
that gen wasn't actually using any concats. I did some others with different hair colors to confirm if it even worked or would error out, but didn't like the images enough to post
>>
>>100375527
>I spend actually zero seconds inpainting or refining these.
That's why your ngmi.
>>
File: 00007.jpg (437 KB, 1536x2176)
437 KB
437 KB JPG
>>
File: ComfyUI_temp_kvoce_00118_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>100375527
Low effort, high quality
>>
Next
>>100375528
>>100375528
>>100375528
Thread
>>
>>100372595
>>100372600
utter nonsense: check
samefag: check

sad!
>>
>>100373076
>use thing
>hate it
>"NUH UH YOU LIKE IT"
92 iq at best



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.