[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1720812309194749.jpg (1.43 MB, 3264x3264)
1.43 MB
1.43 MB JPG
General dedicated to creative use of free and open source text-to-image models

Previous /ldg/ bread : >>101405779

Fresh Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
ComfyUI: https://github.com/comfyanonymous/ComfyUI

>Auto1111 forks
SD.Next: https://github.com/vladmandic/automatic
Anapnoe UX: https://github.com/anapnoe/stable-diffusion-webui-ux

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Kolors
https://gokaygokay-kolors.hf.space
Nodes: https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Share image prompt info
https://rentry.org/hdgcb
https://catbox.moe

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
blessed thread of frenship
>>
>>101432234
i think since he still wants to do subscription based it wouldnt work out
>>
>>101433885
I don't think NSFW will ever work with a traditional monetization strategy because all the payment processors are anti basically everything Pony does. AI stuff will need to be pseudo voluntary or run on crypto.
>>
File: Sigma_04676_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>101433582
Thank you baker!
>>
buh
>>
File: PixArt-Sigma_00010_.png (1.83 MB, 944x1408)
1.83 MB
1.83 MB PNG
>>
Two questions:
>1
Is Diffusion on CPU memory-bandwidth bound in the same sense as LLMs? Or compute-bound? I'm trying to work out whether OpenCL would grant significant speed-boosts on devices without a dedicated GPU (with VRAM).
>2
Does any kind of pass-through need to be specified to allow OpenCL to leverage GPU from within a docker container?
>>
File: anon.png (581 KB, 737x754)
581 KB
581 KB PNG
>I wrote a userscript for the Kolors generator on huggingface, it converts the .webp output to .png, adds a filename hint for downloaders (subdomain-seed-timestamp_num.png) so it's not just "image.webp" all the time, and injects a json metadata packet of the prompt/seed/sliders/etc into the png comment field of the new images. Would anyone else be interested in something like this?
Are you still here?
>>
File: 0.jpg (303 KB, 832x1216)
303 KB
303 KB JPG
>>
>>101433885
He has a lot of choices now
>pixart
>hunyuan
>Kolors
fuck SAI, I'm glad there's some serious rivals to those cucks now
>>
File: Pixart_00030_.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>
File: 0.jpg (311 KB, 832x1216)
311 KB
311 KB JPG
>>
File: Pixart_00032_.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
>>101437145
>>
>>101436717
He could put the new model behind voluntary $5+ paywall. I would pay it.

>>101435117
Psycho-pass police
>>
File: ComfyUI_temp_cpuhn_00172_.jpg (1.62 MB, 1792x2304)
1.62 MB
1.62 MB JPG
>>
File: ComfyUI_temp_cpuhn_00177_.jpg (1.35 MB, 2432x1664)
1.35 MB
1.35 MB JPG
>>
what killed the threads today
>>
File: 00010-1157975464.jpg (779 KB, 1613x2419)
779 KB
779 KB JPG
>>
>>101437682
Looks like avatar spammers
>>
File: ComfyUI_temp_cpuhn_00187_.jpg (1.02 MB, 2432x1664)
1.02 MB
1.02 MB JPG
>>
File: ComfyUI_temp_cpuhn_00193_.png (2.5 MB, 1536x1536)
2.5 MB
2.5 MB PNG
>>
File: ComfyUI_temp_cpuhn_00196_.png (2.86 MB, 1344x1728)
2.86 MB
2.86 MB PNG
>>
File: ComfyUI_temp_cpuhn_00198_.png (2.75 MB, 1344x1728)
2.75 MB
2.75 MB PNG
>>
>>101435417
sars pls respond
>>
File: PixArt-Sigma_00149_.png (2.07 MB, 944x1408)
2.07 MB
2.07 MB PNG
>>
>Huber Schedule: SNR
>Huber Param: 0,1
Is there any need to tweak these settings?
>>
Is there any chance to get decent results with a rtx 2070?
I tried some llama V3 model yesterday at at most I could get 1 liner replies. Escaping Claude's grasp isn't that easy it seems like. Didn't tinker with the settings in oogabooga yet, however.
>>
>other threads archived already
this general lost, pack it up
>>
>>101438826
for tagging? theres moondream2 and florence2
>>
File: 1705474142237652.png (55 KB, 267x235)
55 KB
55 KB PNG
any good linux programs for captioning training images? i've noticed that i'm getting better results when training without captions at all, compared to the auto caption function on kohya. but i believe there's some middle ground like only prompting the hairstyle, makeup or clothes that i must be done manually, but doing that by creating a bunch of .txt files with a text editor is too much work.
>>
>>101439832
this should work on Linux https://github.com/jhc13/taggui/

Using tags means the training takes more time so you want to add few epochs. You could also use caption tag dropout setting to achieve middle ground (this works nicely with style loras).
>>
>>101439869
thanks that looks really good. i only train character loras. does tag dropout randomly select tags to train? i was thinking of only tagging things that stick out like weird facial expressions or haircuts. maybe clothing. not 20+ tags that the auto tagging gives me.
>>
>>101439916
>does tag dropout randomly select tags to train
you can set it up different ways. I usually use drop every 2 epoch, which means it drops all the captions every second epoch. I'm not sure if you want to drop captions for character loras, but it's worth testing. I run tagui with wd-swinv2-tagger-v3 and 0.25 probability, it's decent even for realistic photography. It's worth tagging those weird expressions manually if you want to replicate them.
>>
File: 1719917825410077.png (101 KB, 784x586)
101 KB
101 KB PNG
>>101439967
>It's worth tagging those weird expressions manually if you want to replicate them.
yeah that's what i'm thinking. adding "smiling" would to often just generate a generic smile on top of their faces instead of their actual smile. thanks for the help
>>
>>101439986
np dude. South Park characters?
>>
>>101439997
nah. doing tyra banks for pony right now
>>
File deleted.
>>
File deleted.
>>
File: file.png (1.48 MB, 832x1123)
1.48 MB
1.48 MB PNG
>>
File: 0.jpg (286 KB, 832x1216)
286 KB
286 KB JPG
>>
File: 0.jpg (120 KB, 1024x512)
120 KB
120 KB JPG
>>
IM IN THE OP COLLAGE TWICE AGAIN WOO
>>
File: R8_SD3_00001_.jpg (376 KB, 1024x1024)
376 KB
376 KB JPG
https://replicate.com/p/nq6z3mghs1rm40cgqa08qcbdvg
>inequality dating moy ppc indianclaudieyed tim astrbuddipc peas watts search continent trailblazer
Wtf SD3M?
>>
File deleted.
Hello anons!
>>
>>101441895
maybe "peas" is in the prompt because there's green in the image, heh
>>
File deleted.
>>101441988
jannies, no ban pls
>>
File deleted.
>>101442075
>>
File deleted.
>>101442327
>>
>>101442075
>>101442327
>>101442415
why won't you just put a link instead, you can put NFSW in there and it seems accepted by the jannies
>>
>>101442075
Nice
>>101442451
It's censored tho
>>
>>101442460
dunno, some people put uncensored pictures on links and they never got deleted so I thought it was an accepted process
>>
>kolors finetuned on midjourney images
Grim.
>>
>>101442531
how do you know that?
>>
>>101442460
>It's censored tho
what that other anon means is to catbox.moe the uncensored then you won't have to worry about being banned.
>>
File: grid-0520.jpg (242 KB, 1200x1600)
242 KB
242 KB JPG
>>
wowie
>>
why were those images deleted?
>>
>>101443865
>>101441988
>>101442075
>>101442327
>>101442415
>>
fuck jannies
>>
>>101443887
they were censored thooooo wtf
>>
>>101443898
and two of them didnt even have nudity lmao
>>
seems like the janny is on a powertrip or pure jealousy... if youre reading this janny, fuck you
>>
>>101443909
>>101443898
Probably got mass reported by the usual suspects
>>
>>101444040
fuckkkk



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.