[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.5 MB, 3264x3264)
1.5 MB
1.5 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101532879

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://www.modelscope.cn/home
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Kolors
https://gokaygokay-kolors.hf.space
Nodes: https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

>AuraFlow
https://fal.ai/models/fal-ai/aura-flow
https://huggingface.co/fal/AuraFlows

>Index of guides and other tools
https://rentry.org/ldg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>
my saas senses are tingling
>>
baker has little taste
>>
official pixart bigma and lumina 2 waiting room
>>
File: Sigma_09302_.png (1.68 MB, 1024x1024)
1.68 MB
1.68 MB PNG
No thread blessings again.. this is a bad omen
>>
File: Sigma_09308_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>
File: Sigma_09323_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>
File: Sigma_09328_.png (2.03 MB, 1024x1024)
2.03 MB
2.03 MB PNG
>>
praying for the health and safety of bigma
>>
File: Sigma_09341_.png (1.95 MB, 1024x1024)
1.95 MB
1.95 MB PNG
>>
>>101559437
do we have any news about the following of pixart, hunyuan, auraflow, SD3?
>>
File: Sigma_09343_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>101559450
Hunyuan released a few updates but you still need to know Chinese
>>
>>101559450
>pixart
no
>hunyuan
holding a finetuning contest for their newest model
>>
File: Sigma_09351_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
>>101559450
>do we have any news
non at all unfortunately, although hunyuan shill mentioned a few threads back that they plan on releasing a model that specializes in anime or something, a pony level base model he said. i did not look into it unfortunately.
>>
File: Sigma_09356_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>
File: Sigma_09362_.png (1.81 MB, 1024x1024)
1.81 MB
1.81 MB PNG
>>
File: Sigma_09366_.png (1.7 MB, 1024x1024)
1.7 MB
1.7 MB PNG
>>
File: Sigma_09376_.png (1.99 MB, 1024x1024)
1.99 MB
1.99 MB PNG
>>
>>101559450
>auraflow, SD3
Kek
>>
File: Sigma_09379_.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>
File: Sigma_09385_.png (992 KB, 1024x1024)
992 KB
992 KB PNG
>>
File: Sigma_09394_.png (2.3 MB, 1024x1024)
2.3 MB
2.3 MB PNG
>>
File: Sigma_09401_.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>
>>101559665
this is cool
>>
File: Sigma_09402_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>>
File: Sigma_09423_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>101559674
Would stay at that hotel
>>
File: Sigma_09432_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>
File: Sigma_09443_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>
File: Sigma_09448_.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>
File: Sigma_09450_.png (1.82 MB, 1024x1024)
1.82 MB
1.82 MB PNG
>>
File: Sigma_09465_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
File: Sigma_09472_.png (2.03 MB, 1024x1024)
2.03 MB
2.03 MB PNG
>>
File: Sigma_09484_.png (1.68 MB, 1024x1024)
1.68 MB
1.68 MB PNG
Wait a second.. baker forgot reference to >>>/g/sdg again.. I filled the last thread so we could be healed from this mistake
>>
>>101559846
explains the lack of thread blessings. blesser anon is still out there, lost, wondering where /ldg/ went.
>>
File: Sigma_09510_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>101558992

>Related boards
>>>/g/sdg
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>
File: Sigma_09523_.png (2.04 MB, 1024x1024)
2.04 MB
2.04 MB PNG
Well I'm out for a bit
>>
>>101559888
Why wouldn't he be searching the catalog for "ldg" like everyone else here is?
>>
>>101560033
people can't think straight when they are afraid and alone
>>
>>101560033
habits are hard to break
>>
newfags dont know about ldg. they only search sdg.
>>
the only reason to change out the rentry link is to make it so ldg doesnt show when you search sdg
curious isnt it
>>
File: Sigma_09527_.png (1.97 MB, 1024x1024)
1.97 MB
1.97 MB PNG
>>101560215
>https://rentry.org/ldg-link
redirects to
>https://rentry.org/sdg-link

Very curious.. what's the point other than obfuscation?
>>
File: Sigma_09529_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
Bump goes the typewriter
>>
File: Sigma_09540_.png (1.91 MB, 1024x1024)
1.91 MB
1.91 MB PNG
>>
>>101560272
was thred blesser killed?
>>
File: Sigma_09525_.png (1.88 MB, 1024x1024)
1.88 MB
1.88 MB PNG
>>101560371
An hero for our sins
>>
File: Sigma_09520_.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
Good night anon
>>
>>101560426
gn
>>
File: 0.jpg (108 KB, 1024x512)
108 KB
108 KB JPG
>>
>>101560272
>what's the point other than obfuscation?
Kill the bred
>>
artflow.ai is pretty good - upload five or more images of a subject and it creates a passable model that you can pose by prompting into any setting but it's stupid expensive. New model? 9 dollars please. More images? More shekels.

I expect I can do better with controlnet or something - is there a standard workflow for this kind of thing?
>>
>>101560793
No but there's new stuff all the time. https://huggingface.co/spaces/TencentARC/PhotoMaker-V2
>>
File: ComfyUI_temp_cixtt_00216_.jpg (1.69 MB, 2048x2048)
1.69 MB
1.69 MB JPG
hidiffusion is upgraded kohya deep shrink, recommend you try
https://github.com/blepping/comfyui_jankhidiffusion
pic related raw gen at 2048x2048, 30 steps, took 38 seconds on a 3080ti mobile
>>
>>101560949
with these hidiff-specific settings, and full pic catbox
the apply mwsmsa node has some severe limitations (bugs?), and mucking with the cross-attention blocks can either improve perspective/proportions/framing or fry your shit up even with very small values
https://files.catbox.moe/hfm9a0.png
>>
>>101560949
>>101560988
>2048x2048, 30 steps, took 38 seconds on a 3080ti mobile
>catbox
Impressive.
>Ancestral samplers seem to work a lot better than the non-ancestral ones when using RAUNet and ControlNet simultaneously. I recommend using the ancestral version if possible.
Have you tried many sampler/scheduler combos?
>>
https://stability.ai/news/stable-video-4d
https://huggingface.co/stabilityai/sv4d
lets you turn video into 3d video or something with different angles
>>
>>101561789
oh i think it lets you make 3d models
>>
File: 00_sig14.jpg (339 KB, 1336x1336)
339 KB
339 KB JPG
>>101560426
Great stuff. Gn.
>>101559808
Love this
>>
>>
>>101561789
>Users start by uploading a single video and specifying their desired 3D camera poses
>start by uploading
go back to your containment thread
>>
>>101560272
Remove it from the op
>>
Now that the dust has settled, which has more promising architecture, pixart, hunyuan, auraflow or SD3? Which one has better comprehension and adherence to prompt? Which one has better colors?
>>
>>101563216
I like the pixart and hunyan look the most. All of them have decent colors, but these can be tweaked with other programs.
>>
>>101563266
which other programs?
>>
>>101563304
Photoshop, gimp, ffmpg etc
>>
>>101563427
any tips for correcting a high cfg whitewash? Besides lowering my cfg?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.