[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: collage.jpg (2.26 MB, 2878x3566)
2.26 MB
2.26 MB JPG
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107594109

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of frenship
>>
what the fuck.
go away
fucking troll
>>
>>107597478
tanks 4 bake
>>
Opinions on https://github.com/zai-org/SCAIL ?
>>
>>107597525
>zai
>not affiliated with z image or alibaba
hm...
>>
>>107597525
Some of those community works look almost too good. I wonder if it handles nudity and stills
>>
>>107597525
Do they even ever looks like promised
>>
Base status?
>>
>>107597525
>v2v
I sleep
>>
Whih is the real thread
>>
>>107597620
Trust the plan, 2 weeks
>>
uhh not sure which thread i should speak in
>>
>>107597478
>flamewar links in OP
This is not okay
>>
>>107597620
>Base status?
they'll probably release everything in christmas day
>>
>>107597669
What if Chinese Christmas is like Chinese new year. Do we have a knower
>>
File: 1748927874573129.png (3.71 MB, 1152x2312)
3.71 MB
3.71 MB PNG
>>
Does the z controlnet work or is it still bugged?
>>
thank fuck the real thread is here
>>
>>107598120
Z is nice model but I dont like the frame it almost forces on paintings
>>
>>107598167
every model has weird ass limitations and quirks. I'm rotating between SPARK chroma, Z, and qwen right now because of this. none of them are a clear winner/superior. in this case, chroma favors framing for this prompt too. idk why
>>
>>107598167
never mind, the problem was that I included "on paper" in my style prompt
>>
cozy bread
>>
>>107598211
fug
>>
>>107598184
Have to try z to chroma i2i workflow sometime. Getting good composition saves so much time
>>
>>107598237
depends on what you consider good composition... chroma has much more varied and creative compositions.
>>
>>107598237
>z to chroma
that sounds ass backwards
>>
>>107598256
Yeah true, but z will give me correct amount of limbs almost every time. Good start
>>
File: kek.jpg (63 KB, 910x586)
63 KB
63 KB JPG
https://www.reddit.com/r/StableDiffusion/comments/1ppa8x9/zimageedit_news
Congrats retards, you made Tongyi cry :(
>>
>>107598284
Conflicted because SegFault is an OG chad from the Pixart days but that pic is very homosexual.
>>
>>107598284
Tongyi be like:
https://youtu.be/bYzKJ91KBzE?t=22
>>
>>107598284
they don't answer, that's the problem
>>
>>107598322
>where is base???
>it's coming.
>5 minutes later
>where is base???
>...
>wtf they don't answer!!!
>>
>>107598349
>>5 minutes later
*2 weeks later (after they said that base would be released "before the weekend" btw)
>>
File: 1754521073194063.png (3.57 MB, 1152x2312)
3.57 MB
3.57 MB PNG
>>
>>107598273
SPARK improved a lot on that. though this gen isn't a great example lol
>>
>>107598349
>between now and the end of time
well I could have told you that
>>
File: orz.png (1.03 MB, 1280x720)
1.03 MB
1.03 MB PNG
impossible to get an actual low angle orz
>>
so the new ani shitter strategy is to bump his thread with nonsense advice

>>107598428
>>
File: 1748505506798805.mp4 (3.86 MB, 2048x1152)
3.86 MB
3.86 MB MP4
https://xcancel.com/aisearchio/status/2001365588980175153#m
sovl vs sovless
>>
>>107598443
whocars just post bifusion here
>>
>>107598456
but those are both sovless
>>
File: 2MW.png (150 KB, 897x773)
150 KB
150 KB PNG
SOON™
>>
soon
>>
File: OTL.png (1.13 MB, 832x1248)
1.13 MB
1.13 MB PNG
>>
File: 1744231120338829.png (362 KB, 576x448)
362 KB
362 KB PNG
>>
>>107598488
her name is soon yon
>>
>>107598530
y so smol
>>
>>107598575
if her name aint soon base i dont care
>>
>>107598530
>>107598584
>>>/g/adt
>>
>>107598580
my gpu can only run sd1.5 at low resolution
>>
>>107598584
>>107598595
No, it's fine here. Good gen too.
>>
File deleted.
>>107598595
fuck off
>>
File: 1765814453579586.png (1.13 MB, 1216x832)
1.13 MB
1.13 MB PNG
https://rentry.org/ranfaggot
>>
>>107598608
It's the schizo. See in the 70 posts that were IP nuked in that thread, it included redirects to /adt/ to shit up the thread. https://desuarchive.org/g/search/tnum/107570316/deleted/deleted/page/2/
>>
File: 1763581455933377.jpg (701 KB, 2000x1336)
701 KB
701 KB JPG
>>
>>107598599
i applaud your dedication anon
you tried sdultimate upscale? it divides the image up into smaller squares so you can gen larger res without needing the extra vrams
>>
>>107598616
>https://desuarchive.org/g/search/tnum/107570316/deleted/deleted/page/2/
damn, that's some serious mental illness
>>
File: 1735520266883826.png (3.42 MB, 1920x1088)
3.42 MB
3.42 MB PNG
>>107598617
>>107598366
these are sick. box/prompt?

>>107598599
based. sd15 was peak, it's been downhill since.
>>
>>107598615
get out schizo!
>>
>>107598617
>>107598629
>>107598648
guys I need you to tell the schizo to tell off too
>>
>>107598615
prompt?
>>
File: 1762218061397260.png (726 KB, 832x1248)
726 KB
726 KB PNG
>>107598652
>>
File: z_mod_00143_.jpg (1.09 MB, 1344x1728)
1.09 MB
1.09 MB JPG
>>107598648
Those floating swords are cool
>>
File: 1743064119841539.png (3.88 MB, 1336x2008)
3.88 MB
3.88 MB PNG
>>107598648
nice gen yourself
lora: https://files.catbox.moe/jelpf6.safetensors
wf: https://files.catbox.moe/epo278.png
the custom nodes are for resolution stuff so you can remove them
>>107598676
i should prune that dataset and retrain. its got a lot of nonsensical images
>>107598686
cool style anon
>>
>>107598666
nta but it's in the rentry
>>
>>107598749
Thanks, buddy.
>>
>>107598749
yes it unlearned the shape of things. i have briefly tried training zit on a strong style and it seems very prone to unlearning the shape of things before successfully training styles. maybe the datasets need to be higher quality than sdxl, maybe it's a setting issue. it's hard to find the right lora training inputs when each iteration takes so long to complete.
>>
File: disco_i2i.png (2.7 MB, 1664x1216)
2.7 MB
2.7 MB PNG
I tried anon's Disco Elysium lora with a simple 0.5 denoise i2i on an alt portrait of Octavia I made a while back. The art style is awesome, but it's also an uglifier (unsurprisingly). I just want painterly beautiful women for my CRPG portraits!
>>
File: 1765108483657514.png (2.51 MB, 1920x1088)
2.51 MB
2.51 MB PNG
>>
File: 1750091942698036.png (3.09 MB, 1336x2008)
3.09 MB
3.09 MB PNG
>>107598765
if your dataset has poor quality images or subpar captions, itll look bad. for example that dataset had images like https://files.catbox.moe/evwsef.jpeg but when the dataset is clean itll look really nice with little to no bad fingers or nonsensical text
but i also didnt use a regularization set
for settings, i just use the default onetrainer config
>>
>>107598749
i know no one wants to hear, this but with illustrious as a base i always got pretty great results with no captioning haha.
>>
>>107598780
>simple 0.5 denoise i2i
Would probably do what you want more if you used a cnet. Not sure if it's "still bugged" though >>107598064
>>
File: 1747545091884906.jpg (815 KB, 2000x1336)
815 KB
815 KB JPG
>>107598835
>i always got pretty great results with no captioning
youre right. but still, having captions is better than not imo



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.