[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Merry Christmas Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107662604

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of frenship
>>
File: deBS_zi_00015_.png (3.03 MB, 1792x1152)
3.03 MB
3.03 MB PNG
>mfw
>>
>>107668208
Go back
>>
Any word on when Flux.2 Klein is coming? Dev is bringing my computer to its knees crying.
>>
>lain diffusion general
MERRY CHRISTMAS /LDG/
>>
>>107668223
they're trying to make it safer anon, be patient
>>
>>107668224
go to sleep Ani no one likes your shitty UI
>>
>>107668234
Go back to discord
>>
Enjoying your base model?
>>
>>107668307
not "ran". ran doesn't exist. take your meds, you are truly the most mentally ill person i've ever seen
>>
File: z_mod_00026_.jpg (813 KB, 2064x1408)
813 KB
813 KB JPG
>>
File: 1752600593415338.png (483 KB, 640x640)
483 KB
483 KB PNG
https://docs.comfy.org/tutorials/image/qwen/qwen-image-edit-2511

new workflow/etc.
>>
>>107668299
Send me the invite link
>>
>>107668324
What's better with the new edit iteration?
>>
>>107668331
everything. it's basically nano banana pro at home
>>
>>107668331
it's like 2509 but more refined, should respond to even more specific prompts, just started using it but seems good
>>
>>107668336
have a look in the mirror my dude
>>
>>107668337
I doubt that.

>>107668344
I will try it then, thanks anon.
>>
File: 1766707370.png (2.14 MB, 1504x1024)
2.14 MB
2.14 MB PNG
>>
File: trellis2.png (468 KB, 910x843)
468 KB
468 KB PNG
Trellis 2
>>
>>107668368
qrd
>>
I am creatively bankrupt today. There is nothing I have generated worth posting. See you anons tomorrow.
>>
>>107668337
>it's basically nano banana pro at home
lmao, not even close, that plastic generator will never be as good
>>
File: 7.mp4 (420 KB, 448x416)
420 KB
420 KB MP4
>>107668368
>>
No one tried Kandinsky pro? That's crazy, i2v and t2v is fully uncensored...

The image model is also interesting, but it a Z-image refinement. Clearly very fresh training data, that's cool.
>>
>>107668234
>lain
based anon
https://www.youtube.com/watch?v=XtOsfHoDDdI
>>
File: Polish_20251221_160654071.jpg (424 KB, 1080x1321)
424 KB
424 KB JPG
>>107668336
>post deleted immediately
Well if this isn't a reminder that the 'person' splitting and derailing these threads isn't a seething tranny janny begging for mean replies so they can b& us I don't know what is.
Lol, lmao even!
>>
File: 1758571985844884.png (59 KB, 578x444)
59 KB
59 KB PNG
>why don't we remove everything that was working and put it all into this tiny section of the screen that also has constant useless pop-ups and alerts
masterful UI design. Truly exquisite.
>>
>>107668324
thank you migu :)
>>
>>107668423
that's why I went back to older frotends, it's getting worse and worse
https://www.reddit.com/r/StableDiffusion/comments/1pnizap/for_those_unhappy_with_the_modern_frontend_ui_of/
>>
>>107668337
Sucks at realism but if you're just trying to make memes or using non-realistic characters as a reference for new scenarios then I guess it's fine.
>>
>>107668423
popup toast notifications were a mistake (not just in comfy, in general)
>>
>>107668337
People praise nano banana pro for the fact it can reproduce humans perfectly, QiE is the antithesis of that it's way too slopped to be good on realism
>>
File: z_mod_00036_.jpg (824 KB, 2064x1408)
824 KB
824 KB JPG
>>
File: 1761667252882586.png (620 KB, 1237x1157)
620 KB
620 KB PNG
>>107668449
(((the man who invented pop ups))) apologized for that btw loool
>>
>>107668337
kek
>>
File: 1761944390352503.png (2.66 MB, 1440x1512)
2.66 MB
2.66 MB PNG
>>107668324
heres a test, outfit swap + beer mug to candy cane, success.
>>
>>107668388
>Still no open source text to armature animation model.
>>
>>107668400
Let's all love /ldg/
>>
>>107668194
Now that we know base won't be released on christmas, do we have another cope for a day's release?
>>
File: 1765788800421537.png (2.45 MB, 1272x1920)
2.45 MB
2.45 MB PNG
>>107668477
also, it did a good edit of anri while preserving her appearance.

the japanese girl is wearing a low cut red christmas bikini and a santa hat. there is snow on the walls. she is holding a sign saying "merry christmas!" in festive christmas text.

comfy site workflow seems good to me.
>>
>>107668328
invite/6wUwtcJsr2
>>
File: 1745859318212322.png (2.13 MB, 1200x1800)
2.13 MB
2.13 MB PNG
>>107668509
same prompt, diff girl:
>>
File: 1740508795704898.png (1.91 MB, 1200x1800)
1.91 MB
1.91 MB PNG
>>107668531
>>
>>107668502
Christmas is not Chinese culture.
>>
File: z_mod_00038_.jpg (760 KB, 2064x1408)
760 KB
760 KB JPG
>>
>>107668554
when's their next celebration or something? I wanna cope on another day :(
>>
>>107668515
Is that tr*nis doxxcord?
>>
>>107668585
No lol
>>
File: 1751173525938399.png (1.74 MB, 1048x1592)
1.74 MB
1.74 MB PNG
imagescaletototalpixels node seems to be effective

another test with saint floyd:
>>
>>107668601
>merry fentmas
kek
>>
File: 1758416104746594.png (2.04 MB, 1592x1048)
2.04 MB
2.04 MB PNG
>>107668601
okay, it's really clean with that enabled. now it can scale above the original image size.
>>
I want Zedit
>>
File: 1754613203523591.png (1.86 MB, 1080x1544)
1.86 MB
1.86 MB PNG
>>107668654
diff girl
>>
>>107668585
follow the instructions to get access to the rest of the server
>>
>>107668392
>That's crazy, i2v and t2v is fully uncensored
yeah, I'm not falling for it
>>
File: 1748356370432865.png (1.9 MB, 1120x1488)
1.9 MB
1.9 MB PNG
>>107668673
pretty neat, good workflow on the comfy site.
>>
File: z_mod_00057_.jpg (844 KB, 2064x1408)
844 KB
844 KB JPG
>>
Like a crack fiend I'm considering renting just for a short time until I'm back to my Home Computer
>>
File: 1742075250614308.jpg (889 KB, 1142x2536)
889 KB
889 KB JPG
>decide to get back into imagegen for the first time since flux launch
>update comfy
>the entire ui has now been redesigned to huge-padding giant-buttons-and-margins mobileshit
AIIIEEEEEEEEEEEEEE
>>
File: z_mod_00066_.jpg (909 KB, 2064x1408)
909 KB
909 KB JPG
>>
>>107668731
go back to tradition old man >>107668439
>>
File: 1736727188591431.png (2.1 MB, 1072x1552)
2.1 MB
2.1 MB PNG
>>
>>107668481
Even no decent video to armature animation or just static armature.
>>
>>107668811
It's so weird. It's totally possible yet nobody does it.
>>
File: ComfyUI_03541_.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
>>
File: trellis2.png (3.67 MB, 1440x2560)
3.67 MB
3.67 MB PNG
Trellis 2
>>
File: 1761822862353359.png (1.34 MB, 1344x1240)
1.34 MB
1.34 MB PNG
give the white anime dog a santa hat and christmas outfit. there is snow on the ground. keep their pink hair the same.

cute christmas doro!
>>
>>107668811
>>107668816
did you try SAM3?
https://www.aidemos.meta.com/segment-anything/gallery
>>
>>107668848
sovl
>>
File: 1735904725231078.png (1.69 MB, 1344x1240)
1.69 MB
1.69 MB PNG
>>107668848
>>
>>107668824
Is this fullsize or quanted? Have you done quant comaprisons how much it kills quality?
>>
>>107668848
That dog has autism
>>
>>107668905
Full size. I'm not aware of any quants as it's a 4b model.
>>
File: 1761633454577562.png (1.38 MB, 1344x1240)
1.38 MB
1.38 MB PNG
>>107668893
>>
>>107668848
I mean yeah QiE is good at anime character but we already knew that on its previous versions, I'd like it to be good on realism too now, why can't they just borrow the Z-image turbo method, they're on the same company after all
>>
>>107668875
I did. Its image to pose with an armature thing is really cool, but I feel the dropped the ball but not going the extra step and adding video to that.
>>
>>107668909
>piss filter
>>
>>107668925
it does work good on realism too.

not lewd but the image is 7mb.

https://files.catbox.moe/0mjvun.png
>>
>>107668223
who gives a shit? Z-image is better for T2I and qwen is better for editing. It's over for flux.
>>
>>107668959
it's just a simple edit anon, obviously it's not gonna fuck up if you change 3 pixels (it still fucks it up by zoomin it though), make her skate or some shit and see how slopped she'll become
>>
>>107668223
>Any word on when Flux.2 Klein is coming?
wait you're waiting for that? it's gonna be worse than flux 2 dev and flux 2 dev is inferior to z-image turbo lmao
>>
File: 1735666752859267.png (1.62 MB, 1448x1152)
1.62 MB
1.62 MB PNG
qwen edit but with a pepe as input:
>>
comfy should he
>>
>>107668969
>make her skate
why would I want to do that?
>>
>>107669027
>he did make her skate and finally noticed the shortcommings of QiE
>>
has anyone used qwen layered? I got this shit installed up and running but I have no idea how its supposed to work. it said its a slow model and to use 50 steps. Currently at 14 percent. Feels like I can do anything with 64GB
>>
are there any good and not bloated comfyui webui gui wrappers? i hate looking at spaghetti
>>
testo
>>
>>107669104
swarm
>>
>>107669106
Welcome back from your ban.
>>
>>107669077
It's really good if you have a specific use case for it. If you're asking for ideas then you probably don't need it
>>
supersonics looking like a 1-and-done
poverty franchise
>>
>>107669133
I think you've lost your way, this isn't /sp/ kek
>>
I love LDG
>>
File: 1746095081533806.jpg (691 KB, 1600x2048)
691 KB
691 KB JPG
>>
File: 1753162967083995.jpg (447 KB, 1944x1328)
447 KB
447 KB JPG
>>
File: ComfyUI_09624_.png (1.33 MB, 1096x944)
1.33 MB
1.33 MB PNG
>>
>>107668234
i wish lainchan was more active but its cool to see the progression of ai art 5 years back in a single thread
>>
File: 1764756763922351.png (1.98 MB, 1608x1040)
1.98 MB
1.98 MB PNG
>>
>>107669218
lmaooo, Real Migumunism hasn't been tried yet!
>>
File: 1765525777492044.png (1.57 MB, 1608x1040)
1.57 MB
1.57 MB PNG
>>107669218
>>
>>107669112
There was an anon who shared an unofficial extension not a node that made a Swarm like interface in Comfy. It was just a sidebar with all the sampler tabs together. I don't know why this isn't in Comfy by default.
>>
>>107669283
this?
https://github.com/chrisgoringe/cg-controller
it hasn't been updated in forever so it might be broken
>>
File: ComfyUI_09625_.png (121 KB, 280x360)
121 KB
121 KB PNG
>>
File: 1742099589148411.jpg (308 KB, 1756x1019)
308 KB
308 KB JPG
Making neat workflows is as fun as genning.
I got a Z-workflow that enables prompt rewriting with a single switch. I2I is also enabled by another switch. Node grouping is a godsend.
Prompt rewriting adds almost flat 10s to generation. Not a big impact when genning at high resolution.
Also, if you have 16 gb vram, load clip on cpu. Text encoding will be slower, but you might save time on vram juggling. Running TE on cpu shaves off 1s from total time for me.
>>
>>107669351
>Making neat workflows is as fun as genning
no it's not, you're just autistic
>>
>>107668875
Is it local?
>>
File: 1738880877409680.png (1.27 MB, 1488x1120)
1.27 MB
1.27 MB PNG
pretty neat
>>
>>107669351
Looks interesting. Share it?
>>
File: 1761472154860676.png (1.95 MB, 1488x1120)
1.95 MB
1.95 MB PNG
the anime man in image1 is pointing a gun at the black man in image3. change the text at the bottom to "I came here to take your fent."

kek
>>
File: crazypepe.png (2.76 MB, 1504x1504)
2.76 MB
2.76 MB PNG
>maek post
>do captcha
>click Post instead of Next
>have to wait another 30 seconds
>>
>>107669351
are you using a custom node to rewrite your prompts? and what llm are you using?
>>
>>107669384
kek, same, I have to get used to it too, but if it filters more bots and schizos I'm all for it
>>
>>107669392
v&



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.