[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107395519

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>107398266
>>Z
needs to be "Z Image Turbo" btw
>>
comfy should be dragged out on the street and shot
>>
>>107398278
RUSSIA RUSSIA DERZHAVA NASHA
RUSSIA RUSSIA AMERIAK PA RUSSIA
>>
Can I please have a tutorial on wan 2.2 for linux?
the one in the OP is for windows only.
>>
>>107398305
amerika* :(
>>
Ok I wll install AI toolkit, I wonder how many dozens of pyhton dependencies I will have to download
>>
how long until the full z image model that's massive bros. Another rugpull I fear :(
>>
>>107398311
nobody here uses linux since it's a troon OS
>>
>>107398311
Got matrix/element? I wouldn't mind helping you out.
>>
>>107398311
just ask chatgpt how to install comfy, triton and sageattention
>>
File: zimg.png (1.25 MB, 1024x1536)
1.25 MB
1.25 MB PNG
>>
File: 00153-1047012706.png (1.71 MB, 1024x1544)
1.71 MB
1.71 MB PNG
>>
File: z-image_00069_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
>>107398311
If you installed Linux you should be able to figure it out
>>
>>107398335
Is that Z with no loras?
>>
>>107398334
kewl
>>
File: comparison_models_small.jpg (2 MB, 4412x3250)
2 MB
2 MB JPG
Ok here it is. My big comparison of Qwen Image, Chroma, Z-Image Turbo, and SD3.5 Medium, with a few notable SAAS models thrown in. The full image (twice as big) is here:
https://files.catbox.moe/2i5oy9.jpg
>>
>>107398347
Yes sir
>>
File: 177.jpg (910 KB, 857x579)
910 KB
910 KB JPG
>>107398323
Never. Base model will never be released. Doomposters are always right.
>>
>>107398353
>zit
arr rook same
>>
>>107398353
oof... ZIT's prompt variation is inexistant lmao
>>
>>107398354
Please gib the prompt
>>
>>107398353
I forgot to mention that I was testing not performance in general (in which this kind of test would be too forgiving) but variety
>>
>>107398353
based
>>
File: zimg.png (1.68 MB, 1024x1536)
1.68 MB
1.68 MB PNG
>>
>>107398353
>hd fp8
>q4
>>
>>107398394
I'm a promptlet though

Hyperrealistic Picture of a beautiful Russian girl with large breasts blonde hair and light freckles wearing a french maid outfit black choker and a stockings.
Reclining on a chair with one of her knees up and speaking on an old rotatory phone. One of her hands is holding the phone, the other is adjusting her hair
>>
File: chroma_00013_.png (1.72 MB, 896x1344)
1.72 MB
1.72 MB PNG
>>107398353
interesting that qwen labels the image so much more often
>>
>>107398335
please more
>>
>>107398406

ho ho ho
>>
Is the gook ZIT face the new Flux buttchin?
>>
>>107398389
IT'S FOOKING OVERCOOKED
>>
File: Nothing ever happens.png (1.38 MB, 1280x720)
1.38 MB
1.38 MB PNG
>>107398363
>Doomposters are always right.
Doomposters said they wouldn't release turbo, and that if they released turbo it wouldn't look as good as the demo showcase though lol, trust the chinks on that one
>>
>>107398389
Base will fix this
>>
File: ComfyUI_00092_.png (953 KB, 1504x1024)
953 KB
953 KB PNG
>>107398456
>>
File: ComfyUI_temp_gvfih_00009_.png (3.45 MB, 1496x1120)
3.45 MB
3.45 MB PNG
testing out the ZIT anime lora
>>
>>107398353
I blame the distillation, Turbo has 2 distillations (guidance + steps) it kills the variety
>>
File: kit.png (4 KB, 410x109)
4 KB
4 KB PNG
Does this do something? I forgot to include it, but the lora turned out fine.
>>
File: ComfyUI_00093_.png (1.19 MB, 1504x1024)
1.19 MB
1.19 MB PNG
>>107398472
I think we have a happening
>>
File: ComfyUI_00094_.png (1.02 MB, 1504x1024)
1.02 MB
1.02 MB PNG
>>107398483
hey, YOU 1girl, *I* wapple.

this one is a little less perfect.
>>
>>107398475
Very deboesque look.
>>
>>107398353
This is why vramlets should shut the fuck up when the adults are talking in these threads. The only reason your toy models can spit out decent looking images at their size is because they're heavily distilled and overfitted
>>
https://github.com/ChenDarYen/ComfyUI-NAG/pull/64
Oh shit, this PR makes NAG work on Z-image!! dude!!
>>
>>107398475
qwen synthslop
>>
>download quen_3_4b.safetensors to clip folder
>try to gen
>comfyretardUI is now downloading quen_3_4b.safetensors to clip folder
I don't get it
>>
>>107398353
SD 3.5 and Flux 2 seem to be the winners. A "disposable" doesn't have the white polaroid/instax framing.
>>
Z-image seems to have difficulty learning new manga art styles.
>>
>>107398525
>SD 3.5 and Flux 2 seem to be the winners
So why is no one using them
>>
File: combined_0119.jpg (931 KB, 2040x3840)
931 KB
931 KB JPG
>>
>>107398529
Get a decently sized diverse dataset and train at rank 32
Do not use "trigger words", use simple prompts like "an illustration of (...)"
>>
File: soon.png (1.43 MB, 1536x1024)
1.43 MB
1.43 MB PNG
>>
File: ComfyUI_00096_.png (976 KB, 1504x1024)
976 KB
976 KB PNG
>>107398497
heheheheh it's happening.

>blurry kodak disposable photo of the face of my buddy al's sister julia. a really intriguing beautiful girl in Brisbane, Australia, 2003. Her eyes are really interesting.

>>107398501
>>107398353
thanks, will give it a go
>>
>>107398537
they're not as efficient at generating 1girl, standing, looking at viewer
>>
>>107398353
try zit fp8. looks like i have more variations than with bf16
>>
File: 4164.jpg (638 KB, 832x1248)
638 KB
638 KB JPG
Perfect female body type
>>
File: ComfyUI_00097_.png (1.13 MB, 1504x1024)
1.13 MB
1.13 MB PNG
>>107398497
so I take that seed, with basically medium negative, and go full negative here. This is way better than nothing!
>>
>>107398450
Literally can't run larger on my card and I stated that as a limitation at the outset. I've done a lot of Q4 Chroma prompting in my time and can attest it works very well and looks nearly as good, so I don't see an issue. If someone wants to run 75 images with full Qwen Image and otherwise the same prompt/settings as laid out here, I'll happily replace the images in the chart. But they have to promise not to cherry-pick.

>>107398476
Yeah it would have been more fair to compare ZIT and Chroma HD Flash, as I did in this image a few threads ago

>>107398525
I think some blurring of the lines between related but distinct concepts like Kodak Disposables vs Polaroids etc is ok and basically unavoidable in most models, it doesn't seriously bother me because I consider it an example of the same phenomenon that yields so many different faces.
>>
>>107398582
z-image cannot into style instructions.
>>
>>107398513
>https://github.com/ChenDarYen/ComfyUI-NAG/pull/64
I can't import NAG on ComfyUi since his new update that broke some of the nodes though
>>
>>107398335
Now try changing the age to 18.
>>
File: ComfyUI_00098_.png (1.4 MB, 1504x1024)
1.4 MB
1.4 MB PNG
>>107398558
>>blurry kodak disposable photo of the face of my buddy al's sister julia. a really intriguing beautiful girl in Brisbane, Australia, 2003. Her eyes are really interesting.
>>107398501
>>107398353

yep, asian...
>>
>>107398593
It also can't into "enormous bosom... ample cleavage", which once upon a time would have been completely disqualifying in these threads (we have since been so thoroughly beaten down by these cucked models that we have learned to accept it)
>>
File: ComfyUI_00002_.mp4 (602 KB, 640x832)
602 KB
602 KB MP4
>>
>>107398335
very nice, the second I ask for a blonde zimage makes her like a 40yo milf automatically, it's a bit annoying
>>
File: 1749523157890529.png (176 KB, 1000x577)
176 KB
176 KB PNG
>>107398571
model?
>>
>>107398620
You have to set your caucasian 1girl's age really low with zimage to make her look cute.
>>
>>107398614
Damn that's nice

>>107398620
Try using young and Russian, or just specify an age
>>
>>107398513
>>107398596
Do this
>git fetch origin pull/59/head:pr-59
>git fetch origin pull/64/head:pr-64
>git checkout pr-59
>git checkout -b combined-59-64
>git merge pr-64
you'll get a weird unix command shit, you have to write this
>:wq
and you press Enter

and you're good to go you can try out this new node
>>
>>107398456
>>107398406
>>107398472
>>107398483
NAG doubles gen times right?

According to this, NegPIP also works with Z:
https://github.com/hako-mikan/sd-webui-negpip

NegPIP doesn't effect gen times. Which tech is better?
>>
>>107398332
>just ask chatgpt

NEVER do this for linux anything. ShatGPT pulls and mixes sources together giving you the wrong steps and omitting crucial information.

>>107398353
Chroma<3. I knew Z suffered from same face but goddamn. Chroma-Z when?
>>
>>107398673
fucking chatgpt helped me lose 1tb of my games when i was trying to fix a problem in sandboxie

it just loves to fuck shit up constantly
>>
>>107398582
>3 feet

oh man, Chroma is amazing but also horrid
>>
>>107398266
ANIME DIFFUSION NEWS ANCHOR!

>Noob Models!
SeeleNoobAI (2048 native resolution): https://civitai.com/models/1445275/seele-noobai-sdxl
Chenkin Noob XL:(NoobAI ESP with new dataset of character)
https://civitai.com/models/2167995/chenkin-noob-xl
WAI Shuffle Noob
https://civitai.com/models/989367/wai-shuffle-noob

>Anime LoRa Making Guide!
https://civitai.com/models/22530/guide-make-your-own-loras-easy-and-free

>Model News!
ZiT Zeta Image Turbo Model: a new 6b model, It's fast, open-source but the main problem is it doesn't understand booru tags.
UIs that supports it: Comfy, Krita AI Diffusion, Neo Forge, Swarm, SD Next

>Anime ZiT LoRas!:
Frieren LoRA
https://civitai.com/models/2176854/frieren-beyond-journeys-end-sousou-no-frieren-z-image-lora
Flat Anime Style:
https://civitai.com/models/2175307/z-image-flatanimestyle
Ra Lilium Style:
https://civitai.com/models/2125529/ra-lilium-style
Nyalia Style:
https://civitai.com/models/2180136/nyalia-style
Anime Flat Style:
https://civitai.com/models/1952560/anime-flat-style

ALSO ANIME CHARACTER LORA REQUESTS GO HERE!
>>
File: 3567547547.png (24 KB, 705x642)
24 KB
24 KB PNG
>>107398681
>fucking chatgpt helped me lose 1tb of my games

>people like this exist and you have to share your oxygen with them
>>
>>107398627
THAT'S MY GIRLFRIEND

stop staring, wow, you are a PERVERT
>>
File: 00153-10470127034.png (1.45 MB, 1024x1544)
1.45 MB
1.45 MB PNG
>>
>>107398691
>filter
>/ANIME DIFFUSION NEWS ANCHOR!/i
>>
File: combined_0016.jpg (977 KB, 3958x2040)
977 KB
977 KB JPG
>>
>>107398673
>NEVER do this for linux anything. ShatGPT pulls and mixes sources together giving you the wrong steps and omitting crucial information.
If you're using the thinking model and have basic knowledge of commands/scripts, and not just mindlessly copy pasting, you can do quite a lot and won't break anything.
>>
>>107398725
no you can't
>>
File: ComfyUI_00101_.png (875 KB, 1504x1024)
875 KB
875 KB PNG
>>107398609
>>>blurry kodak disposable photo of the face of my buddy al's sister julia. a really intriguing beautiful girl in Brisbane, Australia, 2003. Her eyes are really interesting.
>>107398501
>>107398353

that prompt is very bad. it can't really be recovered through a negative. Like... kind of lol. picrel.
>>
File: ComfyUI_00102_.png (918 KB, 1504x1024)
918 KB
918 KB PNG
>>107398746
idk, kind of worked. It's not the zit azn hooker.
>>
>>107398691
https://civitai.com/models/2175612/kasane-teto-z-image-lora
>>
>>107398725
it also requires not being a moron, and so many people fail that test and just mindlessly follow whatever the model says, not asking for confirmation, check etc
>>
File: ComfyUI_00595_.png (2.11 MB, 1600x960)
2.11 MB
2.11 MB PNG
>>
What tool would be best to caption images for z training? I took my set from SDXL but it didn't work out that great.
>>
>>107398596
>>107398661
>NAG doubles gen times right?

!!!!!!!

I'm not using NAG!!!!!!

for now just euler.
schedulers linear_quadratic and beta.
cfg 5 with the apples.

>>107398746
>>107398759
these are burned, but cfg 12 with this negative:

oriental, asian, Chinese, Indian, Indonesian, Pakistani, Bangladeshi, Japanese, Filipino, Vietnamese, Iranian, Turkish, China, India, Japan, South Korea, Indonesia, Saudi Arabia, Taiwan, UAE, Thailand, Philippines

this negative might can be improved, maybe it's too long, idk.

>>107398560
lmao yeah better watt:1girl ratio.
>>
>>107398801
nobody has made a good zit lora yet.
>>
File: Eva-00 64k.jpg (317 KB, 1600x1037)
317 KB
317 KB JPG
>>107398723
chef's kiss pc-captioned
>>
never prompt zimage for natalya poklonskaya in russian it makes hillary clinton
>>
File: ComfyUI_00008_.mp4 (2.33 MB, 832x640)
2.33 MB
2.33 MB MP4
>>
>>107398611
>>107398593
Faggots who can't be precise and instead rely on "vibes"
Fucking kill yourself
>>
File: ComfyUI_00009_.mp4 (2.29 MB, 832x640)
2.29 MB
2.29 MB MP4
always fucking yapping
>>
>>
File: nbp-x37.jpg (1.97 MB, 1482x2048)
1.97 MB
1.97 MB JPG
>>
>>107398820
I've made one but I don't think my tagging was that good
>>
File: Zurbo_00100_.jpg (698 KB, 2688x1536)
698 KB
698 KB JPG
Just posting.
>>
can you make money making loras
>>
File: 1746585788307612.jpg (302 KB, 2048x1024)
302 KB
302 KB JPG
>>107398642
it's even worse when you ask for bigger boobs or glossy lips, it just default to 40yo milf even if you type "young woman"

>>107398644
picrel was specifying mid twenties news anchor, one time a blonde american, second time a korean one
>>
File: 1750865958258189.png (3 KB, 365x88)
3 KB
3 KB PNG
>>107397636
>6s per 1280x1280 zimage with powerlimited 3090 and no fp16 accumulation, big
Ok so this didnt last,
I think theres a problem in comfy/pytorch or whatever that prevents double performance speed boost, because every once in a while, as im queueing a lot of prompts, the next batch gets allocated differently in vram/ram and goes at double the speed.

I don't think it's my setup thats usually by default running at half the speed or anything like that since i believe my numbers match with other people online

This kind of thing happened before to me even when running old wan 2.1 generations, a few queued videos in between 30 queued ones for example finish at half the time randomly at no quality loss.

I tried playing around with cli flags, nvidia p states, windows power profiles etc and cant reproduce these fast generations.

During those fast generations, I notice that during the allocation of the actual transformer model stage, a ~2-4gb get allocated less in vram, but instead i see them allocated in "Shared GPU memory" blue graph in task manager, which usually has 0 in it.

Can anyone with a 3090 give me their numbers of how long it takes them to generate 8 1280 1280 images in a single batch of for example z image turbo bf16, 8 steps euler simple shift 3 cfg 1, full text encoder model and ae.safetensors basic workflow?

I'm using:
latest update of comfy, all nodes, drivers, windows, triton windows
--windows-standalone-build --disable-api-nodes --disable-auto-launch --async-offload --use-sage-attention
Python version: 3.12.8
Total VRAM 24576 MB, total RAM 130985 MB
pytorch version: 2.9.0+cu130
Set vram state to: NORMAL_VRAM
Device: cuda:0 NVIDIA GeForce RTX 3090 : cudaMallocAsync
Using async weight offloading with 2 streams
Enabled pinned memory 58943.0
>>
>>107398934
The prompt was precise, "thick dark outlines". Where are they?
>>
>>107398997 (me)
I also tried 380w power and p0 gpu state and it still took ~85s for those 8 images in total so ~10.5s per image, but before even with 270w p2 normal nvidia power profile it got to 6s per image for a few batches of 8 queued generations like i initially said.
>>
File: ComfyUI_00009_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
Is there a particular reason why nunchaku can't into multigpu setups besides them just not caring?
>>
File: 1746638938389387.jpg (314 KB, 2048x1024)
314 KB
314 KB JPG
>>107398986
and this was specifying 20 years old instead of mid 20s (and not changing hair color for the korean)
I really can't wait for an age slider lora, this is ridiculous
>>
>comfy should be dragged out ont he street and shot
>>
File: NAG on Z-Image Turbo.jpg (861 KB, 3072x1314)
861 KB
861 KB JPG
>>107398513
>>107398657
It works.
>>
>>107398971
gawddamn
>>
>Ask zit for tits
>Get nothing
>Don't ask for tits
>Get them in every fucking image
what causes this
>>
>>107398513
>NAG
huh
>>
File: 1754781809944788.jpg (281 KB, 2048x1024)
281 KB
281 KB JPG
>>107399033
and this is going at 15 years old, which finally gave something looking like 20-30 from the american anchor, (but didn't change much on the korean one) which means you were right >>107398642, you have to make it dog years or something for it to actually look like the age you want
>>
File: ComfyUI_08340_.png (3.42 MB, 1280x2048)
3.42 MB
3.42 MB PNG
>>
File: ComfyUI_00010_.mp4 (1.04 MB, 832x640)
1.04 MB
1.04 MB MP4
>>107398971
>>
>>107398763
Cool!
>>
File: 1749909271209612.jpg (1.44 MB, 3726x2900)
1.44 MB
1.44 MB JPG
>>107399089
NAG allows you to simulate a cfg on guidance distilled models, it had been used on Kontext and now it's working on Z-Image Turbo
https://chendaryen.github.io/NAG.github.io/
>>
>>107399147
Very interesting! Thanks for sharing anon^^
>>
I got this ZiT thig to work and it gives me 1girls in less than 20 seconds on 12Gb, pretty good.
>>
File: file.png (1.8 MB, 2113x1058)
1.8 MB
1.8 MB PNG
so i tried training pixel art character lora for z-image but noticed something.
big discrepancy on running the lora on fp16 model vs fp8 model. i trained in fp8 and then when i use on fp16 it has worse adherence than genning it on fp8
picrel - same seed, left is fp16 genned, right is fp8 genned is closer to original dataset with soft shadows
this is counter intuitive because you'd think genning on fp16 would be better even if you train lora on fp8
>>
File: wan22__00002.mp4 (2.48 MB, 1024x768)
2.48 MB
2.48 MB MP4
trying that dasiwa wan you posted on the previous thread. Lightning and nsfw loras baked in
>>
>>107399147
What does this do in terms of inference times, I'd assume it is twice as much?
>>
>>107398964
Nice style, how was your genning process?
>>
>>107399203
it shouldn't be, it's still cfg 1
>>
When using ZIT, my GPU doesn't get as loud and hot as with other models, even though it's still at 100% usage. Anyone else notice that?
>>
>>107399203
>>107399214
it's making it a bit slower since it's doing the same thing like cfg but not quite, it's like 50% slower
>>
File: 1758047793931616.jpg (679 KB, 2048x1280)
679 KB
679 KB JPG
>>107399041
>>107399147
buh bye stupid bokeh lmao
>>
>>107399224
>>107399214
o-okay daddy...
>>
>>107399199
very creative, brety gud
>>
>>107398657
can you git pull afterwards tho
>>
File: 1734015524654685.png (295 KB, 3109x1623)
295 KB
295 KB PNG
>>107399233
>o-okay daddy...
https://chendaryen.github.io/NAG.github.io/
it depends on the architecture, for Z-image turbo it's 50% slower (from 22 sec to 36 sec here) >>107399041
>>
So how's the apple model which just dropped? Anyone tried? https://huggingface.co/apple/starflow
>>
>>107398999
Z-Image Turbo fans get weirdly defensive about it.
>>
File: ComfyUI_temp_yzfcx_00015_.jpg (451 KB, 1176x1776)
451 KB
451 KB JPG
ZIT is neat, but i think i'm gonna stick with Chroma cuz it just werks
>>
>>107399266
post the other 14 images full of body horror
>>
File: ComfyUI_00112_.png (1.18 MB, 1504x1024)
1.18 MB
1.18 MB PNG
zit can dooooit

but has some issues 8^)
>>
>>107399194
>this is counter intuitive because you'd think genning on fp16 would be better even if you train lora on fp8
its not because when training in fp16 the lora is learning and accounting for all of the extra information it has access to that suddenly gets zeroed out later and vice versa, especially for pixel loras where the smallest thing can throw pixel alignment off
>>
File: ComfyUI_temp_lhgbq_00012_.png (3.76 MB, 1072x1616)
3.76 MB
3.76 MB PNG
>>107399289
No idea what you mean :^)
>>
File: ComfyUI_01255_.png (1.67 MB, 1536x1152)
1.67 MB
1.67 MB PNG
>>
File: ComfyUI_00012_.mp4 (1.04 MB, 832x640)
1.04 MB
1.04 MB MP4
>>
>>107398353
This is supposed to be an attack on Zim Turbo but I've met Julia from Brisbane and that's her 100%.
>>
>>107399302
makes sense. lesson learned and will only train on fp16 from now on
>>
>>107399317
me
>>
As a Vramlet I can say ZiT is pleasant because I don't have to jump though any jewish hoops, even with offloading it just werks.
>>
>wake up
>still no base
>>
>>107399242
you can't, you're not in the main branch anymore
>>
File: images.jpg (7 KB, 168x300)
7 KB
7 KB JPG
>>107399306
inclusive dataset, nice to see
>>
File: Zurbo_00102_.jpg (533 KB, 3328x1792)
533 KB
533 KB JPG
>>107399136
>>107399323
Oh hey, it's you again. The guy quietly animating images.
>>
File: 1762537228540224.png (104 KB, 588x598)
104 KB
104 KB PNG
So did anyone train in the v2 adapter or knows someone who did? Verdict?
>>
Just found some apparent comfyui performance improvements, is anyone using these?

https://civitai.com/articles/23189/improving-comfyui-performance-with-new-extensions

https://github.com/SparknightLLC/ComfyUI-DisableJobHistory
https://github.com/SparknightLLC/ComfyUI-DisableBrowserLogs
https://github.com/SparknightLLC/ComfyUI-TempFileCleaner
>>
>>107399337
Even if you are offloading, BF16 is faster than Q8 gguf (which takes half as much memory). It's a good idea to use
>https://github.com/SeanScripts/ComfyUI-Unload-Model/
after last VAE Decode node to force memory clean up.
>>
File: ComfyUI_00116_.png (1.16 MB, 1504x1024)
1.16 MB
1.16 MB PNG
I figured out how to escape zit azn hooker, but now I'm getting this one.

>>107399266
Yeah, chroma has a place.
>>
>>107398353
Test Z image with noise strategy to increase variety. Though even then I don't think it'd be Flux.2 tier. Also this is why I'm suspicious about Z base, why haven't they given us the model yet given all the limitations of turbo distillation? If other one is only 6B, even if it takes a hit to speed (2 mins per gen) for variety and lack of distillation it'd be worth it.
>>
>>107399102
Yeah, sometimes I had to decrease age to 12 or so to make the girl actually look 18.
>>
File: ComfyUI_00115_.png (1.1 MB, 1504x1024)
1.1 MB
1.1 MB PNG
I now have a theory.
>>
>>107399199
Damn, genji sure got a few upgrades lately.
>>
>>107399358
false.

No model is any good at amputation.

Simple attempt: A woman who is amputated below the elbow.

notice I didn't say she has a prosthetic.

I don't think I should have to say there's a stump, but you can try that too.
>>
File: combined_0141.jpg (708 KB, 2040x3377)
708 KB
708 KB JPG
>>
>>107399416
kys pedo
>>
File: ComfyUI_00118_.png (1.11 MB, 1504x1024)
1.11 MB
1.11 MB PNG
>>107399429
>>
>>107399266
Based. Chroma is stupid fun. Just found a nice little realism checkpoint https://huggingface.co/dawncreates/UnCanny-Photorealism-Chroma-GGUF gonna try it out
>>
zit loras are so good. too bad, we have the same problem with cloned faces
>>
File: ComfyUI_00119_.png (1.1 MB, 1504x1024)
1.1 MB
1.1 MB PNG
>>107399466
This is a zit unlockable character, apparently.
>>
>>107399306
Just use HD Flash
>>
>>107398657
why not just use scottmudge's branch?
>>
>>107398673
>NEVER do this for linux anything
I dunno, it just werked for me installing comfy on cachyos and I never really used linux before.
>>
>>107398469
copium
>>
>>107399504
tell me more about the zit ARG
>>
File: ComfyUI_00121_.png (1.1 MB, 1504x1024)
1.1 MB
1.1 MB PNG
>>107399538
Flux2 is the same. it's just sd and chroma that show high variation.

but zit's azn hooker is a solved problem (for me) :^)
>>
File: qwen-image-edit_0001.jpg (450 KB, 1024x1024)
450 KB
450 KB JPG
Testing qwen image edit, prompt "The anime girl is holding a sign "Anime Diffusion in /ldg/""
Very happy with the results, looking forward to seeing Zeta's image edit model
>>
>>107398475
100% trained on ai slop, retarded fucking indians just cant help themselves
>>
File: ComfyUI_08383_.png (3.42 MB, 1280x2048)
3.42 MB
3.42 MB PNG
>>
File: 1748429533166989.png (223 KB, 1108x892)
223 KB
223 KB PNG
>furkan is doing more research than everyone in ldg combined because the thread is nothing but vramlet retards now
kek
https://github.com/ostris/ai-toolkit/issues/552
>>
>>107399520
because you also need the fix to make NAG not output errors on the new comfyui version, and it's not working if you only go for the Z-Image Turbo implementation PR, I tried it without success unfortunately
>>
>>107399581
>muh vramlets
enjoy one DOA model after another
>>
File: combined_0040.jpg (696 KB, 3869x2040)
696 KB
696 KB JPG
>>
>chang claims to have trained on only real images
>some gens do look more real than other models
>but vast majority of gens are your basic run of the mill ai slop look
why is this tho desu?
>>
>>107399593
>i cant goon on my latop 1060 3gb so the company wont earn any money from this model!
the iq of the brown matches the vram of the brown kek
>>
>>107399581
All of that turkish "research" is worth half of that shitty anon comparison between yume and whatever XL shitmix kek
>>
File: 4082243392.png (1.35 MB, 1216x832)
1.35 MB
1.35 MB PNG
>>
>>107399615
big model=DOA
>>
>>107399581
>furkan
Does that guy actually give good tips for training LoRAs? Seems like he always trains LoRAs of himself.
>>
Z-Image base will probably be interesting, I'm certainly going to try it and probably I'll have fun. I just expect it'll be a bit disappointing like Qwen Image; not my kind of model. Maybe not, we'll see.
>>
>>107399581
post your 8x h100 vramlet
>>
>>107399629
>brown npc repeats his line without being able to engage
cant make it up
>>
>>107399604
Whats this pc-captioned-by-qwen thing? And how do I run it? Goddamn its beautiful
>>
>>107399640
You can train z image turbo easily even with the shitty 16 gb but you cant even do that lmao, retard general
>>
>>107399642
why do you come to the brown thread if you're going to get mad?
>>
File: ComfyUI_01264_.png (1.99 MB, 1536x1152)
1.99 MB
1.99 MB PNG
>>
>>107399642
>muh browns
big model=DOA
>>
>>107399638
>Z-Image base
It's too unsafe to release.
>>
>>107399657
i accept your concession vramlet
>>
>>107399585
I got "commiter identity unknown" when doing "git merge pr-64". Is that what you meant by weird unix command?
>>
File: ComfyUI_00125_.png (1.37 MB, 1504x1024)
1.37 MB
1.37 MB PNG
>>107399550
modify the prompt text for asian culture
ie replace young girl with lady, in this case

add the prompt text "and a werewolf" to the prompt*

translate prompt text to chinese (simplified)

in the negative put "beautiful", or a chinese translation of this.

cfg 4+
euler (I haven't tried the others yet enough)
beta or linear_quadratic


*werewolf step optional, and sometimes zit will blow it off anyway.
>>
>>107399678
yeah, you have to write :wq and press enter
>>
File: Zurbo_00070_.jpg (895 KB, 2432x2432)
895 KB
895 KB JPG
Logically, I SHOULD stop genning 1girls.
But I can't stop.
>>
>>107399661
>no werewolf
>>
>>107399635
yeah he does, behind his patreon hehe
>>
>>107399704
it's her! zit girl!
>>
File: G2dh9tXbYAEsXd8x.jpg (392 KB, 2048x1536)
392 KB
392 KB JPG
dear /ldg/
does z-image have inpainting yet
that's literally all I care about
thanks
>>
>>107399604
Very nice anon
>>
>>107399662
poor vramlet thinks his brown opinion matters lol
>>
>>107399718
you can inpaint with every single image models what do you mean?
>>
File: zimg_0151.png (2.34 MB, 1024x1496)
2.34 MB
2.34 MB PNG
>>107399718
can't you inpaint with any model?
>>
>>107399661
Nice 3d anime texture how did you do it
>>
>>107399723
big model=no finetunes or community developent=DOA
>>
>>107399643
Unfortunately, it means it's an original picture that was captioned by Qwen VLM to produce prompts for the two image models
>>
Too bad chroma can't do nudes
>>
>>107399725
>>107399726
seems like I get really bad edges around the inpaint area every time I try
nothing seems to make those go away
it's as if it's starting from an entirely new image that gets smashed into the low denoising value
>>
File: Zurbo_00084_.jpg (695 KB, 1920x3072)
695 KB
695 KB JPG
>>107399715
Her name is totally Zelda. You know, because Z.
>>
File: ComfyUI_00128_.png (1.36 MB, 1504x1024)
1.36 MB
1.36 MB PNG
>>107399690
>>
>>107399740
But it can do mutants and let's face it that's all that matter for muh dick
>>
File: ComfyUI_01263_.jpg (490 KB, 1536x1152)
490 KB
490 KB JPG
>>107399730
testing my old wildcard generator it's just bunch of booru tags plus that funny quality tag gibberish, this is the prompt for the anime girl image

>art photograph, wide angle view, halo effect, courteous, downtown, volcano, alien planet surface, (anorexic male:1.2), (humanoid robot:1.2), (\rose (Isekai Maou to Shoukan Shoujo no Dorei Majutsu)\:1.2), long legs, (kneeling:1.1), embarrassed, light yellow spotted hair, orange eyes, pink collared shirt, hot pants, bubble skirt, boots, long pointy ears, ribbon-trimmed gloves, energy aura, antimatter gun, tiling pattern, hamster, costume,
>masterpiece, best quality, ultra-detailed, absurdres, intricate details, ultra high resolution, 8k, 4k, HDR, UHD, professional photography, sharp focus, extremely detailed, realistic, photorealistic, photorealism, hyperrealistic, hyperrealism, cinematic lighting, studio lighting, soft lighting, volumetric lighting, perfect lighting, award winning, finely detailed, high quality, ultra quality, extremely delicate and beautiful, stunningly beautiful, breathtaking, magnificent, spectacular, remarkable, fascinating, incredible, gorgeous, elegant, exquisite, flawless, perfect, immaculate, pristine, ultra clean, crisp, crystal clear, ultra sharp, razor sharp, tack sharp, highly detailed skin, detailed skin texture, realistic skin, pore level detail, subsurface scattering, smooth skin, no artifacts, clean render, perfect anatomy, ideal proportions, depth of field, bokeh, film grain, f/1.8, lens flare, chromatic aberration, color graded, post-processing, tone mapping, ray tracing, global illumination, god rays, dramatic atmosphere, moody, epic, majestic, sublime, transcendent, divine beauty, absolute perfection, ultimate quality, pinnacle of art, artistic genius, visually stunning, jaw dropping, mind blowing, awe inspiring, revolutionary, groundbreaking, legendary, iconic, timeless masterpiece
>>
why does Z image outputs look like low compressed jpegs
>>
File: 1745338739279649.jpg (160 KB, 1024x1024)
160 KB
160 KB JPG
>>107399416
this is trying 12, she looks more like anywhere 15-18

so far basically any >20 will look 40+
the model is really weird for non asian
>>
>>107399374
i have, it definitely works fine, but i haven't done comparison to see how much better it is
>>
>>107399766
qwen image edit?
>>
File: ComfyUI_00130_.png (1.35 MB, 1504x1024)
1.35 MB
1.35 MB PNG
>>107399750
I really have no idea...
>>
>>107399765 (me)
also, why is this thread full of transphobes? you know anyone can easily image edit any photo to become anything you want nowadays, right?
>>
>>107399033
>>107399102
>>107399766
Western women are washed.
>>
Filtering namefags makes 4chan so much better
>>
>>107399699
I'm retarded. where do I enter that? I was using CMD
>>
>>107399761
>>107399730
to add:
I think it looks like 3d render because the meme tag compilation has lots of cg related keywords
>global illumination, subsurface scattering, clean render..etc
Would be funny to pick up every computer graphics related word and see what it does then.
>>
File: combined_0061.jpg (590 KB, 3934x2040)
590 KB
590 KB JPG
>>
File: Comfy_UI_04.png (1.96 MB, 1216x832)
1.96 MB
1.96 MB PNG
>>
>>107399814
still on cmd, you literally write :wq here and you'll see it's been written
>>
>>107399809
it's better if you just filter all image posts
>>
>>107399217
Probably because the model fits entirely on your gpu so it's not swapping constantly
>>
>>107399737
oh, I see now. woops!
>>
File: 1754345439593118.jpg (1.5 MB, 1536x2048)
1.5 MB
1.5 MB JPG
>>
>>107399761
>wildcard generator
Based and /sdg/ pilled
>>
File: ComfyUI_00133_.png (1.38 MB, 1504x1024)
1.38 MB
1.38 MB PNG
>>107399750
lmao
>>
>>107399842
What's sdg?
>>
fag
>>
>>107399845
https://civitai.com/models/2184198
>>
>>107399825
Nice tests, could you test copyrighted anime characters to see how the models re interpret them?
>>
>>107399781
z-image
>>
>>107399800
>washed
What does that even mean.
>>
File: 00153-0923752345.png (1.78 MB, 1024x1544)
1.78 MB
1.78 MB PNG
>>
>>107399873
Take too many showers
It's a term used by dudes with a smell fetish
>>
>>107399033
Left is 20 years old? What the fuck did they feed the model with?
>>
File: Zurbo_00004_.jpg (711 KB, 3328x1792)
711 KB
711 KB JPG
>>107399890
>Smoke coming out of the top cigarette instead out of the butt or some other weird place
Finally. Qwen fucked that up constantly.
>>
>>107399860
neat.

I would also like to be able to do something with the early preview, sometimes.
>>
File: 4203438961.png (1.18 MB, 1152x896)
1.18 MB
1.18 MB PNG
>>
>>107399906
she looks like a horse thief. shoot her.
>>
>>107399923
This outfit doesn't suit Arue
>>
File: ComfyUI_01268_.jpg (474 KB, 1536x1152)
474 KB
474 KB JPG
>>
I'm getting ready to train a lora... something I attempted with mixed results ~2 years ago.
I'll use noobvpred as the base model, and have ~60 ref images to work with.
Is kohya_ss still the best application to use for this, or are there better options now?
>>
>>107398353
Z picked up on the fact that a girl in Australia would be Chinese
>>
>>107399923
zit?
>>
File: 1751253914496366.jpg (1.72 MB, 1536x2048)
1.72 MB
1.72 MB JPG
>>
Did no nut november anon end up cooming?
>>
File: 1754032025974956.png (242 KB, 426x315)
242 KB
242 KB PNG
>>107399923
Oh, and Arue has black tattoos all over her body.
>>
>>107399906
smoke coming out of the top of a cigarette and correctly drawn swords are one of those things that every model fucked up so far, ZiT seems pretty good when it comes to these things
>>
File: 1735543820346145.jpg (1.03 MB, 1248x1824)
1.03 MB
1.03 MB JPG
>>
File: combined_0093.jpg (559 KB, 3788x2040)
559 KB
559 KB JPG
>>107399862
Both generally know the most common characters. With Z, it will usually invoke the default art style for the character regardless of instructions.
>>
File: ComfyUI-ZiT-iPhone_00030_.png (1.84 MB, 1152x1152)
1.84 MB
1.84 MB PNG
>>107398266
>>107398065
v2 of my LoRA trained with the v2 adapter now. Forgot to turn on differential guidance (not sure if it was a meme) but let the results speak for themselves. Skin texture/stability wise this is the best I've got.
>>
File: 00153-0923734645.png (1.74 MB, 1024x1544)
1.74 MB
1.74 MB PNG
>>107399906
That's pretty consistent
The real challenge is getting a russian girl who doesn't look half asian for some freaking reason
>>
File: Zurbo_00005_.jpg (1.18 MB, 3328x1792)
1.18 MB
1.18 MB JPG
>>107399929
I would prefer not to.
>>
File: combined_0092.jpg (466 KB, 3916x2040)
466 KB
466 KB JPG
>>
>>107399945
zit defaults to azn hooker ho
>>
File: ComfyUI-ZiT-iPhone_00033_.png (1.78 MB, 1152x1152)
1.78 MB
1.78 MB PNG
>>107399984
>>
>>107399985
NAG + neg asian
>>
>>107399742
for masking, try the Inpaint crop n stitch node. it gens a bigger image based on 2x or more context and then pastes it back onto your original image, you can also adjust the output padding to make it blend in better. if it still looks like it was obviously edited, I'll upscale with low CFG/denoise on SDXL to unify the output under one model
>>
>>107399988
we can't allow horse thieves about.
>>
>>107399984
>>107399997
Definitely better but there's this weird sort of texture that I can't quite put my finger on. Trained on Chroma outputs, right?
>>
>>107399985
keyword:
>gopnik girl
>>
Zit is very bad at doing hebe+ consistently. It knows only hags and cunny. 12 is just the number that heavily leans to cunny. Anything higher defaults to hags.
>>
File: 3903984981.png (1.2 MB, 1152x896)
1.2 MB
1.2 MB PNG
>>107399933
>>107399962
Been a while, I forgot about that. It's not represented at all in my dataset.
>>107399947
Yes.
>>
File: ComfyUI_00134_.png (1.49 MB, 1504x1024)
1.49 MB
1.49 MB PNG
>>107399845
>>107399860
80 steps instead of 9...
>>
>>107399976
Interesting and all this happens also with contemporary characters like gacha girls or only with 80s 90s animes?
>>
>>107400022
true suffering
>>
>>107400022
meant for >>107399766
>>
File: ComfyUI_00075_.png (2.13 MB, 1920x1088)
2.13 MB
2.13 MB PNG
How do i install nag?
>>
>>107399997
why are there no blacks in the pool???
>>
File: ComfyUI-ZiT-iPhone_00032_.png (1.97 MB, 1152x1152)
1.97 MB
1.97 MB PNG
>>107399997
https://files.catbox.moe/mztuas.safetensors
>>
>>107400010
good recommendation
doesn't solve the horrendous blending but this took time to fix on other models
I imagine it's just an implementation issue
I'll go try another webui or something
>>
>>107400015
I kinda like the unrealistic water.
>>
>>107399976
>flux fingers

you had 1 job, flux
>>
File: 1755685815504819.png (565 KB, 1227x1554)
565 KB
565 KB PNG
https://xcancel.com/PrunaAI/status/1995524846948700495#m
https://www.youtube.com/watch?v=4R-79bPlplQ
>P-Im𝗮ge and P-Im𝗮ge-Edit
People in the comments sections say that it's a finetune of Z-Image base and Z-Image Edit lol?
>>
File: ComfyUI-ZiT-iPhone_00034_.png (2.07 MB, 1152x1152)
2.07 MB
2.07 MB PNG
>>107400015
>Trained on Chroma outputs
Nah, no synthetic data, just real iPhone 16 Pro images. Extremely varied, so there should be face variety. This is workflow I'm using https://files.catbox.moe/w5vrbg.png

I'm testing different strengths, but now the weird fake skin texture is much less than before.
>>
>>107399984
...a plastic lora?
>>
>>107400022
>>107400037
I just want my age prompt to mean something for z-image
so far it's like you said, anything between 10 and 20 is a random lottery, and anything above 20 is basically the same as 40
less the case for Asians
>>
>>107400041
>How do i install nag?
https://www.reddit.com/r/StableDiffusion/comments/1pbrbrt/nag_normalized_attention_guidance_works_on_zimage/
>>
>>107400070
Wow nice, then. In terms of getting it close to the kind of realism chroma outputs.
>>
File: ComfyUI-ZiT-iPhone_00035_.png (2.46 MB, 1152x1152)
2.46 MB
2.46 MB PNG
>>
>>107400065
any proof other than vibes and intuition?
>>
File: ComfyUI_00136_.png (1.29 MB, 1504x1024)
1.29 MB
1.29 MB PNG
maybe I went too far
>>
>>107400065
probably just jeets using zimage and qwen edit
>>
File: ComfyUI_09526_.png (2.04 MB, 1152x1152)
2.04 MB
2.04 MB PNG
>>107400085
I've been telling you guys Chroma is based on real smartphone images (which also look similar to other professional cameras). Dataset was just 41 images from Flickr
>>
File: 3153464623.png (25 KB, 498x498)
25 KB
25 KB PNG
zimage when I prompt blue eyes
>>
>>107400041
>>107399814
you just use this custom branch and it'll work
git clone https://github.com/scottmudge/ComfyUI-NAG
>>
>>107400106
i see you're a fellow cunny enjoyer as well
>>
>>107400117
but anon said
>>107399585
>>
I don't give a shit how many gallons of snake oil it has i'm not using comfy
>>
I tried making a style lora for z image with 700 images and 5000 steps. Do you think that would be enough steps? It's... ok. Just wondering if I should try again with like 10k steps.
>>
>>107400132
ok fuck off then
>>
>>107400131
I fucked it up, I was using the wrong branch, if you go for main you're good
>>
>>107400110
Anyways, model is still a little too prude compared to Chroma but this is a start. Z is very promising, all these images of course look way better right away and background more coherent than what Chroma would output.
>>
File: 133242_0.jpg (355 KB, 1536x1152)
355 KB
355 KB JPG
>>
>>107399147
>and now it's working on Z-Image Turbo
Was it even updated? the github was a month ago, even before zimage release.
>>
>>107400154
>Was it even updated?
nope, that's why you go for this alternative branch >>107400117
>>
>>107398353
a Flux employee made this
>>
>>107400158
kek
>>
File: 1886748799.png (1.18 MB, 1152x896)
1.18 MB
1.18 MB PNG
>>
>>107400155
Oh I see, thanks anon.
>>
>>107400158
no, it's a real problem we encounter with zit, because they seem to not have had absolutely any qc, at least no Westerners on qc.
>>
Hey guys, just wanted to update you on the status of the base model.
Still not out yet. Teehee and never will be teehee.
>>
>>107400183
thanks for the update
>>
File: ComfyUI-ZiT-iPhone_00037_.png (1.69 MB, 1152x1152)
1.69 MB
1.69 MB PNG
It's supposed to be showing her panties damn it
>>
File: Zurbo_00009_.jpg (414 KB, 3328x1792)
414 KB
414 KB JPG
>>107400183
I don't habeeb it.
>>
I want to skip even steps, and only do odd steps.

is there a way except chaining?
>>
>>107400183
thanks for the update anon, doing god's work here!
>>
>>107400208
So like you'd want to skip step 6, but not 7?
>>
File: ComfyUI_temp_ebakh_00124_.png (2.01 MB, 1472x1104)
2.01 MB
2.01 MB PNG
Trying to make realistic non-cosplay fantastical creatures doesn't seem very easy with zimage.
>>
File: ComfyUI-ZiT-iPhone_00039_.png (1.76 MB, 1152x1152)
1.76 MB
1.76 MB PNG
>>107400186
Even with a Chinese prompt I have consistently failed miserably on this one, in case anyone wants to take a crack at it

 一张业余摄影作品,捕捉到一位姿态端庄、充满魅力的年轻日本美女气象主播,她坐在现代化演播室中一把流线型的演播椅上,面前是大型互动气象屏幕,展现出专业的优雅、知性的魅力与温暖的镜头亲和力。她的内裤略微可见,手中拿着标有温度读数的气象提示卡,在正午直播栏目中对着镜头露出亲切的微笑。

她那柔顺的长深色秀发以轻柔的波浪垂落在肩上,微微侧分的刘海勾勒出她优雅的椭圆脸庞;淡雅的妆容令她的杏眼闪烁光彩,唇边带着温柔而真诚的笑意。她的坐姿放松却专注,一只手轻轻举着提示卡,另一只手自然放在腿上,突显出她纤细而自信的身形,将可信度与微妙的吸引力融入气象播报的氛围中。

她身穿一件挺括的白色长袖衬衫,领口整洁,并佩戴着精致的吊坠项链,下身搭配短款同色系裙装,整体造型展现出精致的播报气质与女性的优雅。配饰极为简约,使观众的注意力更多集中在她的信息传递上。

充满活力的演播室背景中,有一块发光的气象屏幕,上面显示着色彩鲜明的天气图、云层图示、诸如“70°”等温度高点与区域预报;周围是一张极简风格的桌面,上面摆有麦克风、纸条与笔记。明亮的顶灯将柔和的光线洒在光洁的地板和附近显示实时数据的监视器上,营造出高科技新闻间繁忙却聚焦的氛围。

整体画面散发着媒体力量、阳光般的乐观与富有吸引力的教育特质,将气象专业与视觉温暖融为一体,呈现出一位深受喜爱的气象主持人以她坐镇演播中心的姿态,在充满动感的播报空间中为观众带来愉悦与精彩的天气解读。
>>
File: come on apple.png (724 KB, 1142x616)
724 KB
724 KB PNG
>>107399263
>So how's the apple model which just dropped? Anyone tried?

>STARFlow (3B Parameters - Text-to-Image)
>Resolution: 256×256
>Text Encoder: T5-XL
>VAE: SD-VAE
Happy new year 2022!
>>
File: Zurbo_00011_.jpg (1.05 MB, 3328x1792)
1.05 MB
1.05 MB JPG
>>107400233
So, a phone model, basically?
Damn, that's useless.
>>
>>107400233
Apple is so far behind in AI it's insane for such a huge company.
>>
File: 1759297766122509.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>107400231
Tried your chink prompt + NAG, didn't work either :(
>>
File: 1734861560216143.jpg (346 KB, 1024x1024)
346 KB
346 KB JPG
installed NAG, prompted "woman", got this
I'm not sure what went wrong but it's interesting
>>
>>107399704
i love this gen. creative and fun
>>
>>107400262
show your workflow you probably messed something up lool
>>
>>107398778
What did you use to generate that? It reminds me of Flux 1, so I assume Flux 2?
>>
File: ComfyUI_00139_.png (1.65 MB, 1504x1024)
1.65 MB
1.65 MB PNG
so zit doesn't know what a werewolf is.
>>
>>107400293
looks like wolf in middle of image?
>>
File: 1763721362517880.jpg (298 KB, 1024x1024)
298 KB
298 KB JPG
>>107400271
it's ok I'll find out what, now testing the presenter prompt
>>
>>107400293
can you specify something like "a fusion between a wolf and a man, a wolf with humanoid figures such as: ..." and go from there?
>>
>>107400307
did you put the cfg to 1 on the nag ksampler?
>>
>>107400307
I know what's wrong, your neg_scale is too high, don't go over 3
>>
File: Zurbo_00010_.jpg (784 KB, 3328x1792)
784 KB
784 KB JPG
>>107400269
I choose to not interpret this as sarcasm.
>>
>>107400293
>people now realizing z is just flux fp8 with flash photo loras baked in

jej
>>
File: 1763003768049386.png (47 KB, 539x824)
47 KB
47 KB PNG
>>107400314
of course

>>107400315
that was it, thanks, I was just using the default, what are the recommended values for zimage?
>>
>>107400319
it wasn't sarcasm. i love the film going over her breasts, it inspired me to think of something similar to gen. i'd love to have the prompt, but was afraid you wouldn't share
>>
>>107400158
I thought the Flux 2 images turned out very poorly. Seemed to me the winner was Chroma, albeit with a strong showing by Qwen. SD3.5 medium excelled on this particular metric (variety) but you can tell that it has other problems which are pretty serious.
>>
>>107400337
>I was just using the default, what are the recommended values for zimage?
it's written on the top of the image >>107399041
>>
File: ComfyUI-ZiT-iPhone_00050_.png (1.71 MB, 1152x1152)
1.71 MB
1.71 MB PNG
>>107400252
Shame. I imagine when they add reasoning this model shall be much better at stuff like this.
>>
File: 3321543332-4091799438.png (1.38 MB, 832x1216)
1.38 MB
1.38 MB PNG
Did someon test tis ZiT anime lora? Looks promising
https://civitai.com/models/2117052/namako-mikan-style?modelVersionId=2460349
>>
>>107400355
I'm not sure, reasoning is just supposed to rewrite and enhance a prompt, you can already do that, and that still won't show a pantyshot.
>>
>>107400330
It's based on Lumina.
>>
>>107400366
All anime loras fail to fix proportions. Head is too small for anime.
>>
File: Zurbo_00016_.jpg (1.03 MB, 3328x1792)
1.03 MB
1.03 MB JPG
>>107400338
Well, thank you, then.

Here's the prompt:
Candid 35mm photograph capturing a young woman in a vintage darkroom, mid-movement as she reaches to adjust a strip of hanging film negatives. Her messy, slept-in waves are tousled around her shoulders, with sharp winged eyeliner framing her eyes and small silver hoop earrings catching the dim light against her pale, matte complexion. Her right arm is extended upward as her hand grasps the film strip. Another film strip is naturally crossing her breasts.

But it'll be very messy since I've been playing around with a lot of random noise and some of my own nodes. But you'll get there, for sure. Z will listen.
>>
>>107400410
>>107400410
>>107400410
>>107400410
>>
>>107399041
thanks anon
>>
File deleted.
>>107400186
>>
>>107400395
thanks for the prompt, i appreciate it!
>>
>>107400124
I am 100% lawful according to federal and state law of wherever I happen to reside without question.
>>
File: ComfyUI_00014_.mp4 (733 KB, 1280x720)
733 KB
733 KB MP4
>>107399372
>>
>>107400767
Video workflow please?
>>
I WANT Z-IMAGE QUALITY AND SPEED WITH QWEN EDIT ACCURACY AND PROMPT ADHERENCE!!! ... and pussies. Is that too much to ask???? Huh? Huh!?
>>
>>107400464
No problem!
>>
>used to only generate through Civitai since my PC was dogshit
>get a new, somewhat decent PC
>finally able to gen locally, come to this thread to start learning
>see that there's a website that archives stuff that was deleted Civitai
>it has what I thought was a long lost LoRA for my favorite character

Bless the internet! Time to plunge even further down into my degenearcy.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.