[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage.jpg (2.72 MB, 3210x3078)
2.72 MB
2.72 MB JPG
Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106696274

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: AniStudio_0109.jpg (302 KB, 768x1024)
302 KB
302 KB JPG
>>
File: 1756587203701534.png (74 KB, 1725x374)
74 KB
74 KB PNG
why do you need two text encoders in qwen image edit? in the example wf I see only one used
>>
Blessed thread of frenship
>>
so what are the best of da best of flux models for generating a lot more detail in real human characters? Like torn/worn clothes and certain period clothing for example.
in the midst of enjoying wan and crazy fast llm speeds i kinda forgot the part were i should really jump from epicrealism and realismengine kek
>>
>>106700489
you dont, only need one

qwen_2.5_vl_7b_fp8_scaled.safetensors
>>
>>106700474
ty for bake
>>
>>106700489
mmproj is needed if you use GGUF
>>
>>106700511
thanks anon
it's better than the mmproj one?
>>
>>106700482
He really needs to distance himself from all of the things in that picture if he no longer wants to be industry poison.
>>
>>106700528
I assume so, no issues with it
>>
File: ComfyUI_00022_ - Copy.png (697 KB, 768x768)
697 KB
697 KB PNG
>>106699330
Sorry for taking a while to respond. This is what I got with a basic bitch ass prompt. I'm using GNER but stock t5 should be fine too.
>>
>>106700525
how so?
as a second clip encoder? to do what?

>>106700548
ok thanks
>>
Friendship thread.
Long live to local models.
Don't go to the anime one, it's toxic.
Post anime here.
>>
File: 1747391077810497.png (1.05 MB, 1088x952)
1.05 MB
1.05 MB PNG
qwen edit: "he is holding a nintendo switch"

then with the back image of a switch 2 on that image: "the device the man is holding in image1 has the appearance of the device in image2".
>>
>>106700555
GGUFs don't have multimodal/vision by default in their quants/files, it's a separate mmproj file. It's automatically loaded by the GGUF nodes provided its in the same dir with the appropriate filename.
>>
>>106700556
Go back to your empty dump instead of trying to pit two better threads against each other you disabled faggot
>>
>>106700508
Examples of the sort of image you mean? Flux Krea looks more realistic by default than any Flux Dev merge or lora, anyways, IMO, and it has better prompt adherence. Likes a bit higher guidance than regular Flux also, I usually use Euler Beta with Guidance 4.5.
>>
>>106700571
fucking slopped to death, you better add some grain
>>
>>106700582
it's the 40s. images weren't as sharp then.
>>
>>106700531
?
>>
File: 1734413875551837.png (1.06 MB, 832x1248)
1.06 MB
1.06 MB PNG
the woman in image1 is holding a nintendo switch system, with the appearance of image2 on the back of it.

now this is advertising.
>>
>>106700585
they weren't as plastic either
>>
>>106700573
I actually use a gguf, so I should use that file and put it in the text_encoders folder as recommended by the screenshot and using the same name as q8 gguf I have, or instead put it in the gguf folder itself?
>>
>>106700592
>image1
>image2
so that's how the model "sees" the images? I was wondering if it was a stitch of them or if it "knew" the names of the inputs
>>
File: 00485-3159378300.png (3.31 MB, 1248x1824)
3.31 MB
3.31 MB PNG
>>106700580
literally picrel but not obviously sdxl level quality kek, and like i said actual weathered/dirty clothing and not some studio quality looking shit
there's a few flux amateur photoreal checkpoints im looking at that look pretty good but most people don't really do anything beyond the same concepts so i'm not sure. here's one example
https://civitai.com/models/978314/ultrareal-fine-tune?modelVersionId=1413133
>>
File: 1743005292851057.png (28 KB, 1135x325)
28 KB
28 KB PNG
>>106700599
like this (aka dont change the fucking name)
>>
Is Wan 2.2 5B model useless? I've never touched video AI stuff but I followed this workflow: https://docs.comfy.org/tutorials/video/wan/wan2_2
No matter what I do, there's like 0 action if I use image to video, and without the results are quite horrible
>>
>>106700610
with this version you can just describe the image elements, but you can also refer to them by node cause the new text node has image1/image2/image3 as inputs.
>>
>>106700616
alright, I'll do that, thanks anon
>>
>>106700571
Was the original Hitler actually a real photo?
>>
>>106700632
hitler's not real dude
>>
>>106700612
i thought that was nicki minaj from the thumbnail
>>
>>106700635
it was actually a woman
>>
>>106700623
it's way easier for what I want to do, thanks
>>
File: 1744626084116714.png (904 KB, 832x1248)
904 KB
904 KB PNG
I like how edit v2 also doesn't make people hobbits in edits.

Show a side profile of the woman who is standing up in the same room.
>>
File: 1740081901077936.png (3.3 MB, 2560x1440)
3.3 MB
3.3 MB PNG
>watching anime about how AI will destroy the world
>protag is using comfyui
cant make this shit up
>>
>>106700640
did you cume at least?
>>
>>106700673
no i dont like fat cotton planters
>>
>>106700679
Understandable.
>>
>>106700673
how could i cum when it wasnt nicky as an egyptian queen with a fat ass? what a silly question
>>
>>106700667
lmaooo
>>
>>106700667
>average civitai workflow
>>
File: 00021-324630917.png (3.25 MB, 1248x1824)
3.25 MB
3.25 MB PNG
>>
>>106700743
cringe plastic bitch
>>
>>106700747
you could be describing like 99% of flux gens i see here and on civitai honestly
>>
File: 1758269144680770.png (1.18 MB, 1176x888)
1.18 MB
1.18 MB PNG
>so mr altman apparently you thought people would pay $1000 a month for limited prompts over open source...a foolish move.
>>
File: 00023-871518679.png (2.62 MB, 1248x1824)
2.62 MB
2.62 MB PNG
>>
>>106700622
It's pretty useless. Any benefits from the reduced model size get erased by the hard coded resolution. If you're vram/ram limited it's better to just use a smaller quant.
>>
>>106700667
>comfyui created nodegraphs
You just did make it up.
>>
File: 1734927925453003.png (1.09 MB, 1152x904)
1.09 MB
1.09 MB PNG
>>
File: ComfyUI_00218.mp4 (1.98 MB, 608x960)
1.98 MB
1.98 MB MP4
Is the new Qwen edit uncensored or do I need a LoRA
>>
File: elf-hugger_00785_.png (3.02 MB, 1088x1920)
3.02 MB
3.02 MB PNG
so Flux pro is now in photoshop
>>
>>106700875
10/10 would take her home alongside my new copy of the limited special $150 edition of LOST SOUL ASIDE
>>
>>106700875
send this to joosten and tell her it's an unlockable scene in Kojima's new game.
>>
File: 00032-2936808018.png (2.99 MB, 1824x1248)
2.99 MB
2.99 MB PNG
>>
Has anyone made an offline AI image generator that can generate image prompts as good as Gemini? I want to make porn of my favourite anime and video game characters
>>
>>106700786
it actually looks like UE
>>
>>106700931
it looks like an abstract japanese representation of a nodegraph. it's unusable garbage is more like it and it's webshit too kek
>>
File: 1753194009425666.png (888 KB, 896x1160)
888 KB
888 KB PNG
the anime girl is typing on a computer on a desk, with a white CRT monitor and white tower. Keep her expression the same. she is wearing a black t-shirt. keep the text "kit-aura" in the image.
>tfw when new AI models
>>
File: 00038-806821494.png (2.89 MB, 1824x1248)
2.89 MB
2.89 MB PNG
>>
I want to train a character lora. The character wears a bunch of detailed accessories that the AI always gets wrong and makes it look like slop. How can I fix that? Should I train each accessory as a separate lora by itself? Inpainting to fix those details is unreliable and time consuming.
>>
File: 1747826054580758.png (890 KB, 896x1160)
890 KB
890 KB PNG
>>106700953
>>
can you use qwen image loras in qwen edit?
>>
File: 1732969123880653.png (1.82 MB, 1024x1536)
1.82 MB
1.82 MB PNG
>>106700982
2nd pass with qie destroys even further. qie pixelspace when?
>>
File: 1738656653365036.png (1.15 MB, 1360x768)
1.15 MB
1.15 MB PNG
the anime girl is pointing at a large neon sign with the text "LDG" in teal color text.

from mikudonalds ad
>>
File: 1753716307964165.png (1.18 MB, 1360x768)
1.18 MB
1.18 MB PNG
>>106701023
better neon sign:
>>
>>106700968
Include closeups in the dataset?
>>
bros is vpred worth it?
>>
File: 1757796893080558.png (856 KB, 1360x768)
856 KB
856 KB PNG
the anime girl is sitting on a black couch in front of a large neon sign with the text "LDG" in teal color text.

kinda neat it can figure out the 3d model proportions despite only a side profile as reference, didn't even say "miku hatsune".
>>
>>106701051
All signs point to yes
>>
>>106701051
All signs point to no
>>
>>106700772
Lul
>>
can flux even do nfsw without a billion loras?
>>
Maybe we should commision /3/ or /ic/ to design a unique character as /ldg/'s mascot. LDG tan. Then we can train a lora with it.
>>
It is literally impossible to make Wan understand that I want breasts to look natural, to jiggle and bounce and squish and behave as real breasts do. Nothing I put in the prompt window has any slightest effect on it. If the breasts in the starting image are ambiguous, it will choose to make them stiff and fake no matter what, 100% of the time.

I am so fucking tired.
>>
>>106701051
All signs point to maybe
>>
File: 1752683354072639.mp4 (1.56 MB, 720x1072)
1.56 MB
1.56 MB MP4
>>106700592
>>
>>106701234
You should batch a gen of 50 pics of the same girl, leave all the six fingers and scuff in, and train the lora with it.
>>
File: 1750754522233491.png (609 KB, 640x640)
609 KB
609 KB PNG
>>106697494
https://www.reddit.com/r/StableDiffusion/comments/1nqm5l0/images_from_the_huge_apple_model_allegedly/
here's some images on HunyuanImage 3.0, this shit is gigaslopped lmao
>>
>>106701256
we're so lucky to have all these neat tools like wan/qwen/noob/illustrious
>>
>>106701051
At this point (it's been around for years), if it was actually worth it, it would be the default

This is just furries and weebs (aka severe autists) who are obsessing over some imagined color improvement
>>
>>106701298
>This is just furries and weebs (aka severe autists)
aka 90% of the fags in this hobby
>>
File: 1744463994961662.mp4 (2.41 MB, 720x1040)
2.41 MB
2.41 MB MP4
>>106700612
>>
File: 1740268835112219.png (2.82 MB, 1728x1344)
2.82 MB
2.82 MB PNG
>>
>>106701297
2025 is a coomer's paradise.
>>
>>106701256
hpru shjet
>>
>>106701249
https://civitai.com/models/1852647?modelVersionId=2096600
works with i2v too
>>
>>106701345
and this is the worst it will ever be
>>
>>106701403
Honestly, this is enough for my needs

I'll gladly take improvements and they will obviously come, but the current state far exceeds any expectations I had going into local ai image/video gen
>>
File: 1741925909030886.png (2.73 MB, 1728x1344)
2.73 MB
2.73 MB PNG
>>106701332
>>
>>106701322
>Road to El Dorado live action remake could actually be great
>>
File: 1736292908559610.png (1.77 MB, 1368x1612)
1.77 MB
1.77 MB PNG
>>106701270
https://xcancel.com/cannn064/status/1970659710220349509#m
>>
>>106701298
>it's been around for years
kek
>>
>>106701453
>gpt anime sameface
DoA
>>
File: RA_NBCM_00024.jpg (737 KB, 1872x2736)
737 KB
737 KB JPG
>>
>>106701322
ÆÆÆÜÜÜGGH thank you for making me cume sir
>>
>>106701270
>here's some images on HunyuanImage 3.0, this shit is gigaslopped lmao
Yeah, hopefully it will take well to lora training

But everything except for Chroma is 'gigaslopped' these days since the rest primarily train on synthetic data and stock photo, again it usually can be fixed with lora / finetuning, but it also depends on how big the model is for it to be realistically viable
>>
>>106701453
They can keep it. Industry grade != benchmax. It's funny that these teams make waves in the llm space but can't capture the Dalle, 4o and Imagen audience. Hmm I wonder why...
>>
>>106701256
>>106701322
why they move in slow motion???
>>
>>106701514
they are not moving in slow motion, you have brain tum0r
>>
File: 1736909347118327.png (1.4 MB, 1176x880)
1.4 MB
1.4 MB PNG
replace the man in the black tracksuit with the anime girl in image2.

no offense paulie, it was just the easiest swap description
>>
>>106701512
they still have the llm way of thinking, synthetic data training = good mememarks, but for imagegen that's really useless, no one want to generate plastic humans even if the model can add a blue cube on the left and a red sphere on the right, sigh...
>>
File: 1736131520247800.png (1.08 MB, 744x801)
1.08 MB
1.08 MB PNG
>>106701520
and once again, anri as a test
>italian sausage
>>
>>106701512
Watch it being nearly identical to qwen just like the hunyan image model, kek.
>>
File: 1748647912312963.png (1.31 MB, 1176x880)
1.31 MB
1.31 MB PNG
>>106701554
>>
>>106701554
doesn't anri have way larger tits?
>>
>>106701554
damn this is bad, it's like they copy pasted the girl without considering the lightning of the original image
>>
>>106701562
well she has two phases. now she's giga milk mode. that was after she got pregnant.
>>
File: 1745363897850864.png (1013 KB, 1136x920)
1013 KB
1013 KB PNG
the blonde anime girl is wearing a gold crown, and on the seat of the car is a stack of rectangular boxes that say "Nvidia RTX 5090" with the Nvidia logo on the box. A champagne bottle is in a bucket of ice to the right.
>>
what's the best general way to use wan 2.1 loras (not light) with wan 2.2, full strength on low noise, split evenly between high and low, full strength on both high and low?
>>
>>106701641
high pass, 2.1 lora at 3 strength, low pass, 2.2 low lora at 1 strength

seems to work for me, when 2.2 came out the kijai workflow was using 2.1 for both, with high at 3, low at 1.
>>
https://github.com/comfyanonymous/ComfyUI/pull/9979
the memory leak fix has been merged
>>
>>106701641
>>106701657
>high pass, 2.1 lora at 3 strength, low pass, 2.2 low lora at 1 strength
I also add high pass 2.2 lora at 0.4 strengh to get less blurriness
>>
>>106701667
2.2 high can cause motion issues but at that strength it should be ok
>>
File: WANI2V__00206.mp4 (2.38 MB, 1112x824)
2.38 MB
2.38 MB MP4
>>106701561
>''Ay marone, these are some nice gabagools!''
>>
>>106701680
wan is really incredible desu, probably the only non meme local model so far
>>
>>106701657
>>106701667
I wasn't talking about lightning loras but thanks ill keep it in mind
>>
>>106701680
neat, boob grab lora or something like that?
>>
>>106701680
>''Ay marone, these are some nice gabagools!''
lmao
>>
>>106701700
>>106701398
and some simple prompts; the fat man in the dark blue shirt uses his hand to fondle the breasts of the woman wearing a green bikini.
>>
File: 1729030614309247.png (1.08 MB, 832x1248)
1.08 MB
1.08 MB PNG
the japanese woman is wearing the outfit of the man in image2, with a helmet, and a blue suit with golden shoulder armor, with a cleavage cutout. keep her expression the same.

meet judge dr- anri:
>>
File: 00072-1506818161.png (3.08 MB, 1248x1848)
3.08 MB
3.08 MB PNG
>>
File: 1740776309711008.png (1.66 MB, 1120x1440)
1.66 MB
1.66 MB PNG
>>106701772
what a coincidence, I'm genning asuka too
>>
File: 1738384016649579.png (1.24 MB, 832x1248)
1.24 MB
1.24 MB PNG
the japanese woman in image1 is wearing the outfit of the anime girl in image2 wearing a red bodysuit. keep her expression the same.

not bad. just need an edit to fix the number but that's pretty good overall.
>>
File: 1734685968814943.mp4 (3.71 MB, 1920x1080)
3.71 MB
3.71 MB MP4
https://xcancel.com/HaochengXiUCB/status/1971219731140182423#m
ready for another cope speedup?
>>
>>106701830
Woah! A real free lunch this time!
>>
>>106701830
Sparsity will unironically make a huge comeback.
>>
>>106701830
>wan2.1

where wan2.2
where node
>>
>>106701852
check under your foreskin
>>
File: 1746544525755321.png (1.78 MB, 1024x1024)
1.78 MB
1.78 MB PNG
>>106700474
what are some of the larger models i can throw in to 96gb of vram?
SDXL has been my go-to for a while but I feel like I could do some more interesting stuff now with all the extra space
sadly invokeai doesn't seem to support WAN models, kinda wanted to mess around with video stuff but i might need to find an alternative just for that
>>
File: pass.png (1.99 MB, 2561x847)
1.99 MB
1.99 MB PNG
>>106701830
it's still quite different to the original video, pass
>>
File: 88406020.mp4 (3.8 MB, 480x848)
3.8 MB
3.8 MB MP4
>>
File: RA_NBCM_00027.jpg (699 KB, 1872x2736)
699 KB
699 KB JPG
>>
>>106701867
Pretty sure https://huggingface.co/Qwen/Qwen-Image is the largest open model currently
>>
>>106701886
hehhh, that's pretty good! model?
>>
File: 1746504362410572.png (1.91 MB, 1120x1440)
1.91 MB
1.91 MB PNG
>>106701787
>>
>>106701658
for fuck's sake it's slower now
>>
>>106701867
>>106701888
the largest video model is step video (30b)
>>
File: 1738183956396187.png (1.17 MB, 832x1248)
1.17 MB
1.17 MB PNG
also pretty good
>>
>>106701892
I still get the memory leaks, each time I click gen it's rng if I get oom or not
>>
>>106701897
My b I meant largest image model
>>
>>106701886
neat
>>
File: KreaRefined.jpg (3.51 MB, 4096x2048)
3.51 MB
3.51 MB JPG
>>106701270
Yet another Chinese model I'll just denoise with Flux Krea then at least for photographic stuff lol
>>
>>106701919
looks like slop either way lol
>>
File: 1753927350372977.png (988 KB, 880x1176)
988 KB
988 KB PNG
>>106701898
diff girl, same psylocke but a diff pose

and this is why china > openAI
>>
>>106701919
are you using the unslop refiner for hunyuanImage?
https://github.com/comfyanonymous/ComfyUI/pull/9882
>>
>>106701886
that looks too good to be an open model, it's Seedream right?
>>
>>106701925
>this is why china > openAI
bruh it literally mixed her realistic face with an anime body
>>
>>106701959
can fix that with "make it realistic" in another prompt.
>>
>>106701888 >>106701897 >>106701903
hm, seems i'll prob have to use a different frontend to go outside the box I've been confined to
>>
>>106701889
NoobAI
>>
File: 1741501890424957.png (912 KB, 880x1176)
912 KB
912 KB PNG
killer bee:
>>
>>106701923
Well yeah a native Krea version is better. I had to denoise at 0.6 strength to even clean up the 2K Hunyuan output that much.
>>
File: 1733923744960094.png (1.09 MB, 832x1248)
1.09 MB
1.09 MB PNG
>>106701988
and laura from sf5, why not
>>
File: 1754447048482392.jpg (452 KB, 1932x1086)
452 KB
452 KB JPG
>>106702018
I wonder how the math works for edit models, or how it can take this and translate it to a figure/inpaint/etc. pretty cool even if I don't get how it works entirely.
>>
File: 1752096716523720.png (1.26 MB, 896x1160)
1.26 MB
1.26 MB PNG
holy shit, I prompted "make the man a n-word" and it worked. I assumed it might not work cause it's not the "proper" prompt.

china doesn't care I guess!
>>
File: file.png (137 KB, 498x280)
137 KB
137 KB PNG
>>106698483
"Bargaining phase."
THE CHANCE IS NEVER 0 BITCH
>>
>>106701830
Finally, wan getting some love once again, dont suppose there's any mention of running this in comfy?
>>
File: 1746936553847301.png (1.06 MB, 1448x720)
1.06 MB
1.06 MB PNG
>>
File: 1743996505739008.png (823 KB, 1288x808)
823 KB
823 KB PNG
>>
File: 1729432024982850.png (644 KB, 1288x808)
644 KB
644 KB PNG
>>106702106
>>
>>106702106
that one is pretty decent desu, I wished it worked as well on 2 characters
>>
File: 1728082938815358.png (635 KB, 1288x808)
635 KB
635 KB PNG
>>106702121
one more
>>
Is qwen 2509 slopped? Haven't tried either one but figured I 'd start with that one and it's already fucked up the first two prompts I tried.
>>
File: 1755771797441022.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
replace the girl with Miku Hatsune wearing the same outfit.

I like Chie but just a test.
>>
>>106702147
>I like Chie
she's the goat
https://youtu.be/w7lj9qI8VFc?t=227
>>
File: 1727899696188181.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
007: architects die another day
>>
Just got everything installed correctly i think, any guides to make good prompts?
>>
>>106702184
The best guide is your own two eyes
>>
File: 1752988361134815.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>106702172
>>
File: 1729803878865322.png (1.01 MB, 1176x880)
1.01 MB
1.01 MB PNG
the man has his arms at his side. the man is holding up a vanilla ice cream cone and giving the thumbs up.
>>
File: ComfyUI_00523_.png (2.44 MB, 1192x1744)
2.44 MB
2.44 MB PNG
>>106702142
Guess the official comfyui workflow doesn't work with qwen 2509? The original model seems to work fine.
>>
File: 1729349108099124.png (28 KB, 705x246)
28 KB
28 KB PNG
>>106702142
you need this node in place of the old one.
>>
>>106702142
>Is qwen 2509 slopped?
it's just a finetune of the older QIE, so yes, it's as slopped
>>
File: 1756424876948472.png (1023 KB, 1024x1024)
1023 KB
1023 KB PNG
the anime girl is wearing black pixel art sunglasses. at the bottom is a large subtitle saying "DEAL WITH IT" in white stylish text with a black outline.
>>
File: 00302.jpg (1.4 MB, 2896x1440)
1.4 MB
1.4 MB JPG
>>
>>106702142
>is this Chinese model slopped
yes? they're all are (the exception would be Seedream)
>>
File: settings.jpg (182 KB, 2469x867)
182 KB
182 KB JPG
Do these settings look good for my Chroma wf? I'm trying to get as much detail as I can. First is the loader config, second is txt2img, third is upscaling. Ignore the upscale_by 1, I prescale using a custom node to x2 size.
>>
>>106702238
Sweet, thanks anon.
>>
File: 1739305081936573.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>
File: pass.png (343 KB, 603x703)
343 KB
343 KB PNG
>>106702272
cfg 4 is enough desu. clip should be set to chroma and not SD. Tokenizer is a snake oil. For second pass I had weird result with beta schedulers when they tried to second pass themselves and it resulted in a weird shifted output. Res_2s is very heavy and will take forever to run so keep that in mind
>>
>>106700626
Nta, but you're talking about a gguf of the text encoder right? If you're just using a gguf of the model itself and not the text encoder the extra file is not necessary.
>>
File: 1729064006680232.png (770 KB, 832x1248)
770 KB
770 KB PNG
if wan/qwen were made by the saudis instead of china:
>>
are pickletensor files really dangerous?
>>
>>106702336
porn is banned in China lol
>>
>>106702337
They're dangerous in the same way an activated mine is dangerous vs an unactivated mine is.
>>
>>106702321
>Tokenizer is a snake oil.
You min min padding? I thought the guy who made Chroma recommended it himself? His own workflow has a custom padding removal node which does the same thing.
>I had weird result with beta schedulers when they tried to second pass themselves and it resulted in a weird shifted output
True, I've actually noticed that in a few upscaled gens but didn't know the scheduler was the cause, thanks.
I know res_2s is heavy, I've got a 4090 though and don't mind the wait given the quality increase that I observed.
>>
File: 1756005856572430.png (1.21 MB, 832x1248)
1.21 MB
1.21 MB PNG
>>106702336
but since it isn't

give the woman a green suit of armor from the videogame Halo, with cleavage.

>>106702349
lots of stuff is technically banned but it's a free for all there, 5090s are banned but they are freely bought.
>>
File: 1758341651193196.jpg (735 KB, 1792x2304)
735 KB
735 KB JPG
Saw those gens of the Major last thread and some days back.
Sexualizing the Major just never feels right to me.
>>
>>106702370
she did her job in a leotard and a jacket, showing off your body doesn't make you exploited.
>>
>>106702272
Why these nodes are not connected?
>>
File: 1740121415233926.png (1.18 MB, 768x1360)
1.18 MB
1.18 MB PNG
give the japanese woman a spotted white cow themed bikini and a visor with cow ears.
>>
File: 1739929780657494.png (497 KB, 832x1248)
497 KB
497 KB PNG
new yumi is neat
>>
File: 1732665099181953.png (1.25 MB, 1360x768)
1.25 MB
1.25 MB PNG
now this shows what qwen edit v2 can do.

the man is drinking a bottle of jack daniels which is half empty. the green poster on the left is ripped in half. The red bottles of soda on the right are empty. On the wall behind the man is "SHILL MAN" spray painted on the wall with black spray paint.
>>
File: ComfyUI_03378_.mp4 (1.8 MB, 1024x1024)
1.8 MB
1.8 MB MP4
>>106702280
Tits too small.
>>
>>106702397
Because they're three separate screenshots combined into one, showing only the nodes relevant to my question.
>>
>>106702351
>You min min padding?
IIRC the 'official' from the author was:

min_padding 1
min_length 3

But Comfy changed it to (because he thought it 'looked better'):

min_padding 0
min_length 3

In their example workflow, what workflow of his are you referring to ?
>>
>>106702426
im the butterfly
>>
File: 1731050310179655.png (1.23 MB, 1360x768)
1.23 MB
1.23 MB PNG
>>106702420
the man is holding a sign saying "man I LOVE Doritos!", while a black pistol is pointed at him from a man off camera. only their arm is visible holding the gun.
>>
>>106702383
fuck you, you know what I'm talking about
>>
File: y.jpg (162 KB, 1024x1024)
162 KB
162 KB JPG
>>
File: 1729839249965292.jpg (157 KB, 710x1000)
157 KB
157 KB JPG
>>106702448
official art did that all the time, motoko being hot and strong is part of her persona.
>>
>>106702420
>now this shows what qwen edit v2 can do.
making his skin as smooth as porcelaine?
>>
>>106702430
The one on his repo uses min padding 1, length 0, that's where I copied the initial settings from.

>https://huggingface.co/lodestones/Chroma/blob/main/ChromaSimpleWorkflow20250507.json
>https://huggingface.co/lodestones/Chroma/blob/main/ChromaSimpleWorkflow20250507_overview.png
>>
File: 1740528574319351.png (1.02 MB, 1416x2128)
1.02 MB
1.02 MB PNG
>>
>>106702430
>But Comfy changed it to (because he thought it 'looked better'):
>min_padding 0
>min_length 3
he changed the values (as a bandaid) because his implementation sucks ass and he's not bothered to fix it
https://github.com/comfyanonymous/ComfyUI/pull/7965
>>
>>106701332
i didn't even know she was sick
>>
>>106702448
The scene where she fights the tank has a closeup of her erect nipples in a body suit when she's struggling with the hatch on the tank. What do you think the point of that was?
>>
>>106700474
I started using sd.webui and now I can run it through a Cloudflare Tunnel to gen wherever, but the issue I'm facing now is that I sometimes need to restart it but can't.
Like, generation freezes at 100% and the only fix I can find is closing the terminal then running run.bat again...

Is there any fix to this through the UI itself? "Interrupt" doesn't work and neither does "Skip" when it happens
>>
File: 1735001710316058.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>
File: 00220-2406164603.png (3.16 MB, 1248x1848)
3.16 MB
3.16 MB PNG
>>
>>106702503
kek
>>
File: 1738646539792126.png (119 KB, 303x319)
119 KB
119 KB PNG
>>106702370
>>
>>106702481
>because his implementation sucks ass
That whole issue is about the min_padding

Are you retarded ?
>>
>>106702524
min_padding = 1 works fine if you use lodestone's implementation, not comfy's one (because that one is fucked up), right back at you retard? are you dumb or something? all he had to do is to do a 1:1 copy paste of his implementation, and he didn't, and now he's surprised why it doesn't work the way it should
>>
File: 1733544455719539.png (829 KB, 1176x880)
829 KB
829 KB PNG
frens?
>>
>>106702539
>not comfy's one
Comfy did not write the Chroma implementation you stupid fuck, it was submitted by a core Chroma contributor, silveroxides

The whole argument was the min_padding
>>
>>106702539
Which is one of the many reasons why I don't use his UI, I don't like overly opinionated spergs when it comes to basic functionality look at the disaster wayland was before valve stepped in.
>>
>>106702574
>Comfy did not write the Chroma implementation you stupid fuck, it was submitted by a core Chroma contributor, silveroxides
and? someone made a PR to fix that implementation and he didn't merge it, Comfy doesn't care about Chroma, he's more interested about making flawless API nodes
>>
>>106702574
>wrong
>doesn't fix it
To the other anon that's arguing with this dipshit just stop he's fucking retarded
>>
>>106702574
>it's not Comfy's fault that he's merging bad PR implementations
YES IT IS RETARD, IT IS HIS FUCKING JOB TO VERIFY IS WHAT GOES TO HIS OFFICIAL REPOSITORY IS LEGIT OR NOT, KYS
>>
File: mikuuu.png (990 KB, 896x1152)
990 KB
990 KB PNG
>>
File: 1742654669340052.png (2.9 MB, 1416x2128)
2.9 MB
2.9 MB PNG
swords could be worse i guess
>>
File: 1741631537258831.webm (159 KB, 640x480)
159 KB
159 KB WEBM
>>106702562
>>
File: 1756731974657690.png (1.31 MB, 864x1208)
1.31 MB
1.31 MB PNG
the blue hair anime girl is wearing a hot dog suit outfit.

I didn't expect this prompt to work but...it does in fact work. testing models with odd prompts is a good way to see what you can/can't do:
>>
>>106702591
There's nothing wrong with the implementation other than this setting which Comfy fucked with

Show me something else that is wrong

The PR has lodestones blessing, he and silveroxides are best buds, likely lovers

Dumbass retard
>>
Lots of amazing models coming to SaaS recently
>>
File: 1732636779362535.png (1.32 MB, 864x1208)
1.32 MB
1.32 MB PNG
>>106702608
better suit, now it has the original hoodie from the image.
>>
>>106702574
>i will not merge a bugfix thats been sitting for months because someone made an... error
dont post again on this site lil techlet jamal
>>
File: go fuck yourself.png (856 KB, 1080x402)
856 KB
856 KB PNG
>>106702610
>There's nothing wrong with the implementation other than this setting which Comfy fucked with
you are sooooo fucking retarded dude, you don't know what you're talking about, the 2 implementations are completly different, Comfy's one doesn't use the same tokenizer as lodestone's one, and that's why you get fried shit, this is the last time I respond to you, you seem completly braindead, lurk more you fucking faggot
>>
>>106702608
>mustard
disgusting
>>
>>106702632
>using half a year old versions
>>
>>106702632
>months old image
lol, can't run a v30 comparison yourself
>>
>>106702644
the arch of the model didnt change, jamal
any other low iq take to continue on with your public humiliation?
>>
>>106702644
>>106702646
>nooo, you don't understand, the implementation is the exact same but the image is different because... reasons...
(You)
>>
>>106702652
>>106702651
You get absolutely minimal diferrence on current chroma no matter if you use the padding or not. You are retarded.
>>
Deliver us from evil, A*******o.
>>
>>106702658
>You get absolutely minimal diferrence on current chroma
prove it (you won't)
>>
>>106702632
Show me a fucking image from a checkpoint that isn't ancient you dumb fuck

v28, are you insane ?

Go ahead, not even lodestones gives a shit about this PR
>>
>>106702664
>v28, are you insane ?
are you fucking retarded? the implementation was made during v28 that was why they used v28 to show that there was a problem, omfucking god why is there so many low IQ subhumans in this fucking thread???
>>
>>106702646
>lol, can't run a v30
Another ancient checkpoint with zero relevance

Stop, just stop
>>
File: 1731496567445259.jpg (652 KB, 1416x2128)
652 KB
652 KB JPG
>>
any advice on reordering sockets in a subgraph in comfy? why doesn't the right click menu have a move up/down option bro... I'm not even autistic and I thought of this QoL like literally immediately...
>>
File: 1734889195991498.png (866 KB, 1336x784)
866 KB
866 KB PNG
>>
>>106702658
>minimal diferrence
Thanks for conceeding.
>>
>>106702671
Show me this minimal difference actually manifesting in chroma1-hd or base
>>
>>106702684
what's there to concede? you claimed that there is minimal difference, you have therefore the burden of proof and you don't want to prove it, I'm the one accepting your concession
>>
>>106701298
retarded slopper
>>
Please... just a crumb of side by sides... a sliver even
>>
>>>/pol/517249748
>>
File: combined_image.jpg (889 KB, 2048x2408)
889 KB
889 KB JPG
kys all of you
>>
>>106702721
I made this. Feels good to still see it get reposted.
>>
File: 00018-3868416254.png (1.75 MB, 1024x1344)
1.75 MB
1.75 MB PNG
I tested chroma extensively and I have to say once we're getting full strength loras turning into irl gens without heavy weighting for against realistic if you decide to do certain actions, we have serious fucking problem with how this model was trained. I'm still trying to figure out why the fuck he decided to copy the pony creator and obscure so much shit, from what I can tell that alone has fucked the data to it's core, not only that it becomes mandatory to put the steps higher just for it to be cohesive in certain actions
Pic related random seed came out to 3D while all the others didn't because applies this type of shit to common actions
>>
>>106702729
>he's still missing the point
the problem isn't the min padding value, it is the inner code that you can't change on your node, that's why there's a PR to change that code and make it similar to the official Chroma's one, thanks for proving again you don't know what the problem really is
>>
>>106702729
... you need to use Padding Removal from Fluxmod.
https://github.com/lodestone-rock/ComfyUI_FluxMod
>>
File: 1743365382121169.png (332 KB, 3677x888)
332 KB
332 KB PNG
>>106702729
>>106702737
to be more clear, the issue is there
>Chroma's implementation by lodestone uses MOCHI's tokenizer, and for some reason Comfy's implementation uses PIXART's tokenizer, that difference is the reason you get different images (and fried shit on comfy's side) >>106702632
>>
>>106702749
>run through esoteric workflows and deprecated node hoops
Fuck off. You lost. Deal with it.
>>
sdxl won
>>
>>106702737
There is no problem, some minimal difference in an ancient epoch is inconsequential

If you could at the very least show same minimal difference in the actual final relelases you would at least have some claim, but you don't

Which makes me conclude there is no difference in the final releases

Also recently Chroma Radiance was merged, no complaints
>>
>>106702757
Is there a custom Load Clip node with the corrected tokenizer for CLIPType.Chroma?
>>
>>106702766
>some minimal difference in an ancient epoch is inconsequential
it's not minimal at all, do you have eyes? >>106702632
>>
I like pretending that Lodestone / his ilk don't post here. Makes it more fun.
>>
>>106702767
The multigpu nodes nodes have specifically Chroma as a clip type but idk if it includes it
>>
>>106702767
>Is there a custom Load Clip node with the corrected tokenizer for CLIPType.Chroma?
no, you have to change the inner code to get that result, that's why this PR exist >>106702481
>>
I really do loathe them. I have nothing wrong with Chroma itself. It's the fact they keep acting like it's more consequential than it actually is that bothers me. It's just a shitty flux fine tune.
>>
>>106702757
>and fried shit on comfy's side
Been using Chroma on Comfy AND Forge, neither are fried and there's no percetable quality difference

Enough with your bullshit lies
>>
File: ComfyUI_00275_.mp4 (677 KB, 720x720)
677 KB
677 KB MP4
>>106702735
mhmm, very nice anon, where are the new pancake girls?
>>
>>106702778
>there's no percetable quality difference
prove it (again you won't and I accept your concession in advance)
>>
>>106702777
>It's just a shitty flux fine tune.
To be fair you could say the same about Pony being a shitty XL fine tune. That thing was CARRIED by LoRAs.
>>
>>106702768
Give link to this PR
>>
frogslop avatar, ignored
>>
>>106702788
-> >>106702481
>>
File: 00025-507773844.jpg (450 KB, 2048x2688)
450 KB
450 KB JPG
I seriously feel bad for whoever has to finetune this mess to fix it.
>>106702759
I kind of agree simply because everyone trying to make the next model keeps shooting themselves in the fucking foot when all they have to do is not alter shit or do retarded shit like fuck with tags. We've been over this many times already once you start obscuring shit you basically kill a model, it happened to SD 3 and it happened to pony and it happened to chroma. Something as simple as protesting and selfie should not be so poorly weighed that it behaves like a token set to 2.0 strength by default. Also mutliple characters and interactions are childs play with any model I have shown this for YEARS, so when you tout it natively don't have your tokens so fucked that some characters will always duplicate at normal strength and only act normal at .5.
Also what the fuck is up with the banding on the base model during high res pass?
Waste of time making loras for this thing even the ones on civ will swing based on tag.

I rank this model XL 1.2 only because of the built in text after giving this a serious try
>>
File: esl.jpg (178 KB, 973x841)
178 KB
178 KB JPG
>I have nothing wrong with Chroma itself
>>
>>106702793
That shows this image:

Why are you lying ?
>>
File: 1432498179182.png (296 KB, 722x768)
296 KB
296 KB PNG
Does the PR even matter now that Chroma has it's own clip category and doesn't use pixart, moch or flux?
>>106702805
retard
>>
>>106702805
read the PR motherfucker, what does it say?
>Now both implementations make identical images:
it's showing the images are now the same if you apply this fix
>>
can we all just agree that comfy is a fucking retard
>>
>>106702810
only once your engine is up to snuff julien
>>
>>106702808
The insanely fried image you posted is not here

>>106702768

Where is it ? Nobody gets these images with Chroma, it's pure bullshit

Stop with your insane lying
>>
>>106702787
whataboutism.
>>
>>106702797
Bold to assume I'm ESL and not just tired. TIRED OF CHROMA.
>>
>>106702814
>The insanely fried image you posted is not here
again, are you retarded, the 2 images you showed are WITH THE PR FIX, so of course they look the same, they look the same if you apply the PR, that's the fucking goal of this PR, are you fucking retarded dude?
>>
>>106702810
I've been saying it all along
>>
>>106702766
Legitimately stop replying lol.
>>
File: output.webm (3.87 MB, 720x1280)
3.87 MB
3.87 MB WEBM
>>106702782
I got bored with the concept I guess. Here's another I did back then but didn't like it as much so I didn't bother sharing it.
>>
>>106702822
Whatever this problem was 5 months ago, it's no longer a problem since nobody is reporting anything remotely like the fried image you posted
>>
>>106702834
pancake sexo
>>
>>106702835
>it's no longer a problem
again, prove it, you have no idea how different both implementations are with the current chroma versions
>>
>>106702835
>nobody is reporting anything remotely like the fried image you posted
Chroma outputs garbage images though, who knows if it's because the model is genuinely bad or it's the fault of Comfy's implementation lol
>>
*yawn*
>>
>>106702845
No, you prove it

People use Chroma every day, tons of new images on the Chroma discord every day, people would have reported this problem if it still existed, nobody is

You don't even use Chroma, you're the anti-Chroma schizo
>>
>>106702835
>>106702858
>it's no longer a problem
>you prove it
this is your claim, this is your burden of proof
>>
>>106702834
AI was a mistake, is what I would say if this wasn't so peak
>>
>>106702865
Only one claiming it's a problem now is you, someone who doesn't even use Chroma and just wants to complain about Comfy

Get a life
>>
>>106702871
>Only one claiming it's a problem now is you
Only one claiming it's not a problem anymore is you
>>
File: 00007-2743784341.png (2.5 MB, 1824x1248)
2.5 MB
2.5 MB PNG
>>
>>106702871
>just wants to complain about Comfy
if he did his job proprely he wouldn't have valid criticism, get off his dick, he can mess up things like everyone else
>>
>>106702876
>People use Chroma daily, people who has used it since the very beginning, including the guy who made the model, they are not thinking there's any problem with the current Chroma implementation
No, it's just you
>>
How can you spot SEA ESL as apposed to other ESL? What are some things to look for?
>>
>>106702887
>just ignore the fried images bro, Comfy just changed the tokenizer for no reason and you have to trust him on that one, he's god after all
this is getting embarassing desu
>>
>>106702881
There's a LOT to complain about when it comes to Comfy, like the UI only becoming progressively worse, so much basic functionality missing forcing people to use third-party loras with all the security/compability issues that come with it etc.

But this retard has to invent problems, because the problems are irrelevant to him, he is just jumping around attacking different targets, truly a person with no life
>>
>>106702896
The creator of the model has no issue with it, actually he posts Comfy generated Chroma images on a daily basis

kys
>>
>>106702907
He deprecated fluxmod specifically because Chroma's implementation is nearly identical. There's no quality loss, it just looks like a slightly different seed.
>>
>>106702795
Is he telling the truth?
>>
>>106702912
>it just looks like a slightly different seed.
do you have eyes or something? >>106702768
>>
>he keeps posting images from ancient epochs
yawn
>>
File: kek.png (227 KB, 1331x1259)
227 KB
227 KB PNG
>>106702907
kek
>>
>5 months ago
>>
>>106702921
>5 month old bug report for v28
Give up you fucking retard
>>
>>106702940
>>5 month old bug
that's the worst part, that bug is this old and comfy still hasn't fixed it
>>
>>106702928
What is this supposed to show ? 5 month old post from some Flutter_ExoPlanet
>>
>>
File: chroma_comp.jpg (965 KB, 3208x1280)
965 KB
965 KB JPG
>>106702921
It doesn't do that anymore. The differences are there, but negligible. Like I said, it changes composition, not quality.
>>
>>106702951
>What is this supposed to show ?
sorry, I thought your IQ was over 70, let me explain that to you, comfy admits that his implementation is different from lodestone's, not only he admits that, but he also implies that HIS implementation is the superior one, yeah right... >>106702768
>>
>use chroma dc-2k
>add any well trained flux character and photorealism lora at low strength for consistency
>insert correct camera direction (this is where most promptlets fail)
>insert short character and scene description
>install booru tags copypaste from any of the sleazyforks
>go to e621 and find concept you're looking for
>copy paste and delete or replace all unwanted tags unless you want to yiff in hell
>end with extra camera direction
>??????
>photorealistic degeneracy

simple shit
>>
File: combined_image.jpg (918 KB, 2048x2408)
918 KB
918 KB JPG
This is after manually writing the PR fix into sd.py
>>
File: 1735831535775189.png (834 KB, 1403x407)
834 KB
834 KB PNG
>>106702967
it looks more saturated on the Comfy's one, look at the skin color, it's too uniform on the right
>>
>>106702967
no... the schizo was right..
>>
>>106702943
>that bug is this old and comfy still hasn't fixed it
It's clearly been fixed since this doesn't manifest itself

Nobody got around to close this PR yet, along with a ton of others
>>
>>106702979
>It's clearly been fixed since this doesn't manifest itself
uh oh... >>106702967
>>
Why the fuck does that fennec retard have so much fanboys here. Don't give a shit about chroma shit but an implementation of something should be absolutely be true to how the creator does it.
>>106702967
I think you need to get your eyes checked my man
>>
>>106702968
This was old news, we know it was different with the padding, we have been discussing this all thread, you absolute retard
>>
>>106702979
>Nobody got around to close this PR yet
the PR provided the exact catbox workflow so you can test that out and get the same exact fried Wario image
>>
>pixel 852692 is three lumens brighter than before
>>
>>106702987
>we know it was different with the padding
it's not the padding's problem you fucking mongoloid, it's a tokenizer problem, he's not using the same one >>106702757
>>
>>106702971
The irony is that's the only thing the model can do well when the target audience was 2d degenerates
>>106702986
Lots of astroturfing from a company without it's priorities in order both comfy and ani used to shill and run operations in the thread. Actual corporate interest from the start which makes people surprised over api implementation show how new they really are
>>
>>106702984
lel, where is this 'fried' look you complained about ?

they're practically identical
>>
>>106702971
proof?
>>
>>106702967
wait, I thought the image was supposed to be totally fried like the wario one, schizosisters what's going on?!
>>
>>106702975
bottom right is too cute pls post full img plox
>>
>>106702994
>>106702998
>see, we find cases where it's not fried, therefore we can conclude that it'll never be fried ever
(You)
>>
File: off_v2.png (1.79 MB, 1024x1024)
1.79 MB
1.79 MB PNG
>>106703001
>>
>>106703002
I accept your concession.
>>
>>106702976
>>106702967
Yeah, it's too bright on the right side, I wonder why comfy decided to not simply copy the original implementation? Did he give a reason on why he decided to do that?
>>
>>106702967
The main question is why it's different at all? Isn't it supposed to fully respect the original creator's implementation?
>>
>>106703002
So you concede that the bug reported 5 months ago has been fixed, good
>>
>>106703010
see >>106702810
>>
Member when both comfy and and his coworkers which included ani worked for stability and used to do thread raids shit talking other devs?
Member when ani was comfy's enforcer to false report the original forge dev?
Member when they both lied and acted smug over SD 3 only to back peddle and jump like rats even through anons told them that would happen?
I member
I member well
The irony is they all ended up snaking each other in some way or another
>>
File: ComfyUI_00007_.png (872 KB, 1024x1024)
872 KB
872 KB PNG
>>106703006
>>
>>106702203
kek
>>
>>106703020
Who said it was always fried? You're fighting ghosts here, that wasn't the claim. The claim was that sometimes you get completely messed up images because that idiot decided to make a yolo implementation for some reason.
>>
fyi, you get practically the same result by setting the tokenizer to mochi, same way lodestone did in his fluxmod implementation. No overexposure like with the pixart one you get if you set it to 'chroma'.
>>
>>106702967
>Lodestone
>slim "woman"
>normal neck height

>Comfy
>fat roastie
>suspiciously long neck

hmm...
>>
>>106703039
>fyi, you get practically the same result by setting the tokenizer to mochi
you don't get an error when you do that?
>>
>>106703056
>>106703056
>>106703056
>>106703056
>>106703056
>>
>>106703030
>The claim was that sometimes you get completely messed up images
Only evidence being one image posted 5 months ago for an ancient training epoch

Meanwhile people, including the model creator, are posting Comfy generated Chroma images every day, gee I wonder if they would have noticed there being a problem

Stop wasting time
>>
>>106703054
Nope.
>>
>>106703063
>Only evidence being one image posted 5 months ago for an ancient training epoch
the workflow is on the PR, feel free to run it with a newer version of chroma, if it's still fried, what will be your excuse this time?
>>
>>106703068
I don't need to, I generate hundreds of Chroma images per day with Comfy, I don't get fried images
>>
>>106703023
>underboob visible
ty anon



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.