Discussion of Free and Open Source Text-to-Image/Video Models and UIPrev: >>106696274https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Neta Luminahttps://huggingface.co/neta-art/Neta-Luminahttps://civitai.com/models/1790792?modelVersionId=2203741https://neta-lumina-style.tz03.xyz/>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbours>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
why do you need two text encoders in qwen image edit? in the example wf I see only one used
Blessed thread of frenship
so what are the best of da best of flux models for generating a lot more detail in real human characters? Like torn/worn clothes and certain period clothing for example.in the midst of enjoying wan and crazy fast llm speeds i kinda forgot the part were i should really jump from epicrealism and realismengine kek
>>106700489you dont, only need oneqwen_2.5_vl_7b_fp8_scaled.safetensors
>>106700474ty for bake
>>106700489mmproj is needed if you use GGUF
>>106700511thanks anonit's better than the mmproj one?
>>106700482He really needs to distance himself from all of the things in that picture if he no longer wants to be industry poison.
>>106700528I assume so, no issues with it
>>106699330Sorry for taking a while to respond. This is what I got with a basic bitch ass prompt. I'm using GNER but stock t5 should be fine too.
>>106700525how so?as a second clip encoder? to do what?>>106700548ok thanks
Friendship thread. Long live to local models.Don't go to the anime one, it's toxic. Post anime here.
qwen edit: "he is holding a nintendo switch"then with the back image of a switch 2 on that image: "the device the man is holding in image1 has the appearance of the device in image2".
>>106700555GGUFs don't have multimodal/vision by default in their quants/files, it's a separate mmproj file. It's automatically loaded by the GGUF nodes provided its in the same dir with the appropriate filename.
>>106700556Go back to your empty dump instead of trying to pit two better threads against each other you disabled faggot
>>106700508Examples of the sort of image you mean? Flux Krea looks more realistic by default than any Flux Dev merge or lora, anyways, IMO, and it has better prompt adherence. Likes a bit higher guidance than regular Flux also, I usually use Euler Beta with Guidance 4.5.
>>106700571fucking slopped to death, you better add some grain
>>106700582it's the 40s. images weren't as sharp then.
>>106700531?
the woman in image1 is holding a nintendo switch system, with the appearance of image2 on the back of it.now this is advertising.
>>106700585they weren't as plastic either
>>106700573I actually use a gguf, so I should use that file and put it in the text_encoders folder as recommended by the screenshot and using the same name as q8 gguf I have, or instead put it in the gguf folder itself?
>>106700592>image1>image2so that's how the model "sees" the images? I was wondering if it was a stitch of them or if it "knew" the names of the inputs
>>106700580literally picrel but not obviously sdxl level quality kek, and like i said actual weathered/dirty clothing and not some studio quality looking shitthere's a few flux amateur photoreal checkpoints im looking at that look pretty good but most people don't really do anything beyond the same concepts so i'm not sure. here's one examplehttps://civitai.com/models/978314/ultrareal-fine-tune?modelVersionId=1413133
>>106700599like this (aka dont change the fucking name)
Is Wan 2.2 5B model useless? I've never touched video AI stuff but I followed this workflow: https://docs.comfy.org/tutorials/video/wan/wan2_2No matter what I do, there's like 0 action if I use image to video, and without the results are quite horrible
>>106700610with this version you can just describe the image elements, but you can also refer to them by node cause the new text node has image1/image2/image3 as inputs.
>>106700616alright, I'll do that, thanks anon
>>106700571Was the original Hitler actually a real photo?
>>106700632hitler's not real dude
>>106700612i thought that was nicki minaj from the thumbnail
>>106700635it was actually a woman
>>106700623it's way easier for what I want to do, thanks
I like how edit v2 also doesn't make people hobbits in edits.Show a side profile of the woman who is standing up in the same room.
>watching anime about how AI will destroy the world>protag is using comfyuicant make this shit up
>>106700640did you cume at least?
>>106700673no i dont like fat cotton planters
>>106700679Understandable.
>>106700673how could i cum when it wasnt nicky as an egyptian queen with a fat ass? what a silly question
>>106700667lmaooo
>>106700667>average civitai workflow
>>106700743cringe plastic bitch
>>106700747you could be describing like 99% of flux gens i see here and on civitai honestly
>so mr altman apparently you thought people would pay $1000 a month for limited prompts over open source...a foolish move.
>>106700622It's pretty useless. Any benefits from the reduced model size get erased by the hard coded resolution. If you're vram/ram limited it's better to just use a smaller quant.
>>106700667>comfyui created nodegraphsYou just did make it up.
Is the new Qwen edit uncensored or do I need a LoRA
so Flux pro is now in photoshop
>>10670087510/10 would take her home alongside my new copy of the limited special $150 edition of LOST SOUL ASIDE
>>106700875send this to joosten and tell her it's an unlockable scene in Kojima's new game.
Has anyone made an offline AI image generator that can generate image prompts as good as Gemini? I want to make porn of my favourite anime and video game characters
>>106700786it actually looks like UE
>>106700931it looks like an abstract japanese representation of a nodegraph. it's unusable garbage is more like it and it's webshit too kek
the anime girl is typing on a computer on a desk, with a white CRT monitor and white tower. Keep her expression the same. she is wearing a black t-shirt. keep the text "kit-aura" in the image.>tfw when new AI models
I want to train a character lora. The character wears a bunch of detailed accessories that the AI always gets wrong and makes it look like slop. How can I fix that? Should I train each accessory as a separate lora by itself? Inpainting to fix those details is unreliable and time consuming.
>>106700953
can you use qwen image loras in qwen edit?
>>1067009822nd pass with qie destroys even further. qie pixelspace when?
the anime girl is pointing at a large neon sign with the text "LDG" in teal color text.from mikudonalds ad
>>106701023better neon sign:
>>106700968Include closeups in the dataset?
bros is vpred worth it?
the anime girl is sitting on a black couch in front of a large neon sign with the text "LDG" in teal color text.kinda neat it can figure out the 3d model proportions despite only a side profile as reference, didn't even say "miku hatsune".
>>106701051All signs point to yes
>>106701051All signs point to no
>>106700772Lul
can flux even do nfsw without a billion loras?
Maybe we should commision /3/ or /ic/ to design a unique character as /ldg/'s mascot. LDG tan. Then we can train a lora with it.
It is literally impossible to make Wan understand that I want breasts to look natural, to jiggle and bounce and squish and behave as real breasts do. Nothing I put in the prompt window has any slightest effect on it. If the breasts in the starting image are ambiguous, it will choose to make them stiff and fake no matter what, 100% of the time.I am so fucking tired.
>>106701051All signs point to maybe
>>106700592
>>106701234You should batch a gen of 50 pics of the same girl, leave all the six fingers and scuff in, and train the lora with it.
>>106697494https://www.reddit.com/r/StableDiffusion/comments/1nqm5l0/images_from_the_huge_apple_model_allegedly/here's some images on HunyuanImage 3.0, this shit is gigaslopped lmao
>>106701256we're so lucky to have all these neat tools like wan/qwen/noob/illustrious
>>106701051At this point (it's been around for years), if it was actually worth it, it would be the defaultThis is just furries and weebs (aka severe autists) who are obsessing over some imagined color improvement
>>106701298>This is just furries and weebs (aka severe autists)aka 90% of the fags in this hobby
>>106700612
>>1067012972025 is a coomer's paradise.
>>106701256hpru shjet
>>106701249https://civitai.com/models/1852647?modelVersionId=2096600works with i2v too
>>106701345and this is the worst it will ever be
>>106701403Honestly, this is enough for my needsI'll gladly take improvements and they will obviously come, but the current state far exceeds any expectations I had going into local ai image/video gen
>>106701332
>>106701322>Road to El Dorado live action remake could actually be great
>>106701270https://xcancel.com/cannn064/status/1970659710220349509#m
>>106701298>it's been around for yearskek
>>106701453>gpt anime samefaceDoA
>>106701322ÆÆÆÜÜÜGGH thank you for making me cume sir
>>106701270>here's some images on HunyuanImage 3.0, this shit is gigaslopped lmaoYeah, hopefully it will take well to lora trainingBut everything except for Chroma is 'gigaslopped' these days since the rest primarily train on synthetic data and stock photo, again it usually can be fixed with lora / finetuning, but it also depends on how big the model is for it to be realistically viable
>>106701453They can keep it. Industry grade != benchmax. It's funny that these teams make waves in the llm space but can't capture the Dalle, 4o and Imagen audience. Hmm I wonder why...
>>106701256>>106701322why they move in slow motion???
>>106701514they are not moving in slow motion, you have brain tum0r
replace the man in the black tracksuit with the anime girl in image2.no offense paulie, it was just the easiest swap description
>>106701512they still have the llm way of thinking, synthetic data training = good mememarks, but for imagegen that's really useless, no one want to generate plastic humans even if the model can add a blue cube on the left and a red sphere on the right, sigh...
>>106701520and once again, anri as a test>italian sausage
>>106701512Watch it being nearly identical to qwen just like the hunyan image model, kek.
>>106701554
>>106701554doesn't anri have way larger tits?
>>106701554damn this is bad, it's like they copy pasted the girl without considering the lightning of the original image
>>106701562well she has two phases. now she's giga milk mode. that was after she got pregnant.
the blonde anime girl is wearing a gold crown, and on the seat of the car is a stack of rectangular boxes that say "Nvidia RTX 5090" with the Nvidia logo on the box. A champagne bottle is in a bucket of ice to the right.
what's the best general way to use wan 2.1 loras (not light) with wan 2.2, full strength on low noise, split evenly between high and low, full strength on both high and low?
>>106701641high pass, 2.1 lora at 3 strength, low pass, 2.2 low lora at 1 strengthseems to work for me, when 2.2 came out the kijai workflow was using 2.1 for both, with high at 3, low at 1.
https://github.com/comfyanonymous/ComfyUI/pull/9979the memory leak fix has been merged
>>106701641>>106701657>high pass, 2.1 lora at 3 strength, low pass, 2.2 low lora at 1 strengthI also add high pass 2.2 lora at 0.4 strengh to get less blurriness
>>1067016672.2 high can cause motion issues but at that strength it should be ok
>>106701561>''Ay marone, these are some nice gabagools!''
>>106701680wan is really incredible desu, probably the only non meme local model so far
>>106701657>>106701667I wasn't talking about lightning loras but thanks ill keep it in mind
>>106701680neat, boob grab lora or something like that?
>>106701680>''Ay marone, these are some nice gabagools!''lmao
>>106701700>>106701398and some simple prompts; the fat man in the dark blue shirt uses his hand to fondle the breasts of the woman wearing a green bikini.
the japanese woman is wearing the outfit of the man in image2, with a helmet, and a blue suit with golden shoulder armor, with a cleavage cutout. keep her expression the same.meet judge dr- anri:
>>106701772what a coincidence, I'm genning asuka too
the japanese woman in image1 is wearing the outfit of the anime girl in image2 wearing a red bodysuit. keep her expression the same.not bad. just need an edit to fix the number but that's pretty good overall.
https://xcancel.com/HaochengXiUCB/status/1971219731140182423#mready for another cope speedup?
>>106701830Woah! A real free lunch this time!
>>106701830Sparsity will unironically make a huge comeback.
>>106701830>wan2.1where wan2.2where node
>>106701852check under your foreskin
>>106700474what are some of the larger models i can throw in to 96gb of vram?SDXL has been my go-to for a while but I feel like I could do some more interesting stuff now with all the extra spacesadly invokeai doesn't seem to support WAN models, kinda wanted to mess around with video stuff but i might need to find an alternative just for that
>>106701830it's still quite different to the original video, pass
>>106701867Pretty sure https://huggingface.co/Qwen/Qwen-Image is the largest open model currently
>>106701886hehhh, that's pretty good! model?
>>106701787
>>106701658for fuck's sake it's slower now
>>106701867>>106701888the largest video model is step video (30b)
also pretty good
>>106701892I still get the memory leaks, each time I click gen it's rng if I get oom or not
>>106701897My b I meant largest image model
>>106701886neat
>>106701270Yet another Chinese model I'll just denoise with Flux Krea then at least for photographic stuff lol
>>106701919looks like slop either way lol
>>106701898diff girl, same psylocke but a diff poseand this is why china > openAI
>>106701919are you using the unslop refiner for hunyuanImage?https://github.com/comfyanonymous/ComfyUI/pull/9882
>>106701886that looks too good to be an open model, it's Seedream right?
>>106701925>this is why china > openAIbruh it literally mixed her realistic face with an anime body
>>106701959can fix that with "make it realistic" in another prompt.
>>106701888 >>106701897 >>106701903hm, seems i'll prob have to use a different frontend to go outside the box I've been confined to
>>106701889NoobAI
killer bee:
>>106701923Well yeah a native Krea version is better. I had to denoise at 0.6 strength to even clean up the 2K Hunyuan output that much.
>>106701988and laura from sf5, why not
>>106702018I wonder how the math works for edit models, or how it can take this and translate it to a figure/inpaint/etc. pretty cool even if I don't get how it works entirely.
holy shit, I prompted "make the man a n-word" and it worked. I assumed it might not work cause it's not the "proper" prompt.china doesn't care I guess!
>>106698483"Bargaining phase."THE CHANCE IS NEVER 0 BITCH
>>106701830Finally, wan getting some love once again, dont suppose there's any mention of running this in comfy?
>>106702106
>>106702106that one is pretty decent desu, I wished it worked as well on 2 characters
>>106702121one more
Is qwen 2509 slopped? Haven't tried either one but figured I 'd start with that one and it's already fucked up the first two prompts I tried.
replace the girl with Miku Hatsune wearing the same outfit.I like Chie but just a test.
>>106702147>I like Chieshe's the goathttps://youtu.be/w7lj9qI8VFc?t=227
007: architects die another day
Just got everything installed correctly i think, any guides to make good prompts?
>>106702184The best guide is your own two eyes
>>106702172
the man has his arms at his side. the man is holding up a vanilla ice cream cone and giving the thumbs up.
>>106702142Guess the official comfyui workflow doesn't work with qwen 2509? The original model seems to work fine.
>>106702142you need this node in place of the old one.
>>106702142>Is qwen 2509 slopped?it's just a finetune of the older QIE, so yes, it's as slopped
the anime girl is wearing black pixel art sunglasses. at the bottom is a large subtitle saying "DEAL WITH IT" in white stylish text with a black outline.
>>106702142>is this Chinese model sloppedyes? they're all are (the exception would be Seedream)
Do these settings look good for my Chroma wf? I'm trying to get as much detail as I can. First is the loader config, second is txt2img, third is upscaling. Ignore the upscale_by 1, I prescale using a custom node to x2 size.
>>106702238Sweet, thanks anon.
>>106702272cfg 4 is enough desu. clip should be set to chroma and not SD. Tokenizer is a snake oil. For second pass I had weird result with beta schedulers when they tried to second pass themselves and it resulted in a weird shifted output. Res_2s is very heavy and will take forever to run so keep that in mind
>>106700626Nta, but you're talking about a gguf of the text encoder right? If you're just using a gguf of the model itself and not the text encoder the extra file is not necessary.
if wan/qwen were made by the saudis instead of china:
are pickletensor files really dangerous?
>>106702336porn is banned in China lol
>>106702337They're dangerous in the same way an activated mine is dangerous vs an unactivated mine is.
>>106702321>Tokenizer is a snake oil.You min min padding? I thought the guy who made Chroma recommended it himself? His own workflow has a custom padding removal node which does the same thing.>I had weird result with beta schedulers when they tried to second pass themselves and it resulted in a weird shifted outputTrue, I've actually noticed that in a few upscaled gens but didn't know the scheduler was the cause, thanks.I know res_2s is heavy, I've got a 4090 though and don't mind the wait given the quality increase that I observed.
>>106702336but since it isn'tgive the woman a green suit of armor from the videogame Halo, with cleavage.>>106702349lots of stuff is technically banned but it's a free for all there, 5090s are banned but they are freely bought.
Saw those gens of the Major last thread and some days back.Sexualizing the Major just never feels right to me.
>>106702370she did her job in a leotard and a jacket, showing off your body doesn't make you exploited.
>>106702272Why these nodes are not connected?
give the japanese woman a spotted white cow themed bikini and a visor with cow ears.
new yumi is neat
now this shows what qwen edit v2 can do.the man is drinking a bottle of jack daniels which is half empty. the green poster on the left is ripped in half. The red bottles of soda on the right are empty. On the wall behind the man is "SHILL MAN" spray painted on the wall with black spray paint.
>>106702280Tits too small.
>>106702397Because they're three separate screenshots combined into one, showing only the nodes relevant to my question.
>>106702351>You min min padding?IIRC the 'official' from the author was:min_padding 1min_length 3But Comfy changed it to (because he thought it 'looked better'):min_padding 0min_length 3In their example workflow, what workflow of his are you referring to ?
>>106702426im the butterfly
>>106702420the man is holding a sign saying "man I LOVE Doritos!", while a black pistol is pointed at him from a man off camera. only their arm is visible holding the gun.
>>106702383fuck you, you know what I'm talking about
>>106702448official art did that all the time, motoko being hot and strong is part of her persona.
>>106702420>now this shows what qwen edit v2 can do.making his skin as smooth as porcelaine?
>>106702430The one on his repo uses min padding 1, length 0, that's where I copied the initial settings from.>https://huggingface.co/lodestones/Chroma/blob/main/ChromaSimpleWorkflow20250507.json>https://huggingface.co/lodestones/Chroma/blob/main/ChromaSimpleWorkflow20250507_overview.png
>>106702430>But Comfy changed it to (because he thought it 'looked better'):>min_padding 0>min_length 3he changed the values (as a bandaid) because his implementation sucks ass and he's not bothered to fix ithttps://github.com/comfyanonymous/ComfyUI/pull/7965
>>106701332i didn't even know she was sick
>>106702448The scene where she fights the tank has a closeup of her erect nipples in a body suit when she's struggling with the hatch on the tank. What do you think the point of that was?
>>106700474I started using sd.webui and now I can run it through a Cloudflare Tunnel to gen wherever, but the issue I'm facing now is that I sometimes need to restart it but can't.Like, generation freezes at 100% and the only fix I can find is closing the terminal then running run.bat again...Is there any fix to this through the UI itself? "Interrupt" doesn't work and neither does "Skip" when it happens
>>106702503kek
>>106702370
>>106702481>because his implementation sucks assThat whole issue is about the min_paddingAre you retarded ?
>>106702524min_padding = 1 works fine if you use lodestone's implementation, not comfy's one (because that one is fucked up), right back at you retard? are you dumb or something? all he had to do is to do a 1:1 copy paste of his implementation, and he didn't, and now he's surprised why it doesn't work the way it should
frens?
>>106702539>not comfy's oneComfy did not write the Chroma implementation you stupid fuck, it was submitted by a core Chroma contributor, silveroxidesThe whole argument was the min_padding
>>106702539Which is one of the many reasons why I don't use his UI, I don't like overly opinionated spergs when it comes to basic functionality look at the disaster wayland was before valve stepped in.
>>106702574>Comfy did not write the Chroma implementation you stupid fuck, it was submitted by a core Chroma contributor, silveroxidesand? someone made a PR to fix that implementation and he didn't merge it, Comfy doesn't care about Chroma, he's more interested about making flawless API nodes
>>106702574>wrong>doesn't fix itTo the other anon that's arguing with this dipshit just stop he's fucking retarded
>>106702574>it's not Comfy's fault that he's merging bad PR implementationsYES IT IS RETARD, IT IS HIS FUCKING JOB TO VERIFY IS WHAT GOES TO HIS OFFICIAL REPOSITORY IS LEGIT OR NOT, KYS
swords could be worse i guess
>>106702562
the blue hair anime girl is wearing a hot dog suit outfit.I didn't expect this prompt to work but...it does in fact work. testing models with odd prompts is a good way to see what you can/can't do:
>>106702591There's nothing wrong with the implementation other than this setting which Comfy fucked withShow me something else that is wrongThe PR has lodestones blessing, he and silveroxides are best buds, likely loversDumbass retard
Lots of amazing models coming to SaaS recently
>>106702608better suit, now it has the original hoodie from the image.
>>106702574>i will not merge a bugfix thats been sitting for months because someone made an... errordont post again on this site lil techlet jamal
>>106702610>There's nothing wrong with the implementation other than this setting which Comfy fucked withyou are sooooo fucking retarded dude, you don't know what you're talking about, the 2 implementations are completly different, Comfy's one doesn't use the same tokenizer as lodestone's one, and that's why you get fried shit, this is the last time I respond to you, you seem completly braindead, lurk more you fucking faggot
>>106702608>mustarddisgusting
>>106702632>using half a year old versions
>>106702632>months old imagelol, can't run a v30 comparison yourself
>>106702644the arch of the model didnt change, jamalany other low iq take to continue on with your public humiliation?
>>106702644>>106702646>nooo, you don't understand, the implementation is the exact same but the image is different because... reasons...(You)
>>106702652>>106702651You get absolutely minimal diferrence on current chroma no matter if you use the padding or not. You are retarded.
Deliver us from evil, A*******o.
>>106702658>You get absolutely minimal diferrence on current chromaprove it (you won't)
>>106702632Show me a fucking image from a checkpoint that isn't ancient you dumb fuckv28, are you insane ?Go ahead, not even lodestones gives a shit about this PR
>>106702664>v28, are you insane ?are you fucking retarded? the implementation was made during v28 that was why they used v28 to show that there was a problem, omfucking god why is there so many low IQ subhumans in this fucking thread???
>>106702646>lol, can't run a v30Another ancient checkpoint with zero relevanceStop, just stop
any advice on reordering sockets in a subgraph in comfy? why doesn't the right click menu have a move up/down option bro... I'm not even autistic and I thought of this QoL like literally immediately...
>>106702658>minimal diferrenceThanks for conceeding.
>>106702671Show me this minimal difference actually manifesting in chroma1-hd or base
>>106702684what's there to concede? you claimed that there is minimal difference, you have therefore the burden of proof and you don't want to prove it, I'm the one accepting your concession
>>106701298retarded slopper
Please... just a crumb of side by sides... a sliver even
>>>/pol/517249748
kys all of you
>>106702721I made this. Feels good to still see it get reposted.
I tested chroma extensively and I have to say once we're getting full strength loras turning into irl gens without heavy weighting for against realistic if you decide to do certain actions, we have serious fucking problem with how this model was trained. I'm still trying to figure out why the fuck he decided to copy the pony creator and obscure so much shit, from what I can tell that alone has fucked the data to it's core, not only that it becomes mandatory to put the steps higher just for it to be cohesive in certain actionsPic related random seed came out to 3D while all the others didn't because applies this type of shit to common actions
>>106702729>he's still missing the pointthe problem isn't the min padding value, it is the inner code that you can't change on your node, that's why there's a PR to change that code and make it similar to the official Chroma's one, thanks for proving again you don't know what the problem really is
>>106702729... you need to use Padding Removal from Fluxmod.https://github.com/lodestone-rock/ComfyUI_FluxMod
>>106702729>>106702737to be more clear, the issue is there>Chroma's implementation by lodestone uses MOCHI's tokenizer, and for some reason Comfy's implementation uses PIXART's tokenizer, that difference is the reason you get different images (and fried shit on comfy's side) >>106702632
>>106702749>run through esoteric workflows and deprecated node hoopsFuck off. You lost. Deal with it.
sdxl won
>>106702737There is no problem, some minimal difference in an ancient epoch is inconsequentialIf you could at the very least show same minimal difference in the actual final relelases you would at least have some claim, but you don'tWhich makes me conclude there is no difference in the final releasesAlso recently Chroma Radiance was merged, no complaints
>>106702757Is there a custom Load Clip node with the corrected tokenizer for CLIPType.Chroma?
>>106702766>some minimal difference in an ancient epoch is inconsequentialit's not minimal at all, do you have eyes? >>106702632
I like pretending that Lodestone / his ilk don't post here. Makes it more fun.
>>106702767The multigpu nodes nodes have specifically Chroma as a clip type but idk if it includes it
>>106702767>Is there a custom Load Clip node with the corrected tokenizer for CLIPType.Chroma?no, you have to change the inner code to get that result, that's why this PR exist >>106702481
I really do loathe them. I have nothing wrong with Chroma itself. It's the fact they keep acting like it's more consequential than it actually is that bothers me. It's just a shitty flux fine tune.
>>106702757>and fried shit on comfy's sideBeen using Chroma on Comfy AND Forge, neither are fried and there's no percetable quality differenceEnough with your bullshit lies
>>106702735mhmm, very nice anon, where are the new pancake girls?
>>106702778>there's no percetable quality differenceprove it (again you won't and I accept your concession in advance)
>>106702777>It's just a shitty flux fine tune.To be fair you could say the same about Pony being a shitty XL fine tune. That thing was CARRIED by LoRAs.
>>106702768Give link to this PR
frogslop avatar, ignored
>>106702788-> >>106702481
I seriously feel bad for whoever has to finetune this mess to fix it. >>106702759I kind of agree simply because everyone trying to make the next model keeps shooting themselves in the fucking foot when all they have to do is not alter shit or do retarded shit like fuck with tags. We've been over this many times already once you start obscuring shit you basically kill a model, it happened to SD 3 and it happened to pony and it happened to chroma. Something as simple as protesting and selfie should not be so poorly weighed that it behaves like a token set to 2.0 strength by default. Also mutliple characters and interactions are childs play with any model I have shown this for YEARS, so when you tout it natively don't have your tokens so fucked that some characters will always duplicate at normal strength and only act normal at .5. Also what the fuck is up with the banding on the base model during high res pass?Waste of time making loras for this thing even the ones on civ will swing based on tag.I rank this model XL 1.2 only because of the built in text after giving this a serious try
>I have nothing wrong with Chroma itself
>>106702793That shows this image:Why are you lying ?
Does the PR even matter now that Chroma has it's own clip category and doesn't use pixart, moch or flux?>>106702805retard
>>106702805read the PR motherfucker, what does it say?>Now both implementations make identical images:it's showing the images are now the same if you apply this fix
can we all just agree that comfy is a fucking retard
>>106702810only once your engine is up to snuff julien
>>106702808The insanely fried image you posted is not here>>106702768Where is it ? Nobody gets these images with Chroma, it's pure bullshitStop with your insane lying
>>106702787whataboutism.
>>106702797Bold to assume I'm ESL and not just tired. TIRED OF CHROMA.
>>106702814>The insanely fried image you posted is not hereagain, are you retarded, the 2 images you showed are WITH THE PR FIX, so of course they look the same, they look the same if you apply the PR, that's the fucking goal of this PR, are you fucking retarded dude?
>>106702810I've been saying it all along
>>106702766Legitimately stop replying lol.
>>106702782I got bored with the concept I guess. Here's another I did back then but didn't like it as much so I didn't bother sharing it.
>>106702822Whatever this problem was 5 months ago, it's no longer a problem since nobody is reporting anything remotely like the fried image you posted
>>106702834pancake sexo
>>106702835>it's no longer a problemagain, prove it, you have no idea how different both implementations are with the current chroma versions
>>106702835>nobody is reporting anything remotely like the fried image you postedChroma outputs garbage images though, who knows if it's because the model is genuinely bad or it's the fault of Comfy's implementation lol
*yawn*
>>106702845No, you prove itPeople use Chroma every day, tons of new images on the Chroma discord every day, people would have reported this problem if it still existed, nobody isYou don't even use Chroma, you're the anti-Chroma schizo
>>106702835>>106702858>it's no longer a problem>you prove itthis is your claim, this is your burden of proof
>>106702834AI was a mistake, is what I would say if this wasn't so peak
>>106702865Only one claiming it's a problem now is you, someone who doesn't even use Chroma and just wants to complain about ComfyGet a life
>>106702871>Only one claiming it's a problem now is youOnly one claiming it's not a problem anymore is you
>>106702871>just wants to complain about Comfyif he did his job proprely he wouldn't have valid criticism, get off his dick, he can mess up things like everyone else
>>106702876>People use Chroma daily, people who has used it since the very beginning, including the guy who made the model, they are not thinking there's any problem with the current Chroma implementationNo, it's just you
How can you spot SEA ESL as apposed to other ESL? What are some things to look for?
>>106702887>just ignore the fried images bro, Comfy just changed the tokenizer for no reason and you have to trust him on that one, he's god after allthis is getting embarassing desu
>>106702881There's a LOT to complain about when it comes to Comfy, like the UI only becoming progressively worse, so much basic functionality missing forcing people to use third-party loras with all the security/compability issues that come with it etc.But this retard has to invent problems, because the problems are irrelevant to him, he is just jumping around attacking different targets, truly a person with no life
>>106702896The creator of the model has no issue with it, actually he posts Comfy generated Chroma images on a daily basiskys
>>106702907He deprecated fluxmod specifically because Chroma's implementation is nearly identical. There's no quality loss, it just looks like a slightly different seed.
>>106702795Is he telling the truth?
>>106702912>it just looks like a slightly different seed.do you have eyes or something? >>106702768
>he keeps posting images from ancient epochsyawn
>>106702907kek
>5 months ago
>>106702921>5 month old bug report for v28Give up you fucking retard
>>106702940>>5 month old bugthat's the worst part, that bug is this old and comfy still hasn't fixed it
>>106702928What is this supposed to show ? 5 month old post from some Flutter_ExoPlanet
>>106702921It doesn't do that anymore. The differences are there, but negligible. Like I said, it changes composition, not quality.
>>106702951>What is this supposed to show ?sorry, I thought your IQ was over 70, let me explain that to you, comfy admits that his implementation is different from lodestone's, not only he admits that, but he also implies that HIS implementation is the superior one, yeah right... >>106702768
>use chroma dc-2k>add any well trained flux character and photorealism lora at low strength for consistency>insert correct camera direction (this is where most promptlets fail)>insert short character and scene description>install booru tags copypaste from any of the sleazyforks>go to e621 and find concept you're looking for>copy paste and delete or replace all unwanted tags unless you want to yiff in hell>end with extra camera direction>??????>photorealistic degeneracysimple shit
This is after manually writing the PR fix into sd.py
>>106702967it looks more saturated on the Comfy's one, look at the skin color, it's too uniform on the right
>>106702967no... the schizo was right..
>>106702943>that bug is this old and comfy still hasn't fixed itIt's clearly been fixed since this doesn't manifest itselfNobody got around to close this PR yet, along with a ton of others
>>106702979>It's clearly been fixed since this doesn't manifest itselfuh oh... >>106702967
Why the fuck does that fennec retard have so much fanboys here. Don't give a shit about chroma shit but an implementation of something should be absolutely be true to how the creator does it. >>106702967I think you need to get your eyes checked my man
>>106702968This was old news, we know it was different with the padding, we have been discussing this all thread, you absolute retard
>>106702979>Nobody got around to close this PR yetthe PR provided the exact catbox workflow so you can test that out and get the same exact fried Wario image
>pixel 852692 is three lumens brighter than before
>>106702987>we know it was different with the paddingit's not the padding's problem you fucking mongoloid, it's a tokenizer problem, he's not using the same one >>106702757
>>106702971The irony is that's the only thing the model can do well when the target audience was 2d degenerates>>106702986Lots of astroturfing from a company without it's priorities in order both comfy and ani used to shill and run operations in the thread. Actual corporate interest from the start which makes people surprised over api implementation show how new they really are
>>106702984lel, where is this 'fried' look you complained about ?they're practically identical
>>106702971proof?
>>106702967wait, I thought the image was supposed to be totally fried like the wario one, schizosisters what's going on?!
>>106702975bottom right is too cute pls post full img plox
>>106702994>>106702998>see, we find cases where it's not fried, therefore we can conclude that it'll never be fried ever(You)
>>106703001
>>106703002I accept your concession.
>>106702976>>106702967Yeah, it's too bright on the right side, I wonder why comfy decided to not simply copy the original implementation? Did he give a reason on why he decided to do that?
>>106702967The main question is why it's different at all? Isn't it supposed to fully respect the original creator's implementation?
>>106703002So you concede that the bug reported 5 months ago has been fixed, good
>>106703010see >>106702810
Member when both comfy and and his coworkers which included ani worked for stability and used to do thread raids shit talking other devs?Member when ani was comfy's enforcer to false report the original forge dev?Member when they both lied and acted smug over SD 3 only to back peddle and jump like rats even through anons told them that would happen?I memberI member wellThe irony is they all ended up snaking each other in some way or another
>>106703006
>>106702203kek
>>106703020Who said it was always fried? You're fighting ghosts here, that wasn't the claim. The claim was that sometimes you get completely messed up images because that idiot decided to make a yolo implementation for some reason.
fyi, you get practically the same result by setting the tokenizer to mochi, same way lodestone did in his fluxmod implementation. No overexposure like with the pixart one you get if you set it to 'chroma'.
>>106702967>Lodestone>slim "woman">normal neck height>Comfy>fat roastie>suspiciously long neckhmm...
>>106703039>fyi, you get practically the same result by setting the tokenizer to mochiyou don't get an error when you do that?
>>106703056>>106703056>>106703056>>106703056>>106703056
>>106703030>The claim was that sometimes you get completely messed up imagesOnly evidence being one image posted 5 months ago for an ancient training epochMeanwhile people, including the model creator, are posting Comfy generated Chroma images every day, gee I wonder if they would have noticed there being a problemStop wasting time
>>106703054Nope.
>>106703063>Only evidence being one image posted 5 months ago for an ancient training epochthe workflow is on the PR, feel free to run it with a newer version of chroma, if it's still fried, what will be your excuse this time?
>>106703068I don't need to, I generate hundreds of Chroma images per day with Comfy, I don't get fried images
>>106703023>underboob visible ty anon