[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Obese Nerds Using My Images Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107017112

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Neta Yume (Lumina 2)
https://civitai.com/models/1790792?modelVersionId=2298660
https://gumgum10.github.io/gumgum.github.io/
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: file.png (2.56 MB, 1024x1024)
2.56 MB
2.56 MB PNG
btw is there some good method people use to mix up qwen's low seed variety other than having a whole LLM create prompt variants instead?
>>
blessed thread of chroma :-)
>>
SONGBLOOM IS THE NEW KING OF LDG
>>
Has anything surpassed Illustrious yet
>>
File: 1745961318219695.png (2.43 MB, 1344x1728)
2.43 MB
2.43 MB PNG
>when u SHAFT look too hard
>>
File: file.png (10 KB, 734x134)
10 KB
10 KB PNG
this thread should be all about asses
>>
>>107019722
in regards to what
>>
>>107019711
>is there some good method people use to mix up qwen's low seed variety other than having a whole LLM create prompt variants instead?
Let me answer your question with another question: if a model has low seed variety, how else could you mix it up if not for varying the content/structure of the prompt?

Completely random thought I had: I wonder what It would take for diffusion models and their text encoders to be smart enough to understand Pig Latin. Is it really that much harder than understanding a non English language? You'd just need enough examples in the training data
>>
More and more people are realizing chroma is a shitty failbake, and it's glorious to see
>>
File: radiance.png (2.93 MB, 864x1488)
2.93 MB
2.93 MB PNG
>>107019722
in many aspects yes, but unfortunately not all of these aspects in one practical model

e.g. qwen utterly wrecks illustrious for multiple characters or text or such, but it has some aesthetic failings (includin with anime) and it can't do most anime characters.

you might say aesthetics are subjective but I don't thin most people particularly like qwen's
>>
>>107019737
it's more like we were convinced that chroma would end up being good, but it didn't improve much between v25ish and v50, it even got worse
>>
>>107019569
Chroma does black ladies very, also try different camera/lighting loras for different flavors

>>107019729
Yes
>>
The disillusionment I am under after realizing the repercussions of it being a wrapper cannot be exaggerated. What the fuck.
>>
File: radiance.png (3.12 MB, 864x1488)
3.12 MB
3.12 MB PNG
>>107019733
> if a model has low seed variety, how else could you mix it up if not for varying the content/structure of the prompt?
for example by messing with the initial noise? add some blotches with patterns or something.

or some additional thing to the model that does <something> at 50% of the steps... idk, segment and just move the discernible elements around

these may not be extremely clever ideas, but that's why I'm asking of some had a good idea
>>
>>107019722
noob. though it's basically just a variant of illustrious.

neta yume is about to beat noob though if they can train some more artist knowledge into it.
>>
>>107019768
I know understand why corsets were so popular in the 19th century, damn that looks good to my monke brain
>>
File: 1734804362739077.jpg (1016 KB, 1224x2184)
1016 KB
1016 KB JPG
>>
>>107019729
>this thread should be all about asses
I am on a genning hiatus to try and improve my life (and also I am burnt out and waiting for videos with sound) but one of my benchmarks for video models now is if it can actually generate an accurate body from behind for the age and height. Wan is really bad at this in terms of both prompt adherence and just composition in general once you try to do anything that isn't an Instagram model in her 20s-30s

Oh and you're thinking too much about cunny, I'm talking equally about being able to generate anatomically accurate 70 year old granny cheeks in thongs too. Both a 70 year old and a 07 year old are most likely not in the training data which means that it would have an actual understanding of human anatomy across the entirety of the aging process at this point

I hope that's enough ass discussion for you

>>107019755
>Chroma does black ladies very, also try different camera/lighting loras for different flavors
I think anon wanted anime/2.5d black women while keeping their facial features like a 90s anime or something like that, not photorealism
>>
File: Comfy_00033.png (1.03 MB, 720x1280)
1.03 MB
1.03 MB PNG
me on the right
>>
File: 1740937227086680.png (2.85 MB, 1248x1824)
2.85 MB
2.85 MB PNG
>>
>>107019775
>I now understand
Is this the first time you have learned about the hip-waist ratio? That's what corsets exaggerate, there's nothing inherently attractive about them. If anything it should be considered the original "shapewear" and is actually frauding and lying about the woman's bodies, but that sentiment doesn't exist because when corsets were a thing only rich people wore them unlike fat poor women at the gym wearing shapewear

Sometimes it surprises me how little people introspect especially about the things they're uniquely passionate for like their own sexuality to the point where a 1girl slop makes you realize something about yourself and about the world
>>
>>107019791
nice extra fingers, sd 1.5?
>>
>>107019790
It do be like that mr mice
>>
>>107019799
it's not that deep bro, women with corsets make my pp harder than without, end of the story
>>
File: 1748849052429854.png (2.95 MB, 1248x1824)
2.95 MB
2.95 MB PNG
>>107019800
Oops. Fixed. Maybe.
>>
>>107019800
>extra fingers
there's 5 fingers on each of her hands, what are you talking about?
>>
>>107019807
>it's not that deep bro
Yeah it is, retarded mysticist. There is an explanation for everything. Thanks for self reporting that my paragraph about your lack of sexual introspection struck a nerve, though
>>
>>107019819
you must be blind retard
>>107019818
>>
>>107019821
>struck a nerve
>retarded mysticist
Oh how I love irony.
>>
>>107019821
why do you need introspection on your preferences? you can't control what you like
>>
>>107019827
>my pp gets hard
>*insert autistic rambling how its because u monkey who lack introspection [LMAO]*
>yes im such a deep and intelligent individual unlike a mongrel like u
dont even bother replaying to that retard
>>
File: hmmm?.png (172 KB, 460x460)
172 KB
172 KB PNG
>>107019821
>>107019799
Usecase for sexual introspection?
>>
Local Diffusion?
>>
>>107019840
Thanks for self reporting that my post struck a nerve, though.
>>
File: file.png (3.37 MB, 864x1488)
3.37 MB
3.37 MB PNG
>>107019775
agreed. we'll certainly gen a lot of girls with corsets over time.

and they were quite popular in modern times too (goth girls, now social media)
>>
>>107019799
>Is this the first time you have learned about the hip-waist ratio
I know about the ratio obviously, I love the 8 shape, but I've mostly seen corsets from history books, and they kind of looked gross with only illustrations with stomachs so constrained it looked more alien than anything hot.
>>
>>107019955
>stomachs so constrained it looked more alien than anything hot
there is a reason it's a ratio and not "zero stomach", like anything if you go too far it goes quickly from hot to horrible
>>
File: ComfyUI_07991_.png (1.8 MB, 1152x1152)
1.8 MB
1.8 MB PNG
>>107019677
>https://www.reddit.com/r/StableDiffusion/comments/1ogx7j4/chroma_radiance_mid_training_but_the_most/

Radiance absolutely is looking insane and like it will be the endgame once Chroma is done.
>>
File: 1754469448134070.jpg (935 KB, 1336x2008)
935 KB
935 KB JPG
>>
>>107019721
https://vocaroo.com/186PM8O3j8W2
>>
>>107020025
the doll has a skin texture less plastic than the actual woman lool
>>
>>107019790
Oh look it's assholegolden!
>>
>>107020047
Bonus points if you can guess the style mp3
>>
>>107020056
Ahah so true, I must add perchance that orange man is bad! Take my upvote kind sir!
>>
Is it possible to block swap native wan gguf (any wan) to load more frames?
>>
File: sd1.5slop.png (1.54 MB, 1069x956)
1.54 MB
1.54 MB PNG
LOL
>>
>>107019684
>Open thread
>No feet sloppa
>Leave
>:/
>>
>>107020052
The woman is Korean so the plastic is realistic, notice it's only her face anyways.
>>
>>107020072
>The woman is Korean so the plastic is realistic
a sweaty korean woman that just went out from a sauna then lol
>>
>>107020025
lodestone should have given up on it as soon as qwen came out and switched.
>>
>>107020068
yes, the redditor is also shilling the model here, take a look at this
>>107018294
https://www.reddit.com/r/StableDiffusion/comments/1ogx7j4/comment/nlk1foz/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button
>>
>order ram
>its not compatible afterall
>order another pack
>gets delayed
>get new date
>get email that its out of stock
>find a new pack miraculously
>arrives today to postalbox at my apartment building
>get message that the box is full and the package will be redirected to a 10min drive away

FUCKING CURSED UPGRADE REEEEE
>>
>>107020086
this, if he finetunes Qwen Image Edit and makes it VAE-less then he'll be the true hero
>>
for me, it's all about the SongBloom.
>>
>>107020091
I'd feel bad for you but there are zero local models right now worth upgrading your ram for

Unless you went 16 -> 32 and I wouldn't even consider that an upgrade but more of actually buying the entry ticket to the amusement park

>>107020101
your name is so fucking gross bro I hate you
>>
File: 1745264814353293.jpg (719 KB, 1336x2008)
719 KB
719 KB JPG
>>
>>107020068
hobby is full of those expressing and exploring their aesthetic sensibilities for the very first time
>>
>>107020118
Actually I think burn-out where you can no longer see the slop is the bigger issue. Because now that I've stepped away from genning and I'm in the waiting phase of the cycle sdxl models are unpalatable to me. Actually they were equally unpalatable at the start of this year but still
>>
>>107020153
>where you can no longer see the slop
some simply cannot see it no matter how often they look, is the thing
>>107020153
>sdxl models are unpalatable to me. Actually they were equally unpalatable at the start of this year but still
i agree in that the only worthwhile models are the massive finetunes and not the barrage of cancermixes

you can really stretch your enjoyment finding new artists to train little loras on desu, short of something truly new and exciting
>>
>>107020107
that's some chroma-looking mesh there
>>
I spend all evening genning 1girls with the largest possible breasts and then I go to sleep dreaming about the girl I'm in love with irl who hardly even has breasts really. I see the contradiction, but I don't feel it.
>>
File: 1753845210544158.jpg (779 KB, 1336x2008)
779 KB
779 KB JPG
>>107020198
here it is with "fishnets, mesh" in the prompt as they were not before https://files.catbox.moe/w9wh1l.png
but i agree netayume's details could be a bit tighter
>>
>>107020241
dont fret it, fishnets are hard to do for a lot of models (sadly). Chroma is quite terrible at it too. Didn't actually try Qwen.
>>
>>107020153
I went on Artstation this weekend and was stunned by the quality of the art there. I've been staring at slop for so long...
>>
File: radiance.png (3.36 MB, 864x1488)
3.36 MB
3.36 MB PNG
>>107020025
It is a very cool model! So Reddit picked up on it and actually made nice gens - cool.

>>107020086
It takes ~3x+ as much to train. it seems wise to train radiance first actually unless pony or w/e provides the resources.

>>107020093
Qwen Image Edit also is a somewhat different training dataset than for Qwen Image.

Also not sure VAE-less will make it better, Qwen does seem to have a nice VAE. I guess you'd have to try?
>>
File: 1756153334834888.mp4 (3.86 MB, 1280x718)
3.86 MB
3.86 MB MP4
>>107020294
>Also not sure VAE-less will make it better
because it won't compress the image if you edit it, it'll just change the area you wanted to change, that's really important for professional work, you start with a png you don't want to end up with a low quality jpg because you did the edit iteration too many times
>>
File: 1748472626816717.png (40 KB, 761x391)
40 KB
40 KB PNG
>>
>>107020254
qwen and hunyuan can both do them pretty well
>>
>>107020294
>it seems wise to train radiance first actually unless pony or w/e provides the resources.
I think that continuing to train the flux base, especially having already fed it a bunch of 512x images that negate the benefit of no vae, is a waste of time. Sunk cost
>>
>>107020334
loool
>>
>>107020334
Poorfags should not be allowed into this hobby, and quants/speed loras were the worst thing that ever happened to local.
>>
File: 1756117263422441.jpg (1.86 MB, 4008x2008)
1.86 MB
1.86 MB JPG
>>
File: get it?.png (1.07 MB, 872x1025)
1.07 MB
1.07 MB PNG
>>107020025
>Radiance absolutely is looking insane
it's true that it's so shiny it's radiant
>>
>>107020239
>I spend all evening genning 1girls with the largest possible breasts and then I go to sleep dreaming about the girl I'm in love with irl who hardly even has breasts really. I see the contradiction, but I don't feel it.
I don't see any contradiction. I am interested in women with J cup breasts for 10 minutes at a time a couple of times a week at most. I am entirely disinterested in dealing with a large breasted woman beyond masturbating to her. I don't want to deal with the personality, society, upkeep etc required to have and keep one in real life. I am not interested in destroying the physical chemical pair bonding I have with my real life wife who I love by cheating on her with a IRL prostitute either. These are just a few reasons off the top of my head why AI generating a big titty woman is preferable for everyone
>>
>>107020364
>Poorfags should not be allowed into this hobby
I hope you can run HunyuanImage 3.0 anon, or else I will unfortunately have to call you a poorfag
>>
>>107020364
Poorfags are still not let into this hobby. If they were then these threads would be more active from 60 series shitters and laptop GPU zoomers

By the time "poorfags are let into this hobby", kids are gonna be running around with VR glasses creepshotting their mother's friends under the kitchen table
>>
>>107020370
I've had crushes on a lot of women in my life, but none of them ever had big tits. That's the part I don't understand. The women in my family all have DD's (and not from being fat), so you'd think I'd be genetically drawn to that, maybe I am, but as it happens I've never felt that kind of strong desire for anyone larger than a B-cup.

Other than my sister of course.
>>
File: ComfyUI_08006_.png (1.79 MB, 1152x1152)
1.79 MB
1.79 MB PNG
>>107020086
>Qwen
20B model, so overbaked it lacks variety, realism LoRAs are at best on par with Flux despite lack of distillation (realism tunes completely breaks anatomy among other things if you ask them to do anything novel outside of the training data).

It's good as edit model because BFL never gave us a proper Kontext Pro (which equivalent of Dev), but tuning it would pretty much yield worse or similar results to Chroma at a bigger cost.
>>
Reminder the 5000s series cards are still in poorfag tier
>>
>>107020453
Severe cope. Base qwen is already much more flexible than flux was when it was released (flux couldn't even do basic animu), and it's not an intentionally crippled distill like flux either.
>>
>>107020453
a lot of what you're talking about is solely the result of retards refusing to accept that they HAVE to caption Qwen with proper natural language or it won't work properly.

That said last time I checked translating my prompts into Chinese while running one of my loras (which was only captioned in English obviously) produced nearly identical results, so I don't think the multilingual support even has anything to do with captions. Meaning even if you captioned it in proper natural language Chinese it would probably still work.
>>
File: 1739864452614267.jpg (1.94 MB, 4008x2008)
1.94 MB
1.94 MB JPG
>>
>>107020475
>(flux couldn't even do basic animu),
To be clear it could but only in a very narrow range of style.
>>
File: ComfyUI_08023_.png (1.99 MB, 1152x1152)
1.99 MB
1.99 MB PNG
>>107020254
>Chroma is quite terrible at it too

It's all about the prompt you give it. Don't have a skill issue and it won't suck.
>>
>more boring asian woman slop
>>
>>107020606
>complainer is an eternal nogen
>>
File: file.png (2.82 MB, 832x1488)
2.82 MB
2.82 MB PNG
>>107020328
i think as long as AI is good about inserting a fitting thing into whatever area you change, it'll be ok either way?

>>107020348
slower training AND THEN 4x or 8x larger images? it's just getting harder to do for not very much benefit most of the time.

no, 512px images don't really "negate" the advantages of the pixnerd method? they may not be the most optimal thing but you'll find it difficult to train on 100mp images perfectly sampled from 400mp+ photos
>>
We are grateful that chromajeet attaches his same shitty gens to every post, otherwise i'd think there was more than one retard in this thread delusional enough to believe chroma is a anything more than a failbake
>>
File: radiance.png (1.48 MB, 832x1488)
1.48 MB
1.48 MB PNG
>>
Is there a single place left on the internet that isn't overrun with newfags who cannot figure out how to read text in command line, pajeets spamming pure sloppa or coomers that are satisfied with 1girl standing?
I miss 2022
>>
>>107020823
>2022
>>107020651
>We
>>
I wish we could go back to the days of Dall-E Mini
>>
File: ComfyUI_08045_.png (1.65 MB, 1152x1152)
1.65 MB
1.65 MB PNG
>>107020610
They are nogens for a reason.
>>
File: ComfyUI_temp_mqspd_00001_.png (1.41 MB, 1027x1027)
1.41 MB
1.41 MB PNG
>>107019768
>>107019711
I'm messing with this now, because, surprise, I dig qwen prompt adherence. And it's so very fast with lighting lora I can iterate almost instantly.
Results are inconclusive. Had images previewed instead of saved in Comfy, so nothing to cherrypick from.
All in all, the attack vector for now is splitting sigmas at first step and dropping first sigma to about 0.85-0.95 in a heavily front-loaded scheduler (say, beta with alpha 3: beta 20) so that first step denoise defines low frequencies. And then follow up with 3 steps of a well-behaved sampler which solidifies the intial composition (which is *occasionally* out-of-distribution) into the typical qwen look. Results are a mixed bag of typical qwen gens and worst imaginable murky brown chroma gens (but with qwen prompt following), with very little middle ground. This is understandable when one considers the nature of sigmas, but still annoying. I'm also trying to keep total gen time under 15 seconds for 512px on 3090 because I ain't got all day.
Adding external noise at the start is a compromise between qwen's RL'd "mean" image and low sigma brown bog, but rn it introduces trypophobic patterns I couldn't manage to contain... yet, or maybe never. I've seen it's source, though, it's the edges of DiT patches in the low contrast areas.
The next attack vector would be getting rid of the scheduler entirely and passing noise in hand-crafted dosages. Why bother scheduling when you're denoising just 4 steps? Not sure how different that would be.
>>
>>107020591
this is the only example of it from chroma that I've seen that wasn't severely artifacted, so I'm curious about what you did
>>
>>107019711
>>107019768
Mind spoonfeeding me a catbox or two for qwen? I'd like to start messing with it too, but lately have no time to keep up with various models and their quirks
>>
When will Chroma become good?
>>
File: i2741.jpg (281 KB, 1408x768)
281 KB
281 KB JPG
>>107021193
when pooma 2 is out
>>
Retard here, is there any way to run stable diffusion with vulkan, like koboldcpp vulkan support for llms?
>>
>>107021193
when he decides to stop this fluxenstein shit and finetune qwen instead
>>
>>107021304
>stop this shit and finetune worse shit instead
>>
>>107021118
that's zoomed in, try with a full body shot or with a fitter fishnet, models spaz out
>>
>>107021149
unlike the other anon i don't really have anything interesting. it's basically the template from comfyui but loading a gguf with the comfyui-multigpu loader + loading loras with the rgthree power loader for convenience.

i could share it later but really you might as well use the template.
>>
>>107021368
yeah, it looks like a very low frequency pattern picked to be too large to exhibit the issue.
>>
>>107021374
Sure, but what about prompting? Does it need huge paragraphs? How do you get anything stylistic or drawn? Those are mainly my issues with post-illust models.
>>
>>107021220
nice I'd like to see the worklow
>>
File: 1743347882618777.png (197 KB, 1284x1020)
197 KB
197 KB PNG
i this still how you upscale shit or should i be using a more up to date method?
>>
>>107021382
you don't need huge paragraphs, but you can use them and get more consistent results. it's easy to get generic "styles" like "anime" or "renaissance painting" or "pencil drawing" but you'll struggle to get more specific than that. if you request a subject associated with a particular style, it might override whatever you requested.
>>
>>107021478
Can you get consistent original characters and styles across different gens?
>>
>>107021496
sort of, it doesn't like to deviate too much so it will generally give you the same result for the same detailed prompt. you may alternatively want to generate the character in a neutral pose and then use qwen image edit to place it in different scenarios.
>>
File: 00043-3590784361.png (2.78 MB, 1824x1248)
2.78 MB
2.78 MB PNG
>>
File: 00008-2156361139.png (2 MB, 1824x1248)
2 MB
2 MB PNG
>>
>chroma + lanpaint
holy shit why didn't anon tell me this based combo
>>
>>107021681
please elaborate
>>
>>107021698
no
>>
>>107021698
you can inpaint realistic pussy for example
>>
nigga i think 4chan barely stays afloat as it is
>>
>>107021738
catbox please
>>
File: 00016-455749066.png (2.02 MB, 1824x1248)
2.02 MB
2.02 MB PNG
>>
᠏ ᠏
>>
File: 1747324981125768.png (137 KB, 1297x477)
137 KB
137 KB PNG
>>107020067
yes, with multigpu nodes. I only discovered this recently. been gimped by kijai this whole time AAAAAAAAAH
>>
>>107021382
>Does it need huge paragraphs?
I just describe things in sentences until it's about the thing you wanted or until I find out qwen can't do it. Most prompts give nearly identical images so you will have to wildcard and vary prompts a lot to get actually different images. I haven't figured out a particularly good way yet.

SDXL or Chroma or Neta Yume or Wan or others vary vastly more with just a different seed.

> How do you get anything stylistic or drawn?
2d digital drawing, pencil sketch, watercolor painting. cubist acrylic painting. statue, ... a lot of that type of thing works. some classical art styles work too.
>>
>>107019845
liquid death
lucid dreams
lethal dose
>>
>>107022014
add qwen and lumina to this.
I think lumina uses the flux vae maybe? i dont rememebr
>>
>>107022014
add qwen and lumina to this.
I think lumina uses the flux vae maybe? i dont rememebr
>>
>>107022014
add qwen and lumina to this.
I think lumina uses the flux vae maybe? i dont rememebr
>>
stop d*bo
>>
>>107022059
Are restarts a meme? I usually train with cosine with restarts and Adamw8 but I heard they are useless with adaptive optimizers. Anyone knows if this is true?
>>
>>107022059
Are restarts a meme? I usually train with cosine with restarts and Adamw8 but I heard they are useless with adaptive optimizers. Anyone knows if this is true?
>>
it's funny how Qwen Image Edit doesn't give a single fuck about the original image's style, it'll add the new element like you'd add something on paint and call it a day, that's soo lazyyy
>>
using a seed that was made from lightx2 wont work good. are seeds even still relevant for trying to get the same output with wan?
>>
that is indeed the issue
>>
>>107022081
they don't, the only other super-high-image-count finetunes ever done (starting from a literal base model as in SDXL) are like Animagine and BigASP. Starting from something like Chroma which has heavy concept knowledge already though is going to be a totally different story even if Lodestones insists it's itself a "base model". A high-diversity high-quality dataset could make it way more coherent with nowhere remotely close to millions of images.
>>
>>107021900
doesn't wan do this by default anyway?
>>
>>107022069
that's why I'm still using kontext
>>
>>107022087
I am skeptical of the claim but would be great if you can publish the lora that can do that.
>>
>>107022089
If you can't tell what model an output comes from you should just give up honestly
>>
hunyuan 3 is supposed to be like that, like an LLM that can generate images (why it's so big)
>>
>>107022090
i mean that for example the native workflow already block swaps to ram and paging file without the kijai block swapping node or the distorch node, no lora needed
>>
>>107022066
nta, Adaptive optimizers set their own internal learning rate. Using anything other than a fixed lr with them is a bad idea all around
>>
File: 00059-1737324999.png (2.76 MB, 1824x1248)
2.76 MB
2.76 MB PNG
>>
>>107022116
I see what you mean.
I actually haven't tried specific styles with chroma, mainly just realism stuff.
Shame that he also did this stupid BS.
>>
>>107022117
>notice how he never posts gens
that sentence is /sdg/ coded btw
>>
>>107022118
If you haven't noticed Chorma can't do artist tags despite having a dataset that includes various artist and styles, this is a conscious effort done by the creator and has historically lead to massive issues with models as we saw with SD3 being dumbed down and the latest victim being pony V7. It always seems to do massive damage to models because concepts and other things that are often put into separate tags or related to some tags become completely fucked and you get stuff like inconsistent styles or broken anatomy like SD3 not being able to produce an image of a woman lying in the grass because they decided to obfuscate anything close to a sex act.
>>
>>107022119
Show me a model that does as well as consistently as chroma at as many styles and is not qwen which is style locked
>>
so the shitter is just reposting posts from the last thread

report
>>107022129
>>107022100
>>107022096
>>107022074
>>
>>107022126
>Flux creators give training guidelines
>decide to ignore them when the goal was to make a flux model without restrictions
Also since you're pointing out single gens it's clear to me you're not at a level for this discussion, if a model is not consistent then it's worthless especially when you have to put so much time investment into creating chroma gens compared to other models even on top end hardware.
I think we're done here because you fail to realize that 1 out of 5 gens is not acceptable when you're trying to maintain a look or style especially at the dog shit speeds chroma runs at
>>
Hopefully they start fully supporting python 3.14 my generations are 2x to 3x faster with it but it's extremely unstable.
>>
>>107022140
Or deliberately p-hacked BS that is useless in practice.
So many fucking papers on arxiv that does not yield promised results when trying the experiment on different data/params.
>>
>>107022116
nice try diddy
>>
>>107022145
Hypothetically if I were to train one where would I even upload it at this point?
Can't register to that broken piece of shit seaart at all and tensorart asks for glowing email.
>>
>>107022148
no, pixnerd doesn't improve the training speed and works on the pixel space, that one works on something different than pixel space and latent space
>>
melty
>>
Use a non realism lora and post 5 back to back seeds using the same prompt, I'm not fucking spoon feeding you
>>
id rather have 99% perfect details with crazy good aesthetics and prompt following AND style flexibility and then just fix small details with adetailer over having nothing but the same image with better small details due to being overfit to death on a single style

Its one or the other, at least until the vae issue is fixed
>>
What is going on
>>
>>107022182
Him getting in Loadstone's ear to make a equally shitty model. The irony is this piece of shit is worse than chroma and instead of following THE FUCKING DOCUMENTATION he decided to listen to pony fag to make a stillborn model that can't even work with loras.
Fuck both of them for burning so much time and money
>>
>>107022182
see
>>107022140
>>
>>107022185
only low iq browns cant extract the 95% of good from something and focus on the fixable 5% bad
>>
Hey guys, I need a little bit of help.

Instead of generating videos locally I wanna use an API that uses wan or other models. I'm trying Pollo API, i can manage to make it work but the generated videos are midly NSFW.

Is there a way to directly use an existing API (could be another one than Pollo) to generate full NSFW (porn) content using image to video?
>>
>>107022191
show me a image remotely similar both style wise and quality wise from qwen or sdxl or illustrious at the same res, also that is the point of chroma radience which is training, all models have that issue with tiny details
>>
>>107021440
there are native comfy nodes. what is even that monstrosity? WAS?
with ESRGAN or related upscalers (like yours), you DONT really decide the upscale ratio, it's baked in the model. how does the upscale_by even work here? these kitchen sink nodes are fucking garbage.
There are other ways to do upscaling, one is controlnet tile upscale (model dependant), another is LATENT upscaling, and finally we have the new entry on the block, seedvr2 upscaler model.
DESU I mostly do seedvr2 upscaling nowadays (I've got the vram, it's fucking HUNGRY), for the simple reason that with CNET tile and LATENT upscale you need to do the upscaling the moment you're generating if you have a complex workflow.
With esrgan or seedvr2, you don't care, so you can just handpick any image you have and shove it in there.
>>
>>107022217
NTA and I have no experience but as far as I understand the most you can do is keeping separate parts of the model (TE, UNET, VAE) on different GPUs. This can provide a moderate speed up as opposed to constant loading and unloading (especially on lower VRAM systems) but it's not even close to what you think can be done.
>>
Is it true you can stack multiple cards or is vid gen more nuanced~?
>>
i have trained pony loras in the past, is the process the same for illustrious? can i follow the same guide kohya guide i used for pony?
>>
JANNIES
>>
>>107022226
I mean I can run it on 12gb VRAM but it runs like shit.
>>
>>107022219
I'm not talking about VRAM limits or how to offload what to where
>I have no experience
>it's not even close to what you think can be done
getting mixed signals here. I've just given you a list of different upscaling methodologies, yes it's not exhaustive but I'm not sure what you're asking for then.
>>
>>107022230
Where's your songbloom, bitch?
(shoves you)
>>
>>107022238
Isn't there a node that lets you offload vram to another card? So instead of going to system ram, it goes to the 2nd gpu. This wouldn't speed up video gens, but atleast you'd be able to use higher parameter models without worrying about OOM'ing. Also great for higher res videos
>>
>>107022217
> how does the upscale_by even work here?
it downscales, keep or upscales upscaler output, is it so hard to assume?
>>
>>107022248
You can, but probably not with comfyui. Most newer models support multi-gpu in their reference implementation.
>>
>>107022248
how is it downscaling? area? lanczos?
>>
>>107022256
sdcpp can but pootorch is just jank
>>
File: media_1761091264.png (1.28 MB, 768x1280)
1.28 MB
1.28 MB PNG
When I see your pictures I'm comming to the conclusion that online models are better than local. PixAI is superior.
>>
I mean, on /gif/ I can see full-blown NSFW content that I can’t seem to generate using the Pollo API.
I’ve tried with WANx models, Kling, and the Pollo model itself.
Is it a matter of the model training, or does the Pollo API apply some kind of moderation/filtering?
>>
>>107022217
> with CNET tile and LATENT upscale you need to do the upscaling the moment you're generating if you have a complex workflow
elaborate
>>
>>107022261
>posts a terrible plastic slop
>>
>>107022261
not sure what you mean because of how you are asking the question but character loras work just fine with wan 2.2, you would need to train them for wan 2.2 and then use them as you use loras normally for any mode
>>
>>107022242
yes there are multiple (KJ and MultiGPU) but I never pointed out OOM issues in my post
wait are we fucking getting the thread botted? what the fuck is happening?
>>
>>107022263
this is one of the saddest posts I've ever seen on 4chan, why are you so passionate about defending this model?
>>
>>
>>107022264
As someone who is only interested in text to video (and is burnt out of that and waiting for a local sora 2 so I can generate teens acting bratty with sound before going back to genning),

The fact that local models are in such a state that you even wrote a three-paragraph persuasive essay is top grim. I don't think it could get any grimmer than this. At least time will heal all wounds and it's just a matter of surviving and waiting for things to get better.
>>
File: 1760046818798437.jpg (1.36 MB, 1248x1824)
1.36 MB
1.36 MB JPG
>>
>>107022265
>another wall of text
this guy is seriously mentaly ill
>>
>>
>>107022226
> i have trained pony loras in the past, is the process the same for illustrious? can i follow the same guide kohya guide i used for pony?
yes

it's scatPICKLER is spamming the thread for sure
>>
>>107022268
I used diffusion pipe's defaults. LR 1-e4, 68 epochs, over 500 images, took me about 3 days
>>
>>107022282
that sounds great. i tried ace step and it was crap it couldnt even follow a consistent beat.
>>
>>107022270
rank 64 seems kinda high DESU, at least if you mean that in Kohya scaling where it's gonna thus be > 500 MB
what training settings? was it trained at 1024x1024? If not that's your problem lol
>>
>>107022256
> how is it downscaling? area? lanczos?
look at that screenshot again
>>
>>
>>107022269
I have now idea what you're saying buddy. I just want to generate NSFW content lol, i'm trying to find the best solution, could be pollo or anything else I dont care
>>
>>107022273
it's funny how Qwen Image Edit doesn't give a single fuck about the original image's style, it'll add the new element like you'd add something on paint and call it a day, that's soo lazyyy
>>
>>107022293
can you fuck off nigger
>>
>>107022292
Same trend I am seeing as I go down my list of VAEs and eyeballing the error image brightness. Flux and Hunyuan are best for the random 3 anime images I tested.
>>
>>107022300
What does it do anyway? It's compression?
>>
how the fuck do you search while banning tags in civitai?
>>
>>107022395
Don't ask me to find logic in that.
>>
File: chroma___0006.png (1.4 MB, 1152x896)
1.4 MB
1.4 MB PNG
>>107022297
born to coom, forced to troll
>>
>>107022413
now this looks like chorma
>>
>>107022425
i don't get
>>
>>107022447
now this looks like shit
you understand now?
>>
File: chroma___0002.png (1.63 MB, 1152x896)
1.63 MB
1.63 MB PNG
>>107022425
>>
>>107022447
nigga claimed previous goth thots gens were chroma, but they were qwen or wan
>>
>>107022457
based fellow satanic sitcom coomsumer
>>
Why does /ldg/ have the most abhorrent taste in women?
>>
File: ComfyUI_temp_dbeyl_00048_.png (2.28 MB, 1440x1120)
2.28 MB
2.28 MB PNG
>>107022467
we are different anons
>>
>>107022478
post your 1girl. I'll wait
>>
and this >>107022472>>107022482 is wan or most likely qwen
>>
File: QwenImg_00008_.png (1.87 MB, 1440x1152)
1.87 MB
1.87 MB PNG
>>107022478
you'd think I'd marry my gens or something? what does it matter
>>
>>107022478
I expect it to be banned the same way nude loras for qwen image edit systematically get banned.
>>
>>107022467
This is what chroma radient is upposed to avoid.
>>
>>107022502
Hunyuan 2.1 has a 32ch VAE
Actually, I don't remember if it uses it for the full process or only the refiner
>>
>>107022489
Yep.
Despite being still 4 channel the SDXL vae was a lot better than the turbo garbage in SD 1.5 (for which you wouldn't even need comparison node to see how much it destroys the image)
Hence why it was still a noticeable improvement over its predecessors in term of fine detail.
>>
>>107022413
ramtorch will save us unironically
>>
>>107022454
it's VRAM usage. the vae is for converting to and from latents. openai already did this for years because of the compute they have access to. it's faster but the entire point is sd1.x having a vae in the first place was to fit on consumer gpus
>>
>>107022529
I have now idea what you're saying buddy. I just want to generate NSFW content lol, i'm trying to find the best solution, could be pollo or anything else I dont care
>>
>>107022489
its chroma lol, I tried qwen before but it seemed to me like it gave me the same image over and over but when I see images like >>107022502 it makes me wanna try it more, but I feel that qwen is stiff AF when it comes to image generation
>>
>>107022551
My bad. I didn't realize more modest SDXL finetunes below pony/illust level still trained on millions.
>>
>>107022551
try lowering the shift, might get duds but loses that porcellain look
>>
>>107022566
posted wrong image at first:

pony is better than originally thought (still not good) but it has major issues, he did not do uncond dropout

To somewhat fix it spam tons of random nonsense tokens into the negative

It is also somewhat locked to long prompts = good image
>>
>>107022478
because most are reddit tourists and dont belong on 4ch
>>
>>107022574
I think a lot of the initial tests disregarded the prompting instructions and tried to a-b test models with exactly the same prompt, getting terrible results.

pony 7 still doesn't look great, but not as bad as the first mutagenic horrors people initally posted.
>>
Why aren't jannies doing anything?
This spam is actually really fucking disruptive.
>>
>>107022582
No it's the same faggot that griefs this thread daily, sora is now irrelevant and the api bullshit gets no traction so he's cycling through his autistic bullshit.
I do think chroma is good for some memes if you can cope with the art constantly being inconsistent between seeds
>>
File: ComfyUI_00024_.png (1.35 MB, 1152x896)
1.35 MB
1.35 MB PNG
>>107022551
i think it might actually be chroma lol
>>
Reporting you for trash content and furry pedophilia, and then blocking you.
>>
>>107022607
Is that the first model? So can we expect a distilled/smaller model in the future? I want to interact it like a LLM model with LM Studio. Kinda annoying trying to install billion different dependencies to generate image with python for every new model.
>>
>>107022609
You're too stupid for this conversation please go back to your containment thread, no other model besides chroma is unable to maintain the same consistent art style even with the aid of loras
>>
>>107022566
Does it fix the composition stiffness? or I was just being worked by the 1girl qwen (probably overtrained) lora, because it always gave me the similar photo over and over, for example I generated this selfie but no matter if I changed the prompt, If I had mirror selfie in the prompt I would get this exact type of image, the little image variation is what killed my interest for qwen
>>
>>107022623
shoot me a prompt, I'll see
>>
>>107022623
I see what you mean.
I actually haven't tried specific styles with chroma, mainly just realism stuff.
Shame that he also did this stupid BS.
>>
>>107022642
Now post 5 images with the same prompt within the same batch so I can laugh at you
>>
>>107022642
If you haven't noticed Chorma can't do artist tags despite having a dataset that includes various artist and styles, this is a conscious effort done by the creator and has historically lead to massive issues with models as we saw with SD3 being dumbed down and the latest victim being pony V7. It always seems to do massive damage to models because concepts and other things that are often put into separate tags or related to some tags become completely fucked and you get stuff like inconsistent styles or broken anatomy like SD3 not being able to produce an image of a woman lying in the grass because they decided to obfuscate anything close to a sex act.
>>
ignore the anti chroma troll, notice how he never posts gens
>>
File: flux__004.png (1.24 MB, 1152x896)
1.24 MB
1.24 MB PNG
>>107022665
i'm anti-chroma, here's a gen
>>
>>107022674
Show me a model that does as well as consistently as chroma at as many styles and is not qwen which is style locked
>>
>>107022674
actually this survey cannot be reproduced
>>
>>107022651
>>107022659
Idiots, we are talking about Qwen here, not Chroma.

Chroma doesn't have image variations issues like Qwen does
>>
>>107022683
It's a known issue and one of the reason details like hands or feet look bad.
>>
>>107022683
It's a bot.
>>
>>107022687
Create a torrent, port forward the client port, and post the magnet link on the civarchive page too as another mirror
>>
>>107022683
there's a spambot bombarding the thread if you havent noticed, jannies are apparently asleep as I'm doing my due diligence.
>>
how do i specify i want the female wearing sandals and not the male with XL?
>>
>>107022691
thats on the usual initial toy model size, seeing if it actually scales is what kills papers
>>
File: krea__002.png (1.66 MB, 1152x896)
1.66 MB
1.66 MB PNG
>>107022675
i'm not interested in your debate. use whatever model you like. i do not like chroma though.
>>
>>107022695
again, I tried this bullshit and the output was slopped as fuck and the background was grainy. 480p high, latent upscaled 1.5x to 720
>>
>>107022698
I'm a local fag but bored with muted videos. just paid for a month at a discount test the backlog of 1girl images i have on my output folder.
>>
>>107022577
>>107022700
this responses have nothing to do with what im talking about. can a janny get rid of this bot already?
>>
>>107022709
And what are you gonna do different from the 6 gorrillian upscale models that already exist
>>
File: ComfyUI_temp_mpreu_00030_.png (1.93 MB, 1320x1024)
1.93 MB
1.93 MB PNG
damn so the dead internet theory is real and I've replying to bots this whole damn time
>>
>>107022721
I know I need to, but I'm lazy... What you got nigga?
>>
>>107022721
The schizo is trying everything in their power to kill /ldg/. It's actually funny.
>>
>>107022730
> you are fucking degenerate learn how comfyui works and treats workflows in particular first mr i know how's better
>>
the Any Switch nodes in rgthree already support dynamically typed inputs and outputs, you're a fucking retard. You can dynamically type the output depending on the input you're connecting it to, likewise for inputs.
kys nocoder retard
>>
>>107022730
it's baffling as this is the fastest moving thread on /g/, but I guess jannies are too busy erping in IRC to do anything about it.
>>
>>107022747
because it generalized to understand that the only purpoes for a digital anus is for it to be fucked, equating ti to a vagina
>>
>>107022747
polydactyl
>>
>>107022775
you are fucking degenerate learn how comfyui works and treats workflows in particular first mr i know how's better
>>
File: dmmg_0013.png (1.5 MB, 896x1152)
1.5 MB
1.5 MB PNG
>thinking you can out-spam 1girl
>>
>>107022747
it's only the fastest moving because of the schizo, but then again i guess that every fast moving thread on 4chan has schizos bloating them.
>>
>>107022788
I think it's great the average normie is so clueless, so the anti-AI mob calms down a bit while open source flourishes. Though Qwen definitely has a bit of yellow tint (Alibaba trains on synthetic data unfortunately).
>>
>>107022790
another retard that doesnt understand how python works.
you can have outputs be dynamically typed (or having a dropdown for casting to your desired type), but everything is mathable together in python, makes absolutely no sense to have 09593421906234 nodes all to do casting and implementing math operations for each fucking type, there's absolutely no fucking need retard. Go learn python fucking monkey retard, kys
>>
>>107022790
it's actually always fastest moving, along with /aicg/ followed by /lmg/
>>
Bros, I'm home.
>>
>>107022805
All this extra shit just for latent upscale and you're stuck with 1.5x, nothing else works or you get the mismatch error.
>>
>>107022808
Top right and bottom right are very much my shit.
Hnnnng
>>
>>107022808
>38 CL
>probably 4 sticks with 2 IMC cpu
*vomits*
>>
>>107022816
don't be mean anon,
>>
>>107022808
you don't even need more than 64GB ram. you did with WAN2.1 because torch compile and such, but WAN2.2 doesn't need those optimizations. I haven't gone over 96GB ram usage in months
>>
Hello, this is the Chroma anon from last thread that made the rank 64 lora.

Please test it and report back. I would like to get feedback on what are the most glaring issues Chroma still has.

https://huggingface.co/chromanon/goldenchroma
>>
>>107022818
it's a shame because for 'general' speed it's going to be a big downgrade to regular CL30 6000mhz ddr5 ram (which you can get in consumer platforms at 96gb currently, im not sure if there are any 128gb 2x kits capable of doing this).
This particularly matters for LLMs, not so much videogen.
>>
>>107022293
>Pixai
This is a local general, why you contribute nothing but a closed service?
>>
>>107022815
wat

>>107022816
>>107022832
I still have my 64gb 6000mhz lying around if I need it for something in particular.

>>107022825
You very much do, I can't run loras when doing 720p. And now I can keep doing other stuff while genning and not ooming.
>>
Just imagine how pathetic his life is THO kek
>>
File: ComfyUI_temp_fuhfo_00013_.png (2.02 MB, 1088x1920)
2.02 MB
2.02 MB PNG
>>
File: psxpolaroid_0001.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>
>>107022829
I think it increases gen times, i've noticed when i forgot to disable them, i had only genned 81 frames but i had the context on and set to 41 frame context size. And yeah they gen i'm doing is taking way too long, its been stuck at 33% for a while now... and it just jumped to 67% thank fuck, i thought it was having a fit. but yeah its effecting the gen time a lot, so only use the context window when you need it.
>>
>>107022854
how bad is your autism? they look pretty close
>>
>>107022928
scarlett johansson looking rough these days
>>
File: dmmg_0037.png (1.43 MB, 896x1152)
1.43 MB
1.43 MB PNG
>>107022947
>(scarjo x margot x kbell)
keen eye anon
>>
>>107022857
I have a feeling we are gonna be eating so good in the new year or Christmas coming.
>>
>>107022857
I have a feeling we are gonna be eating so good in the new year or Christmas coming.
>>
>>107022976
never got into wan and alike.

so this does not work well on faces?
it is clearly different when it gets animated
>>
how do you know if your lora is shit due to the images you used, the captions, or the training settings themselves?
>>
>>107023058
kind of, yeah... Its actually meant for doing longer videos but it stop's oom's and it works very good. Its really underrated or just over looked desu because its a new beta node guess.
>>
>>107023058
Context Window is actually a frame batch node
>>
File: 1761132801891331.png (1.28 MB, 768x1280)
1.28 MB
1.28 MB PNG
>>
107023073
107023078
wat
>>
>>107023096
Describe what you want
>>
>>107023095
I wish there was more feet sloppa...
>>
>>107023096
bot is shitting up the thread by reposting old posts
>>
File: dmmg_0024.png (1.66 MB, 896x1152)
1.66 MB
1.66 MB PNG
>posting through the pain
>jannies wru?
>>
>>107023102
The example is a single prompt for everything, but I'm not sure if it will do well with a sequence of different events. It looks like it should be possible to prompt each segment differently, but it might get weird when it fuses them together.
>>
>>107023104
I got you anon just just tweaking to hit the right spot and i'm getting close to the sweet spot. this i think was only 4 steps using old wan2.1 lora rank 64 iv2 in high at 5.00 strength and old wan2.2 low in low at 1.0 strength. But I'm using unconventional sampler settings that some anons here think is retarded because it uses cfg but honestly the gen time is not all that much more time and you get high quality with prompt adherence so its balance I seek and not just the fastest speed.
>>
might as well cool on a new bread anyway since all those posts will get wiped
>>
>>107023150
the car doesn't have any windows anon, pay more attention
>>
>>107023150
>since all those posts will get wiped
yeah in two more weeks at this rate
>>
most if not all flux loras work in chroma so try them.
>>
>>107023171
you don't understand when you install requirements.txt it will check and re download most probably.
>>
>>107023172
speed up loras are for losers
>>
File: image_00002_.jpg (792 KB, 1240x1672)
792 KB
792 KB JPG
spooky door

>>107023058
Bad dataset is the usual reason for bad results. Use AdamW or something else that just works.
>>
>>107023058
What model architecture are we talking about here?
>>
>>107023199
can you go back to posting women?
>>
>>107022829
I was the guy you were talking to, I'll have to see how this compares to my own Chroma experiments
>>
>>107022791
Qwen is ridiculously 4o-maxxed specifically at 1024x1024 yeah. Not really at 1328x1328 or the other weird standard Qwen resolutions though
>>
File: 1761497463611920.png (898 KB, 720x960)
898 KB
898 KB PNG
>literally change nothing about my workflow
>regression into sparkly bullshit
cmon dawg
>>
>>107022691
I've seen this kind of shit before, they seem to just randomly repeat recent comments that other people already made, nothing they say is ever original. Not really sure what the intent is by whoever is deploying them, maybe just trolling
>>
>>107022829
Better stick to one type of dataset if you want to improve anatomy with lora. I would remove cgi stuff and just stick to photos
>>
>>107023309
A lot of things are redundant old versions that won't be needed anymore or from nodes that were deleted after testing
>>
>>107023267
why would hair fly in the car?
>>
Is the 5b model of wan2.2 useless, or am I doing something terribly wrong?
>>
File: image_00007_.jpg (723 KB, 1240x1672)
723 KB
723 KB JPG
>>
>>107023340
why? Is Chroma against rules too? Did Flux krauts do this?
>>
>>107023343
There's several good loras for Chroma just civitai are cucks and take them down almost as immediately as they get uploaded.
>>
>>107023340
your question is useless
>>
>>107023354
Most of its community still thinks Flux is SOTA. And then you have all the Chinks who have successfully shilled Qwen.
>>
Discord trannies having a melty
>>
>>107023363
I'm talking Wan btw
>>
>>107023345
>>107023354
>>107023357
I'm not comparing wan to anything, just asking if the 5b model is supposed to look like the will smith spaghetti gif because of it's size, or if I fucked up the setup
>>
>>107023379
I agree that Chroma has its issues but to be fair to Chroma that's not the correct resolution for 1.75 ratio in a 1024p model.
>>
>>107023382
Sdg will never be popular :[
>>
Pathetic
>>
>>107023404
its not just here its everywhere in ai
>>
>>107023405
and i will make eat you shit you just posted within a few minutes once i adjust the cfg from 4 to 3.5 and load the q8 and up the resolution to full 720p then you can eat shit and die
>>
File: image_00008_.jpg (719 KB, 1240x1672)
719 KB
719 KB JPG
>>
>>107023451
>Its pretty good once you figure it out how to use it
>posts a gen with the long torso issue
>>
File: 1706367111.png (1.81 MB, 896x1152)
1.81 MB
1.81 MB PNG
>>
>>107023490
>letting your dog lick chocolate syrup
fucking retarded kid
>>
I will now proceed to post 4 more variations of my gen
>>
>>107023497
Kind of annoying how the lightx2v ruins videos with end frames. It distorts right at the ending but without the lora it works fine
>>
>>107023503
>>107023503
>>
File: radiance.png (2.82 MB, 1024x1024)
2.82 MB
2.82 MB PNG
>>107023340
it depends on subject (and arguably your expectations)

sure enough, the bigger wan is noticeably more competent, so generally you want to try that if 5b doesn't work [well enough]



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.