[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage.jpg (2.53 MB, 2389x3463)
2.53 MB
2.53 MB JPG
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107019684

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Neta Yume (Lumina 2)
https://civitai.com/models/1790792?modelVersionId=2298660
https://gumgum10.github.io/gumgum.github.io/
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
Death to Worthless Jannies Edition
>>
I fucking love 1goth so much bros
>>
>>107023518
I even got warned for my 'false' reports, these fucking jannies are fucking retarded. 1 look a the archive to see they're 1:1 copies of old comments, but I guess it's above their paygrade with their sub80 iq indian brain
>>
File: 00096-1231330843.png (2.2 MB, 1824x1248)
2.2 MB
2.2 MB PNG
>>
TOTAL API VICTORY
>>
File: image_00011_.jpg (655 KB, 1240x1672)
655 KB
655 KB JPG
>>
>>107023530
https://github.com/kijai/ComfyUI-WanVideoWrapper/issues/1519#issuecomment-3440759925

I tried telling anons this when it first dropped, it can't be as easy as just 1 lastframe because base wan does not have any concept of the previous genned video. It treats each as a new video, so to me it looks more like how wananimate works in taking 5 previous frames to continue the motion since 1 frame isn't enough to continue motion with.

People were making videos with it but those would have been placebo gens.
>>
>>107023547
I just wish wan could handle longer prompts for more actions. It hardly ever works for me even when using context windows and 161+ frames at 81 frame chunks with overlap.
>>
File: 1730944079673496.mp4 (1.22 MB, 832x480)
1.22 MB
1.22 MB MP4
new kijai high lora (MoE) + 2.2 low seems pretty good for motion
>>
>>107023530
jannies lack the nuance required to understand trolling in ai threads unless its blatantly obvious to even the newest of fags
>>
>>107023592
what if the second thing is waiting, then the actual second thing becomes the third thing
>>
>>107023595
With the way things are going, I doubt it.
Well they will, but you'll get the less powerful stuff.
>>
i like how lilbotbro immediately switched to the new bread hmmm
pathetic rlly
>>
File: 1756081406286310.mp4 (1.34 MB, 832x480)
1.34 MB
1.34 MB MP4
>>107023592
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22_Lightx2v/Wan_2_2_I2V_A14B_HIGH_lightx2v_MoE_distill_lora_rank_64_bf16.safetensors

low uses the old 2.2 lightning which is fine, the old high lora caused some slomo in general. this one is better.
>>
>>107023608
any VLM model will use cuda, if you're gonna figure out how to make it work for one then it may as well be joycap
>>
Blessed thread of frenship
>>
>>107023610
yeah I guess kek
>>
>>107023616
Can't wait to load my sdxl models super fast!
>>
File: image_00014_.jpg (688 KB, 1240x1672)
688 KB
688 KB JPG
>>
>>107023626
Discussion of free and open source models, faggot
>>
>>107023503
So, and ignoring video as its own thing, can we finally say local AI is stagnating at the moment compared with the previous years? Or are there still avenues of research that could yield something innovative for PC architecture and size?
>>
>>107023637
Nice, did anyone try these SVI loras with wan2.2? What weight did you use? How did you make it work for longer videos?
>>
>>107023637
it's not something you can mess up by mistake
>>
>>107023637
What is the basis for your post
>>
File: ChromaLora_00071_.png (1.28 MB, 1152x896)
1.28 MB
1.28 MB PNG
Reminder to please test this, compare vs vanilla Chroma with the same settings and report back:

https://huggingface.co/chromanon/goldenchroma
>>
>>107023654
if he released the artist ids, I bet people would even overlook the massive flaws
>>
Why is there a jeet bot randomly replying to unrelated posts?
>>
>>107023610
>new
that's from two weeks ago bud
>>
>>107023655
the author has a strange heretic perversion to releasing models to the public which can be prompted with artist names. his own secret versions however do not have this problem
what a faggot
>>
>>107023663
You must have knowledge of the turd to appreciate the beauty of better models like Pixart Sigma.
>>
>>107023668
Chroma is a base model. Why should a base model have ridiculous style tags?

Chroma as can be tuned by anyone for any purpose. That's what makes it special.
>>
>>107023663
Nah, it's kids from discord
>>
Complete meltdown
>>
>>107023677
what the fuck? I assumed that was the whole point of clusters. Maybe I should actually read the docs.
>>
>>107023680
i assume the clusters do not allow for prompting individual artists which is a huge fucking kick in the nuts for no reason other than muh morals
>>
>>107023663
>debo/other retards from special diffusion general
>jullien/his pet retards
>anti clankers
>payed saas shills
take your pick
>>
File: 00100-740583725.png (2.07 MB, 1248x1824)
2.07 MB
2.07 MB PNG
>>107023586
prompt adherence and animation quality for wan2.2 is terrible and and was barely an upgrade from wan 2.1 . The ovi finetune for wan2.2 is just absolutely terrible with the audio and animation quality. Grok is far better is to use but its annoyingly censored and block naughty prompts 75% of the time. Don't have to autistically be descriptive with the prompts on image to video compared to wan2.2. Made some grok videos on the wsg sora thread. >>6012247
>>
>>107023704
I finally succeeded. I was uhh only trying to accomplish no earrings, nothing else in particular
>>
>>107023706
>pivoting straight to another grift
this, as to why anyone would support him after v7, the amount of retards far outweigh people with common sense.
>>
File: dmmg_0015.png (1.67 MB, 896x1152)
1.67 MB
1.67 MB PNG
>try to gen realism
>get american woman body types
oops
>>
File: 1751453756075991.mp4 (2.43 MB, 640x640)
2.43 MB
2.43 MB MP4
the 4 characters jump off a cliff into the ocean.

kek
>>
>>107023654
Mostly? I haven't seen anything surpass flux in the same way flux surpassed SDXL, or XL surpassed SD1. To me the new models are slowly tapering out on the increments, chroma (a flux-level model with poorer finer details) remains without checkpoints and Illustrious, a model almost a year out remains unbeaten in its very niche use.

Compare the environment two years ago, with up to 4 promising architectures being developed in parallell and people waiting for releases continuously every few months, this is simply not that same speed. Incremental, at best, on architectures limited by hardware first and foremost.
>>
>>107023717
Well I know there are plenty of elf images that don't have earrings like that. And other models like the SDXL-based ones can make earring-less elves easily enough. It might be that I am pass in a cartoon style image in the workflow, that does not itself have earrings but this nudges the model as well. Perhaps it can realistic pointy earsor other styles just fine without sticking earrings on. Or perhaps if I fed it a empty latent image it wouldn't have the problem. Haven't tested it. Just noticed it was st range. At any rate, I have added frieren to negatives (along with earrings) but I don't start losing the earrings until the cfg gets all the way up to around 5.0 at which point it is looking rather fried so I guess it is not going to fix the problem.
>>
>>107023718
Pony v6 was simply a fluke.
When he announced v7 and what was going on with it that already was a clear sign that the model is going to be a failure.
>>
what was the asmongold gen made with? lora?
>>
>>107023719
Same reason Princess Peach always has an Iron Man core embedded in her chest no matter what clothes she's wearing, if almost 100% of the examples of a given object have that particular feature then as far as that model is concerned it is an inherent aspect of that object. It's not like these models make any inherent distinction between clothing and body parts.
>>
>>107023725
Do you mean Miku Hatsune?
*ducks*
>>
>>107023719
>Illustrious, a model almost a year out remains unbeaten in its very niche use.
nah its been usurped. noob however remains but not for long
>>
>>107023737
I'll maybe wait for another kind anon to do the usual MATRIX of CFG x SAMPLERS.
I thought that pony only worked with NL, that's what the official images are using, I'll try a round with booru prompting, but later.
>>
File: ChromaLora_00074_.png (1.2 MB, 1152x896)
1.2 MB
1.2 MB PNG
>>107023655
Note to self: Chroma doesn't seem to do "underboob" consistently, need to add data
>>
>>107023748
nodes are kinda shit when it comes to videos. where is a UI that has sequencers and timelines? is that too much for techbros to handle? all this node kikery is a waste of my fucking time
>>
>>107023719
> I haven't seen anything surpass flux in the same way flux surpassed SDXL, or XL surpassed SD1.
Qwen Image does to a degree but admittedly not as much. It's close though. Edit models in general are a pretty new thing.
>>
>>107023753
Did he overtrain the model, chose the wrong parameters or is Auraflow just that shit no matter what you do?
>>
>style_cluster_1610, score_9, rating_safe, human girl Iwakura Lain from Serial Experiments Lain. She is wearing a sexy halloween witch dress with a witch hat, holding a pumpkin hallowen basket in one hand and putting her other hand behind her head. She has a mischevious evil grin looking at the viewer. She's standing in front of the viewer's house's door, behind her a faintly lit road in a suburb. Cowboy shot. The atmosphere is eerie and supernatural
nailed the character this time, and adjusted some of the prompt to make it simplier to understand where she is. Also I added the word pumpkin for the next gen. Tbh it looks a bit undercooked, I'll try adding more steps, maybe that'll fix it
>>
How many of the 100% of users are responsible for the internet being so cluttered with AI slop? 5%?
Not counting commercial spammers?
I've never uploaded a picture, and I've been around longer than SD 1.5.
It used to be nice to see a few funny AI slops. But now it's gotten to the point where it's really ruining the internet for me.
The technology is great, it's just the people who use it that suck.
Sad life
>>
hes trying SO hard to slide the fuck out of this long dick general kek
>>
>>107023766
I think the problem might be with the style cluster? the default one was for pony fuckers I guess but on the model card in HF I see no mention at all of where these fucking styles are.
but first error I see that I did was this:

>When referring to characters use pattern: <species> <gender> <name> from <source>
>For example "Anthro bunny female Lola Bunny from Space Jam".
something that no other model has required before lol, I'll try by changing some of the prompt around too.
>>
>>107023767
you can try to find out what models that app uses and google or ask chatgpt how to run them
>>
File: image_00020_.jpg (712 KB, 1240x1672)
712 KB
712 KB JPG
>>107023748
It can probably do it, just needs longer description
>>
>>107023781
Linux Mint is probably the most approachable distro in terms of matching Windows' usability but even then it's a clusterfuck of issues.
It's a-okay but goddamn do I hate linux already. Endless stream of dependencies etc.
>>
So what pissed him off this time?
>>
>>107023797
No, I will post the finished part later.
Now I will go sleep like a baby.
>>
File: ChromaLora_00077_.png (1.37 MB, 1152x896)
1.37 MB
1.37 MB PNG
>>107023781
Sometimes it does a boob window like picrel
>>
Fuck, I've been choosing spamming/flooding instead of spambot, technically the same I guess.
Good rest in bed while doing it, time to get back up and check my set of 8 genned 720p videos.
>>
>>107023820
double click on the input and connect to both ksamplers
messy but it's cumfart ui get used to
>>
>>107023719
>Compare the environment two years ago, with up to 4 promising architectures being developed in parallell
which were?
>>
>>107023821
I see it's a dense model. Realistically, how long would it take to gen 2 min videos on a 3090?
>>
>>107023825
dunno why schizo is so anti anistudio. I've been asking for an exe since 2022 and finally someone is working on it. fuck python
>>
stable-diffusion.cpp & vision.cpp
>>
>>107023836
It came out 10 hours ago, come on dude, this is ridiculous.
>>
with lightx2v even if the loras are 4 step, dont you generally get better outputs with 6 steps or more?
>>
>>107023825
Flux, SD3, and there were smaller ones such as pixart and that was parallel to famous checkpoints such as pony (I think that was before illustrious?) and controlnets were relatively new. Point is, more things to hope for as they cooked, the same way it is for video now.
>>
>>107023904
yeah I do 6 at least
>>
>>107023840
wdym?
>>
>>107023924
he wraps the main application that's in diffusers but comfy has a vendetta for making diffusers as abrasive to use as possible to use his slower implementations
>>
>>107023928
I have trouble telling wan to do anything with the camera at all other than zoom or close up
>>
>>107023929
At this point you deserve to never get what you want.
You fucking retard can't even bother to learn the very basics of prompting with wan.
I know what it is but I hope nobody else spoon feeds your jeet ass.
>>
File: image_00026_.jpg (639 KB, 1240x1672)
639 KB
639 KB JPG
>>107023820
I use 'clothing that exposes part x of her body'.
>>
>>107023929
spambot. report
>>
>>107023924
most were not rooting for SD3 or pixart kek
>>107023929
some bot is reposting stuff from previous threads just ignore all the ones that make zero sense
>>
>>107023948
I can write down stuff later for sure. Hll anon used LION to create a huge multiple concept lora and only trained Text Encoder too so I think there's lots undocumented stuff that works really well.
>>
>>107023950
luck of the draw. Try different prompt, add stuff that gets out of frame to the description for it to still show, I don't think its wan fault it's light lora fault on my case at least
>>
>>107023952
fixed camera
>>
>>107023924
There is a ton of hope behind Qwen, Edit models, VAEless models, that lumina 2 tune, etc
>>
>>107023988
ask devs to contribute. hell, ask the nunchaku devs to make a sdcpp implementation. nobody does shit unless they know it's what people want
>>
File: image_00028_.jpg (555 KB, 1240x1672)
555 KB
555 KB JPG
>>
>>107024021
"girl with only 2 arms and 2 legs"
>>
So what model is everyone using?
still chroma 27? qwen?
or is there something new?
also are there any good loras for chroma that make it better or something?
>>
>>107024045
Know how to use plain English to describe what you want.
>>
File: image_00031_.jpg (684 KB, 1240x1672)
684 KB
684 KB JPG
>>
>>107024085
My gens with this image aren't as creative or as safe for work
>>
File: ComfyUI_00032.png (1.55 MB, 896x1152)
1.55 MB
1.55 MB PNG
>>107023836
did not realize we could post transparencies
>>
>>107024091
At this point why not just partner up and go all in on Chroma? They "sponsored" Chroma, but a full blown partnership would be better. Pony v7.1 is Chroma, then a tune of that is Pony v7.5
>>
>>107023503
Why is AniStudio not in OP?
>>
>>107024103
chroma and leto are good doe
>>
>>107024103
a tranny lolcow that freaks out at any mention of it
>>
>>107024108
The worst thing is that v7 could have saved SD3.5 the same way v6 saved SDXL, if he were to train on Medium. Small size, faster training, but all the benefits of 3.5M namely 16ch VAE, T5XXL and native 1.4MP out of the box. We could have had v7 as early as like the first quarter of 2025, and today would have been swimming in loras and merges.
>>
>>107023722
I think Pony V7 would have turned out probably fine DESU had it been captioned via the same process Lodestones used for Chroma instead of whatever was actually done. AuraFlow 0.2 was obviously unfinished and not really that great overall but it had quite good pronpt adherence already and did text quite well.
>>
Rate these noodles.
>>
>>107024113
>masturbate to horses
>pour tens of thousands of dollars into horse porn generator
>the horse porn is subpar
>>
>>107024114
>krea video
no GGUFs
>light x2v lora
suffers from ghosting and lip flapping
>>
the bot is broken. fix your shit anon
>>
>>107024121
im actually downloading because at the end of the day, it doesnt hurt to try really
>>
>jannie warned me for my legit reports
fuck you
>>
>>107024130
no its synth slopped, but somehow in a far more retarded way than flux/qwen
>>
File: ComfyUI_00034.png (1.45 MB, 896x1152)
1.45 MB
1.45 MB PNG
>>107024117
>>
>>107024138
dude on the right wishes he was home watching youtube
>>
>>107024130
What if the jannies are in on it to increase their report card at the end of the month for more mone- oh wait, lol.
>>
>>107024111
Medium was pretty weird to train though DESU, I eventually got good results from it myself but only from Doras, never regular Loras, and only with the CAME optimizer
>>
>>107024142
I haven't tried it yet but this seems to occur for very short prompts because the model was trained with long and detailed ones
>>
>>107024144
I'm completely fine with that
>>
Hello? Anyone real here left? Dead internet theory is the depressing reality now?
>>
File: ComfyUI_03772_.png (1.04 MB, 832x1216)
1.04 MB
1.04 MB PNG
Friendly reminder that there was a time /pcbg/ was borderline unusable for MONTHS due to ban evading spambot that raped the general and jannies couldn't do shit about it.
These useless faggots will sit on their asses policing fun and deleting whatever slightly inconveniences them but when you actually need them for once they won't lift a fucking finger.
>>
>>107024166
blender isn't and "of" is the trigger
>>
>>107024169
could've taken even lower quality picture
>>
File: 1734628898286324.mp4 (951 KB, 640x640)
951 KB
951 KB MP4
>>
>>107024181
qwen image edit, 8 steps, 1 megapixel images, rtx 3090
first gen: 156 secs
second gen, same image and prompt: 49 secs
change image: 91 secs
change prompt: 62 secs
disable 8 step lora, 20 steps: 95 secs
>>
>>107024169
Would let her murder me
>>
>>107024195
It's noticeably shit when comparing to not using it
>>
>he's tying to shit up the blessed thread again
Because that worked so well in the past
>>
>>107024130
>warned me too

Jesus fucking christ.
>>
>>107024205
bruh literally just type wan2.2_i2v_A14b_high_noise_lora_rank64_lightx2v_4step_1022 in google
>>
>>107024169
they did the whole email confirmation thing, then I guess it was rolled back after the hack?
>>
File: ComfyUI_00036.png (1.19 MB, 896x1152)
1.19 MB
1.19 MB PNG
>>107024212
>>
>>107024212
Prompt nodes don't have image inputs links, correct?
>>
>>107024216
based turk working hard
>>
>>107024217
youre talking to a motion designer. literally no one waits 10 mins for imagen. youre joking
>>
why has progress on models slowed down so much? pony6 is still king of non anime nsfw
>>
>>107024229
I dont think anyone waits 10mins for an image
>>
>>107024197
Lol thanks for the insight Mr Spambot
>>
>>107024236
maybe he meant to say hes poor. but qwen's problem is not its size (can still fit in 16gb with some offload at Q8 or completely at 24gb). The results are almost always GOOD meaning you dont need to re-roll your gens as much, but even fully fitting in a GPU, genning is slower (due to genning at a high 1.3MP size) and it's slopped and has bad styles knowledge/no artists
>>
>>107024229
Pony V6 is nowhere close to as good as Chroma for that, really like 3Dish kinda stuff that jsn't COMPLETELY realistically but sort of is is where Chroma is strongest
>>
>>107024259
so does sd1.5, but because it's so small it can't compete now. is neta yume lumina's quality still good compared to the big ones?
>>
>>107024216
I could say how it was theorized that he kept ban evading, but I don't want to give the current troll here any ideas.
>>
File: image_00037_.jpg (347 KB, 1264x912)
347 KB
347 KB JPG
>>
>>107024269
It does you just have to write it in the most verbose way that makes it think you're doing something artsy
>>
>>107024271
small models are garbage sadly.
You could try nemo instruct, or a recent gemma abliterated.
If you're asking for prompting techniques, then you'll have to play around with samplers, more randomness you want, the higher the temeperature. there are some sampler that help make the bot coherent with hight temp (but I forgot the name, I usually use llms for work and low temp), I'd suggest you ask chatgpt or lmg for this.
For prompting itself, it usually works better if you give the chatbot a list to choose from (but at that point it would be the same as using wildcards substitution) and the prompting techinque GREATLY varies between models, so there's not a general way to do it
>>
File: image_00038_.jpg (361 KB, 1264x912)
361 KB
361 KB JPG
>>
what's with the spam
>>
>>107024269
you have to be a complete fucking newfag tourist to not know, especially on /g/ where retards shill the website all the time.
>>
>>107024282
yeah or just use impact wildcards, this is my current setup
you can see how the normal prompt comes out and the augmented prompt.
SADLY tipo creates a trash augmented prompt. I just randomize artists really
>>
>>107024283
i'm not sure what you mean by "not repeating prompts" but you can probably set up ollama and find some way to call it
>>
>>107024286
*heavy trap bass beat starts playing*
>>
>>107024283
Jeet is acting pissy and released his spambots, anyone reporting them gets a warning.
>>
>>107024286
it's better, but I'm kinda bored of wan now, I want sound :(
>>
now it's pulling posts from /lmg/
>>
>>107024293
how would i do that? what node or whatever would make that happen?
>>
>>107024301
why don't u reduce the resolution then slowly increase until u oom?
>>
>>107024301
It's just sdg trannies, not a bot
>>
Update to >>107023272 no one cares about:
The issue was the rank 256 lightning lora. I don't have issues with the rank 64 Lora but just switching to the 256 version fucks things. I'm assuming this is because I am cutting it really really close to my memory limit and the difference between 64 and 256 pushes it over the edge

I still have not seen any actual evidence that 256 is better than 64 in any way beyond the theoretical considerations so it doesn't matter to much for me and my 1cutes

>>107024216
Email confirmation was rolled back because it destroyed the site for real users. It costs 1/25th of a cent for a real Gmail. Remember that SIM farm they captured with 40 million accounts a few weeks ago?
>>
>>107024316
careful with those words bro, I got some 2girls
>>
>>107024318
Depends, not sure how much good it will do with just say 4 steps, but if you use more steps it can be beneficial (the more the better). It can prevent "wasting" steps on the high noise model (overall composition, layout, and fundamental semantics) and leave more steps for the low noise model (details, textures, and video coherence) for the whole denoising process. Just gen videos and compare them side by side with something like GridPlayer and see the difference for yourself.
>>
File: image_00039_.jpg (573 KB, 1240x1672)
573 KB
573 KB JPG
>>
>>107024329
ok this looks real.. aside from their angular faces i don't think i'd suspect this being generated
>>
>>107024316
I don't think they're THAT mentally ill to manually do this for hours
>>
>>107024341
fucking CAMM man
consumer boards will be lucky to get 1 cuck slot at max
I hate this gay brown world
>>
File: 00031-1039261054.png (2.78 MB, 1824x1248)
2.78 MB
2.78 MB PNG
wtf is going on in this thread? are 80% of posts here are now just bots? the replies are stupid and look inorganic.
>>
>>107024318
>It costs 1/25th of a cent for a real Gmail.
Do you know where one gets their hands on that? Not for botting 4chan, I prefer hand crafted trolling and shitposting, but I can use it for other purposes.
>>
>>107024344
on anything other than SDXL and SD 1.5 you definitely want to use natural language captions that actually just say "performing fellatio" and other normal words, those kind of triggers won't do shit
>>
>>107024344
Feel free to fuck off.
>>
>>107024347
the base model has learn some of the "normal" words at some point, so it's better to go for something it has never seen to start from scratch and not start with something the model has learned wrong
>>
>>107024344
spambot is reposting old posts. jannies are warning people who report
>>
>>107024344
It's getting spam botted yes.
>>
>>107024354
Me on the left
>>
>>107024269
>I could say how it was theorized that he kept ban evading, but I don't want to give the current troll here any ideas.
No problem I'll just talk about how I did it before dot g@y stole all of my sources for res!dental proxies and I was forced to use the zigger zoopedo site (not complaining that much, I am a happy customer of theirs and have gotten my money's worth multiple times over by this point)

Getting fake emails was always easy with the dot trick. You just searched up something like "mail t!cker or emailt!ck" on Google and use their services. Eventually mods cracked down on the popular ones but setting up one yourself is relatively easy for a /g/ user, especially if you're not committing serious thoughtcrime you could probably just buy a single sim make 5 Google accounts off of that and just use that for a whole week
>>
>>107024359
It absolutely is omegaslopped, plastic skin and image, low details, fuzz where small details are like the background tall grass and net, looks like one of the ealiest overcooked realism loras from years ago
>>
>>107024361
you forgot
>still isn't comfy at all
>>
>>107024367
this. the tattoo epidemic is really annoying
>>
>>107024259
im not looking for 2.5d exactly, i want detailed western 2d art like furry artists create, especially something that can create ethnicities.
>>
>>107024375
you can see the benefits of flowmatching there though, normal SDXL models don't have that kind of dynamic range, not even v-pred ones
>>
>>107024367
Dont you have to send him your dick pics?
>>
>>107024388
The beauty of AI image gen is you can gen whatever you want. The girls don't have to have tattoos like they do in real life. You can just gen girls who don't have them.
>>
File: dmmg_0039.png (1.58 MB, 896x1152)
1.58 MB
1.58 MB PNG
very surreal to post today and get quoted something i posted to someone else yesterday
>>
>>107024367
>"mail t!cker or emailt!ck"
Thanks
>just buy a single sim make 5 Google accounts off of that
Sims are needlessly expansive in my shithole. I am not that committed to anything.
Also, just out of curiosity, what's your grievance for spamming this thread now?
>>
>>107024403
What's your upscaling process, does it work well with a low denoise pass i2i like SDXL? Original NetaLumina didn't, second pass pics always were ruined for me.
>>
>>107024407
Thanks for your help I am a Lora hoarder and checkpoint hoarder this really helped!
And the Gallery button, do you have it or how do you manage your images to select them and import them or copy their metadata quickly. I miss the Gallery button so much.
>>
>>107024349
Wow I'm honored, the bot is copying me here lol
>>
>>107024329
>u mad, landboi?
>>
>>107024415
>Also, that custom node for Lora and Checkpoint thumbnails do you remember it's name?
Theres lora manager, idk the names of plugins that give you nodes that allow you to see the gallery of loras inside a node as i dont use them
>>
>>107024415
mouthbreathing retard identified
>>
>>107024417
Literally just a particular seed because of some token. Almost none of my gens have that, and the other times were just me intentionally prompting for some kind of effect.
>>
why didnt pony guy use illustrious? noobai/wai is the standard for anime.
>>
>>107024432
seedreamsissies... our response?
>>
>>107024432
wasn't out at the time and the retard is known for doubling down retardation
>>
File: 1754367026128139.mp4 (1.02 MB, 640x640)
1.02 MB
1.02 MB MP4
>>
>>107024439
to be fair, these gens were considered good back in 2023 when local was still competitive. but now saas is so far ahead that these gens look like shit in comparison
>>
>>107024442
yeah, changing that number will just re-run the entire workflow (apart from model loading which is cached) X number of times
>>
>>107024347
>Do you know where one gets their hands on that? Not for botting 4chan, I prefer hand crafted trolling and shitposting, but I can use it for other purposes
You can message the admin that run the evasion platform. They seem to be willing to talk about how their evasion stack works. I'm assuming getting emails and phone numbers is just about knowing the right forums or telegram channels to go to. Dark net markets are always an option but they're probably scams and/or overpriced idk man I'm not as cool of a hacker as I should be honestly

>>107024372
I hate tattoos so much mr spambot I agree with you so much

>>107024388
>Dont you have to send him your dick pics?
No that's a different site than the zigger zoopedo one. I don't know how the organization works but the dicker is associated with the ban site and codes the captcha solvers and stuff but there are different sites using the same backend im assuming , with different priority queues for dick pic users, gold users, and normal users

>>107024407
>Also, just out of curiosity, what's your grievance for spamming this thread now?
I am not the one spamming, I just use the evasion site to share beauty with the world
>>
>>107024450
has that distinct Chroma Chromatic Aberration, pun not intended lol
>>
>>107024392
Bold of you to assume the girls I gen have tattoos IRL.
>>
>>107024432
He was never going to do another SDXL model. Also Pony isn't even really supposed to be a proper "anime model" the way Illustrious and NetaYume are
>>
>>107024471
>this battle again...
>>
File: image_00044_.jpg (570 KB, 1240x1672)
570 KB
570 KB JPG
>>107024417
>>
>>107024476
Sigh, I think that’s the issue. I’ve only got 6GB of VRAM, kek every image in a simple workflow takes about 120 seconds, but when I set the batch size to 15, it takes around 4 hours instead of 30 minutes.

Does changing this number just make it run multiple times instead? I just want to queue up a bunch of generations to run overnight.
>>
>>107024432
>>107024439
He could have aborted training and switched to something actually promising like Illust, dedistilled flux or hell even SD 3.5 and gotten far better results without sinking a fortune on a hopeless garbage model but yes he is a complete fucking imbecile who just lucked out with v6.
>>
>>107024482
fresh OS install, fresh comfy install, try again. problem solved?
>>
>>107024486
He didn't indicate that he was the one doing the finetuning, but maybe you're right, we'll see
>>
*yawn*
>>
>>107024493
Pretty sure he’s just saying that he finetuned both tencent and qwen models and will be sharing the results. I don’t think he’s talking about a nee base model release
>>
>>107024486
No SDXL-based finetune will ever be interesting ever again IMO, the prompt adherence is just too shit by today's standards
>>
>>107024513
these look like someone ran them through a "sharpen" filter about 20x
>>
>>107024517
kek
>>
>>107024513
For me, it's the 4ch vae that really dates it.
>>
>>107024450
Thanks for the tips man.
I know about the dot G but no clue what "zigger zoopedo website" refers to.
I guess I can lurk the party until I come across what you are referring to.
>>
>>107024513
Rouwei guy is trying to rig an LLM for SDXL prompting.
Maybe that can become interesting when/if it matures.
I agree that there is not too much headroom left in SDXL if someone can't find a way to ditch the CLIP garbage.
>>
Do you think comfy did stop to post here because of the whole trani drama situation and his vendetta against comfy?
>>
File: image_00047_.jpg (768 KB, 1240x1672)
768 KB
768 KB JPG
>>
>>
>>107024486
Main problem was assuming text should be focused on over aesthetics. Text is important, but it should be after thought in case no model exists that is good at both text and aesthetics. The most aesthetic anime base model that knew the most styles by far was HunyuanDiT. Pony HunyuanDiT would've been glorious.
>>
>>
>>
>>107024599
As for Flux, ease of training would've excused the Pony devs. But they couldn't in their right mind say HunyuanDiT was shit. Heck, FreewayHunyuan, the only NSFW and the only non large scale finetune Hunyuan ever saw is still better than Pony v7. That should speak volumes.
>>
File: image_00050_.jpg (745 KB, 1240x1672)
745 KB
745 KB JPG
>>
Is there a "state of the art" background removal tool for an animation with a static background? From what I can tell segment anything is considered the best and I tried it and while it does a pretty good job I still have to manually clean up every single frame which is gonna take forever. Rembg and other models of its kind do a pretty terrible job.
>>
>>107024552
I don't think even the best case would be better than NetaYume ultimately TBQH
>>
File: dmmg_0073.png (2.94 MB, 1254x1613)
2.94 MB
2.94 MB PNG
>>107024580
careful bub, you're getting real close to hentai
>>
>>107024628
is this flux?
>>
>>
what if i want to make an ugly person?
>>
>>107024617
>https://huggingface.co/Laxhar/Freeway_Animation_HunYuan_Demo

>a domestic dual-element model with superior performance and greater potential compared to previous SDXL models.Keywords: Improved human body expression, flexible Chinese and English keyword input methods, vivid composition, diverse styles, and lower training costs.Main motivations: More flexible keyword input and stable body structure for vivid compositions to facilitate novice user usage: providing multiple keyword input rules to output usable images based on user habits; more stable limb performance for better stability in large actions and multi-person interactive scenes. Maintaining high aesthetic standards for various anime art styles while keeping output appealing to general users. Contains extensive knowledge, eliminating the need for using characters/styles/artists' lore

>Specifically, in Freeway Animation HunYuan, we recommend using the following four sequence combinations:

>Compared to SDXL models, the HunYuan model excels in maintaining human body rationality in complex compositions, with a significant improvement in its ability to understand and respond to prompts.

>One noteworthy point is that the HunYuan model has development potential in multi-person interactions not inferior to Novel AI v3, with much lower development complexity than SDXL, which is why many enthusiasts choose the HunYuan model in the first place (laughs).

>https://nx9nemngdhk.feishu.cn/docx/XNMDdCOkvoWlvVxVqfVceLXEnoh

Based Chinks always pour their resources into something that matters, recognizing its strengths and weaknesses and being way ahead of Westerners. It's an IQ thing I guess.
>>
File: AnimateDiff_00001.mp4 (1.78 MB, 720x576)
1.78 MB
1.78 MB MP4
Is it finally over?
Can I post again?
>>
>>107024694
knock up your sister
>>
File: image_00052_.jpg (693 KB, 1240x1672)
693 KB
693 KB JPG
>>107024639
just playful pinup poses

>>107024644
chroma
>>
>>107024707
fill me up with turbo slop changdaddy
>>
>>107024712
Comfy can be a little finicky sometimes. Could also be a RAM issue if it only occurs after a while of usage.
>>
File: image_00053_.jpg (517 KB, 1240x1672)
517 KB
517 KB JPG
>>
>>107024743
If you are the anon who genned that pic then I'll say I'm pretty sure comfy tag weighting works like (tag:1.2) and not just (tag), the latter being how AUTO and forks do it. But the source is that the text encoder is not t5 lol. Test it again and you'll see it does not work.
>>
>>107024706
looks like it is not over
>>
Is the spambot done yet?
>>
>>107024763
I mean there's not a specific different "rough" version of the paper texture tag AFAIK, that's why I just weighted the first three tags more, wasn't sure exactly what you meant by that
>>
>>107024766
Thanks for coming back. It follows water colour but the texture is an issue.
It's not bad.
I remember testing various water colour gens and one of the best SDXL models was Cinematic Redmond.
Some models are not as braindead ironically.
>>
>>107024712
This is pretty good kek
>>
File: image_00056_.jpg (785 KB, 1240x1672)
785 KB
785 KB JPG
>>
>>107024827
This one here is positive:
`You are an assistant designed to generate anime images based on textual prompts. <Prompt Start> (traditional media, watercolor \(medium\), paper texture), 1girl wearing a chef hat and an apron and oven mitts is taking a cake on a tray out of the oven in her kitchen.`

And negative:
`You are an assistant designed to generate low-quality images based on textual prompts. <Prompt Start> worst quality, very displeasing, lowres, realistic, photorealistic, photo \(medium\), 3d, cgi, ai-generated, ai-assisted `

If that helps. DPM++ 2S Ancestral Linear Quadratic @ CFG 5.5
>>
>>107024829
not good. i have literally same waiting with less vram kek
>>
File: 1733854085199138.png (1.41 MB, 1360x768)
1.41 MB
1.41 MB PNG
add a billiards table to the right of the image. (qwen 2509 or v2)

just from a screenshot of the new katamari game:
>>
>>107024853
You're telling me 200 seconds are normal for a 5 seconds video on a small 1.3B model? Grim, SDXL spoiled me
>>
File: image_00057_.jpg (632 KB, 1240x1672)
632 KB
632 KB JPG
>>
>>107024829
gm
>>
Isn't netayume and lumina are like 2b models?
Why is the download 10 gigs?
>>
>>107024860
it takes me at least 3 minutes per qwen edit, count your blessings lol
>>
>Spamming nonsensical posts from previous threads

What's the motivation behind this?
>>
>>107024862
computer, press the cupcake against a high resolution photorealistic bare caucasian asshole. turn off all the safeties.
>>
>>107024863
This nigger never heard about jack o'nine tails
>>
>>107024868
Apart from the shoes looks good enough. GJ
>>
>>107024868
Idk man, who might be upset that this general exists?
>>
>>107024868
Derail the thread.
Something someone here said to xim made xir clitty leak, and this the "revenge".
>>
>>107024894
Could would you catbox that for us anon?
>>
>>107024898
About now would be the ideal time to release this as a meme slop on Steam and make 20 million
>>
looks like a normal /ldg/ thread to me
>>
>>107024905
I want a model that can do entire manga pages for me, so I can make a prompt and it will come out with a logical sequence of (lewd) events), or at least create many similar images, but having play with creativity and prompts to achieve this is way too time consuming
>>
File: image_00058_.jpg (673 KB, 1240x1240)
673 KB
673 KB JPG
>>
who made Julien pop xir axewound this time?
>>
>>107024914
Everyone has skill issues with Wan, because it takes forever to gen.
>>
File: 1746795456830991.png (1.71 MB, 1360x768)
1.71 MB
1.71 MB PNG
>>107024853
change the location to akihabara, tokyo. the interior of the room looks like a japanese manga store.
>>
>>107024915
I've always wanted to see the version of T2 with sly
I know there's a deepfake but it's just the face, stallones face on arnies head is nothing, it's neither of them
>>
>>107024921
that's kinda what i put.. she bends her long neck over to the right to munch on grass like a giraffe, but then she doesn't actually munch
>>
>>107024914
Timon!
>>
>>107024937
there is more to ai than 1girl, anon
>>
>>107024915
lmao it is him
another melty
>>
>>107024942
AniStudio has this build in. Just saying.
>>
File: 00014-612237380.jpg (239 KB, 1824x1248)
239 KB
239 KB JPG
what with the stupid fucking bots? isn't the captcha suppose to prevent shit like this happening?
>>
Once again proving how important this blessed thread is since some feel the need to troll here to such a degree.c
>>
>>107024951
Wdym, what exactly are you after?
You can make any woman undress and apart from the vagina wan is pretty good with bodies. You may need loras for more absurd stuff like huge tits but everything else can be prompted.
>>
don't care no-ones using that
>>
>>107024955
just shota gens from what i remember
>>
>>107024958
Why would you if it can do it natively?
The only lora people are hoarding for themselves are good pussy ones, all of the civitai ones suck ass.
>>
>>107024915
Think he actually went to a lawyer to sue "schizo anon" but that guy also just made fun of him like everyone else
>>
>>107024967
You come here to grovel for help? Pathetic. At least attach your avatar.
>>
>>107024951
Bots can do captchas easily.
More accurately than humans even.
>>
>>107024967
No way
Any proof links?
>>
>>107024978
I have, it's mine, not yours
>>
>it instantly replied to the trani mention
yep
thank god for making literal retards so easy to bait
>>
>>107024982
Haven't seen anything like this and I use external firewall.
I could be wrong of course but maybe provide some more substance to your claims than just being hyperbolical by default.
To be honest I don't care because it's obvious where this software is headed towards.
>>
>>107024987
>bothering and shilling in our cute sister general
grim
>>
>https://civitai.com/models/1836040?modelVersionId=2352622
GGUF when

>>107024951
Jannies are supposed to prevent it, but I have a feeling that our resident janny ignores the reports
>>
Why is there always some intense drama bubbling under the surface of ldg that random anons know all about but is somehow completely invisible to me when I look at the thread? It's all just meta-commentary on some happening that I can't actually see happening, 24/7, every day
>>
>>107025003
i think this may be the first wan gen i've ever done that wasn't in full slo-mo
>>
>>107025014
that's why you block outbound connections by default for every application you run.. opensnitch, simplewall, whatever it takes
>>
>>107024951
Possibly just a bunch of austistic/schizophrenic manual spamposting. Notice targeted responses to those talking about the spam.
>>
>>107025026
I'm inclined to agree with you
>>
File: 1732796043558155.png (1.26 MB, 1360x768)
1.26 MB
1.26 MB PNG
>>
>>107025029
Sounds like the lora might just be dogshit. Probably trained by a tagnigger who captioned it with fucking WD-TAGGER-V3 or some shit
>>
File: image_00061_.jpg (558 KB, 1240x1672)
558 KB
558 KB JPG
cool otters
>>
>>107025026
The real question is: what does the spammer not want us to see, which he is trying to hide in a sea of spam?
>>
>>107025014
Small community of people who've been keeping tabs on things for years. Very small community.
>>
>>107025038
i did not know about that but now i do
>>
>julien buttmad again
Lolcow
>>
>>107025039
If I don't put small Wan just makes her being ejaculated on by a fucking whale
>>
>>107025041
>404
Thanks for outing yourself as the most newniggerish of the nigger retards
>>
>>107025045
Excuse me saar, these highlights are missing kino posts. Oh Vishnu curse this bangladeshi bitch basterd
>>
>>107025041
I mean I've always been vaguely aware of the characters involved but every day there's some new "Uh oh! Julien is having a melty!" and I can never find what post they're referring to
>>
>julien
>>
>>107025039
Underaged and thinks it's funny.
>>
>>107025021
>>107025026
>>107025031
>>107025043
>>107025047
you are a worthless trannoid and no one like you lol
imagine being so repulsive and subhuman that not even anonimity can mask it
>>
>>107025056
Anon, stop being delusional, we don't even have open-source T2I models with dalle3's levels of pop culture knowledge, so videos are a given it's "never ever" in that regard.
Chinks don't care about having a video model with trillions of parameters that "knows everything", they just want a model that performs well enough on benchmarks while being small enough to run on their gpu-embargoed datacenters
>>
retarded schizo spammers are so tiresome.
>>
>>107025039
He just hates ldg, I think that's that
>>
>>107025063
whiter than you post hand
>>
>>107025064
They don't have leeway to do anything. US/ClosedAI could do it, but not them, plus copyright holders would attempt to charge them double the tax.
>>
>>107025065
and that's a good thing, i'd hate to be a minority like a nigger
>>
>>107025067
apparently it's either the light or the fusion loras that add the movement, without them camera stays static, but quality becomes ASS
>>
someone should make some gen some troonliens
>>
>>107025068
there is nothing special with yume
>>
>>107025092
he did in previous
>>
>>107024863
I can't tell if this one is bot but if not, the 10GB all-in-one has everything in it, like including the Gemma text encoder and the VAE, you don't need anything else
>>
>>107025109
it's comfyui manager and rgthree give these warnings
>>
what is the prompt to consistently remove all people in the scene while keeping the viewpoint unchanged in wan i2v?
>>
>>107025115
not really. the VRAM helps fit models but the speed isn't much better than a 4090
>>
How are all the [SONGBLOOMERS] doing?
>>
>>107025118
if you are thinking of a 5090 in terms of "value" then no. Like all high-end hardware (speakers/cameras/headphones/whatever) it isn't about value, it is about how much you enjoy owning high-end shit and seeing the marginal advantages.
>>
>>107025112
Yeah ok I guess bot only makes responses, not standalone comments, this one happened almost immediately
>>
>>107025130
(samefag) woops I should have mentioned also, around CFG 4.5 to 5.5 is best.
>>
>>107024921
>it's chinese 7-11
>>
why is chroma full of shitty gay loras, it's sad
>>
>>107025130
Not a bot, discord raid
>>
>>107025140
Looking forward to it
>>
>>107025145
thanks for the idea kind anon, ill make some kino zapping 1girls!
>>
>>107025140
https://www.youtube.com/watch?v=E5GoihDxS44
>>
>>107025157
civitai pony v7 comment section replies
>>
>>107025136
Lol bot me replying to real me here
>>
>>107025188
better than cumfart at least
>>
>>107025185
>>107025185
>>
im having a very complex problem. with chroma, how do i make the girl deepthroat the fucking penis. she always only sucks the tip. i want it all the way in her mouth. i've tried everything
>>
>>107025195
Did not work, trying another gen without any loras to see if there's any weird interaction fucking up.
Or maybe I suck at prompting
>>
File: file.png (35 KB, 483x177)
35 KB
35 KB PNG
I'm trying out wan2.2 for the first time. Is it normal for comfyui to be stuck here for a long time?
>>
>>107025196
Hopefully its not another dead project that'll never release their model. Speaking of released models,wonder if Kijai or anyone know that Rolling Forcing is already out https://huggingface.co/TencentARC/RollingForcing/tree/main/checkpoints
>>
>>107025204
i've never seen "attempting to release mmap". is your vram maxed out?
>>
>>107025003
What does this add to table over V1?
I use the nunchaku version of V1. Better NSFW than any other Flux but it is closer to Krea than Chroma.
I hope it gets better.
>>
>>107025038
This one looks cool.
>>107025056
It's honestly not brag worthy to waste enough time here that you understand the drama.
Just be happy that you are not a no lifer.
>>
File: ComfyUI_00736_.mp4 (1.19 MB, 768x1352)
1.19 MB
1.19 MB MP4
>>107024582
>>
File: EeLWEZAXkAAQldn.png (185 KB, 417x350)
185 KB
185 KB PNG
>>107025418
>>
File: 1730613651826236.mp4 (1.59 MB, 960x720)
1.59 MB
1.59 MB MP4
>>107025115
>the four women on the left move quickly to the left out of view. the text on the bottom dissapears. static camera
>>
>>107025288
yeah, you're probably right. I've changed the UnetLoader to UnetloaderGGUFDisTorchMultiGPU and set virtual_vram_gb to 4.0, but I still got a OOM error. I use the Q6 models. 12GB VRAM should be enough with virtual_vram_gb set to 4, no?
>>
>>107025508
I dunno but you should have 1gb vram free or else it will essentially freeze. try increasing the virtual vram setting. there's a distorch2 version of that node that is supposed to be better I think
>>
>tfw no 32gb vram gpu
how do you cope bros?
>>
>>107025623
i'm working extra hours to buy one and feed the capitalist machine and jensen's yachts
>>
>>107025708
steal one instead
>>
>>107025473
so fucking good
I love these fake leather skirts and pants
>>
File: 1680154040.png (3.12 MB, 1248x1824)
3.12 MB
3.12 MB PNG
>>
Just here to check for SongBloom gens.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.