[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage.jpg (1.54 MB, 3531x1896)
1.54 MB
1.54 MB JPG
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107030058

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Neta Yume (Lumina 2)
https://civitai.com/models/1790792?modelVersionId=2298660
https://gumgum10.github.io/gumgum.github.io/
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>two of my failgens made it into the collage

In comfy, when you need to mask and inpaint, you have to manually draw a mask in a different software and plug it back in?
The GUI in forge is too potent for for the inpainting for me to make the switch.
>>
comfy should be dragged out on the street and shot
>>
File: 00025-3063647883.png (2.21 MB, 1824x1248)
2.21 MB
2.21 MB PNG
after testing img2vid on ltx2 and grok imagine and hearing audio with video. I can't go back to using wan anymore. Ovi is just a trash and pure copium.
>>
>only 23 images in the last thread while it had 320 replies
grim
>>
>>107032625
real schizo hours
>>
Total SaaS Victory
>>
File: 00099-3665430901.png (2.06 MB, 1824x1248)
2.06 MB
2.06 MB PNG
>>
>>107032568
right click an Image Load node
"Open in MaskEditor"
>>
we will fucking dismantle cunts faces in the next 6 month, do you have a problem with this? people think they are fucking funny, these people will soon find out
>>
blessed pit of spamshit
>>
>>107032716
yes but it also doubles gen time because it has to generate a negative video. you might as well increase cfg if you're gonna use it because 2.0 or 3.0 vs 1.1 is not gonna make a difference in gen time but setting it to 1.0 will
>CFG on low noise in Wan2.2
using cfg on high noise would make more sense because high noise is responsible for establishing the base motion of the video. low noise just fills in the details
>>
fire fox completely

you are enemy combaants
>>
existential angst in a hostile universe.
>>
>Why doing open and verifiable research at all? Closed SaaS labs mogs anyway bro
Retard
>>
>>107032706
Excellent, thanks.
>>
cute freckles
>>
>>107032538
This was really the best the previous thread had to offer?
>>
File: image_00136_.jpg (747 KB, 1264x1656)
747 KB
747 KB JPG
>>107032421
https://files.catbox.moe/vai8bu.png
>>
>window open and it's 12c outside..
what gpu are you genning with? 4x5090?
just opening the windows when it's cold should be enough
>>
>>107032805
Powerlimit to 75%. Open case.
>>
>>107032706
not that anon, but working with inpainting in comfy is something that is beyond me. In forge I could just do the thing and it would work. In comfy (it seems like) I have to use a different model specifically for inpainting, plug a bunch of nodes and shit. I don't get it. I just wanted it to rerun the area I inpainted with the same model I'm already working with...
>>
stop posting scat in a blue general
>>
>>107032623
now give her socks
>>
>>107032807
>post locally genned video
>janny warns me

i guess it was too realistic?
>>
I hate going through these papers. Sure, you can make even a small dick look huge on paper if you measure from the asshole.
>>
>>107032819
>cog
I forgot about that one, need to add it to my failbake list alongside hidream
>>
What the fuck happened here?
>>
>>107032830
babe wake up, bytedance released a finetune of wan
https://huggingface.co/ByteDance/Video-As-Prompt-CogVideoX-5B
>Video-As-Prompt
that's... interesting...
>>
>>107032830
20% boredom, 40% stagnation, 15% shitty software, 5% humiliation ritual and 500% schizophrenia
>>
>>107032830
The spambot started posting way more frequently than it is yesterday lol
>>
>>107032854
Discord raid, not a bot
>>
>>107032847
so true unfortunately
>>
>>107032854
no need. if you can't understand what version is the most used you don't need to know.
>>
File: image_00138_.jpg (673 KB, 1264x1656)
673 KB
673 KB JPG
>>
you have one last warning before i get really nasty
>>
>>107032847
Checkpoints have different uses for me.
When I make an image, I can use up to 4-5 different ones.
Upscaling and inpainting is where it's at.
>>
File: image_00140_.jpg (816 KB, 1264x1656)
816 KB
816 KB JPG
>>
>>107032918
You can rent indians locally on fiverr
>>
So buck broken him and his lot sieges the thread daily
>>
>>107032863
I don't think so, the responses in particular seem too immediate not to be automated
>>
File: image_00144_.jpg (820 KB, 1264x1656)
820 KB
820 KB JPG
>>
>>107032830
There is a small group of faggots that hate this general. They seethe because they are not accepted so they decided to dedicate every possible moment into griefing the thread.
>>
Yeah bro don't trust your eyes, trust the paper
>>
PLASMA LATENTS DUDE! it was proven that rescale cfg, cfg++ and other garbage are worse than normal cfg (there was a paper about this), but since it produces DIFFERENT results they think they hit it big, the AUTEURS that we have here LMAO, fucking copers
>>
File: image_00146_.jpg (866 KB, 1264x1656)
866 KB
866 KB JPG
>>
I'm probably just retarded but isn't chroma flash supposed to be faster? With the recommended settings I get the same gen times as with other chroma checkpoints
>>
File: 1759379124871160.png (20 KB, 612x242)
20 KB
20 KB PNG
>>
>>107033109
majority of local copium (ipadapter, regional prompter, rescale cfg) is complete snakeoil trash.
>>
have a fucking happy life /ldg/ bu my country calls and i have to serve. Do no worry in the slightest you will be ok.
>>
>>107033135
nta but i could never get the piece of shit to work properly according to the creators video's and following everything exactly. It seemed very limited compared with just copy paste image into gimp, make edit then copy paste back into comfy with out having to fuck around with some tool. maybe i just installed it when it was bugged or something.
>>
>>107033109
>With the recommended settings I get the same gen times as with other chroma checkpoints
its extremely fast compared to normal chroma, especially if you combine it with Chroma1-HD-fp8_scaled_original_hybrid_large_rev2 and combine it with one of the flash loras instead.

>chroma-flash-heun_r32-fp32-pruned.safetensors
>chroma-flash-heun_r256-fp32-pruned.safetensors

i'm generating resolutions that took me 2 minutes in 30 seconds, and they look better as well. also make sure you're using 1.0 cfg, that's the entire point
>>
>>107033143
You could maybe work something with the Krita extension since it uses layers.
>>
File: ChromaHD_Output_265222626.png (755 KB, 1496x1024)
755 KB
755 KB PNG
>>
>>107033162
is there some way to generate additional information on top of an image, similar to the way a detailer works? i'm not talking about masking with denoise to redraw parts of the image. i mean more like drawing on top of a layer without changing what's underneath, but still using the image underneath as context. A completely random example would be, say a face, where i want to put white liquid on top of it. Masking with a detailer will require me to put a high denoise, which will redraw the face underneath. This is just a completely random example, and this specific problem probably has a specific solution, like combining a face lora with a white liquid lora, but i'm looking for something broader that can cover a ton of different cases. Maybe stuff like flex does this? But i would prefer not to put an additional unet into my workflow.
>>
File: 1749197332141868.png (43 KB, 1063x285)
43 KB
43 KB PNG
>>107033147
this is extremely freaky. i got this exact response a couple of days ago, relating to another question regarding in painting on top of an image instead of replacing what's in the mask. how is this even possible?
>>
File: image_00150_.jpg (882 KB, 1264x1656)
882 KB
882 KB JPG
>>
>>107032830
https://s o y jakwiki.org/Project_F.A.E.
>>
>>107033181
it's a shame wan fucked up the lettering but nice otherwise
>>
>>107033182
aw hell yeah
>>
>>107032821
When there'a increased mod activity on these threads they are trigger happy about /g/'s offtopic rules for even anons that are simply posting within the thread.
>>
File: FluxKrea_Output_262526.png (1.63 MB, 1024x1496)
1.63 MB
1.63 MB PNG
>>107033166
>>
>>107033228
the only negative is very slightly slower gens, it only increases quality when you have a negative prompt with stuff like blurry, low res and stuff in it, not using NAG if using CFG 1 is just retarded
>>
>>107033181
The spambots and / or human spammers are just scraping old /ldg/ threads and reposting verbatim comments randomly
>>
>>107033246
last I read up on it he was working on making it work for flux nunchaku
>>
>>107033246
yeah i figured. but it's crazy that i got the same reply twice. i dont even post here that often.
>>
>>107033279
he's probably using the lightning loras so he's at cfg 1 (and therefore can't use negative prompts unless he activates NAG)
>>
>>107033279
Some anons think it's some rogue mod spamming this place. Who knows.
>>
>>107033294
this lora is fucking garbage tho
>>
File: image_00156_.jpg (730 KB, 1264x1656)
730 KB
730 KB JPG
Frank Miller style
>>
>>107033302
only 3 weeks? impressive, took that furry fag 6 months to finish chroma's finetune with 5 millions images
>>
>>107033246
Our favorite schizo does that quite often with his foes
>>
>>107033311
>being this new
do you really just look at the ugly spaghetti when gening
>>
>>107033294
Mods would have revoked their access hours ago if it was some rogue janny or mod. They just don't give a shit.
>>
>>107033322
Correct, the baker has a skill issue where the model somehow got worse over time. Perhaps try contacting him about it
>>
>>107033322
The that does this has a long history of VPN abuse. This isn't the first time this has happened. There was spam this bad for multiple days during the first pastebin split.
>>
>>107033356
Chroma can’t do realism either, only blurry meltyslop
>>
i just started training a character chroma lora with 220 images for science. wish me luck
>>
>>107033363
their whole thing is anime which it is the best at
>>
>>107033356
Is the spam 4chan wide? Or is it just /ldg/? We could check if it's manual by just omitting thread title and starting new thread
>>
>>107033373
>muh coom
the only cope of localkeks
>>
>>107033377
replacing a character in a video with another character you give images of
>>
This is a white flag from the small group of faggots that get rejected from this general, lolcows and literal ERPing tranny faggots that want to feel special. They eat bans and evade and this is the last cope
Fuck debo
Fuck trani
Fuck Iluni
Fuck PW
Fuck /sdg/
>>
>>107033386
flux was also a bit overcooked, that was a large part of what chroma fixed, it destroyed the lack of variety, of course it also destroyed the aesthetic training but I would rather have the more flexible model
>>
>>107033391
this, qwen needs loras or every image looks exactly the same, and then you gota train a fucking lora for any little thing, so qwen is useless cept for super specific shit
>>
>>107033377
Several human spammers with 4chan accounts, no captcha. Janny is either incompetent or one of them since reports arent forwarded.
>>
>>107033405
And have you tried stringing together gens before? this is night and day better, and this is not implemented right
>>
>>107033373
/lmg/ had this exact same spam a few months back. An earlier version just reposted comments from the same thread. Whoever it is, or whatever discord it is, just seems to pick one ai general at a time to focus on shitting up.
>>
>>107033415
so about 4 - 5 strength in high then? Thanks for testing it on 2.2, did you compare outputs with and with out the lora? It really needs to be tested with longer and more complex videos.
>>
>>107033426
kek I never said anything about being able to do unlimited/20 minutes, either way, I dont care, feel free to try it out your self
>>
File: image_00160_.jpg (844 KB, 1264x1656)
844 KB
844 KB JPG
>>
>>107033435
I'm not convinced it would be that easy. do you understand what unlimited video with a 20 minute test with no drift means in terms of wan? You're not gonna be doing 20 minutes of video on your 16GB vram card. Unless it truly is just using the last frame as input for each 5 seconds of video, but that don't make sense when pic related. Why would they go out of their way to make so many versions?
>>
We have a faggot who only exist to grief this thread and has been doing it for years. Is it really far fetched for him or one of his friends to do this even though they attempt to derail the general almost every week?
Some of them have worked professionally in the AI field before spiraling so a bot would make sense.
>>
>>107033449
how do you set that up? couldn't figure it out for the life of me.
>>
>>107033405
Don't jannies have IRC? There's no way to reach them?
>>
>>107033460
and where did they mention that? Or are you just talking out your ass? I don't see them providing a workflow...
>>
>>107033463
No, it's pretty cool to not have color and brightness issues, it's a major pain in the ass.
But it'll be unusable for most people unless they go back to wan2.1 or use a shitty 5B model.
At least until the team makes a wan 2.2 14B version lora, which they didn't promise.

I hope they don't do a nunchaku and just disappear or do random models after the 5B one.
>>
File: image_00163_.jpg (780 KB, 1264x1656)
780 KB
780 KB JPG
>>107033460
>Don't jannies have IRC? There's no way to reach them?
There should be irc channel
>>
>>107033463
They couldn't spam, change the OP or shill trani studio so now they are just fucking with us.
>>
>>107033460
>Don't jannies have IRC? There's no way to reach them?
i'm assuming since this has been going on for days on /ldg/ and allegedly in the past on /lmg/ that the administration is aware of the issue and there is nothing they can currently do about it.

i'm not going to pretend like AI hasn't given me the best orgasms of my life, but it's unfortunate that it has also re-written the social contract of the internet and destroyed the human parts of it almost entirely
>>
>they will surely all come back to my containment general if i shit up the blessed thread
>>
>>107033449
It's most likely just a kid with too much time on their hands with a grudge due to being mocked for being a vramlet. Writing a spambot that uses proxies is easily doable in a weekend especially now that LLMs can assist.
>>
>>107033484
If someone does go to irc please tell the mods about debo and ani they keep trying to either spam or hijack the thread they do it constantly I'm fucking tired
>>
>>107033489
there was a mentally ill highschooler in the early days of /degen/ who spammed clowns, gore, scat etc for 10 hours a day. mods finally banned him after 2 weeks
a few days of text only replyposts isn't really that much in comparison. it does almost completely destroy technical discussion, but on the bright side there's nothing to really discuss right now.

>>107033508
bro the mods dont care about your thread personalities and you shouldn't either.
>>
what does any of this have to do with anistudio? ranfaggot is spinning some yarn here but I don't think she's the bot (too tech illiterate and retarded to do so)
>>
op will take place all such website will be shut down forever. we will and we have enough anons to shut down any website. we will not be bullied or pressured, we will respond absolute, no shitty web site takes us down I am warning you now. On the note we be doxing them all they better not have something to hide because we will find it.
>>
>>107033478
There is, but they kick you immediately for complaining about moderation.
>>
File: image_00165_.jpg (925 KB, 1264x1656)
925 KB
925 KB JPG
>>
>>107033512
They are directly responsible for 90% garbage that happens in this thread. Both of them are avatarfags that wanted to be popular and because nobody likes them they do what we already discussed in the previous post. It's nearly every single fucking week.
>>107033506
No it's the bitter /sdg/ faggots
>>
>>107033512
>it does almost completely destroy technical discussion
hurr duur technical discussion doesn't belong on a tranime shitposting forum
>>
>>107033537
you are the cause of the thread being shitty because you can't stop screaming about boogeyman during your melty
>>
>>107033547
>boogymen
What a odd thing to say
>>
>>107033537
>No it's the bitter /sdg/ faggots
That wouldn't explain why /lmg/ was attacked the same way.
>>
>mention the right names
>spam stops
Curious
>>
it's literally the sharty. nothing else to it
>>
>>107033543
>hurr duur technical discussion doesn't belong on a tranime shitposting forum
i mean sure, but this IS the technology section of the tranime shitposting forum. i'd fully agree with you if we were on any other board, even /ai/ (and the fact that technical discussion on AI would be split between /ai/ and /g/ always, and it fundamentally wouldnt be possible to contain technical discussion of AI to just one board since its so prevalent for both topics is one of the biggest reasons I think /ai/ will just not happen, other than traffic)

>>107033560
I wish I didn't live in FVEY so I could just host my own generative AI imageboard and properly moderate it
>>
>>107033560
If they were behind it, wouldn't they spam more to try to slide it? Unless the spammers are trying to frame them.
>>107033569
This is the most likely.
>>
>>107033577
You underestimate how fucking stupid they really are we have a rentry for a reason and the other one doxxed himself to own the haters.
>>
File: image_00168_.jpg (1022 KB, 1264x1656)
1022 KB
1022 KB JPG
>>
>>107033577
>Unless the spammers are trying to frame them.
what would the spammers get from framing them? why would they care if we know who is spamming or who is not

>>107033590
>doxxed himself to own the haters
ok kek, but i totally understand this because i have had to fight back the urge to prove my 6 foot tall whiteness blue-eyedness a few times when someone accused me of being a brown poo and i KNOW they're the ones who are actually brown
>>
>>107033590
please just kill yourself already niggerjak. nobody cares about your fucking drama posting or shitty art. we tell you this all the fucking time but your stupid nigger brain keeps doubling down
>>
love watchin schizojeets melt down and point fingers over a spambot. local really is dead if this is all they have to discuss
>>
>spam stops
>now targeted seething
This is why you retards are always rejected too prideful and mentally ill to function
>>
>>107033629
>local really is dead if this is all they have to discuss
i don't think anyone is denying this. the last thing to discuss was a slight update to an I2V lightning lora and that was a week ago already

it's annoying that 4chan's backend hasn't improved in like 11 years because I just KNOW that when the first video+audio model comes out, it's going to be so much extra friction sharing gens with sound on /g/, to the point where maybe /gif/ (does /wsg support sound?) will become the main place to discuss that model
>>
File: qwen_00042_.png (1.04 MB, 768x1344)
1.04 MB
1.04 MB PNG
>>
LEAVE THIS PLAE IT IS HOSTILE, USE DARK NET TO FIND OTHER BOARDS AVOID ALL THE EVIL SHIT.

THEY ARE ACTIVELY CENSORING
>>
>disabled nigga thought he could get away with this shit
Your caretaker should beat you for gooning with the tranny too
>>
>>107033661
why do all image and video models suck at high angle shots? the legs are always fuckywucky

>>107033683
most mentally stable bong
>>
>>107033569
I went to their brainrot hovel and don't see any mentions of either /g/ or /ldg/ on their /raid/ board.
>>
LEAVE THIS PLACE NOW MY POT GOT TROUGH ONLY BECAUSE IT WAS NO CRIT OF WHAT WE TALK ABOUT. LEAVE THIS PLACE NOW!

START WITH TOR AND SET JAVA TO DISABLED

IT THEY DO NOT WANT TO TALK WE WILL FUCKING MOVE TO WERE THEY CAN'T SEE US...
>>
>SET JAVA TO DISABLED
WHY DID YOU REDEEM
>>
>>107033705
He's trying to falseflag
Notice how quiet things got once the right names were mentioned followed by the autistic vendetta post.
>>
I WILL GET ON TOR AND POST LINKS THEY THINK THIS IS A GAME.
>>
but it will oom after so many frames... they say it requires frames from previous? Anyway I'm currently building a test workflow using imagetovideo for first sampler then wanAnimate node and i will select from range the last 5 frames and feed into continue motion. But again its not wan doesn't treat each as a new video, it has no context.
>>
i read that in the do not redeem voice
>>
File: 509945118.png (1.13 MB, 1536x640)
1.13 MB
1.13 MB PNG
>>
/sdg/ faggots in shambles
>>
>>107033831
What? I'm still in the middle of testing.
>>
Is there a point in upgrading from python 3.12 to 3.13?
>>
File: FluxKrea_Output_2626626.png (3.75 MB, 2496x1664)
3.75 MB
3.75 MB PNG
>>107033634
I don't think it's stopped DESU, the most recent comments don't necessarily make sense
>>
war is breaking out now and we are al useless because we focus on meme ai.

>>107033831
let me guess you re better but never offer an alternative. ok retard i will see fucking this, i am, a fucking trained killer and that was my fucking job and i didn't like it and it fucked up my head i can find you any time...
>>
bye bye microsoft safety researcher
>>
File: image_00179_.jpg (993 KB, 1264x1656)
993 KB
993 KB JPG
>>
>>107033896
Any ComfyUI-ers around that could please advise on how to do inpainting without it taking 7 hours? I looked into some custom workflows as well but they ass
>>
>>107033831
Honestly I checked it out during the initial spam here yesterday and they kinda have far more actual gens posted than this thread, most of generally high quality
>>
>>107033902
do you even know what is being discussed?
>>
two ldg threads
LFG
>>107033651
>the last thing to discuss was a slight update to an I2V lightning lora and that was a week ago already
Anime meta has completely changed if you hadn't noticed. Huge news and fags need to step up and start migrating already
>>
File: 4218414007.png (1.13 MB, 768x1344)
1.13 MB
1.13 MB PNG
>>
>>107033900
>Any ComfyUI-ers around that could please advise on how to do inpainting without it taking 7 hours
no
>>
Logical next step. You guys should be making cartoons and bankrupting Hollywood.
>>
>>107033920
the anime girl is typing on her computer keyboard, while using a word processor on the computer.

(new lightx2v loras from today)
>>
>>107033921
>making cartoons
in comfyui? no that just sounds fucking aweful

>bankrupting Hollywood
they already do a good enough job themselves
>>
>>107033930
kek, he's not wrong, Qwen and Wan have so little differences between seed
>>
File: image_00181_.jpg (805 KB, 1264x1656)
805 KB
805 KB JPG
>>
if they want me to warn you, you can't hide from /pol/ we are everywhere!

but yeah we are everywhere even where you are now... the act you are so insulting is not only amusing its probably no a good fucking idea eh?

Trust me we are everywhere even the guy that serves you might be one of us.

Goto bed! Or stfu or you will find out you dick head!
>>
>>107033610
>>107033590
This is the level of retardation in these threads.
>>
>>107033900
If you find out how, let me know. Inpainting is a pain in the ass.
>>
>>107033900
If you find out how, let me know. Inpainting is a pain in the ass.
>>
>>107033449
ranfaggot is seething again
please take your medicine every day
general public wants this
>>
>>107033961
If you find out how, let me know. Inpainting is a pain in the ass.
>>
>107033537
>Ran is hallucinating
>>
File: image_00182_.jpg (669 KB, 1264x1656)
669 KB
669 KB JPG
>>
File: 1735287637065893.png (1.05 MB, 1728x1344)
1.05 MB
1.05 MB PNG
>>107030053
>SD ultimate upscale, disable the tiling or make tiles as big as the image
what's the point of using USDU if you disable tiling? just do a normal upscale with controlnet like it was a highres fix
>>
>>107033952
i warned anon. but i did not link
>>107033952
i lost the link, but regardless stfu no one cars bout it. do not share your life no one fucking gives a shit.
>>
I kinda regret posting that Ani dox, when was that, 2 years ago? He's a cunt, but still doesn't feel like he deserved the hate he gets. I was just bored and feeling shitty.
>>
i have things to say the will help for that anon, it would take me a lot of time to locate.

so i might just say here.

your fucking state is amazing but you like everyone loses it.

yeah i will tell you how to be real homeless
>>
homeless is a great opportunity, there is not other thing that ever gave me such feeling.
>>
>>107034035
i don't think ani really cared that much about it but ranfaggot has become the usual thread derailing avatarfag in any other general. if anyone should be doxxed, it's that faggot
>>
File: image.png (490 KB, 1665x1478)
490 KB
490 KB PNG
Is there a way to finetune a LoRa only for fine detail (textures, later stages of the denoising process) and leave the high level structural part (first stages of the diffusion process) alone?
>>
the only thing you need to care about is you!

you still think the devil is real don't you? qwel so o these other tards here. what if you understood it for only 10 minutes how would you face the world then?

i'm giving you real advice here but you might not like it real like this but that is how...
>>
File: qwen_00062_.png (1.33 MB, 1280x960)
1.33 MB
1.33 MB PNG
>>
>>107033912
>Anime meta has completely changed if you hadn't noticed. Huge news and fags need to step up and start migrating already
Lumina 2? it has shitty loras and no NSFW content yet. no one is moving over.
>>
the only thing is you! start being that person, trust me.
>>
>>107034062
Gonna cry?
Stop making up fan fiction
>>
>>107034088
No "detail" lora actually works like how you think it would. The only real solution to "more or better details" is to increase your resolution by means of upscaling and a second pass or using a model with a better VAE.
>>
>>107034104
why is there an eldritch nightmare outside her window?
>>
File: image_00187_.jpg (878 KB, 1264x1656)
878 KB
878 KB JPG
>>107034088
I don't think you need finetune for this. Just enable the lora during later steps of the generation. There should be several extensions/nodes that let's you control lora power per step.
>>
>>107034125
You are the real cancer in these threads, ranfaggot. Just stay in your discord and everything is fine.
>>
>>107034151
this but please leave the discord as well
>>
>>107034120
>no NSFW content
look at the examples on the yume page. plenty of NSFW.
>has shitty loras
because barely anyone knows how to and faggots are stuck to their XL shitmixes just like how they were stuck on 1.5 when XL dropped
>no one is moving over.
those who are tired of XL anime already did
>>
>>107034126
Why? Can't we finetune a model to be specialized at finalizing details? It could even be specialized for in-painting different kinds of textures. For example you could have a human skin expert LoRa. But I guess if you train it for inpainting you don't even have to care about level of detail because it will only ever care about fine grained detail (beside some global aspects like shadows or human shapes).
>>
is LTX-2 local yet?
>>
>>107034135
But the LoRa would work better for fine detail if it was trained specifically for the later stages and not trained to be a generalist.
>>
>>107034088
maybe something like t-lora
>>
>>107034164
>look at the examples on the yume page. plenty of NSFW.
im saying there are no NSFW loras

>because barely anyone knows how to and faggots are stuck to their XL shitmixes just like how they were stuck on 1.5 when XL dropped
yeah, so im saying until the majority move over, it's not really worth paying attention to.

>those who are tired of XL anime already did
but no one is really tired of XL. 99% of AI slop I see posted on 4chan is from XL. Absolutely no one is using Lumina aside from /ldg/'s lumina shill.
>>
if a gen takes more than 10 seconds to complete, model is too big
if the model looks like AOM slop, it's a shitty model
if a model can't into NSFW or artist styles, it's safetyslop
>>
>>107034183
Why are you here if you're not interested in the cutting edge of image diffusion kek
>>
>>107034183
> posted on 4chan is from XL
nai
>>
>>107034173
The only thing that rivals the nothing burger status of LTX is Pony v7
>>
>>107034280
everyone knew it was going to be an abortion so there wasn't really a letdown
>>
>>107033162
I think we might have to go image-only for communication if this continues. Even writing a response to someone's post the text should be in the image.
>>
>>107034183
(I am a real not-bot person and also not the same person you were just replying to FYI) Is there even a specific actual thing you immediately want / need a lora for, though? Like what do you mean by "NSFW loras" in terms of ones that actually serve a purpose and aren't wholly redundant the way tons of Pony ones and Illu ones were / are?
>>
>>107034203
>Why are you here if you're not interested in the cutting edge of image diffusion kek
NTA but i'm only here because I'm interested in the cutting edge of video (and audio I guess) diffusion

and you should be too, because the future of image models is video models run at 1 frame. video models fundamentally just understand the world more, there's no way a pure image model will always be the best one going forward
>>
>>107033921
>Logical next step. You guys should be making cartoons and bankrupting Hollywood.
i made a few music videos. storyboarding and character consistency isn't there yet.

unironically only 10 years until i make a feature-length adaptation of Lolita in first person though
>>
>>107034190
>if a gen takes more than 10 seconds to complete, model is too big
This but I have 8GB vram in a 5 year old gpu.
>>
>>107034190
I routinely wait 2hours for my 5 second wan gens and have no problem.

this new generation has no patience. completely brain rotted by quick dopamine hits from tiktok. I pity you lot.
>>
>>107034346
>I routinely wait 2hours for my 5 second wan gens and have no problem.
except for the fact that because of opportunity cost you can't test slight variations to your prompt to see if something works better
>>
>>107034346
I am speaking of imagen. vidgen isn't really mature yet
>>
>>
>>107034333
>NTA but i'm only here because I'm interested in the cutting edge of video (and audio I guess) diffusion
I feel that was implied but perhaps not.
>because the future of image models is video models run at 1 frame.
Probably. But as it stands now it's not like wan is SOTA for image generation. Its not bad to be clear but it's not like there's any momentum for imagebros to switch to it.
>>
>>107034357
use lightx2 to test variations/loras.
>>
File: screenshot.1761670545.jpg (44 KB, 344x290)
44 KB
44 KB JPG
I have a 3090 and have created 1,700 wan videos since release.
>>
>>107034190
10 seconds at what resolution and on what hardware though
This metric will never make sense unless it's like some 80B Hunyuan 3.0 situation
>>
>>107034365
>clover tattoo
i would be very surprised if it can't do paw print tattoos

>>107034377
>use lightx2 to test variations/loras.
this makes no sense to me
if lightx2v output is close enough to test variations why not use it.
if lightx2v output is not close enough then how do you know that the prompt variation you're testing is going to work on the full step version
>>
>>107033629
The library is full of interesting books. But when there's a rambling homeless man stinking up a half-mile radius around him camped in the middle of that library, it's the only thing anyone can pay attention to. It's not the books' fault.
>>
>>107034280
>>107034295
why
>>
>>107034410
I didn't ask for a paw one specifically TBQH, the original image prompt used for the input image for the vid just said "tattoo" and it was always a clover
>>
>>107034410
>if lightx2v output is close enough to test variations why not use it.
Treat Lightx2 like a watered down version of the real output.

>if lightx2v output is not close enough then how do you know that the prompt variation you're testing is going to work on the full step version
The motion is mostly there, it's just more stiff. Removing the lora will improve basically every aspect of the animation.
>>
>>107034407
at least 1024x1024. 4k is a meme since upscalers exist
>>
For any anon using the res/bongmath combo with multiple chained samplers (like on wan22), an update on the nodes corrected a bug that made output worse. The results are now way better on less steps.
My chained 5/5 lightx2v finally looks very good on it.
>>
>>107034437
oh its i2v okay


>>107034443
but it still changes significantly and you risk 2 hours on it. i guess i just can't believe you because of the fundamentals but thanks for trying to explain
>>
>>107034456
It doesn't become something entirely different from the lightx2 lora. It's just better more fleshed out motion. While it's true there is some risk the video looks like shit despite the lightx2 looking fine, it's a risk i'm willing to take. In most cases that doesn't happen though.
>>
>>107034476
i would love to see an example if you care enough to put one together, but you don't have to because I'm never going to spend more than 15 minutes running a video ever anyways
>>
>>107033610
I had an anon absolutely adamantly insist I "sounded ESL" once despite the fact that I definitely 100% type the same way every other early-to-mid-30s white North American guy who spent a lot of time on traditional PHPBB / VBulletin forums as a kid does lol
>>
>>107034506
that's nothing, I had a group of people convinced that I was the one spamming /aicg/ and they found my LinkedIn and I was some Indian man. you can't make this shit up
>>
>>107034395
I have 12gb card and have created 7000+ wan videos since June.
>>
File: image_00203_.jpg (738 KB, 1264x1656)
738 KB
738 KB JPG
>>
>>107034506
100% that was some projecting spic or jeet
>>
>>107034528
Kek
If anons were always right we'd all be transgender U.S. democrat voters who are somehow simultaneously Indian and actually live in India
>>
>>107034551
Yeah and they're probably 4step lightx2 SHIT. I use 80 steps. I want MAX quality therefore my shit is automatically better.
>>
>>107034551
Wait, actually it's 11000+.
>>
>>107034572
even on a 5090 it should be 1h+
>>
>>107034572
> 4step lightx2
4-8 steps.

> I use 80 steps. I want MAX quality therefore my shit is automatically better.
No, my videos are better, because I've tried all variations of loras and their strengths, prompts, etc.
>>
>>107034551
>>107034574
i2v? of the same image?
>>
File: sss.jpg (925 KB, 3099x882)
925 KB
925 KB JPG
>>107034600
> I've tried all variations of loras and their strengths, prompts, etc.
so have I. Show me your workflow. I bet it doesn't compare to my perfection.
>>
>>107034607
Mostly i2v, about input 1000 images.

>>107034625
My workflows are simple, all the work is in python scripts.

> pic
I hope you did not made this and tweak by hand.
>>
>>107034625
all of that for wan?
what the hell are you doing?
>>
>>107034665
/lmg/
>>
>>107032185
it offloads, just have enough paging file
>>
>>107034680
snake oils
>>
>>107034649
Simplicity is subjective. I have written infinitely more complex things in C++ so much so that this workflow doesn't even register on my radar as complex.

>>107034680
It's an all-in-one workflow.
TV2
I2V
Single/Batch loading images
Interpolation
Upscaling(though I do that in a separate workflow now)
Post processing(color match/film grain)
Sampler switching(uni_pc for anime/deis for realism)
Lightx2 lora switching
Template prompts for commonly used prompts
Mobile notifications when a gen is complete(gotify)

it's really simple stuff. The bulk was done in maybe a few hours?
>>
>>107034719
>Single/Batch loading images
I'm interested by this, is there a specific node or did you do it with multiple ones?
>>
>>107034719
>It's an all-in-one workflow
reddit alert. only jeet retards shove everything into one workflow
>>
>>107034719
>I have written infinitely more complex things in C++
highly doubt since you never wrote a PR to anistudio so we don't have to use the poothon spaghetti anymore
>>
>>107034747
>write my tranny software for me!
>>
>>107034719
> Simplicity is subjective
No, it is not. It's like perfection, but when there is nothing to simplify:
> Antoine de Saint-Exupéry — 'Perfection is achieved, not when there is nothing more to add, but when there is nothing left to take away.'

> I have written infinitely more complex things in C++
Makes sense.
>>
>>107034747
>anon never contributed to ani's project so that means no one here knows c++ despite it being a language every cs grad knows
LOL this guy is legit insane
>>
>107034747
that's some vegan level of bringing up a subject you obsess with
>>
>>107034752
>write for cumfart instead!
fuck off
>>
If you apply the fix suggested here, longcat-video will run in 48GB, and maybe even 32GB if you lower the frames per pass and image size, and skip the refining step (which doesn't seem to do much other than upscale to 720p and fuck things up). I'm referring to their run_demo_long_video.py t2v script for making 1 minute videos.
>>
It's literally just a wrapper. God fucking damnit Ani you massive faggot.
>>
call me when longcat has porn loras
>>
>>107034769
dont get him riled up. he's going to activate the spam bot again
>>
>>107034765
Forgot link https://github.com/meituan-longcat/LongCat-Video/issues/7
>>
>>107034757
>language every cs grad knows
I've met MIT, Stanford, UCLA and NYU CS grads that never touched C or C++. they don't teach it because instructors are fucking retarded nowadays
>>
I don't see why it's an excuse to not contribute to ani's project
>>
I don't see why anons don't just contribute to sdcpp
>>
The SVI loras, are they meant to be used alongside context window? I don't understand how they are meant to produce longer videos.
>>
>>107034822
I honestly just want this so torch can fuck off from local entirely. we are so far behind when it comes to having just werks binaries on the diffusion side
>>
>>107034830
If you try to make longer videos from short ones by reusing their ending frames, you'll notice significant quality drop over time. SVI tries to minimize that quality drop.
>>
ani will save us, as soon as a competent dev contributes!
>>
>>107034884
too bad you suck at everything otherwise it could have been you
>>
>>107034894
he would have never added it if there wasn't that reddit post talking about a custom node making the canceling faster btw
>>
>>107034830
>>107034862
sadly only for wan 2.1
>>
>>107034907
it's because you click run too quickly while gooning and forgot to add the thing to the prompt
>>
>>107034822
People can't be bothered to learn pytorch but yeah, contributing to some bespoke cnile implementation is important
>>
>>107034914
I lurk on /pol/ so I'm kinda used of that aggressivness, feels like home kek, and desu I prefer an angry place over "omg your pronouns are xe/xir, that's awesome!" reddit forced positiveness
>>
>he's mad
lmao
>>
>>107034920
holy MOTHER OF TRVKE
>>
>>107034915
please stay in /lmg/ with your nemo or whatever bot
>>
>>107034862
Ah ok. I do tend to gen around 125frames.

>>107034907
Makes sense as to why I didn't see anything good come out of it when testing.
>>
>>107034822
Why don't you?
>>
>>107034930
>btfos the entire 5000 series
Only if you buy enough for a cluster with nvlink. But (You) wouldn't know that.
>>
>>107034932
that was never the claim, just that it was better than Q8 which is true
>>
>>107034765
>>107034779
Cant remember where I read but arent these guys going to release a block swap node or something to allow for more frames? I could be mistaking it with other devs.

Also Kijai released a refined 2gb+ version, havnt had chance to test it out in case anyone else can https://huggingface.co/Kijai/LongCat-Video_comfy/tree/main

>>107034830
Seemed to produce somewhat better quality with the context nodes but I havnt done a huge amount of testing. I did noticed however with or without context nodes after 15 seconds, the svi loras tend to repeat anyway, similar to context nodes. Here's a 10 second one that I still have saved, may do more tests if I get time this weekend.
>>
>>107034781
>I've met MIT, Stanford, UCLA and NYU CS grads that never touched C or C++. they don't teach it because instructors are fucking retarded nowadays
no you haven't. you can easily prove this post wrong by just checking the current curriculums of the schools you mentioned
>>
why does every general i'm interested in have a fucking psychotic weirdo ruining it fuck
>>
>>107034914
I don't want 10x the amount of bloat. if my only dep is ggml I'd be extremely happy I don't have to juggle pip dependencies almost every update. python is such a fucking shitty thing to scale
>>
>>107034933
>the same outdated image
an objectively right image can't be outdated, it's like saying we shouldn't use einstein relative equations because they are 100 years old, that's retarded as fuck, if it provides something valuable and objectively right, it doesn't matter when it was made, do I really have to explain this simple concept to you, fucking retarded fuck
>>
>>107034940
>it looks completly identical as fp16 anon, you wouldn't see a single difference!
>w-well, it looks completly different but the quality is here!
pick one subhuman
>>
>>107034944
>You're comparing apples to oranges, of course is not the same
is this guy retarded or something? why would you care the model it's being applied on, if it doesn't work on flux it won't work on wan or qwen, the nunchaky guys never said "our quant only work on one model", how do you survive with such a small head anon?
>>
>>107034932
>Ah ok. I do tend to gen around 125frames.
125? why not 129?

>Makes sense as to why I didn't see anything good come out of it when testing
they released the training code, so maybe someone will do a wan 2.2 i2v version
right now they chose the 5b version as their next goal, which is retarded
>>
>>107034945
it's 4x smaller in size but "many of its weights are still fp16", all right I think that's an elaborate bait at this point, take this last (You) saar
>>
>>107034946
you are retarded. I said from the get go that it looks closer than Q8 does to FP16 and it literally looks almost exactly the same, just small variances that would be made from a different seed, no quality loss at all
>>
welcum back spambot :-)
>>
>>107034951
Never said anyone of that you weird cunt. Also where's your gens?
>>
>>107034963
they are different models, that chart is for flux idiot, not qwen, how dumb are you
>>
suffa trani. you'll always be just a f-list lolcow and your program will go nowhere. suffa bitch
>>
>>107034975
also I could just do more steps with nunchuku for more detail and still be faster, but this is side by side, it is nearly identical to FP16, far closer than Q8 is
>>
>>107034977
>that image is like 1 year old
and?
>>
>>107034940
Which lora goes where. high/low noise?

>>107034963
I think I've managed to do around 200 frames before oom.
>5b
What the fuck.
>>
>>107032538
test
>>
I checkboxed all the spam posts and then realized there's no mass report so I guess no reports then.
>>
>>107034940
>https://huggingface.co/Kijai/LongCat-Video_comfy/tree/main
wtf, a lora for wan2.2? did you try it?

>>107034977
it's a spambot anon, ignore
>>
>>107034945
>why does every general i'm interested in have a fucking psychotic weirdo ruining it fuck
anonymity gives, and anonymity takes away

>>107034995
doesnt matter, you'd get warned for reporting it.
>>
>>107034995
jannie will just warn you
>>
>>107034982
Too much leg. This content violates our content policies.
>>
>>107035007
lets remove that burka and give her a nice suit.
>>
>>107034993
>I think I've managed to do around 200 frames before oom.
sure but 129 is basically 8 seconds of gen @16fps (16x8+1), so might as well do that instead of 125
>>
>>107035016
no it is not, nunchku's quants for instance are closer to fp16 than q8 is and look better than q8. They keep the important bits at fp16
>>
ldg always prevails
>>
>>107035034
damn haven't been there in a while thanks for reminding me about it
>>
>>107034995
it's not hard to ignore, so just do that and let the retard waste his life on these baby tantrums
>>
you don't even need a LLM, if you just curate the spambot to only have yes-y and no-y responses like this one >>107035020 you can waste anons time with literally every reply because you need to read more than a single sentence to see if the reply is relevant or not
>>
>>107033415
And how long did it last in lmg? Methinks not as long as what's happened here (over a day straight at this point)
>>
File: 1716575713105840.jpg (37 KB, 948x699)
37 KB
37 KB JPG
>i hate ldg, so im going to spam it, thus keeping it permanently at the top of /g/ and therefore bringing more attention to it
>>
>>107035043
Turns out blur was a few bad seeds on regular heun sampler. The sampler is a bit schizo and messes with prompt following so I'll just go back to default.
>>
>>107035050
then don't use them I guess? that is all wan-chatter is full of, people replacing characters
>>
>>107034993
It's 2.1, just load in the lora as you normally would.

>>107034998
>wtf, a lora for wan2.2? did you try it?
>it's a spambot anon, ignore

Havnt had the chance to, only just seen it today. Still got a lot of svi testing to do, then going to test 2.2 and will move on to longcat, probably by the weekend. Been keeping an eye on this thread for it https://github.com/kijai/ComfyUI-WanVideoWrapper/issues/1570

Also, fair enough about the spam. Just noticed the repeated replies, kek
>>
>I hate ani for being better than me in every way so I'm going to humiliate myself by constantly spamming the same info on him nobody cares about. that will show him!
>>
>>107035058
we definitely need a vae-less edit model, going for a vae destroys the color
>>
>>107035059
nice except the stiff face
>>
>>107035050
>every reply because you need to read more than a single sentence to see if the reply is relevant or not
or you could remember it from a previous thread as this bot is simply reposting old replies
>>
>>107035072
works better than I expected, thanks anon
>>
>>107035077
good luck finding a GPU that can do it. latents are used because it actually fits on consumer hardware
>>
>>107035076
yeah but it's probably the source image isn't anything to sneeze at
>>
>>107035087
>zooms in the image
nothing personal kid
>>
File: 00049-3589327685.jpg (865 KB, 2048x2688)
865 KB
865 KB JPG
Looks like the brain damaged retard is doing his weekly seethe sesh
Learning yume now
>>
>>107035091
If you're not White, lower your tone while speaking on /ldg/.
>>
2538
Add the last four digits of op number to prove you're not a bot.
>>
>>107035104
>Looks like the brain damaged retard is doing his weekly seethe sesh
not sure why niggerjak has the need to waste her time here
>>
>>107035104
his stars doubled and this will keep happening. the people that find it are greybeards and some anons here. if anything that is the sure sign it's autistic approved
>>
>>107035118
based bot
>>
>spambot is unironically ani upset that no one here wants to use his wrapper
HOLLLLLYYY KEEEEKKKKKKKKK
>>
>retard replies to himself while phone posting
I think what's even sadder is you're doing this manually and pulled this in the past kek
>>
>>107035132
nah, you are just schizo niggerjak
>>
Move
>>107032422
>>107032422
>>107032422
Move
>>
>>107035087
>or you could remember it from a previous thread
I have severely deficient autobiographical memory (SDAM) as a result of my aphantasia (can't visualize an apple, which is why i love visual generative ai so much) so that's not really possible for me
>>
>>107035142
keke but also :( im sorry anon
>>
>>107035104
You could've easily made this image with XL. Why would you use Yume just to make the most basic 1girl image ever? The entire point in experimenting with it is to see what it can do that XL cannot.
>>
>>107035172
it's niggerjak, aka the dumbest and maddest retard in /ldg/
>>
tes



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.