Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107030058https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Neta Yume (Lumina 2)https://civitai.com/models/1790792?modelVersionId=2298660https://gumgum10.github.io/gumgum.github.io/https://neta-lumina-style.tz03.xyz/https://huggingface.co/neta-art/Neta-Lumina>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
>two of my failgens made it into the collageIn comfy, when you need to mask and inpaint, you have to manually draw a mask in a different software and plug it back in?The GUI in forge is too potent for for the inpainting for me to make the switch.
comfy should be dragged out on the street and shot
after testing img2vid on ltx2 and grok imagine and hearing audio with video. I can't go back to using wan anymore. Ovi is just a trash and pure copium.
>only 23 images in the last thread while it had 320 repliesgrim
>>107032625real schizo hours
Total SaaS Victory
>>107032568right click an Image Load node"Open in MaskEditor"
we will fucking dismantle cunts faces in the next 6 month, do you have a problem with this? people think they are fucking funny, these people will soon find out
blessed pit of spamshit
>>107032716yes but it also doubles gen time because it has to generate a negative video. you might as well increase cfg if you're gonna use it because 2.0 or 3.0 vs 1.1 is not gonna make a difference in gen time but setting it to 1.0 will>CFG on low noise in Wan2.2using cfg on high noise would make more sense because high noise is responsible for establishing the base motion of the video. low noise just fills in the details
fire fox completelyyou are enemy combaants
existential angst in a hostile universe.
>Why doing open and verifiable research at all? Closed SaaS labs mogs anyway broRetard
>>107032706Excellent, thanks.
cute freckles
>>107032538This was really the best the previous thread had to offer?
>>107032421https://files.catbox.moe/vai8bu.png
>window open and it's 12c outside..what gpu are you genning with? 4x5090?just opening the windows when it's cold should be enough
>>107032805Powerlimit to 75%. Open case.
>>107032706not that anon, but working with inpainting in comfy is something that is beyond me. In forge I could just do the thing and it would work. In comfy (it seems like) I have to use a different model specifically for inpainting, plug a bunch of nodes and shit. I don't get it. I just wanted it to rerun the area I inpainted with the same model I'm already working with...
stop posting scat in a blue general
>>107032623now give her socks
>>107032807>post locally genned video>janny warns mei guess it was too realistic?
I hate going through these papers. Sure, you can make even a small dick look huge on paper if you measure from the asshole.
>>107032819>cogI forgot about that one, need to add it to my failbake list alongside hidream
What the fuck happened here?
>>107032830babe wake up, bytedance released a finetune of wanhttps://huggingface.co/ByteDance/Video-As-Prompt-CogVideoX-5B>Video-As-Promptthat's... interesting...
>>10703283020% boredom, 40% stagnation, 15% shitty software, 5% humiliation ritual and 500% schizophrenia
>>107032830The spambot started posting way more frequently than it is yesterday lol
>>107032854Discord raid, not a bot
>>107032847so true unfortunately
>>107032854no need. if you can't understand what version is the most used you don't need to know.
you have one last warning before i get really nasty
>>107032847Checkpoints have different uses for me.When I make an image, I can use up to 4-5 different ones.Upscaling and inpainting is where it's at.
>>107032918You can rent indians locally on fiverr
So buck broken him and his lot sieges the thread daily
>>107032863I don't think so, the responses in particular seem too immediate not to be automated
>>107032830There is a small group of faggots that hate this general. They seethe because they are not accepted so they decided to dedicate every possible moment into griefing the thread.
Yeah bro don't trust your eyes, trust the paper
PLASMA LATENTS DUDE! it was proven that rescale cfg, cfg++ and other garbage are worse than normal cfg (there was a paper about this), but since it produces DIFFERENT results they think they hit it big, the AUTEURS that we have here LMAO, fucking copers
I'm probably just retarded but isn't chroma flash supposed to be faster? With the recommended settings I get the same gen times as with other chroma checkpoints
>>107033109majority of local copium (ipadapter, regional prompter, rescale cfg) is complete snakeoil trash.
have a fucking happy life /ldg/ bu my country calls and i have to serve. Do no worry in the slightest you will be ok.
>>107033135nta but i could never get the piece of shit to work properly according to the creators video's and following everything exactly. It seemed very limited compared with just copy paste image into gimp, make edit then copy paste back into comfy with out having to fuck around with some tool. maybe i just installed it when it was bugged or something.
>>107033109>With the recommended settings I get the same gen times as with other chroma checkpointsits extremely fast compared to normal chroma, especially if you combine it with Chroma1-HD-fp8_scaled_original_hybrid_large_rev2 and combine it with one of the flash loras instead. >chroma-flash-heun_r32-fp32-pruned.safetensors>chroma-flash-heun_r256-fp32-pruned.safetensorsi'm generating resolutions that took me 2 minutes in 30 seconds, and they look better as well. also make sure you're using 1.0 cfg, that's the entire point
>>107033143You could maybe work something with the Krita extension since it uses layers.
>>107033162is there some way to generate additional information on top of an image, similar to the way a detailer works? i'm not talking about masking with denoise to redraw parts of the image. i mean more like drawing on top of a layer without changing what's underneath, but still using the image underneath as context. A completely random example would be, say a face, where i want to put white liquid on top of it. Masking with a detailer will require me to put a high denoise, which will redraw the face underneath. This is just a completely random example, and this specific problem probably has a specific solution, like combining a face lora with a white liquid lora, but i'm looking for something broader that can cover a ton of different cases. Maybe stuff like flex does this? But i would prefer not to put an additional unet into my workflow.
>>107033147this is extremely freaky. i got this exact response a couple of days ago, relating to another question regarding in painting on top of an image instead of replacing what's in the mask. how is this even possible?
>>107032830https://s o y jakwiki.org/Project_F.A.E.
>>107033181it's a shame wan fucked up the lettering but nice otherwise
>>107033182aw hell yeah
>>107032821When there'a increased mod activity on these threads they are trigger happy about /g/'s offtopic rules for even anons that are simply posting within the thread.
>>107033166
>>107033228the only negative is very slightly slower gens, it only increases quality when you have a negative prompt with stuff like blurry, low res and stuff in it, not using NAG if using CFG 1 is just retarded
>>107033181The spambots and / or human spammers are just scraping old /ldg/ threads and reposting verbatim comments randomly
>>107033246last I read up on it he was working on making it work for flux nunchaku
>>107033246yeah i figured. but it's crazy that i got the same reply twice. i dont even post here that often.
>>107033279he's probably using the lightning loras so he's at cfg 1 (and therefore can't use negative prompts unless he activates NAG)
>>107033279Some anons think it's some rogue mod spamming this place. Who knows.
>>107033294this lora is fucking garbage tho
Frank Miller style
>>107033302only 3 weeks? impressive, took that furry fag 6 months to finish chroma's finetune with 5 millions images
>>107033246Our favorite schizo does that quite often with his foes
>>107033311>being this newdo you really just look at the ugly spaghetti when gening
>>107033294Mods would have revoked their access hours ago if it was some rogue janny or mod. They just don't give a shit.
>>107033322Correct, the baker has a skill issue where the model somehow got worse over time. Perhaps try contacting him about it
>>107033322The that does this has a long history of VPN abuse. This isn't the first time this has happened. There was spam this bad for multiple days during the first pastebin split.
>>107033356Chroma can’t do realism either, only blurry meltyslop
i just started training a character chroma lora with 220 images for science. wish me luck
>>107033363their whole thing is anime which it is the best at
>>107033356Is the spam 4chan wide? Or is it just /ldg/? We could check if it's manual by just omitting thread title and starting new thread
>>107033373>muh coomthe only cope of localkeks
>>107033377replacing a character in a video with another character you give images of
This is a white flag from the small group of faggots that get rejected from this general, lolcows and literal ERPing tranny faggots that want to feel special. They eat bans and evade and this is the last copeFuck deboFuck traniFuck IluniFuck PWFuck /sdg/
>>107033386flux was also a bit overcooked, that was a large part of what chroma fixed, it destroyed the lack of variety, of course it also destroyed the aesthetic training but I would rather have the more flexible model
>>107033391this, qwen needs loras or every image looks exactly the same, and then you gota train a fucking lora for any little thing, so qwen is useless cept for super specific shit
>>107033377Several human spammers with 4chan accounts, no captcha. Janny is either incompetent or one of them since reports arent forwarded.
>>107033405And have you tried stringing together gens before? this is night and day better, and this is not implemented right
>>107033373/lmg/ had this exact same spam a few months back. An earlier version just reposted comments from the same thread. Whoever it is, or whatever discord it is, just seems to pick one ai general at a time to focus on shitting up.
>>107033415so about 4 - 5 strength in high then? Thanks for testing it on 2.2, did you compare outputs with and with out the lora? It really needs to be tested with longer and more complex videos.
>>107033426kek I never said anything about being able to do unlimited/20 minutes, either way, I dont care, feel free to try it out your self
>>107033435I'm not convinced it would be that easy. do you understand what unlimited video with a 20 minute test with no drift means in terms of wan? You're not gonna be doing 20 minutes of video on your 16GB vram card. Unless it truly is just using the last frame as input for each 5 seconds of video, but that don't make sense when pic related. Why would they go out of their way to make so many versions?
We have a faggot who only exist to grief this thread and has been doing it for years. Is it really far fetched for him or one of his friends to do this even though they attempt to derail the general almost every week?Some of them have worked professionally in the AI field before spiraling so a bot would make sense.
>>107033449how do you set that up? couldn't figure it out for the life of me.
>>107033405Don't jannies have IRC? There's no way to reach them?
>>107033460and where did they mention that? Or are you just talking out your ass? I don't see them providing a workflow...
>>107033463No, it's pretty cool to not have color and brightness issues, it's a major pain in the ass.But it'll be unusable for most people unless they go back to wan2.1 or use a shitty 5B model.At least until the team makes a wan 2.2 14B version lora, which they didn't promise.I hope they don't do a nunchaku and just disappear or do random models after the 5B one.
>>107033460>Don't jannies have IRC? There's no way to reach them?There should be irc channel
>>107033463They couldn't spam, change the OP or shill trani studio so now they are just fucking with us.
>>107033460>Don't jannies have IRC? There's no way to reach them?i'm assuming since this has been going on for days on /ldg/ and allegedly in the past on /lmg/ that the administration is aware of the issue and there is nothing they can currently do about it.i'm not going to pretend like AI hasn't given me the best orgasms of my life, but it's unfortunate that it has also re-written the social contract of the internet and destroyed the human parts of it almost entirely
>they will surely all come back to my containment general if i shit up the blessed thread
>>107033449It's most likely just a kid with too much time on their hands with a grudge due to being mocked for being a vramlet. Writing a spambot that uses proxies is easily doable in a weekend especially now that LLMs can assist.
>>107033484If someone does go to irc please tell the mods about debo and ani they keep trying to either spam or hijack the thread they do it constantly I'm fucking tired
>>107033489there was a mentally ill highschooler in the early days of /degen/ who spammed clowns, gore, scat etc for 10 hours a day. mods finally banned him after 2 weeksa few days of text only replyposts isn't really that much in comparison. it does almost completely destroy technical discussion, but on the bright side there's nothing to really discuss right now.>>107033508bro the mods dont care about your thread personalities and you shouldn't either.
what does any of this have to do with anistudio? ranfaggot is spinning some yarn here but I don't think she's the bot (too tech illiterate and retarded to do so)
op will take place all such website will be shut down forever. we will and we have enough anons to shut down any website. we will not be bullied or pressured, we will respond absolute, no shitty web site takes us down I am warning you now. On the note we be doxing them all they better not have something to hide because we will find it.
>>107033478There is, but they kick you immediately for complaining about moderation.
>>107033512They are directly responsible for 90% garbage that happens in this thread. Both of them are avatarfags that wanted to be popular and because nobody likes them they do what we already discussed in the previous post. It's nearly every single fucking week.>>107033506No it's the bitter /sdg/ faggots
>>107033512>it does almost completely destroy technical discussionhurr duur technical discussion doesn't belong on a tranime shitposting forum
>>107033537you are the cause of the thread being shitty because you can't stop screaming about boogeyman during your melty
>>107033547>boogymenWhat a odd thing to say
>>107033537>No it's the bitter /sdg/ faggotsThat wouldn't explain why /lmg/ was attacked the same way.
>mention the right names>spam stopsCurious
it's literally the sharty. nothing else to it
>>107033543>hurr duur technical discussion doesn't belong on a tranime shitposting forumi mean sure, but this IS the technology section of the tranime shitposting forum. i'd fully agree with you if we were on any other board, even /ai/ (and the fact that technical discussion on AI would be split between /ai/ and /g/ always, and it fundamentally wouldnt be possible to contain technical discussion of AI to just one board since its so prevalent for both topics is one of the biggest reasons I think /ai/ will just not happen, other than traffic)>>107033560I wish I didn't live in FVEY so I could just host my own generative AI imageboard and properly moderate it
>>107033560If they were behind it, wouldn't they spam more to try to slide it? Unless the spammers are trying to frame them.>>107033569This is the most likely.
>>107033577You underestimate how fucking stupid they really are we have a rentry for a reason and the other one doxxed himself to own the haters.
>>107033577>Unless the spammers are trying to frame them.what would the spammers get from framing them? why would they care if we know who is spamming or who is not>>107033590>doxxed himself to own the hatersok kek, but i totally understand this because i have had to fight back the urge to prove my 6 foot tall whiteness blue-eyedness a few times when someone accused me of being a brown poo and i KNOW they're the ones who are actually brown
>>107033590please just kill yourself already niggerjak. nobody cares about your fucking drama posting or shitty art. we tell you this all the fucking time but your stupid nigger brain keeps doubling down
love watchin schizojeets melt down and point fingers over a spambot. local really is dead if this is all they have to discuss
>spam stops>now targeted seethingThis is why you retards are always rejected too prideful and mentally ill to function
>>107033629>local really is dead if this is all they have to discussi don't think anyone is denying this. the last thing to discuss was a slight update to an I2V lightning lora and that was a week ago alreadyit's annoying that 4chan's backend hasn't improved in like 11 years because I just KNOW that when the first video+audio model comes out, it's going to be so much extra friction sharing gens with sound on /g/, to the point where maybe /gif/ (does /wsg support sound?) will become the main place to discuss that model
LEAVE THIS PLAE IT IS HOSTILE, USE DARK NET TO FIND OTHER BOARDS AVOID ALL THE EVIL SHIT.THEY ARE ACTIVELY CENSORING
>disabled nigga thought he could get away with this shitYour caretaker should beat you for gooning with the tranny too
>>107033661why do all image and video models suck at high angle shots? the legs are always fuckywucky>>107033683most mentally stable bong
>>107033569I went to their brainrot hovel and don't see any mentions of either /g/ or /ldg/ on their /raid/ board.
LEAVE THIS PLACE NOW MY POT GOT TROUGH ONLY BECAUSE IT WAS NO CRIT OF WHAT WE TALK ABOUT. LEAVE THIS PLACE NOW!START WITH TOR AND SET JAVA TO DISABLEDIT THEY DO NOT WANT TO TALK WE WILL FUCKING MOVE TO WERE THEY CAN'T SEE US...
>SET JAVA TO DISABLEDWHY DID YOU REDEEM
>>107033705He's trying to falseflagNotice how quiet things got once the right names were mentioned followed by the autistic vendetta post.
I WILL GET ON TOR AND POST LINKS THEY THINK THIS IS A GAME.
but it will oom after so many frames... they say it requires frames from previous? Anyway I'm currently building a test workflow using imagetovideo for first sampler then wanAnimate node and i will select from range the last 5 frames and feed into continue motion. But again its not wan doesn't treat each as a new video, it has no context.
i read that in the do not redeem voice
/sdg/ faggots in shambles
>>107033831What? I'm still in the middle of testing.
Is there a point in upgrading from python 3.12 to 3.13?
>>107033634I don't think it's stopped DESU, the most recent comments don't necessarily make sense
war is breaking out now and we are al useless because we focus on meme ai.>>107033831let me guess you re better but never offer an alternative. ok retard i will see fucking this, i am, a fucking trained killer and that was my fucking job and i didn't like it and it fucked up my head i can find you any time...
bye bye microsoft safety researcher
>>107033896Any ComfyUI-ers around that could please advise on how to do inpainting without it taking 7 hours? I looked into some custom workflows as well but they ass
>>107033831Honestly I checked it out during the initial spam here yesterday and they kinda have far more actual gens posted than this thread, most of generally high quality
>>107033902do you even know what is being discussed?
two ldg threadsLFG>>107033651>the last thing to discuss was a slight update to an I2V lightning lora and that was a week ago already Anime meta has completely changed if you hadn't noticed. Huge news and fags need to step up and start migrating already
>>107033900>Any ComfyUI-ers around that could please advise on how to do inpainting without it taking 7 hoursno
Logical next step. You guys should be making cartoons and bankrupting Hollywood.
>>107033920the anime girl is typing on her computer keyboard, while using a word processor on the computer.(new lightx2v loras from today)
>>107033921>making cartoonsin comfyui? no that just sounds fucking aweful>bankrupting Hollywoodthey already do a good enough job themselves
>>107033930kek, he's not wrong, Qwen and Wan have so little differences between seed
if they want me to warn you, you can't hide from /pol/ we are everywhere!but yeah we are everywhere even where you are now... the act you are so insulting is not only amusing its probably no a good fucking idea eh?Trust me we are everywhere even the guy that serves you might be one of us.Goto bed! Or stfu or you will find out you dick head!
>>107033610>>107033590This is the level of retardation in these threads.
>>107033900If you find out how, let me know. Inpainting is a pain in the ass.
>>107033449ranfaggot is seething againplease take your medicine every daygeneral public wants this
>>107033961If you find out how, let me know. Inpainting is a pain in the ass.
>107033537>Ran is hallucinating
>>107030053>SD ultimate upscale, disable the tiling or make tiles as big as the imagewhat's the point of using USDU if you disable tiling? just do a normal upscale with controlnet like it was a highres fix
>>107033952i warned anon. but i did not link >>107033952i lost the link, but regardless stfu no one cars bout it. do not share your life no one fucking gives a shit.
I kinda regret posting that Ani dox, when was that, 2 years ago? He's a cunt, but still doesn't feel like he deserved the hate he gets. I was just bored and feeling shitty.
i have things to say the will help for that anon, it would take me a lot of time to locate.so i might just say here. your fucking state is amazing but you like everyone loses it.yeah i will tell you how to be real homeless
homeless is a great opportunity, there is not other thing that ever gave me such feeling.
>>107034035i don't think ani really cared that much about it but ranfaggot has become the usual thread derailing avatarfag in any other general. if anyone should be doxxed, it's that faggot
Is there a way to finetune a LoRa only for fine detail (textures, later stages of the denoising process) and leave the high level structural part (first stages of the diffusion process) alone?
the only thing you need to care about is you!you still think the devil is real don't you? qwel so o these other tards here. what if you understood it for only 10 minutes how would you face the world then?i'm giving you real advice here but you might not like it real like this but that is how...
>>107033912>Anime meta has completely changed if you hadn't noticed. Huge news and fags need to step up and start migrating alreadyLumina 2? it has shitty loras and no NSFW content yet. no one is moving over.
the only thing is you! start being that person, trust me.
>>107034062Gonna cry?Stop making up fan fiction
>>107034088No "detail" lora actually works like how you think it would. The only real solution to "more or better details" is to increase your resolution by means of upscaling and a second pass or using a model with a better VAE.
>>107034104why is there an eldritch nightmare outside her window?
>>107034088I don't think you need finetune for this. Just enable the lora during later steps of the generation. There should be several extensions/nodes that let's you control lora power per step.
>>107034125You are the real cancer in these threads, ranfaggot. Just stay in your discord and everything is fine.
>>107034151this but please leave the discord as well
>>107034120>no NSFW contentlook at the examples on the yume page. plenty of NSFW.>has shitty lorasbecause barely anyone knows how to and faggots are stuck to their XL shitmixes just like how they were stuck on 1.5 when XL dropped >no one is moving over.those who are tired of XL anime already did
>>107034126Why? Can't we finetune a model to be specialized at finalizing details? It could even be specialized for in-painting different kinds of textures. For example you could have a human skin expert LoRa. But I guess if you train it for inpainting you don't even have to care about level of detail because it will only ever care about fine grained detail (beside some global aspects like shadows or human shapes).
is LTX-2 local yet?
>>107034135But the LoRa would work better for fine detail if it was trained specifically for the later stages and not trained to be a generalist.
>>107034088maybe something like t-lora
>>107034164>look at the examples on the yume page. plenty of NSFW.im saying there are no NSFW loras>because barely anyone knows how to and faggots are stuck to their XL shitmixes just like how they were stuck on 1.5 when XL dropped yeah, so im saying until the majority move over, it's not really worth paying attention to.>those who are tired of XL anime already didbut no one is really tired of XL. 99% of AI slop I see posted on 4chan is from XL. Absolutely no one is using Lumina aside from /ldg/'s lumina shill.
if a gen takes more than 10 seconds to complete, model is too bigif the model looks like AOM slop, it's a shitty modelif a model can't into NSFW or artist styles, it's safetyslop
>>107034183Why are you here if you're not interested in the cutting edge of image diffusion kek
>>107034183> posted on 4chan is from XLnai
>>107034173The only thing that rivals the nothing burger status of LTX is Pony v7
>>107034280everyone knew it was going to be an abortion so there wasn't really a letdown
>>107033162I think we might have to go image-only for communication if this continues. Even writing a response to someone's post the text should be in the image.
>>107034183(I am a real not-bot person and also not the same person you were just replying to FYI) Is there even a specific actual thing you immediately want / need a lora for, though? Like what do you mean by "NSFW loras" in terms of ones that actually serve a purpose and aren't wholly redundant the way tons of Pony ones and Illu ones were / are?
>>107034203>Why are you here if you're not interested in the cutting edge of image diffusion kekNTA but i'm only here because I'm interested in the cutting edge of video (and audio I guess) diffusionand you should be too, because the future of image models is video models run at 1 frame. video models fundamentally just understand the world more, there's no way a pure image model will always be the best one going forward
>>107033921>Logical next step. You guys should be making cartoons and bankrupting Hollywood.i made a few music videos. storyboarding and character consistency isn't there yet. unironically only 10 years until i make a feature-length adaptation of Lolita in first person though
>>107034190>if a gen takes more than 10 seconds to complete, model is too bigThis but I have 8GB vram in a 5 year old gpu.
>>107034190I routinely wait 2hours for my 5 second wan gens and have no problem.this new generation has no patience. completely brain rotted by quick dopamine hits from tiktok. I pity you lot.
>>107034346>I routinely wait 2hours for my 5 second wan gens and have no problem.except for the fact that because of opportunity cost you can't test slight variations to your prompt to see if something works better
>>107034346I am speaking of imagen. vidgen isn't really mature yet
>>107034333>NTA but i'm only here because I'm interested in the cutting edge of video (and audio I guess) diffusion I feel that was implied but perhaps not. >because the future of image models is video models run at 1 frame.Probably. But as it stands now it's not like wan is SOTA for image generation. Its not bad to be clear but it's not like there's any momentum for imagebros to switch to it.
>>107034357use lightx2 to test variations/loras.
I have a 3090 and have created 1,700 wan videos since release.
>>10703419010 seconds at what resolution and on what hardware thoughThis metric will never make sense unless it's like some 80B Hunyuan 3.0 situation
>>107034365>clover tattooi would be very surprised if it can't do paw print tattoos>>107034377>use lightx2 to test variations/loras.this makes no sense to me if lightx2v output is close enough to test variations why not use it. if lightx2v output is not close enough then how do you know that the prompt variation you're testing is going to work on the full step version
>>107033629The library is full of interesting books. But when there's a rambling homeless man stinking up a half-mile radius around him camped in the middle of that library, it's the only thing anyone can pay attention to. It's not the books' fault.
>>107034280>>107034295why
>>107034410I didn't ask for a paw one specifically TBQH, the original image prompt used for the input image for the vid just said "tattoo" and it was always a clover
>>107034410>if lightx2v output is close enough to test variations why not use it. Treat Lightx2 like a watered down version of the real output. >if lightx2v output is not close enough then how do you know that the prompt variation you're testing is going to work on the full step versionThe motion is mostly there, it's just more stiff. Removing the lora will improve basically every aspect of the animation.
>>107034407at least 1024x1024. 4k is a meme since upscalers exist
For any anon using the res/bongmath combo with multiple chained samplers (like on wan22), an update on the nodes corrected a bug that made output worse. The results are now way better on less steps.My chained 5/5 lightx2v finally looks very good on it.
>>107034437oh its i2v okay>>107034443but it still changes significantly and you risk 2 hours on it. i guess i just can't believe you because of the fundamentals but thanks for trying to explain
>>107034456It doesn't become something entirely different from the lightx2 lora. It's just better more fleshed out motion. While it's true there is some risk the video looks like shit despite the lightx2 looking fine, it's a risk i'm willing to take. In most cases that doesn't happen though.
>>107034476i would love to see an example if you care enough to put one together, but you don't have to because I'm never going to spend more than 15 minutes running a video ever anyways
>>107033610I had an anon absolutely adamantly insist I "sounded ESL" once despite the fact that I definitely 100% type the same way every other early-to-mid-30s white North American guy who spent a lot of time on traditional PHPBB / VBulletin forums as a kid does lol
>>107034506that's nothing, I had a group of people convinced that I was the one spamming /aicg/ and they found my LinkedIn and I was some Indian man. you can't make this shit up
>>107034395I have 12gb card and have created 7000+ wan videos since June.
>>107034506100% that was some projecting spic or jeet
>>107034528KekIf anons were always right we'd all be transgender U.S. democrat voters who are somehow simultaneously Indian and actually live in India
>>107034551Yeah and they're probably 4step lightx2 SHIT. I use 80 steps. I want MAX quality therefore my shit is automatically better.
>>107034551Wait, actually it's 11000+.
>>107034572even on a 5090 it should be 1h+
>>107034572> 4step lightx2 4-8 steps.> I use 80 steps. I want MAX quality therefore my shit is automatically better.No, my videos are better, because I've tried all variations of loras and their strengths, prompts, etc.
>>107034551>>107034574i2v? of the same image?
>>107034600> I've tried all variations of loras and their strengths, prompts, etc.so have I. Show me your workflow. I bet it doesn't compare to my perfection.
>>107034607Mostly i2v, about input 1000 images.>>107034625My workflows are simple, all the work is in python scripts.> picI hope you did not made this and tweak by hand.
>>107034625all of that for wan? what the hell are you doing?
>>107034665/lmg/
>>107032185it offloads, just have enough paging file
>>107034680snake oils
>>107034649Simplicity is subjective. I have written infinitely more complex things in C++ so much so that this workflow doesn't even register on my radar as complex. >>107034680It's an all-in-one workflow.TV2I2VSingle/Batch loading imagesInterpolationUpscaling(though I do that in a separate workflow now)Post processing(color match/film grain)Sampler switching(uni_pc for anime/deis for realism)Lightx2 lora switchingTemplate prompts for commonly used promptsMobile notifications when a gen is complete(gotify)it's really simple stuff. The bulk was done in maybe a few hours?
>>107034719>Single/Batch loading imagesI'm interested by this, is there a specific node or did you do it with multiple ones?
>>107034719>It's an all-in-one workflowreddit alert. only jeet retards shove everything into one workflow
>>107034719>I have written infinitely more complex things in C++highly doubt since you never wrote a PR to anistudio so we don't have to use the poothon spaghetti anymore
>>107034747>write my tranny software for me!
>>107034719> Simplicity is subjectiveNo, it is not. It's like perfection, but when there is nothing to simplify:> Antoine de Saint-Exupéry — 'Perfection is achieved, not when there is nothing more to add, but when there is nothing left to take away.'> I have written infinitely more complex things in C++Makes sense.
>>107034747>anon never contributed to ani's project so that means no one here knows c++ despite it being a language every cs grad knowsLOL this guy is legit insane
>107034747that's some vegan level of bringing up a subject you obsess with
>>107034752>write for cumfart instead!fuck off
If you apply the fix suggested here, longcat-video will run in 48GB, and maybe even 32GB if you lower the frames per pass and image size, and skip the refining step (which doesn't seem to do much other than upscale to 720p and fuck things up). I'm referring to their run_demo_long_video.py t2v script for making 1 minute videos.
It's literally just a wrapper. God fucking damnit Ani you massive faggot.
call me when longcat has porn loras
>>107034769dont get him riled up. he's going to activate the spam bot again
>>107034765Forgot link https://github.com/meituan-longcat/LongCat-Video/issues/7
>>107034757>language every cs grad knowsI've met MIT, Stanford, UCLA and NYU CS grads that never touched C or C++. they don't teach it because instructors are fucking retarded nowadays
I don't see why it's an excuse to not contribute to ani's project
I don't see why anons don't just contribute to sdcpp
The SVI loras, are they meant to be used alongside context window? I don't understand how they are meant to produce longer videos.
>>107034822I honestly just want this so torch can fuck off from local entirely. we are so far behind when it comes to having just werks binaries on the diffusion side
>>107034830If you try to make longer videos from short ones by reusing their ending frames, you'll notice significant quality drop over time. SVI tries to minimize that quality drop.
ani will save us, as soon as a competent dev contributes!
>>107034884too bad you suck at everything otherwise it could have been you
>>107034894he would have never added it if there wasn't that reddit post talking about a custom node making the canceling faster btw
>>107034830>>107034862sadly only for wan 2.1
>>107034907it's because you click run too quickly while gooning and forgot to add the thing to the prompt
>>107034822People can't be bothered to learn pytorch but yeah, contributing to some bespoke cnile implementation is important
>>107034914I lurk on /pol/ so I'm kinda used of that aggressivness, feels like home kek, and desu I prefer an angry place over "omg your pronouns are xe/xir, that's awesome!" reddit forced positiveness
>he's madlmao
>>107034920holy MOTHER OF TRVKE
>>107034915please stay in /lmg/ with your nemo or whatever bot
>>107034862Ah ok. I do tend to gen around 125frames.>>107034907Makes sense as to why I didn't see anything good come out of it when testing.
>>107034822Why don't you?
>>107034930>btfos the entire 5000 seriesOnly if you buy enough for a cluster with nvlink. But (You) wouldn't know that.
>>107034932that was never the claim, just that it was better than Q8 which is true
>>107034765>>107034779Cant remember where I read but arent these guys going to release a block swap node or something to allow for more frames? I could be mistaking it with other devs.Also Kijai released a refined 2gb+ version, havnt had chance to test it out in case anyone else can https://huggingface.co/Kijai/LongCat-Video_comfy/tree/main>>107034830Seemed to produce somewhat better quality with the context nodes but I havnt done a huge amount of testing. I did noticed however with or without context nodes after 15 seconds, the svi loras tend to repeat anyway, similar to context nodes. Here's a 10 second one that I still have saved, may do more tests if I get time this weekend.
>>107034781>I've met MIT, Stanford, UCLA and NYU CS grads that never touched C or C++. they don't teach it because instructors are fucking retarded nowadaysno you haven't. you can easily prove this post wrong by just checking the current curriculums of the schools you mentioned
why does every general i'm interested in have a fucking psychotic weirdo ruining it fuck
>>107034914I don't want 10x the amount of bloat. if my only dep is ggml I'd be extremely happy I don't have to juggle pip dependencies almost every update. python is such a fucking shitty thing to scale
>>107034933>the same outdated imagean objectively right image can't be outdated, it's like saying we shouldn't use einstein relative equations because they are 100 years old, that's retarded as fuck, if it provides something valuable and objectively right, it doesn't matter when it was made, do I really have to explain this simple concept to you, fucking retarded fuck
>>107034940>it looks completly identical as fp16 anon, you wouldn't see a single difference!>w-well, it looks completly different but the quality is here!pick one subhuman
>>107034944>You're comparing apples to oranges, of course is not the sameis this guy retarded or something? why would you care the model it's being applied on, if it doesn't work on flux it won't work on wan or qwen, the nunchaky guys never said "our quant only work on one model", how do you survive with such a small head anon?
>>107034932>Ah ok. I do tend to gen around 125frames.125? why not 129?>Makes sense as to why I didn't see anything good come out of it when testingthey released the training code, so maybe someone will do a wan 2.2 i2v versionright now they chose the 5b version as their next goal, which is retarded
>>107034945it's 4x smaller in size but "many of its weights are still fp16", all right I think that's an elaborate bait at this point, take this last (You) saar
>>107034946you are retarded. I said from the get go that it looks closer than Q8 does to FP16 and it literally looks almost exactly the same, just small variances that would be made from a different seed, no quality loss at all
welcum back spambot :-)
>>107034951Never said anyone of that you weird cunt. Also where's your gens?
>>107034963they are different models, that chart is for flux idiot, not qwen, how dumb are you
suffa trani. you'll always be just a f-list lolcow and your program will go nowhere. suffa bitch
>>107034975also I could just do more steps with nunchuku for more detail and still be faster, but this is side by side, it is nearly identical to FP16, far closer than Q8 is
>>107034977>that image is like 1 year oldand?
>>107034940Which lora goes where. high/low noise?>>107034963I think I've managed to do around 200 frames before oom.>5bWhat the fuck.
>>107032538test
I checkboxed all the spam posts and then realized there's no mass report so I guess no reports then.
>>107034940>https://huggingface.co/Kijai/LongCat-Video_comfy/tree/mainwtf, a lora for wan2.2? did you try it?>>107034977it's a spambot anon, ignore
>>107034945>why does every general i'm interested in have a fucking psychotic weirdo ruining it fuckanonymity gives, and anonymity takes away>>107034995doesnt matter, you'd get warned for reporting it.
>>107034995jannie will just warn you
>>107034982Too much leg. This content violates our content policies.
>>107035007lets remove that burka and give her a nice suit.
>>107034993>I think I've managed to do around 200 frames before oom.sure but 129 is basically 8 seconds of gen @16fps (16x8+1), so might as well do that instead of 125
>>107035016no it is not, nunchku's quants for instance are closer to fp16 than q8 is and look better than q8. They keep the important bits at fp16
ldg always prevails
>>107035034damn haven't been there in a while thanks for reminding me about it
>>107034995it's not hard to ignore, so just do that and let the retard waste his life on these baby tantrums
you don't even need a LLM, if you just curate the spambot to only have yes-y and no-y responses like this one >>107035020 you can waste anons time with literally every reply because you need to read more than a single sentence to see if the reply is relevant or not
>>107033415And how long did it last in lmg? Methinks not as long as what's happened here (over a day straight at this point)
>i hate ldg, so im going to spam it, thus keeping it permanently at the top of /g/ and therefore bringing more attention to it
>>107035043Turns out blur was a few bad seeds on regular heun sampler. The sampler is a bit schizo and messes with prompt following so I'll just go back to default.
>>107035050then don't use them I guess? that is all wan-chatter is full of, people replacing characters
>>107034993It's 2.1, just load in the lora as you normally would.>>107034998>wtf, a lora for wan2.2? did you try it?>it's a spambot anon, ignoreHavnt had the chance to, only just seen it today. Still got a lot of svi testing to do, then going to test 2.2 and will move on to longcat, probably by the weekend. Been keeping an eye on this thread for it https://github.com/kijai/ComfyUI-WanVideoWrapper/issues/1570Also, fair enough about the spam. Just noticed the repeated replies, kek
>I hate ani for being better than me in every way so I'm going to humiliate myself by constantly spamming the same info on him nobody cares about. that will show him!
>>107035058we definitely need a vae-less edit model, going for a vae destroys the color
>>107035059nice except the stiff face
>>107035050>every reply because you need to read more than a single sentence to see if the reply is relevant or notor you could remember it from a previous thread as this bot is simply reposting old replies
>>107035072works better than I expected, thanks anon
>>107035077good luck finding a GPU that can do it. latents are used because it actually fits on consumer hardware
>>107035076yeah but it's probably the source image isn't anything to sneeze at
>>107035087>zooms in the imagenothing personal kid
Looks like the brain damaged retard is doing his weekly seethe seshLearning yume now
>>107035091If you're not White, lower your tone while speaking on /ldg/.
2538Add the last four digits of op number to prove you're not a bot.
>>107035104>Looks like the brain damaged retard is doing his weekly seethe seshnot sure why niggerjak has the need to waste her time here
>>107035104his stars doubled and this will keep happening. the people that find it are greybeards and some anons here. if anything that is the sure sign it's autistic approved
>>107035118based bot
>spambot is unironically ani upset that no one here wants to use his wrapper HOLLLLLYYY KEEEEKKKKKKKKK
>retard replies to himself while phone postingI think what's even sadder is you're doing this manually and pulled this in the past kek
>>107035132nah, you are just schizo niggerjak
Move>>107032422>>107032422>>107032422Move
>>107035087>or you could remember it from a previous threadI have severely deficient autobiographical memory (SDAM) as a result of my aphantasia (can't visualize an apple, which is why i love visual generative ai so much) so that's not really possible for me
>>107035142keke but also :( im sorry anon
>>107035104You could've easily made this image with XL. Why would you use Yume just to make the most basic 1girl image ever? The entire point in experimenting with it is to see what it can do that XL cannot.
>>107035172it's niggerjak, aka the dumbest and maddest retard in /ldg/
tes