Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107023503https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Neta Yume (Lumina 2)https://civitai.com/models/1790792?modelVersionId=2298660https://gumgum10.github.io/gumgum.github.io/https://neta-lumina-style.tz03.xyz/https://huggingface.co/neta-art/Neta-Lumina>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
>3 hour threads on /g/that's faster than some /vg/ generals with natural posters. this spam is getting out of hand.
wow the local scene sure is bussling. so much discussion happening, I can't keep up!
>>107025208Perhaps janny should forward reports?
Blessed thread of frenship
>copy-pasting posts from archives>copy-pasting posts from reddit>samefagging>falseflagging>splitting the general>using up the image limitshow did /ldg/ make this guy so butthurt
>>107025229by simply existing kek
>>107025222janny's are too lazy to actually read why something is being reported so nothing ever happens. only NSFW content gets deleted immediately
1girl extravaganza
>>107025229so wtf is that schizo faggots problem?why cant this moron just fuck off?
>>107025241you can't write anything down though, only select from a drop-down list?
Whoever the anon was that mentioned LanPaint a couple of threads ago, thank you.
>>107025257there's spamming/flooding/low quality posts, but that requires the janny to sit in the thread and investigate. dealt with the same shit in the sonic general in the past. always some autist doing shit like this
>>107025241Not necessarily true. On some boards you get banned right away even for arbitrary rules.
>>107025276How's chroma with it? same gen speed? looks nice
>>107025276kino gen anon
>>107025295Did not work, trying another gen without any loras to see if there's any weird interaction fucking up.Or maybe I suck at prompting
>>107025302Hopefully its not another dead project that'll never release their model. Speaking of released models,wonder if Kijai or anyone know that Rolling Forcing is already out https://huggingface.co/TencentARC/RollingForcing/tree/main/checkpoints
where can I ask questions about TTS?
>>107025329Don't know if you're trolling me or not but will try it out lol
>>107025109I actually wasn't a bot so thank you for the explanation anon.
>>107025335what do you mean? you want to hear my question?
>>107025341on sdxl, hands are very difficult to get right especially when prompting for complex poses with foreshortening and combat involved. adetailer can't fix all the aspects of a bad hands and fingers.
>>107025344i have no idea how to set this up. i just set up a text-to-image generator once using A1111
spacker
>>107025351Idk about that model specifically but if you have a decent GPU you can just try cloning their repo and running it locally. Lots of those example apps on huggingface can just be cloned and run locally.
>>107025346well, i'm using vibevoice with comfyui and it works pretty well/fast considering i only have a 8gb card. but i'm wondering how to improve my results...
>>107025352>he doesn't have 8 (6) fingers on his right hand.
>>107025357i2v does a lot of the heavy lifting for getting a satisfactory gen though, it's understandable though not desirable.
>>107025351prompt?
>>107025378Man I am not even trying to be a "hater" but can you look at your gens for longer than 2 seconds before posting them here?She has like 8 fingers in her right hand.
>>107025351look at this niggas head lmao
>>107025390just check the default templates, retard.do you know how to breathe?
It finally reposted a post of my own kek.
>>1070253513.6 roentgens?
>>107025401Either jump on the API train or get run over by it, API is the future.
>>107025403OpenAI did push the overtron window, but I don't believe this was their intention, they just wanted hype by showing their model could do Will Smith playing ping pong against 2pac, ff7 style and shit, they know what people like, so they bait by letting them do copyright shit for like one week and then switch to stay safe, they did this on 4o and dalle3 as well, I'm NOOOTICING the pattern at this pointbut hey, everything that pushes the overton window in the right direction is welcome, even if it's not being done intentionally
>>107025401*Julien
>>107025421When mentioning the censorship i meant its not censored against nsfw, given that it so easily learns any nsfw concept in lora training.With IP characters, it doesn't learn them as fast and they are not the same type of data to expect the model to be able to generate compared to nsfw because there is a difference for a company to train heavily on youtube and when asked about IP rights say "oh well we trained on everything like sora 2 did" versus them training on a huge dataset of literal porn.And im not saying a model should be limited in the data its trained on even when it comes to porn given the anatomy benefits, its just that training on porn for a model company is not something that we can almost ever really expect to happen, meaning when it comes to the discussion of censorship, what that really means is that as long as the model is not specifically trained against genning nsfw or it gets lobotomized to the tier of sd3 so it cant even generate women, then that is good enough of a sign that the model's core wasnt "censored"/lobotomized.
>>107025295The inpainting is unfortunately slow. A gen still takes me about 35 seconds and when I want to fix hands/fingers, I just switch to inpainting using grouped nodes within the wf and it takes about ~200 seconds, but the result is good. No more fucked up fingers/hands.
me on the right
>>107025437What matters is releasing good models, and wan 2.1 was a huge jump they contributed that wont be matched any time soon that they didnt need to release, there was no pressure in that space from anyone else, hunyuan was okish but still very much a toy
>>107025441meh, I saw some ltx 2 videos, the sound is atrocious
>>107025437at least it works, could be worse
>>107025475Oh ignore that one, I wrote in a later post that I went back to the original one and it's working, but it's not quite as good for last frame, weird things happening.
>>107025495No it's working for me. The quality is equivalent to going 720p on low noise, but the motion is enhanced.
>>107025341No problem
>>107025473>sewer k-poplul
>>107025504Yw
>>107025381>he thinks counting fingers is a merit
>>107025620Kek
>>107025651>replying to botis it over btw?
>>107025684?
>>107025351This nigga lookin' ZESTY, this nigga lookin' MOIST, he's got sugar in his tank, he's light on his feet, he's a Iil' bit fruity, he plays for the other team, he dances at the other end of the ballroom, this nigga theatrical, this nigga good with colors, this nigga gonna coordinate yo curtains wit you cushions and that shit gonna look good! This nigga lifts shirts, this nigga on the down low, this nigga be a tollet trader, this nigga gardens uphill, this nigga packs fudge, he's a friend of Dorothy, he feels the love that dare not speak Its name, he loves to dance, he's of the Uranian brotherhood, he indulges in the French vice, he has an antipathic sexual Instinct, he's fluent in Polari, he's a refugee from Sodom, he's on the wrong bus, he bats for the other team, he's temperamental, he's one of em if you catch my drift.
Ran has destroyed this thread.
>>107025276im here bro, glad you liked it and are getting mileage out of it.
>ranfom seething about $namefag
>>107025620Aww poor girl.
>>107025973she's built different
so how 2 train yume loras?
>"we need to go out more!">take her to a new place>she hates it
Why do my videos come out so blurry?https://files.catbox.moe/9nlzj0.mp4
>>107026198Because you're only using 4 steps without using the lightx2 lora.
>>107026198Because you are a faggot.
>>107018205>>107018254cщьaĐ½ Ñ„a
>>107026219oh man, i am retarded. thanks man
>>107026219>>107026198Also your CFG is set to 1. Again, i don't see the lightx2 lora in your workflow.
>>107026219this. also set fps to 16 or you'll have a double speed video. if you want better quality set steps to 6 in both samplers, end_at_step 3 on high, start_at_step_3 on low
>weather is cold enough to gen 24/7 without needing to turn the A/C on>keeps room comfygod how i missed cold weather. please never go away
>>107026246>i don't see the lightx2 lora in your workflowntaThere have been so many update of lightx2 lora recently that I'm confused which one to use
>>107025736Funny
>>107026157>Let's drink gravy!lol
give the man an outfit like a white wizard and a staff that is firing lightning at the dog on the left side of the room.
>>107026293https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22_Lightx2v/Wan_2_2_I2V_A14B_HIGH_lightx2v_MoE_distill_lora_rank_64_bf16.safetensorshttps://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22-Lightning/old/Wan2.2-Lightning_I2V-A14B-4steps-lora_LOW_fp16.safetensors
>>107026342better:>>107026347this works well, the MoE one fixed all the slowmo issues the old 2.2 had.
>>107026347https://huggingface.co/lightx2v/Wan2.2-Distill-Loras/tree/main
what sampler and what cfg do you use for chroma flash
>>107026531I do 12 steps DEIS simple, 0.85 cfg (1.0 is intended but you get a little more output variety going under that without ruining it too much)
>>107026246what is cfg supposed to be at for video gen?>>107026219I've added LoRA loaders, but not ComfyUI crashes while trying to VAE decode the latent video. The terminal just says "pause" and prompts me to press any key to continue.
>>107026547THX
>>107026531Heun/beta 8 steps with cfg 1 is what is intended.
>>107026567>what is cfg supposed to be at for video gen?1 if you're using the light loras. around 3.5 if you're not. are you maxing out your vram when decoding? you might have to use tiled decode
>>107026567Stick to a decent workflow form civit instead of doing random shit you don't understand.You have I2V and you are using T2V loras.CFG should be 1 for the lora. 3.5-5 without it, I think.Set length to 81 and fps to 16.If you are still getting crashes, perhaps due to a bug, you can use comfyui unload model extension to unload the low noise model before loading the vae.
>>107026614I don't like this kind of faux-edgy lazy-blasphemy Madonna-style use of crossesotherwise cute
>>107026328lmao
>>107026695Doing random shit and failing is how you learn. Your "advice" is designed to keep him helpless, it's advice for slaves
qwen edit is a fun model
man im completely drunk, can anyone post some 1 girls laughing at viewer? thanks
is the repost bot done yet?
>>107026760Can I be put in the next OP?
>>107026736ok I'll start the 1girl laughing machine up>>107026241translation please?
>>107026792thanks but tits too big. can we make em sexier too?
>WAI
>>107026584What other basic settings apart from that, so far it's spitting out barely discernible garbage
Are there any AI that can edit anime/video game characters as well as Gemini? I want to edit anime/video game characters to be naked but Gemini doesn't allow that
the man jumps off a cliff into a swimming pool.new MoE high lora by kijai is pretty smooth
>>107026866Qwen Edit 2509 with a nudity lora (dunno what is the one for anime, look up civitarchive)
https://huggingface.co/spacepxl/Wan2.1-VAE-upscale2x
>>107026792
>>107026815Oh dear oh dear. Busted.
>>107026527I agree, let's do grugs
davinci resolve is really cool for editing
lets roll, JC
why are local so humiliated? you guys spend at least $2,000, and ai companies don't even respect you. only ai and video games have such a shitty situation. in a $100 restaurant, I'm a prince lol
>>107027550also, have you ever thought about how local users are like dalit compared to brahmins like some of us? lol
saar...
Detective...
>>107026975
>>107027695
>>107027550>pay $100>here's your wholesome ai cat video sir
>>107027550
>>107027550We cum inside of you for free
reeeeeeeeeeeeeeeeeeeeeeeeeeei need more frames 81 is not enough reeeeeeee
>>107027202nice
ltx save us
>>107027476noice
ani save us
>>107028077ywn baw
HOW'S EVERYONE'S SONGBLOOMS GOIN ALONG
>>107028122AceStep 1.5 is about to be released and it's competitive to even the latest Suno, why would anyone bother with this garbage?
>>107026708his arm is interesting.
>>107028134any more info on this?
>>107028151They post updates on their discord server and some samples from the current training runVocals are already arguably better than Suno
With Illustrious, we need to increase float precision so models are 12.92GB instead of 6.46GBI think it would still be twice as fast as Flux
>>107028134>it's competitive to even the latest Suno,Got a sample to back that up?I still wait for what Qwen devs are working on.
>>107028443Especially given first ACE Step was not even competitive to early versions of Suno.
>>107028060thanks
>be me>100 percent nocturnalim always stuck here with the nips
>>107028258Post the sample then anon-kun. Also curious about multi lingual/Japanese performance. I need some convincing that it's not just good at Chinese rap again. I mean, not just vocals, but composition too. YuE was better at that. Also the SOTA is Udio.
>>107028406Why do that when there are better models with better VAEs and better prompting and better everything
>>107028483>here with the nipsits mostly brownoids tho unfortunately
>>107027476>>107028468Model? Really cool.
total saassies meltdown on /gif/ lmao
I am genning the best song ever created.It will be the best song.There will be epic poems to this song.This song will be engraved on the moon.
>>107025185Does anybody have a guide for how to use WAN i2v with the turbo Loras? I haven't been able to find any
>>107028852sure:set cfg to 1use turbo lora
>>107028859No I mean the light ones with high noise/low noise
>>107028864a guide for what? if you're using comply it comes with a template
i installeda bunch of stuff in comfy that i dont use like chatter box, when i start up comfy they have to load, am i taking a hit on my vram having these installed when i gen a wan video?
what even started the sharty vs 4chan kerfuffle?
>>107028881no, those are scripts you dummy
>>107028483The Japanese very rarely use foreign internet. 95+% Japanese flags you see on int and pol are (s)expats too.
>>107028709id visit more often if it wasnt turboslop: the thread
>>107028406why
>>107028709KekKnew it was inevitable that it would get censored so didn't even bother with grok video lol.
My gens suck. Do I need different models? I want to make anime pr0n.
>>107028676Chroma with a lora called "Retro_Celestial_Scifi" on civitai
>>107024541>Thanks for the tips man.cute anon>I know about the dot G but no clue what "zigger zoopedo website" refers to.>I guess I can lurk the party until I come across what you are referring to.No, dot G is the site I'm taking about. It's run by zoophiles that tolerate pedo, and at least a couple of the devs are from Russia. But that's different than dicker tech, but I'm pretty sure they use the same backend since at the very least they use the same captcha s0lver>>1070289059/10s in the uk
>>107028979Post workflow/catbox with metadata no idea what model this is, SD 1.5? Lol.
>>107028979Found this... https://stable-diffusion-art.com/beginners-guide/
what can a 6gb vram gpu do?
>>107029092I think so... sd-v1-5. I'm using EasyDiffusion so a lot of details are hidden behind the GUI. I'm reading https://stable-diffusion-art.com/beginners-guide now.
>>107029101keep you warm
>>107029101make images and play vidya
>>107029111>make imagesI don't see how
what can a 12gb vram gpu do?
>>107029102SD 1.5 is ancient garbage at this point. You are not going to make anything similar to what you see here with it.Read the rentry links in the OP.Not clicking that link, probably ChatGPT generated crap.Get on SDXL at least, Flux/Qwen if whatever GPU you have can handle them.
>>107029101>>107029116what can a 80gb vram gpu do?
>>107029116make images and play vidya at the same time
>>107029115put on your spectacles
Great general nsfw lora for wan WAN DR34ML4Y - All-In-One NSFWhttps://civitai.com/models/1811313
IT NEARS
>>107029122>Not clicking that link, probably ChatGPT generated crap.FWIW it doesn't read like GPT. It's informative and there's some humor to it.
>>107029184don't need an nsfw lora if you're genning SongBloom - totally uncensored.
>>107026347is MOE better than wan2.2_i2v_A14b_high_noise_lora_rank64_lightx2v_4step_1022 ?
ai is too powerful
>>107029236yes
>>107029215It is still a surface level of intro that is 2 years out of date.You should read the OP.
How do you get IL upscales to not just remove all the detail from the image? I've tried numerous detail loras and none of them really seem to solve the problem.
>>107029184Not this one but I got some nasty body horror from T2V NSFW loras. I2V ones seem to work better. At least result in funnier gens when they fuck up.
https://vocaroo.com/16tpZIis6HP5PERFECTIONERFECTION
>>107029313Noob can do low denoising without blurring everything when upscaling.You can also throw controlnet into the mix for further original image adherence.
>>107029326
>>107029324oh, is noob a [SONGBLOOM] finetune?
>>107026198I'm literally sick of telling you people, 4 steps is not enough, try 6 steps split of High and Low, 3 steps each. also eular/simple is not good enough imo. https://files.catbox.moe/2i6qkl.mp4This might not have been perfect, ensure high noise cfg is 3.00 and low noise is 1.7Realistically you're never gonna get a good video using 4 steps 2/2 and 1 cfg, cfg improves detail massively. I hate 1 cfg snake oil bullshit, yes it will take longer about 32 minutes sometimes less on 12 GB 3060. But I'm willing to wait that extra 20 minutes for something that looks better and follows prompt more. But honestly the best would be no lightx lora's with at least 50 steps at 1280 x 720 but it takes ages obviously, so it is best to strike a balance. The same settings in the workflow embedded in the video works with i2v perfectly, you just need to switch over to using the i2v models and include the wanimagetovideo node and link in your image and clipvision output. Its best to either keep proportion within 720 x 720 or just pad with black borders while keeping proportion and wan should normally fill in the black border.Use native nodes with --use-sage-attention --lowvram --reserve-vram 1.0 (1.0 = 1GB) That will do all the swapping behind the scene and prevent oom, using context window node will also prevent oom. Ignore every stupid shit in this thread, they are trolling or completely retarded and never actually tested this model properly, they just point you to some bloaty workflow using kjnodes using some cookie cutter fucktard settings on every jeet youtube channel with the same bullshit workflows. With style lora's just use the low models in both high and low, and if you don't want lora's effecting characters likeness so much then also consider using only the low versions. There are body slider and even and age slider lora for wan2.2 on civitai if it hasn't been deleted already...T2V is good when you know what you are doing, its very good.
what can a 16gb vram gpu do?
>>107029240>>107029348>4 steps is not enoughSkill issuehttps://files.catbox.moe/5i9niz.mp4
>>107029355wan, my 12GB can do wan with only 16GB system ram, i use ssd swap partition on arch btw. >>107029348>Use native nodes with --use-sage-attention --lowvram --reserve-vram 1.0 (1.0 = 1GB)those launch settings work --reserve-vram 0.3 can also work if not doing much o your machine while genning. 0.3 would be 300MB vram reserved for other applications, it just prevents oom when the system does not have enough that comfy thinks is available when its already taken up by youtube or something using vram. You know if someone would bother their ass like muh we could have an easy link that just fucking explains all this shit.
Someone inpaint and replace one of the debos with the songbloom schizo please.
>>107029368i've tested other peoples settings and its all fucking trash? Unless you think your machine is any for different like it has some magic ability to not make a 1 CFG image look like a washed out pair of jeans... Same shit with every other 1 cfg shit model in existence. I makes me laugh when you have retards using such models on a card that clearly can gen just fine at 30 steps. What a total brain drain. Oh its not following my prompts.. Then turn the fucking cfg up, oh it burns the image well durrr. With high noise you can get away with higher cfg for the motion when you use the right settings.
>>107029399>Unless you think your machine is any for different like it has some magic ability to not make a 1 CFG image look like a washed out pair of jeansI posted an example>>107029368
>>107029416and its blurry, this was 4 steps my settings. I'm taking a break from wan but I will improve on this, mainly looking to see if I can get more quality and speed. I'm running out of electricity, wan eats electricity very fast... So testing is hard as it takes a lot of trial and error and each test is using electricity >.<I'm thinking I could find much better settings with persistent tests. But I will never go back to using 1 cfg every again because I hate talking to the model like its retarded.Some janny is annoyed because suddenly the board keeps eating my posts. Aye go fuck your self you just a dumb retard.
>>107029437My gen loses a small amount of detail because the initial image is pretty high in detail already, especially compared to your image which already doesn't have much detail, doesn't have detailed human faces or human faces at all, and is already somewhat fucked, picrelAlso my entire video has a lot movement, fast movement, taking up the entirety of the image, you have some vague ass rubbing movement that takes 1/10th of the scene and doesn't require as much complex motionUse Q8, and what someone posted beforehttps://civitai.com/models/1818841/wan-22-workflow-t2v-i2v-t2i-kijai-wrapperNew HIGH:https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22_Lightx2v/Wan_2_2_I2V_A14B_HIGH_lightx2v_MoE_distill_lora_rank_64_bf16.safetensorsOld LOW:https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22-Lightning/old/Wan2.2-Lightning_I2V-A14B-4steps-lora_LOW_fp16.safetensors4 steps, cfg 1, unipc
>>107029437it do be like that mr stancil
>>107027550>>107028709
I havent been keeping up. Currently it seems like Qwen and Qwen image edit is absolutely the best model to use
>>107029476It sho do
>>107029476Qwen edit is the best model for inpainting/edits, yes. definitely not for anime/nsfw porn/etc. every model has their pros and cons. to say one model is 'the absolute best' at everything is fucking stupid.fuck off
>>107025278the jannies are half the people doing it. i asked a question about a model in one thread, got called a faggot, told the guy to fuck off and got a three day ban right away. could probably dig up the screenshots.it's the same on every board. all the jannies and mods are discord trannies and cods and are primarily responsible for all the shitty content, spamming and troll threads. hiro doesn't fucking care so nothing changes.
>>107029471But... I look so cute in it.
>>107029522probably because you format your posts like a huge faggot
>>107029011I'm so glad you made this :)
>>107029520I prefer SDXL for inpainting. Faster and more versatile
I prefer SaaS for inpainting. 4k support with nano banana and seedream 4
>>107029574does it inpaint audio?
>>107029594gtfo then
>>107029394https://vocaroo.com/1gZIMGIUeV5X
>>107029457I tried you settings the other day, I disagree with only 4 steps. You should use 8 or 6 a the least other wise everything should be ok if using higher resolution. Still looks washed out though and body horror happens more... No amount of cope is every going to change that.
>>107029534separating paragraphs is gay? fuck off nigger. you weren't even alive when the reddit spacing meme was coined.
>>107029631???did you forget the vocaroo or smth cuz it's not showing on my terminal.
>>107029457uni pc isn't even optimal, eular/simple at 50 steps is as specified by the people that created the model ffs. Our settings are vain attempts to get some quality in much less time. It don't matter though because all testing is good and I don't even think we've really found the holy grail yet.
>>107029644i didnt realize one or two sentences = a paragraph for todays youth
>>107029673you don't even know what a fucking paragraph is because you are today's youth. fuck off nigger. go aschually somewhere else.
>>107029680are you going to continue to be a fag retard or will you take a breath
i did not expect there to be this much variation.>prompt: a smiling woman stands in a form fitting maxi dress, reddish brown hair, scottish, in a brightly lit room
you should do some nsfw gens at 1280 x 720 with out speed up lora's and 50 steps you will see what I mean providing your prompt is good and not half assed effort. Wan likes basic who what where style prompts but it also likes the extra details about characters and background, the more the better but the model does get confused sometimes when mentioning characters epecially with lora's that might trigger a specific sense because you mentioned she has dark hair or something then you might get an extra dude walk in and completely ruin and otherwise great scene. But that is the problem when genning at such a resolution for 50 steps you will not know until about 50 minutes passed then have to wait through the humiliation ritual of it not stopping right away for another 15 - 20 minutes before the damn thing stops. Scenes can be way more complex when you remove the lightx lora's and start using higher cfg at the expense of time to gen and its a huge amount of time on something like a RTX 3060 So i am hesitant to change to some rando's settings when it does not give me what I want in terms of colour and overall style and composition.
>>107029692i'm sorryyou seem to bequite madlet me guessyou're a trannitorand you're trying to work yourself upto ban me whiletrying to convince yourselfthat you're justified, after you got told exactly who you aredon't worry you miserable little homosexual. 50% means your pain won't last for too much longer.
>>107029709i see you chose the former option kek
>>107029534Structures sentences are often better than paragraphs and annoy fewer people faggot. I'm not the best at typing but I could talk in ways that would make you feel completely inept in just about everything because I'm not a zoomer kid.My real issue is my mental state and drink to cope, but that has no baring on actual intelligence. I have lived, you barely got out of your daddies nut sack and think you know shit.
>>107029715yeah, looking like you act like the thread police because you are the thread police, just like i said. and you are one of those guys fucking up the site so you can have your little hormone therapy power trip on.you niggers really do need to go back to r*ddit.
I challenge zoomers to actually pick up a paper and just read and when ever you find a word that is triggering because you've never read it before google its meaning. then finish the paper right to end, you little retards have no patience what so ever and that is the crux of the problem. If you truly want to understand then that is what you have to do.
you can't just fix retarded with a magic wand, no AI isn't going to make you smarter it will make you dumber! Sometimes you really just have to sit with it till you understand it, that goes for most things tech related.
>>107029761and if that means you have to lose a lot of skills in other areas such as social skills then you have to ask your self...if you put in the effort you will be rewarded and if not you will be a failure.
>>107029534Nice job upsetting the village idiot, retard. Now he's going to be crying and pissing himself for hours.
>>107029786>HoloCine16 seconds is hype. I'm staying optimistic until I run this myselfI'm already bored of video without audio now though
Gen millennial is all over on this shit because this is we feel our last chance for a world we were promised. So we take it to the extreme seeking for the best results using these models. All I want is something that can take prompt per 5 second clip and build scenes just like editing a movie with sound and speech. Is that much to ask?Then we will get right into producing the next entertainment experiences, that is what we want. The nsfw stuff is a means to an end, it provides us with the drive to continue. Each gen is fappable, each gen gets better and more complex. Unironically it will be the gooners that spawn the next thing with their persistence and so any model that goes out of its way to self censor will die and be forgotten. I think this assumption will be accurate, if a new people does not allow for freedom it will die. It will determine whether you have washed out yellow tinted synthetic slop or glorious full featured movies.
>>107029861he didn't say when it will be released?
>>107029861By the time you're able to make your full AI-generated movies, everyone else will be able to make their full-length feature AI pornography. They will not be watching your movie.
the perfect model will deliver on its word the full range of capabilities free and open source. The rest will be history as that model will define all others that come after it. We know this, they know this but who will take the rap for it? It will be chinese most certainly and they will build new gpu that really challenge nvidia's position.
>>107029874I just ran it through qwen edit no loras because you had me curious.Prompt:adjust the color of the image to a realistic photo
>>107029875It’s also a western model, and western models are better quality than chinese slop. It’s just we rarely get weights without bullshit attached
>>107029874true anon but then we will be living in and entirely different world. I was an 80's kid and i watched this world change so fast in almost the same way with the internet and faster and faster CPU's and GPU'sit has been interesting.btw i don't care if no one watches my movies, i'd just be happy with what i can do hopefully in one evening and sit and enjoy watching it.
>>107029889Yes, with the clothes remover lora.Go back a few threads for a link.Hopefully it still works.
>>107029313>>107029324The "trick" is latent upscale with controlnet before the high 0.7 denoise second pass. It's more difficult than using an image upscaler, and some anon get filtered by this, but it's well worth it.
>>107029776>n-n-no, you're crying!this place if fucking hilarious, not gonna lie. even if it's unintentional.
>>107029892the homer one isnt ai its just an old thing someone by the name of pixeloo made, they called it untoons
>>107029894I tried qwen image edit but the workflow says it needs more than 16gb vram and indeed it did not work
>>107029891what are you talking about anon? i have that and much more at my disposal but i'm bored of nsfw now, I've coomed my fucking brains out so much.Now I want actual content and not just the same positions in porn... I crave matrix movie remakes and new movies all from prompts, this i know will be possible within maybe 12 months.
>>107029900Of course it is, they upgraded to SaaS for Wan2.5 which is why they were able to compete
>>107029894To be fair look at how he's sperging out. Granted it's about something else but anon was correct.>>107029861>>107029875>>107029889Now he's getting fucked with by the anon who spent the last two threads reposting replies from older threads which is pretty funny.
>>107029908That example looks untrustworthy. Qwen-E is good at preserving text style and combining images but a restoration like that seems out of its reach.
>>107029889I think as the means to create this stuff gets more powerful, and content becomes easier to create, the pressure will be toward wasting as little of other people's time as possible. In fact that's largely already happened, as longer formats are being displaced by the tiktok, the short, whatever Instagram's thing is, etc.I think that anything beyond 30 seconds is not likely to matter because it's going to be very hard to ask that much of anyone's attention for anything made with AI. "Full length movies" will be a purely personal indulgence, pornography for an audience of one. Publicly, AI will be used mostly just to make memes. But let's also not forget what it means if people get the power to make AI movies. It means you may someday be forced, out of politeness, to watch a "movie" your friend (or your friend's kid) "made" with AI.
>>107029897qwen is tricky, the old edit model is taking more vram that is all i know. I got it working using --reserve-vram 1.0 on my card of 12GB and using swapfile. reserve-vram 0.3 did not work it was not enough, so tweak it slightly till it works, don't do too much though or it will be slower. Q4 full old edit model, new edit model and text to image worked with settings i said. Consider more steps for quality. Its a damn slow model but it is a good one, it will do as you command and with lora's its amazing. You have to disable sage attention though otherwise you just get a black image. I think it requires some compiling to use sage attention, not sure which module but lets be clear it risks breaking shit so don't bother.
>>107029931I went back to the original workflow and hooked up one single thing and it just works..I shouldn't be doing these things after waking up and desperately needing to take a shit.
>>107029935Compared to just using first frame. Stuff actually happens and the latent upscale is working.
I've tested a metric shitton of different i2v settings for the past month on 3090, both with speed loras and up to 60 total steps without, on different ksamplers, samplers, schedulers, with combos that always reached the 0.9 sigma boundary.While high steps for the high pass always led to better motion, the difference was minimal enough compared to the speed loras that I would only use the high steps version if I was trying to create a once-a-week godlike gen. The speed loras already do an amazing job in my opinion. The best combo I use now 99.9% of the time with speed loras is clownshark ksampler with bongmath turned on, 8 steps total, 4 for each pass, with euler / beta. None of the other samplers had as good visual results in this step range for me. There was no noticeable difference between simple / beta / beta57 / bong_tangent with speed loras, although beta should technically yield the best results as it allocates the most steps out of the 4 of them to the high pass, which is the most important. Just wanted to report my findings, not saying these are the absolute best settings for everyone, but they are for me.>>107029698By the way, since you like running high steps on 3060, maybe you know this already, but you don't have to run both passes.Do just the high pass and check the motion and prompt adherence. If it looks good, then run the low pass. If not, you save half the gen time.And try running the low pass with just euler if you're using res_2s or some other exponential sampler for the high pass. It'll save you time and can fill in the details just fine at that number of steps.
>>107029939Can't tell if trolling or legitimately braindead.
boring
>>107029950>gen 10 minute vidoe>prompt completely fails 6 minutes in5 seconds will always be superior
post a pic if you're not a bot
>>107029955qwen image is boring and has terrible seed rng variety.
>>107029939>clownshark ksampler with bongmath turned on, 8 steps total, 4 for each pass,I've tried something like that but not really getting what i wanted, also i read some things about that method that i did not like. Like its always trying to solve, a bit like ancestral eular? I read that somewhere and i was like well nope, and i stopped caring.
>>107029963i'll try, but share places are getting retarded...
>>107029939>euler / betaburns too much imo, starts to make things look like plastic.
>>107029971Shit. Well I got it to not error by doing pic related. But the genned result is just a static image.
>Keeps models loaded onto vram even after closing>Logs prompts and send them for """telemetry"'" purposes>Will soon be closed sourceTell me again why Comfy is good?
>>107029958thats why i mask it with sdxl
>>107029975Oh, I was talking only about images. No idea for videos.
Is Qwen image edit the best current solution for anime to photo-real? It's very good at keeping the character extremely consistent but it doesn't push the realism 'enough', and qwen is still kind of shit with nsfw. Wonder if any of you guys have this use-case with some success.I used to do it with illustrious, but illustrious usually changes hair/clothes too much and it's shit at hands/eyes/small details like that
>>107029981finally... untooned if it was good
Take us to the next one, anon.
>>107029939>By the way, since you like running high steps on 3060, maybe you know this already, but you don't have to run both passes.>Do just the high pass and check the motion and prompt adherence. If it looks good, then run the low pass. If not, you save half the gen time.yeah i know what you mean anon. I sometimes do that using different seed and prompt. wan 2.2 is so versatile like that :)Might not always work but can turn cowgirl position that failed in high into blowjob with not much effort in low all is not lost ;D
>>107029983Local caught up around Flux. That's when its LoRAs were really up there.For realism, MJ is currently not that good. Pic rel are four MJ gens made not too long ago. SDXL tier crap (though you could argue it's better than SDXL all you want, it's still not Flux tier).
>>107029987Post a chroma guitar with 6 strings and 6 pegs
>>107029988highest param open image model ever released
>>107029991You can use the analog lora and pretend its chroma to trick anons into thinking chroma is actually good
>>107029992Even if the loss is minimal, the logical approach is to minimize it as much as possible as in not translating at all. It is admittedly less now with models like Flux, Lumina, and other modern arch compared to the shit that is XL's. But still, it's there.For past models, it was most apparent in the colors and high noise details. Even with a suped up external VAE.
>>107029999Use case for latent upscaling?(I jumped into this conversation just to help jog your memory i have no idea whats going on im running on 2% brainpower but i wanna see were this goes)i ran some upscales with latent bicubic antialiased and it looks really good
>>107030000I think often the problem lies in ones cnet settings and prompt. It's a bitch to dial in (especially with some models) but once one does, it's like magic.
>The spammer is still not permabannedI guess no mods moderate /g/ or /ldg/ becaude who wants to work for free huh?
>>107030002>pixel space has always fucked the outputshasn't been the case in my experience. if anything latent upscale often introduced more artifacts for me.
>>107030006im using comfy right now. resizing the whole image beforehand works, however it seems to mess with bbox detection.
>>107030008>I wish Comfy had that aliased latent upscale that whatever Forge fork has.this?
>>107030010Hunyuan 3. Yeah, about those benchmark rankings...
>>107029988yes low does have that much power, which i think most anon forget. Its tricky to get lora's good. I tend to stick to low lora first for both high and low just to get the overall composition and style then i will use high if i don't got the motion but much lower like 0.3 at the start. low noise is the key, high noise lora will change the character too much i feel and like qwen they are very strong.
>>107030011You're relying on intuition. Empirically it's better to upscale the raw image, not the latents. Yes it requires one extra pass through the vae, but that isn't as lossy as you think
>>107030016>needing to translate to and from pixel space isn't lossynta but the absolute best upscaling workflow imo would be training DAT but exclusively on VAE degradation. traditional latent upscaling methods aren't great because they're, like the other anon said, using dumb algos like bicubic, nearest, etc. with the very low resolution of the actual latents, this often hurts details more than it helps. the true endgame would be using a model similar to DAT but in latent space, but this would require a much much more powerful arch due to the very low resolution of the latents.
>>107030018Are you using comfy or some variation of forge/webui/etc? What you described is the inpaint behavior in webui if you have "masked area only" set. It scales the area to whatever resolution your have appreciated and you can specify a padding to bring in more context around the inpainted region
>>107030021For one, needing to translate to and from pixel space isn't lossy... so just based on that it's better. Also the use of cnets allows the user more control over how the second pass holds to or departs from the original image. Subjectively, any denoise lower than 0.4 is pointless anyway.>DATDesu the best out of the bunch but still not as good as latent when doing comparisons.
its a bot isn't it? Either that or something just had to much awesome tonight and is now relying to me.
>>107030006It's interesting that he's replying to himself now. I wonder what upset him this time (again).
>>107030026latent upscale is just using traditional dumb algos to increase the size before you move into the next KSampler, it's not superior in any way to using a purpose trained ESRGAN / DAT / etc model to do the exact same thing, really it's worse by all accounts. I frankly don't understand what you mean.
>>107030027> top of any benchmark chartlike hynuan 3.0?
>>107030028SDXL came out in late June 2023, Flux came out in August 2024.
>>107030029There are no valid charts for image models
>>107030032It has, MidJourney isn't even close to the top of any benchmark chart that exists anywhere
Where to discuss image generation with real people and not this bot?
>>107030035You can thank T5 for that. The thing wants extremely literal prompts or else it'll misinterpret it.
>>107030035He'll tire himself out eventually
>>107030040>Could depend on your artist tags though possiblyWithout a doubt, now that I think about it. Still I would love cnets so I can at least gen at a lower res in order to choose which I throw through a second pass. Like other high(er than XL) res models, it's a cool feature but I much prefer "highres fix"ing instead.
>>107030042IDK about Neta Lumina 1.0 but Yume is supposed to have been mult-res trained at between 768 and 1536.
>>107030043not really, I gen at 1536x1536 with it all the time. Even higher every now and then. Could depend on your artist tags though possibly
>>107029892Yeah, I get that much, but it seems like IL just refuses to make higher detail images at higher resolutions, unless you use specific artstyle loras that probably have specific training. Using generic detail loras on the 0.7 controlnet pass doesn't seem to help, everything still looks very simple.
>>107030046Well yeah, clearly it’s not trained above that resolution. Happens with SD1.5 above 768 and SDXL above 1200
>>107030049yeah the realism that must be from base Lumina isn't that degraded at all, you can bring it back pretty easily with boomer prompts
>>107030018>absolute best upscaling workflowListen to me very carefully. SD ultimate upscale, disable the tiling or make tiles as big as the image, so 1024 x 1024 at 2x upscale, set tile size to 2048 x 2048, send image from decode from first or second stage sampler into a control net using SDXL union controlnet. Set controlnet strength to about 80 - 90& so yeah 0.8 - 0.9 or however you want, less means more change fro original. Set sd upscale to 1.00 denoise 100% yes? then watch the magic.for pro put face detailer nodes for face, hands and person before that also using controlnet if you need real consistency.you won't get no better comfyui upscale than that.
>>107030053>that's not gonna happen lmao, it would take an enormously huge amount of degradation given the text encoder itself is far superior to CLIPtbqh i think it's saying something considering the model still retains a lot of its original knowledge even after extensive training on anime
>>107030055oh yes I fucking love grainy analog y2k look, slop me more bra
huh
there are no normies here, go use your crap bot on facebook they will be much easier.
New>>107030058>>107030058>>107030058>>107030058>>107030058New