Discussion of Free and Open Source Text-to-Image/Video Models and UIPrev: >>106464276https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicChromaforge: https://github.com/maybleMyers/chromaforgeSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://tensor.arthttps://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://rentry.org/wan22ldgguidehttps://github.com/Wan-Videohttps://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneSamplers: https://stable-diffusion-art.com/samplers/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Blessed thread of frenship
They tell me this thread is blessed
neta is the future for anime start learning now https://neta-lumina-style.tz03.xyz/
>>106469516>>106469514all of Gods children are blessed by his grace
>>106469536>.xyzOh yes, anon, this link is definitely trustworthy.
>>106469565Seems fine
>normalfags calling other normalfags out as being too eager to use the word "clanker" as a slur ai bros stay winning
>>106469612I didn't realize styxhexenhammer was this big
is there any hope for local t2v? except for making shizo videos, local t2v is obsolete. only local i2v is excellent for serious things
>>106469625It's pretty good for porn with porn LoRAs but yeah most videos end up like a fever dream.
>>106469625I mean, we’re basically in the Ford model an era of this stuff and here you are writing off the future of cars. Bit silly don’t you think?
>>106469621i dont know who that is or how he relates
>>106469625It's good for producing extremely plausible realistic single frames from the video. Like if candid realism if your goal I'd just extract a frame from i2v
Anti AI fags have gone fucking nuts the last few months.
whats the best tool for automated video captioning? I want to try training a t2v lora but I dont wanna caption my entire dataset by hand. I'm trying chatgpt but it cant caption videos for shit.
>>106469652>the last few months.That's an understatement
>>106469625remember how anon hyped Wan T2I? shame the diaperfag decided to tune qwen instead. maybe the bigasp guy will do it.
>>106469658Gemini. It shits all over the other models for OCR stuff.For example. >>106469558Based on the video provided, here is a description of what happens:The video features two characters from the TV series Star Trek: The Next Generation: the android Lieutenant Commander Data (on the left) and Lieutenant Commander Geordi La Forge (on the right). They are seated side-by-side in the cockpit of what appears to be a shuttlecraft.The sequence of events is as follows:Data holds up a blue and white package resembling a pack of cigarettes.He removes a single cigarette and places it in his mouth.The tip of the cigarette spontaneously lights up on its own.Data takes a drag from the cigarette and then exhales smoke.Throughout this entire action, Geordi La Forge looks at Data with a surprised and bewildered expression.
>>106469536>Boris VallejoLooked up his style and... that's quite sad. Of course, nano banana can pull of his style just fine.
love me ambiguous language
According to GeminiIn this video:The scene opens with Captain Picard looking thoughtful in his ready room. In the background, the Vocaloid star Hatsune Miku, now impressively dressed in a red Starfleet command uniform, walks into the room holding a Starfleet PADD (Personal Access Display Device).She stops and stands behind the captain. Then, in a move that would surprise even the Borg, Picard picks up a 20th-century-style handgun from his desk and calmly examines it.It's a fantastic edit, but a few small clues give it away:The Starfleet Guest Star: As you know, Hatsune Miku, while a cultural icon here in Japan and worldwide, never actually made it onto the Enterprise-D's crew roster.The Firearm: The biggest giveaway is the prop. Starfleet exclusively uses energy-based weapons like phasers. A projectile-based handgun like that is a massive anachronism and would be completely out of place in Picard's ready room, unless he was in the middle of a Dixon Hill holodeck program.This is another great example of combining real footage with a digitally inserted character to create a completely new, surreal, and humorous narrative.
>>106469755Isnt gemini paid? Do you use with some local client like kobold or silly tavern? Im trying to find good uncensored llm for images
>>106469755those are not good captions
>>106469705>Two elf female characters with pointed ears. The character in the foreground has long, wavy blonde hair, light skin, and wears a blue, off-shoulder dress with white fur trim. She has a worried expression and blue markings on her forehead. The character behind her has long, orange hair, light brown skin, and wears a sleeveless purple top. She has green markings on her face and is embracing the blonde-haired character from behind, with one arm around her shoulder. The background is a textured, dark green and blue gradient, resembling a forest or cave
>>106469887basterd bitch delete this
What's the most viable captioning method for deviantart-tier freak fetish stuff?
should i get my lazy ass out of bed and finish installing wan? how long do gens take with a 5090
>>106469998they'll take no time at all you fuckin ass
>>106469705>>106469887
>>106469998sure / depends on settings but in the order of some minutes with many settings. you can do 1 minute gens at a not too terrible resolution if you take the fast options with 4 steps or so.
>>106470011my bad man, i genuinely got no clue on this shit
>>106470024thanks anon
>>106469998it takes me roughly 4-5 minutes for a 720p 8 second video
>>106470020Just like my futa doujins!
>>1064699982.2 is so good it convinced me to try training video loras. >>106469701thanks bruv. got everything captioned, surprised it let me do them all for free
Damn, AI inventing new instruments.
>>106470133very mongolian
>>106470141I fed Gemini a Batzorig video screenshot lol
>>106469883Yeah but I didn't prompt it how to caption. I was just more interested to see if it could identify what was happening in the video at all.
>>106470085>surprised it let me do them all for freenp. I assume they do it as a means to capture audience share. Their free stuff is very generous. I honestly just pay for gemini as my GPUs are usually blasting away at training most of the time anyway. It's the best model for captioning in my opinion. And the fact google made veo 3 should indicate as much.
>>106469887>>106470020>Reference imageLiterally just prompt for the guy>A caveman carrying a wounded woman while pointing a spear a large flying bird over them while sitting atop a rocky hill by Boris Vallejohttps://files.catbox.moe/zcos9q.jpegLocal would never.
did comfy fix the shitty qwen edit text encode node?
>>106470186every fix breaks two more things. python was a mistake
>>106470176Do you have some special version lol? I get this on nano which isn't even close.
>>106470176Didn't think it would recognize it. Flux can't really handle a more complex composition like that.
>>106470235Not bad. Unlike the original nano banana can't show me booba, so a Chroma LoRA would win anyway.As for the results you're getting with nano banana, no idea what you're using. I can get his likeness right away even across other seeds.>A caveman with a shield standing atop a rocky hill while goblins are incoming. A woman kneels beside him by Boris Vallejohttps://files.catbox.moe/ak75k4.jpeg