Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107006468https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Neta Luminahttps://civitai.com/models/1790792?modelVersionId=2298660https://gumgum10.github.io/gumgum.github.io/https://neta-lumina-style.tz03.xyz/https://huggingface.co/neta-art/Neta-Lumina>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
>>107010364>4 pics of the same girlkys faggot
can someone make a better model than chroma already? I'm tired of tard wrangling this shit
Didn't manage to get titties on ltx 2 site with t2v, let's hope is just in site censorship and not the model itself, it would be such a shame because the model feels and looks so fucking great.
give me a new style and girl to prompt
Output from the LongCat Video demo script first stage. 1:04 long! It took about 3 hours. The compression to attach it here is killing it, so here's the original file: https://files.catbox.moe/2gjaqx.mp4Note that this is only the first pass out of 3 (the longest though). Unfortunately, the inference crashed shortly after beginning the second stage due to a bug in the provided code. I fixed it and and will try another test.
>>107010393surestyle_cluster_33iwakura lain
>>107010404cool. how does longcat work? do you chain multiple prompts or you gotta do all in 1 prompt?
>>107010331not so fast anon see here>>107010391and video is latest test, workflow here https://files.catbox.moe/2i6qkl.mp4
>>107010393Gen a scene girl
>>107010393gen a photoreal obese black woman
>>107010408The example is a single prompt for everything, but I'm not sure if it will do well with a sequence of different events. It looks like it should be possible to prompt each segment differently, but it might get weird when it fuses them together.
>>107010410do note its 81 frames but at 32 fps because i think that is the base fps for wan2.2 however when you use most peoples lora's they will be at 16fps so adjust accordingly. Also can bypass those context windows, they are useful for when you wish to do longer videos. For longer videos just increase the total frames and the context window can be adjusted to more frames, think of them like chunks. It still does all frames at once but it won't oom which is an added bonus meaning you can do 1280 x 720 full resolution without much issues.
>>107010364I wish there was more feet sloppa...
>>107010441Describe what you want
without slop twerk lora
>>107010404Could you do a quick short clip nsfw test?
>>107010440Context Window is actually a frame batch node?!
>>107010514kind of, yeah... Its actually meant for doing longer videos but it stop's oom's and it works very good. Its really underrated or just over looked desu because its a new beta node guess.
>>107010536I have had it hooked up before but stopped using it for some reason.This totally coincides with me starting to oom more often.
>>107010510Yeah I second this.>>107010404have it do drop down and zoom up a ladies skirt, i want to see the brappers it can gen (0.0)
Does Chroma like simple sentences or long paragraphs?
I have a feeling we are gonna be eating so good in the new year or Christmas coming.
how do I make comfyui run a long job in the background while I create new jobs in the foreground? as in, one long job and one short job both running at the same time
>>107010479never got into wan and alike.so this does not work well on faces?it is clearly different when it gets animated
>>107010584I think it increases gen times, i've noticed when i forgot to disable them, i had only genned 81 frames but i had the context on and set to 41 frame context size. And yeah they gen i'm doing is taking way too long, its been stuck at 33% for a while now... and it just jumped to 67% thank fuck, i thought it was having a fit. but yeah its effecting the gen time a lot, so only use the context window when you need it.
>>107010626how bad is your autism? they look pretty close
>>107010653>closenot at all if you wish to supplement your low count dataset for lora training - with precision
>>107010406>>107010416>>107010418any other ideas
>>107010696Post your gen and I'll decide
a n o n s how's it going?>>107010615sentences, yeah. long paragraphs, nope.
https://civitai.com/models/2073885?modelVersionId=2346721OYYYYYYYYYYYYYY
>>107010719My fetish
>>107010719oy vey shut it down!
whys qwen edit create these scan line looking bands across the output?
>>107010794GPU failure
>>107010805don't be mean anon,
>>107010833>generic white girl #4634566356yawn
>>107010841>whiteoy vey...
So far it's impossible to get anything animu-looking out of Longcat T2I. It always does some kind of 3DCG thing when I tell it "anime" or "animation" in the prompt.
>>107010866does it have i2v?
>>107010866What is your prompt?
>>107010872Yes, guessing it will work better based on how Wan responds. Going to try after a couple more T2I attempts.This deviantart looking thing is the "best" I've gotten so far.
>>107010898normal wan has the tendency to CGI my anime stills, I guess they trained it mostly on 3dcgi and realism
>>107010881Some variation of this, swapping out terms to try to push it:>prompt = "high-contrast anime of a female space marine wearing a form-fitting robotic exoskeleton. she has a black bob haircut and narrow blue glasses studded with blinking LEDs. her armored suit is gray and blue, with signs of wear and battle damage. over the suit, she wears a long, ragged khaki-colored scarf. she is in a desert with sparse rocky outcroppings and a brown haze in the air. she climps to the top of a jutting pile of rocks, and looks into the distance. far away in the distance along the horizon, a city of domed buildings is visible, clustered together. at the center of the buildings is the cable of a space elevator, a single vertical line rising straight up into the sky. the lighting is harsh, tinted by the atmospheric haze.">negative_prompt = "Bright tones, overexposed, static, blurred details, subtitles, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards, cg, 3d, photograph"Also, the 1st stage low-detail pass has generally looked better than the last pass for this prompt, ignoring that it's the wrong medium.
How do i generate slop?
>>107008788> KJ was wrong to assume it needed no codes changes, we gonna need a new node.It need changes for native nodes, his implementation is correct and works, there are example gens in that github thread.
>use the latent upscale method>it can't do anything other than a 1.5x upscaleGod, open source sucks ass.
>>107010364Top right and bottom right are very much my shit.Hnnnng
>>107010933>retard doesnt know how models are trained
>>107010933literately upscale dummy
>>107010933Something else must be fucked up
>>107009152>>107009129wan?
Any reliable way to do POV deep penetration with wan i2v? GF been complaining my OC dick insertion shenanigans aren't genning deep thrusts.I looked for some wan loras to address this but couldn't find any. There are SDXL loras for this, I may try some first-last frame workaround.
>>107010940>>107010950>>107010963It's because of the uneven pixels it's pissy about. The upscale by x node was supposed to fix that, upscaling with manual numbers is impossible to match so you don't get the error.It's literally dogshit coding.
>>107011018> There are SDXL loras for this, I may try some first-last frame workaround.Yes, or vace with deep/shallow frames.
>>107011022Can you illustrate what you mean with examples? I haven't run into this.
>>107011053I didn't get around playing with vace yet. By the sounds of it seems to have a lot of potential to do wild v2v stuff.
>>107011073But it runs 1.5x slower.
>>107011060All this extra shit just for latent upscale and you're stuck with 1.5x, nothing else works or you get the mismatch error.
I love the InternetIt's lovely having the ability read the thoughts of the brightest minds in history
>>107011000Pony v7
Longcat can cook anime-style noodles! Sort of
Got wicked drunk and passed out last night, woke up just now with a throbbing headache. Last I was at my computer I was genning titty elves with Chroma Flash 12 steps DEIS, 30s per gen, so if that was 10 hours ago then there should be 1200 new titty elf pictures on my hard drive now
>>107011094>int to float>float multiply>flot to intholy fuck have these retarded node devs ever heard of duck typing (prominet feature of python)???? holy fucking shit
>>107011146Proof
>>107011106Making a model gravitate toward a specific color was always a thing. It seemed like leaning toward yellow was the "default" in most models but Sora goes too far for some reason.https://github.com/hako-mikan/sd-webui-supermerger?tab=readme-ov-file#adjust
God damn, 60s.>>107011149Open source, bro.
>>107011146post your top 5 picks
>>107011146>there should be 1200 new titty elf pictures on my hard drive nowI kneel
>>107011163I'm not at my PC I'm lying in bed feeling like complete dogshit
>>107011161>just make it yourselfI don't need the nodes, my workflows are pretty basic, but unlike these hacks I'm a competent programmer, so when I see garbage nodes such as the ones you posted, I can't help but criticize.This prompted me to check why there are no basic math operation nodes planned in native comfy and VOILA:https://github.com/comfyanonymous/ComfyUI/pull/8024The only other competent person in this discussion says the same thing I'm saying.
>>107011149nigger retard you can't connect int and float just like that
>>107011212another retard that doesnt understand how python works.you can have outputs be dynamically typed (or having a dropdown for casting to your desired type), but everything is mathable together in python, makes absolutely no sense to have 09593421906234 nodes all to do casting and implementing math operations for each fucking type, there's absolutely no fucking need retard. Go learn python fucking monkey retard, kys
anis will saveis
>>107011230no one will save you
why can't chroma tell the difference between an anus and a vagina
>>107011272it's a failbake
goth cart
>>107011106I think it's great the average normie is so clueless, so the anti-AI mob calms down a bit while open source flourishes. Though Qwen definitely has a bit of yellow tint (Alibaba trains on synthetic data unfortunately).
>>107011223lo>>107011247ra?
>>107011220you are fucking degenerate learn how comfyui works and treats workflows in particular first mr i know how's better
I mean there already are nodes where you can do all the math and conversions you need (and more)
>>107011272because it generalized to understand that the only purpoes for a digital anus is for it to be fucked, equating ti to a vagina
>>107011317the Any Switch nodes in rgthree already support dynamically typed inputs and outputs, you're a fucking retard. You can dynamically type the output depending on the input you're connecting it to, likewise for inputs.kys nocoder retard
>>107011499> you are fucking degenerate learn how comfyui works and treats workflows in particular first mr i know how's better
>>107011531>>107011338retard
For video, going with a less elongated aspect ratio is good because you can push the resolution more. So for a square, you can get away with 1080p.
>>107011541>>107011538
>>107011546>>107011538>>107011541>>107011531gay
vidgen was a mistake
There's no "best practices" in comfyui, you just have to carve out your own little path through the mountain of feces to get to where you need to go
can local video gen add audio and lip sync like grok? I've been having alot of fun with grok but the daily limit is annoying.
ok I got the money ($3k). which gpu should I buy?
Could Qwen edit be use to change the style of something like make anime real or 3D render realer looking?
>>107011558no. local is very far behind in all fields currently.
>>107011561Nvidia H100
Longcat T2I has produced something anime-like!
>>107011561Preorder 4 Huawei Styx 1k40s
Where is criticanon? Did he go back to Blazblue? Did anyone make an OC for him?
>>107011599>chinese shitbe serious
>>107011602>96GB vram vs MAYBE 32GB if you're luckyI am serious
>>107011601<7288clear your output
>>107008372I want to train a super resolution model.I have a collection of images, with the same style and colours, but some are low resolution because I don't have the original images.My idea is to recreate the high resolution images by training a model. The idea is to tell a model: hey, this is a low resolution image and this is the original high resolution image, and the idea is to guide the model with that information so then I give it the low resolution image and it gives me a high resolution image. I know I will never get exactly the original, but because all the images share the same color palettes it should be more accurate than using an untrained model.
>>107011611I know I need to, but I'm lazy... What you got nigga?
>>107011617And what are you gonna do different from the 6 gorrillian upscale models that already exist
>>107011626Give it a cool demo.
>>107011296https://civitai.com/models/1942216/chroma-the-creepy-the-unsettling-and-the-ugly
>>107011626>Asks for X>But why do you want X bruh???Read my comment again, please>>107011617I think it was called GAN? I don't remember the exact AI term.I basically want to train the model with pairings, but I don't know what model to train or how, please someone help me.
>>107011617You'll need thousands h100 hours and millions images for that.
>>107011621what a fucking shitty gen, why upscale this garbage with broken hands? kys
wait what? I thought it was a Wan finetune but it's its own model??https://huggingface.co/meituan-longcat/LongCat-Video
>>107011589looks inferior to wan 2.2 unfortunately
>>107011671Because it was a joke to make fun of a guy in a thread in /v/ so I didn't much time on it. I honestky forgot I posted it here.
>>107010364But of a newb question right off the bat, ignore if you want: Can you have a local language model somehow?
>>107011698Jesus I typed so fast I seem like a "saar".
>>107011709go to /lmg/
>>107011558currently we are waiting for ltx 2 to release weights in november but that's allhttps://ltx.video/blog/introducing-ltx-2
Is my GPU dying? Often when I start my computer or wake it from sleep, GPU fails to initialize and I get a black screen. I have to reset multiple times, it takes several tries like revving up a chainsaw but eventually it does work, but it's annoying and worrying as hell. Does open source genning put undue strain on a GPU? Perhaps leaving Flux running for 10 hours generating batches while I'm away at work was too much for my aging 3080
>>107011664Can't I make a lora?
>>107011844Just unplug it and blow on the slot
Are there any IPadapters for Chroma, or do you just use the Flux ones?
>>107011844mine did the blackscreen thing and then it died a few weeks later. i also left it genning so you might have to get ready to buy a new one soon
>>107011223>>107011247can you post your workflow? my chroma gens always look blurry or messy
i really love ltx2 video just for the mere audio support and 8sec coherence. can't go back to generating muted slopped video from wan 2.2. want to try grok out but redditors are crying about the excessive censorship.https://files.catbox.moe/80u7jc.mp4
>>107011984uhmmm LOCAL video gen? go to /wsg/ or /gif/ sora threads if you want to post your NON local videogens, no one fucking cares here.
>>107011984uhmmm stay but less talking and more bouncing
migu
>>107012162I'm a local fag but bored with muted videos. just paid for a month at a discount test the backlog of 1girl images i have on my output folder.
>>107011094again, I tried this bullshit and the output was slopped as fuck and the background was grainy. 480p high, latent upscaled 1.5x to 720
>>107011115why you lie?
>>107012372again, fuck off, you're off topic.
>>107012372again, stay, but more bouncing
>>107012445you fuck off, thread shitter
don't know why it tinted green. triple ksampler, unipc/simple
>>107012474that immediately killed the sitcom aesthetic..or did it? I haven't watched tv in decades, god knows what they show now
>>107012457talk about local in the local diffusion thread, faggotron
>>107012457>posts off-topic shit>thread shitterliterally kys, your gens are garbage 3dcgi anyway they FUCKIGN SUCK bro, either do anime or realism, not this fucking pixar shit it fucking blows. then it's all cowtits shit too, NO RANGE at all, you're a literal brown subhuman with your fucking bigporn whatever illustrious shitty mix youre using, fucking garbage. literally kys.>>107012513browns dont understand that we have multiple threads for everything, I understand not wanting to go to /adt/ or /sdg/ (theyre tripfag/avatar infested that do the daily HI X HOW R U X3 absolute cancer thing, on top of having faggystudio in the op), but there are pretty 100% servicable /de3/ threads here for cloud image, and sora threads in /wsg/but no, they can't fathom frequenting multiple threads, they think they're regulars here so they shit up the thread with their useless shit nobody wants to see (save for another retarded brown that won't understand WHY someone doesnt want to see non-local shit in local threads).It's the same thing in /lmg/ btw, you have absolute brownoid retards discussing diffusion/comfy shit sometimes, but they're ''''regulars'''' there so they feel entitled to shit up the thread. browns shouldve never been allowed internet access.
>>107011247average romanian after night out>>107011294heheh cute>>107012474so fuckable My Lord Allmighty, she's so fuckable fuckk.OOF. This is the type of girl that you wake up mid night and get to fucking while she protests but succumbs to her body quickly. God fucking damn it everything damn fuck all.Young bratty bitches God bless em.She would not have time to do anything around the house. If I saw her cooking Id fuck her. Cleaning? Id ra_e her on the floor. Chilling on her phone? Take phone ram in the wall throatfuck her until shes wet. Life doesnt get much better than a bratty bitch cocksleeve on the cock daily. Imagine the freedom and relief of living such a balls emptied frequently, life. Oh yeah, and the girl aint bad either.
>>107012549post hands
>>107012544based, fuck them.
>>107012544I'm not that anon. anon was testing a model that will be open source in a month (supposedly) and you have a sperg fit. YOU kys
>>107012578>open source in a monthit is currently API only, it doesnt fucking matter. Also they can still rugpull and what they release as open might not correspond 1:1 to what they're serving via API. Kindly fuck off.
>>107012586just fucking end your life retard
>>107012558ladies first
I love women nobody has ever loved women more than me ever in history of the Universe across all races and planets and everything.Woman is the perfect form I adore forever and ever.I love women have loved women will love women and always will beWomen are the perfect expression of the Universe and only I am to breed them all.If you are a woman you belong to me, simple as.I love women forever and ever more than anything.All women belong to me. I am the ONLY MAN to have infinite children.All other men are to tend to my women and my children.That is the Truth of This World.Have a nice day.
>>107012684Never going to get a woman talking like that
>>107011161am i doing it right?
>>107012684If you are 10/10 woman, you belong to me.If you are 9/10 woman, you belong to me.If you are 8/10 woman, you belong to me.If you are 7/10 woman, you belong to me.If you are 6/10 woman, I may be persuaded.If you are 0-5/10 woman, unless a friend, you will be shipped to Australia with all other <5/10s to fight the local animals and other women in butch games for pleasure of my Kingdom.Survivor will be allowed to work as security. If you are a "man" you are a servant to me and my women (if you behave nicely you earn "woman viewing priviledges" where you can glance from distance. Otherwise you live in the mines and mine lithium for AI friend (only friend I indulge except God).My word is LAW. Under God, me and my offspring and my glorious women, nobody else.I have been born with this message, more ancient than time.SO it be done.
>>107011969https://files.catbox.moe/6fv9gj.json
>>107012656>subtle cameltoewan is based
>>107012544what an absolute faggot. i was testing the ltx2 video model on the official site because there is a serious drought and the devs are not releasing the weights until the very end of November. its a open source model so its far from off topic.https://files.catbox.moe/b55zog.mp4https://files.catbox.moe/u3lup5.mp4
>>107012910stop posting your cgi garbage retard, you're still posting OFFTOPIC api shit despite all the mental gymnastic you're doing. kys
>>107012887Anon do you have a good i2i workflow for Chroma to share?
I continued genning ponyv7, iterating styles.Using simple/euler/40steps/4cfg randomized style clusters and.It's fucking garbage. WHILE some of the gens might look nice, the details are absolutely shit. It's like the model is undercooked, and ALL gens have big defects, either in the anatomy (mostly eyes/hands, didnt try FEET gens but I cant imagine them being better) and/or in the image composition. These aren't even hard prompts AND MAN it just fucking sucks.Posting a realism gen I got when randomizing the styles. Anyways, that's it from me for pony. It's a shame because it had some promise. You can gen at 2MP (which is nice) without needing to upscale, 100~ seconds per gen, but the results are... just not worth it. Back to lumina/qwen/chroma with me.