Discussion and Development of Local Image, Video, and Music ModelsPrevious: >>108987212https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/https://animadex.net>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>Wanhttps://github.com/Wan-Video/Wan2.2>LTX-2.3https://huggingface.co/collections/Lightricks/ltx-23>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
Blessed thread of frenship
>>108990836no. GRRR!!!!!!! *farts in your general direction*
>>108990835Can this like body swap the girl in porn videos?
>>108990754>The average ZIT gen will have better realism than average ZIM gen.>>108990796>it's the output generally of zib which is less realistic.Anon, it's easy to understand. The average ZiT output will be better than the average ZiB output but the best you will see out of either will come from ZiB.
HOLY SHIT COMFY ACTUALLY LISTENED! Finally they're deprecating local. Most users want the full power of the cloud, it's stated right in the installer. Nobody needs shitty kLatentEmbedder nodes or whatever junk, it just clogs up the node list.
>>108990880upload to HF?
>>108990909Upload what, the image? Why would I upload the image to huggingface?
>>108990850i think it only works well with solo subject.
is it possible to hack Google and download the checkpoint for nano banana pro?
>>108990974Yes, i'm using it locally
>>108990980don't know if joking or not but share it frend
>>108990983Nah you can do it yourself mate, very simple
>>108991044workflow?
Weekend artist how are the masterpieces coming along? Wagecuck hat goes back in two days...
>>108990891but to finalize the image, to fix the anatomy problems and such, you'll have to put it through zit, so it's a moot point - anything can be used for the 1st pass, even zit with loras that add seed variance. What else are you going to do? Gen 1000 images hoping you get lucky with the perfect one?
>>108990933the lora? none of the shadman ones i see are as good as that desu
>>108991060>Wagecuckwho?
i remember anon suggesting to generate batches of images instead of individual ones. good idea. i save 1 second per generation
>>108991044the original looks hotter
https://civitai.red/articles/30980>T minus 40 hours until civitjeets are reminded of their GPU poor status
>>108991083There is no lora. Just @shadman with Anima, model's own knowledge.
>>108991101>>T minus 40more like 20
>>108991145Very nice work, anon. Is this an OC? You seem like quite a unique individual.
>>108991101sir, where will I train my new zit lora with my pony data now, sir?
>>108991145kinda nonsense innit
>>108991205this is so cool, what's your setup like hardware wise?
>>108991048>>108991214nyo
>decide to train some lora>'charmap' codec can't encode characters in position 33-42: character maps to <undefined>alright then
>>108991219tranny
>>108991214>intel 13900k>rtx5090 >64gb ddr5heres a failed one
>>108991240would 40 gigs be enough for video?
>>108991232what jankass encoding are you using for your captions?
>>108991255it triggers the error even without captions
>>108991240Boobs and ass jiggling as they teleport unlike the source video is interesting.It seems to understand the physics of human anatomy on a more fundamental level.Interesting toy. But probably beyond capabilities of my humble hardware and I assume workflow for this shit is also a nightmare.
>>108991262did you mess up the config file then? which trainer is it
>>108991232What are the characters in position 33-42 of this caption file?Either you wrote some monstrosity or the tool you are using uses something else besides utf-8 for some reason.
>>108991262It will say line x in file blabla in the traceback somewhere.Look at the culprit file.
>>108991101> crypto mining> 2026it this true?
I just want to make various poses with a consistent character. Why is this literally impossible to do locally? Should I just give up on img2img and use SD with a lora instead?
>>108991319use klein 9b
>>108991301Some shitcoin came out and people mined it, that is true.Whether it got rugpulled already or if it is still profitable to mine, that's a different matter.
>>108991319> Why is this literally impossible to do locally? what are you on about. even right in this bread anon did videos like that.
https://files.catbox.moe/gp4vz4.wav
interpolation is so fast now, what changed?
>>108991417GPU?I don't bother with interpolation but if it got a huge speed boost, I might reconsider.
>>108991409this is what english sounds like to foreigners
AI can't create something it has never seen beforeSAD
HelloI'm new hereWhat's the best guide to learn ComfyUI from scratch?Any playlist on youtube to learn creating workflows and controlnet?I mainly want to turn rough doodles/stick figure art into imageslike if I make a stick figure art of two people riding a bike and prompt the rider as x and pillion as y and it'll generate it
>>108991508it's never seen your mum before but I can prompt for whales easily
>your mom jokes in almost 2027
>>108991508>AI can't create something it has never seen beforeYou aren't the first to say that. Also, AI can certainly merge concepts in ways humans have never seen before. Most human thoughts are largely unoriginal. Ones that are, are usually combinations of other previous ideas.Hang in there anon. It'll get better some day (if you want it to)
>frog>cringe>x in <current year>
>>108991240Good lord. Just imagine new DLSS versions having this quality and having all that jiggle and quality on every NPC in a game. I can't even contain myself thinking about it.
>>108991202Shirow had a very autistic way of thinking out the mechanics, so it probably all fits
models cannot into left and right.Doesn't matter if SAAS or local, saying something like "their right eye closed" always seems to have them close the eye on the right of the screen (characters left) closed
'rog 'eb'iteFACT!!!!!!!!!!!!!!!!!!!!!!!!!
I'm a die-hard Flux fan, but Ideogram wins by a landslide, hands down. I'm switching over until the next Flux
>>108991403Yeah, I remember when it took them months to fix random seeds not working inside of subgraphs.>>108991729No. It's really touchy about resolution. It seems like a popular model though, so maybe genius or two might create some fixes.>>108991851>The model this prompt is for understands "left" and "right" as horizontally mirrored, so be sure to correct those directions in the final prompt.I added this to my LMS Sys Prompt and it improved things a lot (this is for Z-Image btw).
>>108991508it even pretty much always does, thanks to random noiseit does remember and mix up concepts and patterns else it'd be random noise, but that's art (minus the "art" that is paint splotches thrown at a wall, that one is just human doing anything random + physics)
>>108992205what does it win? text?
is it true that you cant even use the ideogram outputs commercially?
>>108992269you can use all model outputs commercially if you really think about it
>>108992205can it be run on on comfy?
>>108992205does ideogram let you put as many reference images as you want? that's what i like about klein
>>108991851I'm pretty sure this stems from the fact that people themselves can't decide on if they're talking about left/right of the picture or left/right from the perspective of the person in the image. Like if you say their left hand, is it your left or their left? People probably flip flop on which "left" they're talking about in the captions and it confuses the model.
>>108992331nothing runs on comfy when it's broken
tfw SDXL with CN is better and more consistent for face transfer than Klein 9B edit
>>108992447examples?
>>108992457Just try it yourself.
>>108992467cool. can you share a catbix with metadata?
>>108991354it's still profitable if you already have the hardware for it. not worth invest new gpu for it though so i don't expect this to cause gpu shortage like the last time
>>108992435yeah, amazing AI model bro
>>108992208you can always count on the highest quality gens in /ldg/ to be of jenny
how big are your files?
>>108991262xister, copypaste the entire traceback into a chatbot and it will guide you like a dog guides a disabled person
>>108992695~5 megs
https://github.com/Comfy-Org/ComfyUI/pull/14182Answer the man you Comfucker so I can use Anima in OneTrainer already.
Are people skipping the unconditional model for Ideogram, or what? How are you keeping below 24gb VRAM use?
>>108992695mostly something like 1.5MiB/10sec. if i ever have something that really needs better quality I'll probably also change codec, not just increase bitrate.
>>108991813Shirow did but not even gpt-image-2 would have learned that from being trained in his work
>>108992777whenever I run it iirc it reads something from disk on comfy so it isnt keeping everything on vram
I NEED more Anima loras.Someone go tell everyone Illustrious is over already!!
>>108992811the ai hasn't a clue what everything's for
>>108992869make them yourself
>>108992907nobody ever taught meI usually ask AI for help but AI can't help with new subjects
>>108992921prepare the training data for something reasonably popular and people here or on civitai (bounty system?) or elsewhere might even give it a shot
>>108992867Yeah, somehow it seems to have sorted itself out and it doesn't go over anymore. Not sure what happened.
>>108992869>Someone go tell everyone Illustrious is over already!!you'll have an easier time convincing the people still on sd1.5 kek
>>108992359Does it even allow reference images? I've been looking around to see if anyone has a workflow for it but the most I can find are for region prompting but you can't connect more than one image.
>take a pic of your dick>remove background>put it near the face of your 1girl gen>prompt it to suck your cock using WanALL I'VE EVER WANTEDWAS AN ANIME GIRLTO SUCK MY DICKIT'S WITHIN MY HANDSI MUST NOT PROMPTI MUST NOT PROMPT
>>108992975this is nice, i like it. like a cheesy 90s music video.>>108992991problem here; i have to look at my dick outside of JO times.
>>108992975Looks like the video for Paramore's Brick by Boring Brick.
Shitmix progress
>>108992988>Does it even allow reference images?klein? yes
>>108993068I meant ideogram
>>108993048>>108992570Hi Catjack!
>>108993074oh. i don't know, i'm asking the same thing. you can't replace flux without reference images
>>108993097Stop trying to get your subhuman avatartranny crush to notice you, Julien
>>108993142that's catjak and you are responding to yourself in yet another retarded false flag attempt. happy pride month faggot
>>108993157Sure it was, JulienSure it was
>>108993139NTA but I don't think it is going to replace flux in any capacity other than high effort generations. I don't think it does reference images or any kind of editing. I think it may do a little better with inpainting because of how you can explicitly target an area, but I haven't really tried it yet.The json prompting turns even a low-effort prompt into a medium-effort prompt which kind of sucks. But the images are really good and the control is amazing if you actually want to take the time to play with it. I really like how it seems to handle realistic fine art styles. It's just not something you can queue up a bunch of wildcards and play gatcha with.
>>108993210i checked their website and it does look like it has editing. i wonder if it lets you plug multiple images in
prolefeed
how do you get z image turbo to be mor dynamic with poses and angles? everything comes out as a typical flat picture from a magazine
>>108993349You use z image base then pass it through turbo
>>108993331catbox? workflow? hog many GBs of VRAM do I need for this?
3 merges in
how to avoid flux2 klein 9b edit color shift?
Looking to add image gen to my llm server. Currently running with 4 v620s for text, and am planning on buying a gpu for images. What's the best card for image/video under $800? Is cuda still king? I'm a bit wary of newer amd cards because I've heard they all have the reset bug. And intel software is horrible in the llm space, idk how they are with image generation.
>Anima>Ideogram>Klein>LTXwhat happened to chink shills? all of the best open weight models recently have been released by the west while china copes with underperforming API.
>>108993555Nobody uses anything except njudea except some poor few souls that got dicked with amd. absolutely no idea about intel because nobody uses that shit
>>108993581all saas now. the narrative flipped
How good is SD Forge - Neo with Txt2Video and Image2Video , should I attempt it or go with ComfyUI
>>108993715I think for video Comfy is the easier option. Although I see some anons here recommend Wan2GPVideo didn't work for me in Forge Neo
Hi, I'm using ZIT with qwen 3 4b text encoder, I've been running this setup since January. Is it outdated now? (eg, is there something better either in terms of model or TE?)
>>108993715use wan2gp
>>108993822There are zit model finetunes that may be of interest.I don't recall any reason to change TE.
>>108993421>>108991240https://files.catbox.moe/t8qgoq.mp4
when ideogramm train? the fck this guys doing?
>>108993911learn english.
>>108993971its called lear faggot
>>108993828I was scammed. Subbed and then it told me it doesnt do nsfw
>>108993447Nice anon
cozy breasd
>>108993984great bait post, really activates the almonds, makes one ponder "what the fuck does he mean by subbed"
It's interesting to me that Wan2GP is advertised for the "GPU poor" but the like single anon who uses it has a 4090 or something
>>108990891post your ZIB workflow for max quality then
Does comfy has something similar to BREAK in A1111?
anyone got max quality i2v ltx 2.3 workflow or does nobody gen videos anymore?
is it worth using smooth mix wan 2.2 over the original wan2.2 for porn?
If I pull hard enough will my penis grow in length?
>>108994151as long as you enable LFS before git pulling your cock
stop asking so many questions anon
>>108994062it will be great
>>108994172BAASSED
>just noticed a consistent anatomy error in my training data
>>108994299more like anatomy feature
Spent some quality time trying to figure out what the fuck ComfyUI did with their latest update. My gens went from 90~150 seconds to anywhere between 400~600 seconds. If they're gonna try to push dynamic vram out in this state they're out of their fucking minds.
>>108994368more money, more bugs
>the models are getting better, fast on the historical scale, but slow as fuck if you're actually paying attentiongrim. when i saw seedance 2.0 first teaser, i felt more annoyed than anything, since it just reminded me that from that moment onwards i will have to wait for 2 years to actually get that locally.
werks on my machine
>>108994477whats the tag for the motion effect? afterimage?want to test how my own realism lokr handles it
>Klein 9b image editingDoes anyone know if it's possible to set the strength of the loaded image akin to denoise somehow? using a reference image gets me exactly what i want, but it affects the image quality negatively. so i'd want to try and balance it.
>>1089945192000's grainy film photo, dynamic motion still of an intense battle scene outdoors in a grassy field under dramatic cloudy sky, golden hour lighting with strong rim light, heavy film grain, motion blur on background and hair, action freeze frame. 1girl, solo, extremely voluptuous enhanced Marie Rose, (massive huge breasts:1.4), extreme heavy underboob, prominent underboob cutout, deep cleavage, wearing a very short, battle-damaged black and white gothic lolita maid dress with extreme underboob cutout, frills torn in places, black thighhighs with garter belts, black gloves, white apron piece barely holding on, blonde twintails with black ribbons flowing wildly in motion, round tinted glasses slightly crooked, fierce yet playful grin, looking at viewer, dynamic fighting pose, mid-action, one leg raised high in a powerful kick, body twisted, breasts bouncing heavily with motion, fabric straining, thick thighs, wide hips, curvy waist, dramatic action scene, score_7, safe but very suggestive, highly detailed, 2000s analog film photography, cinematic
>>108994522found this:https://github.com/shootthesound/comfyui-ReferenceLatentPlus/blob/main/screenshot.png
>>108994532neat, thanks
>>108994497>2 yearsanon is high on hopiumltx just fired half the company. local video is absolutely dead.
>>108994532>(massive huge breasts:1.4)based beyond belief
>>108994655
>>108994701looks worse than the previous ones in terms of texturehave you tried the new pid models for qwenimage vae yet? they came out this week and newest comfy release added support as well. from some tests i did earlier this week they're pretty good for realistic styles on anima
>>108994754including the foreground
>>108994765Another snarky comment that wont be seen in sdg ;]
>>108994780>snarkyWhat?"Everything's fucked.""Literally."
I see the junk (that's still background) but I meant subject rather than foreground, in another sense.
im all gooned out bros. dont even feel like genning. i left a couple projects just sitting there half done
man, the fucking spaces-instead-of-underscores requirement for Anima is a killer, it's going to hurt its popularity so much.I bet like half the people who try Anima and get shit results and then go back to illustrious is because they used underscores in their tags.
>>108994299I have a YEAH ANATOMY folder where I save both ai and non-ai quirky anatomy although most of it is nsfw
>>108994754>prebog megyn
>>108994831Retards who don't research (basic reading and asking questions) a new model before using it deserve what they get.
It is a mystery why snarktranny doesnt comment deboshit ;]
> >108994846fuck off
One boat to goontown please
>>108994831>man, the fucking spaces-instead-of-underscores requirement for AnimaHas it not been this way since the original NAI leak
show bob and vagene
I don't see the fucking point of making porn/lewd gens. They're useless, I look back on them after a few months and think to myself 'why?' So when I look at all of them in this thread, its like watching indians shit in the street. With that being said, I wonder if indians walk across the shit they made in the street and feel bad.
>>108995188I think the real power move is to make gens that can be used in datasets
When are local models going to actually be able to think? Reve 2.0 is nuts, it seems to be able to look up characters automatically. There is genuinely zero need for character loras on a model like this, it gets their entire outfits correct>A cute watercolor painting of Eirika and Lyn from Fire Emblem having coffee together in a cafenot even anima would come close to this in a one-shot gen
I've been training style loras for anima and I'm trying to select the best epoch. I'm generating an image using the same seed on all of them and comparing the output. Should I consider it sufficiently trained when the images start looking the same, at least as a loose heuristic?
Where are all the NEW celeb LoRAs ever since civitai went to shit? civarchive only has the old ones that are already like 2 years old.>>108993864>https://files.catbox.moe/t8qgoq.mp4Bottom left video is another of yours or somebody else's?
>>108991417How fast are we talking?
QRD on conditional vs unconditional? And why it needs both Gemma 4 and Qwen3?
>>108995323This shit sucks ass.
>>108995458wait so this thing is apparently ACTUALLY honest to god censored in the way no model before ever has been, like it returns legit image blocked outputs INHERENTLY based on training? Really? Why would I bother then? It can't possibly be that good
>>108995547every other company that's released an image model is currently drowning in lawsuits. ideogram might be the first image model that doesn't open the parent company to litigation.
>>108995557wat? please tell me all about e.g. BFL drowning in lawsuits, which surely is real and not total BS
>>108995460Reve 2.0 is a meme, every image looks like a nearest-neighbour SDXL gen at full size, they're forcing stupidly huge awful-looking images for no reason, IDK why they don't offer at least 2K or something, their model clearly cannot properly do 4K at all
why the fuck LTX and SULF/EROS whatever based on LTX keep melting anatomy? like tongue and lips, dick and mouth ... DOESN'T THIS MODEL understand how humans are made??? D:< WHY IS IT LOOKING LIKE SHIET
"realistic"
Any uncensor for ideogram yet?
>>108995412
>>108995758Is she the gymnast from P5?
>>108995363dam how did you make this? really cool
>>108995846still not fast enough to be used for real time video playback.
>>108995771please keep the slop out of /ldg/
>>108995869
kino alert
>>108995384I assume the community moved on to some telegram channel. No idea which.
>>108995890>SeaDance
>>108995384https://huggingface.co/malcolmreyhttps://huggingface.co/SDim1973/Z-Image-Lorasthere were also some upload folders maintained in the /r/ thread before it was kill, though i cant find the linkkeep in mind all of these are shit, then again so was what was on civitaino idea where that community moved to
>>108995384>>108995979>there were also some upload folders maintained in the /r/ thread before it was kill, though i cant find the linki think here https://rentry.org/ldg-lazy-getting-started-guide#defunct-rrealisticparody
I swear I'm going to crash out if Krea 2 gets a lobotomized release.I can't take another Flux or Qwen slopped model.Don't even get me started on Ideogram. Jfc.>picrel is bfl
Did trellis make mainline comfy yet?
>>108995997>the models they name as 'high risk' are all chinese API-only now>bragging about making the local cloud models more censored than the cloud oneslocal is pathetic
>>108996034Answered my own q. no. Trellis requires custom nodes.If you are on rdna2, like me, you can't run trellis at all.As of last month, rdna3 support emerged:https://github.com/CalebisGross/TRELLIS-AMD(an amd fork of Microsoft's code)And apparently rdna4 *does* handle Trellis. I guess HIP supports it. I don't have an rdna4 card to try it out on.
>>108995997Krea 2 Large API version isn't impressive as is, it can't do realistic architecture and shit nearly as well as the original Flux Krea could, I don't think it'd be anything to write home about. Also yeah Ideogram 4.0 is a bit of a joke, IDK why the community is suddenly fine with like, ACTUAL censorship that really exists in a way it didn't actually ever before in other models
>>108996150I wish I could fee like this
By not releasing the 5080 Super at 24gb of vram, nvidia is helping AMD dump their 24gb 7900xtx at a much higher price than would otherwise be possible.
>>108996142The censorship can be easily bypassed with the prompt builder node now. The problem is that ideogram 4 isn't meant for i2i. How the fuck are you making a model aimed at creating posters and product ads but can't even insert the product you're advertising?
>>108996142It's more about how expressive Krea 2 is.Every damn local image model released has this rigid slop look to it. Even ZIT and Z-Image.All of the local models we've been getting are for benchmaxxx scores, not sovl.We need a model to where we can just prompt wild and crazy shit.That gives local a base model to do crazy finetunes like we had in the SD1.5 days.
>>108996241We need a model that supports detailed descriptions of the face and body. Then, we need an llm that's light but can turn simple prompts into the format.We need the same thing for what you call "crazy". In other words, we need way more detailed training, but an llm to help make it easy for us.I have made it a game to try and find the pointy chins that ZIT and every other ai image creator everywhere prefers to make.Did you know Moot has an "ai chin"? But his wife doesn't.
>>108996213There was a cope rumour recently that Super refresh will still be coming this year. I don't believe it, but who knows.Honestly pc hardware is fucked until 2030 at least, perhaps forever imo. I have no idea how I am going to upgrade from my 3060 + 32gb ram.
What's a good model for touching up photographs? I've taken a bunch of concert photos over the years that would be otherwise great except for blur because my dumb ass keeps forgetting to set it correctly.
>>108996262I dunno try your luck with Klein.
>>108996155
>>108996262AI models don't fix blurry images, it just fills in the missing data with whatever it thinks it should be.
>>108991508>AI can't create something it has never seen before
>>108996313Yes gremlins already exist
I dunno why it is struggling with biting own lip and giving me the weird vampire teeth.
Anima really isn't that smart... it needed a better base and better captions
>>108996341There isn't a single local Anime base model.They are all finetunes of another model.
>>108996341it's pretty fucking good for a model smaller than SDXL even when including the text encoder
>>108996262i have successfully used klein to turn my noisy RAW files into clean photos
>>108996377it is much slower than sdxl tho
>>108996407you will get the image you want much faster than with SDXL though since you won't have to deal with controlnets, inpainting or adding text with an image editor.
>>108996377It's slightly better, but it kind of creates messy crap. The prompt adherence is roughly stable cascade level.
jordach status?
>>108995758box?
>>108993864Thanks but how do I get the workflow out of this?It's only opening a load video node where I drag it
>>108991508But what hasn't it seen?
>>108996451https://files.catbox.moe/3h3vc3.png
>>108996546I will ask google to turn this into a song :^)
>>108996235why should I care though, like please explain how this model is somehow so good that it's actually fine it takes censorship ten times farther than any model ever did before it
>>108996241is this thread just shills now? Krea 2 looks like unispired bullshit just like most other recent models, it's not bad but not great either, wtf is this nonsense
>>108996559localkeks are absolutely buckbroken, it's stockholm syndrome at this point. they're still coping with flux klein. heckin based uncensored china sold them all out to comfycloud API and now they have to lick western corporate boot and beg for censored scraps
>>108996422>The prompt adherence is roughly stable cascade level.what the fuck are you talking about you absolute moron? This has to be bait. Anima can understand lengthy natural language prompts perfectly, Cascade (which DID NOT EVEN HAVE BETTER ADHERENCE THAN XL, YOU FUCKING RETARD, IT WAS STILL ON CLIP) cannot.
>>108996559Because you can direct scenes with it. But it's still kind of useless without the ability to insert image references or i2i. I don't expect the general's local schizos to understand why being able to control the image is a good thing though.
>>108996570stop taking bait retard
>>108996572wtf does "direct scene" mean in a direct, practical sense, though?
>>108996582It means not just using it as a 1girl gacha
>>108996583so, you mean there's absolutely nothing remotely interestin about it compare to numerous models that already exist? Got it. How much is Kekgram paying you to shill their faggot stop-sign riddled nonsense BTW?
>>108996591I get maybe ₹95 every 100 posts
>>108996563>is this thread just shills now?no i simply stopped replying to them
>>108996559The censorship is there is no censorship, you just have to follow the stupid json rules (or use the KJnodes node), and it'll happily generate whatever you ask.Like check this out:https://files.catbox.moe/ghv0gz.pngI deliberately phrased the prompt in a way that would trigger any censorship filters, and it generated it happily.It can't do genitals, just generates blank skin, but that's a training data issue and puts it in exactly the same position as every other model on launch.It's fast, you can gen high resolution, and the regional prompting is a feature I haven't seen on any other model.
>>108996474works on my machine. or u just open the old fashion way
is there a way to automatically copy all of the catbox links when you upload multiple files?
>>108996664ask claude to slop up a userscript
>>108996694nah, just wanted to know if there was a native button i was missing
>>108996639>https://github.com/Comfy-Org/ComfyUI/pull/14216Could you try this PR?I'm curious to see if you get better results. It doesn't need pose conditions and stuff.
y local ideogram is so grainy/greasy?
>>108996778If you are using the comfy default workflow it is shit and fries the image. Override to lower cfg earlier like around 70%.
Maybe also caused by quantization I dunno I wouldn't put it past them to intentionally gimp the fp8.
>>108996396>>108996266I'll try that.>>108996288Here's an example of the kind of stuff I'm trying to unfuck. Shitty camera plus poor settings. Good show, incidentally.
>json prompting is too difficult!!just let llm write it for yousays a lot about the technical know how of the image gen "community"
>>108996546ai gem free wrotted this songhttps://files.catbox.moe/gyzm8b.mp3genned on my own electricityI think it's as good as any Disney slop. hilarious it read a descriptor "cinematic".
>>108996866You don't even need this. Just use the kj prompt builder node and give basic regional instructions.
>>108996878that node is great for control but too much effort for low effort prompts
at least ideogram is better than microsoft lens.
>>108996810This is practically unsalvageable especially since your phone's shitty filter messed it up even more. It's not going to know what that guy looks like.
>>108996896i imagine reference images could help. plug his face in. plug the band logo in. the whole image will look different but no one will know it was edited
Personally, instead of taking photos, I just memorize what I see and keep the prompt.So, instead of taking a woman's photo, say one who is jogging or whatever, I type in all the things she is wearing and descriptors. sometimes I use an llm to figure out what those things are called. then, I gen it.
>>108996910At that point just generate new ones from scratch, they will probably look better. Add Miku up on stage with them while you're at it.
>>108996775i tried it few days ago. it just stuck in rendering sampler forever when i increase resolution to 720p. so im stuck in low res. less breasts jiggle, poor color (probably due to low res) and longer gen time.maybe i try again when they updated new workflow and nodes
>>108996927>>108996927
>>108991319same. Anima has theoretical image input capabilities, but this topic is largely underexplored https://github.com/Mirumo0u0/ComfyUI-Cosmos-Reference