[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage.jpg (2.83 MB, 3965x3800)
2.83 MB JPG
Discussion and Development of Local Image, Video, and Music Models

Previous: >>108987212

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of frenship
>>
>>108990836
no. GRRR!!!!!!! *farts in your general direction*
>>
>>108990835
Can this like body swap the girl in porn videos?
>>
File: jail girl.png (877 KB, 1152x864)
877 KB PNG
>>
File: Flux2-Klein-4b-base.png (1.88 MB, 1024x1024)
1.88 MB PNG
>>108990754
>The average ZIT gen will have better realism than average ZIM gen.
>>108990796
>it's the output generally of zib which is less realistic.
Anon, it's easy to understand. The average ZiT output will be better than the average ZiB output but the best you will see out of either will come from ZiB.
>>
File: deprecated.png (712 KB, 1282x834)
712 KB PNG
HOLY SHIT COMFY ACTUALLY LISTENED! Finally they're deprecating local. Most users want the full power of the cloud, it's stated right in the installer. Nobody needs shitty kLatentEmbedder nodes or whatever junk, it just clogs up the node list.
>>
>>108990880
upload to HF?
>>
>>108990909
Upload what, the image? Why would I upload the image to huggingface?
>>
File: Wanimate_00147.mp4 (1.51 MB, 922x1280)
1.51 MB
1.51 MB MP4
>>108990850
i think it only works well with solo subject.
>>
File: 1766185207778076.jpg (40 KB, 403x392)
40 KB JPG
is it possible to hack Google and download the checkpoint for nano banana pro?
>>
>>108990974
Yes, i'm using it locally
>>
>>108990980
don't know if joking or not but share it frend
>>
>>108990983
Nah you can do it yourself mate, very simple
>>
File: ComfyUI_00054_.png (1.67 MB, 880x1168)
1.67 MB PNG
>>
File: Wanimate_00153.mp4 (2.09 MB, 922x1280)
2.09 MB
2.09 MB MP4
>>
>>108991044
workflow?
>>
Weekend artist how are the masterpieces coming along?
Wagecuck hat goes back in two days...
>>
>>108990891
but to finalize the image, to fix the anatomy problems and such, you'll have to put it through zit, so it's a moot point - anything can be used for the 1st pass, even zit with loras that add seed variance.
What else are you going to do? Gen 1000 images hoping you get lucky with the perfect one?
>>
>>108990933
the lora? none of the shadman ones i see are as good as that desu
>>
>>108991060
>Wagecuck
who?
>>
i remember anon suggesting to generate batches of images instead of individual ones. good idea. i save 1 second per generation
>>
>>108991044
the original looks hotter
>>
File: GPUlets BTFO.png (842 KB, 1148x868)
842 KB PNG
https://civitai.red/articles/30980
>T minus 40 hours until civitjeets are reminded of their GPU poor status
>>
>>108991083
There is no lora. Just @shadman with Anima, model's own knowledge.
>>
>>108991101
>>T minus 40
more like 20
>>
File: Flux2-Klein_00142_.jpg (343 KB, 1552x832)
343 KB JPG
>>
>>108991145
Very nice work, anon. Is this an OC? You seem like quite a unique individual.
>>
>>108991101
sir, where will I train my new zit lora with my pony data now, sir?
>>
>>108991145
kinda nonsense innit
>>
File: Wanimate_00155.mp4 (3.36 MB, 928x1280)
3.36 MB
3.36 MB MP4
>>
>>108991205
this is so cool, what's your setup like hardware wise?
>>
>>108991048
>>108991214
nyo
>>
File: 1752849324218875.png (2.73 MB, 1536x1984)
2.73 MB PNG
>>
>decide to train some lora
>'charmap' codec can't encode characters in position 33-42: character maps to <undefined>

alright then
>>
>>108991219
tranny
>>
File: Wanimate_00149.mp4 (1.27 MB, 820x1152)
1.27 MB
1.27 MB MP4
>>108991214
>intel 13900k
>rtx5090
>64gb ddr5

heres a failed one
>>
>>108991240
would 40 gigs be enough for video?
>>
>>108991232
what jankass encoding are you using for your captions?
>>
>>108991255
it triggers the error even without captions
>>
>>108991240
Boobs and ass jiggling as they teleport unlike the source video is interesting.
It seems to understand the physics of human anatomy on a more fundamental level.
Interesting toy. But probably beyond capabilities of my humble hardware and I assume workflow for this shit is also a nightmare.
>>
>>108991262
did you mess up the config file then? which trainer is it
>>
>>108991232
What are the characters in position 33-42 of this caption file?
Either you wrote some monstrosity or the tool you are using uses something else besides utf-8 for some reason.
>>
>>108991262
It will say line x in file blabla in the traceback somewhere.
Look at the culprit file.
>>
>>108991101
> crypto mining
> 2026
it this true?
>>
File: kms.jpg (5 KB, 250x176)
5 KB JPG
I just want to make various poses with a consistent character. Why is this literally impossible to do locally? Should I just give up on img2img and use SD with a lora instead?
>>
>>108991319
use klein 9b
>>
>>108991301
Some shitcoin came out and people mined it, that is true.
Whether it got rugpulled already or if it is still profitable to mine, that's a different matter.
>>
>>108991319
> Why is this literally impossible to do locally?
what are you on about. even right in this bread anon did videos like that.
>>
File: 35745.gif (1.97 MB, 224x128)
1.97 MB GIF
https://files.catbox.moe/gp4vz4.wav
>>
interpolation is so fast now, what changed?
>>
>>108991417
GPU?
I don't bother with interpolation but if it got a huge speed boost, I might reconsider.
>>
>>108991409
this is what english sounds like to foreigners
>>
AI can't create something it has never seen before
SAD
>>
File: Wanimate_00159.mp4 (2.67 MB, 1480x720)
2.67 MB
2.67 MB MP4
>>
File: ComfyUI_00001_.png (942 KB, 1152x896)
942 KB PNG
Hello
I'm new here
What's the best guide to learn ComfyUI from scratch?
Any playlist on youtube to learn creating workflows and controlnet?
I mainly want to turn rough doodles/stick figure art into images
like if I make a stick figure art of two people riding a bike and prompt the rider as x and pillion as y and it'll generate it
>>
>>108991508
it's never seen your mum before but I can prompt for whales easily
>>
File: cringe.png (114 KB, 1000x700)
114 KB PNG
>your mom jokes in almost 2027
>>
>>108991508
>AI can't create something it has never seen before
You aren't the first to say that. Also, AI can certainly merge concepts in ways humans have never seen before. Most human thoughts are largely unoriginal. Ones that are, are usually combinations of other previous ideas.

Hang in there anon. It'll get better some day (if you want it to)
>>
>frog
>cringe
>x in <current year>
>>
>>108991240
Good lord. Just imagine new DLSS versions having this quality and having all that jiggle and quality on every NPC in a game. I can't even contain myself thinking about it.
>>
>>108991202
Shirow had a very autistic way of thinking out the mechanics, so it probably all fits
>>
models cannot into left and right.

Doesn't matter if SAAS or local, saying something like "their right eye closed" always seems to have them close the eye on the right of the screen (characters left) closed
>>
File: fact.jpg (57 KB, 716x687)
57 KB JPG
'rog 'eb'ite
FACT!!!!!!!!!!!!!!!!!!!!!!!!!
>>
File: ComfyUI_00738_.png (1.06 MB, 1152x896)
1.06 MB PNG
>>
File: btgt66.png (1.28 MB, 1216x832)
1.28 MB PNG
>>
I'm a die-hard Flux fan, but Ideogram wins by a landslide, hands down. I'm switching over until the next Flux
>>
File: ComfyUI_00045.png (3.84 MB, 1500x1920)
3.84 MB PNG
>>108991403
Yeah, I remember when it took them months to fix random seeds not working inside of subgraphs.

>>108991729
No. It's really touchy about resolution. It seems like a popular model though, so maybe genius or two might create some fixes.

>>108991851
>The model this prompt is for understands "left" and "right" as horizontally mirrored, so be sure to correct those directions in the final prompt.
I added this to my LMS Sys Prompt and it improved things a lot (this is for Z-Image btw).
>>
File: k9.jpg (128 KB, 1024x1024)
128 KB JPG
>>108991508
it even pretty much always does, thanks to random noise

it does remember and mix up concepts and patterns else it'd be random noise, but that's art (minus the "art" that is paint splotches thrown at a wall, that one is just human doing anything random + physics)
>>
File: z.jpg (233 KB, 1024x1536)
233 KB JPG
>>108992205
what does it win? text?
>>
is it true that you cant even use the ideogram outputs commercially?
>>
>>108992269
you can use all model outputs commercially if you really think about it
>>
>>108992205
can it be run on on comfy?
>>
>>108992205
does ideogram let you put as many reference images as you want? that's what i like about klein
>>
>>108991851
I'm pretty sure this stems from the fact that people themselves can't decide on if they're talking about left/right of the picture or left/right from the perspective of the person in the image. Like if you say their left hand, is it your left or their left? People probably flip flop on which "left" they're talking about in the captions and it confuses the model.
>>
File: 081219CUI_00001_.png (1.33 MB, 1152x1536)
1.33 MB PNG
>>
>>108992331
nothing runs on comfy when it's broken
>>
tfw SDXL with CN is better and more consistent for face transfer than Klein 9B edit
>>
>>108992447
examples?
>>
>>108992457
Just try it yourself.
>>
>>108992467
cool. can you share a catbix with metadata?
>>
>>108991354
it's still profitable if you already have the hardware for it. not worth invest new gpu for it though so i don't expect this to cause gpu shortage like the last time
>>
File: file.png (327 KB, 1547x1247)
327 KB PNG
>>108992435
yeah, amazing AI model bro
>>
File: 144617CUI_00002_.png (1.34 MB, 1536x1152)
1.34 MB PNG
>>
File: 1777139007041328.jpg (102 KB, 1920x1080)
102 KB JPG
>>108992208
you can always count on the highest quality gens in /ldg/ to be of jenny
>>
File: 57954.png (1.41 MB, 1365x1838)
1.41 MB PNG
how big are your files?
>>
>>108991262
xister, copypaste the entire traceback into a chatbot and it will guide you like a dog guides a disabled person
>>
>>108992695
~5 megs
>>
File: 9.png (603 KB, 512x728)
603 KB PNG
>>
https://github.com/Comfy-Org/ComfyUI/pull/14182
Answer the man you Comfucker so I can use Anima in OneTrainer already.
>>
Are people skipping the unconditional model for Ideogram, or what? How are you keeping below 24gb VRAM use?
>>
>>108992695
mostly something like 1.5MiB/10sec. if i ever have something that really needs better quality I'll probably also change codec, not just increase bitrate.
>>
>>108991813
Shirow did but not even gpt-image-2 would have learned that from being trained in his work
>>
File: ba.jpg (134 KB, 1024x1024)
134 KB JPG
>>
>>108992777
whenever I run it iirc it reads something from disk on comfy so it isnt keeping everything on vram
>>
I NEED more Anima loras.

Someone go tell everyone Illustrious is over already!!
>>
File: Clue.png (1.62 MB, 1039x1136)
1.62 MB PNG
>>108992811
the ai hasn't a clue what everything's for
>>
>>108992869
make them yourself
>>
>>108992907
nobody ever taught me

I usually ask AI for help but AI can't help with new subjects
>>
>>108992921
prepare the training data for something reasonably popular and people here or on civitai (bounty system?) or elsewhere might even give it a shot
>>
>>108992867
Yeah, somehow it seems to have sorted itself out and it doesn't go over anymore. Not sure what happened.
>>
>>108992869
>Someone go tell everyone Illustrious is over already!!

you'll have an easier time convincing the people still on sd1.5 kek
>>
File: 1856.webm (3.11 MB, 768x576)
3.11 MB
3.11 MB WEBM
>>
>>108992359
Does it even allow reference images? I've been looking around to see if anyone has a workflow for it but the most I can find are for region prompting but you can't connect more than one image.
>>
>take a pic of your dick
>remove background
>put it near the face of your 1girl gen
>prompt it to suck your cock using Wan
ALL I'VE EVER WANTED
WAS AN ANIME GIRL
TO SUCK MY DICK
IT'S WITHIN MY HANDS
I MUST NOT PROMPT
I MUST NOT PROMPT
>>
>>108992975
this is nice, i like it. like a cheesy 90s music video.

>>108992991
problem here; i have to look at my dick outside of JO times.
>>
File: 155650CUI_00002_.png (943 KB, 1536x1152)
943 KB PNG
>>108992975
Looks like the video for Paramore's Brick by Boring Brick.
>>
File: _Anima_00012_.jpg (384 KB, 1152x1592)
384 KB JPG
Shitmix progress
>>
>>108992988
>Does it even allow reference images?
klein? yes
>>
>>108993068
I meant ideogram
>>
>>108993048
>>108992570
Hi Catjack!
>>
File: 12466.webm (3.89 MB, 768x576)
3.89 MB
3.89 MB WEBM
>>108993074
oh. i don't know, i'm asking the same thing. you can't replace flux without reference images
>>
>>108993097
Stop trying to get your subhuman avatartranny crush to notice you, Julien
>>
>>108993142
that's catjak and you are responding to yourself in yet another retarded false flag attempt. happy pride month faggot
>>
File: 49b4zn.png (1.29 MB, 1024x1024)
1.29 MB PNG
>>
>>108993157
Sure it was, Julien
Sure it was
>>
>>108993139
NTA but I don't think it is going to replace flux in any capacity other than high effort generations. I don't think it does reference images or any kind of editing. I think it may do a little better with inpainting because of how you can explicitly target an area, but I haven't really tried it yet.

The json prompting turns even a low-effort prompt into a medium-effort prompt which kind of sucks. But the images are really good and the control is amazing if you actually want to take the time to play with it. I really like how it seems to handle realistic fine art styles. It's just not something you can queue up a bunch of wildcards and play gatcha with.
>>
>>108993210
i checked their website and it does look like it has editing. i wonder if it lets you plug multiple images in
>>
prolefeed
>>
File: 294.jpg (661 KB, 1312x1600)
661 KB JPG
>>
File: 56.jpg (921 KB, 1664x2136)
921 KB JPG
>>
File: Wanimate_00181.mp4 (2.82 MB, 932x1280)
2.82 MB
2.82 MB MP4
>>
how do you get z image turbo to be mor dynamic with poses and angles? everything comes out as a typical flat picture from a magazine
>>
>>108993349
You use z image base then pass it through turbo
>>
>>108993331
catbox? workflow? hog many GBs of VRAM do I need for this?
>>
File: _Anima_00069_.jpg (581 KB, 1152x1720)
581 KB JPG
>>
File: _Anima_00070_.jpg (483 KB, 1152x1720)
483 KB JPG
>>
File: _Anima_00073_.jpg (496 KB, 1152x1720)
496 KB JPG
3 merges in
>>
how to avoid flux2 klein 9b edit color shift?
>>
Looking to add image gen to my llm server. Currently running with 4 v620s for text, and am planning on buying a gpu for images. What's the best card for image/video under $800? Is cuda still king? I'm a bit wary of newer amd cards because I've heard they all have the reset bug. And intel software is horrible in the llm space, idk how they are with image generation.
>>
File: _Anima_00092_.jpg (407 KB, 1152x1720)
407 KB JPG
>>
>Anima
>Ideogram
>Klein
>LTX
what happened to chink shills? all of the best open weight models recently have been released by the west while china copes with underperforming API.
>>
>>108993555
Nobody uses anything except njudea except some poor few souls that got dicked with amd. absolutely no idea about intel because nobody uses that shit
>>
>>108993581
all saas now. the narrative flipped
>>
File: _Anima_00129_.jpg (537 KB, 1152x1720)
537 KB JPG
>>
How good is SD Forge - Neo with Txt2Video and Image2Video , should I attempt it or go with ComfyUI
>>
File: _Anima_00139_.jpg (695 KB, 1152x1720)
695 KB JPG
>>
>>108993715
I think for video Comfy is the easier option. Although I see some anons here recommend Wan2GP
Video didn't work for me in Forge Neo
>>
Hi, I'm using ZIT with qwen 3 4b text encoder, I've been running this setup since January. Is it outdated now? (eg, is there something better either in terms of model or TE?)
>>
>>108993715
use wan2gp
>>
>>108993822
There are zit model finetunes that may be of interest.

I don't recall any reason to change TE.
>>
>>108993421
>>108991240
https://files.catbox.moe/t8qgoq.mp4
>>
File: _Anima_00159_.jpg (350 KB, 1152x1720)
350 KB JPG
>>
when ideogramm train? the fck this guys doing?
>>
>>108993911
learn english.
>>
>>108993971
its called lear faggot
>>
>>108993828
I was scammed. Subbed and then it told me it doesnt do nsfw
>>
File: 5a4ovu.png (1.55 MB, 1216x832)
1.55 MB PNG
>>
>>108993447
Nice anon
>>
cozy breasd
>>
>>108993984
great bait post, really activates the almonds, makes one ponder "what the fuck does he mean by subbed"
>>
File: file.png (39 KB, 1213x293)
39 KB PNG
It's interesting to me that Wan2GP is advertised for the "GPU poor" but the like single anon who uses it has a 4090 or something
>>
>>108990891
post your ZIB workflow for max quality then
>>
Does comfy has something similar to BREAK in A1111?
>>
anyone got max quality i2v ltx 2.3 workflow or does nobody gen videos anymore?
>>
is it worth using smooth mix wan 2.2 over the original wan2.2 for porn?
>>
If I pull hard enough will my penis grow in length?
>>
File: Ernie-Turbo-PID_00011_.jpg (2.5 MB, 4096x4096)
2.5 MB JPG
>>
>>108994151
as long as you enable LFS before git pulling your cock
>>
stop asking so many questions anon
>>
File: _Anima_00217_.jpg (389 KB, 1152x1720)
389 KB JPG
>>108994062
it will be great
>>
File: Ernie-Turbo-PID_00015_.jpg (2.65 MB, 4096x4096)
2.65 MB JPG
>>
>>108994172
BAASSED
>>
>just noticed a consistent anatomy error in my training data
>>
File: _Anima_00234_.jpg (413 KB, 1152x1720)
413 KB JPG
>>108994299
more like anatomy feature
>>
Spent some quality time trying to figure out what the fuck ComfyUI did with their latest update. My gens went from 90~150 seconds to anywhere between 400~600 seconds. If they're gonna try to push dynamic vram out in this state they're out of their fucking minds.
>>
>>108994368
more money, more bugs
>>
File: _Anima_00266_.jpg (448 KB, 1152x1720)
448 KB JPG
>>
>the models are getting better, fast on the historical scale, but slow as fuck if you're actually paying attention
grim. when i saw seedance 2.0 first teaser, i felt more annoyed than anything, since it just reminded me that from that moment onwards i will have to wait for 2 years to actually get that locally.
>>
werks on my machine
>>
>>108994477
whats the tag for the motion effect? afterimage?
want to test how my own realism lokr handles it
>>
File: 1767376298387819.png (78 KB, 1118x667)
78 KB PNG
>Klein 9b image editing
Does anyone know if it's possible to set the strength of the loaded image akin to denoise somehow? using a reference image gets me exactly what i want, but it affects the image quality negatively. so i'd want to try and balance it.
>>
>>108994519
2000's grainy film photo, dynamic motion still of an intense battle scene outdoors in a grassy field under dramatic cloudy sky, golden hour lighting with strong rim light, heavy film grain, motion blur on background and hair, action freeze frame.

1girl, solo, extremely voluptuous enhanced Marie Rose, (massive huge breasts:1.4), extreme heavy underboob, prominent underboob cutout, deep cleavage,

wearing a very short, battle-damaged black and white gothic lolita maid dress with extreme underboob cutout, frills torn in places, black thighhighs with garter belts, black gloves, white apron piece barely holding on,

blonde twintails with black ribbons flowing wildly in motion, round tinted glasses slightly crooked, fierce yet playful grin, looking at viewer,

dynamic fighting pose, mid-action, one leg raised high in a powerful kick, body twisted, breasts bouncing heavily with motion, fabric straining,

thick thighs, wide hips, curvy waist, dramatic action scene, score_7, safe but very suggestive, highly detailed, 2000s analog film photography, cinematic
>>
File: 1774932292566311.png (796 KB, 1310x787)
796 KB PNG
>>108994522
found this:
https://github.com/shootthesound/comfyui-ReferenceLatentPlus/blob/main/screenshot.png
>>
>>
File: 1758084977276952.png (2.72 MB, 1088x1928)
2.72 MB PNG
>>108994532
neat, thanks
>>
>>
>>
>>108994497
>2 years
anon is high on hopium
ltx just fired half the company. local video is absolutely dead.
>>
File: debo_i_fia_00004_.png (2.86 MB, 1433x1792)
2.86 MB PNG
>>
>>108994532
>(massive huge breasts:1.4)
based beyond belief
>>
>>
File: _Anima_00296_.jpg (475 KB, 1152x1720)
475 KB JPG
>>108994655
>>
File: gch2i5.png (1.83 MB, 1216x832)
1.83 MB PNG
>>
File: zImageturbo_00253_.jpg (487 KB, 1840x1152)
487 KB JPG
>>
>>108994701
looks worse than the previous ones in terms of texture
have you tried the new pid models for qwenimage vae yet? they came out this week and newest comfy release added support as well. from some tests i did earlier this week they're pretty good for realistic styles on anima
>>
>>108994754
including the foreground
>>
>>108994765
Another snarky comment that wont be seen in sdg ;]
>>
>>108994780
>snarky
What?
"Everything's fucked."
"Literally."
>>
I see the junk (that's still background) but I meant subject rather than foreground, in another sense.
>>
im all gooned out bros. dont even feel like genning. i left a couple projects just sitting there half done
>>
File: zImageturbo_00267_.jpg (392 KB, 1152x1840)
392 KB JPG
>>
man, the fucking spaces-instead-of-underscores requirement for Anima is a killer, it's going to hurt its popularity so much.

I bet like half the people who try Anima and get shit results and then go back to illustrious is because they used underscores in their tags.
>>
File: slip.png (647 KB, 748x1173)
647 KB PNG
>>108994299
I have a YEAH ANATOMY folder where I save both ai and non-ai quirky anatomy although most of it is nsfw
>>
>>108994754
>prebog megyn
>>
File: debo_i_fia_00012_.png (3.53 MB, 1433x1792)
3.53 MB PNG
>>
>>108994831
Retards who don't research (basic reading and asking questions) a new model before using it deserve what they get.
>>
It is a mystery why snarktranny doesnt comment deboshit ;]
>>
> >108994846
fuck off
>>
File: zImageturbo_00295_.jpg (500 KB, 1152x1840)
500 KB JPG
One boat to goontown please
>>
>>108994831
>man, the fucking spaces-instead-of-underscores requirement for Anima
Has it not been this way since the original NAI leak
>>
File: zImageturbo_00309_.jpg (546 KB, 1152x1840)
546 KB JPG
>>
show bob and vagene
>>
File: zImageturbo_00326_.jpg (494 KB, 1152x1840)
494 KB JPG
>>
File: zImageturbo_00345_.jpg (470 KB, 1152x1840)
470 KB JPG
>>
File: 1754565477502208.png (3.35 MB, 1152x1375)
3.35 MB PNG
I don't see the fucking point of making porn/lewd gens.

They're useless, I look back on them after a few months and think to myself 'why?'

So when I look at all of them in this thread, its like watching indians shit in the street.

With that being said, I wonder if indians walk across the shit they made in the street and feel bad.
>>
File: debo_i_fia_00014_.png (2.69 MB, 1433x1792)
2.69 MB PNG
>>
>>108995188
I think the real power move is to make gens that can be used in datasets
>>
File: Reve Won.jpg (3.44 MB, 2256x1856)
3.44 MB JPG
When are local models going to actually be able to think? Reve 2.0 is nuts, it seems to be able to look up characters automatically. There is genuinely zero need for character loras on a model like this, it gets their entire outfits correct
>A cute watercolor painting of Eirika and Lyn from Fire Emblem having coffee together in a cafe
not even anima would come close to this in a one-shot gen
>>
>>
I've been training style loras for anima and I'm trying to select the best epoch. I'm generating an image using the same seed on all of them and comparing the output. Should I consider it sufficiently trained when the images start looking the same, at least as a loose heuristic?
>>
File: teutonic_knight.png (2.58 MB, 1824x1216)
2.58 MB PNG
>>
File: crap.jpg (491 KB, 1160x1696)
491 KB JPG
Where are all the NEW celeb LoRAs ever since civitai went to shit? civarchive only has the old ones that are already like 2 years old.

>>108993864
>https://files.catbox.moe/t8qgoq.mp4
Bottom left video is another of yours or somebody else's?
>>
File: put2uv.png (2.06 MB, 1216x832)
2.06 MB PNG
>>
>>108991417
How fast are we talking?
>>
File: wat.png (52 KB, 678x848)
52 KB PNG
QRD on conditional vs unconditional? And why it needs both Gemma 4 and Qwen3?
>>
File: 1778661292260467.jpg (3.45 MB, 3328x4864)
3.45 MB JPG
>>108995323
This shit sucks ass.
>>
>>
>>108995458
wait so this thing is apparently ACTUALLY honest to god censored in the way no model before ever has been, like it returns legit image blocked outputs INHERENTLY based on training? Really? Why would I bother then? It can't possibly be that good
>>
>>108995547
every other company that's released an image model is currently drowning in lawsuits. ideogram might be the first image model that doesn't open the parent company to litigation.
>>
>>108995557
wat? please tell me all about e.g. BFL drowning in lawsuits, which surely is real and not total BS
>>
>>108995460
Reve 2.0 is a meme, every image looks like a nearest-neighbour SDXL gen at full size, they're forcing stupidly huge awful-looking images for no reason, IDK why they don't offer at least 2K or something, their model clearly cannot properly do 4K at all
>>
File: Flux2-Klein_00152_.png (2.66 MB, 1280x1648)
2.66 MB PNG
why the fuck LTX and SULF/EROS whatever based on LTX keep melting anatomy? like tongue and lips, dick and mouth ... DOESN'T THIS MODEL understand how humans are made???
D:<

WHY IS IT LOOKING LIKE SHIET
>>
File: debo_i_fia_00016_.png (3.41 MB, 1433x1792)
3.41 MB PNG
>>
"realistic"
>>
File: q_l1spcp.png (1.51 MB, 1344x960)
1.51 MB PNG
>>
Any uncensor for ideogram yet?
>>
>>
File: debo_i_fia_00020_.png (2.6 MB, 1433x1792)
2.6 MB PNG
>>
File: 235805CUI_00002_.png (854 KB, 1536x1152)
854 KB PNG
>>
File: 000732CUI_00002_.png (709 KB, 1536x1152)
709 KB PNG
>>
>>108995412
>>
>>108995758
Is she the gymnast from P5?
>>
>>108995363
dam how did you make this? really cool
>>
>>108995846
still not fast enough to be used for real time video playback.
>>
>>108995771
please keep the slop out of /ldg/
>>
File: debo_i_fia_00026_.png (2.88 MB, 1433x1792)
2.88 MB PNG
>>108995869
>>
File: 4575.png (416 KB, 2201x1139)
416 KB PNG
kino alert
>>
>>108995384
I assume the community moved on to some telegram channel. No idea which.
>>
>>108995890
>SeaDance
>>
>>108995384
https://huggingface.co/malcolmrey
https://huggingface.co/SDim1973/Z-Image-Loras
there were also some upload folders maintained in the /r/ thread before it was kill, though i cant find the link
keep in mind all of these are shit, then again so was what was on civitai
no idea where that community moved to
>>
>>108995384
>>108995979
>there were also some upload folders maintained in the /r/ thread before it was kill, though i cant find the link
i think here https://rentry.org/ldg-lazy-getting-started-guide#defunct-rrealisticparody
>>
File: keeping-you-safe.png (346 KB, 1569x1663)
346 KB PNG
I swear I'm going to crash out if Krea 2 gets a lobotomized release.
I can't take another Flux or Qwen slopped model.
Don't even get me started on Ideogram. Jfc.
>picrel is bfl
>>
Did trellis make mainline comfy yet?
>>
>>
>>108995997
>the models they name as 'high risk' are all chinese API-only now
>bragging about making the local cloud models more censored than the cloud ones
local is pathetic
>>
>>108996034
Answered my own q. no. Trellis requires custom nodes.

If you are on rdna2, like me, you can't run trellis at all.

As of last month, rdna3 support emerged:
https://github.com/CalebisGross/TRELLIS-AMD
(an amd fork of Microsoft's code)

And apparently rdna4 *does* handle Trellis. I guess HIP supports it. I don't have an rdna4 card to try it out on.
>>
>>108995997
Krea 2 Large API version isn't impressive as is, it can't do realistic architecture and shit nearly as well as the original Flux Krea could, I don't think it'd be anything to write home about. Also yeah Ideogram 4.0 is a bit of a joke, IDK why the community is suddenly fine with like, ACTUAL censorship that really exists in a way it didn't actually ever before in other models
>>
File: 1770780150561527.png (3.73 MB, 2176x1216)
3.73 MB PNG
>>
>>108996150
I wish I could fee like this
>>
By not releasing the 5080 Super at 24gb of vram, nvidia is helping AMD dump their 24gb 7900xtx at a much higher price than would otherwise be possible.
>>
>>108996142
The censorship can be easily bypassed with the prompt builder node now. The problem is that ideogram 4 isn't meant for i2i. How the fuck are you making a model aimed at creating posters and product ads but can't even insert the product you're advertising?
>>
>>108996142
It's more about how expressive Krea 2 is.
Every damn local image model released has this rigid slop look to it. Even ZIT and Z-Image.
All of the local models we've been getting are for benchmaxxx scores, not sovl.
We need a model to where we can just prompt wild and crazy shit.
That gives local a base model to do crazy finetunes like we had in the SD1.5 days.
>>
>>108996241
We need a model that supports detailed descriptions of the face and body. Then, we need an llm that's light but can turn simple prompts into the format.

We need the same thing for what you call "crazy". In other words, we need way more detailed training, but an llm to help make it easy for us.

I have made it a game to try and find the pointy chins that ZIT and every other ai image creator everywhere prefers to make.

Did you know Moot has an "ai chin"? But his wife doesn't.
>>
>>108996213
There was a cope rumour recently that Super refresh will still be coming this year. I don't believe it, but who knows.
Honestly pc hardware is fucked until 2030 at least, perhaps forever imo. I have no idea how I am going to upgrade from my 3060 + 32gb ram.
>>
What's a good model for touching up photographs? I've taken a bunch of concert photos over the years that would be otherwise great except for blur because my dumb ass keeps forgetting to set it correctly.
>>
>>108996262
I dunno try your luck with Klein.
>>
File: 1760361011055195.png (3.43 MB, 1984x1344)
3.43 MB PNG
>>108996155
>>
>>
>>108996262
AI models don't fix blurry images, it just fills in the missing data with whatever it thinks it should be.
>>
File: 1768592367309443.png (1.93 MB, 1451x1084)
1.93 MB PNG
>>108991508
>AI can't create something it has never seen before
>>
>>108996313
Yes gremlins already exist
>>
File: groping brazilian miku.png (1.07 MB, 896x1152)
1.07 MB PNG
I dunno why it is struggling with biting own lip and giving me the weird vampire teeth.
>>
Anima really isn't that smart... it needed a better base and better captions
>>
>>108996341
There isn't a single local Anime base model.
They are all finetunes of another model.
>>
>>108996341
it's pretty fucking good for a model smaller than SDXL even when including the text encoder
>>
>>108996262
i have successfully used klein to turn my noisy RAW files into clean photos
>>
>>108996377
it is much slower than sdxl tho
>>
>>108996407
you will get the image you want much faster than with SDXL though since you won't have to deal with controlnets, inpainting or adding text with an image editor.
>>
>>108996377
It's slightly better, but it kind of creates messy crap. The prompt adherence is roughly stable cascade level.
>>
jordach status?
>>
>>108995758
box?
>>
>>108993864
Thanks but how do I get the workflow out of this?
It's only opening a load video node where I drag it
>>
>>108991508
But what hasn't it seen?
>>
>>108996451
https://files.catbox.moe/3h3vc3.png
>>
>>108996546
I will ask google to turn this into a song :^)
>>
>>108996235
why should I care though, like please explain how this model is somehow so good that it's actually fine it takes censorship ten times farther than any model ever did before it
>>
>>108996241
is this thread just shills now? Krea 2 looks like unispired bullshit just like most other recent models, it's not bad but not great either, wtf is this nonsense
>>
File: ComfyUI_temp_bpmck_00005_.jpg (585 KB, 1936x1088)
585 KB JPG
>>
>>108996559
localkeks are absolutely buckbroken, it's stockholm syndrome at this point. they're still coping with flux klein. heckin based uncensored china sold them all out to comfycloud API and now they have to lick western corporate boot and beg for censored scraps
>>
>>108996422
>The prompt adherence is roughly stable cascade level.

what the fuck are you talking about you absolute moron? This has to be bait. Anima can understand lengthy natural language prompts perfectly, Cascade (which DID NOT EVEN HAVE BETTER ADHERENCE THAN XL, YOU FUCKING RETARD, IT WAS STILL ON CLIP) cannot.
>>
>>108996559
Because you can direct scenes with it. But it's still kind of useless without the ability to insert image references or i2i. I don't expect the general's local schizos to understand why being able to control the image is a good thing though.
>>
>>108996570
stop taking bait retard
>>
>>108996572
wtf does "direct scene" mean in a direct, practical sense, though?
>>
>>108996582
It means not just using it as a 1girl gacha
>>
>>108996583
so, you mean there's absolutely nothing remotely interestin about it compare to numerous models that already exist? Got it. How much is Kekgram paying you to shill their faggot stop-sign riddled nonsense BTW?
>>
>>108996591
I get maybe ₹95 every 100 posts
>>
>>108996563
>is this thread just shills now?
no i simply stopped replying to them
>>
>>108996559
The censorship is there is no censorship, you just have to follow the stupid json rules (or use the KJnodes node), and it'll happily generate whatever you ask.
Like check this out:
https://files.catbox.moe/ghv0gz.png
I deliberately phrased the prompt in a way that would trigger any censorship filters, and it generated it happily.
It can't do genitals, just generates blank skin, but that's a training data issue and puts it in exactly the same position as every other model on launch.
It's fast, you can gen high resolution, and the regional prompting is a feature I haven't seen on any other model.
>>
>>108996474
works on my machine. or u just open the old fashion way
>>
File: Wanimate_00185.mp4 (1.79 MB, 928x1280)
1.79 MB
1.79 MB MP4
>>
File: 1779669669825899.png (3.74 MB, 2176x1216)
3.74 MB PNG
>>
is there a way to automatically copy all of the catbox links when you upload multiple files?
>>
>>
File: 1757329823443051.jpg (767 KB, 2176x1216)
767 KB JPG
>>
>>108996664
ask claude to slop up a userscript
>>
>>108996694
nah, just wanted to know if there was a native button i was missing
>>
File: 1772666650067875.jpg (738 KB, 2176x1216)
738 KB JPG
>>
>>108996639
>https://github.com/Comfy-Org/ComfyUI/pull/14216
Could you try this PR?
I'm curious to see if you get better results. It doesn't need pose conditions and stuff.
>>
File: 48.png (1.87 MB, 896x1184)
1.87 MB PNG
y local ideogram is so grainy/greasy?
>>
>>108996778
If you are using the comfy default workflow it is shit and fries the image. Override to lower cfg earlier like around 70%.
>>
Maybe also caused by quantization I dunno I wouldn't put it past them to intentionally gimp the fp8.
>>
File: 1.jpg (1.51 MB, 3264x2448)
1.51 MB JPG
>>108996396
>>108996266
I'll try that.

>>108996288
Here's an example of the kind of stuff I'm trying to unfuck. Shitty camera plus poor settings. Good show, incidentally.
>>
>>
>json prompting is too difficult!!
just let llm write it for you
says a lot about the technical know how of the image gen "community"
>>
>>108996546
ai gem free wrotted this song

https://files.catbox.moe/gyzm8b.mp3

genned on my own electricity

I think it's as good as any Disney slop. hilarious it read a descriptor "cinematic".
>>
>>108996866
You don't even need this. Just use the kj prompt builder node and give basic regional instructions.
>>
>>108996878
that node is great for control but too much effort for low effort prompts
>>
at least ideogram is better than microsoft lens.
>>
>>108996810
This is practically unsalvageable especially since your phone's shitty filter messed it up even more. It's not going to know what that guy looks like.
>>
>>108996896
i imagine reference images could help. plug his face in. plug the band logo in. the whole image will look different but no one will know it was edited
>>
Personally, instead of taking photos, I just memorize what I see and keep the prompt.

So, instead of taking a woman's photo, say one who is jogging or whatever, I type in all the things she is wearing and descriptors. sometimes I use an llm to figure out what those things are called. then, I gen it.
>>
>>108996910
At that point just generate new ones from scratch, they will probably look better. Add Miku up on stage with them while you're at it.
>>
>>108996775
i tried it few days ago. it just stuck in rendering sampler forever when i increase resolution to 720p. so im stuck in low res.
less breasts jiggle, poor color (probably due to low res) and longer gen time.
maybe i try again when they updated new workflow and nodes
>>
>>108996927
>>108996927
>>
>>108991319
same. Anima has theoretical image input capabilities, but this topic is largely underexplored https://github.com/Mirumo0u0/ComfyUI-Cosmos-Reference



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.