[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Beg More Fag Edition

Discussion and Development of Local Image and Video Models

Previous: >>108711911

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Damn jeets stoled my job
>>
File: 00020-2077476732.png (1.95 MB, 896x1152)
1.95 MB PNG
>>
>>108718153
No. And yes Gemini 3 Flash uncritically can perfectly caption hardcore NSFW including bestiality.
>>108717558
It likes very very long prompts more than short ones, can do most styles. Most people sort of underuse Klein T2I IMO.
>>
>>108718245
woops *unironically
>>
>>108718114
He could have at least done 512x512 like the original Chroma instead of fucking 256x256, which had literally no chance whatsoever of producing worthwhile results from day one
>>
webp really is incredible
it's insane that 4chan still doesn't support it
>>
i discovered hard cuts for wan i feel like spielberg
>>
>>108718304
based kinosmith
>>
>>108718295
It has good compression efficiency but most CDNs serve webp images at absolutely fucking dogshit quality for some reason which makes me hate it.
>>
>mfw Resource news

04/29/2026

>Z-Anime | Full Anime Fine-Tune on Z-Image Base
https://huggingface.co/SeeSee21/Z-Anime

>QuantVideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization
https://github.com/svg-project/Quant-VideoGen

>World-R1: Reinforcing 3D Constraints for Text-to-Video Generation
https://github.com/microsoft/World-R1

>Benchmarking Layout-Guided Diffusion Models through Unified Semantic-Spatial Evaluation in Closed and Open Settings
https://github.com/lparolari/cobench

>VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generations
https://github.com/SonyResearch/VibeToken

>OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Grounding
https://github.com/oceanflowlab/OmniVTG

>Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models
https://github.com/LeapLabTHU/RvR

>SketchVLM: Vision language models can annotate images to explain thoughts and guide users
https://sketchvlm.github.io

>Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation
https://tuna-ai.org/tuna-2

>Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models
https://github.com/huaiyi66/PTI

04/28/2026

>Illustrious XL & NoobAI-XL Style Explorer
https://github.com/ThetaCursed/Illustrious-NoobAI-Style-Explorer

>LTX Desktop 1.0.5
https://github.com/Lightricks/LTX-Desktop/releases/tag/v1.0.5

>Meta-CoT: Enhancing Granularity and Generalization in Image Editing
https://shiyi-zh0408.github.io/projectpages/Meta-CoT

04/27/2026

>PixlStash 1.1.0 Update
https://pixlstash.dev/whatsnew.html

>AURA AI Studio Vault: One-stop management app for models, images and more
https://github.com/TheGho7t/AURA-AI-Studio-Vault

>UniGeo: Unifying Geometric Guidance for Camera-Controllable Image Editing via Video Models
https://mo230761.github.io/UniGeo.github.io
>>
>mfw Research news

04/29/2026

>Golden RPG: Confidence-Adaptive Region-Aware Noise for Compositional Text-to-Image Generation
https://arxiv.org/abs/2604.25314

>A Systematic Post-Train Framework for Video Generation
https://arxiv.org/abs/2604.25427

>ResetEdit: Precise Text-guided Editing of Generated Image via Resettable Starting Latent
https://arxiv.org/abs/2604.25128

>ViPO: Visual Preference Optimization at Scale
https://liming-ai.github.io/ViPO

>GramSR: Visual Feature Conditioning for Diffusion-Based Super-Resolution
https://github.com/aimagelab/GramSR

>Exploring Time Conditioning in Diffusion Generative Models from Disjoint Noisy Data Manifolds
https://arxiv.org/abs/2604.25289

>The Thinking Pixel: Recursive Sparse Reasoning in Multimodal Diffusion Latents
https://arxiv.org/abs/2604.25299

>DDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editing
https://arxiv.org/abs/2604.25477

>Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation
https://mutualforcing.github.io

>Learning Illumination Control in Diffusion Models
https://nishitanand.github.io/relighting-diffusion-website

>Learning from Noisy Preferences: A Semi-Supervised Learning Approach to Direct Preference Optimization
https://arxiv.org/abs/2604.24952

>Improving Diversity in Black-box Few-shot Knowledge Distillation
https://arxiv.org/abs/2604.25795

>QFlash: Bridging Quantization and Memory Efficiency in Vision Transformer Attention
https://arxiv.org/abs/2604.25306

>When the Forger Is the Judge: GPT-Image-2 Cannot Recognize Its Own Faked Documents
https://arxiv.org/abs/2604.25213

>The Forensic Cost of Watermark Removal
https://arxiv.org/abs/2604.25491

>GPT-Image-2 in the Wild: A Twitter Dataset of Self-Reported AI-Generated Images from the First Week of Deployment
https://arxiv.org/abs/2604.25370

>Can We Change the Stroke Size for Easier Diffusion?
https://arxiv.org/abs/2603.26783
>>
File: 1746979048377051.png (1.09 MB, 1080x731)
1.09 MB PNG
>>
>>
File: 6908.png (527 KB, 640x928)
527 KB PNG
>>
So this thing came out:
https://civitai.red/models/2585622/ultrareal-fine-tune-anima?modelVersionId=2904690

Havin' a hard time genning good images with it, though.
>>
>>108719007
>Anima1
I assume he means preview1. Odd decision.
>>
>>108719088
Pretty sure he fucked up. Worked with my anima preview 3 loras just fine.
>>
File: 1768125419259229.png (1013 KB, 896x1152)
1013 KB PNG
>>108719007
we are regressing, this is pure slop
>>
i been generating kinos for 18 hours straight
>>
>>108719209
post some
>>
>>108719088
Yeah it is odd. I asked in the discussions and he deleted my message pretty quickly so I assume it is Anima Preview 1 if he's taken offense to me asking a simple question.
>>
>>108719007
I think the lenovo lora works better.
>>108719273
Fucking kek, what a retard.
>>
>>108718410
>>108718420
Thanks
>>
> >108718410
> >108718420
Fuck off
>>
i'll get banned
>>
File: 595912070197977.png (1.61 MB, 1152x1600)
1.61 MB PNG
>>
No discussions just slops sirs,
>>
File: 129936321467773.png (1 MB, 832x1216)
1 MB PNG
>>
File: 759449603795249.png (1.53 MB, 1152x1600)
1.53 MB PNG
>>
ltx is pretty good at fixing limbs. i make a 1 second clip with the broken reference image, prompt what the intended pose was, and then take the last frame
>>
>>108719684
Show a in/out ?
>>
>>108719684
why are you telling everyone my secret?
can you stfu please?
>>
File: 944906051179993.png (2.6 MB, 1344x1728)
2.6 MB PNG
>>
File: datatset(1).png (3.89 MB, 4766x7310)
3.89 MB PNG
https://github.com/zty0304/Anime-layers
Babe, babe wake up! New model just released that can take apart any anime illustration like a Photoshop PSD file, line art, flat colors, shadows, all separate layers!
>>
>>108719917
based now i can troll artists with it
>>
I NEED GPU
I NED GPU
I NEED GPU
INEED GPU
>>
>>108719917
im running it right now
>>
>>
>>108720045
catbox?
>>
>>108720045
helo beauty baby i am give kissing on youre bobs
>>
File: 454620313742262.png (2.25 MB, 1152x1600)
2.25 MB PNG
>>
File: AnimaOutput_124356151.png (1.39 MB, 832x1248)
1.39 MB PNG
>>108719007

this is just regular Anima Preview 3 with the Turbo lora, same prompt as my Klein 9B one a bit earlier lol
>>
File: 1766309393962459.png (2.31 MB, 1448x1086)
2.31 MB PNG
>>
>>108720135
It's Preview 1. Apparently it was just civitai fucking up but he did reply to my message.
>>
>>108720135
yeah i was just pointing out that Anima normally with Turbo lora already looks like my picrel if you use DPM++ 2S Ancestral Simple
>>
File: 1769784750163886.png (2.76 MB, 1086x1448)
2.76 MB PNG
>>
>>108719684
Thank you for letting us know.
>>
>>108719917
wow this is cool. i've always thought this would be the ideal strategy for an anime model. is it open weights?
>>
>>108719917
Do the "layers" have transparency? If not this is worthless.
>>
>>
>>108719917
Nice
>>
File: h8058s.png (1.71 MB, 1280x768)
1.71 MB PNG
>>
Need MayLi making out with Lois Griffin in Stonetoss style
>>
Sarah Peterson desu...
>>
File: Untitled.png (1.07 MB, 1024x1024)
1.07 MB PNG
>>
>>108720436
This is some next level quality, did you achieve that with Anima alone?
>>
i tuned ltx2.3 on music videos, time to start a 30 hour 4k generation and see what happens
>>
Ive made a little dream of mine come true.

- a tablet mounted on the sofa that serves solely as a display
- a room microphone
- on my first 3090, a own custom finetune of flux2klein 9b SNOFS, Cohere STT with FireRedVAD
- on my second 3090: Qwen3.6 35b Heretic, fine-tuned Qwen3-TTS
- a custom-developed pipeline

now I can lie on the couch, scratch my balls while letting the system generate images. this really makes the hobby so much more fun.
>>
>>108720698
Post pic of setup
>>
>>108720721
butcept for the scatching.
>>
>>108720698
I'm too uptight, I must be at the computer. I don't even like using my phone.
>>
>>108720574
Anima and just a basic hires fix.
https://files.catbox.moe/jfpkoa.png
>>
Is it a skill issue that I can't make Flux generations better than Pony ones?
Given, I really only use them to make portraits for chatbot character cards.
I just thought with models that are twice as big as pony models, they'd look a little better at least.
>>
So why isn't stability matrix shilled in the OP?
Is it some kind of Chinese malware?
I tried it the other day and it was actually pretty damn nice
>>
>>108720789
It's fine I just use comfy directly so I don't bother. Stability Matrix is good for noobs.
>>
File: ComfyUI_22993.png (3.54 MB, 1920x1080)
3.54 MB PNG
>>108720755
Flux.1 Dev? That needs a lot of setup and a more "pruned" style of NL prompting to get the best out of it. Compared to newer models, it is fairly limited.
>>
>>108720698
>a room microphone
so you yell out "computer, generate bobs and vagan"?
>>
File: Untitled.png (2.01 MB, 1280x768)
2.01 MB PNG
>>
>>
File: _AnimaPreview3_00615_.jpg (501 KB, 1248x1608)
501 KB JPG
>>
File: cover1.jpg (229 KB, 1920x1080)
229 KB JPG
>browsing loras on civitai
>half of them are tranny porn
is this really the ultimate desire for men when they have no restrictions?
>>
File: monke.jpg (104 KB, 450x700)
104 KB JPG
>>108720838
The eyes remind me of pic related.
>>
>>108720879
>>half of them are tranny porn
The internet is a mirror
>>
south park style for anima when
>>
>>108720879
Must be your recommendations.
>>
>>108720504
forced unfunny meme
>>
File: _AnimaPreview3_00633_.jpg (424 KB, 1248x1608)
424 KB JPG
>>
>>108720879
It's like three niggas making 90% of them
>>
>>108720890
Poorfag can't train?
>>
>>108720879
>is this really the ultimate desire for men when they have no restrictions?
>no restrictions
8^)

There are... *no*... restrictions?
>>
>>108720894
>recommendations
no i'm typing my model name into the search bar
>>
>>
>>
>obtained images
>no longer wish to clean, caption, and train
its not even coom pics :(
>>
>>
File: 1777139007041328.jpg (102 KB, 1920x1080)
102 KB JPG
>>108720829
jebby a CUTE!
>>
>>108721055
have your ai do it
>>
>>108721105
the best bakers go through manually to ensure each caption is perfect and each image is pristine after the computer does its job
>>
>>108721116
>ai cant compare to the heart and soul of a human
you sound like a drawfag
>>
>>108721131
ok
>>
>>108721055
>clean
Not really needed unless you have trash full of watermarks. Maybe bucketing but it's not a dealbreaker
>caption
>what are captioner apps
>train
Go to sleep and wake up to lora retard
>>108721131
>believing ai and leaving his captions without proofreading
retard
>>
>>
>>108721143
>captioner apps
please, i have my own script
>>
>>108721144
>can do arm warmers
>can't do realistic amputees
>>
>:^(

I'm installing ace step cpp.

lish me ruck
>>
hol up, dcw isn't in the cpp app? lol
>>
>>108721293
found it.
>>
fyi ace-step.cpp with Vulkan (with my AMD rdna2 card) is way faster than comfy, for some reason, idk why.
>>
>>108721305
What backend does Comfy use with Rdna2?
>>
>>108721305
It's also way faster on Nvidia, and it's magnitudes faster than the official Gradio or any other pytorch implementation as well.
>>
>>
>>
Is joycaption still considered the best captioning model?
>>
>>108721393
rocm. mine is gfx1030, and it's really hard to look this up, and I have long since forgotten which part of amd's driver stuff it is, but one of them was dropped, but cdna2 is still built (can't be used with mine).
>>
>>108721435
no its either qwen or gemma
>>
File: 1770093852172549.jpg (510 KB, 832x1216)
510 KB JPG
Anima still can't do toes but the arm and leg spaghetti here is kind of blowing my mind. Pretty good.
>>
>>
>>108721435
It's a retarded meme.
Just grab abliterated Qwen 3.6 MoE or Gemma 4 MoE and write a decent system prompt.
Can go with dense variants too if you have 24gb vram.
>>
>>
https://files.catbox.moe/d3jwcw.mp3
It turned out alright imo.

>>108721416
What settings do you use?

in comfy I was using exp_heun_2_x0_sde, and tan 2 for the scheduler. and often 500 steps, actually.
>>
>>108720168
not bad, but not as squishy as a real kid's drawing.

real kids use dead reckoning to draw. They start on a part. Then then hook it together with another one. And another. then uh oops, not quite coming together, various compromises are implemented. ai could do this, but nobody wants to, they want "pro" art, which begins by squinting, and working in layers (very conceptually similar to frequency, actually).
>>
>>108721507
>>108721475
Nice, set up a script to work with ollama/gemma. Thanks
>>
>>
cozy breas
>>
File: _AnimaPreview3_00644_.jpg (526 KB, 1248x1608)
526 KB JPG
>>
File: kono.png (507 KB, 512x720)
507 KB PNG
>>
File: comfy_.jpg (573 KB, 775x1050)
573 KB JPG
>>
>>108721779
What do those numbers mean
>>
>>108719917
>demo coming soon
>files updated one month ago
>>
>>108721829
https://danbooru.donmai.us/wiki_pages/twitter_cutting_game
>>
File: file.png (1.87 MB, 1199x1312)
1.87 MB PNG
local will never reach this level of capability
>>
>>108721874
thank god
>>
>>108721874
Ok I am impressed that this one didn't get filtered at least.
>>
>>108721874
informative. up-it-two-thumbs
>>
>>108721874
see if it can summarize https://www.bbc.com/news/world-us-canada-12994248

if it does, be sure to notify any and all news outlets.
>>
>>108721813
A small treat (wild berries) may help.
>>
>>108719007
can somebody extract a lora of this from preview1? so i can use this shit on preview3?
>>
what's the best way to train a lora for ace step?
>>
File: 1750229276648919.mp4 (3.22 MB, 1264x1872)
3.22 MB MP4
>>
File: ss_04-30-2026_003.png (152 KB, 1238x730)
152 KB PNG
Hello saars I code the ollama image tagger app. Yes good?
>>
File: Flannery_Strive_039.png (926 KB, 1152x896)
926 KB PNG
>>
File: 54654.jpg (452 KB, 2517x1268)
452 KB JPG
>>108722117
I made this. I also made video version which doesn't have auto-captioning yet but you can crop, cut videos and caption manually with it.
>>
>>108722148
Nice, mines just a single python script
>>
File: _AnimaPreview3_00684_.jpg (473 KB, 1248x1608)
473 KB JPG
>>108722117
>>108722148
nice
>>
>>108722154
Mine started from couple of batch files then bloated into whole app.
>>
>>108721874
lmao
>>
ANIMA PREVIEW 4 WHEN???
>>
I need an izzat llm that monitors everything I do and tells me my izzat.
>>
>>108722191
-500 izzat
>>
File: May_Nude00361.jpg (223 KB, 1152x896)
223 KB JPG
>>
I still don't understand the diff between /sdg/ and /ldg/.
>>
File: _AnimaPreview3_00048_.png (1.78 MB, 1536x1024)
1.78 MB PNG
>>
>>108722218
Containment general for a discordfag who used to cause drama there.
It kinda works but this general has developed its own dramas after the split.
A few other anons schizopost there too sometimes.
>>
File: ComfyUI_00495_.png (1.77 MB, 1500x844)
1.77 MB PNG
>>
File: Dawn_00250_peep.jpg (161 KB, 1152x896)
161 KB JPG
>>
>>108722221
brown
>>
File: ComfyUI_00002_.jpg (441 KB, 1024x1024)
441 KB JPG
original prompt do not steal
>>
>>108722221
Catbox or LoRA... Please explain anon
>>
>>108722985
piss filter and melted fingers, it's chatgpt.
>>
>>108722767
uwhoah is this real
>>
>https://github.com/muooon/EmoSens
Anyone tried this shit?
>>
I still got a few kinos in me after I thought I'd run out
>>
>>108722985
More than one named character doing something besides standing, it's API
>>
File: 00009-2765899386.png (2.43 MB, 1248x1824)
2.43 MB PNG
>>
File: 00010-2867639613.png (2.56 MB, 1248x1824)
2.56 MB PNG
>>
File: 00024-601912342.jpg (558 KB, 1536x2304)
558 KB JPG
>>
File: Shaunagatari.png (846 KB, 1152x896)
846 KB PNG
>>108723124
>>
>>108723124
Is janny brown pokemon fan ximself or something?
Lol @ removing that but leaving barely censored pedo porn.
>>
File: 1755924730015712.png (137 KB, 1166x1195)
137 KB PNG
why did my post get removed? especially considering it was objectively true
>>108723389
I wish you the best of luck with your suicide, faggot
>>
File: 1673413841964530.png (266 KB, 785x1000)
266 KB PNG
>>108723606
>I wish you the best of luck with your suicide, faggot
>>
>>108723606
Those fucking Japanese ruining the internet!
>>
File: 00043-1854806733.png (1.97 MB, 1024x1536)
1.97 MB PNG
ernie base has some good potential. just needs a fine-tuning of photorealistic non synthetic images.
>>
>>108723606
What's up with Peru of all places?
>>
>>108723651
Cool, liked your earlier gens
>>
>>108723608
lol no wonder you're a friendless retard that looks for validation in fucking 4chan
>>108723629
Nips look at you and feel revulsion
>>
>>108723651
I wrote it off for now but I am interested in taking a crack at it if I see evidence that it responds well to training.
One anon here posted a meme lora with very poor facial likeness, which was not very encouraging to say at least.
>>
File: 1575712385869.png (73 KB, 1029x1040)
73 KB PNG
>>108723665
>lol no wonder you're a friendless retard that looks for validation in fucking 4chan
>>
>>108723665
>nips
>feel
uh okay good one
>>
>>108722221
Anima 3... For sure
>>
>>108721527
>500 steps

Which model? 500 steps is too much for Turbo, anyways on Turbo XL not too many fancy settings are needed. Just Scragnog custom VAE, DCW Double 0.05 for both, 8 steps. I master my gens with Matchering 2 to improve the sound quality even further. The XL SFT model isn't as creative and doesn't use as many instruments as Turbo XL which means it's also significantly worse at prompt following so I don't use it or its merges anymore.
>>
File: 1752551173518241.png (120 KB, 722x667)
120 KB PNG
>wanschizo's a petrol-sniffing abbo
I mean, that explains him having the brain damage required to be obsessed with a TV show for 5-year-olds
>>
File: 1573930113221.jpg (35 KB, 231x590)
35 KB JPG
>>108723712
>I mean, that explains him having the brain damage required to be obsessed with a TV show for 5-year-olds
>>
>>108723723
nice selfie of you when you see an unattended jerrycan
>>
>Only one person uses this very popular reaction image/meme/jak
I wish these wannabe detective schizos could understand how ludicrous they look.
>>
>>108723747
>nice selfie of you when you see an unattended jerrycan
Why haven't you taken your meds yet little man?
>>
>>108723757
>>108723764
>subhuman retard doesn't understand how 4chan works
yeah bro the fact that all those pics have the exact "randomized" filename is nothing but a coincidence
what a fucking mongrel
>>
>>108723785
But seriously, why haven't you taken your meds?
>>
FYI, this is the kind of "contribution" the lobotomite abbo makes to this site
https://desuarchive.org/a/thread/280916078
>>
why are we fighting again
>>
>>108723983
>we
It's just an archive schizo having a meltdown.
>>
>1boy, male focus
>get a man with a vagina
ebin
>>
>>108724000
shoulda put cuntboy in the negatives
>>
>>108724000
which model? lmao
>>
>>108724000
chroma has a habit of giving women thick penises even when you don't ask for it.
>>
>>108723048
What tf is that readme
>>
when can local do this?
https://files.catbox.moe/mr1or8.mp4
>>
>>108724075
What having millions of furry futa porn images in the dataset does to a motherfucker's model.
Serious talk though, it's probably the captioning LLM being unable to tell it when the "woman" has a penis and describing how her vagina gets penetrated.
>>
>>108724094
lel I recognized him before he turned around.
>>
>>108724102
Yeah, I usually have to describe the woman's vagina or mention the clitoris or it usually gives everybody a penis if you mention it once.
>>
>>108724094
it can already do absolutely boring SFW videos

but yes this is longer and with better audio than average

BUT is this using lower end hardware? If you had a high end nvdia gpu farm server thing you could already have done more with hunyuanvideo like a year ago
>>
>>108723989
I mean, yeah, there is only one person involved in that discussion, since the other party is a subhuman monkey
>>
>>108724094
>he is not flashing his dick overflowing with cum
>the girls are not showing boucing tities
excuse-me but whats the usecase for local being able to make boring videos? indian scamming?
>>
is insectfucker anon in?
>>
File: a-yume04_00154_.png (893 KB, 832x1296)
893 KB PNG
>>
>>108724146
Anon, take your meds. You're clearly mentally unwell :(
>>
>>108724241
another certified wanschizo classic
all that petrol has left your brain so shriveled you can't even come up with new retorts
>>
>>108724276
Take your meds.
>>
let's ALL take our meds
>>
>>108724276
You definitely need your meds bro, like urgently.
>>
>>108724276
Nta but if everyone else is "a schizo" for you you might want to ask yourself if you aren't the schizo in the room
Faggot
>>
ok i think i have done everything AI can do and i am now bored.
>>
>>108724435
Everything? Show me your BEST cute fart gen anon
>>
>>108724435
come back when insectfucker is here, he'll show you some shit
>>
>fartschizo is also back
of course
>>
File: 1488721134554.png (744 KB, 742x885)
744 KB PNG
Everytime I do batch captioning, the later captions get lower quality than the earlier - missing punctuation, general wall-of-textiness, no capitalisation. Do I have to set it up to flush context after every image or something like that?
>>
>>
>>108724492
unless you can think of a reason why the context from different images is at all useful, then I would say yes
>>
>>108724492
>I am poisoning the context with irrelevant shit, why does the LLM perform worse?
>>
>>108724492
Yeah typically unfortunately that happens if it's not free for each image
>>
>>108724427
>it thinks it's a person
Lmaooooooooooo hope you overdose on gas, you cocksucking ape
>>
>>
>>108724555
this an edit of a real painting or you prompted it?
>>
>>108724775
prooompted it

It was:

Style: 17th-century Dutch Golden Age oil painting
Prompt: A lavish still-life arrangement on a dark velvet tablecloth. A peeled lemon with a long, spiraling rind drapes over the edge of a silver platter. Beside it, a glass triangular prism splits a beam of sunlight into a vivid rainbow that lands exactly on a cluster of green grapes. Leaning against a crystal goblet is a small, weathered piece of parchment with the handwritten ink text "MEMENTO MORI." A tiny, iridescent beetle is perched on the lemon rind.
>>
>>108724555
>memento mori
>no death or decay
baka my head but otherwise kewl gen



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.