[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 00019-1684567118.jpg (1.12 MB, 2432x1664)
1.12 MB
1.12 MB JPG
Previous /sdg/ thread : >>107766236

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/jayn7/Z-Image-Turbo-GGUF

>Flux.2 Dev
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/city96/FLUX.2-dev-gguf

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>>/r/realistic+parody
>>
File: 1767729961576.jpg (163 KB, 1152x768)
163 KB
163 KB JPG
Thanks for baking anon.
>>
File: 00029-4059726390.jpg (1.83 MB, 3200x1920)
1.83 MB
1.83 MB JPG
>>107785867
you're welcome. i am sorry, i had bad timing, didn't mean to take away the baking opportunity
>>
File: IMG_0948.png (1.37 MB, 768x1152)
1.37 MB
1.37 MB PNG
>>107785902
It's okay :)
>>
File: 10101101.jpg (208 KB, 1536x2048)
208 KB
208 KB JPG
>>
File: 35301353.jpg (279 KB, 2048x1536)
279 KB
279 KB JPG
>>
File: 00031-3873121437.jpg (1.92 MB, 3200x1920)
1.92 MB
1.92 MB JPG
>>
>>
>mfw Resource news

01/06/2026

>LTX-2: DiT-based audio-video foundation model
https://github.com/Lightricks/LTX-2

>DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
https://guoxu1233.github.io/DreamID-V

>Diffusion Toolkit v1.10
https://github.com/RupertAvery/DiffusionToolkit/releases/tag/v1.10

>NVIDIA RTX Accelerates 4K AI Video Generation on PC With LTX-2 and ComfyUI Upgrades
https://blogs.nvidia.com/blog/rtx-ai-garage-ces-2026-open-models-video-generation

>fp8_e4m3fn conversion of Gemma 3 12b it text encoder
https://huggingface.co/GitMylo/LTX-2-comfy_gemma_fp8_e4m3fn

>Diffuse: Windows desktop UI for Huggingface Diffusers
https://github.com/TensorStack-AI/Diffuse

>SwinIFS: Landmark Guided Swin Transformer For Identity Preserving Face Super Resolution
https://github.com/Habiba123-stack/SwinIFS

>PartImageNet++ Dataset: Enhancing Visual Models with High-Quality Part Annotations
https://github.com/LixiaoTHU/PartImageNetPP

>E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models
https://github.com/shengjun-zhang/VisualGRPO

>A Comprehensive Dataset for Human vs. AI Generated Image Detection
https://huggingface.co/datasets/Rajarshi-Roy-research/Defactify_Image_Dataset

>MagicFight: Personalized Martial Arts Combat Video Generation
https://MingfuYAN.github.io/MagicFight

01/04/2026

>Invoke AI 6.10 - now supports Z-Image Turbo
https://github.com/invoke-ai/InvokeAI/releases/tag/v6.10.0rc2

>ComfyUI Wan VACE Video Joiner
https://github.com/stuttlepress/ComfyUI-Wan-VACE-Video-Joiner

>UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement
https://pku-yuangroup.github.io/UltraShape-1.0

>OpenVINO AI Plugins for GIMP
https://github.com/intel/openvino-ai-plugins-gimp/releases/tag/3.2.0

>Comfyui-GeminiWeb
https://github.com/Koko-boya/Comfyui-GeminiWeb
>>
>mfw Research news

01/06/2026

>NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation
https://arxiv.org/abs/2601.02204

>Unraveling MMDiT Blocks: Training-free Analysis and Enhancement of Text-conditioned Diffusion
https://arxiv.org/abs/2601.02211

>BiPrompt: Bilateral Prompt Optimization for Visual and Textual Debiasing in Vision-Language Models
https://arxiv.org/abs/2601.02147

>VIBE: Visual Instruction Based Editor
https://arxiv.org/abs/2601.02242

>ExposeAnyone: Personalized Audio-to-Expression Diffusion Models Are Robust Zero-Shot Face Forgery Detectors
https://mapooon.github.io/ExposeAnyonePage

>Agentic Retoucher for Text-To-Image Generation
https://arxiv.org/abs/2601.02046

>HyperCLOVA X 8B Omni
https://arxiv.org/abs/2601.01792

>Forget Less by Learning from Parents Through Hierarchical Relationships
https://arxiv.org/abs/2601.01892

>TalkPhoto: A Versatile Training-Free Conversational Assistant for Intelligent Image Editing
https://arxiv.org/abs/2601.01915

>MotionAdapter: Video Motion Transfer via Content-Aware Attention Customization
https://arxiv.org/abs/2601.01955

>AFTER: Mitigating the Object Hallucination of LVLM via Adaptive Factual-Guided Activation Editing
https://arxiv.org/abs/2601.01957

>GDRO: Group-level Reward Post-training Suitable for Diffusion Models
https://arxiv.org/abs/2601.02036

>VINO: A Unified Visual Generator with Interleaved OmniModal Context
https://sotamak1r.github.io/VINO-web

>DatBench: Discriminative, Faithful, and Efficient VLM Evaluations
https://arxiv.org/abs/2601.02316

>A Comparative Study of Custom CNNs, Pre-trained Models, and Transfer Learning Across Multiple Visual Datasets
https://arxiv.org/abs/2601.02246

>VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation
https://arxiv.org/abs/2601.02256

>DeepInv: A Novel Self-supervised Learning Approach for Fast and Accurate Diffusion Inversion
https://arxiv.org/abs/2601.01487
>>
>mfw MORE Research news

>FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing
https://arxiv.org/abs/2601.01720

>Improving Flexible Image Tokenizers for Autoregressive Image Generation
https://arxiv.org/abs/2601.01535

>Unified Generation and Self-Verification for Vision-Language Models via Advantage Decoupled Preference Optimization
https://arxiv.org/abs/2601.01483

>Guiding Token-Sparse Diffusion Models
https://arxiv.org/abs/2601.01608

>Image Synthesis Using Spintronic Deep Convolutional Generative Adversarial Network
https://arxiv.org/abs/2601.01441

>Slot-ID: Identity-Preserving Video Generation from Reference Videos via Slot-Based Temporal Identity Encoding
https://arxiv.org/abs/2601.01352

>Luminark: Training-free, Probabilistically-Certified Watermarking for General Vision Generative Models
https://arxiv.org/abs/2601.01085

>Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment
https://arxiv.org/abs/2601.01224

>Evolving CNN Architectures: From Custom Designs to Deep Residual Models for Diverse Image Classification and Detection Tasks
https://arxiv.org/abs/2601.01099

>YODA: Yet Another One-step Diffusion-based Video Compressor
https://arxiv.org/abs/2601.01141

>NarrativeTrack: Evaluating Video Language Models Beyond the Frame
https://arxiv.org/abs/2601.01095

>CRoPS: A Training-Free Hallucination Mitigation Framework for Vision-Language Models
https://arxiv.org/abs/2601.00659

>TimeColor: Flexible Reference Colorization via Temporal Concatenation
https://bconstantine.github.io/TimeColor

>DynaDrag: Dynamic Drag-Style Image Editing by Motion Prediction
https://arxiv.org/abs/2601.00542

>FreeText: Training-Free Text Rendering in Diffusion Transformers via Attention Localization and Spectral Glyph Injection
https://arxiv.org/abs/2601.00535
>>
>>
File: 453463465465.png (2.93 MB, 1152x1920)
2.93 MB
2.93 MB PNG
desuarchive.org/g/thread/103782080
1 year ago
>>
>>107786576
RIP dance anon
>>
File: deED_zi_00025_.png (2.73 MB, 2048x1280)
2.73 MB
2.73 MB PNG
>>107786576
>First for schizo containment general
appears this anon was finally defeated
>some comfy posts
maybe the last we'd ever get
>>
>>107786690
we lost all the schizos except the one
>>
File: 00033-1071472141.jpg (1.07 MB, 1536x2560)
1.07 MB
1.07 MB JPG
>>
>>
>>
File: deED_zi_00026_.png (2.86 MB, 2048x1280)
2.86 MB
2.86 MB PNG
>>
File: 00035-1799700658.jpg (825 KB, 1536x2560)
825 KB
825 KB JPG
>>
>>
File: deED_zi_00027_.png (2.5 MB, 2048x1280)
2.5 MB
2.5 MB PNG
>>
File: 00036-681521761.jpg (989 KB, 1536x2560)
989 KB
989 KB JPG
>>
File: deED_zi_00028_.png (2.55 MB, 2048x1280)
2.55 MB
2.55 MB PNG
>>
>>107786781
???
>>
File: 657657567566.jpg (248 KB, 832x1216)
248 KB
248 KB JPG
>>
File: 00099-3201643795.jpg (1.13 MB, 2160x1728)
1.13 MB
1.13 MB JPG
>>
File: 554756756.jpg (206 KB, 1216x832)
206 KB
206 KB JPG
>>
File: 00100-3179641390.jpg (2.95 MB, 3040x2080)
2.95 MB
2.95 MB JPG
>>
File: deCL_zi_00042_.png (2.79 MB, 2048x1280)
2.79 MB
2.79 MB PNG
>>
File: 00101-3836762617.jpg (1.98 MB, 3040x2080)
1.98 MB
1.98 MB JPG
gn
>>
why comfy github repo returning 403 on fetch
>>
File: deCL_zi_00043_.png (2.75 MB, 2048x1280)
2.75 MB
2.75 MB PNG
>>107790572
gn

>>107790581
he moved it to the org
https://github.com/Comfy-Org/ComfyUI
>>
File: deCL_zi_00044_.png (2.39 MB, 2048x1280)
2.39 MB
2.39 MB PNG
>>
File: pixlr_20260107020458280.jpg (987 KB, 6799x3676)
987 KB
987 KB JPG
>>
File: deCL_zi_00047_.png (2.7 MB, 2048x1280)
2.7 MB
2.7 MB PNG
>>
File: 5801600586.jpg (312 KB, 2691x2097)
312 KB
312 KB JPG
There is much untruth in a dream
>>
i miss schizo anon
>>
File: pixlr.jpg (121 KB, 1080x530)
121 KB
121 KB JPG
>>
File: winter night.webm (3.9 MB, 1920x640)
3.9 MB
3.9 MB WEBM
>>
>>107790664
It's working now but when I posted that the org was "disabled", so said the remote(GitHub).
Never seen that before except when repos get DMCA requests.
>>
File: winter night 2.webm (3.91 MB, 1920x640)
3.91 MB
3.91 MB WEBM
>>
File: 907272267.png (1.01 MB, 768x1024)
1.01 MB
1.01 MB PNG
Cakes on the griddle
>>
File: image.png (1.69 MB, 984x726)
1.69 MB
1.69 MB PNG
I am using Chatgpt to try to create this weird tales art style like the one on the left.

I was trying a prompt like this

Prompt: 1930s Weird Tales pulp magazine cover art, commercial gouache illustration style. Flat opaque color application with simplified modeling, poster-aesthetic with naturalistic forms. Bold color zones with controlled edges, limited soft transitions only for form modeling. Strong value contrast, theatrical stage lighting, hard-edged cast shadows. Vintage four-color printing look, slight registration offset. Illustrative realism with posterized color planes.

A chaotic, night-time battle scene at a bandit camp. Sir Gottfried, wounded and bleeding, is a "whirlwind of silver and death," fighting multiple bandits simultaneously with his sword and shield. His pose is one of desperate, exaggerated heroism. In the background, there are burning tents and a crossbowman reloading in the shadows. The moon is a "sliver of bone," casting eerie light, while the fires create a warm, dramatic glow on the armor and faces.

Saturated dramatic lighting. 1930s adventure pulp, heroic fantasy illustration, magazine cover composition

NOT / Negative Prompt: NOT: oil painting, visible brushstrokes, impasto, heavy texture, painterly, blended edges, soft focus, atmospheric, chiaroscuro, tenebrism, academic painting, fine art, canvas texture, alla prima, glazing, scumbling, loose brushwork, impressionist, expressionist, textured surface, thick paint application, Rembrandt lighting, naturalistic rendering, photorealistic, digital art, concept art, artstation, modern illustration, 3D render, airbrushed, gradient mesh

but it produced photo on the right, What am I doing wrong? I imagine overprompting but not sure what to remove and what to include
>>
File: 00004-3298743327.jpg (872 KB, 1664x2432)
872 KB
872 KB JPG
>>
File: deED_zi_00032_.png (2.73 MB, 2048x1280)
2.73 MB
2.73 MB PNG
more new captcha puzzles
3 tries to submit
>>
File: 00006-2309528666.jpg (2.7 MB, 3040x2080)
2.7 MB
2.7 MB JPG
>>
Morning anons
Qwen made a huge quokka dog lmao.
>>
File: deED_zi_00035_.png (2.72 MB, 2048x1280)
2.72 MB
2.72 MB PNG
>>107795929
happy work anniversary, office quokka dog
>>
File: 00007-3678956485.jpg (1.34 MB, 2432x1664)
1.34 MB
1.34 MB JPG
think i am gonna try to train, with civitai, a koff3 lora for flux. idk if flux is dead but whatever
>>107795929
morning
>>
>>107795941
Train The Ninth Gate lora... from the book images...
>>
File: deED_zi_00036_.png (2.96 MB, 2048x1280)
2.96 MB
2.96 MB PNG
>>107795941
could always try a z-image lora if you're worried about model relevance
>>
>>107795940
That was back in December, today is just regular birthday
>>107795941
Nice pomni,
morning
>>
File: deED_zi_00037_.png (2.46 MB, 2048x1280)
2.46 MB
2.46 MB PNG
>>107796195
oh, happy birthday office quokka dog
>>
>>107796233
Thank you Debo.
>>
File: 00008-432222307.jpg (1.02 MB, 1664x2432)
1.02 MB
1.02 MB JPG
>>107795977
meh, i'm going to hold off anyway, the civitai site is annoying me. i guess there will be a base z-image released soon?, though i will be unable to run it locally anyway
>>
File: deCS_zi_00062_.png (3.66 MB, 2176x1152)
3.66 MB
3.66 MB PNG
>>107796274
>there will be a base z-image released soon?
no idea. there's been mixed messages about the base model release cuz they've suggested its almost ready but then have been mostly silent otherwise
>>
File: deCS_zi_00065_.png (3.31 MB, 2176x1152)
3.31 MB
3.31 MB PNG
>>107796274
>though i will be unable to run it locally anyway
z-image is pretty small. have you tried the ggufs to see if they'd work locally? surely something can fit

model:
https://huggingface.co/jayn7/Z-Image-Turbo-GGUF/tree/main
encoder:
https://huggingface.co/unsloth/Qwen3-4B-GGUF/tree/main
>>
>>107796274
If you have 6GB and at least Ampere card you can run fp16 (or the non guff) one with 16gb shared ram. It's not a big deal.
GGUF Q8 takes even less memory but it is slightly slower as it is not hardware accelerated.
If I can run it, you can do it too.
Flux is way heavier.
>>
File: 00009-3000184703.jpg (1.2 MB, 2432x1664)
1.2 MB
1.2 MB JPG
>>107796400
>>107796474
guess i'll start the process of installing and trying to use comfyui, again
>>
>>107796507
New comfyui is somewhat bad experience. I don't have a cutting edge version but it has memory management issues. But this means it will oom after every few gens. It's not unusably bad but slightly so.
>>
File: deCS_zi_00066_.png (3.2 MB, 2176x1152)
3.2 MB
3.2 MB PNG
>>107796641
I'm not pulling as long as possible
hopefully someone forks a ComfyClassic and unfucks a lot of their bad decisions
>>
File: deCS_zi_00067_.png (3.36 MB, 2176x1152)
3.36 MB
3.36 MB PNG
>>
File: komfyui_00002_.png (1.19 MB, 896x1152)
1.19 MB
1.19 MB PNG
well.. got it working, though i don't yet understand how to get the results i want
https://files.catbox.moe/jnbti2.jpg
rate the flow/offer guidance as far as the settings i should be using
>>
done some performance optimizations, nice one is fused geglu, also added flash attention, be ready for merge soon but it's tested and working
>UNet2DConditionModel runwayml/stable-diffusion-v1-5 @ 512x512 with batch 2 (batch 1 CFG) on 3090
>22.8ms
>~43.86 it/s
very nice
i think i can get faster, haven't fused qkv yet, groupnorm and layernorm kernels aren't the best, maybe some more gemm/conv2d fusions are possible or maybe more tile description tuning, and try tuning concat and elementwise kernels more
>>
File: file.jpg (226 KB, 1024x1024)
226 KB
226 KB JPG
>>107797148
forgot an image
>>
File: 1574148441580.jpg (9 KB, 315x338)
9 KB
9 KB JPG
>>107796768
>>107797067
>[Elton John's "imagine" piano melody playing]
>*dum**dum**dum**dum**dum**dum**dum**dum*
>*dururururumdum*
>Imagine there's weird industrial machinery in every roof
>*dururururumdum*
>>
Lunch
>>
File: komfyui_00005_.png (881 KB, 1152x896)
881 KB
881 KB PNG
>>
File: deED_zi_00038_.png (2.75 MB, 2048x1280)
2.75 MB
2.75 MB PNG
>>107797095
here's one of my most recent workflows for reference:
https://files.catbox.moe/56hg3d.png the auraflow node with shift param [supposedly] improves encoding; people typically a range between 5-7

>>107797148
>>22.8ms
>>~43.86 it/s
how does this compare with the non-modular models?

>>107797191
>>Imagine there's weird industrial machinery in every roof
this is the future according to AI. its not like we're using roofs for anything else
>>
>>107796768
It will be a long project. I wish I was a dev but I can barely read books.
Now would be the time to capture the codebase I think.
>>
>>107797365
Before the git gets erased or closed.
>>
>>107797338
I just found it funny
>>
>>107797338
>the non-modular models
wdym
>>
File: deED_zi_00039_.png (2.49 MB, 2048x1280)
2.49 MB
2.49 MB PNG
>>107797487
dino transforms models into optimized standalone models, right? I'm always wondering about the benchmarks, non-optimized model vs optimized dino-module
>>
File: k_00004_.png (945 KB, 1152x896)
945 KB
945 KB PNG
>>
>>107797525
ohh
>diffusers 23.05 it/s
>diffusers with torch.compile max-autotune fullgraph 29.29 it/s
>>
File: k_00011_.png (968 KB, 768x1280)
968 KB
968 KB PNG
>>
File: k_00014_.png (1.02 MB, 768x1280)
1.02 MB
1.02 MB PNG
>>
>update comfy
>workflow that worked before now ooms
wowzer!
>>
>>107798245
Most of the 3rd party nodes were not using apis' memory management anyway.
It was more and more obvious with chinese animation models.
>>
>>107798280
eg. They are wrappers.
>>
File: Systems_01494.jpg (359 KB, 1418x1778)
359 KB
359 KB JPG
>>
File: Systems_01456.jpg (1.05 MB, 2304x1792)
1.05 MB
1.05 MB JPG
>>
File: halopsx3.jpg (450 KB, 3687x2765)
450 KB
450 KB JPG
>>
File: halopsx2.jpg (433 KB, 3687x2765)
433 KB
433 KB JPG
>>
File: deED_zi_00040_.png (2.43 MB, 2048x1280)
2.43 MB
2.43 MB PNG
see the stargs. these text goof-ups humor me

>>107798245
>he pulled
oh no

>>107798330
ah, my stocks seem to be doing well

>>107798488
very cool. would play (the pc port)
>>
File: k_00017_.png (292 KB, 1280x768)
292 KB
292 KB PNG
>>
File: k_00019_.png (1.27 MB, 1280x768)
1.27 MB
1.27 MB PNG
>>
File: deED_zi_00044_.png (2.94 MB, 2048x1280)
2.94 MB
2.94 MB PNG
>>
File: k_00020_.png (1.14 MB, 1280x768)
1.14 MB
1.14 MB PNG
>>
File: k_00021_.png (1.28 MB, 768x1280)
1.28 MB
1.28 MB PNG
>>
File: k_00022_.png (2.97 MB, 1248x1824)
2.97 MB
2.97 MB PNG
>>
File: k_00023_.png (2.93 MB, 1248x1824)
2.93 MB
2.93 MB PNG
>>
File: deUC_zi_00001_.png (2.38 MB, 2048x1280)
2.38 MB
2.38 MB PNG
>>
File: Jennifer_01.jpg (258 KB, 1489x1117)
258 KB
258 KB JPG
>>
File: ComfyUI_01528_.png (1.99 MB, 2000x1152)
1.99 MB
1.99 MB PNG
Entombed left Hand Path.
https://www.youtube.com/watch?v=weyYzWU-FNI
>>
>>107799099
Yewtube has decided that my current song is this:<
>>
>>107799158
https://www.youtube.com/watch?v=hdp7snFv7YI
>>
https://www.youtube.com/watch?v=qeeOc8naPIg
>>
File: k_00025_.png (2.43 MB, 1824x1248)
2.43 MB
2.43 MB PNG
https://suno.com/s/cqz2js53aL29kxGX
nice when it ignores my dumb lyrics and instead generates a decent 8 minute instrumental
>>
File: slame meat.jpg (106 KB, 1024x1024)
106 KB
106 KB JPG
>>107799340
idk why but the drum beat gives me early 90s vibes. fun energy
also, I'd like 4lbs of slame meat please
>>
File: k_00028_.png (2.37 MB, 1920x1080)
2.37 MB
2.37 MB PNG
>>
File: k_00029_.png (2.24 MB, 1920x1080)
2.24 MB
2.24 MB PNG
>>
File: k_00031_.png (2.2 MB, 1344x1728)
2.2 MB
2.2 MB PNG
>>
File: deUC_zi_00004_.png (2.29 MB, 2048x1280)
2.29 MB
2.29 MB PNG
>>
File: k_00032_.png (2.33 MB, 1344x1728)
2.33 MB
2.33 MB PNG
>>
File: deUC_zi_00008_.png (2.31 MB, 2048x1280)
2.31 MB
2.31 MB PNG
#wisdom
>>
File: k_00034_.png (2.08 MB, 1920x1080)
2.08 MB
2.08 MB PNG
>>
File: k_00038_.png (1.56 MB, 1152x896)
1.56 MB
1.56 MB PNG
>>
File: k_00040_.png (2.74 MB, 1920x1080)
2.74 MB
2.74 MB PNG
>>
File: deED_zi_00047_.png (2.71 MB, 2048x1280)
2.71 MB
2.71 MB PNG
>>107799980
I approve this gen
>>
>>107785765
catbox?
>>
File: 1801700187.jpg (346 KB, 2097x2691)
346 KB
346 KB JPG
>>
>>107801484
Sent you a pm.
>>
File: IO_SE_TART_4.jpg (1.14 MB, 3532x4396)
1.14 MB
1.14 MB JPG
>>
File: IO_SE_TART_5.jpg (1.11 MB, 4608x3584)
1.11 MB
1.11 MB JPG
>>
File: deme_00213_.jpg (494 KB, 1024x1024)
494 KB
494 KB JPG
>>107801730
i have this in my files but i dont remember why i made it
>>
>>107801956
I don't think you have ever announced a specific reason for your 24/7 spam.
>>
File: deUC_zi_00009_.png (2.09 MB, 2048x1280)
2.09 MB
2.09 MB PNG
>>107801968
ok
>>
File: IO_SE_TART_2.jpg (1.3 MB, 3584x4608)
1.3 MB
1.3 MB JPG
>>107801956
Thats.... Interesting.
>>
>>107802011
>ran took everything from me
>>
File: deUC_zi_00011_.png (2.12 MB, 2048x1280)
2.12 MB
2.12 MB PNG
>>107802196
>Interesting
EYE thought so too, heh heh heh
>>
i miss schizo anon
>>
File: IO_SE_TART_3.jpg (1.08 MB, 3488x4300)
1.08 MB
1.08 MB JPG
>>107802248
EYE SEE what you did there lol. What's good man. Very ominous gen.
>>
File: deUC_zi_00013_.png (2.27 MB, 2048x1280)
2.27 MB
2.27 MB PNG
>>107802259
nm, just makin it through the week. slowly but surely. are you back in your groove?
>Very ominous gen.
I grabbed an interesting lora earleir the week that I've been having fun with. it adds a neat mood and interpretation to gens
>>
>>107802253
he tried to warn us...
>>
File: IO_SE_TART_7.jpg (1.31 MB, 3584x4436)
1.31 MB
1.31 MB JPG
>>107802354
Yep, I'm back in my groove. Had a very productive few days at work so far. I'm one of these people who can't stand to be idle. I don't understand how people can sit in front of a television and binge watch episodes of a TV show for hours on end. I have to be up and moving. Anyway, yeah, those are some pretty cool gens. What exactly is the Lora theme?
>>
>>107802405
Thank you for letting us know.
>>
File: IO_SE_TART_1.jpg (1.2 MB, 3584x4420)
1.2 MB
1.2 MB JPG
>>107802411
You're welcome. Didn't mean to strike a nerve if you're one of these people who can watch long bouts of television shows or movies or anything. I didn't mean to imply that those people were lazy. Just that I don't have the capacity for that.
>>
File: deUC_zi_00015_.png (2.71 MB, 2048x1280)
2.71 MB
2.71 MB PNG
>>107802405
>I'm one of these people who can't stand to be idle.
good for you. I'm always kind of envious of people who always keep moving. my natural state is more sedentary
>What exactly is the Lora theme?
its suppose to be 'cassette fururism' or something like that. I didn't quite get the original aesthetic I was aiming for out of it but have been getting lots of other cool stuff. did you see the space gens earlier in the thread? I thought those were super cool
>>
File: PH_SE_ANIMALSNACK_06.jpg (1.38 MB, 4608x3584)
1.38 MB
1.38 MB JPG
>>107802468
Ah, I see. That's a cool idea for a Lora. Anyway, you talk frequently about going to the gym, so you're at least combating your sedentary nature with some healthy habits.
>>
File: deUC_zi_00018_.png (2.48 MB, 2048x1280)
2.48 MB
2.48 MB PNG
>>107802504
I do stuff, but there's always a barrier in front of doing stuff. doing stuff is more of a duty than a desire. I wish I had that innate pull towards motion. if I did, I prob would have conquered the world by now
but for now, I'm being innately pulled towards my pillows. gn
>>
File: PH_SE_ANIMALSNACK_01.jpg (914 KB, 4408x2988)
914 KB
914 KB JPG
>>107802523
I get it, but you should embrace who you are and the positive qualities that you have. Everyone has negative qualities, but wishing they were different is just to harp on yourself in a negative manner. I'm sure you have a ton of good qualities that you can espouse and be proud of. Anyway, have a good night's sleep.
>>
fuckin rip
https://files.catbox.moe/ycwd3z.jpg
>>
File: PH_SE_ANIMALSNACK_17.jpg (1.66 MB, 4608x3584)
1.66 MB
1.66 MB JPG
>>
File: winter night 3.webm (3.94 MB, 1920x640)
3.94 MB
3.94 MB WEBM
>>
File: 000000_51433_.png (2.95 MB, 1440x1120)
2.95 MB
2.95 MB PNG
G'mornin Anons, have a great day!
>>
>>107803942
Needs more jpeg
>>
File: 1756767728924055.jpg (1.42 MB, 2776x2160)
1.42 MB
1.42 MB JPG
>>107803942
she's now a lizard
>>
>>
yo niggas what's the go to realistic 1girl sloppa now
i used pony a year ago and updated and it's worse now, not interested in video gen (only image) but never messed with the funny qwen and flux and chinese models and all
>>
>>107804603
ZIT is based but you have to go a little out of your way to avoid sameface + samebackground + sameangle. It also doesn't change much if you randomize seeds if you use it raw
>>
>>107804603
>>107804617
zit and chroma are best for realism
>>
>>
>>107804617
damn. sameeverything is always annyoing

>>107804628
realism or actual photorealism? seems like everyone forgot how to train skin tones that don't look like a doll in the past year. goddamn chink datasets
>>
>>107804628
I like z-image but can't do lora easily.
>>
qwen is also nice.
>>
File: deED_zi_00049_.png (2.89 MB, 2048x1280)
2.89 MB
2.89 MB PNG
the more elaborate the captchas are, the more intelligent we will become. no longer are captchas being used to train AI, they're being used to train ourselves
>>
>>107805685
i'm still quite unable to just glance and see the differences in the star patterns
it's pretty annoying
mobile is even worse
>>
File: 540151152512.jpg (279 KB, 1874x2499)
279 KB
279 KB JPG
>>
File: deED_zi_00050_.png (2.68 MB, 2048x1280)
2.68 MB
2.68 MB PNG
>>107805753
one of my laptops only has to do 1 captcha puzzle and its the easiest one. it seems like maybe theres some cookie or some useragent that can trigger easier captchas.....

>>107805825
I'm more of a jameson guy if I had to go with big brands, though I've never met a whiskey I didn't like
>>
File: 470152154524.jpg (299 KB, 1874x2499)
299 KB
299 KB JPG
>>107805887
I didn't ask.
>>
>>107805887
I don't know I'm speculating it is Cloudflare who calculates your IP's risk factor plus it is based on the age of the cookies too. We are Amazon turk workers now...
>>
File: k_00047_.png (1.71 MB, 1920x1080)
1.71 MB
1.71 MB PNG
gm
using fal . ai, queued up a lora, trained on a few punk rock album covers, for zit, will see if it works
>>
File: 540151151511.jpg (292 KB, 1874x2499)
292 KB
292 KB JPG
>>107805994
Cute
>>
File: k_00048_.png (1.85 MB, 1920x1080)
1.85 MB
1.85 MB PNG
>>
File: 436656457656.jpg (216 KB, 1216x647)
216 KB
216 KB JPG
>>
File: deED_zi_00052_.png (2.65 MB, 2048x1280)
2.65 MB
2.65 MB PNG
>>107805994
cool, hope it turns out well!

>>107806027
>budget keanu
that pizza looks good. I want a pizza...
>>
File: k_00049_.png (1.9 MB, 1920x1080)
1.9 MB
1.9 MB PNG
prompting like 'there is ___, there is ___, there is ___' random things
>>
>>107806154
can you try '___ is occluded by a ___ in the foreground'?
>>
File: k_00050_.png (1.98 MB, 1920x1080)
1.98 MB
1.98 MB PNG
>>107806264
i'll try
>>
File: deED_zi_00053_.png (2.86 MB, 2048x1280)
2.86 MB
2.86 MB PNG
>>
File: k_00052_.png (2.12 MB, 1344x1344)
2.12 MB
2.12 MB PNG
>>
File: deED_zi_00055_.png (2.53 MB, 2048x1280)
2.53 MB
2.53 MB PNG
>>
Morning anons
It was a fun birthday yesterday :)
>>
File: deCS_zi_00068_.png (3.9 MB, 2176x1152)
3.9 MB
3.9 MB PNG
>>107806513
gm
>>
File: dePR_zi_00047_.png (3.13 MB, 1920x1152)
3.13 MB
3.13 MB PNG
>>
File: k_00053_.png (2.79 MB, 1488x1488)
2.79 MB
2.79 MB PNG
lora possibly turned out alright
>>
File: deSG_cHD_00065_.png (3.84 MB, 2016x1165)
3.84 MB
3.84 MB PNG
>>107806581
nice. that seemed fast
>>
>>107785765
which one of those lets me make porn and also I want Lara Croft Legend porn so bad
>>
File: deAA_cHD_00063_.png (2.5 MB, 1728x1075)
2.5 MB
2.5 MB PNG
>>
File: k_00054_.png (3.01 MB, 1488x1488)
3.01 MB
3.01 MB PNG
>>107806621
fal . ai seems to run pretty quick, much fast than civitai, which is an all-day process of waiting
>>
File: k_00057_.png (3.26 MB, 1488x1488)
3.26 MB
3.26 MB PNG
>>
File: deAA_cHD_00064_.png (2.21 MB, 1728x1075)
2.21 MB
2.21 MB PNG
>>107807035
what are your thoughts about z-image so far?
>>
File: k_00058_.png (3.15 MB, 1488x1488)
3.15 MB
3.15 MB PNG
>>107807081
thoughts are limited to: upgrade from flux, loras seem essential
>>
File: file.png (764 KB, 631x877)
764 KB
764 KB PNG
hey anons, sorry if this isn't the place to ask (pls le me know where instead):
what's the best way to make myself a "live" 2d/3d avatar for youtube videos where i'm just talking with different backgrounds like gaming and/or text/articles?
imagine a design as simple as imu (pic related) that's just mimicking myself talking to the camera during the video, is that explanation good enough?
i know people have been doing with live2d and/or the free 3d thing but it's between mega expensive shit and super ugly
thank you and have an awesome day/night
>>
File: k_00059_.png (3.08 MB, 1488x1488)
3.08 MB
3.08 MB PNG
>>107807234
unless there is a better place on here, you will probably be better off having chatgpt help you set that up.
>>
File: deAA_cHD_00065_.png (3.13 MB, 1728x1075)
3.13 MB
3.13 MB PNG
>>107807234
you want to make an avatar of yourself and use it to generate youtube content?
>>
File: deCS_zi_00069_.png (3.52 MB, 2176x1152)
3.52 MB
3.52 MB PNG
>>107807234
avatar generation is a very well-traversed topic, but its not something much talked about here so I can't tell you what is "the best way". what you can try is checking out avatar spaces on huggingface and seeing if any of the popular/active projects meet your needs

https://huggingface.co/spaces?q=avatar

otherwise, here are a few of the more recent avatar tools I have links for. again, unsure which perform best:
https://huggingface.co/meituan-longcat/LongCat-Video-Avatar
https://liveavatar.github.io
https://github.com/AA-Factory/aafactory
>>
File: k_00061_.png (3.22 MB, 1488x1488)
3.22 MB
3.22 MB PNG
>>
File: k_00062_.png (2.87 MB, 1488x1488)
2.87 MB
2.87 MB PNG
ought eye bake
>>
>>107807180
>Anal Cunt: I just Saw The Gayest Guy On Earth
>>
>>107807627
>>107807627
>>107807627
>>
>>107807632
god damnit i forgot subject
>>
File: peanut.png (273 KB, 532x500)
273 KB
273 KB PNG
>>107807370
yes, it wont look like me at all though, the whole persona will be more of a shadow thing, think TheBurntPeanut but less zoomer (i also considered his Snapchat filter method but it only looks good on streams where the avatar can be wonky 24/7)
>>107807428
thanks, i'm very new at the whole AI creation thing (only have made 2-3 static anime images with comfyui like months ago) but aren't most of these models trying to mimic humans? my intent is more of an object and/or thing like a blob or a slime or a cat/bear without a background
Thanks!
>>
File: k_00065_.png (491 KB, 512x512)
491 KB
491 KB PNG
>>
File: k_00069_.png (573 KB, 512x512)
573 KB
573 KB PNG
>>
File: k_00070_.png (474 KB, 512x512)
474 KB
474 KB PNG
>>
File: deCS_zi_00070_.jpg (1 MB, 2176x1152)
1 MB
1 MB JPG
>>
File: 1748639817369188.png (3.01 MB, 1344x1712)
3.01 MB
3.01 MB PNG
>>
File: 1.jpg (171 KB, 1632x1104)
171 KB
171 KB JPG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.