[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 00063-995181695.png (3.39 MB, 1344x1728)
3.39 MB PNG
Previous /sdg/ thread : >>109081759

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Anima
https://huggingface.co/circlestone-labs/Anima

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/aco/csdg/
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai

OP https://rentry.co/twkuk8tz
>>
File: 00064-2817934012.jpg (452 KB, 1728x1344)
452 KB JPG
>>
>containment general
>>
>mfw Resource news

06/19/2026

>FreeStyle: Free Control of Style-Content Dual-Reference Generation from Community LoRA Mining
https://github.com/Blue2Giant/FreeStyle

>JanusMesh: Fast and Zero-Shot 3D Visual Illusion Generation via Cross-Space Denoising
https://siang1105.github.io/JanusMesh.github.io

>Linear Recurrent Unit with Semantic Modulation for Image Super-Resolution
https://github.com/MingyuChoi-run/LSM

>LEAP: Layer-skipping Efficiency via Adaptive Progression for Vision Transformer Distillation
https://github.com/KevinZ0217/LEAP

>StylisticBias: A Few Human Visual Cues Drive Most Social Biases in MLLMs
https://hf.co/datasets/shaghayegh/stylistic-bias-dataset

>musubi-tuner adds support for ideogram 4 lora training
https://github.com/kohya-ss/musubi-tuner/blob/dev/docs/ideogram4.md

>KupkaProd Music Video Pipeline
https://github.com/Matticusnicholas/KupkaProd-Music-Video-Pipeline

>Midjourney goes from generating cat images to full-body ultrasound scans
https://www.theverge.com/ai-artificial-intelligence/952011/midjourney-medical-ai-ultrasound-scan

>TeleStyle V2: Beyond Content-Preserving Style Transfer with Self-Distillation and Distribution-Matching-Distillation
https://github.com/Tele-AI/TeleStyleV2

06/18/2026

>UniTemp: Unlocking Video Generation in Any Temporal Order via Bidirectional Distillation
https://lzhangbj.github.io/projects/unitemp

>Reasoning as Intersection: Consensus-Frame Alignment for Visual Focus in Video-MLLMs
https://github.com/1Pansy/VideoCFR

>Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance
https://hustvl.github.io/Moebius

>From Bounding Boxes to Visual Reasoning: An On-Policy Data Annotation Tool for Vision-Language Models
https://github.com/WnQinm/Annotator

>Boogu-Image-0.1-Edit GGUF
https://huggingface.co/realrebelai/Boogu-Image-Edit_GGUFs

>FlexAvatar: Learning Complete 3D Head Avatars with Partial Supervision
https://tobias-kirschstein.github.io/flexavatar
>>
>mfw Research news

06/19/2026

>SketchKeyAnime: Reference-anchored Sparse Key-Sketch Animation Synthesis
https://arxiv.org/abs/2606.19958

>BAFIS: Dataset + Framework to assess occupational Bias and Human Preference in modern T2I Models
https://arxiv.org/abs/2606.20241

>Cinematic Compositing Using Character-Environment-Harmonized Video Generation Models
https://arxiv.org/abs/2606.20233

>NAMESAKES: Probing Identity Memorization in T2I Models
https://arxiv.org/abs/2606.20155

>Through the PRISM: Preference Representation in Intermediate States of Video Diffusion Models
https://arxiv.org/abs/2606.20310

>ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?
https://zhangwenyao1.github.io/ImageWAM

>WeGenBench: A Multidimensional Diagnostic Benchmark towards T2I Model Optimization
https://arxiv.org/abs/2606.20100

>On the Redundancy of Timestep Embeddings in Diffusion Models
https://arxiv.org/abs/2606.20416

>The FID Lottery: Quantifying Hidden Randomness in Generative-Model Evaluation
https://kyutai.org/fid-lottery

>Learning When to Denoise: Optimizing Asynchronous Schedules for Latent Diffusion
https://arxiv.org/abs/2606.19662

>Variable-Length Tokenization via Learnable Global Merging for Diffusion Transformers
https://arxiv.org/abs/2606.20076

>SSD: Spatially Speculative Decoding Accelerates Autoregressive Image Generation
https://arxiv.org/abs/2606.20543

>LooseControlVideo: Directorial Video Control using Spatial Blocking
https://shariqfarooq123.github.io/LooseControlVideo

>Timage: A Generative T2I Paradigm for Fine-Tuning Vision-Language Models
https://arxiv.org/abs/2606.19944

>CrossFlow: One-Step Generation Across Latent and Pixel Spaces
https://arxiv.org/abs/2606.19970

>HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing
https://arxiv.org/abs/2603.07236

>VideoSketcher: Sequential Sketch Generation Using Video Model Priors
https://arxiv.org/abs/2602.15819
>>
File: 9k.png (571 KB, 768x512)
571 KB PNG
>>
File: debo_csa_ay_00007_.png (2.59 MB, 1792x977)
2.59 MB PNG
>>
File: 00007-4015616768.png (3.98 MB, 1344x1728)
3.98 MB PNG
>>
>>
>>
>>
File: debo_csa_ay_00011_.png (2.29 MB, 1792x977)
2.29 MB PNG
what have you done, frog knight
>>
File: 00151-4086857861.jpg (411 KB, 1536x1536)
411 KB JPG
>>
>>109098247
So desperate for attention
>>
File: 000000_75300_.png (3.31 MB, 1440x1120)
3.31 MB PNG
>>
>>109098247
it's an homage
>>
File: output.mp4 (3.38 MB, 640x512)
3.38 MB
3.38 MB MP4
>>
File: debo_csa_ay_00013_.png (2.12 MB, 1792x977)
2.12 MB PNG
>>
Morning anons
>>
File: 00226-541971053.png (2.96 MB, 1920x1080)
2.96 MB PNG
>>109098888
morning, nice #s
>>
>>
File: debo_csa_ay_00017_.png (1.88 MB, 1792x977)
1.88 MB PNG
>>109098888
checked and gm
>>
>gm
>>
File: 00249-957641729.png (2.22 MB, 1024x1024)
2.22 MB PNG
youtu.be/szRWa3AznvY
>>
File: debo_csa_ay_00023_.png (2.39 MB, 1792x977)
2.39 MB PNG
>>109099089
groovy
>>
File: 00290-3742888640.png (2.87 MB, 1088x1088)
2.87 MB PNG
>>
File: 00301-1743364541.png (1.74 MB, 1088x1088)
1.74 MB PNG
>>
File: debo_csa_ay_00041_.png (2.03 MB, 1792x977)
2.03 MB PNG
>>
File: 000000_75315_.png (3.4 MB, 960x1679)
3.4 MB PNG
>>
File: debo_csa_ay_00042_.png (2.46 MB, 1792x977)
2.46 MB PNG
>>
>>
File: debo_csa_ay_00050_.png (2.34 MB, 1792x977)
2.34 MB PNG
>>109099862
and they said AI couldn't create true art
>>
File: 00351-975376971.png (2.63 MB, 1024x1024)
2.63 MB PNG
>>
File: debo_csa_ay_00055_.png (2.38 MB, 1792x977)
2.38 MB PNG
>>
File: 00410-2605825548.png (1.73 MB, 896x1152)
1.73 MB PNG
>>
File: output.mp4 (3.57 MB, 640x512)
3.57 MB
3.57 MB MP4
>>
File: debo_csa_ay_00061_.png (2.43 MB, 1792x977)
2.43 MB PNG
>>
File: 00492-2386208182.png (1.45 MB, 768x1280)
1.45 MB PNG
>>
>>109097685
>>109097691
Spam/Malware!! Anon be safe!
>>
File: 00513-4214040947.png (1.56 MB, 896x1152)
1.56 MB PNG
>>
File: debo_csa_fia_00044_.png (2.87 MB, 1792x977)
2.87 MB PNG
>>
File: 00552-527149712.png (2.1 MB, 896x1152)
2.1 MB PNG
>>
>>
File: debo_csa_fia_00055_.png (2.13 MB, 1792x977)
2.13 MB PNG
>>
>>
File: file.png (526 KB, 1076x690)
526 KB PNG
kool model bro
>>
File: debo_csa_ay_00077_.png (2.32 MB, 1792x977)
2.32 MB PNG
>>109102057
this has a very "social commentary on excesses" vibe

>>109102260
try being safer
>>
>>109097585
every time you retards make one of these the price of ram goes up.
>>
>>109102260
Your prompt was unnecessary and unsafe. Probably had some weird topics like illuminati (not safe).
>>
>>109102267
thats how she rolls
>>
File: file.png (57 KB, 365x653)
57 KB PNG
0 for 10. this model is fucking cursed
>>
>>109102359
yah, there are ways to trick it and prevent that safety filter, but really all you need is the json format
>>
>>109102364
i'm doing json format! maybe it's malformed somehow, like do they really want it to be pretty printed? can i try your rewrite prompt?
>>
it must have been the rewriter. this is garbage but it's progress lol
>>
我已经不知道谁在勾结谁了。
>>
File: debo_csa_ay_00078_.png (2.57 MB, 1792x977)
2.57 MB PNG
>>109102470
pretty cool actually

>>109102499
>mage blacked
>>
>>109102511
i feel sorry for the interns that have to read my chat sessions
>>
File: debo_csa_fia_00060_.png (1.8 MB, 1792x977)
1.8 MB PNG
>>109102538
lmao
>>
>>109102552
i'm kind of surprised 5.5 went along with this nonsense
>>
>>109102552
Have you archived all of your gens? I deleted mine and instantly felt regret but also relief.
>>
enough of all that nonsense
>>
File: 0615_020212.jpg (536 KB, 2900x1947)
536 KB JPG
>>
>>109102613
i wonder what productanon is doin now? i'm pourin some nuclear surfboard out for him. just in case.
>>
File: debo_csa_fia_00065_.png (2.5 MB, 1792x977)
2.5 MB PNG
>>109102580
gpt is into it

>>109102583
I've never deleted any of my gens, though I don't have them backed up either. if a drive exploded, I'd be sad but I'd move on
the nice thing about all your free space, you can start a whole new life of gens
>>
>>109102628
LLMs have taken all of my time, it's been ages since generating any image. It's probably hardware fatigue, would need a faster setup or something.
>>
>>109102664
what do you talk to most of the time? or are you a merely code slave?
>>
>>109102686
I made couple of hobby projects and learned some programming but I'm not naturally gifted in logic puzzles. Also enjoy d&d scenarios but that gets old fast. New Gemma is a significant step up and I sound like a shill but it crosses that usability/enjoyment barrier at least to me.
>>
>>109102628
idk maybe... that seems more like a claude thing

>>109102711
i see. i haven't talked to gemma4 much... i did get it setup for local inference, but personally i'd rather cozy up to the frontier models. do you ever ask it random questions or just project stuff?
>>
>>109102721
Of course I toy around with it. But you can't publicly admit that!
I did program my own tool calls but I'm so lazy, I can do simple web searches and news fetching but in order to implement multiple looping tool calls I would need to refactor my client and that's a pain because even after a two weeks of absence I didn't understand my own logic (it works though but sometimes I ask wtf).
>>
i miss schizo anon
>>
File: file.png (30 KB, 967x322)
30 KB PNG
>>109102735
that's cool you're goin' low level DIY. i don't have the patience for all that. just outsourcing thought, i can quit any time btw
>>
>>109102753
Text completion client is basically a basic bitch programming tutorial it's just about string management. Biggest issue is about how to manage tags and keeping them in order but it's basically a string hell.
>>
>>109102783
yeah strings sound simple up front but they can be filled up with any kind of nonsense! what language are you building this in?
>>
>>
>>
File: 000000_75372_.png (2.71 MB, 1310x1250)
2.71 MB PNG
>For me!
G'mornin Anons,
>>
>>109102993
awake in the after hours... grim portents
>>
>>109102822
Began with Python and converted it to C. I like C more because structure helps me to understand it better. I didn't have any idea about Python especially AI generated examples are horrific because variable names are nonsensical etc
>>
>>109102999
Checked, I messed up my sleep sched a month ago..or my empath abilities are trying to tell me something.
>>
>>109103011
now u got digits!. if i'm not mistaken... this means whatever u say is right! the possibilities are endless.
>>
File: 000000_75375_.png (3.39 MB, 1199x1359)
3.39 MB PNG
>>109103026
We need to go back to 13 hour days and a 13 month calendar..

>got sage_attention 3 working after a month..omg that was dumb.
>>
>>109103055
you're probably right, i suppose.
>>
>>
File: 00011-1071626082.png (535 KB, 512x512)
535 KB PNG
>>
>gm
>>
File: 00056-1249974005.png (2.38 MB, 1024x1024)
2.38 MB PNG
>>
File: 00077-1274944731.png (1.6 MB, 768x1280)
1.6 MB PNG
>>
>>109097685
>mfw resource news drops and i still can't make a decent lora
>>
>>109097691
>mfw i can't even keep up with the arxiv firehose anymore

just pick one and pray it's not another latent space rebranding
>>
File: 7878978879.gif (1.91 MB, 576x960)
1.91 MB GIF
>>
File: 00104-4049544950.png (1.88 MB, 1280x768)
1.88 MB PNG
>>
File: 00159-1071570607.png (665 KB, 512x640)
665 KB PNG
>>
>>109102422
sorry i crashed after posting
i fed the prompt guide to gemma4
https://github.com/ideogram-oss/ideogram4/blob/main/docs/prompting.md
and told her to provide a system prompt to output the fomatted JSON
it doesnt need the bbox stuff all that much, just the highlevel, style and elements stuff
>>
>>
I just want to generate photorealistic softcore porn, but now ChatGPT is a prude again so it's almost impossible to use pornstars.
>>
File: 00231-1561143963.png (1.82 MB, 1920x1080)
1.82 MB PNG
>>109104507
nice
>>
>>109104514
So sad. Maybe use local models if you have some spare $

gm
>>
>>
File: 00253-953562525.png (1.7 MB, 1280x768)
1.7 MB PNG
>>109104603
gm
>>
>>109104543
thx
>>109104514
>>109104603
biglust runs on low end hardware doesnt it
>>
File: debo_csa_ay_00003_.png (2.12 MB, 1792x977)
2.12 MB PNG
>>
>>
File: 00271-2231673035.png (1.98 MB, 896x1152)
1.98 MB PNG
>>
File: 00293-3515121269.png (667 KB, 1152x896)
667 KB PNG
>>
File: debo_csa_ay_00005_.png (2.5 MB, 1792x977)
2.5 MB PNG
>>109105079
bulbacat
>>
>>
File: 000000_754334_.png (2.79 MB, 1249x1230)
2.79 MB PNG
>>109104949
Nice. /saves
>>
File: debo_csa_ay_00006_.png (2.44 MB, 1792x977)
2.44 MB PNG
>>
File: debo_csa_ay_00012_.png (2.55 MB, 1792x977)
2.55 MB PNG
>>
File: output.mp4 (3.87 MB, 512x512)
3.87 MB
3.87 MB MP4
100 sd 1.5 hooters
>>
File: 00574-2285763470.png (588 KB, 512x512)
588 KB PNG
mfw
>>
File: 00592-2860239896.png (592 KB, 512x512)
592 KB PNG
>>
File: pixel-0000-3383736815.png (127 KB, 2048x2048)
127 KB PNG
>>
>>
Morning anons
>>
>gm
>>
File: pixel-0005-2712580293.png (293 KB, 2048x2048)
293 KB PNG
>>109105971
morning
>>
File: 1782062662337835.jpg (1.13 MB, 1580x1832)
1.13 MB JPG
I just got recently informed about this and I'm wondering what projects have you guys been hiding away from us?

https://docs.google.com/document/d/1zmcEuocbSMAPH0Cdu0neR7qOYrOad6dB_PXEnPRM24Y/edit?tab=t.0
>>
>>109106142
my projects are not for public consumption
>>
File: debo_csa_ay_00014_.png (2.24 MB, 1792x977)
2.24 MB PNG
>>109105642
what music do you imagine when you look at this

>>109105797
saved

>>109105971
gm(a)

>>109106142
I've been working on a platform that I hope can be a place where people will share and discuss ai projects they're working on. idk if anyone will use it, but the idea is there
>>
>>
>>
File: 00738-2437845078.png (2.45 MB, 1024x1024)
2.45 MB PNG
>>109106328
youtu.be/uR2Jfreo0jg
'his every word is grandly astute
the only thing he says is ‘hoot’'
>>
File: 00746-2734052235.png (1.51 MB, 1024x1024)
1.51 MB PNG
>>
>>
File: debo_csa_ay_00015_.png (2.22 MB, 1792x977)
2.22 MB PNG
>>109106333
what kind of twisted science is ideogramgirl up to

>>109106417
I looped and fullscreened the mp4 to this. its pretty perfect, honestly
>>
>>109106509
the dino-human hybrid experiments will continue until morale improves
>>
>>
File: debo_csa_ay_00019_.png (2.2 MB, 1792x977)
2.2 MB PNG
>>
>>
File: debo_csa_ay_00022_.png (2.48 MB, 1792x977)
2.48 MB PNG
>>
>>
What's the deal with civitai now? I have to pay in crypto? Is anyone actually doing this or are there better options now?
>>
>>
>>
File: debo_csa_ay_00023_.png (2.45 MB, 1792x977)
2.45 MB PNG
>>109107190
you're just trying to do image gen? could just use nai if you want a saas
>>
File: comfyui_00001_.png (1.37 MB, 1024x1024)
1.37 MB PNG
tried z image base, fp8 version of model and text encoder
took 22 minutes.. might not be for me, or my pitiful gpu
>>
File: comfyui_00002_.png (394 KB, 512x512)
394 KB PNG
>>
>>109107456
yeah looks like shit
>>
File: 00878-4147062323.png (1.54 MB, 1152x896)
1.54 MB PNG
>>109107489
ah, was just a minimal prompt of 'penguin, fiery background'
>>
File: debo_csa_ay_00025_.png (2.5 MB, 1792x977)
2.5 MB PNG
>>109107456
turbo is a better option. its low step and the distillation typically makes better gens
>>
File: comfyui_00004_.png (1.14 MB, 1152x896)
1.14 MB PNG
>>109107509
guess i'll stick with turbo
>>
File: debo_csa_ay_00026_.png (2.5 MB, 1792x977)
2.5 MB PNG
>>
File: 00923-306813060.png (1.49 MB, 1152x896)
1.49 MB PNG
>>
File: debo_csa_ay_00027_.png (2.54 MB, 1792x977)
2.54 MB PNG
>>
File: 00937-2566655451.png (1.3 MB, 1280x768)
1.3 MB PNG
>>
File: debo_csa_ay_00029_.png (2.53 MB, 1792x977)
2.53 MB PNG
>>
File: 00994-3788640611.png (2.49 MB, 1024x1024)
2.49 MB PNG
>>
File: debo_csa_ay_00032_.png (2.86 MB, 1792x977)
2.86 MB PNG
>>
>>
>>
File: debo_csa_ay_00041_.png (2.42 MB, 1792x977)
2.42 MB PNG
>>
>>
>>
File: debo_csa_ay_00046_.png (2.54 MB, 1792x977)
2.54 MB PNG
>>109108578
>>109108628
>chibis
its a lie
>>
>>109108677
i skip a lot bc they creep me out sometimes
>>
File: debo_csa_ay_00047_.png (2.39 MB, 1792x977)
2.39 MB PNG
>>109108688
yea you're on a razers edge
>>
>>109108701
i could probably fork it no chibs, but it does so many "normal" gens i haven't bothered.
>>
File: debo_csa_ay_00048_.png (2.18 MB, 1792x977)
2.18 MB PNG
>>109108725
search+replace 'chibis' with 'aliens'
>>
>>
>>
>>
File: mysterious witch.jpg (254 KB, 1280x960)
254 KB JPG
>>109108434
cute!
>>
File: mecha daughter.jpg (261 KB, 1344x768)
261 KB JPG
>>
File: cute witch.jpg (375 KB, 1280x960)
375 KB JPG
>>
File: chaotic aprentice.jpg (301 KB, 1280x960)
301 KB JPG
>>
File: gap moe witch.jpg (249 KB, 1024x1024)
249 KB JPG
>>
File: cute nymph.jpg (272 KB, 960x1280)
272 KB JPG
>>
File: whimsy gloomy witch.jpg (429 KB, 1280x960)
429 KB JPG
>>
i miss schizo anon
>>
File: 00006-858031262.png (1.68 MB, 1280x768)
1.68 MB PNG
>>
File: 00076-3517357252.png (1.35 MB, 1152x896)
1.35 MB PNG
>>
File: 00088-4012046184.png (1.4 MB, 1152x896)
1.4 MB PNG
>>
>>109110061
>>109110061
>>109110061
>>
File: 00030-335884356.png (588 KB, 512x512)
588 KB PNG
>>
File: 00062-423979492.png (563 KB, 512x512)
563 KB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.