[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


🎉 Happy Birthday 4chan! 🎉


[Advertise on 4chan]


File: PW.webm (1.95 MB, 480x576)
1.95 MB
1.95 MB WEBM
Previous /sdg/ thread : >>106737217

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Early Preview UI
AniStudio: https://github.com/FizzleDorf/AniStudio

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Flux.1 Krea
https://docs.comfy.org/tutorials/flux/flux1-krea-dev
https://huggingface.co/black-forest-labs/FLUX.1-Krea-dev
https://huggingface.co/QuantStack/FLUX.1-Krea-dev-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://tensor.art
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>
File: 00019-1540331261.png (1.87 MB, 1536x864)
1.87 MB
1.87 MB PNG
forgive him baker, for he knows not what he does.
>>
>mfw Resource news

09/30/2025

>Kandinsky 5.0: A family of diffusion models for Video & Image generation
https://github.com/ai-forever/Kandinsky-5

>Wan-Alpha: High-Quality Text-to-Video Generation with Alpha Channel
https://donghaotian123.github.io/Wan-Alpha

>CharGen: Fast and Fluent Portrait Modification
https://chargen.jdihlmann.com

>Visual Jigsaw Post-Training Improves MLLMs
https://penghao-wu.github.io/visual_jigsaw

>DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space
https://github.com/dc-ai-projects/DC-Gen

>Mitigating Hallucination in Multimodal LLMs with Layer Contrastive Decoding
https://github.com/maifoundations/LayerCD

>LayerD: Decomposing Raster Graphic Designs into Layers
https://cyberagentailab.github.io/LayerD

>UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation
https://unilat3d.github.io

>STAGE: Stable and Generalizable GRPO for Autoregressive Image Generation
https://github.com/krennic999/STAGE

>AutoPrune: Each Complexity Deserves a Pruning Policy
https://github.com/AutoLab-SAI-SJTU/AutoPrune

>EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling
https://github.com/VectorSpaceLab/EditScore

>GenView++: Unifying Adaptive View Generation and Quality-Driven Supervision for Contrastive Representation Learning
https://github.com/xiaojieli0903/GenViewPlusPlus

>Image MetaHub: Desktop application for browsing, searching, and organizing AI-generated images
https://github.com/LuqP2/Image-MetaHub

>Wan-Alpha; High-Quality Text-to-Video Generation with Alpha Channel
https://donghaotian123.github.io/Wan-Alpha

>California Governor Newsom signs landmark AI safety bill SB 53
https://techcrunch.com/2025/09/29/california-governor-newsom-signs-landmark-ai-safety-bill-sb-53
>>
File: GmdeV_37.jpg (335 KB, 2458x2458)
335 KB
335 KB JPG
>>
>mfw Research news

09/30/2025

>Score-based Membership Inference on Diffusion Models
https://arxiv.org/abs/2509.25003

>PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion
https://yuyangyin.github.io/PanoWorld-X

>Scalable GANs with Transformers
https://arxiv.org/abs/2509.24935

>OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing
https://arxiv.org/abs/2509.24900

>VAGUEGAN: Stealthy Poisoning and Backdoor Attacks on Image Generative Pipelines
https://arxiv.org/abs/2509.24891

>Training-Free Token Pruning via Zeroth-Order Gradient Estimation in VLMs
https://arxiv.org/abs/2509.24837

>Causal-Adapter: Taming T2I Diffusion for Faithful Counterfactual Generation
https://arxiv.org/abs/2509.24798

>Inducing Dyslexia in VLMs
https://arxiv.org/abs/2509.24597

>SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer
https://arxiv.org/abs/2509.24695

>TokenSwap: Backdoor Attack on the Compositional Understanding of Large VLMs
https://arxiv.org/abs/2509.24566

>Instruction Guided Multi Object Image Editing with Quantity and Layout Consistency
https://arxiv.org/abs/2509.24514

>UI2V-Bench: An Understanding-based I2V Generation Benchmark
https://arxiv.org/abs/2509.24427

>CLQ: Cross-Layer Guided Orthogonal-based Quantization for Diffusion Transformers
https://arxiv.org/abs/2509.24416

>TraitSpaces: Towards Interpretable Visual Creativity for Human-AI Co-Creation
https://arxiv.org/abs/2509.24326

>Hyperspherical Latents Improve Continuous-Token Autoregressive Generation
https://arxiv.org/abs/2509.24335

>SVGThinker: Instruction-Aligned and Reasoning-Driven Text-to-SVG Generation
https://arxiv.org/abs/2509.24299

>Light-SQ: Structure-aware Shape Abstraction with Superquadrics for Generated Meshes
https://johann.wang/Light-SQ

>FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in I2V Generation
https://pku-yuangroup.github.io/FlashI2V
>>
>mfw MORE Research news

>DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder
https://arxiv.org/abs/2509.25182

>GHOST: Hallucination-Inducing Image Generation for Multimodal LLMs
https://arxiv.org/abs/2509.25178

>Personalized Vision via Visual In-Context Learning
https://yuxinn-j.github.io/projects/PICO

>Aligning Visual Foundation Encoders to Tokenizers for Diffusion Models
https://aligntok.github.io

>Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
https://kunhao-liu.github.io/Rolling_Forcing_Webpage

>Score Distillation of Flow Matching Models
https://arxiv.org/abs/2509.25127

>Not All Tokens are Guided Equal: Improving Guidance in Visual Autoregressive Models
https://arxiv.org/abs/2509.23876

>Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution
https://arxiv.org/abs/2509.23980

>HunyuanImage 3.0 Technical Report
https://arxiv.org/abs/2509.23951

>GANji: A Framework for Introductory AI Image Generation
https://arxiv.org/abs/2509.24128

>Autoregressive Video Generation beyond Next Frames Prediction
https://arxiv.org/abs/2509.24081

>SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention
https://arxiv.org/abs/2509.24006

>Token Painter: Training-Free Text-Guided Image Inpainting via Mask Autoregressive Models
https://arxiv.org/abs/2509.23919

>Towards Fine-Grained Text-to-3D Quality Assessment: A Benchmark and A Two-Stage Rank-Learning Metric
https://cbysjtu.github.io/Rank2Score

>Texture Vector-Quantization and Reconstruction Aware Prediction for Generative Super-Resolution
https://arxiv.org/abs/2509.23774

>Semantic Editing with Coupled Stochastic Differential Equations
https://arxiv.org/abs/2509.24223

>Asymmetric VAE for One-Step Video Super-Resolution Acceleration
https://arxiv.org/abs/2509.24142

>Origins of Creativity in Attention-Based Diffusion Models
https://arxiv.org/abs/2506.17324
>>
File: deUN_cHD_00063_.png (2.12 MB, 1310x1310)
2.12 MB
2.12 MB PNG
>>
File: PW_143312_.png (2.02 MB, 1280x1800)
2.02 MB
2.02 MB PNG
I had to run off for a bit haha
Gonna go to the store too! Brb again
>>
File: 00027-4148845319.png (1.56 MB, 1536x864)
1.56 MB
1.56 MB PNG
>>106754839
pick me up a labatt tallboy and a pack of lucky red 100s
>>
File: deUN_cHD_00066_.png (2.29 MB, 1310x1310)
2.29 MB
2.29 MB PNG
>>106754850
gonna go "grab a pack of smokes" and never come back :(
>>
File: till_death.jpg (372 KB, 1920x1080)
372 KB
372 KB JPG
till death
https://youtu.be/LQiSEHDRzg8
>>
File: file.png (29 KB, 917x417)
29 KB
29 KB PNG
i need to be careful, this is the response for that Panic! at the disco rip off i posted the other day. fucking normals.
>>
>>106754941
I don't understand.
>>
File: deUN_cHD_00067_.png (3.13 MB, 1843x1147)
3.13 MB
3.13 MB PNG
>>106754941
line go up in a nice feeling
>>
File: 00024-4148845316.png (1.66 MB, 1536x864)
1.66 MB
1.66 MB PNG
>>106754949
it's the view graph for this https://youtu.be/trRoY0ngQjE
you be the judge. it was completely ignored here lmao, which wasn't unexpected.
>>
File: 00025-4148845317.png (1.76 MB, 1536x864)
1.76 MB
1.76 MB PNG
>>
File: deTF_cHD_00004_.png (3.11 MB, 2081x1190)
3.11 MB
3.11 MB PNG
>>
>>106755031
I think you might have something here. It's a pretty good song, it has structure and the chorus has the right hooks. I mean it's AI generated but you made a hit song of sorts.
>>
File: PW_143404_.png (2.94 MB, 1280x1800)
2.94 MB
2.94 MB PNG
>>106754850
You got it!
>>106754905
I love this song! I've listened to it quite a few times hahaha
>>106755031
I like it!
>>
File: 00035-1246292130.png (2.62 MB, 1536x864)
2.62 MB
2.62 MB PNG
>>106755087
suno excels at making "the median song," like it will never do anything really great but with some work it makes some pretty listenable tracks.
>>
>>106755108
Yeah this is what people like - it's catchy, somewhat progressive but still not too complicated and has a clear structure.
You could use Ableton or Cubase (or whatever) to edit these and switch around parts etc..
>>
File: 00037-3392621873.png (2.01 MB, 1536x864)
2.01 MB
2.01 MB PNG
>>106755118
supposedly the new studio thing lets you do that, if they're they're still doing the sale when i run out of credits i might just do upgrade. pw has access to it, idk if he's done anything yet (he hasn't). i do know how to use FL (more less), supposedly you can spit a song out as midi but idk what it is, can't imagine it's got a full suite of effects and vsts and automation, but who knows.
>>
File: deTF_cHD_00007_.png (3 MB, 2081x1190)
3 MB
3 MB PNG
>>
File: PW_143414_.png (2.88 MB, 1280x1800)
2.88 MB
2.88 MB PNG
>>106755144
The new studio thing is fun!
I've done a couple songs LOL but nothing I was gonna share yet haha mostly testing it out
I wanna make something really good and new in v5 first then really get into it!
>>
File: curse_night.jpg (352 KB, 1920x1080)
352 KB
352 KB JPG
WHAT A TERRIBLE NIGHT TO HAVE A CURSE!

curse night
https://suno.com/s/075oGhgpHlRgqoBI
>>
File: BL_SEK_BURG02.jpg (1.1 MB, 4608x3584)
1.1 MB
1.1 MB JPG
>>106755166
>>106755146
Hey a twofer!. What's good?
>>
>>106755144
I would find out how to generate individual audio tracks, then produce something on my own in a proper software. Then make a demo.
It's a long path of course especially if you are totally new.
>>
File: deTF_cHD_00012_.png (3.13 MB, 2081x1190)
3.13 MB
3.13 MB PNG
>>106755172
hey, sorry I missed you earlier today. you caught me during lunch. unusual time for you to pop in!
>>
File: JI_SE_LAYS_8.jpg (222 KB, 2048x2048)
222 KB
222 KB JPG
>>106755184
Hey man its no biggie. What's up?
>>
File: 00041-3940773168.png (2.32 MB, 1536x864)
2.32 MB
2.32 MB PNG
>>106755177
i can operate FL with some proficiency, my problem was always coming up with melody and variety. i'll give it a whirl at some point. i could see it being great for bolting vocals on a track tho. they need more voices, not sure how it works exactly because they do have some variety but not a lot
>>
File: PW_143443_.png (2.73 MB, 1280x1800)
2.73 MB
2.73 MB PNG
>>106755171
Nice!
>>106755172
Heyyy! It's great to see you again :]
Just relaxing at home! It was a great relaxing day haha
How are you?
>>
File: RL_SE_CT_COOK_2.jpg (1.01 MB, 4096x3940)
1.01 MB
1.01 MB JPG
>>106755253
Sounds nice. You had off work I suppose?
>>
File: 57906.png (3.52 MB, 1440x3120)
3.52 MB
3.52 MB PNG
>>106755253
thinking about doing a series of halloween stuff, not sure when to start. last year was fun
>>
File: 57914.png (3.81 MB, 1440x3120)
3.81 MB
3.81 MB PNG
while we're old posting, let me just say
>>
File: 57840.png (3.05 MB, 1440x3120)
3.05 MB
3.05 MB PNG
>>
File: 57852.jpg (328 KB, 1440x3120)
328 KB
328 KB JPG
we used to be a real country
>>
File: 15297-1760922233.png (961 KB, 576x1248)
961 KB
961 KB PNG
>>
File: PQ_SE_N_SODA_5.jpg (169 KB, 1792x2304)
169 KB
169 KB JPG
>>
File: 00064-3909172629.png (1.84 MB, 1536x864)
1.84 MB
1.84 MB PNG
>>
Maintain thread quality
https://rentry.org/debo
>>
File: 00069-4227813072.png (2 MB, 1536x864)
2 MB
2 MB PNG
>>
File: 00073-1993612533.png (1.79 MB, 1536x864)
1.79 MB
1.79 MB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.