[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


🎉 Happy Birthday 4chan! 🎉


[Advertise on 4chan]


File: PW.webm (1.95 MB, 480x576)
1.95 MB
1.95 MB WEBM
Previous /sdg/ thread : >>106737217

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Early Preview UI
AniStudio: https://github.com/FizzleDorf/AniStudio

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Flux.1 Krea
https://docs.comfy.org/tutorials/flux/flux1-krea-dev
https://huggingface.co/black-forest-labs/FLUX.1-Krea-dev
https://huggingface.co/QuantStack/FLUX.1-Krea-dev-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://tensor.art
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>
File: 00019-1540331261.png (1.87 MB, 1536x864)
1.87 MB
1.87 MB PNG
forgive him baker, for he knows not what he does.
>>
>mfw Resource news

09/30/2025

>Kandinsky 5.0: A family of diffusion models for Video & Image generation
https://github.com/ai-forever/Kandinsky-5

>Wan-Alpha: High-Quality Text-to-Video Generation with Alpha Channel
https://donghaotian123.github.io/Wan-Alpha

>CharGen: Fast and Fluent Portrait Modification
https://chargen.jdihlmann.com

>Visual Jigsaw Post-Training Improves MLLMs
https://penghao-wu.github.io/visual_jigsaw

>DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space
https://github.com/dc-ai-projects/DC-Gen

>Mitigating Hallucination in Multimodal LLMs with Layer Contrastive Decoding
https://github.com/maifoundations/LayerCD

>LayerD: Decomposing Raster Graphic Designs into Layers
https://cyberagentailab.github.io/LayerD

>UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation
https://unilat3d.github.io

>STAGE: Stable and Generalizable GRPO for Autoregressive Image Generation
https://github.com/krennic999/STAGE

>AutoPrune: Each Complexity Deserves a Pruning Policy
https://github.com/AutoLab-SAI-SJTU/AutoPrune

>EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling
https://github.com/VectorSpaceLab/EditScore

>GenView++: Unifying Adaptive View Generation and Quality-Driven Supervision for Contrastive Representation Learning
https://github.com/xiaojieli0903/GenViewPlusPlus

>Image MetaHub: Desktop application for browsing, searching, and organizing AI-generated images
https://github.com/LuqP2/Image-MetaHub

>Wan-Alpha; High-Quality Text-to-Video Generation with Alpha Channel
https://donghaotian123.github.io/Wan-Alpha

>California Governor Newsom signs landmark AI safety bill SB 53
https://techcrunch.com/2025/09/29/california-governor-newsom-signs-landmark-ai-safety-bill-sb-53
>>
File: GmdeV_37.jpg (335 KB, 2458x2458)
335 KB
335 KB JPG
>>
>mfw Research news

09/30/2025

>Score-based Membership Inference on Diffusion Models
https://arxiv.org/abs/2509.25003

>PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion
https://yuyangyin.github.io/PanoWorld-X

>Scalable GANs with Transformers
https://arxiv.org/abs/2509.24935

>OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing
https://arxiv.org/abs/2509.24900

>VAGUEGAN: Stealthy Poisoning and Backdoor Attacks on Image Generative Pipelines
https://arxiv.org/abs/2509.24891

>Training-Free Token Pruning via Zeroth-Order Gradient Estimation in VLMs
https://arxiv.org/abs/2509.24837

>Causal-Adapter: Taming T2I Diffusion for Faithful Counterfactual Generation
https://arxiv.org/abs/2509.24798

>Inducing Dyslexia in VLMs
https://arxiv.org/abs/2509.24597

>SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer
https://arxiv.org/abs/2509.24695

>TokenSwap: Backdoor Attack on the Compositional Understanding of Large VLMs
https://arxiv.org/abs/2509.24566

>Instruction Guided Multi Object Image Editing with Quantity and Layout Consistency
https://arxiv.org/abs/2509.24514

>UI2V-Bench: An Understanding-based I2V Generation Benchmark
https://arxiv.org/abs/2509.24427

>CLQ: Cross-Layer Guided Orthogonal-based Quantization for Diffusion Transformers
https://arxiv.org/abs/2509.24416

>TraitSpaces: Towards Interpretable Visual Creativity for Human-AI Co-Creation
https://arxiv.org/abs/2509.24326

>Hyperspherical Latents Improve Continuous-Token Autoregressive Generation
https://arxiv.org/abs/2509.24335

>SVGThinker: Instruction-Aligned and Reasoning-Driven Text-to-SVG Generation
https://arxiv.org/abs/2509.24299

>Light-SQ: Structure-aware Shape Abstraction with Superquadrics for Generated Meshes
https://johann.wang/Light-SQ

>FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in I2V Generation
https://pku-yuangroup.github.io/FlashI2V
>>
>mfw MORE Research news

>DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder
https://arxiv.org/abs/2509.25182

>GHOST: Hallucination-Inducing Image Generation for Multimodal LLMs
https://arxiv.org/abs/2509.25178

>Personalized Vision via Visual In-Context Learning
https://yuxinn-j.github.io/projects/PICO

>Aligning Visual Foundation Encoders to Tokenizers for Diffusion Models
https://aligntok.github.io

>Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
https://kunhao-liu.github.io/Rolling_Forcing_Webpage

>Score Distillation of Flow Matching Models
https://arxiv.org/abs/2509.25127

>Not All Tokens are Guided Equal: Improving Guidance in Visual Autoregressive Models
https://arxiv.org/abs/2509.23876

>Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution
https://arxiv.org/abs/2509.23980

>HunyuanImage 3.0 Technical Report
https://arxiv.org/abs/2509.23951

>GANji: A Framework for Introductory AI Image Generation
https://arxiv.org/abs/2509.24128

>Autoregressive Video Generation beyond Next Frames Prediction
https://arxiv.org/abs/2509.24081

>SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention
https://arxiv.org/abs/2509.24006

>Token Painter: Training-Free Text-Guided Image Inpainting via Mask Autoregressive Models
https://arxiv.org/abs/2509.23919

>Towards Fine-Grained Text-to-3D Quality Assessment: A Benchmark and A Two-Stage Rank-Learning Metric
https://cbysjtu.github.io/Rank2Score

>Texture Vector-Quantization and Reconstruction Aware Prediction for Generative Super-Resolution
https://arxiv.org/abs/2509.23774

>Semantic Editing with Coupled Stochastic Differential Equations
https://arxiv.org/abs/2509.24223

>Asymmetric VAE for One-Step Video Super-Resolution Acceleration
https://arxiv.org/abs/2509.24142

>Origins of Creativity in Attention-Based Diffusion Models
https://arxiv.org/abs/2506.17324
>>
File: deUN_cHD_00063_.png (2.12 MB, 1310x1310)
2.12 MB
2.12 MB PNG
>>
File: PW_143312_.png (2.02 MB, 1280x1800)
2.02 MB
2.02 MB PNG
I had to run off for a bit haha
Gonna go to the store too! Brb again
>>
File: 00027-4148845319.png (1.56 MB, 1536x864)
1.56 MB
1.56 MB PNG
>>106754839
pick me up a labatt tallboy and a pack of lucky red 100s
>>
File: deUN_cHD_00066_.png (2.29 MB, 1310x1310)
2.29 MB
2.29 MB PNG
>>106754850
gonna go "grab a pack of smokes" and never come back :(
>>
File: till_death.jpg (372 KB, 1920x1080)
372 KB
372 KB JPG
till death
https://youtu.be/LQiSEHDRzg8
>>
File: file.png (29 KB, 917x417)
29 KB
29 KB PNG
i need to be careful, this is the response for that Panic! at the disco rip off i posted the other day. fucking normals.
>>
>>106754941
I don't understand.
>>
File: deUN_cHD_00067_.png (3.13 MB, 1843x1147)
3.13 MB
3.13 MB PNG
>>106754941
line go up in a nice feeling
>>
File: 00024-4148845316.png (1.66 MB, 1536x864)
1.66 MB
1.66 MB PNG
>>106754949
it's the view graph for this https://youtu.be/trRoY0ngQjE
you be the judge. it was completely ignored here lmao, which wasn't unexpected.
>>
File: 00025-4148845317.png (1.76 MB, 1536x864)
1.76 MB
1.76 MB PNG
>>
File: deTF_cHD_00004_.png (3.11 MB, 2081x1190)
3.11 MB
3.11 MB PNG
>>
>>106755031
I think you might have something here. It's a pretty good song, it has structure and the chorus has the right hooks. I mean it's AI generated but you made a hit song of sorts.
>>
File: PW_143404_.png (2.94 MB, 1280x1800)
2.94 MB
2.94 MB PNG
>>106754850
You got it!
>>106754905
I love this song! I've listened to it quite a few times hahaha
>>106755031
I like it!
>>
File: 00035-1246292130.png (2.62 MB, 1536x864)
2.62 MB
2.62 MB PNG
>>106755087
suno excels at making "the median song," like it will never do anything really great but with some work it makes some pretty listenable tracks.
>>
>>106755108
Yeah this is what people like - it's catchy, somewhat progressive but still not too complicated and has a clear structure.
You could use Ableton or Cubase (or whatever) to edit these and switch around parts etc..
>>
File: 00037-3392621873.png (2.01 MB, 1536x864)
2.01 MB
2.01 MB PNG
>>106755118
supposedly the new studio thing lets you do that, if they're they're still doing the sale when i run out of credits i might just do upgrade. pw has access to it, idk if he's done anything yet (he hasn't). i do know how to use FL (more less), supposedly you can spit a song out as midi but idk what it is, can't imagine it's got a full suite of effects and vsts and automation, but who knows.
>>
File: deTF_cHD_00007_.png (3 MB, 2081x1190)
3 MB
3 MB PNG
>>
File: PW_143414_.png (2.88 MB, 1280x1800)
2.88 MB
2.88 MB PNG
>>106755144
The new studio thing is fun!
I've done a couple songs LOL but nothing I was gonna share yet haha mostly testing it out
I wanna make something really good and new in v5 first then really get into it!
>>
File: curse_night.jpg (352 KB, 1920x1080)
352 KB
352 KB JPG
WHAT A TERRIBLE NIGHT TO HAVE A CURSE!

curse night
https://suno.com/s/075oGhgpHlRgqoBI
>>
File: BL_SEK_BURG02.jpg (1.1 MB, 4608x3584)
1.1 MB
1.1 MB JPG
>>106755166
>>106755146
Hey a twofer!. What's good?
>>
>>106755144
I would find out how to generate individual audio tracks, then produce something on my own in a proper software. Then make a demo.
It's a long path of course especially if you are totally new.
>>
File: deTF_cHD_00012_.png (3.13 MB, 2081x1190)
3.13 MB
3.13 MB PNG
>>106755172
hey, sorry I missed you earlier today. you caught me during lunch. unusual time for you to pop in!
>>
File: JI_SE_LAYS_8.jpg (222 KB, 2048x2048)
222 KB
222 KB JPG
>>106755184
Hey man its no biggie. What's up?
>>
File: 00041-3940773168.png (2.32 MB, 1536x864)
2.32 MB
2.32 MB PNG
>>106755177
i can operate FL with some proficiency, my problem was always coming up with melody and variety. i'll give it a whirl at some point. i could see it being great for bolting vocals on a track tho. they need more voices, not sure how it works exactly because they do have some variety but not a lot
>>
File: PW_143443_.png (2.73 MB, 1280x1800)
2.73 MB
2.73 MB PNG
>>106755171
Nice!
>>106755172
Heyyy! It's great to see you again :]
Just relaxing at home! It was a great relaxing day haha
How are you?
>>
File: RL_SE_CT_COOK_2.jpg (1.01 MB, 4096x3940)
1.01 MB
1.01 MB JPG
>>106755253
Sounds nice. You had off work I suppose?
>>
File: 57906.png (3.52 MB, 1440x3120)
3.52 MB
3.52 MB PNG
>>106755253
thinking about doing a series of halloween stuff, not sure when to start. last year was fun
>>
File: 57914.png (3.81 MB, 1440x3120)
3.81 MB
3.81 MB PNG
while we're old posting, let me just say
>>
File: 57840.png (3.05 MB, 1440x3120)
3.05 MB
3.05 MB PNG
>>
File: 57852.jpg (328 KB, 1440x3120)
328 KB
328 KB JPG
we used to be a real country
>>
File: 15297-1760922233.png (961 KB, 576x1248)
961 KB
961 KB PNG
>>
File: PQ_SE_N_SODA_5.jpg (169 KB, 1792x2304)
169 KB
169 KB JPG
>>
File: 00064-3909172629.png (1.84 MB, 1536x864)
1.84 MB
1.84 MB PNG
>>
Maintain thread quality
https://rentry.org/debo
>>
File: 00069-4227813072.png (2 MB, 1536x864)
2 MB
2 MB PNG
>>
File: 00073-1993612533.png (1.79 MB, 1536x864)
1.79 MB
1.79 MB PNG
>>
>>106755209
I'm pretty sure some real musicians are using AI and then they'll just improvise on top to change the melodies. At least this is what I would do lol
>>
File: 00079-3996206602.png (2.02 MB, 1536x864)
2.02 MB
2.02 MB PNG
>>106755510
no doubt, and with post-processing almost no one will be able to tell. c'est la vie!

not sure what's going on with the hair coloring, it's weird. shit's been coming out with eyes there recently, and i'm starting to get concerned.
>>
File: 00077-3996206600.png (1.73 MB, 1536x864)
1.73 MB
1.73 MB PNG
>>
File: 00080-1794878236.png (1.67 MB, 1536x864)
1.67 MB
1.67 MB PNG
>>
i miss schizo anon
>>
File: PW_143439_.png (2.7 MB, 1280x1800)
2.7 MB
2.7 MB PNG
>>106755258
Yeah I had today off!! It's been a pretty good day so far! :]
>>106755289
Ohhh Halloween stuff sounds fun! It's about that time haha
Both music and gens!

Had to step out for a bit haha
>>
File: 00085-2177212171.png (1.76 MB, 1536x864)
1.76 MB
1.76 MB PNG
>>106755611
oh you did, did you?
>>
File: 00084-2177212170.png (1.66 MB, 1536x864)
1.66 MB
1.66 MB PNG
>>
File: waves.webm (3.66 MB, 1920x960)
3.66 MB
3.66 MB WEBM
https://www.youtube.com/watch?v=5II-WnW9OJo
>>
File: 00088-2098996753.png (2 MB, 1536x864)
2 MB
2 MB PNG
>>
File: PW_143419_.png (2.73 MB, 1280x1800)
2.73 MB
2.73 MB PNG
>>106755629
LOL I did!!
I like these halloween gens!
>>106755670
Nice anim!
>>
File: 00095-3031671821.png (1.86 MB, 1536x864)
1.86 MB
1.86 MB PNG
>>
File: ComfyUI_00032_.png (1.25 MB, 832x1216)
1.25 MB
1.25 MB PNG
>>
File: ComfyUI_00037_.png (1.26 MB, 832x1216)
1.26 MB
1.26 MB PNG
>>
File: 00099-919691088.png (1.86 MB, 1536x864)
1.86 MB
1.86 MB PNG
>>
File: 00098-919691087.png (1.88 MB, 1536x864)
1.88 MB
1.88 MB PNG
>>
File: ComfyUI_00038_.png (1.58 MB, 832x1216)
1.58 MB
1.58 MB PNG
>>
File: PW_143479_.png (3.05 MB, 1280x1800)
3.05 MB
3.05 MB PNG
>>
File: 00101-2225624989.png (1.74 MB, 1536x864)
1.74 MB
1.74 MB PNG
goodbye
>>
File: PW_143474_.png (3.2 MB, 1280x1800)
3.2 MB
3.2 MB PNG
>>106755801
Good night, Lumi! Sleep well :]
>>
File: waves 2.webm (3.71 MB, 1920x960)
3.71 MB
3.71 MB WEBM
>>106755692
Thanks! Nice to see you again.
>>106755801
Great song, and cool Halloween gens. Goodnight.
>>
File: PW_143504_.png (2.95 MB, 1280x1800)
2.95 MB
2.95 MB PNG
>>106756059
You as well :]
>>
File: PW_143509_.png (3.02 MB, 1280x1800)
3.02 MB
3.02 MB PNG
>>
Gm! Bot status? ;3



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.