[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1718932867913191.jpg (289 KB, 1024x1296)
289 KB
289 KB JPG
Previous /sdg/ thread : >>102825816

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Local install
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SD.Next: https://github.com/vladmandic/automatic
InvokeAI: https://github.com/invoke-ai/InvokeAI
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
flux-dev: https://huggingface.co/spaces/black-forest-labs/FLUX.1-dev
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Black Forest Labs: Flux
https://huggingface.co/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>
File: fSDG_News_00063_.jpg (404 KB, 896x512)
404 KB
404 KB JPG
>mfw Resource news

10/15/2024

>CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation
https://github.com/xyfJASON/ctrlora

>molmo-based image captioner for flux dev lora training
https://huggingface.co/quarterturn/molmo-flux-captioner

>Triton 3 wheels published for Windows
https://github.com/woct0rdho/triton-windows/releases

>Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models
https://tex4d.github.io

>LVD-2M: A Long-take Video Dataset with Temporally Dense Captions
https://silentview.github.io/LVD-2M

>Depth Any Video with Scalable Synthetic Data
https://depthanyvideo.github.io/

>HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
https://github.com/mit-han-lab/hart

>UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation
https://github.com/LiheYoung/UniMatch-V2

>TurboReelGPT: Text to Youtube Shorts/ Tiktok Videos
https://github.com/TacosyHorchata/TurboReelGPT

>CLI tool for Flux model on Apple Silicon
https://github.com/mzbac/flux.swift.cli/releases

>EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense Prediction
https://github.com/mit-han-lab/efficientvit

>Google to buy nuclear power for AI datacentres
https://www.theguardian.com/technology/2024/oct/15/google-buy-nuclear-power-ai-datacentres-kairos-power

>Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs
https://github.com/contrastive/FreeVideoLLM

>LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
https://github.com/hanqiu-hq/LongHalQA

>Adobe’s AI video model is here, and it’s already inside Premiere Pro
https://www.theverge.com/2024/10/14/24268695/adobe-ai-video-generation-firefly-model-premiere-pro

>LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
https://opendatalab.github.io/LOKI
>>
>mfw Research news

10/15/2024

>TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
https://arxiv.org/abs/2410.10818

>Boosting Camera Motion Control for Video Diffusion Transformers
https://arxiv.org/abs/2410.10802

>Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations
https://arxiv.org/abs/2410.10792

>Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes
https://arxiv.org/abs/2410.10790

>Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention
https://ir1d.github.io/Cavia

>DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships
https://arxiv.org/abs/2410.10751

>FlexGen: Flexible Multi-View Generation from Text and Image Inputs
https://xxu068.github.io/flexgen.github.io

>SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
https://nvlabs.github.io/Sana

>Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling
https://arxiv.org/abs/2410.10511

>Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
https://arxiv.org/abs/2410.10496

>Towards Reliable Verification of Unauthorized Data Usage in Personalized Text-to-Image Diffusion Models
https://arxiv.org/abs/2410.10437

>FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification
https://arxiv.org/abs/2410.10356

>Animate-X: Universal Character Image Animation with Enhanced Motion Representation
https://arxiv.org/abs/2410.10306

>Saliency Guided Optimization of Diffusion Latents
https://arxiv.org/abs/2410.10257

>LOBG:Less Overfitting for Better Generalization in Vision-Language Model
https://arxiv.org/abs/2410.10247

>MagicEraser: Erasing Any Objects via Semantics-Aware Control
https://arxiv.org/abs/2410.10207

>Identity-Focused Inference and Extraction Attacks on Diffusion Models
https://arxiv.org/abs/2410.10177
>>
>mfw MORE research news

>Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models?
https://arxiv.org/abs/2410.10160

>Learning to Customize Text-to-Image Diffusion In Diverse Context
https://arxiv.org/abs/2410.10058

>TULIP: Token-length Upgraded CLIP
https://arxiv.org/abs/2410.10034

>NARAIM: Native Aspect Ratio Autoregressive Image Models
https://arxiv.org/abs/2410.10012

>Multi class activity classification in videos using Motion History Image generation
https://arxiv.org/abs/2410.09902

>Intermediate Representations for Enhanced Text-To-Image Generation Using Diffusion Models
https://arxiv.org/abs/2410.09792

>DuoDiff: Accelerating Diffusion Models with a Dual-Backbone Approach
https://arxiv.org/abs/2410.09633

>ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model
https://toughstonex.github.io/controlrm.github.io/

>Bridging Text and Image for Artist Style Transfer via Contrastive Learning
https://arxiv.org/abs/2410.09566

>TD-Paint: Faster Diffusion Inpainting Through Time Aware Pixel Conditioning
https://arxiv.org/abs/2410.09306

>ExpGest: Expressive Speaker Generation Using Diffusion Model and Hybrid Audio-Text Guidance
https://arxiv.org/abs/2410.09396

>Debiasing VLMs with Text-Only Training
https://arxiv.org/abs/2410.09365

>AM-SAM: Automated Prompting and Mask Calibration for SAM
https://arxiv.org/abs/2410.09714

>Toward Defining an Efficient and Expandable File Format for AI-Generated Contents
https://arxiv.org/abs/2410.09834

>Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
https://arxiv.org/abs/2410.09873

>Understanding Robustness of Parameter-Efficient Tuning for Image Classification
https://arxiv.org/abs/2410.09845

>Messaging-based Intelligent Processing Unit (m-IPU) for next generation AI computing
https://arxiv.org/abs/2410.09961

>Mixture of Experts Made Personalized: Federated Prompt Learning for VLMs
https://arxiv.org/abs/2410.10114
>>
>>
File: 1703344833379542.jpg (1.46 MB, 2016x2592)
1.46 MB
1.46 MB JPG
>>
File: heh kiddo.jpg (160 KB, 768x1024)
160 KB
160 KB JPG
>>
File: output.webm (152 KB, 380x380)
152 KB
152 KB WEBM
>>
oops wrong one postd
>>
File: output.webm (171 KB, 380x380)
171 KB
171 KB WEBM
>>
>>
>>
File: output.webm (138 KB, 380x380)
138 KB
138 KB WEBM
>>
File: dena_00019_.png (2.59 MB, 1728x1344)
2.59 MB
2.59 MB PNG
>>
>>
so what is the point of posting the same images on here every single day?
>>
File: output.webm (171 KB, 380x380)
171 KB
171 KB WEBM
>>
>>102836295
no one posts the same images tho
>>
>>102836347
you do though
>>
>>102836360
prove it
>>
>>102836370
no need to, your reaction already did
>>
>>
File: dena_00018_.png (2.61 MB, 1728x1344)
2.61 MB
2.61 MB PNG
>>102836295
whats the point of coming to the general purely to try to start arguments?
>>
>>
>>102836429
I have a couple of theories but most are just miserable people that feel the need to spread their misery to others to feel a moment of whatever in their hate filled lives. and some people think it's le ebin humor to be shitlords.
>>
File: 0.jpg (263 KB, 1344x768)
263 KB
263 KB JPG
>>
File: 00103-855400565.jpg (791 KB, 1728x2160)
791 KB
791 KB JPG
got o.p. pic again.. so neat
>>
File: output.webm (165 KB, 380x380)
165 KB
165 KB WEBM
gn frens
>>
>>102836561
they are nice gens
>>102836569
gn
>>
File: dena_00016_.png (2.88 MB, 1728x1344)
2.88 MB
2.88 MB PNG
>>102836561
lucky. I think I'm forever banned from OP consideration

>>102836569
gn
>>
File: awoo2.jpg (281 KB, 1152x1536)
281 KB
281 KB JPG
>so what is the point of posting the same images on here every single day?
>>
>>
File: 00108-1397779172.jpg (452 KB, 1280x1720)
452 KB
452 KB JPG
https://files.catbox.moe/m9s38j.jpg
box du chat for op pic
>>
File: dena_00015_.png (2.54 MB, 1728x1344)
2.54 MB
2.54 MB PNG
>>
File: 2024-10-15_23-41-30_6186.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>102835728
>>102835934
pretty good, how did you animate this?
>>102835868
>>102836067
nice
>>102836111
kino
>>
File: awoo3.jpg (315 KB, 1152x1536)
315 KB
315 KB JPG
>>
>>102835331
Curious how this MagicEraser compares to Lama cleaner.
>>
File: file.jpg (513 KB, 2048x1024)
513 KB
513 KB JPG
yeah idk. it's not really a sleeping cat. ive gone over everything multiple times, i must be missing something or rf-inversion just doesn't actually work, why can't they just release the code instead of their silly paper
>>
>>102837380
thx
>>
File: 0.jpg (291 KB, 1344x768)
291 KB
291 KB JPG
>>
File: dena_00013_.png (2.29 MB, 1728x1344)
2.29 MB
2.29 MB PNG
>>102837557
>why can't they just release the code instead of their silly paper
the story of so many promising ideas. "code soon!" and then the project disappears forever
>>
>>102837644
coding is hard
>>
File: file.png (639 KB, 1634x1236)
639 KB
639 KB PNG
>>102837644
honestly i've been working on this all day, gone over everything with chatgpt and it just doesn't seem to do the editing that it's suppose to, but i'm actually really stupid so what do i know
maybe ill make an issue on diffusers and ask them
but then its pretty easy to fake the results with other methods for whatever nefarious reasons someone could have for doing such a thing
>>
>>
File: dena_00012_.png (2.31 MB, 1728x1344)
2.31 MB
2.31 MB PNG
>>102837663
I'm not saying it isn't, I've just always assumed the code must have been working for these guys to write and submit papers on whatever they've written

>>102837687
>i'm actually really stupid
so relatable
>>
File: awoo5.jpg (476 KB, 1152x1536)
476 KB
476 KB JPG
>>
>>
so /ldg/ hates us because we like 1girls?
>>
>>102837862
/ldg/ doesn't hate us, they just talk about different stuff and aren't infested with avatarfags
>>
File: file.jpg (115 KB, 1024x1024)
115 KB
115 KB JPG
>>102837732
>the code must have been working for these guys to write and submit papers on whatever they've written
nah it's easy. just write some bullshit, use ip-adapter or whatever to make the demo results and don't release the code, then nobody can outright disprove your alleged results
i'd like to believe that's not what they've done, but there are so many papers with big claims and no code to back it up
like i'll keep working on it tomorrow but idk what else i can do different, as far as i can tell i've implemented what the paper says, i'm using the same hyperparameters from the paper, idk who to ask other than an issue on diffusers
>>
>>102837963
see >>102837592
>>
>>102837979
>SDG is full of a bunch of literal children (and their groomers) who are incapable of change and just want to spam 1girls
couldn't have summed it up better
>>
>>102837997
are you one of the groomers or children?
>>
>>102838000
Neither
>>
>>102838000
he was once the child but now the groomer
>>
>>102838000
I'm a groommy
>>
>>
>>102837997
>>102838000
>>102838016
>>102838017
>>102838048
samefag
>>
>>
>>102837862
You are the resident troll spammer but /ldg/ was originally deviced because /sdg/ was full of Hunyan shills and they started to complain about how this thread only 'endorses' stabilityAI and b.s. like that. I suppose even this was just mongrel from discord or something but maybe they were real shills. The archives are always there, please feel free to examine some of the threads. Now fuck off.
>>
>>102838143
why you mad tho
>>
>>102838159
I'm not "mad" or anything else.
Fuck off was just a friendly reminder.
>>
>>102838185
you sound like a groomer
>>
>>102838194
For the love of god, find some irl hobbies. You could be a master genner at this point.
>>
File: dena_00026_.png (2.97 MB, 1728x1344)
2.97 MB
2.97 MB PNG
>>102838064
its an ldg poster. the telltale signs:
- nogen
- "why is ldg better than sdg?"
- "why is sdg all pedophiles?"
it got worse when ldg died because he has literally nothing else to do but try to bait drama now
>>
>>
I'm so glad debo, the most trustworthy poster (this is proven by the pastebin), is here to set us straight!
>>
>>102838295
your obsession with him is weird
>>
>singular anon cope
>>
File: camera_02.jpg (342 KB, 1152x1536)
342 KB
342 KB JPG
>>
>>
File: 00119-172519130.jpg (631 KB, 2046x1376)
631 KB
631 KB JPG
>>
File: 00001-2386042304.png (2.03 MB, 896x1088)
2.03 MB
2.03 MB PNG
>>
>>
>>102838877
nice



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.