[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1723436212243220.png (1.22 MB, 896x1152)
1.22 MB
1.22 MB PNG
Previous /sdg/ thread : >>101882520

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Black Forest Labs: Flux
https://huggingface.co/black-forest-labs/FLUX.1-schnell
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Discord
6wUwtcJsr2

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: fSDG_News_00009_.jpg (550 KB, 1344x960)
550 KB
550 KB JPG
>mfw Resource news

08/14/2024

>Flux.1-Dev NF4 Quant v2: flux1-dev-bnb-nf4-v2.safetensors
https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/1079

>ComfyUI-Lumina-mGPT-Wrapper
https://github.com/Excidos/ComfyUI-Lumina-mGPT-Wrapper

>bigdata-pw-Dataception: Dataset of datasets
https://huggingface.co/datasets/bigdata-pw/Dataception

>InstantX / FLUX.1-dev-Controlnet-Union-alpha
https://huggingface.co/InstantX/FLUX.1-dev-Controlnet-Union-alpha

>Judge Advances Copyright Lawsuit by Artists Against AI Art Generators
https://www.hollywoodreporter.com/business/business-news/artists-score-major-win-copyright-case-against-ai-art-generators-1235973601/

>Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection
https://github.com/mbar0075/SaRLVision

>ComfyUI nodes for ControlNext-SVD v2
https://github.com/kijai/ComfyUI-ControlNeXt-SVD

>ComfyUi UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
https://github.com/Isi-dev/ComfyUI-UniAnimate-W

>Stable Audio ControlNet
https://github.com/EmilianPostolache/stable-audio-controlnet

>ComfyUI_NAIDGenerator
https://github.com/bedovyy/ComfyUI_NAIDGenerator

08/13/2024

>Kohya training to enable Flux training with 12GB VRAM GPUs
https://github.com/kohya-ss/sd-scripts/pull/1374/files

>InstantX / FLUX.1-dev-Controlnet-Canny Updated
https://huggingface.co/InstantX/FLUX.1-dev-Controlnet-Canny/tree/main

>ClickAttention: Click Region Similarity Guided Interactive Segmentation
https://github.com/hahamyt/ClickAttention

>A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models
https://github.com/taehong-moon/ee-diffusion

>ComfyUI Dwpose TensorRT
https://github.com/yuvraj108c/ComfyUI-Dwpose-Tensorrt

>SSL: A Self-similarity Loss for Improving Generative Image Super-resolution
https://github.com/ChrisDud0257/SSL

>CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
https://github.com/THUDM/CogVideo
>>
>mfw Research news

08/14/2024

>Imagen 3
https://arxiv.org/abs/2408.07009

>Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models
https://arxiv.org/abs/2408.06995

>Prompt-Based Segmentation at Multiple Resolutions and Lighting Conditions using Segment Anything Model 2
https://arxiv.org/abs/2408.06970

>SceneGPT: A Language Model for 3D Scene Understanding
https://arxiv.org/abs/2408.06926

>Dynamic and Compressive Adaptation of Transformers From Images to Videos
https://arxiv.org/abs/2408.06840

>Token Compensator: Altering Inference Cost of Vision Transformer without Re-Tuning
https://arxiv.org/abs/2408.06798

>Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective
https://arxiv.org/abs/2408.06741

>DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
https://arxiv.org/abs/2408.06740

>DC3DO: Diffusion Classifier for 3D Objects
https://arxiv.org/abs/2408.06693

>Masked Image Modeling: A Survey
https://arxiv.org/abs/2408.06687

>Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models
https://arxiv.org/abs/2408.06646

>EditScribe: Non-Visual Image Editing with Natural Language Verification Loops
https://arxiv.org/abs/2408.06632

>Prompt Recovery for Image Generation Models: A Comparative Study of Discrete Optimizers
https://arxiv.org/abs/2408.06502

>Synthetic Photography Detection: A Visual Guidance for Identifying Synthetic Images Created by AI
https://arxiv.org/abs/2408.06398

>Response Wide Shut: Surprising Observations in Basic Vision Language Model Capabilities
https://arxiv.org/abs/2408.06721

>Breaking Class Barriers: Efficient Dataset Distillation via Inter-Class Feature Compensator
https://arxiv.org/abs/2408.06927

>Do Vision-Language Foundational models show Robust Visual Perception?
https://arxiv.org/abs/2408.06781

>ViMo: Generating Motions from Casual Videos
https://arxiv.org/abs/2408.06614
>>
File: 👨🏿---.jpg (125 KB, 1024x1024)
125 KB
125 KB JPG
>mfw Fag Resource news

08/13/2024

>Doghya training to enable Flux training with 1GB VRAM GPUs
https://github.com/kohya-ss/sd-scripts/pull/1374/files

>InstantX / FLUX.1-dev-Controlnet-Canny Updated
https://huggingface.co/InstantX/FLUX.1-dev-Controlnet-Canny/tree/main

>ClickAttention: Click Region Similarity Guided Interactive Segmentation
https://github.com/hahamyt/ClickAttention

>A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models
https://github.com/taehong-moon/ee-diffusion

>ComfyUI Dwpose TensorRT
https://github.com/yuvraj108c/ComfyUI-Dwpose-Tensorrt

>SSL: A Self-similarity Loss for Improving Generative Image Super-resolution
https://github.com/ChrisDud0257/SSL

>ZePo: Zero-Shot Portrait Stylization with Faster Sampling
https://github.com/liujin112/ZePo

>CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
https://github.com/THUDM/CogVideo

08/12/2024

>Linux Foundation Welcomes the Open Model Initiative to Promote Openly Licensed AI Models
https://www.linuxfoundation.org/press/linux-foundation-welcomes-the-open-model-initiative-to-promote-openly-licensed-ai-models

>PreciseControl: Enhancing Text-to-Image Diffusion Models with Fine-Grained Attribute Control
https://rishubhpar.github.io/PreciseControl.home/

>ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation
https://github.com/mc-lan/ProxyCLIP

>CSAM TURBO: Bonus oRthogonAl Token for Architecture Agnostic Textual Inversion
https://github.com/jamesBaker361/tex_inv_plus

>CumfyUI-RefUNet: nodes to use Reference UNets
https://github.com/logtd/ComfyUI-RefUNet

08/11/2024

>Plush-for-ComfyUI: Mixture of Agents and other Agentic Systems Workflows
https://github.com/glibsonoran/Plush-for-ComfyUI

>CmfyUI-SEGAttention: Smoothed Energy Guidance Node
https://github.com/lgtd/ComfyUI-SEGAttention

>SimpleTuner v0.9.8.1 - much-improved flux training
https://github.com/bghira/SimpleTuner/releases/tag/v0.9.8.1
>>
First for debo is the thread schizo
>>
>>
>>101891466
SD eyes
>>
no one cares about the paper spam called "news"
>>
>>101891521
given how dead these threads are you wonder why he bothers posting them anymore
>>
>>101891491
who cares about eyes tho
>>
2 years dude
think about that
>>
File: ComfyUI_00475_.png (1.98 MB, 1536x1152)
1.98 MB
1.98 MB PNG
>>
>>101891625
whom'st've am i talking to?
>>
>nigbo
>>
>>
>>
why is there a discord in op?
>>
https://rentry.org/debo
>>
>>101891580
Catjak created /ldg/
>>
>>101892040
based
>>
I like hlky's avatar, I think her lines are neat and the tattoos are better than ordinary tramp stamps
>>
>>
>>101892125
Hi hlky
>>
>>101892143
no I'm not tho
>>
File: images.jpg (11 KB, 188x268)
11 KB
11 KB JPG
>>101891440
>CSAM TURBO
>>
>>
File: file.jpg (207 KB, 1792x1024)
207 KB
207 KB JPG
too annoyed to nap desu. i'll just have another coffee
>>101892125
thanks. she only has tattoos sometimes, they're temporary
>>101892143
hi
>>
>>
>>101892040
please, call me ranfag
>>
>>
>>101892262
ywc
the lines are nice and fine, I like that about it
2 point for house big data
>>
File: schizo-thread.jpg (234 KB, 1024x768)
234 KB
234 KB JPG
>>
File: ComfyUI_00478_.png (1.72 MB, 1536x1152)
1.72 MB
1.72 MB PNG
>>
>>101892287
He won
>>
>>
File: 284655328.jpg (2.58 MB, 1792x2304)
2.58 MB
2.58 MB JPG
>>
>>101892385
Crazy, is this flux
>>
File: 1955254857.png (734 KB, 1024x1024)
734 KB
734 KB PNG
>>101892396
Yeah, it's very good, if only it didn't take 7 minutes to generate
>>
>>101891440
>https://github.com/lgtd/ComfyUI-SEGAttention
This vanished off the face of the earth, but an alternative implementation exists with https://github.com/pamparamm/sd-perturbed-attention which I have been using. Yes, it slows down generation but it actually does look better.
Additionally, I've been looking back at some older methods for reducing VRAM usage and came across Token Downsampling again, from February this year.
https://github.com/ethansmith2000/comfy-todo
This did work with Flux but the math behind the downsampling is really confusing, I don't understand how you can get a ratio of 0.75 tokens merged or eliminated with a downscale factor of 4 and etc. Regardless, it is working better than ToMe in ComfyUI that is built in. I would make a patch to officially include it but all the code is open source with no license so fuck that.
>>
>>101892419
Is this done locally? 7 minutes is way too long.
>>
>>101892385
the background faces are impressive in their sharpness and quality, I think
>>
>>101892425
CumUI
>>
File: 1452916299.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>101892435
Yeah, it's done on a measly 3060 so that's why.
>>101892438
I agree, no inpainting or anything.
>>
>>101892419
What gear are you using? Is this with comfy or forge?
>>
>>101892483
>Yeah, it's done on a measly 3060 so that's why.
Do you have catbox? I have a 4090 and want to recreate it.
>>
>>101892359
>got doxxed
>incessantly bullied for his furfag/loli fetishes
>goes to sleep/wakes up thinking about Debo/PW
he just can't stop winning
>>
/sdg/ is healing
>>
>>
>model draws ear too big
>inpaint it
>ear gets 15% bigger
...
>>
File: flux.png (11 KB, 577x150)
11 KB
11 KB PNG
Excuse the noob question, but where might i find low bit flux models?
NF4 are on civit but fp8e4/fp8e5 do not seem to be there.
Tried comfy site links but they crash.
I have more than enough VRAM but for some reason i can't get it to work.
>>
File: 2000474601.jpg (2.6 MB, 1792x2304)
2.6 MB
2.6 MB JPG
>>101892494
It's an ultimate sd upcale with 4 tiles of 960x1216, euler 20 steps, probably take ~1 min on a 4090.
>>
>>101892577
you set fp8e4/fp8e5 with the weight-dtype dropdown, it's not a separate model like nf4
>>
>>101892615
>ultimate sd upcale
How about tiled diffusion? Ultimate SD Upscale from my experience is extremely slow.
>>
>>101892615
where is little Margaret Thatcher from the previous gen?
>>
>>
File: ComfyUI_00379_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
i'm trying the bnb nf4 thing and it's using the modelsamplingflux node which has max_shift and base_shift params, wtf are these and is there any reason to care about them?
>>
hello
>>
>>101892425
I thought token downsampling was more for secondary upscaling/detailing passes where granularity was less important. won't the downsampling impact adherence?
>>
>>101892755
howdy
>>
>>101892724
>wtf are these
no one knows
>>
File: grid-0062.jpg (394 KB, 1536x2688)
394 KB
394 KB JPG
>>
>>101892349
Cozy
>>
File: 5e4cmajnemid1.jpg (292 KB, 2048x2048)
292 KB
292 KB JPG
>>
>>101892758
You can do it for primary generation for the reduced resource usage, and I have been feeling my hardware is limiting what I can do with Flux resolution-wise. You are correct though as it's basically token merging improved.
It's available also for A111 as well for those wondering.
https://github.com/feffy380/sd-webui-token-downsampling
>>
File: ComfyUI_00485_.png (1.95 MB, 1408x1408)
1.95 MB
1.95 MB PNG
>>101892862
ty
>>
>>
File: grid-0064.jpg (346 KB, 1536x2688)
346 KB
346 KB JPG
>>
>****
>>
It's been a while since I've used SD, what's a good model to save for the future that might get banned soon? I have SD1.4, SD1.5 pruned and wd-v1-2-full-ema-pruned, can't even remember what the last one is.
>>
really reminds me of debo's multiday spergout when the basedbin was added to op
only with an older Debo who simply has no more power and has quite a bit of brain damage by now
>>
do you ever look in a mirror and ask yourself what you are doing thread schizo?
>>
File: 780267741.jpg (2.6 MB, 1792x2304)
2.6 MB
2.6 MB JPG
>>101892489
forge, nf4-v2 with t5 fp16
>>101892658
It's just the limit of the gpu, also xformers doesn't work currently so that doesn't help either.
>>
File: ComfyUI_00394_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>101893048
those aren't good models and they're still widely available loll
>>
>>101893293
Yeah that's why I'm asking anon.
>>
>>101893048
Now that you mentioned it, i wonder why no one made a Bstaber XL or Based64 Pony.
>>
>>
pathetic
>>
File: file.jpg (418 KB, 1792x1024)
418 KB
418 KB JPG
probably drop what i have for flickr tomorrow with a note that the aim for the next version is 500m, should have like 250m+ by the morning
hopefully big drop will finally start some recognition, diffusion1b is big but limited usage compared to flickr
for now i think maccies and x-files will cheer me up
>>
like we need a blogger baka
>>
File: 1girl_1_31.jpg (872 KB, 2048x2048)
872 KB
872 KB JPG
>>101893337
it really depends on what you want to do, vanilla sdxl and sdxl turbo, as well as flux and flux schnell are all really good to have for multi-step workflows as they're the best at prompt following and creativity, if you just want to 1girl some pony model + lora to taste will be your best bet, i like confetti and wai ani nsw for anime and pony realsim for realism, there are various other mixes probably worth trying

this is done with juggernaut as an upscale pass
>>
>>101893359
Fucking hell I know I'm a lazy fag but when I fiddled a little bit with SD, all I needed to do was install the GUI and run the SD1.4 model for it to work.
I'm downloading CyberRealistic_V5_FP32.safetensors now but it says checkpoint model for SD1.5. Does that mean I use it only together with SD1.5?
>>
File: 1girl_1_32.jpg (967 KB, 2048x2048)
967 KB
967 KB JPG
>>101893554
no that's a full checkpoint, you don't need anything else
>>
>>101893552
>vanilla sdxl and sdxl turbo, as well as flux and flux schnell
So if I understand you correctly, those are general purpose models and the other stuff is I download a model with a specific woman already generated and create images from that?
I didn't get very far probably a year ago but I remember generating the same woman with different clothes, leaving all settings as is except for the clothing keywords.
>>
File: file.jpg (404 KB, 1792x1024)
404 KB
404 KB JPG
>>101893519
we don't need nogens complaining or spamming their schizophrenic obsessions with other posters yet here you are
>>
touchy hlky this is not /soc/
>>
>it was hlky all along
>>
>>101893607
Ok thanks. I think I'll just download a few models and start to get into the topic again.
It's been so long that I've never even seen half of AUTOMATIC1111s current options.
>>
thread schizo
>>
Anybody got a good pinup pose on simple background?
>>
>>101893650
this one goes into the hlky folder
>>
File: ComfyUI_temp_geekj_00003_.png (2.64 MB, 1280x1280)
2.64 MB
2.64 MB PNG
first flux depth controlnet test... don't like that you have to use their sampler node
>>
din jävla mamma
>>
Horunge Bög
>>
>schizophrenic
>>
so /ldg/ only exists because anons wanted to get away from the thread schizo?
>>
>>101894299
right, and said thread schizo is currently spamming CP on there
>>
>>101894299
no, it started because pixart chang wanted a safespace and was co-opted as ranfag's blog to bitchpost about debo and pw
>>
>>101894299
>>101894322
>>101894335
/ldg/ is 4chan's version of a reddit hugspace
>captcha: THIS
>>
so its debo, gotcha
>>
>>101894428
I honestly can't imagine how sad that subhuman's life is though. I feel like one of these days he's going to disappear because he offed himself irl.
>>
File: 1girl_1_48.jpg (759 KB, 1536x2048)
759 KB
759 KB JPG
>>
>>101894335
Baking your cp on proxy debo?
>>
File: file.jpg (834 KB, 1792x1024)
834 KB
834 KB JPG
maccies was nice except for the lunatic driver who got mad at me for stopping at crossing and almost ran over the guy crossing the road. wish i could live somewhere else desu
>>101893913
not sure how to feel about this
>>
>>
>>101894475
nope, just a normal anon that thinks debo and ran are both massive faggots
>>
it's no surprise that you are so lonely Debo
just look at what you are doing
>>
Mods please ban debo post on sight and he will leave /g/ for good
>>
File: ComfyUI_00399_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
can someone link the 25% faster flux workflow that sets the CFG to 1 for the last half of the gen or whatever, i'm trying the dynamic thresholding workflow and it's def a bit better at prompt following but the gens are slow and things are coming out kinda overbaked

(this is with the same settings as the "Hatsune Miku, 80's anime drawing style" example)
>>
>>101894476
many are not sure, but you're in good company
I am not some malicious person, I just like pictures
>>
>>
File: delux_flebo_00013_.png (1.17 MB, 1216x832)
1.17 MB
1.17 MB PNG
>>101894428
it cannot be understated how important and influential I am
>>
imagine having a pastebin about your past actions on fucking 4chan AND anons creating a new general just to get away from you
LMAO
>>
>>101894679
I unironically hope you get a job working with immgen
>>
>>101894720
>job
lol
>nonslop ai gens
lmao
>>
>>
Mind the thread schizo
https://rentry.org/debo
>>
>>101894679
Is there a reason you are spamming cp on /ldg/?
>>
File: delux_ra_00025_.png (1.92 MB, 1536x1152)
1.92 MB
1.92 MB PNG
>>101894720
that'd be cool. suno woulda been my dream job but they left me on read
>>
>>101894757
can we cross reference the post timings? looks sus af
>>
>>101894770
why do they have such a long neck?
>>
File: file.png (85 KB, 1173x960)
85 KB
85 KB PNG
>>101894770
openai have my dream job. i haven't applied for it though. i need more big data drops first desu
>>
File: 1girl_1_52.jpg (944 KB, 1536x2048)
944 KB
944 KB JPG
i wish flux was better at making girls thicc but the vibes are great
>>
File: 2548701152.jpg (3.1 MB, 2688x1536)
3.1 MB
3.1 MB JPG
>>
>the thread schizo containment general works
>>
>>101894878
not really, he's spamming cp on /ldg/
>>
What's the distilled CFG for?
No one knows?
>>
>>101891401
>train lora on flux output, no nudity except inpainted
>the nipples start to disappear in some of the outputs

am I being schizo
>>
>>101894906
I wish debo was here
He could explain the situation and everything around it
>>
File: 1girl_1_55.jpg (1019 KB, 1536x2048)
1019 KB
1019 KB JPG
>>101894861
lol the flux gens always with the nails
>>
File: delux_ra_00026_.png (2.33 MB, 1536x1152)
2.33 MB
2.33 MB PNG
>>101894838
the official big data gang. I wonder how many dataset likes you need before openai is willing to look at a resume

>>101894893
>every poster is debo!!
real schizo behavior

>>101894906
>distilled CFG
all I know is what illy posted in his update write-up:
>Flux-dev is a distilled model. It is recommended to set CFG=1 and then do not use negative prompts. Using “Distilled CFG Guidance” instead. The default value is 3.5. Note that if CFG=1, the UI of negative prompt will be greyed out.
>>
File: 0.jpg (508 KB, 2048x1024)
508 KB
508 KB JPG
>>
>>101894861
nice
>>
>>101894965
finally run out of ips?
>>
>>101894965
Thread schizo
>>
File: file.png (244 KB, 1087x1044)
244 KB
244 KB PNG
>>101894965
probably a lot more than i have atm desu
>>
File: 00733-3857767347.jpg (444 KB, 1248x1656)
444 KB
444 KB JPG
https://voca.ro/19HHTMCLdSYW
took a midday depression nap, and i am now refreshed and ready to .. discuss technology
>>
>>
>>101894835
that's so they can take it deeper
>>
File: delux_ci_00019_.png (1.63 MB, 1536x968)
1.63 MB
1.63 MB PNG
>>101894988
no. I'm omnipotent. I'm all knowing and all seeing. I'm every poster. I'm in your walls. I'm the last thing you see before you sleep and the first thing you see when you wake. I am your entire reason for existing. I am your God.
>>
>>101894770
yeah but whát exactly would you be doing there? surely not clicking 'gen' but coding for music generation?
If you wanted to make it with the music one can still create an ai album and release it, or remaster that irl imhodesu
>>
overcooked
>>
>>101895089
How does it feel to have catjak take everything from you?
>>
>>101894679
what exactly have you done besides rot in this thread all day every day for the past year?
>>
>>101895051
I read 'repression nap' and had to think of Britain
poor Britain, havin' a normal one
>>
>>101895089
>2 years of schizo posting
>>
>>101895051
>took a midday depression nap,
nobody cares
>>
nick fe ultimate cringe
>>
File: delux_ra_00028_.png (1.42 MB, 1536x1152)
1.42 MB
1.42 MB PNG
>>101895125
I was very well-suited for several of their swe openings. the intersection between music and AI is what would make working with them super cool but my actual work would be software. unless they have a "click gen all day" opening, then I'd prefer that
>>
>>101895190
The pastebin says otherwise
>>
>>101895190
I would take that position too, the click one
well there's bound to be competitors, one could pre-emptively market ones selves as suited for the future position of swe
>>
>bimbo lips
gross
>>
>>101891418
>>InstantX / FLUX.1-dev-Controlnet-Union-alpha
>https://huggingface.co/InstantX/FLUX.1-dev-Controlnet-Union-alpha
Nice
>>
File: 00083.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>101895190
>I was very well-suited for several of their swe openings.
lmao sure you were bud... sure you were...
>>
>>101895215
pastebin?
>>
>>101895215
does it have archives of his posts where he admitted the whole reason he huffs comfy's farts is because he's because he can't into math or coding himself?
>>
>>
File: 1girl_1_66.jpg (491 KB, 1536x2048)
491 KB
491 KB JPG
aight i'm gonna go on a walk in the forest but a casual suggestion, what if instead of every mentioning debo ever again y'all just let him be

you retards constantly talking about him are way more annoying than he could ever hope to be, ofc he's taking the piss w/ shit like this

> no. I'm omnipotent. I'm all knowing and all seeing. I'm every poster. I'm in your walls. I'm the last thing you see before you sleep and the first thing you see when you wake. I am your entire reason for existing. I am your God.

i would be too if people dedicated so much of their brains to me
>>
>>101895345
Holy newfag
>>
>>101895282
Yes
Also the rentry in /ldg/ as well
>>
>>
File: delux_ra_00029_.png (2.57 MB, 1536x1152)
2.57 MB
2.57 MB PNG
>>101895282
I can code fine, just not math in python. I couldn't write a stochastic solver if my life depended on it

>>101895345
you can't simply ask schizos to stop being mentally ill, but I appreciate you trying. enjoy the forest
>>
>this is the place i want to be
>a thread full of people that hate me
>>
>>101895421
>I can code fine
you literally can't lmao
>>
>>
File: 1882495562.jpg (2.42 MB, 1536x2688)
2.42 MB
2.42 MB JPG
>>
File: 000000_16299_.png (2.28 MB, 1075x1434)
2.28 MB
2.28 MB PNG
>any new LoRas for Flux today?
>>
File: lazypepe.png (2.12 MB, 1018x1018)
2.12 MB
2.12 MB PNG
>>
File: 00747-4261048446.jpg (740 KB, 1560x2064)
740 KB
740 KB JPG
https://youtu.be/LQhX8PbNUWI?si=UKT6cMduAWX1_IkV
>>
>>
>>101895421
>I can code fine
post github
>>
>>101894965
>Flux-dev is a distilled model. It is recommended to set CFG=1
Thanks. That's a good starting point.
>>
File: delux_ci_00021_.png (1.59 MB, 1536x968)
1.59 MB
1.59 MB PNG
>>101895676
you have to be joking
>>
>>
>>101895713
This you?
https://desuarchive.org/g/thread/96474271/#q96477123
Please read the rentry/pastebin before engaging newfriends
>>
File: orc3.png (2.07 MB, 1024x1024)
2.07 MB
2.07 MB PNG
>>
>>
>>101895275
all those proxies just to spam report an insult on 4chan
>>
File: file.jpg (218 KB, 1792x1024)
218 KB
218 KB JPG
>>
>>101895888
>he's wearing a different girl on his shirt
lel
>>
>>101895842
no need to whine about rules (occasionally) being enforced.
>>
>>101895842
Oh he's doing it again?
That's why we know he's the cp spammer
>>
It's getting sad at this point
>>
>>101895912
don't make it too obvious
>>
File: PW_82425_.png (895 KB, 1024x768)
895 KB
895 KB PNG
Hello again, anons! Totally passed out haha
>>
>>
File: delux_flebo_00009_.jpg (522 KB, 1344x960)
522 KB
522 KB JPG
>>101895974
hello
>>
File: file.jpg (235 KB, 1792x1024)
235 KB
235 KB JPG
>>101895907
she was not happy desu
>>
File: PW_82427_.png (900 KB, 1024x768)
900 KB
900 KB PNG
>>101896048
Hey again, Debo! You were right hahaha that was way too early for me to be up
That gen is so cute! :]
>>
>GIVE ME BACK MY PLUSHIE YA SLOOT
>>
Flux's hand gen is great, the face and skin is weird
>>
I know anons are angry about what happened to /sdg/ but please don't bother yelling at them.
>>
>>101896134
No one is yelling here, what are you talking about? /ldg/ brought it on themselves. This is a safe space now.
>>
>>
File: file.jpg (348 KB, 1792x1024)
348 KB
348 KB JPG
>>
what is happeing today, this AM, this site was down, now github.
>>
>>101896121
Think I'll stick with SD1.5 until I read up more on how to get Flux to gen better esthetics.
>>
File: file.jpg (318 KB, 1792x1024)
318 KB
318 KB JPG
she made me swap purple witch plushie for blue dino plushie
>>
File: PW_82416_.png (892 KB, 1024x768)
892 KB
892 KB PNG
>>101896186
Cute witch in the back haha! Oh yea I wanted to ask you, is that an easy way to interrogate hella images at once? It's so tedious the way i'm doing it LOL
>>101896266
Hahaha cute!!
>>
>>101896235
you should just jump right in and play with it and figure out as you go
even sdxl had models with great aesthetics superior to humu
>>
File: PW_82361_.png (824 KB, 1024x768)
824 KB
824 KB PNG
>>101896299
there* whoops haha still half asleep
Time to get more coffee
>>
>>
>>101895751
>no reply
Kek
>>
File: cfg_mode_00001_.png (1.2 MB, 768x1024)
1.2 MB
1.2 MB PNG
>>101894587
srsly does anyone have this, i remember it was posted a few threads ago
>>
>>101894587
>>101896450
cant help you there, anon, no idea what you're talking about
>>
File: maid.jpg (601 KB, 1248x1824)
601 KB
601 KB JPG
>civitai down once again
>>
File: file.jpg (348 KB, 1792x1024)
348 KB
348 KB JPG
>>101896299
clip interrogate? what are you using? it's fairly simple to put together yourself with batch support if there isn't already and caching/reusing the text embeds if it's not doing that. the library i used to develop could do it fast desu
clip interrogate isn't great anyway, the standard method depends on the content of the list, like it returns the most similar out of that list but the text may not actually be similar to the image. it's a bit better if you get the actual cos sim value and use thresholds but the appropriate threshold depends on which clip model and it's still hit and miss. this is why the stable diffusion safety checker doesn't work properly
you'd be better off using something like florence-2-large for real images, ive liked the detailed_caption mode so far. for anime i think there are caption models based on danbooru tags
>>
File: cfg_mode_00002_.png (1.21 MB, 768x1024)
1.21 MB
1.21 MB PNG
>>101896556
flux is 2x slower if the CFG isn't set to 1, but there's "one weird trick" that makes it better at following prompts that works by setting the CFG to ~6 and the negative guidance to 10 (but still leaving it blank)

some anon figured out that you can set the cfg to 1 for the later steps when it doesn't really change the output but it makes the overall generation faster because you're not getting the speed penalty of non-one cfg for some of the steps
>>
>>101895751
Debo sisters?
>>
File: ahhhhh2.png (8 KB, 1092x153)
8 KB
8 KB PNG
>>101896616
>Something is going on behind the scenes today, gut says more consolidation.
>>
>>101896615
>>101896598
do you really think this is how it works? how distorted is your mind?
>>
>>101896630
maybe do a search on one of the archive sites
>>
>>101896630
you want adaptiveguider
>>
>>101892892
Is that flux? because if so it looks really nice
>>
File: PW_82455_.png (799 KB, 1024x768)
799 KB
799 KB PNG
>>101896617
Yeah! Uhh something on huggingface
It works well enough, but it's super slow so it's kind of annoying and totally makes me not wanna do a huge thing with hella images to train
Thanks so much! I think I have Florence already downloaded too :]
>>
nice placement lel
>>
File: florence-large.png (3.21 MB, 1718x3130)
3.21 MB
3.21 MB PNG
>>101896730
there's https://github.com/rom1504/clip-retrieval made by one of the laion guys, it's a bit overkill and not really designed for clip interrogation, the clip-inference part would let you embed images fast which you could then run interrogation on
but yeah better off using a caption model
>>
>>
>>101896852
>caption model
Which one is recommended?
>>
>>
File: PW_82461_.png (781 KB, 1024x768)
781 KB
781 KB PNG
>>101896852
Thanks so much!! I'll give that a look :]
>>
File: ITSALIVE.png (45 KB, 752x583)
45 KB
45 KB PNG
>>101896642
Anon, we get it, you're a jelly Artist that can't into local. POST A GEN.

>IT'S ALIVE!!
>>
>>101896937
What does that have to do with him lying?
>>
>>101896307
>humu
Humu is just a great merge which yields excellent skin texture. SDXL has some great models as well but it's very fragile outside of specific parameters, reverting to that blurry plain skin look.
Perhaps you can prompt for better skin texture. I'll try that on flux.
>>
>>101896952
what lie
>>
>>101896952
This isn't High School Anon, we don't give a @#%.. Post gens.. kek
>>
File: 00769-3761061215.jpg (529 KB, 1152x1544)
529 KB
529 KB JPG
>>
>>101896957
if you're on a111/forge, i found using a bit of dynamic threshholding brings out more details like skin on sdxl, like a cfg of around 10-12 and scaling down to whatever the model likes
i kind of get it tho, i had one model i liked on 1.5 (i think it was yafm or azovyaphotoreal) and didnt bother switching to sdxl til i found something comparable
you cant really beat flux with its benefits tho, but i do expect checkpoints and other things based on it will start popping up soon
>>
>>101896971
Read the chain
>>101896973
We did and debo harassed everyone to /ldg/
>>
>>101896875
i've tested a few others recently, i was out for lunch using hf spaces on my phone so i can't remember the names other than moondream, the others were all too verbose and moondream made stuff up. i'd recommend florence. it supports other useful tasks like bounding boxes around objects and ocr, base is like 0.2b and large is 0.7b so it runs really fast
>>101896926
i'd stick with clip-L desu, or even use clip-B. H, g and bigG are just too slow for the image part. i did a lot of image embedding before for some project and i remember getting 13 image/sec with H when L was like 100+
>>
>>101897010
oh and you can use a bit of dynamic threshhold on flux as well and another extension called detail demon
i know comfy has dynthresh nodes which are kind of similar (they seem less effective than forge's), and i'm sure there's a bazillion other nodes you can play with
guess i just dont want to have to keep testing models and things when flux does 80-90% of what i want as of now so i'm good with just playing with the new hotness for now
>>
>>101897031
>florence
You mean this one?
https://huggingface.co/microsoft/Florence-2-large

Is there any workflow on how to properly use it?
>>
>>101897014
>We did and debo harassed everyone to /ldg/
I've been here since the start and splitting the thread is dumb. Just have a Latent Diffusion General, encompassing all models etc. I dunno..
>>
>>101897055
this seems reasonable desu, but i came here after the threads had already split. just seems like discord drama that's unnecessary for 4chan
>>
>>101897010
>checkpoints and other things based on it will start popping up soon
Interesting. Would be great if that was allowed. I'm not sure it is. Didn't read the licence. Would be great to have the hands of flux and the textures of humu.
Prompting skin texture doesn't give me what I want. Nice composition with flux.
>>
>>101897010
>>101897099
>checkpoints and other things based on it will start popping up soon

https://civitai.com/models/645943/fluxunchained-artful-nsfw-capable-fluxd-tuned-model-by-socalguitarist
>>
File: florence-base.png (3.22 MB, 1718x3147)
3.22 MB
3.22 MB PNG
>>101897054
yeah, or the base version, see pic related for base examples and above for large, the short caption is better in some cases with base but it misses details like text on detailed_caption, i'd say detailed_caption on large is the best
if you mean ui integration i'm not sure if there is yet, there's example code on the hf repo
>>
File: 00085-3388037189.jpg (192 KB, 1024x1376)
192 KB
192 KB JPG
>>
>>101897099
not the pro version or whatever of course, but the dev and schnell are apache license i believe

>>101897106
>Total training was about 5k images spread across SFW cinematic stills, art photography, LaION art-pop, about 1500ish explicit and artful nudes about 80% photography, 20% AI/illustrative nudes.
seems like a good start
>>
>>101897108
Alright thanks anon
>>
File: 0.jpg (458 KB, 2048x1024)
458 KB
458 KB JPG
>>
File: file.png (1.08 MB, 1257x466)
1.08 MB
1.08 MB PNG
>>101897132
Same guy who created the VisionXL models so I thought it's probably a good shout
>>
>>101897163
just wait
>two more weeks
and you'll see some serious shit, just based on how fast sdxl models came out

>20+ gb per model
welp time to free up some more space on my drive i guess
>>
File: file.png (1.96 MB, 1522x1640)
1.96 MB
1.96 MB PNG
>>101897139
here's what i mean about moondream, ive pasted the original image in because gradio hides the corner with the label
some of the details are nice like 'vintage' and 'cobblestone street', not sure on turquoise vs green but if it's going to make stuff up like 'windows in the background' all the captions would need reviewing/editing. at least with florence everything seems to be accurate
one of the other models i tested with this image said the one of the building doors was white, or that the car rims were black
>>
File: 00055-140332475.jpg (92 KB, 1120x1640)
92 KB
92 KB JPG
>>
>>
>>
File: 00071-1086714175.jpg (118 KB, 832x1216)
118 KB
118 KB JPG
>>
>>101896616
catbox please dearest anon
>>
File: PW_82446_.png (866 KB, 1024x768)
866 KB
866 KB PNG
>>101897031
I just found this thing, and it seems to work out pretty good! It loads up the text in 1 second haha
>https://github.com/Hangover3832/ComfyUI-Hangover-Nodes
Thanks for the help! I really appreciate it :]
>>
>>
>>101897448
nta but there is something else to look out for
https://huggingface.co/spaces/fancyfeast/joy-caption-pre-alpha
It's yet a little jank but it actually captions erotic scenes, hopefully that guy will really polish it. you can just grab the weits from the files section and run locally.
>>
>>
File: RetroXL0004.jpg (195 KB, 1320x1320)
195 KB
195 KB JPG
>>
File: SDXL_3.jpg (517 KB, 1232x1528)
517 KB
517 KB JPG
>>101897404
Love this any chance of a catbox? Use any Lora's or anything?
>>
>>101891401
can you combine loras, eg angalina jolie punches brad pit?
>>
>>
i switched to kolors and flux based on /g/'s recomendations and XL is way better
the fuck is wrong with you sluts
too much cock in your pussies?
you gone crazy?
need a lil foot rub to fix your brains?
>>
/o/ retard here /aicg/ sent me here. I feel like an absolute boomer asking but how or where do I generate all those AI pictures that are everywhere on the internet, I'm guessing there are text to image generators but which ones are the most popular/accurate. I have seen some images on /pol/ of indans riding rockets made out of shit and stuff like that. How the fuck you generate something like that, please explain sorry for being a retard. Just checked the sticky and metastable looks like the most user friendly but I can´t see the requirements. I'm using a laptop with 8 GB ramn ryzen 5. I don´t need anything fancy just want to generate random images for fun. Any hope for me or should I go back to ooga booga wrenching
>>
>>101897638
You're better off using online services instead. Head over to this general >>101896094
>>
>>101897638
just use bing's image generator, it's for casuals and apparently decent enough
>>
File: PW_82433_.png (1.07 MB, 1024x768)
1.07 MB
1.07 MB PNG
>>101897540
Whoa, this one is super detailed! Unfortunately it took quite some time tho, and i'm too lazy to wait hella long hahaha!
I might try to run it locally tho!
Thanks, anon! :]
>>101897581
You can add both LoRAs and then proompt for the punching haha
>>101897638
Welcome to /sdg/, anon! I used to have 8gb and it worked just fine
A bit of a wait tho haha
I would recommend using Comfy for your UI
>https://rentry.org/comfyui
there are examples of how things work too :]
>https://comfyanonymous.github.io/ComfyUI_examples/
>>
>>101897703
>haha
pls just dont type it, you are are sounding like a creep
>>
>>101897719
haha
>>
>>
>>101895635
What a disturbing picture, damn!
>>
>>101897719
had to do it, anon
>>101897777
isnt it? nice quads
>>
>>
>>101897677
>>101897682
stupid shir requires to log in with a microsoft account I think I will try >>101897703
>>
File: PW_80831_.png (812 KB, 1440x512)
812 KB
812 KB PNG
>>101897719
LOL get over it, anon!
>>
File: 1717172253410341.jpg (35 KB, 640x640)
35 KB
35 KB JPG
>>101891401
hey anons, does anyone know what model https://www.pixiv.net/users/8786609 (some nsfw warning) is using?
peeps over at /h said it was novelai, but since when can novelAI pull such styles?
>>
>>101897629
>breath taking graphics
kek, sonovabitch
>>
>>101897879
i think you can do like 3 for free per day without account, no? i never used it
with your specs, you wont be making but one image every 10+mins so not much difference
>>
>>101897949
>novelAI
I mean NAI does have one of the better anime models. Their latest one destroyed sdxl in terms of prompt comprehension and style. More than likely Flux will be able to compete once anime finetunes/loras begin to mature more
>>
File: 000000_16309_.png (2.24 MB, 1075x1434)
2.24 MB
2.24 MB PNG
>>
File: 00788-3702993693.jpg (985 KB, 1560x2064)
985 KB
985 KB JPG
'haha' makes some ppl very upset
>>
flux just gave up on realism for me lel
>>
File: 121402770_p0.jpg (995 KB, 1216x832)
995 KB
995 KB JPG
>>101897997
Hope so, because image related is a cool style, love the whole 3d with 2d thingy, and haven't been able to mix a model that can do that at all, unless cheating with the 3d tag but that makes everything ugly. Is creating lora's on AI images still a no-no? Could I get away on training a lora on his gens?
>>
>>101897997
You sound a lot like a shill.
>>
>>101898058
I'm a Jew who hunts the souls of Muslims

I'm a shilligami
>>
File: PW_82473_.png (724 KB, 1024x768)
724 KB
724 KB PNG
New Thread!!!
>>101897973
>>101897973
>>101897973
>>
Filled!
>>
>>101895998
worrisome toes



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.