[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1716613427736965.jpg (482 KB, 1344x768)
482 KB
482 KB JPG
Previous /sdg/ thread : >>101693167

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Black Forest Labs: Flux
https://huggingface.co/black-forest-labs/FLUX.1-schnell
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Discord
6wUwtcJsr2

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: 00018-3312531984.jpg (369 KB, 1336x1336)
369 KB
369 KB JPG
Pbbbbbbbbt
>>
>>101697538
A greenhouse train? That looks really cool.
>>
File: ComfyUI_30506_.png (939 KB, 768x1024)
939 KB
939 KB PNG
>>
File: FDG_News_00004_.jpg (807 KB, 1344x768)
807 KB
807 KB JPG
>mfw Resource news

08/02/2024

>Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation
https://yixiaowang7.github.io/OptTrajDiff_Page

>UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model
https://github.com/X-niper/UniTalker

>Smoothed Energy Guidance for SDXL
https://github.com/SusungHong/SEG-SDXL

>Mitigating Multilingual Hallucination in Large Vision-Language Models
https://github.com/ssmisya/MHR

>GalleryGPT: Analyzing Paintings with Large Multimodal Models
https://github.com/steven640pixel/GalleryGPT

>The Manga Whisperer: Automatically Generating Transcriptions for Comics
https://github.com/ragavsachdeva/magi

08/01/2024

>Stable Fast 3D: Rapid 3D Asset Generation From Single Images
https://stability.ai/news/introducing-stable-fast-3d

>Announcing Black Forest Labs
https://blackforestlabs.ai/announcing-black-forest-labs

>Flux: The Next Leap in Text-to-Image Models
https://blog.fal.ai/flux-the-largest-open-sourced-text2img-model-now-available-on-fal

>ComfyUI: Basic Flux Schnell and Dev model implementation
https://github.com/comfyanonymous/ComfyUI/commit/1589b5

>Kolors ipadapter FaceID Plus
https://github.com/Kwai-Kolors/Kolors/tree/master/ipadapter_FaceID

>The EU’s AI Act is now in force
https://techcrunch.com/2024/08/01/the-eus-ai-act-is-now-in-force

>Video game performers picket over AI protections
https://apnews.com/article/sagaftra-strike-video-games-ai-f3f18ad01c5b8f4d525a836aeb531447

>Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs
https://lalbj.github.io/projects/PAI

>Detecting, Explaining, and Mitigating Memorization in Diffusion Models
https://github.com/YuxinWenRick/diffusion_memorization

>Forgedit: Text Guided Image Editing via Learning and Forgetting
https://github.com/witcherofresearch/Forgedit/

>ControlMLLM: Training-Free Visual Prompt Learning for Multimodal LLMs
https://github.com/mrwu-mac/ControlMLLM
>>
>mfw Research news

08/02/2024

>MM-Vet v2: Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
https://arxiv.org/abs/2408.00765

>Text-Guided Video Masked Autoencoder
https://arxiv.org/abs/2408.00759

>TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models
https://turboedit-paper.github.io

>SAM 2: Segment Anything in Images and Videos
https://arxiv.org/abs/2408.00714

>MotionFix: Text-Driven 3D Human Motion Editing
https://arxiv.org/abs/2408.00712

>Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function
https://arxiv.org/abs/2408.00707

>Scaling Backwards: Minimal Synthetic Pre-training?
https://arxiv.org/abs/2408.00677

>SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
https://arxiv.org/abs/2408.00653

>Are Bigger Encoders Always Better in VLMs?
https://arxiv.org/abs/2408.00620

>Alleviating Hallucination in Large VLMs with Active Retrieval Augmentation
https://arxiv.org/abs/2408.00555

>Illustrating Classic Brazilian Books using a T2I Diffusion Model
https://arxiv.org/abs/2408.00544

>Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion
https://arxiv.org/abs/2408.00458

>Towards Reliable Advertising Image Generation Using Human Feedback
https://arxiv.org/abs/2408.00418

>A Simple Background Augmentation Method for Object Detection with Diffusion Model
https://arxiv.org/abs/2408.00350

>ADBM: Adversarial diffusion bridge model for reliable adversarial purification
https://arxiv.org/abs/2408.00315

>Navigating T2I Generative Bias across Indic Languages
https://arxiv.org/abs/2408.00283

>WAS: Dataset and Methods for Artistic Text Segmentation
https://arxiv.org/abs/2408.00106

>Replication in Visual Diffusion Models: A Survey and Outlook
https://arxiv.org/abs/2408.00001

>EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head
https://arxiv.org/abs/2408.00297
>>
File: tmp31i_j5aj.png (787 KB, 768x768)
787 KB
787 KB PNG
>>
File: delux_mtg_00025_.png (1.74 MB, 896x1152)
1.74 MB
1.74 MB PNG
>>
File: PW_78963_.png (711 KB, 1024x1024)
711 KB
711 KB PNG
Good evening, anons! I hope everyone is doing well :]
>>
File: delux_mtg_00027_.png (1.4 MB, 896x1152)
1.4 MB
1.4 MB PNG
>>101697908
hello pw, nice to see you. I bet you've been thinking about flux all day
>>
File: tmpxq8rzcqv.png (945 KB, 768x1024)
945 KB
945 KB PNG
>>
>>101697925
Where can you get the actual flux model? I only saw a links for the playground demo
>>
File: PW_79009_.png (1.42 MB, 1024x1280)
1.42 MB
1.42 MB PNG
>>101697925
Heya, Debo!! It's so good to see you!!
I have hahaha! I wanted to get home at the usual time, but today was super busy so I was there for 10 hours LOL
Glad to be home tho! :]
>>
cant think of a good outfit besides maid, or the striped polo shirt and shorts. no idea why i like the striped polo shirt look, it must be imprinted from some girl i saw that i have forgotten
>>
File: delux_mtg_00029_.png (1.79 MB, 896x1152)
1.79 MB
1.79 MB PNG
>>101697966
you can get everything off huggingface
https://huggingface.co/black-forest-labs/FLUX.1-dev
more info here:
https://comfyanonymous.github.io/ComfyUI_examples/flux/
>>
File: BMP_10195.jpg (1.25 MB, 1640x1640)
1.25 MB
1.25 MB JPG
>>101697908
Just scrapped out my first parts today after like 4 months of new jorb because of like 4 chain reactions of unfortunate coincidences, two of them other people's doings. I swear the multiverse was out to get me today. Yeah long days suck though I feel for you.
https://suno.com/song/95f79072-1d57-4c14-8330-73829867c085
>>
File: 1702990897235877.png (999 KB, 1024x1024)
999 KB
999 KB PNG
>>
File: FLUX__00063_.png (830 KB, 1024x768)
830 KB
830 KB PNG
>>
File: BMP_01697__cleanup.png (2.83 MB, 1432x1432)
2.83 MB
2.83 MB PNG
Wtf is Flux btw? Is this the new SD3?
>>
File: 1704375482217570.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
the text outputs are pretty neat
>>
File: 1712947453669586.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>101698114
better:
>>
>>101698038
Thanks, I'll check it out. 23.8 gb, ouch
>>
File: test.webm (337 KB, 1152x832)
337 KB
337 KB WEBM
>>101697908
hello PW! <3 good to see you! I was about to head to bed after I tested some stuff with the conditioning in flux (it started with fixing a dumb push from forever ago)
>>
File: FLUX__00066_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>
>>101698173
I saw some really fucking good 8/16 bit gens whatever you want to call it from anons today. I wish I was cool and kept up with tech like you guys.
>>
File: 1701486416400383.png (50 KB, 1024x1024)
50 KB
50 KB PNG
>>101698233
if you make a gen and use the pixelization extension in auto1111 or comfy you can give anything an 8 bit look:
>>
>>101698254
that's not an 8 bit look, that's a pixelized picture
>>
>>101698173
despite the background shifting this is a convincing fighting game background character loop
>>
>Use SDXL
>anything that does an img2img (even face restore) completely shits the bed and freezes it up, making it do nothing and locks the GPU's cuda cores until the computer is reset
the fuck is wrong with this shit?
>>
>>101698264
well you can use it to make nice sprite art, still need to prompt it to look retro though (simple palette, etc)
>>
File: delux_mtg_00031_.png (1.68 MB, 896x1152)
1.68 MB
1.68 MB PNG
>>101698264
here we go with this shit again
>>
Is this thing going to be faster, or do I need to return to GPU renting again?
>>
File: 1719652143810260.png (997 KB, 1024x1024)
997 KB
997 KB PNG
text output is pretty good, and it there is good variety (perspective text, etc)
>>
File: FD_00332_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>101698111
It's a different model made by the people who originally made Stable Diffusion. It's like an uncucked SD3.
So instead of being dogshit, it's good.
>>
File: BMP_28709_.png (2.37 MB, 1384x1384)
2.37 MB
2.37 MB PNG
>>101698254
I was messing around with it about a month ago but with just prompts and gimp, stuff I saw today was leagues better though. Hats off to you tech pioneers.
>>
>>101697538
Is anyone here willing to answer questions about making Lora? Im new so I have some questions. Any help would be appreciated


More specific question:

Would a source like this be a good place to get a dataset from? Are the pictures good enough?

https://x.com/ZooeyDonk?t=5NG5SZz9mXcuKcCMM_pqMw&s=09
>>
File: delux_mtg_00032_.png (1.74 MB, 896x1152)
1.74 MB
1.74 MB PNG
>>101698290
its a transformers model so it might be quantizable. there may be an aitemplate implementation for it too. seems like there's room for improvement but who knows how far it'll go
>>
>>101697538
how do I set up flux with ComfyUI?
>>
File: ExAm8-jW8AY_0rS.jpg (873 KB, 1536x2048)
873 KB
873 KB JPG
>>101698283
of course, because "8-bit look" is a VERY specific thing to refer to (same for "16-bit")
>>
File: ComfyUI_01577_.png (829 KB, 1152x896)
829 KB
829 KB PNG
>>101698233
I was doing a lot today admittedly but it's one of the few style prompts we found already that's pretty good. I wonder if anon hit a good spot for Picasso

>>101698268
ty anon

>>101698283
last time was over "what is pixel art" not the specifics of it if I remember corectly

>>101698306
yeah it's all raw dog kek. I could just run it through the script but the frames take forever to get through already
>>
File: ComfyUI_30512_.png (668 KB, 1024x768)
668 KB
668 KB PNG
>>
>>101698341
the example page in the repo
>>
File: delux_mtg_00034_.png (1.49 MB, 896x1152)
1.49 MB
1.49 MB PNG
>>101698341
https://comfyanonymous.github.io/ComfyUI_examples/flux/
>>
File: FD_00343_.png (849 KB, 1024x1024)
849 KB
849 KB PNG
>>101698310
>>
>>101698382
so flux dev requires a bit more involvement than schnell? schnell has a download but dev has a bunch of files. Haven't used Comfy in forever so my apologies, some hand holding would be appreciated
>>
>>101698362
I'm still the lewdest mouse poster though until proven otherwise.
https://files.catbox.moe/a9fl62.png
>>
File: FD_00351_.png (1 MB, 1024x1024)
1 MB
1 MB PNG
>>101698414
No, they are the same thing. Schnell is a "fast" model and gens in 4 steps, but is less accurate. Dev is the main open source model. Both need the same VRAM and setups.
>>
>>101698436
>Dev is the main open source model.
It has restrictive license though.
(also schnell is open weights, not open source, but that's technicalities)
>>
File: 1719344171549391.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
one more with same prompt: I like that the text can also be cursive
>>
File: FLUX__00071_.png (1012 KB, 1024x1024)
1012 KB
1012 KB PNG
>>
File: tmpcqqyz1os.png (827 KB, 768x1024)
827 KB
827 KB PNG
>>
File: ComfyUI_01570_.png (1.22 MB, 896x1152)
1.22 MB
1.22 MB PNG
>>101698421
don't tempt me I have to go to bed! (nice gen tho very hot!)

>>101698457
>It has restrictive license though
not really. you just email them if you want to use it commercially. otherwise it's all good to finetune and share for free or keep it to yourself
>>
>>101698436
gotchya

>>101698457
>restrictive license

I don't give much of a crap using this publicly just cute things and lewd things. I'm redownloading comfy now so now I gotta remember how to mess with this damn thing
>>
File: FD_00354_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>101698457
>>
File: ComfyUI_30513_.png (935 KB, 1024x768)
935 KB
935 KB PNG
>>
File: FLUX__00072_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
>>101698490
any sense of what training looks like? I saw a few comments saying its "untrainable"
>>
>>101698490
>you just email them if you want to use it commercially
>you MUST request a license from Company, which Company MAY grant to you in Company’s sole discretion and which additional use may be subject to a fee, royalty or other revenue share.

>>101698493
>>101698503
You don't, it's not for you. Finetuners and commercial services do. This clause is a non-starter for most.
>>
File: 00021-906552293.jpg (898 KB, 1560x2064)
898 KB
898 KB JPG
staying up late cette nuit
>>
>>101698518
why is there a portrait of blonde Elon Musk
>>
File: ComfyUI_01630_.png (803 KB, 1152x896)
803 KB
803 KB PNG
>>
File: ComfyUI_30515_.png (926 KB, 1024x768)
926 KB
926 KB PNG
>>
>>101698533
it looks nothing like elon musk
>>
>>101698473
neat, how did you prompt for isometric/pixel art?
>>
Catboxing this because butthole

https://litter.catbox.moe/31few3.png
>>
File: delux_mtg_00036_.png (1.58 MB, 896x1152)
1.58 MB
1.58 MB PNG
>>
>>101698577
adorbs ani. good night
>>
Who here wants to spoonfeed an absolute retard? I want to train my own shit on specific artists works to get results that I like instead of going through places like promptchan to hope I can roll the dice on a style I like. I know exactly nothing about ai generated images though outside of places like that.
>>
>>101698597
N64, apparently
>>
File: tmpv3zsa8lz.png (864 KB, 768x768)
864 KB
864 KB PNG
>>
File: delux_mtg_00039_.png (1.7 MB, 896x1152)
1.7 MB
1.7 MB PNG
>>
File: ComfyUI_30517_.png (875 KB, 1024x768)
875 KB
875 KB PNG
>>
>>101698606
You want to train a style lora, Check the op
>>
while I'm waiting for flux to download, tell me anons is it better or on par than Dall-E 3?
>>
File: ComfyUI_30518_.png (933 KB, 1024x768)
933 KB
933 KB PNG
>>
File: 1701737873211359.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>101698677
sdxl/ponyXL can make really good characters faster, but this handles certain things a lot better, like text or logos, im new but it seems to have good prompt understanding in general.

I can't do this text with either of those for example. for that i'd have to shoop it.
>>
File: ComfyUI_30519_.png (857 KB, 1024x768)
857 KB
857 KB PNG
>>
File: FD_00040_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>101698677
It's not as good as DallE3 overall because it's harder to prompt style on it.
We need some autists to uncuck it a bit then it will exceed DallE3,
>>
File: 1708101448030454.png (1018 KB, 1024x1024)
1018 KB
1018 KB PNG
>>
>>101698724
>>101698742
so two cucked slightly better models? or is flux decent and do lewds and nudes?
>>
File: tmp1k4tstyi.png (749 KB, 768x768)
749 KB
749 KB PNG
>>
File: delux_mtg_00044_.png (1.89 MB, 896x1152)
1.89 MB
1.89 MB PNG
>>101698677
prompt adherence is superior to sd but not quite at dalle level. does a great job with scene construction and text. will be interesting to see whether community contributors can progress it further, and by how much
>>
so this new flux thing... it's only for the 4090 fags yeah? or worse, hosted a100 pay piggies?
>>
>>101698677
Dall-E 3 feels less restricted than flux pro at prompt following because the dataset is less pozzed, but has terrible quality compared to flux pro.
smaller open-weights flux versions are less coherent than pro
>>
is there a way to basically do: upscale x1.25 (or whatever) at .75 (or whatever) denoise, then taking the output and repeating the upscale process, but using like .60, then repeating etc like 5 or six times--- in one button press, or msybe there is an extension that does this, instead of manually doing all this
>>
File: FD_00140_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>101698778
it can't nudes. But, it knows where nipples go, unlike SD3.
>>
>>101698778
it's already fun to play with and apparently people got img2img working with it as well, so it will get better and better as more people use it
>>
>>101698816
comfy can do all that. won't be a single click (it will, in fact be a few thousand clicks to build the stupid fucking thing), but you can surely orchestrate all that nonsense.
>>
>>101698742
>>101698790
it mainly needs controlnets and zero-shot adapters to augment prompt following, as this level is good enough when combined with controlnets
>>
>>101698778
It's vastly better, not slightly. Finetunes and tooling (controlnets) will make it a beast.
>>
File: delux_mtg_00050_.png (1.7 MB, 896x1152)
1.7 MB
1.7 MB PNG
>>101698803
I'm running it on 12gb. seen people say it can fit on 8gb cards too. its kind of limiting but its not off-limits

>>101698846
I'm sure we'll see stuff like that coming out pretty quickly. kolors got ipadapter really fast and there wasn't even more hype around that model
>>
File: 00036-1668210438.jpg (160 KB, 1024x1536)
160 KB
160 KB JPG
>>101698842
darn, reckon ill keep doing it the oldfashioned way
>>
File: de_fl_00103_.jpg (796 KB, 1344x960)
796 KB
796 KB JPG
>>
>>101698881
doing weird things with comfy becomes more palatable with extensions like Workspace Manager and some others stuff. at least that way you can switch between workflows and keep clipspace. it's still a huge pain in the ballsack, don't get me wrong
>>
>>101698902
is that suppose to be President Brat? or is the God Emperor just beating up on random females (as well he should)?
>>
File: ComfyUI_30523_.png (659 KB, 1024x768)
659 KB
659 KB PNG
>>
File: PW_79226_.png (1.73 MB, 1440x1024)
1.73 MB
1.73 MB PNG
>>101698173
Animanon!! Hello!! Sorry I ran off for a bit hahaha!
Cute! Are you doing animation with Flux?
It's so great to see you! Flux is so fun hahah
>>101698051
Ughh yea!! I was sposed to get out at 3 but got out at 8 LOL
>>
File: de_fl_00104_.jpg (1.08 MB, 1344x960)
1.08 MB
1.08 MB JPG
>>101698911
>kamala harris punches donald trump through the tournament floor at the tenkaichi budokai, dutch angle anime style with a sense of motion and impact, in the style of dragon by drawn by akira toriyama, kamala harris is wearing a in a pantsuit
>>
File: 57637.jpg (914 KB, 1440x3120)
914 KB
914 KB JPG
>>101698943
should be "kamala harris, on her third bar of xanax, attempting to speak standard english. colorized, 2024"
>>
File: 1715743234306421.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
neat, can do styles too. I copied a pixar prompt I found on lebbit.

>Display the title "The Adventure of Hatsune Miku" in bold and playful text at the top or center of the poster. Depict a dynamic and charismatic Hatsune Miku in a heroic pose. Include a colorful and exciting backdrop with elements like a electronic music concert with neon lights to hint at various adventures. Ensure the background is vibrant and engaging. Incorporate the Pixar logo at the bottom or top of the poster to establish it as an official Pixar movie. Include a tagline that reads: "a vocaloid adventure" prominently on the poster. Ensure the overall visual style is consistent with Pixar’s signature animation look – bright colors, expressive characters, and a touch of whimsy.
>>
File: ComfyUI_30524_.png (667 KB, 1024x768)
667 KB
667 KB PNG
>>
>>101698967
cute, how did you get that size? chibi prompt?
>>
File: de_fl_00107_.jpg (1.08 MB, 1344x960)
1.08 MB
1.08 MB JPG
I think goku is a trump supporter. I didn't even prompt for him but he jumped into the fight all on his own

>>101698958
kamala isn't cool enough to do drugs.
>>
>>101698975
>group of little cute kawaii loli anime girls having a tea party with plush animals, ribbons, many cute objects
>>
>>101698987
it's xanax or booze. my money is on big pharma. i believe she isn't a natural politician (i.e., an effortlessly gregarious, glad handing extrovert alpha) so she smooths things out with xannies but sometimes takes one too many and you end up with weird rants about coconut trees and whatnot.
>>
>>101698990
at first I thought it was one of the genshin impact chibi art pieces, looks just like it
>>
>>101699013
in otherwords, the perfect president. the deep state couldn't ask for more
>>
File: 1697334653822517.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
lmao, it's like uncensored dall-e 3
>>
File: 57638.jpg (245 KB, 1440x3120)
245 KB
245 KB JPG
i wake up in the bed you made,
the one where you're supposed to lay with me
>>
File: 1707440728297512.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>101699032
>>
File: delux_mtg_00051_.png (1.55 MB, 896x1152)
1.55 MB
1.55 MB PNG
>>101699013
coconut trees was bars for sure but not xanax
>>
>>
File: 1712743939893514.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
File: FD_00430_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>101699047
>>
File: 57639.jpg (400 KB, 1440x3120)
400 KB
400 KB JPG
>>101699074
that makes it worse, ya know? it implies she's just naturally insane. which, i suppose, could be true. i'm tempted to vote kamala, i think she very well could be the death of the Regime.

on the heels of the great steal for Joe "Most popular votes ever" "but also pudding brain" Biden I don't think there's any way the Regime can maintain legitimacy with a Harris win. She'll do more and more insane things to try and prove her mandate and it will all slip through their fingers.

>"let's finally find out if the 2a crowd will chimp out when we seize their guns or not"
LFG
>>
Revisiting some gens from pixart/etc, flux is pretty nice so far
>>
>>101699107
>>101699135
is this shit Dalle-3 or flux? it looks like standard dalle slop. ancestor cry.
>>
>>
File: 1710122655153894.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>101699150
I am learning the ways of flux, so far i'm very impressed, still think sdxl/ponyxl makes the best characters faster, but this is good: this is day 1 btw.
>>
File: delux_mtg_00052_.png (1.5 MB, 896x1152)
1.5 MB
1.5 MB PNG
>>101699143
you're so deep in the rabbit hole that you're not even gonna see any bunny girls down there
>>
File: FD_00436_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
Prompt: Donald Trump having a rap battle with Adolf Hitler
>>
>>
File: delux_mtg_00054_.png (1.55 MB, 896x1152)
1.55 MB
1.55 MB PNG
>>101699179
this is deep
>>
File: 1707862464148662.png (1 MB, 1024x1024)
1 MB
1 MB PNG
this is a goldmine, the story of kamala: (will iterate again for better text)
>>
File: 57640.jpg (1015 KB, 1440x3120)
1015 KB
1015 KB JPG
>>101699177
>no bunny girls
that's fine, i've moved on to cat girls. fingers crossed for lots of chubby cat girls in the AGI matrix.

>>101699172
so flux is just open source dalle. let the slopfest commence!
>>
>>
File: sample (1).jpg (451 KB, 1024x1280)
451 KB
451 KB JPG
>>101699181
interesting
>>
>>101699211
ok swap "pixar" and whatever other embarassing shit you have in that prompt for "by Piet Mondrian,H R Giger,Kazimir Malevich,Mark Rothko"
>>
>>
>>101699225
no u
>>
>>101699225
to be specific, the pixar look is ass, and you need to stop.
>>
File: 1856687854.jpg (156 KB, 1536x1536)
156 KB
156 KB JPG
>>
File: 1710502259722089.png (987 KB, 1024x1024)
987 KB
987 KB PNG
>>101699225
ok this time the text came out perfect. now ill try that.
>>
>>101699235
*going down in 2024
kek
>>
File: FLUX__00092_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>
File: fs_0010.jpg (73 KB, 1280x424)
73 KB
73 KB JPG
got unlazy and fixed my output naming
>>
File: fs_0014.jpg (120 KB, 1280x800)
120 KB
120 KB JPG
>>
File: fs_0022.jpg (148 KB, 800x1280)
148 KB
148 KB JPG
>>
File: 1714870701850459.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
yep, it's open source DALL-E
>>
File: fs_0048.jpg (183 KB, 1280x568)
183 KB
183 KB JPG
This one had a lot of WIP, was trying to get a rabbit sneaking into a castle sewer, pixart wasn't having it back then

> a somber morbid gritty extremely detailed b&w pencil, by Kentaro Miura,
long distance landscape shot,
a rabbit knight in armor from far away sneaking into a broken grate of a castle sewer,
evil castle walls, sewer grates draining into a castle moat,
>>
File: fs_0064.jpg (232 KB, 1024x1024)
232 KB
232 KB JPG
>>
File: 57641.png (3.48 MB, 1440x3120)
3.48 MB
3.48 MB PNG
>>101699254
can this thing not do non-square image sizes or what? 1:1 is the worst there is. it's 100% shit, in each and every case. squares are never, ever good.

>>101699291
god help us. please do anything creative with it and not just pure dalle slop! I"M BEGGING YOU... LITERALLY ON MY KNEES
>>
File: tmp8a7pjjlm.png (1019 KB, 768x1024)
1019 KB
1019 KB PNG
>>
>>101699342
I don't have the vram to up the resolution
and I'll make what I want
>>
>>101699342
I know, but you have to test the obvious stuff first, then get more creative.
>>
File: ComfyUI_30530_.png (1.17 MB, 1280x768)
1.17 MB
1.17 MB PNG
>>
File: 1707179324045166.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
told it to use japanese manga style at the end of the prompt:
>>
>>101699355
yeah and what you want is shit. enjoy the scat, anon.
>>
File: fs_0100.jpg (383 KB, 1280x1000)
383 KB
383 KB JPG
> a morbid dark fantasy extremely detailed black and white pencil sketch by Kentaro Miura, black and white b&w, an extreme macro closeup of a rabbit knight's eye, extreme closeup of rabbit eye reflecting a horde of demons
>>
>>101699172
How much vram do you have? I ran out of memory.
>>
>>101699342
>can this thing not do non-square image sizes or what?
yes it can, look at all the images debo posted
>>
>>101699369
beg some more
>>
>>101699374
you want to use the fp8 clip file, only a 4090 can use the 16, but you still get good results. I have a 4080 but anything 12gb and up will work fine.
>>
File: tmpdst942yi.png (1.1 MB, 768x1024)
1.1 MB
1.1 MB PNG
>>
File: 57642.png (3.51 MB, 1440x3120)
3.51 MB
3.51 MB PNG
>>101699378
i'm down on my knees! please anon, i beg of you, make a single good gen. i believe in you!
>>
File: 1699300244988019.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
this is pretty funny, you can prompt basically anything and any style, so it's essentially open source dall-e in terms of what it can/can't do.
>>
>>101699441
it does tits too
>>
>>101699441
dalle is shit. this cartoony crap is pure AIDS.
>>
we are all pozzed by the dalle cartoon pixar faggot on this blessed day!
>>
>>101699451
it's only cartoony cause I specifically said "pixar animation style." you can make cool shit like >>101699373 too
>>
>>101699466
ok, then you can simply stop! it sounds crazy, but it's true!
>>
File: fs_0140.jpg (94 KB, 840x1280)
94 KB
94 KB JPG
>>
take the dalle slop to /v/, but make sure is princess peach with large breasts suckling the star bitch from the GC mario or whatever. they'll eat it up
>>
File: 1707136048956920.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>101699475
it also knows trigger discipline!
>>
File: F_00001_.jpg (188 KB, 1024x1024)
188 KB
188 KB JPG
>>101699387
I only have 10gb, but got it to work. Had to reinstall comfy, there was some node causing an error
>>
>>101699481
sorry, the star bitch is from galaxy, on whatever faggot nintendo system that was. i refuse to keep track
>>
>>101699486
better. squares are still awful
>>
File: FluxyUI__00001_.png (1.24 MB, 960x1088)
1.24 MB
1.24 MB PNG
>>
File: 00052-2804180620.jpg (832 KB, 1560x2064)
832 KB
832 KB JPG
>>
>>101699507
you fags do realize 1024x1024 can be restated in a different aspect ratio, right?
>>
File: 1719327459875966.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
neat, prompted manga panels and got kanji/katakana (I cant read japanese)
>>
>>101699518
spoonfeed
>>
>>101699517
where the fuck did you get a 4090? don't tell me they give them away at the insane asylum? if so, sign me the fuck up!
>>
File: FLUX__00099_.png (633 KB, 896x546)
633 KB
633 KB PNG
>>
>>101699524
no. the math isn't even remotely difficult. hell, just ask ChatGPT if your brain is too small
>>
>>101699527
silly billy, i have a mere 1080
>>
File: F_00002_.jpg (172 KB, 1024x1024)
172 KB
172 KB JPG
>>101699488
Rosalina. It couldn't make her, only Peach.
>>
>>101699547
oh i guess i got carried away with assuming everything was richbitch flux dalle sloppa. my bad.
>>
File: 1715773743013728.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>
>>101699549
that's /v/ worthy.
>>
File: FluxyUI__00002_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>101699534
>t. nogen
okay, retard
>>
File: 1708571521785573.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>
so the schnell model apparantly makes good results in 5 steps? have people compared the two? im using the default dev model.
>>
File: FLUX__00100_.png (932 KB, 1024x1024)
932 KB
932 KB PNG
>>
File: 1695202137905061.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
the text working in gens opens up so many possibilities, in SDXL/ponyXL you have to inpaint or shop text in, normally.
>>
File: fs_0306.jpg (90 KB, 1280x776)
90 KB
90 KB JPG
>>
>>101699641
sd3 was/is good at text, shame everything else sucks
>>
File: 1692667596764792.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>101699590
my god, the possibilities are endless.
>>
File: fs_0332.jpg (87 KB, 1280x832)
87 KB
87 KB JPG
>>
File: FLUX__00101_.png (1.17 MB, 896x1152)
1.17 MB
1.17 MB PNG
>>
File: FluxyUI__00003_.png (948 KB, 1024x1024)
948 KB
948 KB PNG
goo night
>>
>>101699702
cool robot, this model does really well with details
>>
>>101699476
underrated
>>
File: 1702624976462523.png (829 KB, 1024x1024)
829 KB
829 KB PNG
>>
File: 1693353719543193.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>a large congregation of chibi anime foxgirls
did not expect that many
>>
File: FLUX__00106_.png (1.11 MB, 896x1152)
1.11 MB
1.11 MB PNG
>>101699517
didn't mean to, but I stole your shtick
>>
File: 1716674567115558.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>101699796
I specified a smaller number to better results:
>>
File: FD_00386_.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>
File: Flux_00126_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>
>>101699796
That's a lot of chibi foxgirls. Kind of scary.
>>
File: 1704835329234044.png (883 KB, 1024x1024)
883 KB
883 KB PNG
>>
>>101699519
from what I can tell it's mostly gibberish, but at least most of the glyphs are legit
>>
File: 1693472186085361.png (869 KB, 1024x1024)
869 KB
869 KB PNG
>Donald Trump in the style of Dragonball Z, dressed as Goku, manga artstyle
>>
File: FLUX__00109_.png (1.09 MB, 896x1152)
1.09 MB
1.09 MB PNG
>>
File: ComfyUI_01170_.png (1.55 MB, 1080x1344)
1.55 MB
1.55 MB PNG
>>
reminder the schnell model can give good results with even 4 steps, dev model with 20 steps is better but more time of course.
>>
File: fs_0643.jpg (82 KB, 1608x456)
82 KB
82 KB JPG
>>
File: PW_79359_.png (1.41 MB, 1024x1280)
1.41 MB
1.41 MB PNG
>>
>>101700105
creepy
>>
File: 00066-3419725008.jpg (680 KB, 2328x1560)
680 KB
680 KB JPG
>>101699830
neat
>>101700105
nice
>>
File: PW_79367_.png (1.62 MB, 1024x1280)
1.62 MB
1.62 MB PNG
>>101700114
Thanks! :]
>>101700132
Thanks so much! :D
Yours looks really cool too!
>>
File: FLUX__00117_.png (907 KB, 896x1152)
907 KB
907 KB PNG
>>
File: F_00007_.jpg (356 KB, 1024x1024)
356 KB
356 KB JPG
>>101700149
very cute
>>
>>101699517
based cuneiform poster
>>
File: 1720009411421846.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
File: PW_79386_.png (1.34 MB, 1024x1280)
1.34 MB
1.34 MB PNG
>>101700247
Thanks so much, anon! :]
Yours looks really cool!
>>
File: FD_00456_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>
File: 1694587467270103.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>dalle wont let you do this
open source always wins, baby
>>
File: 2024-08-03_00030_.png (2.05 MB, 1536x1536)
2.05 MB
2.05 MB PNG
can't believe that FLUX is the first software tech in decades that actually is making waves internationally that came out of Germany ..
>>
File: F_00009_.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
When I said plants growing inside helmet, I was thinking at the bottom. But okay.
>>
Okay, but where controlnets and pony fine tune?
>>
>>101700371
https://civitai.com/models/618792/nepotism-fux
>>
>>101700352
nipple access holes for when I get thirsty
>>
File: 1701035793905556.png (222 KB, 1228x1118)
222 KB
222 KB PNG
>>101700371
it's day 1, patience anon

also these settings seem to work pretty well, seems faster after changing weight_dtype
>>
File: 1721814193566882.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>101700340
and a more cheerful sequel, for contrast:
>>
File: 2024-08-03_00032_.png (2.25 MB, 1536x1536)
2.25 MB
2.25 MB PNG
>>101700378
they heck how did they do this? downloading
>>
>>101700378
huh... does this imply controlnets might go over as well?
>>
>>101700398
I'm not sure they did, but feel free to try
>>
File: FD_00462_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>
>>101700398
looks like an attempt at a shitmix based on a comment
>The merge is effectively flux's unet on top of the NepotismXL models data, it did not retain its original architecture, the only similarity with its SDXL/PONY counterpart is the dataset."
this reply made me jej
>flux doesn't even have a unet. this model merge does nothing. please there's still time left to delete it.
>>
>>101700378
poster in comments claims the XL model didn't actually merge
>flux doesn't even have a unet. this model merge does nothing. please there's still time left to delete it.
>>
File: fs_1015.jpg (65 KB, 1024x784)
65 KB
65 KB JPG
>>
>>101700436
I thought as much.. the model architecture is different.. FLUX is a DiT model ..
>>
File: PW_79389_.png (1.19 MB, 1024x1280)
1.19 MB
1.19 MB PNG
New Thread!!!
>>101700396
>>101700396
>>101700396
>>
File: image(262).png (652 KB, 1024x1024)
652 KB
652 KB PNG
first scribble
>>
File: PW_79387_.png (1.51 MB, 1024x1280)
1.51 MB
1.51 MB PNG
>>
File: FLUX_SCHNELL_00002_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>101699796
they even have their own red fox ears symbol how cute
>>
File: PW_79388_.png (1.4 MB, 1024x1280)
1.4 MB
1.4 MB PNG
>>
>>101700492
nice
>>
>>101700458
>no other images found
wtf is this ai?
>>
I was once using DALLE on bing and it hallucinated a green skyrim-esque giant tearing a guy apart. Can any chinks who have actually studied AI/ML explain how this happens?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.