[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107646172

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: 04smpb.jpg (53 KB, 1048x654)
53 KB
53 KB JPG
>>107648831
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
is there a reason why these are still in the OP?
https://rentry.org/ranfaggot
>>
File: 1752342696048166.png (868 KB, 2384x1157)
868 KB
868 KB PNG
For those trying the new QiE, the workflow is kinda different to the old ones, you have this new
"Edit Model Reference Method" node thing to add
>>
>>107648853
I have to pull?
*shudders
>>
>>107648847
cranky Internet troon bakes the threads
>>
>>107648847
lmfao
https://rentry.org/ranfaggot#the-poll-melty
>>
>>107648847
Ani do you not have any family? it's christmas time and you're spending 16 hours a day trying to derail ldg threads. it's kind of sad desu i hope you get help
>>
>>107648864
try to see if you can find that node with your current commit first I guess
>>
you botted the poll LOL
>>
>>107648853
>20 steps
Doesn't it need like 50?
>>
File: 1765814453579586.png (1.13 MB, 1216x832)
1.13 MB
1.13 MB PNG
>you botted the poll LOL
>>
>>107648907
you can get away with 20 but if it doesn't work try to increase yeah
>>
Blessed thread of frenship
>>
File: 2511.jpg (2.01 MB, 8978x1844)
2.01 MB
2.01 MB JPG
>He cannot run QiE 25/11 even though it works with exactly the same workflows as previous versions of QiE.
>This smells like a serious skill issue. I wouldn't want to see the opinions on the quality of a model by such a retard desu.

>can't troubleshot a basic QiE workflow
>still thinks he's not humiliating himself by showing his noskill to the public
>must be nice being this dumb and having this lack of self awarness desu

>you dont know what you're doing. I can use Qwen 511 Q8 just fine. Literally just replace 2509 with 2511, what the fuck?

>just swap 2509 with 2511

>you don't need to change anything anon, just change the 2509 modeo to the 2511 model and you're good to go

Hahahahaha
>>
>>107648896
The fact that he responded to this without any actual justification for why all the yes votes were from france means you're right keeeeek
He will continue to seethe
>>
>>107648949
let's hate ran together!!!! :D
>>
>>107648951
tl;dr
>>
just give him what he wants so what if the rentries get removed
>>
>>107648831
Thank you for baking this thread, anon
>>107648949
Thank you for blessing this thread, anon
>>
>>107648969
that isn't what tran wants and then she will be seen as a weak pussy if she doesn't kick and scream to keep them in
>>
>>107648984
how many trannies are involved in this bs
>>
>>107648853
How do you do those wiring cables, I hate the formless Comfy spaghetti
>>
>>107648951
>let's pretend it was about this and not about the fact I got noise on the regular workflow and I couldn't troubleshot it
revisionism is a hell of a drug
>>
>>107648831
>>107648949
>>107648980
the samefagging is getting annoying. I remember you were trying to pin ani for thanking himself for baking and yet here you are doing something you think is bad
>>
File: hmm?.png (157 KB, 250x350)
157 KB
157 KB PNG
>>107648969
>just give him what he wants
Do you think you deserve that? Have you been a good boy this last month?
>>
>>107648996
>>107647689
>I find this really hilarious. You retards don't even know about the node you're supposed to pipe your conditioning through. You just tried switching the model, saw that it could generate a vaguely coherent image, and decided that you were doing everything correct.
2 hours 11 minutes ago btw. I could have spared you all the trouble :^)
>>
File: 3d_vs_25d.jpg (1.7 MB, 2816x2788)
1.7 MB
1.7 MB JPG
Man, 3dpd is so ugly compared to godly 2.5d. Also cloud vs local.
>>
>>107649024
We didn't tell you the fix to get rid of the noise, you didn't give us the conditioning solution, looks like it's a draw... but we finally fixed our problem, did you fix yours anon? the lack of images suggest that you're still having trouble with it, sucks to be you I guess
>>
>>107649041
I did fix it but I'm training in ai-toolkit right now so I can't gen
>>
>>107649053
Sure.
>>
the problem with chinks making models is the lack of body shape
its all - stick figure
low test people shouldn't be allowed to make models
>>
>>107649040
Eww socks...
>>
>>107648853
had to go to reddit to find out that the name of that node changed since i couldnt find it, /g/ yet again two steps behind.
>>
>>107649092
im actually considering going to reddit for this since it's such a shithole here
>>
>>107648831
What model was used for that pic at the bottom? I can never get good 3d
>>
fugg
latest comfyui updated pytorch and now I can't run inference on my gtx 1080 because it requires a later version of cuda that my gpu doesn't support

guess I'm gonna have to manually downgrade
>>
>>107649125
DRAGGED AND SHOT
>>
File: file.png (350 KB, 519x368)
350 KB
350 KB PNG
>>107649125
>he pulled
>>
>>107649125
1000 series cards are getting butt fucked right now by nvidia. glad I got a 3060
>>
>>107649146
the 30 series already starting to feel outdated, but i can't justify paying 3k dollars to go from 24gb to 32gb vram
>>
>>107649146
it's only a matter of time before the 30xx series has the same fate
>>
>>107649073
At least use them before saying shit that's completely wrong
>>
>>107649167
yeah in like 7 years. I'm not really worried about that, the entire market will look different by then.
>>
>>107648853
it's still zooming in the image, even with that new workflow, I thought they said they fixed it
>>
>>107649125
god fucking damn and all this time I've been holding out for the RTX 5000 series super cards just to find out they're delayed til at least Q3 next year

the VRAM jew has me by my balls
>>
>>107649184
I envy your optimism lol
>>
File: 1751609199670929.png (915 KB, 992x1048)
915 KB
915 KB PNG
The anime girl is holding dual silver pistols.

2511 with new lightning lora, but with 8 steps. worked better than 2509 which had pistols with the melee weapon.
>>
>>107649184
what market? you will own nothing and be happy
>>
File: 1740429608426743.png (926 KB, 992x1048)
926 KB
926 KB PNG
>>107649226
>>
>>107649202
Mitigated it, not fixed I think. Doubt it's possible to fix completely.
>>
File: 1750841432849012.png (1.14 MB, 992x1048)
1.14 MB
1.14 MB PNG
>>107649239
The anime girl is sitting at a table eating a Mcdonalds cheeseburger.

cute!
>>
>>107649169
gen me a thick momma in z-image
ill wait
>>
>>107649242
>Doubt it's possible to fix completely.
Kontext dev never had this issue, I guess they have to find what was their secret
>>
File: 1736814814953587.png (1.24 MB, 992x1048)
1.24 MB
1.24 MB PNG
make a realistic version of the anime girl as a photorealistic japanese woman.
>>
Weird. Onetrainer gave me black screens and cuda errors for a while now, and now I've disabled autocast cache and everything seems to be normal again, and training speed doesn't seem any different even it's supposed to be slower with autocast cache diabled.
>>
>>107649261
>Kontext dev never had this issue
yes it did...
>>
>>107649223
people shat themselves over the gpu shortage and we bounced right back. the universe will heal we just need to get rid of every indian ceo.

>>107649228
nothing ever happens or you will own nothing. if it's the latter I'm gonna blow something up
>>
>>107649302
no it didn't...
>>
>>107649293
Torch compile a shit, big shit
>>
https://github.com/huggingface/diffusers/pull/12857#pullrequestreview-3608898033
best case scenario this gets merged tommorow and we'll get base on christmas
>>
>>107649308
>we bounced right back
wut. you couldn't buy GPUs for years and the only thing that changed was that they were no longer needed for crypto, i think
>>
File: 1f7-3005716917.jpg (53 KB, 685x567)
53 KB
53 KB JPG
>>107649319
I'm a techlet and have no fucking idea what I'm doing but I'm using exactly the same setup as in the past otherwise so I assume that autocast cache thing has some kind of bug that was introduced with a recent update
>>
File: ComfyUI_Image_00037_.jpg (379 KB, 1456x1840)
379 KB
379 KB JPG
>>107649259
>>
File: 1745049047979965.png (1.15 MB, 824x1264)
1.15 MB
1.15 MB PNG
replace the police officer in blue kneeling on a black man in image1 with the anime girl in image2. keep her expression the same.

kino! it's even better than before.
>>
>>107649125
Your days are numbered, gpulet
>>
>>107649331
>you couldn't buy GPUs for years and the only thing that changed was that they were no longer needed for crypto, i think
a few years isn't that bad. the bubble popped. once all these major corporations finish stocking up on their infinity ram demand will stabilize. I give it 2-3 years.
>>
>>107649377
also, note it was just a cropped face of OG miku. but it still works well.
>>
File: 1744489295196354.png (928 KB, 824x1264)
928 KB
928 KB PNG
the black man lying face down on the floor is holding a sign saying "#1 fent user of the year"
>>
File: 1746375203762066.png (1.96 MB, 1678x1024)
1.96 MB
1.96 MB PNG
The face consistency is better but it's still the same old QiE, slopped ass model unfortunately
>>
>>107648853
anyone worked out how to get the fp8 version working? It just does static for me
>>
>>107649464
try that one anon
https://huggingface.co/xms991/Qwen-Image-Edit-2511-fp8-e4m3fn/tree/main
>>
File: wan_00002.mp4 (1.48 MB, 880x576)
1.48 MB
1.48 MB MP4
new Nolan movie looks fucking wild
>>
>>107649498
looks more like Zack Snyder desu
>>
File: 1754941177091547.png (1.96 MB, 1895x864)
1.96 MB
1.96 MB PNG
>Image 1 must have the same style as image 2.
it's not bad, like it understood it has to be 3d and cell shaded
>>
>>107649125
There's no reason for this to happen lol, why are you replacing everything like that.
>>
File: 1762884817562297.png (2.56 MB, 2066x768)
2.56 MB
2.56 MB PNG
>>107649529
>>
>>107649355
That's too skinny. She looks like she's been starving for months.
>>
>>107649137
Git pulling Comfy doesn't add or update Python packages
>>
>>107649437
workflow? lora or no lora, and how many steps?
>>
>>107649498
lore accurate

>>107649544
>>107649529
kino
>>
Qwen Image Edit..??
*inhales*
THIS LOOKS LIKE THE PERFECT BASE FOR FINETUNING!!!!!!!!!!! TWOOO MORE WEEKS AND THE PLASTIC WILL BE GONE!
>>
>>107649437
fix the prompt, say "eating plates of tube shaped mud covered in sticky liquid"
>>
>>107649558
>workflow?
https://files.catbox.moe/psq302.json
lora and 4 steps
https://huggingface.co/lightx2v/Qwen-Image-Edit-2511-Lightning
>>
ho ho ho

here comes gennerclaus

who wants some nice SONGBLOOM xmas music?
>>
>>107649572
should the q8 work with the lora? or does it work with the fp8 one only?
>>
>>107649562
>TWOOO MORE WEEKS AND THE PLASTIC WILL BE GONE!
It's more likely we'll get Z-image edit than them unslopping QiE lol
>>107649616
the lora works with everything, Q8, fp8, bf16...
>>
>>107649612
i want some, post pls
>>
>>107649437
Just upscale with Z
>>
>>107649562
THE FINETOOONER
>>
>>107649572
is there a multi image workflow yet? or they need a new node I guess?
>>
File: 1762922477730562.png (1.4 MB, 1000x1048)
1.4 MB
1.4 MB PNG
sheeeeit
>>
>>107649687
why are you obsessed with this nigger? it's really bizarre
>>
File: 32GB RAM face.jpg (170 KB, 900x685)
170 KB
170 KB JPG
crazy how a single image can poison an entire dataset, tried to train a celeb lora with the person coming out looking like a ghoul every time then it finally occured to me that the culprit is a single closeup selfie that made the face look weirdly elongated and gaunt, despite being detailed and of otherwise good quality
>>
>>107649683
take that one
https://files.catbox.moe/r0cqkl.json
>>
https://www.reddit.com/r/comfyui/comments/1pu2wgp/how_to_use_qie_2511_correctly_in_comfyui/

index_timestep_zero

try this fix
>>
>>107649687
he took so much fent his skin got as plastic as his syringue kek
>>
>>107649696
it's a test case, like hello world.
>>
File: 1739001229408851.png (1.43 MB, 1000x1048)
1.43 MB
1.43 MB PNG
>>107649727
with fix/node:
>>
>>107649716
Been there. The worst is when dataset is fine, but there are few incorrect captions that ruin small details
>>
File: 1737803713513049.jpg (1.45 MB, 2016x1152)
1.45 MB
1.45 MB JPG
>>
https://cdn.discordapp.com/attachments/1315755467749986375/1453073924152623207/ComfyUI_278397_.png?ex=694c200c&is=694ace8c&hm=0287fefe5a4146fa73ce1025c09dd3c07853926d480279a29073166822dad83a&

new workflow from comfy discord
>>
>>107649808
dragged and shot
>>
File: 1761112888719155.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>107649808
that seems to do the trick, and there are some diff shift values and other stuff. with lora:

saint floyd lives. in AI form.
>>
File: 1764082447238819.png (1 MB, 1024x1024)
1 MB
1 MB PNG
the anime girl is holding a christmas present with a label saying "LDG general". she is waving hello.

was holding a snowflake axe before.
>>
File: 1761157470212544.png (1007 KB, 1024x1024)
1007 KB
1007 KB PNG
>>107649873
the anime girl is wearing a Christmas elf costume and is holding a candy cane.

KINO. 4 steps, lora works, new workflow from >>107649808 works.
>>
why is it easier to find workflows randomly on reddit/4chan/etc than through the actual workflow dbs like civtai and openart?
>>
File: wan_00005.mp4 (900 KB, 608x416)
900 KB
900 KB MP4
where's the genned christmas song that anon promised. i'm waiting.
>>
File: 1742228087965601.jpg (180 KB, 1024x1536)
180 KB
180 KB JPG
>>
>>107649696
>>107649687
We went through a whole year of that stupid shit. Return the problem people to their source countries.
>>
>[verse] . Y UW1 B EH1 T ER0 N AA1 T F L AH1 SH . , Y UW1 B EH1 T ER0 N AA1 T W AY1 P . , Y UW1 B EH1 T ER0 N AA1 T S IH1 T AY1 M T EH1 L IH0 NG Y UW1 W AY1 . , S T R IY1 T SH IH1 T S AE1 N T AH0 V IH1 SH N UW0 Z IH0 N T AW1 N . , [chorus] . HH IY1 S IY1 Z Y UW1 W EH1 N Y UH1 R S L IY1 P IH0 NG . , HH IY1 N OW1 Z W EH1 N Y UH1 R AH0 W EY1 K . , HH IY1 N OW1 Z W AH1 T CH AA1 R JH IH0 Z Y UW1 P UH1 T AA1 N DH AH0 K AA1 R D . , HH IY1 R OW1 T DH AH0 S AO1 F T W EH2 R F AO1 R DH AH0 B AE1 NG K . , K ER1 S V IH1 SH N UW0 EH1 V R IY0 B AA2 D IY0 . D AE1 M V IH1 SH N UW0 .'
>>
>>107649941
well in the case of floyd he sent himself back to where he belonged. wish it were that easy!

>>107649954
what in the god damn
>>
>The "Edit Model Reference Method" nodes above are not needed if you use our files but may be needed if you use repackaged ones from other people.
what does that mean?
what repackaged ones from other people?
>>
>>107649932
https://vocaroo.com/17fGZTPxTJw3
>>
>>107649963
PICKLED
>>
>>107649963
>what repackaged ones from other people?
they mean bad quants maybe?

>>107649983
>implying ANYONE is pickling qwen
..right?
>>
File: 1758968253490357.png (913 KB, 744x1392)
913 KB
913 KB PNG
the anime girl is wearing a Christmas elf costume and is holding a candy cane.

okay it works, i've finished my degree in quantum physics and can make it work in comfy.

workflow: https://files.catbox.moe/8x2301.json
>>
>>107649954
mods: this is phonetics, not nonsense :^)

I'll post it no matter how it sounds. I forgot the wf was set to 80 steps.

>>107649932
voila!

https://vocaroo.com/164GoUKeZrZP
>>
>>107649959
>what in the god damn
phonetics. it does that with SongBloom (the software does it without using ai, I guess).

Y UW1
you

B EH1 T ER0
better

N AA1 T
not

F L AH1 SH .
flush.
>>
>>107649808
>discord
catbox for everyone who doesnt want to dirty their browser with opening a trooncord url
https://files.catbox.moe/hwh8vz.png
>>
>>107649716
The reason for your problem isn't that particular image but the way the trainer works. There will be bias towards certain images in the dataset due to the RNG. Increasing the epochs while lowering steps should mitigate the problem.
>>
File: 1763019658033397.png (922 KB, 656x1584)
922 KB
922 KB PNG
>>107649997
the anime girl is wearing a Christmas elf costume and is holding a candy cane.

christmas teto!
>>
File: Terezi_Stand.png (3 KB, 300x300)
3 KB
3 KB PNG
>>107649954
>t.
>>
>>107649963
>what does that mean?
GGUFs need this, that's for sure
>>
File: 1763954719246108.png (111 KB, 423x781)
111 KB
111 KB PNG
>>107650043
I found the link in a thread, didnt join the discord.

also if you want the output to scale to the image use this instead of the 1024x1024 node:
>>
File: 1755642471307107.jpg (281 KB, 1536x1536)
281 KB
281 KB JPG
ah sweet the whole schizo gang is here its a christmas miracle

>>107649941
>Return the problem people to their source countries.
400 years of microevolution for brute strength and low intelligence is your grandpa's fault and now your responsibility. it's like african honey bees but in reverse
>>
>>107649954
>>107650066
I don't get it, I ran it and this is what I got
https://vocaroo.com/1lNK1eIGRqEN
>>
>>107650083
I'm not a propositional person. I want my kind and my God.
>>
File: 1761454507950322.png (1.26 MB, 656x1584)
1.26 MB
1.26 MB PNG
the anime girl in image1 is standing beside the anime girl in image2, and they are wearing a Christmas elf costume and are holding a candy cane.

yay, new workflow works fine.
>>
File: 1752362030131860.png (1.46 MB, 1000x1048)
1.46 MB
1.46 MB PNG
>>
>>107650083
nigger iq in america is 10 points higher than in africa
>>
File: ComfyUI_00001_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>waited 3 minutes 26 seconds for picrel

gotta say, qwen edit 2511 really isn't much of a step up from 2509.
>>
whats the diff between gguf and safetensors for comfy?
>>
>>107650159
use the lora if you don't want to wait that long
>>107650159
>qwen edit 2511 really isn't much of a step up from 2509.
I feel the improvement but I also feel it's changing the image even more than before, with 2509 you had a good chance of not having some zoom in, with this one it almost never happens, and they said they improved on that my fucking ass lol
>>
>>107649941
Shouldn't have brought them over in the first place but white people are lazy cunts who couldn't do their own work. They're doing the same thing with Mexicans and Indians now and still getting mad at their own actions. Also responsible for turning China into the next world power too.
>>
>>107650174
goofs are quantized (compressed in a way) and smaller in size. there are different quants with Q8 usually being comparable to the original FP16, while the smaller quants like Q6 have degraded quality
>>
>>107650159
workflow needs a fix, try https://files.catbox.moe/hwh8vz.png

pretty sure there will be a new template soon.
>>
>>107649971
>>107650005
pretty cool, thanks. i'm gonna try downloading and running soundbloom
>>
File: 1750295642828066.png (1.11 MB, 824x1264)
1.11 MB
1.11 MB PNG
lmao

the police officer in blue is wearing a santa claus outfit and has a white beard. replace the car on the left with a santa sleigh. there is snow on the road.

merry christmas.
>>
>>107650195
thank
>>
File: ComfyUI_00002_.png (2.95 MB, 1088x1920)
2.95 MB
2.95 MB PNG
>>107650180
i mean you don't try a brand new model with a lightning lora to get an idea of if its better or not kek.
that said >>107650202 Not surprised! There's never good day 1 workflows. takes time to work out kinks. Thanks, ill give it a try.

never forget z-image's day 1 workflow.

>>107650212
lmao
>>
>>107650217
>i mean you don't try a brand new model with a lightning lora to get an idea of if its better or not kek.
I never said I only used the lora, I tried without it too
>>
>>107650212
also this is 2511 with the fixed workflow, and 4 step lora. working better than 2509 with 8 step lora! (so far)
>>
File: 1761851621617401.png (1.08 MB, 824x1264)
1.08 MB
1.08 MB PNG
>>107650212
one more, merry fent-mas.
>>
>>107650256
Don't you mean merry-methmas
>>
File: file.png (26 KB, 631x211)
26 KB
26 KB PNG
>>107650043
>>107650076
>>107649808
I can't find this model anywhere, at least not with this specific name.
>>
File: 1756423500043874.png (15 KB, 567x92)
15 KB
15 KB PNG
>>
>>107650265
just load the one you downloaded....
>>
>>107650276
>....
im a girl and you are bullying me
>>
anyone know where I can find pytorch_model.safetensors?
>>
>>107650265
press r and it'll refresh your list, you'll be able to chose your own file (if you have put it on the folder that is)
>>
>>107650265
swap that with the node for q8 or whatever you downloaded
>>
>>107650290
yeah
>>
>>107650089
Well, first if you use the melbands thing you are using a voice remover. It's pretty good, but it sounds to me like it failed lol.

My experience is the vocal sucks if we don't let it have a voice sample, but idk, I only have had say like... idk <500 gens with it? including lots of fails. maybe 200 gens. The worst part is it seems like there's no preview like vae preview so you can cancel if it's going off the rails.

Here's my wf
https://pastebin.com/Zrvgqard
it's a .json file. And if you prefer, a pic, picrel, rearranged for a screenshot.
NOTE: I am outputing mp3 for ease. pick FLAC for drag and drop functionality. mp3s don't contain the wf.

SongBloom isn't really quite like udio or suno. It's true that it can make a song out of nothing, but it's bad at that. Really, it's a sampler somehow. So, if you want to monetize it, the sample needs to be one you own the *correct* rights to, whatever those are for ai, I have no idea.

Here's the gen from it:
(SongBloom takes varying amounts of time, the previous song took 10 minutes to make. this one took 19 minutes for me)
https://vocaroo.com/1cExekokcZs5

too many words = way worse lol, plus it finally just gives up.
>>
File: 1752998584199589.png (1.17 MB, 1248x832)
1.17 MB
1.17 MB PNG
>>
there are those that gen 1girl and those that want to be the 1girl
>>
>>107650265
https://huggingface.co/unsloth/Qwen-Image-Edit-2511-GGUF/tree/main
https://huggingface.co/lightx2v/Qwen-Image-Edit-2511-Lightning/tree/main

q8 is better than fp8, so download the gguf version.
>>
File: 1753255035492386.png (1.19 MB, 1248x832)
1.19 MB
1.19 MB PNG
>>
File: file.png (1.64 MB, 1024x1024)
1.64 MB
1.64 MB PNG
hmm
>>
>>107650211
>>107650320
I want to add that I haven't genned xy, really don't have a fast enough card. cfg_coef is the trickiest. higher and it sounds less like the source. lower and it will be much more likely to include outright samples of the source. But all the values are strange, and I have a lot to learn about them. Certain settings result in just noise.

IMPORTANT!!!!

[verse]
Songbloom expects.
likes like this.
bla bla bla no commas.
no exclamations etc just like this.

[chorus]
and also you can have.
a bridge and that's really all I think.
but this is de weh.
to do it.

[bridge]
If you add a comma it means like a song structure break thing iirc
>>
>>107650180
what lora? im not on the transcoord
>>
File: ComfyUI_00006_.png (2.73 MB, 1088x1920)
2.73 MB
2.73 MB PNG
>>107650256
MERRY FENTMAS.

>>107650337
i was gonna ask for "source on that opinion" but then the fp8 just spat picrel out at me after running the lightning so, i guess i'll go grab the q8 gguf instead..
>>
File: 1751015156539424.png (853 KB, 744x1392)
853 KB
853 KB PNG
the anime girl in image1 is wearing the outfit of the anime girl in image2.

was just a miku face picture, pretty neat result desu:
>>
File: 1740056091801857.jpg (318 KB, 1536x1536)
318 KB
318 KB JPG
>>107650158
>nigger iq in america is 10 points higher than in africa
now take a look african-african-american IQ versus african-american IQ
oh wait you can't, and that should be a real skullthumper, (unless you have been paying attention since nixon took the US off the gold standard of course)
>>
File: 1747270025058644.png (1.61 MB, 1360x752)
1.61 MB
1.61 MB PNG
I have this weird horizontal line on the bottom of the image Idk if that happens to you as well on QiE 2511
>>
>>107650385
doesnt happen to me.
>>
File: 1745448733076053.png (831 KB, 744x1392)
831 KB
831 KB PNG
>>107650375
bocchi jacket:
>>
>>107650385
Looks like it's cutting off the top part of the image and putting it on the bottom for some reason
>>
File: alu_3.jpg (47 KB, 550x441)
47 KB
47 KB JPG
>>107649226

is fine
now do i get cunny
>>
File: 1755594311030223.png (1.29 MB, 1000x1048)
1.29 MB
1.29 MB PNG
the black man in image1 is holding a baseball bat in the pose of the man in image2.

george...costanza
>>
File: huh.png (718 KB, 609x850)
718 KB
718 KB PNG
>>107650337
>>107650368
Am I missing something?
>>
>>107650426
yea you're retarded
>>
>>107649807
Wizard drip is awesome.
>>
>>107650430
Explain
>>
>>107650426
change the weight_dtype
>>
>>107650426
licherelly just run the workflow linked here multiple times, dont change anything except like your model paths.
>>
File: file.png (413 KB, 1362x719)
413 KB
413 KB PNG
>>107650462
It is the workflow from here

>>107650448
I did and it still shits out noise
>>
>>107650482
turn off --use-sage-attention or --fast if you have them on. One of them fucks with it I forget which.
>>
>>107650426
the rest of the screenshot, show us your 9 year old child prompt anon
>>
>>107650493
the prompt is right there though, get glasses
>>
>>107650482
>downloaded the lightning checkpoint
>doesn't use the lightning settings
*rings the triangle* RETARD ALERT

>>107650491
neither, i use sage attention just fine.
>>
>>107650482
the model you're using has lightx2v baked in, you have to do cfg 1.0 and 8 steps or whatever
>>
>>107650503
>doesn't use the lightning settings
How the hell would I know that, I'm new I asked how to get the model, got a link to the models and told any would do.
>>
so now that cumfart isn't even reliable for implementing new models, what is the goto diffusers UI?
>>
File: uncanny medic.jpg (69 KB, 1242x1186)
69 KB
69 KB JPG
>>107650518
people like you are why videogames have yellow paint in them now (and why us americans weren't allowed to have kinder eggs for a while)
>>
>>107650526
nano banana
>>
>>107650532
is it free and local?
>>
>>107650526
comfyui is still fine.
> inb4 it has been hours and everything doesn't even work pefectly
>>
>>107650537
locally displayed and diffused via API nodes.
>>
>>107650526
ani's usual tactics again
>>
>>107650526
stay tuned for the new UI I'm developing, CozyUI. I even got a mascot already, a brown-haired mouse girl
>>
>>107650548
a different UI implies no API nodes
>>
>>107649634
Curious how that would work. Also it would make more sense to upscale with Chroma.
>>
>chromafags are back now that they're losing attention again
>>
File: 1759173449245297.jpg (468 KB, 1920x1080)
468 KB
468 KB JPG
you're leaving gains on the table if you're not doing collages with Z turbo
>>
>dogshit comfyui cant automatically disable sage if it detect you running qwen image family of models which dont support it and silently output noise image
>>
what's the use case for qwen edit
i make anime porn
>>
>>107650638
making shitty, slopped memes
>>
>>107650616
But sage attention works with Qwen using KJ's node.
>>
File: 1764473539867842.jpg (57 KB, 672x645)
57 KB
57 KB JPG
so the gift was qwen edit 2...
>>
>>107650638
/pol/ memes. it's not good for anime.
>>
File: 1752241271063706.png (1.1 MB, 1000x1048)
1.1 MB
1.1 MB PNG
the black man in image1 is standing with the men in image2. change the background of image1 to the background of image2.

ah sheeeeeeeit!
>>
>>107650655
>qwen edit 2
*3, it's the 3rd iteration kek

don't worry anon, Image base on christmas, trust the plan
>>
>>107650655
>2
not even. more like 1.2
>>
>>107650648
Um custom nodes are NOT SUPPORTED CHUD
>>
>>107650606
baysed
>>
>>107650417
>>107650660
why does it have this weird glossy glowy blur to it?
>>
File: 1739456081281113.mp4 (2.43 MB, 1248x720)
2.43 MB
2.43 MB MP4
Thanks for the MultiGPU fix BigStation anon.
>>107649807
>>
File: ComfyUI_00007_.png (2.9 MB, 1088x1920)
2.9 MB
2.9 MB PNG
yeah i still don't know if they'll release base "on" christmas. Do the chinese really care about our holidays?
>>
File: 1736044087955616.png (3.39 MB, 2896x720)
3.39 MB
3.39 MB PNG
>>107650664
>more like 1.2
I think it's better at keeping the style (still not great though), but it's still zooming in and they still don't have the balls to go pixels only edit (Radiance), like that model can't be use for professional settings since it always compress the image, come on Alibaba, you can do better than this
>>
File: 1760000349857837.png (1.06 MB, 1375x786)
1.06 MB
1.06 MB PNG
>>107650685
trust the plan
>>
cozy bread
>>
>>107650685
there's a reason they released QiE so close to christmas, it's like the appetizer before the big thing (Z-image base)
>>
>>107650692
Not even close. That Miku looks nothing like how Hirohiko Araki draws his females.
>>
>>107650385
Looks like you're using an unsupported vertical resolution. Try a multiple of 32 or 64 (768).
>>
shoot comfy in drag
>>
smootch comfy and plap
>>
>>107650700
Why did labda not work but psi worked?
>>
>>107650712
>unsupported vertical resolution
there's a list of resolutions you should aim for?
>>
>they trained 2511 on nudity so now you can actually get real nudity
>but making anything nude removes accuracy to the character
fuck. what was the "retain accuracy" prompt people use again? funny it was better at nudifying when it had less nudity training.
>>
>>107650728
There probably is a list of resolutions the model was trained on, but in your case, the resolution just doesn't fit properly into matrices used internally. All diffusion models work only with multiple-of-2 resolutions. Common patterns are 16, 32, 64. I guess, qwen doesn't support 16 (752 is a multiple of 16).
>>
Who are the best genners in this general?
>>
>>107650786
the guy that randomly animates people's gens.
>>
>>107650786
me
>>
>>107650711
qie doesnt retain style you'd have to make a style lora for that
>>
>>107650786
Ran and Debo.
>>
>>107650786
the kinosovl poster of course
>>
>>107650554
yeah nano banana pro
>>
File: 1763070265469147.png (741 KB, 936x1112)
741 KB
741 KB PNG
the man in image1 is sitting in a McDonalds and is holding a cheeseburger, on a table with several Mcdonalds brown bags. keep his appearance the same, in a pixelized style like a retro PC game.

8 bit JC
>>
>>107650878
>several
>2
really makes you think
>>
>>107650886
you're not seeing the whole table, there's probably at least 2 more bags.
>>
File: 1757599559395146.png (915 KB, 1024x1024)
915 KB
915 KB PNG
>>107650878
Mikudonalds:
>>
>>107650359
thx anon, got it working. i feel like it's made for chinese, especially the way it tokenizes the text based on written pronunciation, which doesn't work for english so you have to write out lyrics phonetically
managed to gen something that sounds like a shitty system of a down cover:

https://vocaroo.com/1okGYUoVAdsE
>>
File: 1759432306483292.png (1019 KB, 1000x1048)
1019 KB
1019 KB PNG
>>107650891
and of course, we cant forget fent man:
>>
File: 00531-4067431132.png (1.57 MB, 832x1248)
1.57 MB
1.57 MB PNG
I genned this a year and one day ago. back when I was using shitty a1111 or forge (idr)
>>
File: 8_.mp4 (637 KB, 1280x720)
637 KB
637 KB MP4
>>
>>107650927
wha da
>>
>>107650897
Neat. Remember SongBloom is unlike everything you've ever used. It's sampling scheme (they didn't invent pingpong, it's ancestral, apparently) is crazy weird.

I like it, I'm having fun with it.

The exact selected sample plays a huge role. the different settings do too. I have a lot to learn. It's hard to know what exactly does what.

The SongBloom team teased having a prompt input, but as usual they lied. With the chinese, they dump something and move on to totally different projects after that.
>>
>>107650903
now that's definitely several bags
>>
File: file.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>107650903
>>
File: ComfyUI_278454_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>107650786
Me.
>>
>>107650978
B A S E D
A
A
E
D
>>
>>107650978
is this z omni
>>
>>107650786
Your shitty anime general sure doesn't.
>>
>>107650978
what model is this?
>>
File: 1748776253629439.png (1.22 MB, 992x1048)
1.22 MB
1.22 MB PNG
so the 2509 clothes remover lora is still effective with the new edit, it seems.

remove the red shirt from the japanese woman, she is wearing a christmas themed bikini top.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.