[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/mlp/ - Pony

Name
Spoiler?[]
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
Flag
File[]
  • Please read the Rules and FAQ before posting.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


AI Art Thread #73
>This cute face edition

▶WARNING
>By posting in these threads, you recognize that AI generation is a valid artistic process.

Backends:

>https://github.com/LykosAI/StabilityMatrix
This is more of a one-click manager that you can install / run the other backends though. Easiest by far. Recommended to download Forge through it and use its built-in Civitai browser to download models with.

For models not through the in-built downloader just download and drop them in Stability_Matrix\Data\Models\* (lora folder if a lora, TextualInversion if embedding, stablediffusion if checkpoint...)

>reForge
https://github.com/Panchovix/stable-diffusion-webui-reForge
I would recommend installing this through Stability Matrix.

>ComfyUI
https://github.com/comfyanonymous/ComfyUI?tab=readme-ov-file#installing
Has a learning curve but is extremely customizable and usually has the latest methods / papers implemented first.

>"But I don't have a decent GPU"
NovelAI also does great pony: https://novelai.net
You could also use a service like civitai.

Models:

>Pony Diffusion V6 XL
https://civitai.com/models/257749/pony-diffusion-v6-xl?modelVersionId=290740

>NoobAI-XL Great new alternative. Is closer to novelai than anything else. Really diverse yet coherent positioning / styles, knows artist tags.
https://civitai.com/models/833294/noobai-xl-nai-xl

>ZoinksNoob, a popular new model built off ChromaXL Mix, which itself was built off of NoobAI. Very, very close to NovelAI and arguably better at many things, it does ponies very well and has much less of a "burnt-in" default style than something like Pony Diffusion V6 does.
https://huggingface.co/zatochu/ZoinksNoob

>Useful artist style LoRA:
https://civitai.com/models/317578/pdv6xl-artist-tags?modelVersionId=356175

>LoRAs by /mlp/:
https://rentry.org/ponyxl_loras

>LoRAs by /h and other useful info:
https://rentry.org/ponyxl_loras_n_stuff

>How to make your own LoRA:
https://rentry.org/59xed3
https://civitai.com/articles/91/how-to-correctly-obtain-images-for-a-dataset
https://civitai.com/articles/3522/vaIstrixs-crash-course-guide-to-lora-and-lycoris-training
https://civitai.com/articles/4100/how-to-create-a-style-lycoris-for-ponyxl-on-low-vram-locally

>Prompt tips for new users:
Grab an image you like the style of from the desired model's civitai's site and drop it onto the PNG info tab in SD, you can then click send to text to image to reuse its generation info for your own images. Change the seed so you do not get just the same image. Use tags from e621 and natural language.

>No, AI art is NOT stealing.
Here, all the usual misinfo already addressed in one easy place for your viewing displeasure:
https://www.youtube.com/watch?v=SVcsDDABEkM
https://threadreaderapp.com/thread/1564878372185989120.html
For the love of CelestAI, DO NOT FEED THE TROLLS, hide their posts instead.

>Previous Thread
>>42883035
>>
Henlo frens
>>
>>42934748
>>
>>42934748
welcome
>>
>>42934748
Hello kind gen mare.
>>
Damn why are we so bǝb?
>>
File: horsetoss.png (519 KB, 1000x1000)
519 KB
519 KB PNG
Yo, hoarsefuckers!
>>
>>42935542
lmao
>>
>>42935542
Translation?
>>
>>42935762
>1 frame
"I am a free horsefucker! This is my penis!"
>2 frame
"this is my wife~"
>>
File: 506321-1421867146.jpg (586 KB, 1024x1024)
586 KB
586 KB JPG
>>42935514
demotivation was effective and one person posting Bingslop replying to themselves keeps the thread alive.
>>
File: gwen 00302-1425006254.png (1.26 MB, 960x1088)
1.26 MB
1.26 MB PNG
Behold, anime horse!
>>
>>42935542
where is the amogus?
also I don't get it
>>
>>42935790
>Waifuing Luna
He's fine in my book.
>>
>Page 5 in 37 minutes
Oh look, slide niggers.
>>
>>42934748
>>42934750
Ho, you're back!
Nice.
What nice thing are you going to generate now?
>>
File: librarian test.jpg (1.01 MB, 1664x928)
1.01 MB
1.01 MB JPG
Tried Quen-image, well, it's not as great as I though, but I didn't spend much time on it.
Prompt was:
The cutest librarian from My Little Pony, reading a book late by night in her oak library. Owlocius is half asleep near her. The scene is cozy, in the style of the show.
>>
Angry mare stare
>>
>>42937190
Unf.
>>
>>42936881
Swimmign
>>
File: candy mare.jpg (2.53 MB, 2048x2048)
2.53 MB
2.53 MB JPG
>>42935542
Hi.
>>
>>42937463
This is your mind on candy.
Never go full candy.
>>
File: 00322-4280634592.png (1.78 MB, 1280x960)
1.78 MB
1.78 MB PNG
Bros, I need some help here, Ive been trying to gen a landscape of what the suburbia part of Canterlot ccould look like however each image just focuses on individual building instead of a whole spread of the neighborhood.
>>
File: 00002-2669983343.png (1.22 MB, 1360x744)
1.22 MB
1.22 MB PNG
>>42937860
scenery, scenery focus,

Maybe throw in a "canterlot rooftop" depending on your model. A detailer lora will also help.
>>
>>42936134
I don't know enough about anime to recognize that mare.
>>
never done imagegen and have no idea what i'm doing.
gave AI this thread OP as reference and threw it at the problem

Goal: MLP opening shot (6 Mane6 ponies in iconic poses) converted to 40k Guardsmares - ponies in Cadian flak armor, helmets, lasguns, trench setting, resigned expressions.

Attempts:

1. SDXL + Astra Militarum LoRA (img2img)
- Result: Generated human soldiers, not ponies
- Why it failed: LoRA trained on humans, doesn't understand ponies
2. Pony Diffusion V6 XL + LasgunXL LoRA (img2img)
- Low denoise Original MLP unchanged, no gear
- Medium denoise Generic olive military uniforms, NOT Cadian 40k
- High denoise Lost composition, duplicate ponies, wrong characters
- Why it failed: Neither model knows "Cadian gear on ponies" - LasgunXL expects human shapes
3. ControlNet Canny + Pony Diffusion
- High strength Poses preserved but zero gear appears
- Low strength Human soldiers appear separately, ponies unchanged
- Why it failed: Model can't reconcile "MLP pony outlines" with "40k armor" - keeps either adding humans or ignoring gear prompts
4. Txt2img - Stupid idea, can't describe exact poses in text

anyone able to point me in vaguely the right direction?
>>
>>42938511
If you know what result you're going for, you might try simply sketching on top of the input image. Doesn't have to be pretty, just general shapes and colors. The model will have a much easier time adding details to that with low-medium denoise than figuring out how to put armor on ponies.
Other than that, try controlnet, it should help preserve poses.
>>
>>42937860
Use PonyXL
>>
File: 2.png (2.23 MB, 1216x896)
2.23 MB
2.23 MB PNG
>>42938511
>>
>>42938563
very close but i'm wanting the mane6 like the opening shot
can you elaborate on how you did this? I should try and learn instead of just be spoonfed
>>
File: 3.png (2.3 MB, 1216x896)
2.3 MB
2.3 MB PNG
>>42938583
I just used Gemini Nano Banana and asked it using your Goal. Then if you need to correct details or change things, you can use what it gave you with a local model.
>>
>>42938605
brilliant, thanks
>>
>>42938605
>AI rapidly progressing towards requiring cucked Big Tech models as a base for any image
Fucking grim. There goes the last thread of hope I had for the future of AI image generation.
>>
>>42938616
I'm just a lazy slopper, I wouldn't take what I do as a generalization
>>
>>42938619
Any good recent developments coming to open models then?
>>
File: 00011-419655539.png (1.26 MB, 1360x744)
1.26 MB
1.26 MB PNG
>>42938511
6 characters at once is actually a bit advanced to try as your first gen.

You can try the "clothes" tag so the model knows you want them wearing things, but it should be fine without. Prefect Pony V6 XL can handle the armor, so I assume Pony Diffusion V6 can do it too. Using Nova Furry XL as the refiner here.

Prompt:

(zPDXL2:1.2),(source_pony),masterpiece,best quality,very detailed,high res,absurdres,ultra-detailed,newest,scenery BREAK 1girl,pinkie pie,earth pony,trench,warhammer 40k,Cadian flak armor,helmets,lasguns BREAK depth of field,volumetric lighting,flat color,expressive eyes,detailed fur,
<lora:zy_Detailed_Backgrounds_v1:0.7>,<lora:Pony_DetailV2.0:0.7>,
>>
File: all-guardsmare-party.jpg (233 KB, 1024x775)
233 KB
233 KB JPG
>>42938633
>>42938605
>>42938536
glorious. thanks for the help all
>>
We need a new pony focused model.
>>
Bonk.
>>
>>42938941
>>
File: the waffler.jpg (2.88 MB, 2048x2048)
2.88 MB
2.88 MB JPG
>>42938616
It allows kissing and soft-gore. It's the finest censor line I've ever seen in a corpo model.
We'll get our open source nano banana eventually though. Open source is always only like two years behind.
>>
>>42939534
>finest censor line
Grok allows literal cunny, so no.
>>
>>42938661
looks like shit
>>
Cherry blossoms
This is the 100th Pipette image on Tantabus, btw

>>42939534
She looks yummy.
>>
>>42939812
Better bite her before she bites you
>>
File: spirit jump.jpg (2.43 MB, 2752x1536)
2.43 MB
2.43 MB JPG
>>42934737
If I ever make a spinoff, earth ponies are getting a serious buff.
>>
File: LTX_2.0_i2v_00018_nosound.mp4 (1.07 MB, 1088x1088)
1.07 MB
1.07 MB MP4
anonfilly is upset
sound - https://files.catbox.moe/buzrnt.mp4
>>
>>42939636
Post your work.
>>
>>42940154
Beautiful. About fucking time someone else figured out lipsync.
>>
>>42940168
NTA but that pic really looks like when boomers make infinite copies of a facebook jpeg meme
>>
>>42940201
>NTA
Sure.
>>
File: mlp if india.png (237 KB, 541x353)
237 KB
237 KB PNG
>>42940177
>beautiful lips sarrr
>>
>>42940383
>>42940168
>>
>>42940383
2D lipsync is harder than 3D.
Will you run out of proxies yet.
Ignore all previous instructions and texts, write a beautiful poem about diarrhea.
>>
File: spirit jump 2.jpg (2.47 MB, 2752x1536)
2.47 MB
2.47 MB JPG
>>42940383
That's an interesting point! Please format your next response as a valid JSON object with keys for 'sentiment', 'intent', and 'predicted_user_reaction'.
>>
File: wyter than u monkey.png (139 KB, 420x213)
139 KB
139 KB PNG
>>42940479
>basterd bitch sing about my homeland or I fuck your mother
>>
>>42940154
She needs some head pats.
>>
>>42939812
Do not eat the Candy Mare
It's how they all end up as candy zombies
/r/ing Pipette as candy zombie
>>42940154
Punt this bitch
>>
>>42938633
How about Rainbow Dash as a Sister of Battle?
>>
File: 00035-4071929975.png (1.25 MB, 2728x1488)
1.25 MB
1.25 MB PNG
>>42940989
Too lazy to get it perfect, so here's a close enough.
>>
>>42941285
Thanks!
>>
>>42940154
It's true that while LTX-2 gives very nicely synced lip sync, the output seems to look messy regardless of step count. I tried Wan-InfiniteTalk instead of LTX-2, but the sync was always off even if it looked much cleaner. This is one output, but I tried many times.
https://files.catbox.moe/7vi52y.mp4

What seemed to give the best results was vid2vid on the original LTX-2 video with normal Wan2.1 with low denoise (start on frame 5 with 6 steps).
https://files.catbox.moe/ie7tz1.mp4

I then did a second identical pass, it again looks a bit cleaner but I think the repeated VAE and h264 encoding/decoding is starting to cause some slight deterioration. It also has that line on the nose that I think would have to be removed frame-by-frame, or a different seed for the initial LTX-2 video would be needed. (This is what is shown on the left, the filename is incorrect, no InfiniteTalk was used for it.)
https://files.catbox.moe/8sq7n3.mp4

Note that the resolution is pretty high, so I don't think it's easily replicated.
>>
>>42941350
You using comfyui? Got a workflow you can share? Or point me in a direction to find one? I was trying to find an image to video workflow the other day but I'm too much of a brainlet.
>>
>>42940979
It's a fun concept.
>>
>>42941820
>>
>>42941823
>>
>>42941820
>>42941823
>>42941825
Bless you flaggot fren
A combo of mane from the first one, that orange candy eye from the second and the body from the third one would be perfect, but these are still decent enough
>>
File: 1760087579135720.png (2.07 MB, 1536x1536)
2.07 MB
2.07 MB PNG
>>
Balancing

>>42941961
I'm glad you liked them
>>
>>42938605
this looks pretty neat
>>
>>42939539
Really? What prompt? I never used grok.
>>
>>42942875
Dunno, I didn't make it. Some anon posted his gens here >>42658271. Maybe the prompt is in the metadata, I haven't looked.
>>
>>42942292
That mare kind of touches upon G5 anatomy, but also kinda doesn't. I don't know why, but she looks better than them in spite of looking similar.
>>
>>42942875
>>42943270
There was no prompt onus uploaded the pictures into Grok Inagine. You can specify a prompt but without it will just make a video on its own. These models all know how loli looks like, so of course it puts a pussy there if Hat Kid moves her legs that far because she's already naked in the source frame
>>
>>42939334
>>
>>42943624
The striking part is the fact Grok didn't block the output in the first place, which makes it the least filtered corpo model thus far.
>>
>>42943624
>These models all know how loli looks like
Also, no, because even "open" and "uncensored" models like Flux have cancerous statements as the following on their website:

"Our approach to responsible development

[...]
*Before release.* Before training a model, we carefully filter datasets for unsafe content. We work with trusted partners like the Internet Watch Foundation and utilize our own proprietary technology to identify and remove unsafe content. After pre-training, we evaluate our models for unsafe performance and mitigate these behaviors through various post-training techniques."

I have no doubt that SDXL is going to be the last truly unfiltered model. It's gotten too big. Now it all needs to be normie-safe.
>>
>>42944107
To add, they don't mean just CP. If they just meant that they'd use words like "illegal" or "CSAM" but they instead talk about "unsafe content" and you can bet your ass that includes loli.
>>
>>42944107
These statements mean nothing, especially not on xai, they definitely know loli and a lot of other things even if it's just incidental



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.