[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: file.png (284 KB, 592x442)
284 KB
284 KB PNG
How do I become a devtools menace feared by JAV website owners? What's the K&R of scraping and javascript?
>>
you dont, maybe one jav star to follow, thats it
>>
>>101556764
Man I remember busting gallons to this scene
What was the code again?
my dick may not work as well as it did back in those times but that scene will surely bring back memories ingrained in my head
>>
>>101556764
JAV site owners /fear/ the web-scraping coomer.
>>
>>101556902
answering my question will surely bring back memories of the code in my head
>>
>>101556957
How so?
>>
>>101557494
from experience, most jav sites use hosts/players like doodstream, ustream and many others. The quick and dirty way is to find a way to extract the player/host link and use whatever tool/lib exists to download videos from that host. Most likely yt-dl will handle many if not all the popular hosts they use
Basically, your scraper should do this:
>open video
>extract host link
>use yt-dlp (or whatever) to download video
>repeat
Though, the actual smart way to go about it is just to use the JAV code and go to onejav.com, so you find a high quality torrent. The good shit will almost always have seeders
>>
>>101556764
why would you want to hoard low resolution porn videos?

>>101556902
MIDV-064
>>
>>101557832
>The quick and dirty way is to find a way to extract the player/host link
i usually try to find m3u8 link and download it with ytdlp but that's not possible a lot of the time and often websites just reload or redirect when you open devtools
i know that there's probably extensions to do that but i want to understand it
another example: i wanted to extrasct the measurement data from this website but i couldnt figure out how to circumvent the encryption, i know it's possible
https://crinacle.com/graphs/headphones/graphtool/
i want to understand how that works, not just for porn but in general, a lot of the time i find galleries from some weird websites or maybe music playlists and i can't figure out how to scrape it
>>101557868
as long as it's >=720p i dont care
>>
>>101557925
>i usually try to find m3u8 link and download it with ytdlp
just skip the middleman and use ffmpeg directly
alternative, use mpv with dump-cache command to be able to watch the stream while also writing it to disk (in case of something like a livestream with no VOD, for example chaturbate)
>i want to understand how that works, not just for porn but in general, a lot of the time i find galleries from some weird websites or maybe music playlists and i can't figure out how to scrape it
there's no real general answer. Every website or app may use completely different methods depending on how they work, what they do and how commited they are to security or anti scraping policies.
For starters, having good webdev knowledge helps a lot, specially backend and server-oriented stuff. Knowing protocols (HTTP, HLS, TLS, websockets etc) and data/file formats (JSON, M3U8, XML, etc) helps a lot. You gotta get familiar with reverse engineering webapps through things like devtools, specially network inspection and the source itself. In some cases even using the debugger may be useful. It's really a broad topic since every webapp is a different challenge, having all that knowledge at least gives a "feel" for things so you can start probing for way to scrape.
Good luck on the highway, man
>>
>>101558400
>in case of something like a livestream with no VOD, for example chaturbate
yt-dlp just calls ffmpeg for chaturbate and you can watch the files as they are written to with mpv
>For starters, having good webdev knowledge helps a lot, specially backend and server-oriented stuff. Knowing protocols (HTTP, HLS, TLS, websockets etc) and data/file formats (JSON, M3U8, XML, etc) helps a lot. You gotta get familiar with reverse engineering webapps through things like devtools, specially network inspection and the source itself. In some cases even using the debugger may be useful. It's really a broad topic since every webapp is a different challenge, having all that knowledge at least gives a "feel" for things so you can start probing for way to scrape.
Good luck on the highway, man
just give me a fucking book or a website with some useful documentation or something
>>
>>101558486
>yt-dlp just calls ffmpeg for chaturbate and you can watch the files as they are written to with mpv
really? good to know. Back when I was big into chaturbate, yt-dl didn't support it so I just wrote a script that extracted the m3u8 and plopped that bad boy into mpv directly.
>just give me a fucking book or a website with some useful documentation or something
there's no magical book that will teach you the entirety of what you need to know my man
it's a journey
The best I could tell you is to learn web development concepts (specially backend since it the complex part most of the time) and something like python or go to write your scrappers in.
>>
>>101557868
>MID
it really was, huh
>>
>>101558629
>it's a journey
i'm asking you where to start the journey you fucking mongoloid
>>
>find jav on streaming site
>open devtools
>forces debugger to start running
why are the yellow jew like this
>>
>torrent jav
>claims to be 1080p
>looks like absolute fucking shit
if i can't see skin pores it might as well be 720p
>>
>>101556764
Hana himesaki
>>
who
>>
>>101556764
AIKA is my fav,. she looks like she really enjoys it. not just acting.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.