[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1713732097544615.jpg (83 KB, 636x942)
83 KB
83 KB JPG
>Privacy respecting search engines
>... with own crawler
duckduckgo.com - Includes Bing results
search.brave.com
www.mojeek.com
>... without own crawler
searx.space - List of instances
www.startpage.com
metager.org

>Focus on non-commercial and classic sites
wiby.me
search.marginalia.nu

>Special purpose
boardreader.com - For forum posts
sites.google.com/view/l33tech/tools/pasteskimmer - For pastebins
tlgs.one/search - Gemini search (web mirror)

>File searches
www.eyeofjustice.com/od/ - Open directory search
lendx.org - Similar to above, but a slightly different query
filepursuit.com - OD search with its own index
odcrawler.xyz - Same as above
www.mmnt.ru/int - FTP Search
www.dedigger.com - Google Drive search
drodigger.com - Dropbox search
s3digger.com - Amazon S3 search
yadigger.com - Yandex Disk search
shortdigger.com - Search for files shared with url shorteners
fidigger.com - Search file sharing services
uploadedtrend.com - Same as above, with seemingly better results

>Collection of Google Dorks
www.exploit-db.com/google-hacking-database

>Comparison of various search engines in security and privacy matters
searchengine.party

Previous: >>100141815
>>
>>100148818
Tried to include the 4get instances site but 4chan thinks it's spam.
You have some enemies among the admins, 4get anon.
>>
bump
>>100148862
interesting
>>
Kagi gang
>>
>>100149437
is that actually good enough to justify paying money?
>>
any that dont censor loli hentai
>>
What is a good search engine for high quality images? (Particularly images of anime girls)
Serious question btw
>>
>>100149437
>kagi shilling is allowed
>4get shilling isn't
i dont get it
>>
>>100150412
try yep.com
>>
>>100150721
any booru site
safebooru
gelbooru
sankaku
yande.re
>>
File: 1713912984079.jpg (935 KB, 2237x1654)
935 KB
935 KB JPG
>>100148818
Sexo
>>
>>100150306
You can try it out. First 100 searches are free.
>>
>>100151851
kagi sucks, it's basically just a metasearch for google, bing and marginalia
>>
>>100148818
>"1488"

Now this is a gem.
>>
neck yourself poltard
>>
>>100148818
You should add these sites to the sticky:
https://www.searchenginemap.com/
https://leta.mullvad.net/
https://mwmbl.org/
https://feedle.world/
https://stract.com/
https://yep.com/.
https://sr.ht/~benbusby/whoogle-search/
There's a new open source, indieweb focused one I found on Hacker News called OpenOrb. Its decentralized and revolves around making your own instance unlike wiby and marginalia. Link to main instance: https://openorb.idiot.sh/search . Its basically a search engine that is curated from RSS/Atom feeds. It basically allows you surf blogs and articles from someone else's catalog. Also, DuckDuckGo is pozzed. Its 95% Bing, 5% their own results from their shitty webcrawler (they used to also include Yandex especially when used without JS but dropped them because of the Russia Ukraine war) and they have been caught violating their stance on privacy ( https://lemmy.ml/post/31321 & https://www.wired.com/story/duckduckgo-microsoft-twitter-ft-bush-assassination-whatsapp/ ).
>>
>>100154528
Thanks for the links, will be looking into them
>I found on Hacker News called OpenOrb
Also saw that on HN yesterday lol. But it's not really a search engine for the web. It's just searching all the RSS feeds you subscribed to (if you selfhost). If you use someone else's instance you search their subscriptions basically. And I doubt that the index of any instance is bigger than marginalia. The marginalia dev created an export of all the domains upon request from the HN thread
downloads . marginalia . nu/exports/
(fuck off 4chan with their garbage spam detection)
>>
>>100154528
>feedle
never heard about that one, thanks for the share anon

my recommendation: crowdview is good for programming questions
>>
>>100148818
Imagine humping that thing every single night
>>
I wrote my own pasta using previous knowledge and after reading the posts here. I heavily use surfraw to launch searches from anywhere, btw.

/set/ - Search Engine Thread


>Why not just stick with Google?
Search engines are a biased window into the web. We want to filter out SEO and discover actual interesting webpages.
>SHIT tier alternatives
duckduckgo.com

>Deep tier (best first)
search.marginalia.nu
stract.com
mojeek.com
yep.com
rightdao.com
boardreader.com
infotiger.com
ultra.gondola.pics (near fulltext 4chan archive)

>"Specialized" tier
alexandria.org
wiby.me
searchmysite.net
feedle.world
grep.app
searchcode.com
search.pullpush.io (reddit fulltext)

>Premium
kagi.com
leta.mullvad.net

>Proxies to Google or DDG, if you must
https://4get%2Eca
searx.space
sr.ht/~benbusby/whoogle-search/#public-instances

>Privacy and security comparison of some search engines
searchengine.party

>Choose from hundreds of search engines from the command line, courtesy of Julian Assange (real)
pacman -S surfraw



Get comfortable with using multiple search engines.
>>
>>100154528
mwbl is the reddit of search engines
>>
>>100155251
Crowdview seems down right now. It was also broken a few weeks ago when I tried it
>>
>>100156676
Why no file searches?
And what does "deep tier" mean?
I also don't think that marginalia is "better" than mojeek, or that they should be in the same category.
>>
>>100157435
I didn't have anything to add to file searches. The next OP can insert the file searches list. It's an open source text.
I could've written Great tier instead, I just meant Deep because a good search engine finds deep links.
Marginalia (in my experience!) has more interesting links than Mojeek. Putting Mojeek below fledgling Stract is perhaps unfair though
>>
>>100156676
I also want to add that my view is heavily borrowed from this article: https://seirdy.one/2021/03/10/search-engines-with-own-indexes.html
I forgot to but I would include Yandex
>>
>>100157602
>I forgot to but I would include Yandex
Google tier data collection
https://yandex.com/legal/confidential/
>>
>>100156676
Good links, will be including some of them in the next thread
>>
How do you objectively know a search engine is good? Are there specific search terms you always test?
>>
>>100158667
>Are there specific search terms you always test?
Yeah. I always search for something very specific to the uni I went and which is rather obscure. Some don't find it at all, the bigger ones all find it. Getting that as a top result is always a sign for me that the search engine has a big index.
>>
>>100158667
I usually try searching things that are outside the mainstream (corporate websites, legacy media, & social media/platforms). I usually go with an edgy site (4chan, kiwifarms, agoraroad, random forums and altchans) to see if it has censorship or an indieweb site I found in some webring. Then I try to be specific by going through articles in my bookmarks or rss feed and see if the search engine gives it as one of its first websites.
>>
Thought ddg was a meme but have been trying it lately and it werks.
>>
>>100148818
That is a child.
>>
>>100158667
any programming question
>>
>>100159915
See the smug look on her face. She's old enough to know what she's doing.
>>
>>100159915
>>100160278
Whatever. Being attracted to a japanese equivalent of a barbie doll or discussing whether it is old enough to be sexualized are mental illnesses in themselves
>>
File: hello reddit2.jpg (437 KB, 1508x1493)
437 KB
437 KB JPG
>>100162064
>Being attracted to a japanese equivalent of a barbie doll



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.