[Boards: 3 / a / aco / adv / an / asp / b / biz / c / cgl / ck / cm / co / d / diy / e / fa / fit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mu / n / news / o / out / p / po / pol / qa / r / r9k / s / s4s / sci / soc / sp / t / tg / toy / trash / trv / tv / u / v / vg / vp / vr / w / wg / wsg / wsr / x / y / ] [Home]
4chanarchives logo

What programming language/server/frameworks would you use for a web crawler if you


Thread replies: 12
Thread images: 2

File: 111.jpg (16KB, 240x200px) Image search: [Google] [Yandex] [Bing]
111.jpg
16KB, 240x200px
What programming language/server/frameworks would you use for a web crawler if you were bad at functional programming (no python)?
>>
Java with some NIO-capable server
>>
>>55237966
what about node js? doesnt NPM have tons of libraries for web automation?
>>
>>55237750
Why the fuck would you grab a spider like that? A black widow at that
>>
angularjs bro
>>
>>55238041
That works too.
>>
>>55238068
why not? are you some sort of pussy?
>>
File: lupin_money.gif (1023KB, 500x361px) Image search: [Google] [Yandex] [Bing]
lupin_money.gif
1023KB, 500x361px
>make a service that crawls classifieds sites through TOR in Java
>tfw site owner cucks mad
>tfw can't get blacklisted, ever
>>
>>55238243
OP here. I think I will go with Java too. I am more used to it than JS. Can you recommend some technologies to set up the basic server? It doesnt have to have an UI, but use SQL (no Mongo) and run in parallel on multiple VMs (with same DB), so I think I cant just start everything up with XAMPP.
>>
Jsoup html parser for java.

Thank me later op.
>>
>>55238068
black widows are pretty docile, you can actually handle them without them biting.
>>
>>55238201
You don't need a lot of common sense to just know that spiders need to fucking die and extinct
Thread replies: 12
Thread images: 2
[Boards: 3 / a / aco / adv / an / asp / b / biz / c / cgl / ck / cm / co / d / diy / e / fa / fit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mu / n / news / o / out / p / po / pol / qa / r / r9k / s / s4s / sci / soc / sp / t / tg / toy / trash / trv / tv / u / v / vg / vp / vr / w / wg / wsg / wsr / x / y / ] [Home]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
If a post contains personal/copyrighted/illegal content you can contact me at [email protected] with that post and thread number and it will be removed as soon as possible.
If a post contains illegal content, please click on its [Report] button and follow the instructions.
This is a 4chan archive - all of the content originated from them. If you need information for a Poster - you need to contact them.
This website shows only archived content and is not affiliated with 4chan in any way.
If you like this website please support us by donating with Bitcoin at 1XVgDnu36zCj97gLdeSwHMdiJaBkqhtMK