[Boards: 3 / a / aco / adv / an / asp / b / biz / c / cgl / ck / cm / co / d / diy / e / fa / fit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mu / n / news / o / out / p / po / pol / qa / r / r9k / s / s4s / sci / soc / sp / t / tg / toy / trash / trv / tv / u / v / vg / vp / vr / w / wg / wsg / wsr / x / y ] [Home]
4chanarchives logo
I am in a unique situation I want to just be presented with
Images are sometimes not shown due to bandwidth/network limitations. Refreshing the page usually helps.

You are currently reading a thread in /g/ - Technology

Thread replies: 16
Thread images: 3
File: 1461971012187.jpg (2 MB, 2658x3582) Image search: [Google]
1461971012187.jpg
2 MB, 2658x3582
I am in a unique situation

I want to just be presented with a plaintext list of all pastebin paste ids, or at least just a large number of them so I can search through them to find a certain string in the id.

How do this
>>
File: 1464161774743.jpg (405 KB, 1000x1369) Image search: [Google]
1464161774743.jpg
405 KB, 1000x1369
Pls
>>
File: 1461292133810.jpg (1 MB, 2028x2004) Image search: [Google]
1461292133810.jpg
1 MB, 2028x2004
help
>>
>>55008487
on the right side of the website inside a box, you find recent public pastebins
>>
>>55009549
I'm interested in the ids and I want a lot of them at a time, not just looking through one by one and hoping I find an id I want
>>
Look up the Gentoomans Library and find the section on Data Mining. There should be a book in there on web scraping or something similar which should explain how to collect data as such. Idk myself but beiefly looked through some of the books and saw something along those lines. Good luck
>>
>>55009575
218,340,105,584,896 possible urls.
>>
>>55009683
Yeah but how many of those actually represent a paste
To be totally honest all I'm doing is making a fake general on /vg/ and I want a certain other general to see it when they go to the catalog, so I want the name of their general in a pastebin url that can be in my OP

>>55009613
>my only option is searching through dusty old obscure tomes
pls no
>>
kys
>>
>>55009775
Generate a list of all possible paste ids.
Write a script to fork 10 instances of wget and see if it returns 200 OK and then terminate the request (wget has a built in option for this) and if it doesn't remove the option from the list of possible ids. Probably take a couple of days even with 10 threads. I mean we can't hold your hand here.
>>
>>55009899
yes I suppose this is the only way
nothing shall stand in the way of my autism
>>
>>55010099
you wont be succesful
>>
>>55010376
I'm only going to generate ids with the string I want in them so it will be faster
>>
>>55010433
Don't forget to report back when you done it

>pro tip you will never get it done
>>
>>55009683
How? Shouldn't it be 35^11?
>>
>>55011697
Actually no 26 letters times two for capitals, plus 0-9 is 62. So 62^11 is 52 quintillion
Thread replies: 16
Thread images: 3

banner
banner
[Boards: 3 / a / aco / adv / an / asp / b / biz / c / cgl / ck / cm / co / d / diy / e / fa / fit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mu / n / news / o / out / p / po / pol / qa / r / r9k / s / s4s / sci / soc / sp / t / tg / toy / trash / trv / tv / u / v / vg / vp / vr / w / wg / wsg / wsr / x / y] [Home]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
If a post contains personal/copyrighted/illegal content you can contact me at [email protected] with that post and thread number and it will be removed as soon as possible.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com, send takedown notices to them.
This is a 4chan archive - all of the content originated from them. If you need IP information for a Poster - you need to contact them. This website shows only archived content.