Sci-Hub https://sci-hub.io/
Library Genesis https://sites.google.com/site/themetalibrary/library-genesis http://gen.lib.rus.ec/ http://libgen.io/
xD Reddit https://m.reddit.com/r/Scholar/comments/3bs1rm/meta_the_libgenscihub_thread_howtos_updates_and/
Torrents to 52mill+ science articles http://libgen.io/scimag/repository_torrent_notforall/
Example of journal dump http://libgen.io/scimag/journaltable.php?journalid=1457
pic related Biblioteca Vasconcelos
Need help? Questions? Comments? miscellaneous etc.
>>8001984
Thanks
>>8001998
no problem
b u m p
u u . .
m . m .
p . . . p
>>8001984
>Torrents to 52mill+ science articles
what can you tell me about those torrents? are there any people seeding it? how large are they all roughly? I'm thinking about downloading all of them on my unis net, simply becasue I'm a hoarder
>>8002920
also I wonder if they start asking me questions when I download hundreds of gigabytes on their connction
>>8002920
>>8002930
>download torrent at random
>all the files seemingly random symbols and numbers for names
>download other torrent at random
>random heap of letters followed by something that looks like a date
nevermind
>>8002920
>>8002930
>>8002966
Each torrent is 100 ZIP files of 1000 articles each. Of all the ZIP files I've opened, they're all 1000 PDF files. Each ZIP is anywhere from 300MB to 900MB. sm_00000000-00099999.torrent includes the first 100K articles. The first ZIP in this torrent is labelled something like libgen.scimag00000000-00000999.zip I get about 15MB/s maximum download speed on my seedbox.
Same person as above.
Here are the files in libgen.scimag33018000-33018999.zip in sm_03300000-03399999.torrent
http://pastebin.com/MEDsmXwT
The folders appear to correspond to the DOI registrant number of the journals. It would be possible to download only the journals that interest you if you can determine the directory structure of a ZIP with only several chunks, and then someone could post this online.
>>8004440
It appears uTorrent might be able to accomplish this, by prioritizing the end of files.
Success. I can use uTorrent to download the ~4% ending of every ZIP and then use any standard ZIP software to extract the entire directory structure.
Okay so using uTorrent I downloaded all the footers of the ZIP files of the 33000000 torrent. It took about 10 minutes and ate up 2GB of bandwidth. I'm sure this could be improved greatly.
I fed the download directory into 7z.exe and it spit out all 999999 file names. There was only one directory "10.1002". The DOI 10.1002 registrant code returns the Wiley Online Library http://onlinelibrary.wiley.com/
Here are some URL decoded files in libgen.scimag00000000-00000999.zip :
10.1002\(sici)(1997)5:1<1::aid-nt1>3.0.co;2-8.pdf
10.1002\(sici)1096-8628(19960102)61:1<65::aid-ajmg12>3.0.co;2-u.pdf
10.1002\(sici)1096-8628(19970627)70:4<371::aid-ajmg8>3.0.co;2-w.pdf
The goal is to list all journals associated with each ZIP file so people don't have to download all 50TB to get what they want.
I'll need to investigate DOIs further to figure this out.
>>8004752
dong g-ds work
>>8006838
thx. some torrents contain tons of different journals (the first one) and others contain all one journal (sm_00900000-00999999.torrent, ChemInform)
I have about the first million files indexed. I plan on laying everything out in thread #4.
If you're replying to this please sage as I am trying to start thread #4
>>8007683
#3*