[Boards: 3 / a / aco / adv / an / asp / b / biz / c / cgl / ck / cm / co / d / diy / e / fa / fit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mu / n / news / o / out / p / po / pol / qa / r / r9k / s / s4s / sci / soc / sp / t / tg / toy / trash / trv / tv / u / v / vg / vp / vr / w / wg / wsg / wsr / x / y ] [Home]
4chanarchives logo
Sci-Hub / Library Genesis general #2
Images are sometimes not shown due to bandwidth/network limitations. Refreshing the page usually helps.

You are currently reading a thread in /sci/ - Science & Math

Thread replies: 16
Thread images: 5
File: 175027945_23278ebcb9_z.jpg (106 KB, 640x430) Image search: [Google]
175027945_23278ebcb9_z.jpg
106 KB, 640x430
Sci-Hub https://sci-hub.io/
Library Genesis https://sites.google.com/site/themetalibrary/library-genesis http://gen.lib.rus.ec/ http://libgen.io/
xD Reddit https://m.reddit.com/r/Scholar/comments/3bs1rm/meta_the_libgenscihub_thread_howtos_updates_and/
Torrents to 52mill+ science articles http://libgen.io/scimag/repository_torrent_notforall/
Example of journal dump http://libgen.io/scimag/journaltable.php?journalid=1457

pic related Biblioteca Vasconcelos

Need help? Questions? Comments? miscellaneous etc.
>>
>>8001984
Thanks
>>
File: WWW.YIFY-TORRENTS.COM.jpg (128 KB, 350x500) Image search: [Google]
WWW.YIFY-TORRENTS.COM.jpg
128 KB, 350x500
>>8001998
no problem
>>
b u m p
u u . .
m . m .
p . . . p
>>
>>8001984
>Torrents to 52mill+ science articles
what can you tell me about those torrents? are there any people seeding it? how large are they all roughly? I'm thinking about downloading all of them on my unis net, simply becasue I'm a hoarder
>>
>>8002920
also I wonder if they start asking me questions when I download hundreds of gigabytes on their connction
>>
File: 1460577354726.gif (2 MB, 390x261) Image search: [Google]
1460577354726.gif
2 MB, 390x261
>>8002920
>>8002930
>download torrent at random
>all the files seemingly random symbols and numbers for names
>download other torrent at random
>random heap of letters followed by something that looks like a date
nevermind
>>
File: asdf.png (311 KB, 490x379) Image search: [Google]
asdf.png
311 KB, 490x379
>>8002920
>>8002930
>>8002966
Each torrent is 100 ZIP files of 1000 articles each. Of all the ZIP files I've opened, they're all 1000 PDF files. Each ZIP is anywhere from 300MB to 900MB. sm_00000000-00099999.torrent includes the first 100K articles. The first ZIP in this torrent is labelled something like libgen.scimag00000000-00000999.zip I get about 15MB/s maximum download speed on my seedbox.
>>
File: asdf.gif (156 KB, 500x355) Image search: [Google]
asdf.gif
156 KB, 500x355
Same person as above.

Here are the files in libgen.scimag33018000-33018999.zip in sm_03300000-03399999.torrent

http://pastebin.com/MEDsmXwT
>>
The folders appear to correspond to the DOI registrant number of the journals. It would be possible to download only the journals that interest you if you can determine the directory structure of a ZIP with only several chunks, and then someone could post this online.
>>
>>8004440
It appears uTorrent might be able to accomplish this, by prioritizing the end of files.
>>
Success. I can use uTorrent to download the ~4% ending of every ZIP and then use any standard ZIP software to extract the entire directory structure.
>>
Okay so using uTorrent I downloaded all the footers of the ZIP files of the 33000000 torrent. It took about 10 minutes and ate up 2GB of bandwidth. I'm sure this could be improved greatly.

I fed the download directory into 7z.exe and it spit out all 999999 file names. There was only one directory "10.1002". The DOI 10.1002 registrant code returns the Wiley Online Library http://onlinelibrary.wiley.com/

Here are some URL decoded files in libgen.scimag00000000-00000999.zip :
10.1002\(sici)(1997)5:1<1::aid-nt1>3.0.co;2-8.pdf
10.1002\(sici)1096-8628(19960102)61:1<65::aid-ajmg12>3.0.co;2-u.pdf
10.1002\(sici)1096-8628(19970627)70:4<371::aid-ajmg8>3.0.co;2-w.pdf

The goal is to list all journals associated with each ZIP file so people don't have to download all 50TB to get what they want.

I'll need to investigate DOIs further to figure this out.
>>
>>8004752
dong g-ds work
>>
>>8006838
thx. some torrents contain tons of different journals (the first one) and others contain all one journal (sm_00900000-00999999.torrent, ChemInform)

I have about the first million files indexed. I plan on laying everything out in thread #4.

If you're replying to this please sage as I am trying to start thread #4
>>
>>8007683
#3*
Thread replies: 16
Thread images: 5

banner
banner
[Boards: 3 / a / aco / adv / an / asp / b / biz / c / cgl / ck / cm / co / d / diy / e / fa / fit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mu / n / news / o / out / p / po / pol / qa / r / r9k / s / s4s / sci / soc / sp / t / tg / toy / trash / trv / tv / u / v / vg / vp / vr / w / wg / wsg / wsr / x / y] [Home]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
If a post contains personal/copyrighted/illegal content you can contact me at [email protected] with that post and thread number and it will be removed as soon as possible.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com, send takedown notices to them.
This is a 4chan archive - all of the content originated from them. If you need IP information for a Poster - you need to contact them. This website shows only archived content.