|
@boyter | |||||
|
A rather lengthy blogpost boyter.org/posts/an-infor… Processing 40 TB of code from ~10 million projects with a dedicated server and Go for $100
|
||||||
|
||||||
|
boyter
@boyter
|
2. lis |
|
boyter.org/posts/an-infor… a kind soul is offering the code for download and you can get it here.
|
||
|
|
||
|
Lorenz Brun
@lorenzbrun
|
1. lis |
|
I could host the 82GB tar.gz file on some spare bandwidth (2GBps unmetered) and would also be interested to do some further tests with it. Could you give it to me?
|
||
|
|
||
|
boyter
@boyter
|
1. lis |
|
I’ll DM you details when I wake up tomorrow morning. Cheers.
|
||
|
|
||
|
Rijnard van Tonder
@rvtond
|
1. lis |
|
This is a really neat study, thanks for sharing! You might consider submitting a short data showcase paper on this to @msrconf. Either way I'd recommend putting up the ~80GB data set on @ZENODO_ORG (free archival and great for visibility/dissemination).
|
||
|
|
||
|
boyter
@boyter
|
1. lis |
|
You really think so? I’m happy to do so though. As for hosting I have a few offers and the code should go up soon.
|
||
|
|
||
|
Elliott Spira
@ElliottSpira
|
1. lis |
|
How many people commit their node_modules? How much storage space does that many "jquery" files equate to? So many questions! Interesting research @boyter - thanks for writing this up 🙂
|
||
|
|
||
|
boyter
@boyter
|
1. lis |
|
Oh those are both good questions! Should be easy to do too. Ill see about adding them in.
|
||
|
|
||