[Rdap] I give up

Mike Smit Mike.Smit at dal.ca
Thu Jun 2 00:03:48 EDT 2016


GitHub was my first thought as well, but GitHub caps regular files to
<100MB, and files uploaded using its Large Files protocol to 2GB. I believe
this is due to the challenge of versioning large files, as opposed to
storage limits, so one could try uploading 1000 1GB files.  I suspect this
would attract GitHub's attention in a not-very-positive way.  (git as a
tool requires free storage space equal to used storage space, so 2TB of
disk would be required: $60/month at market cloud rates).

I would suggest that for data generated by software, that the "right"
approach is to release the software, ideally open-source, with appropriate
documentation for re-generating that data.  Unless the data generation time
period is measured in weeks, months, or years, that would require less
resources over time than storing and serving large data files. CPU time is
cheap compared to data transfer prices.

Computer science is a pretty broad area - there are repositories, but they
are more focused than the discipline (e.g. machine learning repositories).

Has anyone built a data repository that distributes files using BitTorrent?
That's the kind of thing a computer scientist would get excited about.

Cheers,

Mike


----------------------------------------------------
Dr. Mike Smit
Assistant Professor
School of Information Management
Faculty of Management
Dalhousie University, Halifax, NS
Mike.Smit at dal.ca  //  902-494-1901
----------------------------------------------------




On Wednesday, 1 June 2016, Daureen Nesdill <daureen.nesdill at utah.edu> wrote:

> Hmmm. The student said he had checked Github and the 1 TB is an issue.
>
> I finally went to IEEE looking for information and learned about Tera
> Promise http://openscience.us/repo/
> and other open science possibilities.
> http://openscience.us/other/index.html
>
> Thanks for the reminder about export control restrictions
>
> Daureen
>
> *From:* Rdap [rdap-bounces at asis.org] on behalf of Pouchard, Line C [
> pouchard at purdue.edu]
> *Sent:* Wednesday, June 01, 2016 5:31 PM
> *To:* Research Data, Access and Preservation
> *Subject:* Re: [Rdap] I give up
>
> Daureen:
>
> Has the student tried Github?  It has a publication workflow where the
> data is uploaded to zenodo and acquires a DOI.
>
> It's not exactly a computer science repository but it might work for his
> purposes.
>
> As zenodo is in the EU, there might be export control restrictions on
> cyber security data.
>
> Line
>
> Sent from my iPhone
> Line Pouchard, PhD
> Purdue University Libraries
>
> On Jun 1, 2016, at 7:14 PM, Daureen Nesdill <daureen.nesdill at utah.edu
> <http://redir.aspx?REF=e0aoAaOHirdbAJ3jnKKeZZPhhCtqzR8AC1nyRg4ctQzYmGOYl4rTCAFtYWlsdG86ZGF1cmVlbi5uZXNkaWxsQHV0YWguZWR1>>
> wrote:
>
> Hi,
>
> I have a graduate student in computer science requesting me to find him a
> data repository for his research data. The article has already been
> published and folks are asking to see the data. The data is in a text
> format and 1 terabyte uncompressed; compressed it is 100-200 GB. He
> developed software that used other software to generate the data. It is a
> test of security issues with androids. He tells me his colleagues are
> waiting to see where he put his data because no one knows what to do. This
> project is NSF supported.
>
>
>
> I’ve looked through re3data and conducted a few google searches with no
> luck. Does anyone have any information about data repositories for computer
> science?
>
>
>
> Thanks for any assistance,
>
> Daureen
>
>
>
>
>
>
>
> Daureen Nesdill, MS, MLIS
>
> Research Data Management Librarian
>
> The Faculty Center @ the J. W. Marriott Library
>
> University of Utah
>
> 801-585-5975
>
> daureen.nesdill at utah.edu
> <http://redir.aspx?REF=e0aoAaOHirdbAJ3jnKKeZZPhhCtqzR8AC1nyRg4ctQzYmGOYl4rTCAFtYWlsdG86ZGF1cmVlbi5uZXNkaWxsQHV0YWguZWR1>
>
> ORCID http://orcid.org/0000-0003-0126-5038
> <http://redir.aspx?REF=5Ga5KW1Xbm1OQSDP41wirJ1rMsXLgks26dDYa5_CLKTYmGOYl4rTCAFodHRwOi8vb3JjaWQub3JnLzAwMDAtMDAwMy0wMTI2LTUwMzg.>
>
> <image001.jpg>
>
>
>
>
>
> _______________________________________________
> Rdap mailing list
> Rdap at mail.asis.org
> <http://redir.aspx?REF=G83i_fKLifWY3SLLuTZogC4rvmqYRfRM3JNIfd5Nq1HYmGOYl4rTCAFtYWlsdG86UmRhcEBtYWlsLmFzaXMub3Jn>
> http://mail.asis.org/mailman/listinfo/rdap
> <http://redir.aspx?REF=NgcPYnmQKHpmUWClB5qhiNsgWC_wVQchLngJRhDbmL7YmGOYl4rTCAFodHRwOi8vbWFpbC5hc2lzLm9yZy9tYWlsbWFuL2xpc3RpbmZvL3JkYXA.>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.kunverj.com/pipermail/rdap/attachments/20160602/cf5976ec/attachment.html>


More information about the RDAP mailing list