CCL: Cloud backup in computational chemistry



I highly recommend Backblaze / B2 storage and have used them for some international collaborations:
https://www.backblaze.com/b2/cloud-storage-pricing.html

For some of these services, you can arrange a drive to upload via mail / FedEx.

The "rclone" program is also useful for transferring / sync of files and works with multiple cloud services:
https://rclone.org

Hope that helps,
-Geoff

---
Prof. Geoffrey Hutchison
Department of Chemistry
University of Pittsburgh
tel: (412) 648-0492
twitter: ~~ ghutchis


On Feb 28, 2022, at 4:48 PM, Andrew DeYoung andrewdaviddeyoung#%#gmail.com <owner-chemistry ~~ ccl.net> wrote:

Hi, 

I have ~1.5 TB of analysis data on my home computer related to my PhD work, which I completed last year.  I would like to put this data in long-term storage in the cloud.  I will retain a backup on an external hard drive, which I will keep at home.  But I'm looking for an additional backup in another location (the other location being, ideally, the cloud).  These files are not themselves MD trajectories from my PhD, but rather are output and analyses related to the trajectories.  (The trajectories themselves are tens of TB and are being retained, for now, by my university.)

Does anyone have experience with archiving scientific data on cloud storage services such as AWS S3, AWS S3 Glacier, Microsoft Azure, or iDrive?  

I realize this is the Computational Chemistry List, not the Information Technology List, but I'd be grateful for any insight you may have from the perspective of a computational chemist.

I will be paying for this myself, so the service needs to be reasonably priced for a single user for ~5-10 TB or less.  It's not immediately clear to me whether all of these services are really available to individuals.

I don't need fancy features like automatic backup, delta copying, and so forth; I just need a place to safely store some files.  I use Linux for running computation, but I'm not an IT expert by any means; I am willing to learn, though, especially when good documentation is available!  And, clearly, transferring ~1.5 TB to the cloud is itself a nontrivial task, but I'm willing to see what the options are.

Here are some links to some seemingly popular services in my country (US), but perhaps there are others more particularly suited for scientific data:


Thank you for your time!

Best,
Andrew

Andrew DeYoung, PhD
Department of Chemistry
Carnegie Mellon University