Skip to Main Content

Precision Public Health: Data Repositories

One- stop shop for data and datasets for precision public health research

What is a data repository?

A data repository is shared space for researchers to store, share, and maintain datasets associated with their research. Below are repositories that researchers can access datasets to use for their research.

Storage Best Practices

Always store an uncorrected (the original data set) data file version or master version:

  • Do not make any corrections to this file
  • Make corrections using a scripted language where possible so that all corrections are documented
  • Consider making your original data file read-only
  • Limit access to this file

Whenever possible, use institutional resources.

Regular back-ups protect against accidental data loss which may occur because of:

  • hardware failure
  • software or media faults
  • virus infection or malicious hacking
  • power failure
  • human error

See additional best practices in data storage on the DataONE site:

HiPerGator use with PHI and FERPA data


The University of Florida supercomputer is a cluster that includes the latest generation of processors and offers nodes for memory-intensive computation. HiPerGator’s high-performance storage systems can be accessed from diverse interfaces, including Globus, UFApps for Research, and other tools.

HiPerGator is the University of Florida supercomputer. It is approved for data and processing of open data. In order to work with Protected Health Information (PHI) or Family Educational Rights and Privacy Act (FERPA) restricted data follow the policies and procedures described here for PHI and for FERPA data. The process includes the requirement to submit a request.

UF Repositories

Find a Repository

University of Florida Home Page

This page uses Google Analytics - (Google Privacy Policy)

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.