Data Management

A data management plan (DMP) will help you manage your data, meet funder requirements, and help others use your data if shared.

Undergraduate Datasets

For undergraduate research datasets see Student Dataset Instructions and the Undergraduate Research Datasets collection.

Using the DMPTool

DMPTool is an excellent place to start. This comprehensive, easy-to-use, open-source tool guides you through a developing a data management plan tailored to your needs that you can easily export. You'll create an ID and password, so your plan will stay private. Step by step instructions let you target your plan for a particular granting body. You can also browse an index page with sample plans, templates, and guidance.

Dive right in with the DMPTool Quickstart guide, or watch a 90 second demo video for an introduction.

Writing your plan

Alternatively, you can use the questions below and any specific data management requirements from your funding agency to write your data management plan. Additional resources for creating plans are also provided below.

  1. Project, experiment, and data description
    • What’s the purpose of the research?
    • What is the data? How and in what format will the data be collected? Is it numerical data, image data, text sequences, or modeling data?
    • How much data will be generated for this research?
    • How long will the data be collected and how often will it change?
    • Are you using data that someone else produced? If so, where is it from?
    • Who is responsible for managing the data? Who will ensure that the data management plan is carried out?
  2. Documentation, organization, and storage
    • What documentation will you be creating in order to make the data understandable by other researchers?
    • Are you using metadata that is standard to your field? How will the metadata be managed and stored?
    • What file formats will be used? Do these formats conform to an open standard and/or are they proprietary?
    • Are you using a file format that is standard to your field? If not, how will you document the alternative you are using?
    • What directory and file naming convention will be used?
    • What are your local storage and backup procedures? Will this data require secure storage?
    • What tools or software are required to read or view the data?
  3. Access, sharing, and re-use
    • Who has the right to manage this data? Is it the responsibility of the PI, student, lab, MIT, or funding agency?
    • What data will be shared, when, and how?
    • Does sharing the data raise privacy, ethical, or confidentiality concerns?  Do you have a plan to protect or anonymize data, if needed?
    • Who holds intellectual property rights for the data and other information created by the project? Will any copyrighted or licensed material be used? Do you have permission to use/disseminate this material?
    • Are there any patent- or technology-licensing-related restrictions on data sharing associated with this grant?
    • Will this research be published in a journal that requires the underlying data to accompany articles?
    • Will there be any embargoes on the data?
    • Will you permit re-use, redistribution, or the creation of new tools, services, data sets, or products (derivatives)? Will commercial use be allowed?
  4. Archiving
    • How will you be archiving the data? Will you be storing it in an archive or repository for long-term access? If not, how will you preserve access to the data?
    • Is a discipline-specific repository available? If not, you could consider depositing your data into the Academic Commons. Contact us if you’re interested in using the Academic Commons to store your data.
    • How will you prepare data for preservation or data sharing? Will the data need to be anonymized or converted to more stable file formats?
    • Are software or tools needed to use the data? Will these be archived?
    • How long should the data be retained? 3-5 years, 10 years, or forever?

Resources for creating plans