Skip to Main Content

Research Data Management: What is RDM?

This guide provides a comprehensive overview of Research Data Management (RDM) best practices, resources, and services available at the University of Waterloo.

What is RDM decorative banner

What is Research Data Management?

Research data management (RDM) covers practices associated with the collection, documentation, storage, sharing, and preservation of all research data. It involves the active organization and maintenance of data throughout the entire research process which includes the periods before, during, and after the active research phase. RDM eases the facilitation of the research process during active research, while also promoting greater data accessibility and supporting research excellence.

The FAIR Guiding Principles for Data Management and Data Stewardship

Research data management strives for data that meets the FAIR Guiding Principles, which means that data that is Findable, Accessible, Interoperable, and Reusable. In practice, this means asking a series of questions about your research data:

  • Can I find the data?
  • If I can find the data, is it accessible?
  • If it's accessible, can I open the data file?
  • If I can open the data file, can I understand the data well enough to reuse it? Does the data have a license that allows for reuse?

What Are Research Data?

A common misconception is that research data are limited to the sciences. In reality, they span all academic disciplines but vary widely depending on the project, domain and context. Research data can include spreadsheets and numbers, text, videos, images, historical artifacts, code, and more. See formal definition below.

Research data are also not a mere by-product or a steppingstone to publication. They often outlast the outputs they support. They serve as evidence to validate and verify findings and can be reused for future research, teaching, and other purposes. 

That’s why research data deserve to be treated as first-class research outputs and why RDM is essential. 


Types of Research Data 

Research data can be classified in different ways, for example based on their: 

  • Content: numerical, textual, audiovisual, multimedia, etc. 

  • Format: spreadsheets, databases, images, maps, audio files, (un)structured text, etc. 

  • Mode of data collection: experimental, observational, simulation, derived/compiled from other sources 

  • Digital (born-digital or digitized) or non-digital  (analogue) nature (e.g. paper surveys, notes…) 

  • Primary (generated by the researcher for a particular research purpose or project) or secondary nature (originally created by someone else for another purpose) 

  • Stage in the research data lifecycle: raw (directly collected or generated), cleaned (errors corrected, standardized, de-identified), or processed (transformed or analyzed) 

Data Related to Research

In addition to managing the research data itself, RDM also involves managing documentation that ensures data can be understood and reused effectively. 

Examples of what to document include: 

  • Methodology and protocols: Descriptions of research methods, procedures, and workflows 

  • Data structure and organization: Dataset structures, database schemas, data dictionaries, and codebooks 

  • User support materials: User guides, notebooks, and questionnaires or interview schedules 

  • Technical specifications: Equipment settings and instrument calibration details 

  • Contextual references: Links to related datasets, sources, software, or code 

UWaterloo researchers should include documentation in a README file

Definitions

Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All digital and non-digital content has the potential of becoming research data. 

Source: CODATA Research Data Management Terminology 

Data are facts, measurements, recordings, records, or observations collected by researchers and others, with a minimum of contextual interpretation. Data may be in any format or medium taking the form of text, numbers, symbols, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, workflows, equipment descriptions, data files, data processing algorithms, software, programming languages, code, or statistical records. 

Source: Tri-Agency Research Data Management Policy - Frequently Asked Questions 

Research materials serve as the object of an investigation, whether scientific, scholarly, literary or artistic, and are used to create research data. Research materials are transformed into data through method or practice.  

Source: Tri-Agency Research Data Management Policy - Frequently Asked Questions