Difference between revisions of "Background and History"

From ECRIN-MDR Wiki
Jump to navigation Jump to search
(Initial Planning, 2016-2017)
Line 2: Line 2:
 
===Initial Planning, 2016-2017===
 
===Initial Planning, 2016-2017===
 
Beginning in 2016, ECRIN's work within the H2020 [https://www.corbel-project.eu/home.html, '''CORBEL'''] project, in particular the leadership of a group looking at 'data sharing' issues within clinical research, highlighted the need to improve the FAIRness of clinical research data. It became clear that if researchers made more and more data objects available to others, as they were being encouraged to do, those objects would often be in a wide variety of places and available under a wide range of conditions. Even discovering where the various data objects associated with a study were located might become difficult and time-consuming, and therefore costly, and once found there would be the additional problem of understanding how to access them - because many such objects would only be available under controlled access. The concept of a 'metadata repository', that could bring all this discoverability, access and provenance (DAP) metadata together, evolved out of these concerns.<br/>
 
Beginning in 2016, ECRIN's work within the H2020 [https://www.corbel-project.eu/home.html, '''CORBEL'''] project, in particular the leadership of a group looking at 'data sharing' issues within clinical research, highlighted the need to improve the FAIRness of clinical research data. It became clear that if researchers made more and more data objects available to others, as they were being encouraged to do, those objects would often be in a wide variety of places and available under a wide range of conditions. Even discovering where the various data objects associated with a study were located might become difficult and time-consuming, and therefore costly, and once found there would be the additional problem of understanding how to access them - because many such objects would only be available under controlled access. The concept of a 'metadata repository', that could bring all this discoverability, access and provenance (DAP) metadata together, evolved out of these concerns.<br/>
The initial task was seen as the creation of a metadata schema that focused on the required discoverability, access and provenance data points. The first version of such a schema <ref>Canham, S., Ohmann, C. A metadata schema for data objects in clinical research. Trials 17, 557 (2016). https://doi.org/10.1186/s13063-016-1686-5</ref> was published in late 2016.
+
The initial task was seen as the creation of a metadata schema that focused on the required discoverability, access and provenance data points. The first version of such a schema <ref>Canham, S., Ohmann, C. A metadata schema for data objects in clinical research. Trials 17, 557 (2016). https://doi.org/10.1186/s13063-016-1686-5</ref> was published in late 2016. In fact that metadata schema (now at version 5) has evolved into a combination of two schemas, one for studies and the other for the associated data objects. The first is based on a subset of the data points within the ClinicalTrials.gov trial registry (by far the largest trial registry in the world) and the second is based on DataCite. Two separate schemas are necessary because the relationship between studies and data objects in clinical research is many-to-many. It is therefore necessary to store study details and data object details separately, with a separate 'link' table indicating which data objects are associated with which study.
 
 
===The XDC project and the pilot MDR, 2018-2020===
 
 
<br/><br/>
 
<br/><br/>
 +
===The XDC project and the pilot MDR, 2017-2020===
 +
The opportunity to actually build a demonstrator MDR came in 2017, when the H2020 project Extreme Data Cloud (XDC) was developed, with the MDR as one of the proposed use cases. This project focused on developing services for very large or very heterogeneous data sets.
  
  
 
+
<br/><br/>
 
===The European Open Science Cloud and the MDR, 2020 onwards ===
 
===The European Open Science Cloud and the MDR, 2020 onwards ===
  

Revision as of 14:42, 28 October 2020

Initial Planning, 2016-2017

Beginning in 2016, ECRIN's work within the H2020 CORBEL project, in particular the leadership of a group looking at 'data sharing' issues within clinical research, highlighted the need to improve the FAIRness of clinical research data. It became clear that if researchers made more and more data objects available to others, as they were being encouraged to do, those objects would often be in a wide variety of places and available under a wide range of conditions. Even discovering where the various data objects associated with a study were located might become difficult and time-consuming, and therefore costly, and once found there would be the additional problem of understanding how to access them - because many such objects would only be available under controlled access. The concept of a 'metadata repository', that could bring all this discoverability, access and provenance (DAP) metadata together, evolved out of these concerns.
The initial task was seen as the creation of a metadata schema that focused on the required discoverability, access and provenance data points. The first version of such a schema [1] was published in late 2016. In fact that metadata schema (now at version 5) has evolved into a combination of two schemas, one for studies and the other for the associated data objects. The first is based on a subset of the data points within the ClinicalTrials.gov trial registry (by far the largest trial registry in the world) and the second is based on DataCite. Two separate schemas are necessary because the relationship between studies and data objects in clinical research is many-to-many. It is therefore necessary to store study details and data object details separately, with a separate 'link' table indicating which data objects are associated with which study.

The XDC project and the pilot MDR, 2017-2020

The opportunity to actually build a demonstrator MDR came in 2017, when the H2020 project Extreme Data Cloud (XDC) was developed, with the MDR as one of the proposed use cases. This project focused on developing services for very large or very heterogeneous data sets.




The European Open Science Cloud and the MDR, 2020 onwards



The current progress of the project within EOSC life is tabulated in Progress (EOSC Life)

Notes

  1. Jump up Canham, S., Ohmann, C. A metadata schema for data objects in clinical research. Trials 17, 557 (2016). https://doi.org/10.1186/s13063-016-1686-5