MRC Human Genetics Unit
Medical Research Council Human Genetics Unit

Data management and sharing

How to access our data.

MRC HGU data

The MRC Human Genetics Unit aims to generate high quality data that should be a resource both within and outside our Unit. As such we expect our researchers to use best practice data management approaches and to share data  in a manner consistent with MRC guidelines. 

MRC HGU research aims to interpret the genome and its relationship to health and disease via a wide range of experimental and computational studies generating biological samples, and data in many formats. Each HGU project is expeected to have a dedicated data management plan appropriate to the type of data to be generated.

We have developed a general Unit data management plan which can be viewed here and is available as a template for our researchers to adapt to their specific needs: 

 

As outlined in this plan we guide our researchers to deposit  data and, where appropriate, samples and tools  in the most appropriate and accessible repository. The table below outlines our most frequently used repositories.

 

Data

 

    Asset type

                                                                       Means of sharing

                        Repository/ venue                     

Non-clinical genomic, transcriptomic, proteomic, other ‘omic' data

Via publication (including pre-publication and publication supplementary tables/ access via journal) and/or the most appropriate external repository for, at minimum, summary data. Data not fitting an established international repository will be deposited on University of Edinburgh DataShare Research Data Repository and given a Digital Object Identifier (DOI). Raw read data will be submitted to the Sequence Read Archive (SRA) or European Nucleotide Archive (ENA).

 

Gene Expression Omnibus (GEO) for sequencing, microarray, emRiboSeq, RNAseq,ChIP-seq, ATAC-seq, Hi-C, ProteomeExchange PRIDE database for M/S   data; MAVE DB database (Mutational effect/ mutational scanning data), MGA repository (genome annotation data e.g. ChIP-seq) University of Edinburgh DataShare.  ENA, SRA

 

Software

Labs routinely creating software use github to store, annotate and share versions

https://github.com/

Clinical study data

Personally identifiable genetic and phenotypic data generated by ourselves and (in most cases) consortia we participate in will be discoverable via publication (including pre-publication), and by listing in national/international directories (e.g. https://directory.biobankinguk.org/Profile/Biobank/GBR-1-65 ) and released on a Data Access Committee approval basis by MRC HGU or through the European Genome-phenome Archive (EGA), or the most appropriate external repository for, at minimum, summary data.

European Nucleotide Archive (ENA)

European Genome-phenome Archive (EGA) 

Dementias Platform UK (DPUK)

 

 

Reagents and models

 

       Asset type                                                                                    Means of sharing              Repository/ venue

Plasmids,  other constructs, libraries

Should have a material sheet explaining construction linked to a unique sample number, which reflects the published name of a plasmid, construct or library. Discovery via publication, dissemination (where allowed by any existing use agreements) via MRC PPU, Addgene or direct Data and Material Transfer Agreement (DMTA) from the University of Edinburgh

Addgene, MRC PPU, MRC HGU

Engineered cell models

How models are generated should be fully described in publication or supplementary materials. Discovery via publication, dissemination (where allowed by any existing use agreements) via license from or DMTA with the University of Edinburgh or deposit into an appropriate repository.

MRC HGU

Engineered zebrafish models

How models are generated should be fully described in publication or supplementary materials.

Discovery via publication, dissemination (where allowed by any existing use agreements) via license from or DMTA with the University of Edinburgh or deposit into an appropriate repository.

MRC HGU

Engineered mouse models

How models are generated should be fully described in publication or supplementary materials.

Discovery via publication, dissemination (where allowed by any existing use agreements) via license from or DMTA with the University of Edinburgh or deposit into an appropriate repository.

EMMA, MRC Harwell (sperm/embryos)

Antibodies

How antibodies were generated by HGU should be fully described in publication or via a supplementary data sheet. Discovery via publication, dissemination (where allowed by any existing use agreements) via MRC PPU, Addgene, licensing to a commercial distributor, or direct DMTA from the University of Edinburgh

Commercial entities, MRC PPU, MRC HGU

Unique biologic samples (DNA, RNA, protein)

Records of these should be kept and link to a sample reference. This should not include any information that would allow identification of an individual, in the case of clinical samples. Discovery via publication, dissemination (where allowed by appropriate consent in the case of clinical studies) via DMTA from MRC HGU

MRC HGU

Novel methodology

Descriptions of methodology should be fully described in local records/ SOPs/ lab books and included in publication (including pre-publication), online repositories

Journals, https://www.protocols.io/

 

If you are interested in accessing MRC Human Genetics Unit data, please contact the relevant group leader. If unsure who to contact please contact Helen Nickerson, Science and Strategy Manager.

 

MRC HGU Data management policy

MRC HGU Data sharing policy