Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Principles of metadata organization at the ENCODE data coordination center.

Database : the journal of biological databases and curation | 2016

The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center (DCC) is responsible for organizing, describing and providing access to the diverse data generated by the ENCODE project. The description of these data, known as metadata, includes the biological sample used as input, the protocols and assays performed on these samples, the data files generated from the results and the computational methods used to analyze the data. Here, we outline the principles and philosophy used to define the ENCODE metadata in order to create a metadata standard that can be applied to diverse assays and multiple genomic projects. In addition, we present how the data are validated and used by the ENCODE DCC in creating the ENCODE Portal (https://www.encodeproject.org/). Database URL: www.encodeproject.org.

Pubmed ID: 26980513 RIS Download

Research resources used in this publication

None found

Antibodies used in this publication

None found

Associated grants

  • Agency: NHGRI NIH HHS, United States
    Id: U41 HG006992

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


MINSEQE (tool)

RRID:SCR_003221

The Minimum Information about a high-throughput nucleotide SEQuencing Experiment that is needed to enable the unambiguous interpretation and facilitate reproduction of the results of the experiment. By analogy to the MIAME guidelines for microarray experiments, adherence to the MINSEQE guidelines will improve integration of multiple experiments across different modalities, thereby maximising the value of high-throughput research. The five elements of experimental description considered essential when making data available supporting published high-throughput sequencing experiments are as follows: # The description of the biological system, samples, and the experimental variables being studied # The sequence read data for each assay # The "final" processed (or summary) data for the set of assays in the study # General information about the experiment and sample-data relationships # Essential experimental and data processing protocols

View all literature mentions

FigShare (tool)

RRID:SCR_004328

Repository for all data, figures, theses, publications, posters, presentations, filesets, videos, datasets, negative data in a citable, shareable and discoverable manner with Digital Object Identifiers. Allows to upload any file format to be made visualisable in the browser so that figures, datasets, media, papers, posters, presentations and filesets can be disseminated in a way that the current scholarly publishing model does not allow. Features integration with ORCID, Symplectic Elements, can import items from Github and is a source tracked by Altmetric.com. Figshare gives users unlimited public space and 1GB of private storage space for free. Data are digitally preserved by CLOCKSS. Supported by Digital Science, a division of Macmillan Publishers Limited, as a community-based, open science project that retains its autonomy.

View all literature mentions

Dryad Digital Repository (tool)

RRID:SCR_005910

International, curated, digital repository that makes the data underlying scientific publications discoverable, freely reusable, and citable. Particularly data for which no specialized repository exists. Provides the infrastructure for, and promotes the re-use of, data underlying the scholarly literature. Governed by a nonprofit membership organization. Membership is open to any stakeholder organization, including but not limited to journals, scientific societies, publishers, research institutions, libraries, and funding organizations. Most data are associated with peer-reviewed articles, although data associated with non-peer reviewed publications from reputable academic sources, such as dissertations, are also accepted. Used to validate published findings, explore new analysis methodologies, repurpose data for research questions unanticipated by the original authors, and perform synthetic studies.UC system is member organization of Dryad general subject data repository.

View all literature mentions