In less than nine months, the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) killed over a million people, including >25,000 in New York City (NYC) alone. The COVID-19 pandemic caused by SARS-CoV-2 highlights clinical needs to detect infection, track strain evolution, and identify biomarkers of disease course. To address these challenges, we designed a fast (30-minute) colorimetric test (LAMP) for SARS-CoV-2 infection from naso/oropharyngeal swabs and a large-scale shotgun metatranscriptomics platform (total-RNA-seq) for host, viral, and microbial profiling. We applied these methods to clinical specimens gathered from 669 patients in New York City during the first two months of the outbreak, yielding a broad molecular portrait of the emerging COVID-19 disease. We find significant enrichment of a NYC-distinctive clade of the virus (20C), as well as host responses in interferon, ACE, hematological, and olfaction pathways. In addition, we use 50,821 patient records to find that renin-angiotensin-aldosterone system inhibitors have a protective effect for severe COVID-19 outcomes, unlike similar drugs. Finally, spatial transcriptomic data from COVID-19 patient autopsy tissues reveal distinct ACE2 expression loci, with macrophage and neutrophil infiltration in the lungs. These findings can inform public health and may help develop and drive SARS-CoV-2 diagnostic, prevention, and treatment strategies.
Pubmed ID: 33712587 RIS Download
Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.
Software for linkage and association modeling in pedigrees that uses a maximum likelihood model to extract information on genetic linkage and association from samples of unrelated individuals, sib pairs, trios and larger pedigrees (Li et al, 2005; Li et al, 2006). It provides estimates of genetic model parameters and powerful tests of association in settings where population stratification is not a concern.
View all literature mentionsSoftware that allows large scale neuron simulators to communicate during runtime. It allows exchange of data among parallel applications in a cluster environment, interconnects large-scale neuronal network simulators with each other or with other tools, participates in multi-simulations, and is continuously developed and extended. Three simulators currently have MUSIC interfaces: Moose, NEURON and NEST. Three applications execute in parallel while exchanging data via MUSIC. The software interface promotes interoperability by allowing models written for different simulators to be simulated together in a larger system. It enables re-usability of models or tools by providing a standard interface. As data are distributed over a number of processors, it is non-trivial to coordinate data transfer so that it reaches the correct destination at the correct time. Current and future simulators can make use of MUSIC - compliant general purpose tools and participate in multi-simulations, for example when: * Different parts of a complex nervous system model are optimally implemented in different simulators, and need to communicate with each other. * Post-processing of generated data is needed, where the amounts of data are too large for intermediate storage, and requires the simulator to pass the data directly to the post-processing module. A standard interface enables straight-forward independent third-party development and community sharing of interoperable software tools for parallel processing. * Library and utilities are written in C++, uses MPI. * It is possible to add a MUSIC interface to existing simulators. * Works independently, no assumptions are made about other applications to facilitate development of general purpose tools. * Performance Data transport with high bandwidth and low latency.
View all literature mentionsCollection of genome databases for vertebrates and other eukaryotic species with DNA and protein sequence search capabilities. Used to automatically annotate genome, integrate this annotation with other available biological data and make data publicly available via web. Ensembl tools include BLAST, BLAT, BioMart and the Variant Effect Predictor (VEP) for all supported species.
View all literature mentionsscikit-learn: machine learning in Python
View all literature mentionsNIH genetic sequence database that provides annotated collection of all publicly available DNA sequences for almost 280 000 formally described species (Jan 2014) .These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole-genome shotgun (WGS) and environmental sampling projects. Most submissions are made using web-based BankIt or standalone Sequin programs, and GenBank staff assigns accession numbers upon data receipt. It is part of International Nucleotide Sequence Database Collaboration and daily data exchange with European Nucleotide Archive (ENA) and DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through NCBI Entrez retrieval system, which integrates data from major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of GenBank database are available by FTP.
View all literature mentionsSoftware package for interpreting gene expression data. Used for interpretation of a large-scale experiment by identifying pathways and processes.
View all literature mentionsCollection of curated, non-redundant genomic DNA, transcript RNA, and protein sequences produced by NCBI. Provides a reference for genome annotation, gene identification and characterization, mutation and polymorphism analysis, expression studies, and comparative analyses. Accessed through the Nucleotide and Protein databases.
View all literature mentionsSoftware performing alignment of high-throughput RNA-seq data. Aligns RNA-seq reads to reference genome using uncompressed suffix arrays.
View all literature mentionsSoftware package to comprehensively evaluate different aspects of RNA-seq experiments, such as sequence quality, GC bias, polymerase chain reaction bias, nucleotide composition bias, sequencing depth, strand specificity, coverage uniformity and read distribution over the genome structure. RSeQC takes both SAM and BAM files as input, which can be produced by most RNA-seq mapping tools as well as BED files, which are widely used for gene models.
View all literature mentionsSoftware repository for R packages related to analysis and comprehension of high throughput genomic data. Uses separate set of commands for installation of packages. Software project based on R programming language that provides tools for analysis and comprehension of high throughput genomic data.
View all literature mentionsA commercial organization which provides assay technologies to isolate DNA, RNA, and proteins from any biological sample. Assay technologies are then used to make specific target biomolecules, such as the DNA of a specific virus, visible for subsequent analysis.
View all literature mentionsA web-based software application that enables users to analyze, integrate, and understand data derived from gene expression, microRNA, and SNP microarrays, metabolomics, proteomics, and RNA-Seq experiments, and small-scale experiments that generate gene and chemical lists. Users can search for targeted information on genes, proteins, chemicals, and drugs, and build interactive models of experimental systems. IPA allows exploration of molecular, chemical, gene, protein and miRNA interactions, creation of custom molecular pathways, and the ability to view and modify metabolic, signaling, and toxicological canonical pathways. In addition to the networks and pathways that can be created, IPA can provide multiple layering of additional information, such as drugs, disease genes, expression data, cellular functions and processes, or a researchers own genes or chemicals of interest.
View all literature mentionsSoftware package for high-performance read alignment, quantification and mutation discovery.General purpose read aligner which can be used to map both genomic DNA-seq reads and RNA-seq reads. Subread aligner as fast, accurate and scalable read mapping by seed-and-vote.These programs were also implemented in Bioconductor R package Rsubread.
View all literature mentionsBioconductor software package for Empirical analysis of Digital Gene Expression data in R. Used for differential expression analysis of RNA-seq and digital gene expression data with biological replication.
View all literature mentionsA commercial antibody supplier which supplies primary and secondary antibodies, biochemicals, proteins, peptides, lysates, immunoassays and other kits.
View all literature mentionsQuality control software that perform checks on raw sequence data coming from high throughput sequencing pipelines. This software also provides a modular set of analyses which can give a quick impression of the quality of the data prior to further analysis.
View all literature mentionsOpen source software package for statistical programming language R to create plots based on grammar of graphics. Used for data visualization to break up graphs into semantic components such as scales and layers.
View all literature mentionsSoftware R package for multivariate analysis which takes into account different types of data structure. Data can be organized in groups of variable, groups of individuals, or into hierarchy of variables.
View all literature mentionsHuman and mouse genome annotation project which aims to identify all gene features in the human genome using computational analysis, manual annotation, and experimental validation.
View all literature mentionsData aggregate that compiles results from bioinformatics analyses across multiple samples into a single report. It is written in Python.
View all literature mentionsSoftware package for differential gene expression analysis based on the negative binomial distribution. Used for analyzing RNA-seq data for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates.
View all literature mentionsSoftware R package provides interface to the 'samtools', 'bcftools', and 'tabix' utilities for manipulating Sequence Alignment Map, FASTA, binary variant call and compressed indexed tab-delimited files.
View all literature mentions