Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Integrated Transcriptomic and Proteomic Analysis of Primary Human Umbilical Vein Endothelial Cells.

Proteomics | 2019

Understanding the molecular profile of every human cell type is essential for understanding its role in normal physiology and disease. Technological advancements in DNA sequencing, mass spectrometry, and computational methods allow us to carry out multiomics analyses although such approaches are not routine yet. Human umbilical vein endothelial cells (HUVECs) are a widely used model system to study pathological and physiological processes associated with the cardiovascular system. In this study, next-generation sequencing and high-resolution mass spectrometry to profile the transcriptome and proteome of primary HUVECs is employed. Analysis of 145 million paired-end reads from next-generation sequencing confirmed expression of 12 186 protein-coding genes (FPKM ≥0.1), 439 novel long non-coding RNAs, and revealed 6089 novel isoforms that were not annotated in GENCODE. Proteomics analysis identifies 6477 proteins including confirmation of N-termini for 1091 proteins, isoforms for 149 proteins, and 1034 phosphosites. A database search to specifically identify other post-translational modifications provide evidence for a number of modification sites on 117 proteins which include ubiquitylation, lysine acetylation, and mono-, di- and tri-methylation events. Evidence for 11 "missing proteins," which are proteins for which there was insufficient or no protein level evidence, is provided. Peptides supporting missing protein and novel events are validated by comparison of MS/MS fragmentation patterns with synthetic peptides. Finally, 245 variant peptides derived from 207 expressed proteins in addition to alternate translational start sites for seven proteins and evidence for novel proteoforms for five proteins resulting from alternative splicing are identified. Overall, it is believed that the integrated approach employed in this study is widely applicable to study any primary cell type for deeper molecular characterization.

Pubmed ID: 30983154 RIS Download

Associated grants

  • Agency: NIH HHS, United States
    Id: P50NS038377
  • Agency: NCI NIH HHS, United States
    Id: R01 CA184165
  • Agency: NIGMS NIH HHS, United States
    Id: R01 GM124531
  • Agency: NINDS NIH HHS, United States
    Id: P50 NS038377
  • Agency: NIH HHS, United States
    Id: R01GM124531
  • Agency: NIH HHS, United States
    Id: S10OD021844
  • Agency: NCI NIH HHS, United States
    Id: U24 CA210985
  • Agency: NIH HHS, United States
    Id: S10 OD021844
  • Agency: DBT-Wellcome Trust India Alliance, India
    Id: IA/M/15/1/502023
  • Agency: NIH HHS, United States
    Id: R01CA184165
  • Agency: Wellcome Trust, United Kingdom
  • Agency: NIH HHS, United States
    Id: U24CA210985

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


CASAVA (tool)

RRID:SCR_001802

Software package that creates genomic builds, calls SNPs, detects indels, and counts reads from data generated from one or more sequencing runs. In addition, CASAVA automatically generates a range of statistics, such as mean depth and percentage chromosome coverage, to enable comparison with previous builds or other samples. CASAVA analyzes sequencing reads in three stages: * FASTQ file generation and demultiplexing * Alignment to a reference genome * Variant detection and counting

View all literature mentions

GATK (tool)

RRID:SCR_001876

A software package to analyze next-generation resequencing data. The toolkit offers a wide variety of tools, with a primary focus on variant discovery and genotyping as well as strong emphasis on data quality assurance. Its robust architecture, powerful processing engine and high-performance computing features make it capable of taking on projects of any size. This software library makes writing efficient analysis tools using next-generation sequencing data very easy, and second it's a suite of tools for working with human medical resequencing projects such as 1000 Genomes and The Cancer Genome Atlas. These tools include things like a depth of coverage analyzers, a quality score recalibrator, a SNP/indel caller and a local realigner. (entry from Genetic Analysis Software)

View all literature mentions

SAMTOOLS (tool)

RRID:SCR_002105

Original SAMTOOLS package has been split into three separate repositories including Samtools, BCFtools and HTSlib. Samtools for manipulating next generation sequencing data used for reading, writing, editing, indexing,viewing nucleotide alignments in SAM,BAM,CRAM format. BCFtools used for reading, writing BCF2,VCF, gVCF files and calling, filtering, summarising SNP and short indel sequence variants. HTSlib used for reading, writing high throughput sequencing data.

View all literature mentions

RSeQC (tool)

RRID:SCR_005275

Software package to comprehensively evaluate different aspects of RNA-seq experiments, such as sequence quality, GC bias, polymerase chain reaction bias, nucleotide composition bias, sequencing depth, strand specificity, coverage uniformity and read distribution over the genome structure. RSeQC takes both SAM and BAM files as input, which can be produced by most RNA-seq mapping tools as well as BED files, which are widely used for gene models.

View all literature mentions

Picard (tool)

RRID:SCR_006525

Java toolset for working with next generation sequencing data in the BAM format.

View all literature mentions

Program to Reduce Incontinence by Diet and Exercise (tool)

RRID:SCR_009018

Randomized controlled trial being conducted at two clinical centers in the United States to learn more about the effects of weight loss on urinary incontinence. About 330 overweight women aged 30 or older will participate and will be followed for 18 months. Efficacy of weight reduction as a treatment for urinary incontinence will be examined at 6 months following the intensive weight control program, and the sustained impact of the intervention will be examined at 18 months. To increase the maintenance of weight reduction and facilitate evaluation of the enduring impact of weight loss on urinary incontinence, they propose to study a motivation-based weight maintenance program. At the end of the intensive weight control program, women randomized to the weight loss program will be randomized to either a 12-month skill-based maintenance intervention or to a motivation-based maintenance intervention. The maintenance interventions maximize the potential for sustained weight loss and will allow them to determine if long-term weight reduction will produce continued improvement in urinary incontinence.

View all literature mentions

FastQC (tool)

RRID:SCR_014583

Quality control software that perform checks on raw sequence data coming from high throughput sequencing pipelines. This software also provides a modular set of analyses which can give a quick impression of the quality of the data prior to further analysis.

View all literature mentions

GENCODE (tool)

RRID:SCR_014966

Human and mouse genome annotation project which aims to identify all gene features in the human genome using computational analysis, manual annotation, and experimental validation.

View all literature mentions

bcl2fastq (tool)

RRID:SCR_015058

Conversion software that both demultiplexes data and converts BCL files generated by Illumina sequencing systems to standard FASTQ file formats for downstream analysis.

View all literature mentions

HISAT2 (tool)

RRID:SCR_015530

Graph-based alignment of next generation sequencing reads to a population of genomes.

View all literature mentions

StringTie (tool)

RRID:SCR_016323

Software application for assembling of RNA-Seq alignments into potential transcripts. It enables improved reconstruction of a transcriptome from RNA-seq reads. This transcript assembling and quantification program is implemented in C++ .

View all literature mentions

BioAnalyzer 2100 (tool)

RRID:SCR_019715

2100 Bioanalyzer system is an established automated electrophoresis tool for the sample quality control of biomolecules. The 2100 Bioanalyzer instrument, together with the 2100 Expert Software and Bioanalyzer assays, provide highly precise analytical evaluation of various samples types in many workflows, including next generation sequencing (NGS), gene expression, biopharmaceutical, and gene editing research. Digital data is provided in a timely manner and delivers objective assessment of sizing, quantitation, integrity and purity from DNA, RNA, and proteins. Minimal sample volumes are required for an accurate result, and the data may be exported in a many different formats for ease-of-use.

View all literature mentions

HUVEC-C (tool)

RRID:CVCL_2959

Cell line HUVEC-C is a Finite cell line with a species of origin Homo sapiens

View all literature mentions

HUVEC-C (tool)

RRID:CVCL_2959

Cell line HUVEC-C is a Finite cell line with a species of origin Homo sapiens

View all literature mentions