Integration of high-risk human papillomavirus (HPV) into the host genome is a crucial
event for the development of cervical cancer, however, the underlying mechanism of
HPV integration-driven carcinogenesis remains unknown. Here, we performed long-read
RNA sequencing on 12 high-grade squamous intraepithelial lesions (HSIL) and cervical
cancer patients, including 3 pairs of cervical cancer and corresponding para-cancerous
tissue samples to investigate the full-length landscape of cross-species genome integrations.
In addition to massive unannotated isoforms, transcriptional regulatory events, and
gene chimerism, more importantly, we found that HPV-human fusion events were prevalent
in HPV-associated cervical cancers. Combined with the genome data, we revealed the
existence of a universal transcription pattern in these fusion events, whereby structurally
similar fusion transcripts were generated by specific splicing in E6 and a canonical
splicing donor site in E1 linking to various human splicing acceptors. Highly expressed
HPV-human fusion transcripts, eg, HPV16 E6*I-E7-E1SD880-human gene, were the key driver of cervical carcinogenesis, which could trigger overexpression
of E6*I and E7, and destroy the transcription of tumor suppressor genes CMAHP, TP63 and P3H2. Finally, evidence from in vitro and in vivo experiments demonstrates that the novel read-through fusion gene mRNA, E1-CMAHP (E1C,
formed by the integration of HPV58 E1 with CMAHP), existed in the fusion transcript
can promote malignant transformation of cervical epithelial cells via regulating downstream oncogenes to participate in various biological processes. Taken
together, we reveal a previously unknown mechanism of HPV integration-driven carcinogenesis
and provide a novel target for the diagnosis and treatment of cervical cancer.
Abbreviations:
APA (Alternative polyadenylation positions sites), AS (Alternative splicing), ATSS (Alternative transcription start sites), DEGs (Differentially expressed genes), E1C (E1-CMAHP), Ect1_E1C (The stable Ect1/E6E7 cell line overexpressing E1-CMAHP), Ect1_NC (The stable Ect1/E6E7 cell line with control plasmid), EMT (Epithelial-to-mesenchymal transition), HR-HPV (High-risk human papillomavirus), HSIL (High-grade squamous intraepithelial lesions), PacBio Iso-Seq (Pacific Biosciences Isoform sequencing), RNA-seq (RNA sequencing), SCC (Squamous cell carcinoma), TCGA (The Cancer Genome Atlas)To read this article in full you will need to make a payment
Purchase one-time access:
Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online accessOne-time access price info
- For academic or personal research use, select 'Academic and Personal'
- For corporate R&D use, select 'Corporate R&D Professionals'
Subscribe:
Subscribe to Translational ResearchAlready a print subscriber? Claim online access
Already an online subscriber? Sign in
Register: Create an account
Institutional Access: Sign in to ScienceDirect
References
- Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.CA Cancer J Clin. 2021; 71: 209-249
- Cervicovaginal microbiota composition correlates with the acquisition of high-risk human papillomavirus types.Int J Cancer. 2018; 143: 621-634
- Cervical Cancer Screening With Human Papillomavirus DNA and Cytology in Japan.Int J Gynecol Cancer. 2017; 27: 523-529
- HPV-CCDC106 integration alters local chromosome architecture and hijacks an enhancer by three-dimensional genome structure remodeling in cervical cancer.J Genet Genomics. 2020; 47: 437-450
- HPV Integration in HNSCC Correlates with Survival Outcomes, Immune Response Signatures, and Candidate Drivers.Mol Cancer Res. 2018; 16: 90-102
- Understanding the HPV integration and its progression to cervical cancer.Infect Genet Evol. 2018; 61: 134-144
- Papillomavirus E6 oncoproteins.Virology. 2013; 445: 115-137
- The papillomavirus E7 proteins.Virology. 2013; 445: 138-168
- Genome-wide profiling of HPV integration in cervical cancer identifies clustered genomic hot spots and a potential microhomology-mediated integration mechanism.Nat Genet. 2015; 47: 158-163
- Pathogenesis of human papillomavirus-associated mucosal disease.J Pathol. 2015; 235: 527-538
- Genome-wide analysis of HPV integration in human cancers reveals recurrent, focal genomic instability.Genome Res. 2014; 24: 185-199
- Integrated genomic and molecular characterization of cervical cancer.Nature. 2017; 543: 378-384
- ViFi: accurate detection of viral integration and mRNA fusion reveals indiscriminate and unregulated transcription in proximal genomic regions in cervical cancer.Nucleic Acids Res. 2018; 46: 3309-3325
- Characterization of HPV integration, viral gene expression and E6E7 alternative transcripts by RNA-Seq: A descriptive study in invasive cervical cancer.Genomics. 2019; 111: 1853-1861
- Differences in Stability of Viral and Viral-Cellular Fusion Transcripts in HPV-Induced Cervical Cancers.Int J Mol Sci. 2019; 21112
- Aberrant splicing isoforms detected by full-length transcriptome sequencing as transcripts of potential neoantigens in non-small cell lung cancer.Genome Biology. 2021; 229
- Long-Read Sequencing - A Powerful Tool in Viral Transcriptome Research.Trends Microbiol. 2019; 27: 578-592
- Long-read sequencing to understand genome biology and cell function.Int J Biochem Cell Biol. 2020; 126105799
- Complex rearrangements and oncogene amplifications revealed by long-read DNA and RNA sequencing of a breast cancer cell line.Genome Res. 2018; 28: 1126-1135
- Hybrid sequencing-based personal full-length transcriptomic analysis implicates proteostatic stress in metastatic ovarian cancer.Oncogene. 2019; 38: 3047-3060
- Long Read Single-Molecule Real-Time Sequencing Elucidates Transcriptome-Wide Heterogeneity and.Complexity in Esophageal Squamous Cells. Front Genet. 2019; 10: 915
- Fast and accurate short read alignment with Burrows-Wheeler transform.Bioinformatics. 2009; 25: 1754-1760
- Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples.Nat Biotechnol. 2013; 31: 213-219
- Control-FREEC: a tool for assessing copy number and allelic content using next-generation sequencing data.Bioinformatics. 2012; 28: 423-425
- Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications.Bioinformatics. 2016; 32: 1220-1222
- HIVID: An efficient method to detect HBV integration using low coverage sequencing.Genomics. 2013; 102: 338-344
- GMAP and GSNAP for Genomic Sequence Alignment: Enhancements to Speed, Accuracy, and Functionality.Methods Mol Biol. 2016; 1418: 283-334
- GFF Utilities: GffRead and GffCompare.F1000Res. 2020; 9304
- SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification.Genome Res. 2018; 28: 396-411
- A survey of the sorghum transcriptome using single-molecule long reads.Nature Communications. 2016; 711706
- SUPPA2: fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions.Genome Biology. 2018; 1940
- The Fusion Gene Landscape in Taiwanese Patients with Non-Small Cell Lung Cancer.Cancers. 2021; 131343
- Metascape provides a biologist-oriented resource for the analysis of systems-level datasets.Nat Commun. 2019; 10: 1523
- Pancancer survival analysis of cancer hallmark genes.Sci Rep. 2021; 11: 6047
- BLAT - The BLAST-like alignment tool.Genome Research. 2002; 12: 656-664
- The EMBL-EBI search and sequence analysis tools APIs in 2019.Nucleic Acids Res. 2019; 47: W636-WW41
- UniProt: the universal protein knowledgebase in 2021.Nucleic Acids Research. 2021; 49: D480-D4D9
- SWISS-MODEL: homology modelling of protein structures and complexes.Nucleic Acids Res. 2018; 46: W296-W303
- Integrative genomics viewer.Nat Biotechnol. 2011; 29: 24-26
- The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants.Nucleic Acids Res. 2010; 38: 1767-1771
- HISAT: a fast spliced aligner with low memory requirements.Nat Methods. 2015; 12: 357-360
- Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.Genome Biol. 2009; 10: R25
- RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.BMC Bioinformatics. 2011; 12323
- Controlling the false discovery rate in behavior genetics research.Behav Brain Res. 2001; 125: 279-284
- Driver Fusions and Their Implications in the Development and Treatment of Human Cancers.Cell Rep. 2018; 23 (e3): 227-238
- Tumor Fusions: an integrative resource for cancer-associated transcript fusions.Nucleic Acids Res. 2018; 46: D1144-D11D9
- The DisGeNET cytoscape app: Exploring and visualizing disease genomics data.Comput Struct Biotechnol J. 2021; 19: 2960-2967
- Extrachromosomal DNA in HPV-Mediated Oropharyngeal Cancer Drives Diverse Oncogene Transcription.Clin Cancer Res. 2021; 27: 6772-6786
- Insertional oncogenesis by HPV70 revealed by multiple genomic analyses in a clinically HPV-negative cervical cancer.Genes Chromosomes Cancer. 2020; 59: 84-95
- The characteristics of HPV integration in cervical intraepithelial cells.J Cancer. 2019; 10: 2783-2787
- Genome-Wide Profiling of Human Papillomavirus DNA Integration into Human Genome and Its Influence on PD-L1 Expression in Chinese Uygur Cervical Cancer Women.J Immunol Res. 2020; 20206284960
- Human papilloma virus (HPV) integration signature in Cervical Cancer: identification of MACROD2 gene as HPV hot spot integration site.Br J Cancer. 2021; 124: 777-785
- Multiple ASF/SF2 Sites in the Human Papillomavirus Type 16 (HPV-16) E4-Coding Region Promote Splicing to the Most Commonly Used 3 '-Splice Site on the HPV-16 Genome.J Virol. 2010; 84: 8219-8230
- The Role of E6 Spliced Isoforms (E6*) in Human Papillomavirus-Induced Carcinogenesis.Viruses. 2018; 10: 45
- An internal class III PDZ binding motif in HPV16 E6* protein is required for Dlg degradation activity.Biochim Biophys Acta Gen Subj. 2021; 1865129850
- Structure of the E6/E6AP/p53 complex required for HPV-mediated degradation of p53.Nature. 2016; 529: 541-545
- The E7 oncoprotein is translated from spliced E6*I transcripts in high-risk human papillomavirus type 16-or type 18-positive cervical cancer cell lines via translation reinitiation.J Virol. 2006; 80: 4249-4263
- Human Papillomavirus Genome Integration and Head and Neck Cancer.J Dent Res. 2018; 97: 691-700
- Characterization of recombinant human prolyl 3-hydroxylase isoenzyme 2, an enzyme modifying the basement membrane collagen IV.J Biol Chem. 2008; 283: 19432-19439
- Identification and functional characterization of 2 new transcriptional variants of the human p63 gene.Nucleic Acids Res. 2009; 37: 6092-6104
- The prolyl 3-hydroxylases P3H2 and P3H3 are novel targets for epigenetic silencing in breast cancer.Br J Cancer. 2009; 100: 1687-1696
- Role of p63 in cancer development.Biochim Biophys Acta. 2011; 1816: 57-66
- Decreased Expression of Long Non-Coding RNA GMDS Divergent Transcript (GMDS-DT) is a Potential Biomarker for Poor Prognosis of Hepatocellular Carcinoma.Med Sci Monit. 2019; 25: 6221-6229
- LncRNA GMDS-AS1 inhibits lung adenocarcinoma development by regulating miR-96-5p/CYLD signaling.Cancer Med. 2020; 9: 1196-1208
- Delta Np63 alpha exerts antitumor functions in cervical squamous cell carcinoma.Oncogene. 2020; 39: 905-921
- CMAHP promotes metastasis by reducing ubiquitination of Snail and inducing angiogenesis via GM-CSF overexpression in gastric cancer.Oncogene. 2022; 41: 159-172
- Development and Validation of a Prognostic Index Based on Genes Participating in Autophagy in Patients With Lung Adenocarcinoma.Front Oncol. 2021; 11799759
- Identification of EMT signaling cross-talk and gene regulatory networks by single-cell RNA sequencing.Proc Natl Acad Sci U S A. 2021; 118e2102050118
- The E1 proteins.Virology. 2013; 445: 35-56
- Generation of papillomavirus-immortalized cell lines from normal human ectocervical, endocervical, and vaginal epithelium that maintain expression of tissue-specific differentiation proteins.Biol Reprod. 1997; 57: 847-855
- Sphingosine-1-phosphate receptor 3 signaling.Clin Chim Acta. 2021; 519: 32-39
- EGR2-mediated regulation of m(6)A reader IGF2BP proteins drive RCC tumorigenesis and metastasis via enhancing S1PR3 mRNA stabilization.Cell Death Dis. 2021; 12: 750
- Nucleolin Promotes Cisplatin Resistance in Cervical Cancer by the YB1-MDR1 Pathway.J Oncol. 2021; 20219992218
- AUF1 ligand circPCNX reduces cell proliferation by competing with p21 mRNA to increase p21 production.Nucleic Acids Res. 2021; 49: 1631-1646
- ZY0511, a novel, potent and selective LSD1 inhibitor, exhibits anticancer activity against solid tumors via the DDIT4/mTOR pathway.Cancer Lett. 2019; 454: 179-190
- Genomic Landscape of Human Papillomavirus-Associated Cancers.Clin Cancer Res. 2015; 21: 2009-2019
- Comprehensive genomic characterization of head and neck squamous cell carcinomas.Nature. 2015; 517: 576-582
- Characterization of the human ESC transcriptome by hybrid sequencing.Proc Natl Acad Sci U S A. 2013; 110: E4821-E4830
- Single-Molecule Real-Time (SMRT) Full-Length RNA-Sequencing Reveals Novel and Distinct mRNA Isoforms in Human Bone Marrow Cell Subpopulations.Genes (Basel). 2019; 10253
- Long-read transcriptome sequencing reveals abundant promoter diversity in distinct molecular subtypes of gastric cancer.Genome Biol. 2021; 22: 44
- A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium.Nat Biotechnol. 2014; 32: 903-914
- A comprehensive examination of Nanopore native RNA sequencing for characterization of complex transcriptomes.Nat Commun. 2019; 10: 3359
- Comprehensive Analysis of Alternative Splicing Across Tumors from 8,705 Patients.Cancer Cell. 2018; 34 (e6): 211-224
- Fusion transcripts: Unexploited vulnerabilities in cancer?.Wiley Interdiscip Rev RNA. 2020; 11: e1562
- C33-A cells transfected with E6*I or E6*II the short forms of HPV-16 E6, displayed opposite effects on cisplatin-induced apoptosis.Virus Res. 2018; 247: 94-101
- The large and small isoforms of human papillomavirus type 16 E6 bind to and differentially affect procaspase 8 stability and activity.J Virol. 2007; 81: 4116-4129
- The HPV-16 E7 oncoprotein is expressed mainly from the unspliced E6/E7 transcript in cervical carcinoma C33-A cells.Arch Virol. 2010; 155: 1959-1970
- Transcriptome analysis of cervical cancer exosomes and detection of HPVE6*I transcripts in exosomal RNA.Bmc Cancer. 2022; 22: 164
- Gene fusion neoantigens: Emerging targets for cancer immunotherapy.Cancer Lett. 2021; 506: 45-54
Article info
Publication history
Published online: October 09, 2022
Accepted:
September 26,
2022
Received in revised form:
September 18,
2022
Received:
August 6,
2022
Identification
Copyright
© 2022 Elsevier Inc. All rights reserved.