1. Introduction
The human major histocompatibility complex (MHC), also known as the human leukocyte antigen (HLA), covers 0.13% of the human genome and spans ~4 Mbp on the short arm of chromosome six at position 6p21 within a region that contains more than 250 annotated genes and pseudogenes [
1,
2]. The classical class I and class II regions within the MHC have extensive patterns of linkage disequilibrium (LD), and a high degree of single nucleotide polymorphisms (SNPs) at the HLA genes can differentiate worldwide populations [
1,
3,
4,
5]. HLA polymorphisms are a crucial determinant of the adaptive immune response to infectious agents, allograft success, or rejection and self/nonself immune recognition that can contribute to more autoimmune diseases than any other region of the genome [
1,
2,
6,
7,
8]. Apart from the adaptive immune response, MHC class I molecules have a role in brain development, synaptic plasticity, axonal regeneration, and immune-mediated neurodegeneration [
9,
10,
11,
12]. At least half of the molecules encoded by this highly polymorphic locus are involved in antigen processing and presentation, inflammation regulation, the complement system, and the innate and adaptive immune responses, highlighting the importance of the MHC in immune-mediated autoimmune and infectious diseases [
1,
2]. Polymorphisms expressed by the MHC genomic region influence many critical biological traits and individuals’ susceptibility to the development of chronic autoimmune diseases such as type I diabetes, rheumatoid arthritis, celiac disease, psoriasis, ankylosing spondylitis, multiple sclerosis, Graves’ disease, schizophrenia, bipolar disorder, inflammatory bowel disease, and dermatomyositis [
2,
6,
7,
8]. Furthermore, different viral infections and cancers are associated strongly with the suppression of MHC genomic expression activity, particularly in the region of the MHC class I and class II loci [
6,
13,
14,
15].
There are tens of thousands of genomic loci that express microRNA (miRNA) [
16] and lncRNA [
17,
18,
19,
20], but only about 50 have been investigated in any great detail with respect to their role in the regulation of the immune system and disease [
21,
22,
23,
24]. Although there are many miRNA and lncRNA loci within the MHC genomic region, they have been ignored largely in favor of studies on polymorphisms of the HLA class I and class II gene loci in health, disease, and transplantation cell/tissue/organ typing [
25,
26]. This review focuses on the structure and function of only one of these HLA lncRNA, the
HCP5 lncRNA, which is located between the
MICA and
MICB genes and ~105 kb centromeric of the
HLA-B gene.
In 1993, Vernet et al. [
27] discovered a novel coding sequence belonging to a new multicopy pseudogene family P5 that they mapped within the HLA class I region and named
P5-1 (alias for
HCP5). They found that it expressed a 2.5-kb transcript in human B-cells, phytohemagglutinin-activated lymphocytes, a natural killer-like cell line, normal spleen, hepatocellular carcinoma, neuroblastoma, and other non-lymphoid tissue, but not in T-cells.
HCP5 (P5-1) appeared to be a hybrid sequence created by nonhomologous recombination between two pseudogenes or nonmobile genetic elements that possibly produced a protein comprising 219 amino acids (aa’s) [
28]. A few years later, the
HCP5 (P5-1) gene was mapped precisely to a region between the
MICA and
MICB genes and downstream, at the centromeric end, of the two classical HLA class I genes
HLA-B and
HLA-C (
Figure 1) [
29]. In 1999, Kulski and Dawkins [
30] used the computer programs Censor and RepeatMasker and dot-plot DNA and RNA sequence analyses to demonstrate that the
HCP5 gene sequence and its transcripts were composed mainly of the 3′LTR and
pol sequences of an ancient
HERV16 insertion, which was a member of the HERVL or class III category of endogenous retroviruses (ERVs) in the human and mammalian genomes [
31,
32].
Because
HCP5 expressed an antisense transcript that was complementary to retrovirus
pol mRNA sequences and a 3′LTR, Kulski and Dawkins [
30] suggested that it might have a role in immunity to retrovirus infection. They considered that the lncRNA of
HCP5 might hybridize with retroviral sense mRNA sequences to suppress viral transcription, translation, and transport. Eight years later, a single-nucleotide polymorphism (rs2395029) in the
HCP5 gene was associated with
HLA-B*57:01 and correlated with a lower HIV-1 viral set point [
33], indicating that these two alleles within a particular haplotype may have a role in viral control [
34]. However, when Yoon et al. [
35] tested the antisense/antiviral hypothesis for
HCP5 by infecting TZM-bl cells in vitro with HIV-1 and plasmids expressing high levels of
HCP5 transcripts, they observed no restriction with infectivity throughout the viral life cycle. They concluded from their findings that the
HCP5 gene had no direct antiviral effect, and that the association of an
HCP5 variant with viral control most likely was due to an
HLA-B*57:01-related effect or other functional variants in the haplotype or both. In fact, it appears that the role of
HCP5 in immunity and human disease is far more complex than previously envisioned, and that its antiviral affects might occur by way of some secondary mechanisms such as the possible involvement of miRNA inhibition rather than by hybridization of the
HCP5 transcript with the complementary viral
pol transcripts.
During the last two decades,
HCP5 SNPs have been associated with many different diseases in genome-wide association studies (GWASs), gene expression studies, and cancer studies investigating tissue and cellular biomarkers of tumor progression and inhibition. To better understand the genetics, molecular biology, and functions of
HCP5, this paper reviewed the available data and literature on the genomic organization, structure, and function of the
HCP5 gene (HLA complex P5 (non-protein coding), HGNC:21659) in health and disease (MIM:604676), particularly its association with autoimmune diseases, cancer, and infections by way of its endogenous interactions with miRNA and various gene targets.
Table S1 lists the online databases and repositories that were searched and interrogated to find the available data on
HCP5.
Table S2 lists a summary of the downloaded functional associations in all of the datasets linked with Harmonizome, which is an integrated knowledge base connecting big data with a collection of information about genes and proteins from 114 datasets provided by 66 online resources (
Table S1).
4. HCP5 Methylomics
The study of differentially methylated sites, differential gene expression, and epigenetic mechanisms represent a complementary method to genetic association studies for the identification of molecular and biological pathways that contribute to good health during a normal life cycle and to clinical heterogeneity of autoimmune and chronic diseases and cancer [
120]. Recent studies have identified differentially methylated sites within, or neighboring, the
HCP5 gene sequence associated with epigenetic regulation of various disease phenotypes (obesity, SLE) and in response to fetal development, aging, HIV infection, and vaccination (
Table 4).
Hypomethylation of
HCP5 was associated with autoantibody production against dsDNA, Sjogren’s syndrome-related antigen A (SSA), Smith (Sm) antigen, and ribonucleoprotein (RNP) in SLE [
122,
126], with gene expression and humoral immune response to influenza [
124], with hypomethylated PSORS1C1-associated allopurinol-induced severe cutaneous adverse reactions in Han Chinese [
127], accelerated aging in chronic HIV infection [
123], endometrial receptivity [
125], sexual bias in the human placental sexome [
128,
129], age-related monocyte and T cell gene expression [
121], lung adenocarcinoma [
130], and with hypomethylated
POU5F1-associated ankylosing spondylitis [
131]. In contrast to the more common observation of hypomethylation, HCP5 hypermethylation was associated with obesity and BMI in an epigenome-wide association study of adiposity in Ghanaian African migrants using whole blood to measure DNA methylation [
64].
Age-related HCP5 DNA methylation was associated with gene expression in human monocytes and T cells [
121], and the expressed genes that linked to potentially functional age-related methylation sites were enriched with antigen processing and presentation MHC class I and class II genes that were implicated in ‘parainflammation’ and the development of age-related chronic inflammatory diseases and autoimmune diseases. The total effects of age on gene expression (which increased with age) were significant (FDR < 0.05) for seven MHC genes—
HLA-B, -E, -DPA1, -DPB1, TAP2, TAPBP, and
HCP5—with hypomethylation within and/or near to all of those genes. On the other hand, Gross et al. [
123] found an HCP5 CpG DNA methylation signature in blood cells of patients with chronic, well-controlled HIV infection that correlated with accelerated aging, and that it also was independently associated with HLA expression and corresponding HIV control. The level of methylation at HCP5 was correlated with a patient’s CD4
+/CD8
+ T cell ratio to provide further evidence that the observed changes were functional [
123]. Chronic HIV infection, even when viral loads were kept below the level of detection, is associated with early onset of diseases linked to aging, including cardiovascular disease, kidney disease, cancer, and premature death. Highly active antiretroviral therapy (HAART) controls the burden of HIV, without curing the infection, enabling HIV-infected patients to live for many decades, provided they continue their medications. The increased methylation changes in HIV-infected patients found beyond their chronological age suggested about a five-year increase in aging compared to healthy controls [
123].
Systemic lupus erythematosus (SLE) is a chronic inflammatory autoimmune disease of unknown etiology that can affect most organs and is characterized by the development of autoantibodies associated with specific clinical manifestations implicated in the pathogenesis of lupus nephritis and decreased survival [
122,
126]. The genetic risk factors suggested for SLE include alleles in
IRF5, STAT4, BLK, TNFAIP3, TNIP1,
FCGR2B, and other genes [
132]. Genome-wide DNA methylation analysis of SLE revealed persistent hypomethylation of interferon genes and compositional changes to CD4
+ T cell populations. For example, Chung et al. [
122] characterized the methylation status of 467,314 CpG sites in 326 women with SLE DNA methylation profiling, performed using the Infinium HumanMethylation450 BeadChip (Illumina), and they identified and replicated significant associations between anti-dsDNA autoantibody production and the methylation status of 16 CpG sites in 11 genes. Differential methylation for these CpG sites was also associated with anti-SSA, anti-Sm, and anti-RNP autoantibody production. Overall, associated CpG sites were hypomethylated in autoantibody-positive samples compared to autoantibody-negative cases. In the discovery/replication analysis, associations with hypomethylated CpG sites were within genes (
IFIT1, IFI44L, MX1, RSAD2, OAS1, and
EIF2AK2) that were either induced by type 1 interferon or that regulated type 1 interferon signaling (NLRC5). Except for hypomethylation at
HCP5 and the
PSMB8 gene in the class III region, differential methylation of CpG sites within the MHC was not strongly associated with autoantibody production. Thus, hypomethylation of CpG sites within
HCP5 and other genes from different pathways that could not be explained by DNA sequence variation were associated strongly with anti-dsDNA, anti-SSA, anti-Sm, and anti-RNP production in SLE.
In a system-wide association study between DNA methylation, gene expression, and humoral immune response to influenza [
124], a cohort of 158 individuals who were 50 to 70 years old showed that
HCP5 along with
HLA-B and
HLA-DQB2 had an important role in methylation expression, particularly when the humoral immune response to influenza was measured by a hemagglutination inhibition assay (HAI). Only two genes showed association in all three independent analyses:
ADARB2, an inhibitor of adenosine deaminase activity (RNA editing), and
SPEG, a kinase with a known function in myocyte development. The small number of genes, including
HCP5, that overlapped across two or more methods and at multiple time points were
HLA-B,
HLA-DQB2, the histone deacetylase
HDAC4, RWDD2B, PTPRN2 that (de)phosphorylates phosphoinositols in an insulin regulatory role,
DNAH2, HCP5, FAM24B, LOC399815, and the transcription factor genes
PAX7 and
PAX9. Many genes (~640 genes) that were identified in one of these analyses had direct protein–protein interacting genes identified in the other analyses, revealing that the impact of methylation on humoral immunity is complex and highly dependent upon the immune outcome. Zimmermann et al. [
124] reported that methylation levels of a CpG within the gene body of
HLA-B (hypermethylation) were strongly associated with HAI and had an opposite trend to that of
HCP5 (hypomethylation).
Human and animal studies have identified that the placenta expresses select transcripts in a sexually dimorphic manner [
129]. A microarray-based study identified sex-dependent differences in the placental transcriptomic profile in males and females (sexome) with isolated cells derived from human placental villi [
128]. The four cell types examined included cytotrophoblasts, synctiotrophoblasts, and arterial and venous endothelial cells. For sex-dependent differences, the males demonstrated enrichment of signaling pathways previously reported to mediate graft versus host disease and transcripts involved in immune function and inflammation such as
HLA-DQB1 (syncytiotrophoblast),
HLA-DQA1 (syncytiotrophoblast and cytotrophoblast),
HCP5 (cytotrophoblast),
NOS1 (cytotrophoblast),
FSTL3, PAPPA, SPARCL,
FCGR2C (trophoblast epithelium),
CD34 (cytotrophoblast),
HLA-F (cytotrophoblast), and
BCL2 (syncytiotrophoblast). Males demonstrated a greater in utero vulnerability, and the findings of Cvitic et al. [
128] suggested that these effects may partially be due to reduced maternal–fetal compatibility for males who were then required to up-regulate immune-associated transcripts in an attempt to combat an attack by the maternal immune system.
In a genome-wide methylome analysis of endometrial biopsies collected from 17 healthy fertile-aged women from prereceptive and receptive phases of a menstrual cycle, Kukushkina et al. [
125] found that extracellular matrix organization and immune response were the pathways most affected by methylation changes during the transition from prereceptive to receptive phase. The overall methylome remained relatively stable during the two time points of the menstrual cycle with small-scale changes affecting 5% of the studied CpG sites (22,272 out of 437,022 CpGs, FDR < 0.05). The study confirmed that the differential methylation of
KRTAP17-1, CASP8, RANBP3L, WT1, MPP7, PTPRN2, and
HCP5 between the early and midsecretory phases were similar to those observed in the previous studies [
133,
134]. The differential methylation of
PTPRN2 and
HCP5 in the endometrium is an interesting connection, given that they both were differentially methylated in a system-wide association study between DNA methylation, gene expression, and humoral immune response to influenza [
134]. The
PTPRN2 (NCBI gene: 5799) gene product (de)phosphorylates phosphoinositols, has an insulin regulatory role, and it may be an autoantigen in insulin-dependent diabetes mellitus, but its actual function as a methylation site in the endometrium or human monocytes and T cells is not known.
In an epigenome-wide association study using whole blood measures of adiposity in 547 Ghanaian African migrants, Meeks et al. [
64] found that obesity and body mass index (BMI) were related to
HCP5 hypermethylation and 18 differentially methylated positions (DMPs) for BMI, 23 for waist circumference, and three for obesity. Fourteen DMP overlapped between BMI and waist circumference. The two epigenome-wide loci that were significantly hypomethylated for both general adiposity and abdominal adiposity were
CPT1A (carnitine palmitoyltransferase 1A) and
BCAT1 (branched chain amino acid transaminase 1), whereas
NLRC5 ((NLR family CARD domain containing 5) and most other DMPs including those for six HLA genes—
HCP5, HLA-B, TAP1, TAP2, PSMB8, and
HLA-E—were hypermethylated. The hypermethylation of
NLRC5 was highly significant, and this gene is known to regulate the expression of MHC class I genes and to limit the activation of inflammatory pathways [
54,
55,
56]. Thus, the results of Meeks et al. [
64] suggested that obesity might suppress the adaptive immune response and induce inflammation that could also result in insulin resistance.
Coit et al. [
131] identified a total of 68 differentially methylated sites between ankylosing spondylitis (AS) patients and osteoarthritis controls.
HCP5 and
POU5F1 were both hypomethylated in
HLA-B*27-positive compared to
HLA-B*27-negative AS patients. They suggested that
HLA-B*27 might play a role in AS in part through epigenetic linkage disequilibrium-inducing epigenetic dysregulation. The
POU5F1 gene (alias
OCT4) is located at the telomeric end of the MHC beta-block, ~98 kb upstream of the
HLA-C gene, and its role in methylation is well described [
120].
HCP5 is known to be involved in lung cancer [
130], and Yuan et al. [
92] described at least six
HCP5 SNVs, including rs3130907, that were associated significantly with lung cancer susceptibility (
Table 3). Previously, Orvis et al. [
135] showed that inactivation of the
BRG1 gene, also known as
SMARCA4, which encodes the ATPase subunits of the SW1/SNF chromatin remodeling complex, contributed to non-small cell lung cancer aggressiveness by altering nucleosome positioning in a wide range of genes as well as by downregulating the expression of
HCP5 and all of the classical and nonclassical HLA class I genes.
Presumably, hypomethylation of
HCP5 leads to added interactions and connectivity with proteins and other RNA sequences, especially with miRNA regulators. Studies on hypermethylation of
HCP5 are still lacking, and such studies might provide a more contrasting view of the action of methylation on the function of
HCP5 in health and disease. However, the overall absence of hypermethylation data for
HCP5 might be related to the DNA methylation paradox, whereby methylation of the transcribed region and the region of transcription initiation have opposite effects on gene expression [
120]. Although methylation can affect gene expression in both directions depending on the genomic region, there are more negative correlations in the 5′ UTR, while positive correlations are more common in the gene body region. While this was the case for
HLA-B, the reverse was observed for
HCP5 in humoral immune response to influenza [
124].
5. HCP5, Gene Targets, and Transcription Factors in Interaction Networks
Many hundreds of different transcription factors (TFs) are believed to target the
HCP5 sequence and regulate its expression. Although only a few experiments have examined the relationship between particular transcription factors and the expression of
HCP5 and its neighboring genes in the region between
MICA and
MICB, a variety of datasets have predicted connections between
HCP5 and many known TFs.
Table S4 shows five Internet databases sourced from Harmonizome and GeneCards (
Table S1) that associated TFs with
HCP5. For example, the dataset of JASPAR Predicted Transcription Factor Targets predicted that 57 transcription factors were associated with regulating the expression of
HCP5, whereas the MotifMap Predicted Transcription Factor Targets dataset predicted only seven associations: alpha-CP1, E2A, ETS2, MAFA, NF-kB, NF-Y, and TEF-1. On the other hand, the TRANSFAC curated dataset only listed PTF1A as a TF, interacting with the
HCP5 gene in low- or high-throughput transcription factor functional studies. PTF1A is the pancreas-specific transcription factor 1a, with a role in mammalian pancreatic development and in determining whether cells allocated to the pancreatic buds continue towards pancreatic organogenesis or revert back to duodenal fates. Also,
HCP5 is only one of 233 target genes for the PTF1A transcription factor. In contrast, the TRANSFAC dataset predicted that 13 TFs regulated the expression of the
HCP5 gene: ATF2, ELF3, ETS1, HINFP, JDP2, LEF1, LTF, MYC, NFE2L2, RUNX1, SMAD4, SMARCA2, and SPI1. Most of these predicted transcription factors also targeted many other genes as part of interaction networks, cascades, or divergent pathways.
In a study of interactions between
HCP5 and transcription factors, Warner et al. [
136] showed that
HCP5, together with
NOD2 and
IL-8, was associated strongly with decreased viability of cells in a study of the inactivation of the
NF-kB1 gene by knockout. In their datafile, the other MHC genes that were strongly associated with decreased viability were
HLA-A, -DQA1, -DRB1, -DRB4, -DOA, and -
DOB but not
HLA-B, -E, -G, -F, -DRB5, -DMA, -DRA –DQA2, -DQB1, -DRB3, -DMB, -
DPB1, -DPA1, MICA, MICB, TNF, LTA, LTB, C4A, and
C4B in the HEK293 cell line. Thus, NF-kB1 (located on chr 4) regulates
HCP5 and the gene expression of some other MHC genes as well as a wide variety of biological functions, including inappropriate activation associated with inflammatory diseases, inappropriate immune cell development, and delayed cell growth. This study [
136] also demonstrated that although
HCP5 was often in LD with many genes in the MHC, it could be activated or suppressed independently from most of them.
Coit et al. [
131] identified a total of 68 differentially methylated sites in a study of ankylosing spondylitis (AS) patients and osteoarthritis controls;
HCP5 and
POU5F1 were both hypomethylated in
HLA-B*27-positive compared to
HLA-B*27-negative AS patients. This predicted cis relationship between the transcription factor
POU5F1 and
HCP5 is interesting, given that both genes are located in the beta block of the HLA class I region [
1,
2]. Also, Meeks et al. [
64] in an epigenome-wide association study of measures of adiposity among Ghanaians showed that
NLRC5 and
HCP5, HLA-B, TAP1, TAP2,
PSMB8, and
HLA-E were all significantly hypermethylated for both general adiposity and abdominal adiposity.
The interactive website Pathwaynet [
137] predicts both the presence of a functional association and the most likely interaction type among human genes or their protein products on a whole-genome scale. It is based on a large compendium of refined regulatory interactions within 77 tissues, with their curated pathways taken from primary experimental datasets such as 690 ChIP-Seq datasets, numerous mass spectrometry of metabolites, protein–protein interactions, disease samples, etc., in order to capture the interaction networks.
Figure 5 shows the top 15 genes that interacted with
HCP5, as predicted by Pathwaynet [
137], with a high relationship confidence of between 0.9247 (
BTN3A3 and
IRF9) and 0.9523 (
HLA-F). All of the genes were within the MHC region except for
UBE2L6 (chr11q12.1),
TRIM22 (chr11p15.4),
IRF1 (chr5q31.1),
CASP1 (chr11q22.3), and
IRF9 (chr14q12). Pathwaynet did not specify the type of functional relationships between
HCP5 and these 15 genes or gene products; therefore, the predicted gene interactions should be considered with considerable caution. However, it is evident that the non-HCP5 genes have roles in antigen processing and presentation, the proteasome, graft versus host disease, allograft rejection, autoimmune disease, response to type 1 interferon or interferon gamma, regulation of viral reproduction, IL-6- and IL-12-mediated and NOD-like signaling pathways, and signal transduction by the p53 class I mediator. This connection is supported to a large degree by the top 52 genes that were positively associated with
HCP5 gene expression in the Comparative Toxicogenomics Database (CTD) datasets (
Table 5). The
HCP5 interaction and regulation is probably by way of the methylome and the competitive endogenous RNA regulatory networks, although this premise needs to be investigated further in both in vivo and in vitro experiments and association studies.
6. HCP5 Gene Expression and Gene Interactions
In 1993, Vernet et al. [
27] originally reported that
HCP5 expressed a 2.5 kb transcript in human B cells, phytohemagglutinin-activated lymphocytes, a natural killer-like cell line, normal spleen, hepatocellular carcinoma, neuroblastoma, and other nonlymphoid tissue but not in T cells. Since then, numerous studies of genome-wide gene expression using dense Affymetrix expression arrays were published, but the findings rarely reported directly on the expression of
HCP5. To identify the expression profile of
HCP5 in various scenarios, the databanks needed to be investigated and interrogated separately. In this way, it was possible to find the particular pattern of
HCP5 expression especially in comparison to the other class I and class II genes.
HCP5 is widely expressed at low levels, but it is primarily expressed at higher levels in cells of the immune system such as spleen, blood, and thymus (
http://smd-www.stanford.edu/), consistent with potential roles in autoimmunity and cancer.
Harmonizome (
Table S1) was a good starting point to find data about
HCP5 expression under various experimental conditions [
138]. The gene expression results of
HCP5 in 53 tissues from 8555 samples (570 donors) were sourced from GTEx RNA-seq using the University of California, Santa Cruz (UCSC) online browser (
Table S1). Also, interrogation of the online NCBI Gene Expression Omnibus (GEO) with the keyword “HCP5” produced 1771 results to review. Eighty-nine results were related to up and down differential expression, 6 results to the keyword ‘immunity’, 375 results to ‘cancer’, 28 to ‘HIV’, 50 to ‘virus’, 36 to ‘interferon’, 42 to ‘host defense’, and 4 to ‘MHC’. These results were browsed with a visual profile of the effects of treatments and experiments on the gene expression of
HCP5 and/or other genes of investigative choice. Nine studies in GEO confirmed that IFN and IL28B upregulated
HCP5 RNA in some cell types, whereas IL10 downregulated
HCP5 RNA in peripheral blood mononuclear cells (PBMCs) (
Table 2). There were little or no significant data for other cytokine-positive or -negative effects on
HCP5.
In comparison,
Table S5 shows the ~37 drugs and chemicals that induce or suppress
HCP5 expression with effects on inferred diseases, and that were identified in the Comparative Toxicogenomics Database (CTD) [
139]. Based on the data in
Table S5,
HCP5 gene expression seems to be decreased by various immunosuppressants and neurotoxins. This includes the immunosuppressive aflatoxin B1 that increased methylation of the
HCP5 gene [
140]. In addition, the CTD database [
139] revealed that
HCP5 reacted with 714 different genes in various gene expression studies. The top 52 genes that
HCP5 interacted with most often included
PTGS2, TNF, IL1B, PTGS1, and
CASP3 (
Table 5). The summary information provided by NCBI RefSeq for each of these five genes was the following: PTGS1 and PTGS2 are prostaglandin-endoperoxide synthases or cyclooxygenases, key enzymes in the biosynthesis of prostaglandin that are regulated by specific stimulatory events involved in inflammation and mitogenesis. The tumor necrosis factor (
TNF) gene that is located downstream of
HCP5 in the MHC class III region encodes a multifunctional, proinflammatory cytokine that is a member of the tumor necrosis factor (TNF) superfamily. This cytokine is mainly secreted by macrophages. It is involved in the regulation of a wide spectrum of biological processes including cell proliferation, differentiation, apoptosis, lipid metabolism, and coagulation. It has been implicated in a variety of diseases including autoimmune diseases, insulin resistance, and cancer. Knockout studies in mice suggested that TNF also has a neuroprotective function. IL1B is a cytokine expressed on chromosome two and produced by activated macrophages as a proprotein, which is proteolytically processed to an active form by caspase 1 (CASP1/ICE). This cytokine is an important mediator of the inflammatory response and is involved in a variety of cellular activities including cell proliferation, differentiation, and apoptosis. IL1B induces PTGS2/COX2 in the central nervous system and contributes to inflammatory pain hypersensitivity. CASP3 or caspase 3 is a protease with a central role in the execution phase of cell apoptosis. It inactivates poly(ADP-ribose) polymerase, while it cleaves and activates sterol regulatory element binding proteins as well as caspases 6, 7, and 9. Also, it is the predominant caspase involved in the cleavage of amyloid-beta 4A precursor protein, which is associated with neuronal death in Alzheimer’s disease. Therefore, it is evident that
HCP5 RNA is strongly associated with the inflammatory innate immune response as well as adaptive immune responses as indicated by its coexpression with various class I genes in expression studies in the databases (e.g., GEO) and the published literature.
6.1. HCP5 Expression in HIV-Infected Cells
Given that
HCP5 has been associated strongly with viral suppression in HIV-infected cells in GWAS (
Table 3), it is surprising that so few papers have specifically addressed the correlation between
HCP5 gene expression levels and HIV levels or response to HIV infection [
35]. However, there are a few studies in the GEO database to suggest that
HCP5 expression in response to HIV is induced, suppressed, or unaffected in some cell types. For example,
HCP5 RNA was significantly lower in the three HIV-negative controls than in three samples of jejunal mucosal cells from HIV patients on highly active antiviral therapy [
141]. Also,
HCP5 transcription activity was high in three mononuclear cell samples, but it was low or absent in three T cell samples and three fibroblast samples with HIV DNA integration sites [
142]. Similarly,
HCP5 RNA was higher in the brains of 10 of 26 patients receiving antiretroviral therapy for HIV-associated neurocognitive disorder than in nine uninfected controls [
143]. However, in some studies, there was no significant effect of HIV on
HCP5 RNA levels. For example, there was no difference in the
HCP5 RNA levels of 23 infected and 12 noninfected peripheral blood mononuclear cell samples [
144], little difference between eight infected and eight uninfected macrophage samples [
145], and little or no difference between five uninfected and 15 HIV-infected CD4
+ samples and 15 CD8+ T cell samples [
146]. Alternatively,
HCP5 RNA was low or absent in three T cell samples infected with HIV-based vector or three samples treated with TNF-alpha, but it was relatively higher in the three untreated T cell samples [
147]. Unfortunately, in the one study on RUNX1 in the regulation of HIV, no data were provided about
HCP5 RNA levels. Thus, based on these limited analyses, the role of
HCP5 in HIV infection and AIDS remains unclear.
6.2. HCP5 Expression in Cancer
HCP5 has been found upregulated or downregulated in a number of different cancers. The interactions between
HCP5 and three transcription factors with potential antioncogenic functions are noteworthy.
HCP5 was confirmed as one of the
KAT8 (alias hMOF) downregulated genes by qPCR and ChIP in the hMOF siRNA knockdown HeLa cells and 20 of 28 clinically diagnosed ovarian cancer tissues [
148]. KAT8 (lysine acetyltransferase 8) encodes a member of the MYST histone acetylase protein family involved with the p53 pathway and chromatin organization as well as with the suppression of epithelial to mesenchymal transition and tumor progression [
149]. In contrast, Teng et al. [
150] demonstrated that the
HCP5 transcribed sequence interacted with an miRNA sequence and the runt-related transcriptional regulator RUNX1 in a feedback loop to regulate the malignant behavior of glioma cells of the brain. Another noteworthy interaction was between
HCP5 and SATB1 (special AT-rich sequence binding protein 1), which is a nuclear matrix-associated DNA binding protein that functions as a chromatin organizer. SATB1 is highly expressed in aggressive breast cancer cells and promotes growth and metastasis by reprogramming gene expression [
151]. It also enhanced
HCP5 epigenetically and suppressed the oncogenic long noncoding RNA urothelial carcinoma-associated 1 (
UCA1) in breast cancer cells. Recently, Zhao and Li [
57] showed that transcription factor SP1 induced upregulation of
HCP5, which in turn promoted the development of osteosarcoma, whereas inhibition of
HCP5 expression reversed cell invasion and epithelial–mesenchymal transition. In addition,
HCP5 is overexpressed in tumor tissues of patients with lung adenocarcinoma, and it is positively correlated with poor prognosis specifically in patients who are smokers with
EGFR and
KRAS mutations [
56].
HCP5 also was overexpressed in lymph node metastasis of small cell lung cancer [
130,
152], glioma tissue [
150], colorectal cancerous tissue [
153], and cancers of the colon [
154], thyroid [
155], cervix [
156], and breast [
151,
157].
Interrogation of the TCNG Cancer Network Galaxy Database (
Table S1) produced 206 networks for genes regulating or regulated by the
HCP5 gene as estimated from publicly available cancer gene expression data. There are ~1010 genes that were predicted to interact with
HCP5 either as a child node (regulated gene) or parent node (regulating gene). In about 590 interactions,
HCP5 was the parent or regulatory node, and in the remainder (420 nodes)
HCP5 was the child or regulated node. For example,
HLA-A, -B, -G, -H, and –J were ranked as child nodes downstream of the
HCP5 parent node in four experimental arrays on breast cancer and one experiment on colon cancer. That is,
HCP5 was predicted to regulate the HLA-class I genes in those experiments. On the other hand,
HCP5 was predicted to be the regulated or child node with
NLRC5, a member of the NOD-like receptor family that acts as a transcriptional activator of MHC class I genes [
61,
62,
63], that was the parent or regulatory node in some gene expression experiments such as between adenocarcinoma and squamous cell carcinoma in non-small-cell lung carcinoma, breast cancer cell line profiles, non-Hodgkins lymphoma cell lines, complex genetic sarcomas, meningiomas, prostate cancer, and uveal melanoma primary tumors. Since there were far too many data to review here, further information on the activation or suppression of
HCP5 gene expression in many different cancers can be obtained by interrogating the TCNG Cancer Network Galaxy Database with ‘HCP5’ as the search query and following the links including those to the expression arrays at GEO. A more detailed account of
HCP5 RNA interaction in micro RNA regulatory networks in cancer is provided in the following section.
7. HCP5 lncRNA Interactions with Regulatory miRNA in Cancer
In recent years, an increasing number of lncRNA, including
HCP5, were found to have potential functions in cancer [
158,
159,
160]. The oncogenic lncRNA appear to regulate the transcription and translation of neighboring and distant genes by cis and trans-regulatory functions in a series of biological steps involving dosage compensation, genomic imprinting, and cell cycle dysregulation leading to cancer and its progression [
158,
159,
160]. A particularly important mechanism to emerge from many of these studies is the role of lncRNAs to bind with regulatory miRNA that control antioncogenic or oncogenic pathways. This three-way binding interaction between lncRNA, miRNA, and regulatory protein coding genes, such as those coding for regulatory transcription factors, has become known as the competing endogenous RNA (ceRNA) mechanism/network [
161,
162]. In this regard, association studies (
Table 3), expression data analysis, and knockdown experiments (
Table 2,
Table 4,
Table 5 and
Table 6,
Tables S4 and S5) have shown that
HCP5 can promote or suppress cancers depending on the
HCP5 allelic form and the cancer type. Since 2016, at least ten different cancer types were found to occur and/or progress by way of the
HCP5–miRNA–gene regulator interactions or the ceRNA mechanism (
Table 6).
In an integrated analysis on the dosage effect of lncRNAs in lung adenocarcinoma, Wei et al. [
130] found that the protein coding genes
CTSS, FGL2, and
PDCD1LG2 (alias
PDL2) competed with
HCP5 (ENSG00000206337) and formed a single regulatory subnet with the miRNAs miR-106b-5p and miR-17-5b. This in part confirmed a previous study that found that
HCP5 was involved in the process of lung cancer by competing with
PDL2, an immune checkpoint gene, and
FGL2, a therapeutic target to suppress carcinogenesis [
92]. This also was consistent with their finding that at least six
HCP5 SNPs, including rs3130907, were associated significantly with lung cancer susceptibility along with the novel risk SNP rs114020893 in the
lncRNA NEXN-AS1 region at 1p31.1.
HCP5 expression was positively correlated with the oncogenesis of a pathological grade of glioma tissues, and knockdown of
HCP5 exerted tumor-suppressive effects in human glioma cells by allowing an increase in expression of the miRNA tumor suppressor miR-139 [
150]. The malignant behavior of glioma cells in the brain appears to be regulated by an
HCP5–miR-139–
RUNX1 feedback loop, whereby
RUNX1 increased the promoter activities and expression of
HCP5 that binds to the tumor suppressor miRNA-139 (miR-139) and, therefore, acts as an oncogene.
HCP5 absorbed the tumor suppressor miR-139 to downregulate its expression. Upregulated
RUNX1 also inhibited apoptosis in glioma cell-lines U87 and U251.
RUNX1 down-regulation by knockdown using miR-139 as an inhibitor had the opposite effect on apoptosis and exerted tumor-suppressive effects in human glioma cells. MiR-139 inhibited
RUNX1 expression by targeting the 3′-UTR, and
HCP5 knockdown suppressed
RUNX1 expression by allowing miR-139 overexpression [
150]. Thus, it was concluded that
HCP5 promotes cell proliferation, cell migration, and invasion (motility) and inhibits apoptosis, as do many other lncRNAs participating in glioma phenotypes [
165].
RUNX1 also is known to suppress HIV reactivation in T cells, resulting in a negative correlation between
RUNX1 expression and viral load. The pharmacologic inhibition of
RUNX1 by a small molecule inhibitor, Ro5-3335, synergized with the histone deacetylase (HDAC) inhibitor SAHA (Vorinostat), enhanced the activation of latent HIV-1 in cell lines and PBMCs from patients [
166]. However, the effect of
HCP5 on
RUNX1 expression in HIV-infected cells is not known, although this effect might occur by way of epigenetic or transcriptomic regulation.
Liang et al. [
155] found that lncRNA
HCP5 promoted follicular thyroid carcinoma (FTC) progression as a competing endogenous RNA (ceRNA) sponge for miR-22-3p, mi-186-5p, and miR-216a-5p and activated alpha-2,6-sialyltransferase 2 (
ST6GAL2). Functional experiments showed that
HCP5 promoted
ST6GAL2, which in turn mediated the proliferation, migration, invasiveness, and angiogenic ability of FTC cells. In comparison, Yu et al. [
156] showed that overexpressed
HCP5 promoted the development of cervical cancer by absorbing miRNA-15a to promote expression of the MET transcription regulator MACC1 (
MACC1). MiRNA-15a overexpression in vitro inhibited
MACC1 expression and suppressed the proliferation of cervical cancer cells. In contrast, miRNA-15a knockdown experiments or absorption by
HCP5 allowed increased
MACC1 expression and the proliferation of cervical cancer cells. Furthermore,
HCP5 and
MACC1 were overexpressed in cervical cancer tissues compared to paracancerous tissue, and the survival rate of patients with cervical cancer was negatively correlated to
HCP5 expression and positively correlated to miRNA-15a. Luciferase reporter gene assay also showed that miRNA-15a bound directly to either
HCP5 or
MACC1. Because HPV16 and HPV18 are associated strongly with cervical cancer progression, the question arises whether the
PMSP gene [
38] of
HCP5 is also expressed and what role it might have with the increased proliferation of cervical cancer cells.
Both
MACC1 and
HCP5 are expressed in gastric cancer, and their expression may be regulated by miRNAs. In a metabolic network analysis, Mo et al. [
163] found that
HCP5 was coexpressed with 34 metabolic-related protein-coding genes and five lncRNAs, and regulated by three miRNAs, miR-128, miR-101, and miR-103a, that were downregulated in gastric cancer. The
TOPORS-AS1 lncRNA and the NADH ubiquinone oxireductase subunit B6 (
NDUFB6) coding gene that were associated with
HCP5 in the network analysis were downregulated in gastric cancer samples [
163]. In some cancers, TGFbeta might induce
HCP5 transcription via the activity of SMAD3 [
56], and increased levels of
HCP5 RNA might either directly or indirectly affect
GSR,
ASCL1,
MET,
GRM8,
DACHI [
152],
ETN3A1,
ETN3A3, CCDC50, HERC6, TAP1, and
PSMB9 [
164] as well as many other genes (
Table 6).
Based on a data expression analysis, Olgun et al. [
157] found that
HCP5 was one of seven lncRNA that interacted with miRNA in breast cancer.
HCP5 bound to miR-155 at the hub of the ceRNA network of interactions in the basal subtype of breast cancer. This basal subtype in breast cancer was characterized by a positive correlation between immune cell infiltration and aggressiveness with a key role for interferon signaling and the induction of cell proliferation by the complement cascade. Olgun et al. [
157] detected
C2, C3, C3AR1, C4A, and
C7 complement genes in the basal ceRNA interactions, suggesting that the complement cascade pathway may be significant for progression of the basal subtype.
In an analysis of differentially expressed profiles of lncRNAs and mRNA in ceRNA networks during transformation of diffuse large B cell lymphoma (DLBCL), Tian et al. [
164] identified
HCP5 as a key regulator interacting with many miRNA and protein coding genes associated with transcription (
KLF2), cell adhesion and proliferation (
CD47), lipid metabolism (
BTN3A1), and the adaptive immune response (
TAP1, PSMB9). However, they concluded that the molecular function of
HCP5 remained unknown.
HCP5 has been associated with various other cancers including cutaneous melanoma [
167], HPV-infected head and neck squamous cell carcinoma [
168], squamous cell carcinoma cells [
169], HCV-induced liver cancer [
170], and upper tract urothelial carcinoma [
171] in which ceRNA cross-talk has yet to be tested. The ceRNA network and cross-talk mechanism also might have a role in autoimmune diseases such as idiopathic thrombocytopenia [
172], viral infections, and
HCP5-associated phenotypes (
Table 3,
Table 4 and
Table 6) that, as yet, have not been examined for interactions with miRNA and with the other lncRNA regulators and protein coding genes.
8. Perspective
Since the discovery of
HCP5 in 1993 [
27], a large amount of data has been gathered about its expression, function, and disease associations. Much of this data, however, is buried in large datasets that require considerable effort to locate, analyze, validate, and interpret [
89,
90,
137,
138,
139] (
Table S1). Nevertheless, sufficient amounts of published and unpublished data that were retrieved from online public databanks for this review revealed that
HCP5 had important roles in health and disease, particularly with respect to its role as a ceRNA regulator and biomarker in autoimmune diseases and cancer. GWAS indicated that the
HCP5 SNV rs2395029 was a potential marker for abacavir-induced hypersensitivity, a marker for
HLA-B*57:01 in populations of mainly Caucasian or Hispanic descent, as well as flucloxacillin drug liver injury, HIV control, and psoriasis and psoriatic arthritis (
Table 3). Other SNVs within the
HCP5 gene or within 2.5 kb of the 5′ or 3′ UTR region are useful association markers for various diseases including myositis, herpes infection, cancer, and risk of relapse after transplantation (
Table 3)
. Methylomic studies have associated
HCP5 strongly with HIV progression, SLE, ankylosing spondylitis (AS) and obesity. Various expression studies have shown that
HCP5 is a useful biomarker for interferon and IL28-related inflammatory response, monocyte response to influenza A infection, and various cancers (
Table 4,
Table 6, and
Table S5). Many of the
HCP5-associated diseases such as SLE, AS, psoriasis and psoriatic arthritis, myositis, obesity, and cancer are associated also with accelerated aging, morbidity, and mortality.
The
HCP5 gene within the MHC class I genomic region has evolved by exaptation from an ancient endogenous retroviral and gained a new regulatory function by sequestering the MHC promoter and enhancer region from a fragmented/deleted ancient HLA class I gene. It appears to have generated the
PMSP sequence that codes for a 61 aa peptide that binds to the HPV L2 capsid protein for viral assembly [
41]. The function of the PMSP peptide in different cell types other than its interaction with HPV in keratinocytes still needs to be determined and whether the
PMSP gene is regulated or continuously expressed along with the
HCP5 transcript. The
HCP5 promoter also contains a 22 nt
RUNX1 sequence (
Table S3) that might contribute to the interaction between the
HCP5 transcript and the Runx transcription factor in glioma and in the monocytes of HIV patients. Because the
HCP5 promoter has many of the canonical TF binding sites of the HLA class I promoter (
Table S3),
HCP5 is often expressed concomitantly with classical and nonclassical class I genes, unless they are differentially separated from each other by epigenomic or ceRNA regulatory processes. For example, in some instances,
HCP5 might be upregulated, whereas class I HLA genes are downregulated as exemplified in the study of humoral immune response to influenza [
124]. While it is notable that
HCP5 is often in LD with the HLA class I genes and that their expression is often coordinated in response to various stimulators or suppressors, functional interactions between the products of these genes suggest that
HCP5 has an associated role in antigen processing and presentation, the proteasome, graft versus host disease, allograft rejection, autoimmune disease, response to type 1 interferon or interferon gamma, and in the regulation of viral reproduction, especially in HIV restriction [
6,
14,
24,
50,
52,
107]. The mechanisms by which HLA class I genes and the lncRNA genes
HCP5 and
ZNRD1 might interact with HIV also are worth investigating further because these lncRNAs could be exploited therapeutically using small RNA inhibitors [
109,
110]. Similarly, the interaction of
HCP5, microRNA, and protein coding genes in cancer (
Table 6) suggests that
HCP5 could be targeted for knockdown or knockout in antitumor therapeutics.
It is evident from the results and reports presented in this review that
HCP5 contributes to regulating viral and autoimmune diseases and cancer, and it can be upregulated or downregulated depending on its response to various exogenous or endogenous stimulators or suppressors. One shared feature of the investigated cancerous and noncancerous diseases is hypomethylation of the
HCP5 gene and upregulation of its transcript, which highlights the enormous potential of this lncRNA as a diagnostic biomarker in these pathologies. The
HCP5 gene sequence is a differentially methylated site associated with the epigenetic regulation of some disease phenotypes, such as obesity [
64] and SLE [
132], and it may also act in response to fetal development [
128], aging [
121], HIV [
123] or influenza infection, and vaccination [
124]. Presumably, hypomethylation of
HCP5 leads to added interactions and connectivity with protein and other RNA sequences, especially with microRNA regulators. Although increased
HCP5 levels seem to be a common event upon viral infection and in response to interferon stimulation and some cancers, the consequences of
HCP5 upregulation are diverse. The mode of
HCP5 action, whether acting as a promoter or suppressor, probably depends on the occurring downstream events. From the mechanistic point of view,
HCP5 either acts as a differentially methylated site involved in epigenetic regulation, or it is involved in transcriptional regulation of protein coding genes by sequestering miRNA as reported for a variety of different cancers (
Table 6). Although transcriptional regulation by sequestration of transcription factors or regulatory miRNA seems to be the predominant mode of action by
HCP5 in cancerous diseases, it might also act as a specific or nonspecific ‘sponge’ for miRNAs in noncancerous diseases. The involvement of
HCP5 lncRNA in various human diseases and cancers underscores the importance of understanding its functions in the ceRNA networks as an important step towards future drug development. Perturbations in cellular regulatory functions due to interactions between
HCP5 and transcription factors probably contribute to some malignancies. For instance, recent studies of the interaction between
HCP5 and Runx family proteins suggests that they may play key roles in stem cell biology, particularly in regulating apoptosis and the G0/G1 transition by way of the ceRNA networks [
150]. Runx1 increases the promoter activities and expression of
HCP5, and it is also known to suppress HIV reactivation in T cells perhaps by way of
HCP5 mediation [
166]. In many of the recent studies, the function of the
HCP5 gene is described as a defense-response gene often acting in unison with the inflammatory, innate, and adaptive immune response systems. However,
HCP5 also has disease progression and oncogenic effects that suggest that it may act like a double-edged sword to defend and to attack depending on other endogenous and exogenous regulators in the pathway.
The absence of the
HCP5 gene from the MHC genomic region of chimpanzees and gorillas and some human haplotypes that carry the
HLA-B*48:01 allele appears not to be deleterious or life-threatening [
85]. Great apes and humans without the
HCP5 gene seem to live in relatively good health, which suggests that the deletion, if not harmful, might even confer some selective advantage [
173]. Nevertheless, it is evident from an exploration of the recent literature and accumulating public databases that
HCP5 has many regulatory or associated functions at the genetic, molecular, and physiological levels. It appears to interact with numerous other genes and/or their products by way of intermediaries such as transcription factors and miRNA,
LTR16B2 genomic repeats, and as an enhancer or superenhancer in the regulation of other genes and in chromatid structural changes. There is still much to learn about the ceRNA actions of
HCP5, and many of the future surprises about this hybrid MHC class I endogenous retroviral gene undoubtedly will arise from knockdown, knockout, and knockin gene expression studies. The human homologous
HLA-B*48:01 haplotype is a naturally occurring
HCP5-MICB deletion or knockout haplotype in the human population that warrants genetic and epidemiological analysis to help elucidate the importance of these genes for most humans who still have the intact functional versions. Although the large databanks, datasets, and available publications have provided important insights into the functions of
HCP5, much work still remains in order to elucidate the actual mechanisms and role of this intriguing MHC class I hybrid retroelement in immunity, health, and disease. More detailed studies are needed, particularly applying knockdown, knockout, and knockup studies, to find out more about its function as a regulator of various molecular and biological processes related to health and disease and the MHC.