Details for: THAP2

Gene ID: 83591

Gene Type:  Protein-coding  - A gene that serves as a template for producing a messenger RNA (mRNA) molecule, which is then translated into a functional protein.

Symbol: THAP2

Ensembl ID: ENSG00000173451

Description: THAP domain containing 2

Cell Significance Landscape

Associated with

Significant Cells

Cell Significance Index (CSI) scores for the chosen context(s)

  • group 3 innate lymphoid cell CL0001071
    CSI 6.86
    rCSI 5.15%
    PRS 94.5
  • endothelial cell of artery CL1000413
    CSI 4.51
    rCSI 6.61%
    PRS 93.55
  • skin fibroblast CL0002620
    CSI 3.77
    rCSI 3.25%
    PRS 91.38
  • IgG plasma cell CL0000985
    CSI 3.35
    rCSI 4.02%
    PRS 94.86
  • plasmablast CL0000980
    CSI 3.23
    rCSI 2.54%
    PRS 93.8
  • type B pancreatic cell CL0000169
    CSI 3.19
    rCSI 7.06%
    PRS 91.83
  • perivascular cell CL4033054
    CSI 3.16
    rCSI 4.32%
    PRS 94.62
  • basal cell of prostate epithelium CL0002341
    CSI 3.07
    rCSI 8.87%
    PRS 93.87
  • IgA plasma cell CL0000987
    CSI 3.01
    rCSI 3.08%
    PRS 93.07
  • CD4-positive helper T cell CL0000492
    CSI 2.95
    rCSI 2.24%
    PRS 97.45
  • caudal ganglionic eminence derived cortical interneuron CL4023064
    CSI 2.92
    rCSI 5.16%
    PRS 79.84
  • neural crest cell CL0011012
    CSI 2.91
    rCSI 2.3%
    PRS 85.86
  • interneuron CL0000099
    CSI 2.89
    rCSI 5.8%
    PRS 86.64
  • pancreatic A cell CL0000171
    CSI 2.87
    rCSI 3.01%
    PRS 93.48
  • mesodermal cell CL0000222
    CSI 2.79
    rCSI 3.35%
    PRS 90.92
  • elicited macrophage CL0000861
    CSI 2.66
    rCSI 2.44%
    PRS 95.51
  • early lymphoid progenitor CL0000936
    CSI 2.52
    rCSI 2.21%
    PRS 94.71
  • astrocyte of the cerebral cortex CL0002605
    CSI 2.51
    rCSI 5.63%
    PRS 80.68
  • ciliated epithelial cell CL0000067
    CSI 2.5
    rCSI 2.2%
    PRS 83.95
  • keratinocyte CL0000312
    CSI 2.5
    rCSI 2.09%
    PRS 91.73
  • pulmonary ionocyte CL0017000
    CSI 2.42
    rCSI 2.94%
    PRS 94.75
  • granulocyte CL0000094
    CSI 2.4
    rCSI 3.67%
    PRS 95.18
  • pvalb GABAergic cortical interneuron CL4023018
    CSI 2.12
    rCSI 2.63%
    PRS 78.19
  • neuroblast (sensu Vertebrata) CL0000031
    CSI 2.11
    rCSI 2.7%
    PRS 88.63
  • megakaryocyte-erythroid progenitor cell CL0000050
    CSI 2.02
    rCSI 1.82%
    PRS 90.79
  • neuroblast (sensu Nematoda and Protostomia) CL0000338
    CSI 1.9
    rCSI 2.19%
    PRS 85.1
  • cerebral cortex GABAergic interneuron CL0010011
    CSI 1.81
    rCSI 5.36%
    PRS 91.59
  • T-helper 1 cell CL0000545
    CSI 1.79
    rCSI 3.23%
    PRS 97.27
  • ependymal cell CL0000065
    CSI 1.73
    rCSI 3.51%
    PRS 75.7
  • vascular associated smooth muscle cell CL0000359
    CSI 1.65
    rCSI 5.36%
    PRS 90.29
  • glioblast CL0000030
    CSI 1.57
    rCSI 2.5%
    PRS 85.19
  • progenitor cell CL0011026
    CSI 1.54
    rCSI 3.27%
    PRS 86.02
  • lamp5 GABAergic cortical interneuron CL4023011
    CSI 1.46
    rCSI 2.46%
    PRS 80.38
  • smooth muscle cell of prostate CL1000487
    CSI 1.4
    rCSI 8.24%
    PRS 95.06
  • sncg GABAergic cortical interneuron CL4023015
    CSI 1.31
    rCSI 2.11%
    PRS 81.43
  • CD1c-positive myeloid dendritic cell CL0002399
    CSI 1.21
    rCSI 1.46%
    PRS 95.7
  • lung macrophage CL1001603
    CSI 1.09
    rCSI 2.43%
    PRS 95.59
  • chandelier pvalb GABAergic cortical interneuron CL4023036
    CSI 0.67
    rCSI 2.08%
    PRS 83.36
  • L6b glutamatergic cortical neuron CL4023038
    CSI 0.66
    rCSI 2.07%
    PRS 81.66
  • effector memory CD8-positive, alpha-beta T cell, terminally differentiated CL0001062
    CSI 0.5
    rCSI 2.52%
    PRS 97.55

Cell ID: Standard Cell Ontology term used for mapping and comparing cells across experiments. Ensures consistency in analyzing cellular functions across tissues.
Fold Change: Represents the ratio of the current Cell Significance Index to the Cell Significance Index Threshold, indicating how much the gene expression has changed compared to a baseline.
Cell Significance Index: Reflects how strongly a gene is expressed in this specific cell.

Cell ID: Standard Cell Ontology term used for mapping and comparing cells across experiments. Ensures consistency in analyzing cellular functions across tissues.
Fold Change: Represents the ratio of the current Cell Significance Index to the Cell Significance Index Threshold, indicating how much the gene expression has changed compared to a baseline.
Cell Significance Index: Reflects how strongly a gene is expressed in this cell type. Calculated using techniques like effect size estimation and bootstrapping for reliability.

Cell ID: Standard Cell Ontology term used for mapping and comparing cells across experiments. Ensures consistency in analyzing cellular functions across tissues.
Fold Change: Represents the ratio of the current Cell Significance Index to the Cell Significance Index Threshold, indicating how much the gene expression has changed compared to a baseline.
Cell Significance Index: Reflects how strongly a gene is expressed in this cell type. Calculated using techniques like effect size estimation and bootstrapping for reliability.
Network Configuration

Explore relationships of the current gene. Select an Interaction Source: 'ONTOLOGY' for shared pathways (GO/Reactome) or 'STRING' for protein-protein interactions. Further refine by selecting context genes and comparing Cell Significance Index (CSI) scores between baseline and target cell types and their specific contexts.

Comma-separated if multiple.
Comma-separated if multiple.

Legend:
  • Query Gene
  • Node Color (Target Cell CSI, relative to current network):
    • Very High
    • High
    • Medium
    • Low
    • Very Low
    • CSI N/A
  • Node Size: Proportional to Target Cell CSI magnitude
  • STRING PPI Edge
  • Shared Pathway Edge (ONTOLOGY)

Loading network (please wait)...

Other Information

This section provides additional information about the gene, including a description generated by an AI language model and details about associated proteins.

## Summary [THAP2](/details-gene/83591), or THAP domain containing 2, is a protein-coding gene located on chromosome 12q21.1. It encodes a nuclear protein characterized by a THAP domain, which is a specialized C2CH-type zinc finger motif. Functional annotations indicate that [THAP2](/details-gene/83591) is involved in [DNA binding](/details-ontology/GO:0003677) and [metal ion binding](/details-ontology/GO:0046872), consistent with a role as a transcriptional regulator. It is localized within the [nucleus](/details-ontology/GO:0005634) and specifically in the [nucleolus](/details-ontology/GO:0005730). Expression data from an **Overall** context reveals its highest significance in [group 3 innate lymphoid cell](/details-cell/CL0001071), with notable expression also observed across diverse lineages including arterial endothelial cells, fibroblasts, plasma cells, and pancreatic islet cells, suggesting a multifaceted regulatory function in various biological systems. ## Cellular Roles and Expression Landscape The expression profile of [THAP2](/details-gene/83591) indicates a broad but specific functional role across multiple cell types and systems. The gene's most significant expression is observed within the immune system, particularly in [group 3 innate lymphoid cell](/details-cell/CL0001071) (CSI: 6.86), a key population of cells involved in mucosal immunity. It also shows relevance in the adaptive immune system, with significant expression in antibody-producing cells such as [IgG plasma cell](/details-cell/CL0000985) and [plasmablast](/details-cell/CL0000980). Beyond its role in immunity, [THAP2](/details-gene/83591) is a significant gene in various structural and stromal cell types. It is highly expressed in [endothelial cell of artery](/details-cell/CL1000413) and [skin fibroblast](/details-cell/CL0002620), suggesting a potential function in maintaining vascular identity and connective tissue homeostasis. Furthermore, its expression in endocrine cells like [type B pancreatic cell](/details-cell/CL0000169) and [pancreatic A cell](/details-cell/CL0000171), as well as in epithelial progenitors like [basal cell of prostate epithelium](/details-cell/CL0002341), highlights its involvement in specialized secretory and regenerative functions. The diverse range of cell types where [THAP2](/details-gene/83591) is prominently expressed, including neuronal subtypes like [interneuron](/details-cell/CL0000099), underscores its importance as a regulatory factor in multiple distinct cellular contexts. ## Pathways and Molecular Function The molecular function of [THAP2](/details-gene/83591) is defined by its ability to bind DNA and metal ions. Its annotation for [DNA binding](/details-ontology/GO:0003677) and [metal ion binding](/details-ontology/GO:0046872) is consistent with the structure of its THAP domain, which functions as a DNA-binding motif. The localization of the protein to the [nucleus](/details-ontology/GO:0005634) and specifically to the [nucleolus](/details-ontology/GO:0005730) strongly supports its role as a regulator of gene expression. Its presence in the nucleolus may suggest involvement in processes such as ribosome biogenesis or the regulation of rDNA transcription, although this remains to be experimentally validated. The initial identification and characterization of the [THAP2](/details-gene/83591) cDNA were established through large-scale sequencing projects aimed at creating a comprehensive catalog of human genes ([Link](https://doi.org/10.1101/gr.gr1547r), [Link](https://doi.org/10.1038/ng1285), [Link](https://doi.org/10.1101/gr.2596504)). ## Research Directions Based on its distinct expression pattern and annotated molecular function, several avenues for future research on [THAP2](/details-gene/83591) can be proposed. **Proposed Hypotheses:** 1. Given its top significance in [group 3 innate lymphoid cell](/details-cell/CL0001071), [THAP2](/details-gene/83591) may function as a key transcription factor required for the development, lineage stability, or effector functions of ILC3s, potentially by regulating the expression of cytokines like IL-17 and IL-22. 2. The high expression of [THAP2](/details-gene/83591) in [endothelial cell of artery](/details-cell/CL1000413) suggests a role in maintaining arterial identity, regulating vascular tone, or mediating cellular responses to hemodynamic stress. 3. Its significant expression in both [type B pancreatic cell](/details-cell/CL0000169) and [pancreatic A cell](/details-cell/CL0000171) indicates a potential role in pancreatic islet homeostasis, possibly through the transcriptional regulation of hormone production or pathways involved in cell survival and glucose sensing. **Key Experimental Approach:** To test the hypothesis regarding the role of [THAP2](/details-gene/83591) in ILC3 function, a conditional knockout mouse model could be generated using a RORγt-Cre driver to specifically delete [THAP2](/details-gene/83591) in ILC3s. The gut lamina propria of these mice could be analyzed for ILC3 population size and phenotype by flow cytometry. Functional deficits could be assessed using an intestinal infection model (e.g., *Citrobacter rodentium*), measuring cytokine production and the ability to protect the epithelial barrier. To identify its downstream targets, RNA-sequencing and chromatin immunoprecipitation sequencing (ChIP-seq) could be performed on isolated ILC3s from both knockout and wild-type mice. **Therapeutic Potential:** As a putative transcription factor with significant expression across diverse and essential cell types, [THAP2](/details-gene/83591) presents a challenging therapeutic target. Direct inhibition or activation could lead to significant off-target effects in vascular, endocrine, and immune systems. However, should its dysregulation be implicated in a specific disease, such as ILC3-mediated inflammatory bowel disease, targeting its specific downstream effectors or protein-protein interactions might offer a more precise therapeutic strategy than directly modulating the protein itself.

Genular Protein ID: 829649895

Symbol: THAP2_HUMAN

Name: THAP domain-containing protein 2

UniProtKB Accession Codes:

Database IDs:

Citations:

PubMed ID: 11230166

Title: Towards a catalog of human genes and proteins: sequencing and analysis of 500 novel complete protein coding human cDNAs.

PubMed ID: 11230166

DOI: 10.1101/gr.gr1547r

PubMed ID: 14702039

Title: Complete sequencing and characterization of 21,243 full-length human cDNAs.

PubMed ID: 14702039

DOI: 10.1038/ng1285

PubMed ID: 15489334

Title: The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC).

PubMed ID: 15489334

DOI: 10.1101/gr.2596504

Sequence Information:

  • Length: 228
  • Mass: 26260
  • Checksum: 63D886376BC5CC4E
  • Sequence:
  • MPTNCAAAGC ATTYNKHINI SFHRFPLDPK RRKEWVRLVR RKNFVPGKHT FLCSKHFEAS 
    CFDLTGQTRR LKMDAVPTIF DFCTHIKSMK LKSRNLLKKN NSCSPAGPSN LKSNISSQQV 
    LLEHSYAFRN PMEAKKRIIK LEKEIASLRR KMKTCLQKER RATRRWIKAT CLVKNLEANS 
    VLPKGTSEHM LPTALSSLPL EDFKILEQDQ QDKTLLSLNL KQTKSTFI