Details for: MUC4

Gene ID: 4585

Gene Type:  Protein-coding  - A gene that serves as a template for producing a messenger RNA (mRNA) molecule, which is then translated into a functional protein.

Symbol: MUC4

Ensembl ID: ENSG00000145113

Description: mucin 4, cell surface associated

Selected Context(s):  Overall

Cell Significance Landscape

Contexts:

Associated with

Significant Cells

Cell Significance Index (CSI) scores for the chosen context(s)

  • multi-ciliated epithelial cell CL0005012
    CSI 15.7
    rCSI 15.67%
    PRS 80.33
  • nasal mucosa goblet cell CL0002480
    CSI 15.23
    rCSI 17.66%
    PRS 87.96
  • duct epithelial cell CL0000068
    CSI 13.64
    rCSI 19.96%
    PRS 90.03
  • tracheal goblet cell CL1000329
    CSI 12.8
    rCSI 27.94%
    PRS 90.2
  • squamous epithelial cell CL0000076
    CSI 9.8
    rCSI 23.26%
    PRS 84.64
  • secretory cell CL0000151
    CSI 7.41
    rCSI 7.74%
    PRS 85.08
  • epithelial cell of lung CL0000082
    CSI 6.7
    rCSI 5.56%
    PRS 87.05
  • enterocyte of epithelium of large intestine CL0002071
    CSI 5.63
    rCSI 29.59%
    PRS 89.18
  • conjunctival epithelial cell CL1000432
    CSI 5.38
    rCSI 8.22%
    PRS 85.78
  • lung secretory cell CL1000272
    CSI 5.34
    rCSI 13.21%
    PRS 86.5
  • goblet cell CL0000160
    CSI 4.95
    rCSI 4.68%
    PRS 84.03
  • deuterosomal cell CL4033044
    CSI 4.92
    rCSI 16.62%
    PRS 81.78
  • brush cell of tracheobronchial tree CL0002075
    CSI 4.34
    rCSI 12.86%
    PRS 92.7
  • intestine goblet cell CL0019031
    CSI 4.21
    rCSI 3.74%
    PRS 83.57
  • colon epithelial cell CL0011108
    CSI 4.16
    rCSI 4.36%
    PRS 83.53
  • lung ciliated cell CL1000271
    CSI 4.06
    rCSI 4.69%
    PRS 79.55
  • IgA plasma cell CL0000987
    CSI 4
    rCSI 4.09%
    PRS 90.19
  • airway submucosal gland duct basal cell CL4033024
    CSI 3.88
    rCSI 24.81%
    PRS 88.7
  • ciliated epithelial cell CL0000067
    CSI 3.83
    rCSI 3.37%
    PRS 76.43
  • transit amplifying cell of colon CL0009011
    CSI 3.58
    rCSI 4.2%
    PRS 86.92
  • colon goblet cell CL0009039
    CSI 3.21
    rCSI 7.63%
    PRS 89.15
  • BEST4+ enteroycte CL4030026
    CSI 3.19
    rCSI 3.96%
    PRS 86.17
  • extravillous trophoblast CL0008036
    CSI 3.03
    rCSI 3.75%
    PRS 84.34
  • transit amplifying cell CL0009010
    CSI 3.03
    rCSI 4.63%
    PRS 91.18
  • ciliated cell CL0000064
    CSI 2.95
    rCSI 4.78%
    PRS 80.68
  • small intestine goblet cell CL1000495
    CSI 2.82
    rCSI 6.19%
    PRS 89.62
  • progenitor cell CL0011026
    CSI 2.75
    rCSI 5.84%
    PRS 80.04
  • club cell CL0000158
    CSI 2.73
    rCSI 4%
    PRS 81.01
  • ionocyte CL0005006
    CSI 2.71
    rCSI 2.9%
    PRS 87.91
  • respiratory suprabasal cell CL4033048
    CSI 2.7
    rCSI 3.47%
    PRS 88.4
  • pulmonary ionocyte CL0017000
    CSI 2.57
    rCSI 3.13%
    PRS 90.89
  • dendritic cell CL0000451
    CSI 2.35
    rCSI 2.9%
    PRS 87.02
  • respiratory basal cell CL0002633
    CSI 2.14
    rCSI 2.22%
    PRS 88.7
  • basal cell of epithelium of trachea CL1000348
    CSI 1.71
    rCSI 12.03%
    PRS 88.54
  • respiratory goblet cell CL0002370
    CSI 1.7
    rCSI 18.51%
    PRS 90.79
  • paneth cell CL0000510
    CSI 1.68
    rCSI 2.48%
    PRS 92.92
  • ciliated columnar cell of tracheobronchial tree CL0002145
    CSI 1.54
    rCSI 3.51%
    PRS 79.27
  • glandular epithelial cell CL0000150
    CSI 1.16
    rCSI 3.05%
    PRS 94.07
  • lung goblet cell CL1000143
    CSI 0.84
    rCSI 9.41%
    PRS 91
  • tracheobronchial serous cell CL0019001
    CSI 0.83
    rCSI 3.59%
    PRS 90.14
  • epithelial cell of urethra CL1000296
    CSI 0.49
    rCSI 12.4%
    PRS 90.51
  • intestinal crypt stem cell of colon CL0009043
    CSI 0.43
    rCSI 3.23%
    PRS 92.58
  • paneth cell of colon CL0009009
    CSI 0.36
    rCSI 3.54%
    PRS 91.23

Cell ID: Standard Cell Ontology term used for mapping and comparing cells across experiments. Ensures consistency in analyzing cellular functions across tissues.
Fold Change: Represents the ratio of the current Cell Significance Index to the Cell Significance Index Threshold, indicating how much the gene expression has changed compared to a baseline.
Cell Significance Index: Reflects how strongly a gene is expressed in this specific cell.

Cell ID: Standard Cell Ontology term used for mapping and comparing cells across experiments. Ensures consistency in analyzing cellular functions across tissues.
Fold Change: Represents the ratio of the current Cell Significance Index to the Cell Significance Index Threshold, indicating how much the gene expression has changed compared to a baseline.
Cell Significance Index: Reflects how strongly a gene is expressed in this cell type. Calculated using techniques like effect size estimation and bootstrapping for reliability.

Cell ID: Standard Cell Ontology term used for mapping and comparing cells across experiments. Ensures consistency in analyzing cellular functions across tissues.
Fold Change: Represents the ratio of the current Cell Significance Index to the Cell Significance Index Threshold, indicating how much the gene expression has changed compared to a baseline.
Cell Significance Index: Reflects how strongly a gene is expressed in this cell type. Calculated using techniques like effect size estimation and bootstrapping for reliability.
Network Configuration

Explore relationships of the current gene. Select an Interaction Source: 'ONTOLOGY' for shared pathways (GO/Reactome) or 'STRING' for protein-protein interactions. Further refine by selecting context genes and comparing Cell Significance Index (CSI) scores between baseline and target cell types and their specific contexts.

Comma-separated if multiple.
Comma-separated if multiple.

Legend:
  • Query Gene
  • Node Color (Target Cell CSI, relative to current network):
    • Very High
    • High
    • Medium
    • Low
    • Very Low
    • CSI N/A
  • Node Size: Proportional to Target Cell CSI magnitude
  • STRING PPI Edge
  • Shared Pathway Edge (ONTOLOGY)

Loading network (please wait)...

Other Information

This section provides additional information about the gene, including a description generated by an AI language model and details about associated proteins.

## Summary [MUC4](/details-gene/4585) is a protein-coding gene located on chromosome 3q29 that encodes Mucin-4, a large, heavily glycosylated, cell-surface associated protein. As a member of the mucin family, its primary functions involve forming protective mucous barriers, lubricating epithelial surfaces, and participating in cell adhesion and signaling. Its functional annotations highlight roles in `cell-matrix adhesion` ([GO:0007160](https://www.ebi.ac.uk/QuickGO/term/GO:0007160)), `maintenance of gastrointestinal epithelium` ([GO:0030277](https://www.ebi.ac.uk/QuickGO/term/GO:0030277)), and notably, `Erbb-2 class receptor binding` ([GO:0005176](https://www.ebi.ac.uk/QuickGO/term/GO:0005176)), implicating it in cellular growth and proliferation pathways. **Overall**, expression data reveals that [MUC4](/details-gene/4585) is a key defining marker for various secretory and barrier epithelial cells, with exceptionally high significance in [multi-ciliated epithelial cells](/details-cell/CL0005012), [nasal mucosa goblet cells](/details-cell/CL0002480), and [duct epithelial cells](/details-cell/CL0000068), underscoring its essential role in mucosal tissues of the respiratory and digestive tracts. ## Cellular Roles and Expression Landscape The expression profile of [MUC4](/details-gene/4585) firmly establishes it as a cornerstone of mucosal and glandular epithelia. **Overall**, it exhibits its highest significance in cell types responsible for secretion and barrier function. The top-ranked cells include [multi-ciliated epithelial cell](/details-cell/CL0005012) (CSI: 15.70), [nasal mucosa goblet cell](/details-cell/CL0002480) (CSI: 15.23), and [tracheal goblet cell](/details-cell/CL1000329) (CSI: 12.80), indicating a critical role in the mucociliary clearance mechanism of the respiratory tract. This pattern extends to other secretory and ductal systems, with high significance in [duct epithelial cell](/details-cell/CL0000068) (CSI: 13.64), [secretory cell](/details-cell/CL0000151) (CSI: 7.41), and various gastrointestinal cells such as [enterocyte of epithelium of large intestine](/details-cell/CL0002071) (CSI: 5.63) and [colon epithelial cell](/details-cell/CL0011108) (CSI: 4.16). This widespread expression across diverse epithelial linings suggests a conserved function in maintaining epithelial integrity, providing lubrication, and protecting underlying tissues from chemical and microbial damage. The specificity of its expression to these epithelial lineages implies a negligible role in hematopoietic, neural, or mesenchymal cell types, highlighting its specialized function at the interface between the body and the external environment. ## Pathways and Molecular Function The molecular functions of [MUC4](/details-gene/4585) are intrinsically linked to its nature as a large, transmembrane glycoprotein. Its extensive post-translational modification is highlighted by numerous Reactome pathways related to glycosylation, such as `O-linked glycosylation of mucins` ([R-HSA-913709](https://reactome.org/content/detail/R-HSA-913709)) and `Diseases of glycosylation` ([R-HSA-3781865](https://reactome.org/content/detail/R-HSA-3781865)). These modifications are essential for its role as a lubricant and structural component of the `extracellular matrix` ([GO:0031012](https://www.ebi.ac.uk/QuickGO/term/GO:0031012)). Beyond its structural duties, [MUC4](/details-gene/4585) is an active participant in cell signaling. Its annotated ability for `Erbb-2 class receptor binding` ([GO:0005176](https://www.ebi.ac.uk/QuickGO/term/GO:0005176)) positions it as a modulator of the ERBB2 (HER2) signaling pathway, a key regulator of cell proliferation and survival. This interaction is of particular interest in oncology, as aberrant [MUC4](/details-gene/4585) expression has been linked to cancer progression ([Link](https://doi.org/10.1016/s0079-6603(02)71043-x), [Link](https://doi.org/10.1136/jcp.2004.023572)). Its involvement in the `innate immune system` ([R-HSA-168249](https://reactome.org/content/detail/R-HSA-168249)) is consistent with its function as a physical barrier that prevents pathogen entry at mucosal surfaces. The gene's fundamental roles are further supported by its contribution to `cell-matrix adhesion` ([GO:0007160](https://www.ebi.ac.uk/QuickGO/term/GO:0007160)) and the `maintenance of gastrointestinal epithelium` ([GO:0030277](https://www.ebi.ac.uk/QuickGO/term/GO:0030277)), directly reflecting its high expression in the gut. ## Research Directions The dual role of [MUC4](/details-gene/4585) as a protective barrier protein in healthy epithelia and a signaling modulator in cancer progression presents several avenues for future research. While the provided data focuses on a general context, literature strongly indicates that its expression is frequently dysregulated in malignancies like pancreatic and lung cancer ([Link](https://doi.org/10.1136/jcp.2004.023572), [Link](https://doi.org/10.1016/j.prp.2006.04.002)). This suggests a shift from a homeostatic to a pathogenic role depending on the cellular context. Based on its known functions, the following testable hypotheses can be proposed: 1. **Hypothesis on Cancer Progression:** Overexpression of [MUC4](/details-gene/4585) in pancreatic ductal adenocarcinoma cells promotes chemoresistance and metastatic potential by sterically hindering drug access and activating the ERBB2 survival pathway upon ligand binding. 2. **Hypothesis on Host-Pathogen Interaction:** In the respiratory tract, proteolytic cleavage of the extracellular domain of [MUC4](/details-gene/4585) by bacterial or viral proteases represents a key mechanism for pathogens to degrade the protective mucus layer and gain access to underlying epithelial cells. A specific experiment to test the first hypothesis could be designed. To investigate the role of [MUC4](/details-gene/4585) in pancreatic cancer, one could utilize a CRISPR-Cas9-mediated knockout of the gene in a [MUC4](/details-gene/4585)-positive pancreatic cancer cell line (e.g., Capan-1). The resulting knockout and control cell lines would then be treated with standard-of-care chemotherapeutics (e.g., gemcitabine). Cell viability assays (MTT or CellTiter-Glo) would quantify differences in drug sensitivity, while Transwell invasion assays would assess changes in metastatic potential. Parallel western blot analysis for phosphorylated ERBB2 and its downstream effectors (AKT, ERK) would confirm the impact on this key signaling pathway. Given its cell surface localization and frequent overexpression in tumors relative to most healthy tissues, [MUC4](/details-gene/4585) is a compelling therapeutic target. The strategy would focus on **inhibition** or targeted cell killing. It represents an excellent candidate for developing antibody-drug conjugates (ADCs) that selectively deliver cytotoxic agents to [MUC4](/details-gene/4585)-expressing cancer cells. Alternatively, monoclonal antibodies that block the interaction between [MUC4](/details-gene/4585) and ERBB2 could serve as a targeted therapy to suppress tumor growth, potentially in combination with existing ERBB2 inhibitors.

Genular Protein ID: 2500012748

Symbol: MUC4_HUMAN

Name: Mucin-4

UniProtKB Accession Codes:

Database IDs:

Citations:

PubMed ID: 10880978

Title: Alternative splicing generates a family of putative secreted and membrane-associated MUC4 mucins.

PubMed ID: 10880978

DOI: 10.1046/j.1432-1327.2000.01504.x

PubMed ID: 10920259

Title: Human MUC4 mucin cDNA and its variants in pancreatic carcinoma.

PubMed ID: 10920259

DOI: 10.1093/oxfordjournals.jbchem.a022746

PubMed ID: 12084055

Title: Cloning, chromosomal localization and characterization of the murine mucin gene orthologous to human MUC4.

PubMed ID: 12084055

DOI: 10.1046/j.1432-1033.2002.02988.x

PubMed ID: 10024507

Title: Complete sequence of the human mucin MUC4: a putative cell membrane-associated mucin.

PubMed ID: 10024507

DOI: 10.1042/bj3380325

PubMed ID: 12153560

Title: Genomic organization of MUC4 mucin gene: towards the characterization of splice variants.

PubMed ID: 12153560

DOI: 10.1046/j.1432-1033.2002.03032.x

PubMed ID: 1673336

Title: Molecular cloning and chromosomal localization of a novel human tracheo-bronchial mucin cDNA containing tandemly repeated sequences of 48 base pairs.

PubMed ID: 1673336

DOI: 10.1016/0006-291x(91)91580-6

PubMed ID: 16641997

Title: The DNA sequence, annotation and analysis of human chromosome 3.

PubMed ID: 16641997

DOI: 10.1038/nature04728

PubMed ID: 9620877

Title: Human mucin gene MUC4: organization of its 5'-region and polymorphism of its central tandem repeat array.

PubMed ID: 9620877

DOI: 10.1042/bj3320739

PubMed ID: 12102554

Title: Muc4/sialomucin complex, the intramembrane ErbB2 ligand, in cancer and epithelia: to protect and to survive.

PubMed ID: 12102554

DOI: 10.1016/s0079-6603(02)71043-x

PubMed ID: 16049287

Title: MUC4 expression is a novel prognostic factor in patients with invasive ductal carcinoma of the pancreas.

PubMed ID: 16049287

DOI: 10.1136/jcp.2004.023572

PubMed ID: 16814944

Title: MUC4 expression and its relation to ErbB2 expression, apoptosis, proliferation, differentiation, and tumor stage in non-small cell lung cancer (NSCLC).

PubMed ID: 16814944

DOI: 10.1016/j.prp.2006.04.002

PubMed ID: 16914178

Title: MUC4 expression and localization in gastrointestinal tract and skin of human embryos.

PubMed ID: 16914178

DOI: 10.1016/j.tice.2006.06.004

PubMed ID: 18780401

Title: Identification of N-linked glycoproteins in human milk by hydrophilic interaction liquid chromatography and mass spectrometry.

PubMed ID: 18780401

DOI: 10.1002/pmic.200701057

Sequence Information:

  • Length: 5412
  • Mass: 542307
  • Checksum: CA70610C2C0FC5B0
  • Sequence:
  • MKGARWRRVP WVSLSCLCLC LLPHVVPGTT EDTLITGSKT AAPVTSTGST TATLEGQSTA 
    ASSRTSNQDI SASSQNHQTK STETTSKAQT DTLTQMMTST LFSSPSVHNV METAPPDEMT 
    TSFPSSVTNT LMMTSKTITM TTSTDSTLGN TEETSTAGTE SSTPVTSAVS ITAGQEGQSR 
    TTSWRTSIQD TSASSQNHWT RSTQTTRESQ TSTLTHRTTS TPSFSPSVHN VTGTVSQKTS 
    PSGETATSSL CSVTNTSMMT SEKITVTTST GSTLGNPGET SSVPVTGSLM PVTSAALVTF 
    DPEGQSPATF SRTSTQDTTA FSKNHQTQSV ETTRVSQINT LNTLTPVTTS TVLSSPSGFN 
    PSGTVSQETF PSGETTTSSP SSVSNTFLVT SKVFRMPTSR DSTLGNTEET SLSVSGTISA 
    ITSKVSTIWW SDTLSTALSP SSLPPKISTA FHTQQSEGAE TTGRPHERSS FSPGVSQEIF 
    TLHETTTWPS SFSSKGHTTW SQTELPSTST GAATRLVTGN PSTGTAGTIP RVPSKVSAIG 
    EPGEPTTYSS HSTTLPKTTG AGAQTQWTQE TGTTGEALLS SPSYSVTQMI KTATSPSSSP 
    MLDRHTSQQI TTAPSTNHST IHSTSTSPQE SPAVSQRGHT QAPQTTQESQ TTRSVSPMTD 
    TKTVTTPGSS FTASGHSPSE IVPQDAPTIS AATTFAPAPT GDGHTTQAPT TALQAAPSSH 
    DATLGPSGGT SLSKTGALTL ANSVVSTPGG PEGQWTSASA STSPDTAAAM THTHQAESTE 
    ASGQTQTSEP ASSGSRTTSA GTATPSSSGA SGTTPSGSEG ISTSGETTRF SSNPSRDSHT 
    TQSTTELLSA SASHGAIPVS TGMASSIVPG TFHPTLSEAS TAGRPTGQSS PTSPSASPQE 
    TAAISRMAQT QRTRTSRGSD TISLASQATD TFSTVPPTPP SITSTGLTSP QTETHTLSPS 
    GSGKTFTTAL ISNATPLPVT YASSASTGHT TPLHVTDASS VSTGHATPLP VTSPSSVSTG 
    HTTPLPVTDT SSESTGHVTP LPVTSFSSAS TGDSTPLPVT DTSSASTGHV TPLPVTSLSS 
    ASTGDTTPLP VTDTSSASTG HATSLPVTDT SSVSTGHTTP LPVTDTSSAS TGHATSLPVT 
    DTSSVSTGHT TPLHVTDASS ASTGQATPLP VTSLSSVSTG DTTPLPVTSP SSASTGHATP 
    LLVTDTSSAS TGHATPLPVT DASSVSTDHA TSLPVTIPSA ASTGHTTPLP VTDTSSASTG 
    QATSLLVTDT SSVSTGDTTP LPVTSTSSAS TGHVTPLHVT SPSSASTGHA TPLPVTSLSS 
    ASTGDTMPLP VTSPSSASTG DTTPLPVTDA SSVSTGHTTP LHVTDASSAS TGQATPLPVT 
    SLSSVSTGDT TPLPVTSPSS ASTGHATPLL VTDTSSASTG HATPLPVTDA SSVSTDHATS 
    LPVTIPSAAS TGHTTPLPVT DTSSASTGQA TSLLVTDTSS VSTGDTTPLP VTSTSSASTG 
    HVTPLHVTSP SSASTGHATP LPVTSLSSAS TGDTMPLPVT SPSSASTGDT TPLPVTDASS 
    VSTGHTTPLP VTSPSSASTG HTTPLPVTDT SSASKGDTTP LPVTSPSSAS TGHTTPLPVT 
    DTSSASTGDT TPLPVTNASS LSTGHATPLH VTSPSSASTG HATPLPVTST SSASTGHATP 
    LPVTGLSSAT TDDTTRLPVT DVSSASTGQA TPLPVTSLSS VSTGDTTPLP VTSPSSASTG 
    HASPLLVTDA SSASTGQATP LPVTDTSSVS TAHATPLPVT GLSSASTDDT TRLPVTDVSS 
    ASTGQAIPLP VTSPSSASTG DTTPLPVTDA SSASTGDTTS LPVTIPSSAS SGHTTSLPVT 
    DASSVSTGHA TSLLVTDASS VSTGDTTPLP VTDTNSASTG DTTPLHVTDA SSVSTGHATS 
    LPVTSLSSAS TGDTTPLPVT SPSSASSGHT TPLPVTDASS VPTGHATSLP VTDASSVSTG 
    HATPLPVTDA SSVSTGHATP LPVTDTSSVS TGQATPLPVT SLSSASTGDT TPLPVTDTSS 
    ASTGQDTPLP VTSLSSVSTG DTTPLPVTNP SSASTGHATP LLVTDASSIS TGHATSLLVT 
    DASSVSTGHA TALHDTDASS LSTGDTTPLP VTSPSSTSTG DTTPLPVTET SSVSTGHATS 
    LPVTDTSSAS TGHATSLPVT DTSSASTGHA TPLPVTDTSS ASTGQATPLP VTSPSSASTG 
    HAIPLLVTDT SSASTGQATP LPVTSLSSAS TGDTTPLPVT DASSVSTGHA TSLPVTSLSS 
    VSTGDTTPLP VTSPSSASTG HATPLHVTDA SSASTGHATP LPVTSLSSAS TGDTTPLPVT 
    SPSSASTGHA TPLHVTDASS VSTGDTTPLP VTSSSSASSG HTTPLPVTDA SSASTGDTTP 
    LPVTDTSSAS TGHATHLPVT GLSSASTGDT TRLPVTNVSS ASTGHATPLP VTSTSSASTG 
    DTTPLPGTDT SSVSTGHTTP LLVTDASSVS TGDTTRLPVT SPSSASTGHT TPLPVTDTPS 
    ASTGDTTPLP VTNASSLSTR HATSLHVTSP SSASTGHATS LPVTDTSAAS TGHATPLPVT 
    STSSASTGDT TPLPVTDTYS ASTGQATPLP VTSLSSVSTG DTTPLPVTSP SSASTGHATP 
    LLVTDASSAS TGQATPLPVT SLSSVSTGDT TPLPVTSPSS ASTGHATSLP VTDTSSASTG 
    DTTSLPVTDT SSAYTGDTTS LPVTDTSSSS TGDTTPLLVT ETSSVSTGDT TPLPVTDTSS 
    ASTGHATPLP VTNTSSVSTG HATPLHVTSP SSASTGHTTP LPVTDASSVS TGHATSLPVT 
    DASSVFTGHA TSLPVTIPSS ASSGHTTPLP VTDASSVSTG HATSLPVTDA SSVSTGHATP 
    LPVTDASSVS TGHATPLPLT SLSSVSTGDT TPLPVTDTSS ASTGQATPLP VTSLSSVSTG 
    DTTPLPVTDT SSASTGHATS LPVTDTSSAS TGHATPLPDT DTSSASTGHA TLLPVTDTSS 
    ASIGHATSLP VTDTSSISTG HATPLHVTSP SSASTGHATP LPVTDTSSAS TGHANPLHVT 
    SPSSASTGHA TPLPVTDTSS ASTGHATPLP VTSLSSVSTG DTTPLPVTSP SSASTGHTTP 
    LPVTDTSSAS TGQATALPVT STSSASTGDT TPLPVTDTSS ASTGQATPLP VTSLSSVSTG 
    DTTPLPVTSP SSASTGHATP LLVTDASSAS TGQATPLPVT SLSSVSTGDT TPLPVTSPSS 
    ASTGHATSLP VTDTSSASTG DTTSLPVTDT SSAYTGDTTS LPVTDTSSSS TGDTTPLLVT 
    ETSSVSTGHA TPLLVTDASS ASTGHATPLH VTSPSSASTG DTTPVPVTDT SSVSTGHATP 
    LPVTGLSSAS TGDTTRLPVT DISSASTGQA TPLPVTNTSS VSTGDTMPLP VTSPSSASTG 
    HATPLPVTST SSASTGHATP VPVTSTSSAS TGHTTPLPVT DTSSASTGDT TPLPVTSPSS 
    ASTGHTTPLH VTIPSSASTG DTSTLPVTGA SSASTGHATP LPVTDTSSVS TGHATPLPVT 
    SLSSVSTGDT TPLPVTDASS ASTGQATPLP VTSLSSVSTG DTTPLLVTDA SSVSTGHATP 
    LPVTDTSSAS TGDTTRLPVT DTSSASTGQA TPLPVTSLSS VSTGDTTPLL VTDASSVSTG 
    HATPLPVTDT SSASTGDTTR LPVTDTSSAS TGQATPLPVT IPSSSSSGHT TPLPVTSTSS 
    VSTGHVTPLH VTSPSSASTG HVTPLPVTST SSASTGHATP LLVTDASSVS TGHATPLPVT 
    DASSASTGDT TPLPVTDTSS ASTGQATPLP VTSLSSVSTG DTTPLPVTDA SSASTGHATP 
    LPVTIPSSVS TGDTMPLPVT SPSSASTGHA TPLPVTGLSS ASTGDTTPLP VTDTSSASTR 
    HATPLPVTDT SSASTDDTTR LPVTDVSSAS TGHATPLPVT STSSASTGDT TPLPVTDTSS 
    VSTGHATSLP VTSRSSASTG HATPLPVTDT SSVSTGHATP LPVTSTSSVS TGHATPLPVT 
    SPSSASTGHA TPVPVTSTSS ASTGDTTPLP VTNASSLSTG HATPLHVTSP SSASRGDTST 
    LPVTDASSAS TGHATPLPLT SLSSVSTGDT TPLPVTDTSS ASTGQATPLP VTSLSSVSTG 
    DTTPLPVTIP SSASSGHTTS LPVTDASSVS TGHGTPLPVT STSSASTGDT TPLPVTDTSS 
    ASTGHATPLP VTDTSSASTG HATPLPVTSL SSVSTGHATP LAVSSATSAS TVSSDSPLKM 
    ETPGMTTPSL KTDGGRRTAT SPPPTTSQTI ISTIPSTAMH TRSTAAPIPI LPERGVSLFP 
    YGAGAGDLEF VRRTVDFTSP LFKPATGFPL GSSLRDSLYF TDNGQIIFPE SDYQIFSYPN 
    PLPTGFTGRD PVALVAPFWD DADFSTGRGT TFYQEYETFY GEHSLLVQQA ESWIRKMTNN 
    GGYKARWALK VTWVNAHAYP AQWTLGSNTY QAILSTDGSR SYALFLYQSG GMQWDVAQRS 
    GNPVLMGFSS GDGYFENSPL MSQPVWERYR PDRFLNSNSG LQGLQFYRLH REERPNYRLE 
    CLQWLKSQPR WPSWGWNQVS CPCSWQQGRR DLRFQPVSIG RWGLGSRQLC SFTSWRGGVC 
    CSYGPWGEFR EGWHVQRPWQ LAQELEPQSW CCRWNDKPYL CALYQQRRPH VGCATYRPPQ 
    PAWMFGDPHI TTLDGVSYTF NGLGDFLLVG AQDGNSSFLL QGRTAQTGSA QATNFIAFAA 
    QYRSSSLGPV TVQWLLEPHD AIRVLLDNQT VTFQPDHEDG GGQETFNATG VLLSRNGSEV 
    SASFDGWATV SVIALSNILH ASASLPPEYQ NRTEGLLGVW NNNPEDDFRM PNGSTIPPGS 
    PEEMLFHFGM TWQINGTGLL GKRNDQLPSN FTPVFYSQLQ KNSSWAEHLI SNCDGDSSCI 
    YDTLALRNAS IGLHTREVSK NYEQANATLN QYPPSINGGR VIEAYKGQTT LIQYTSNAED 
    ANFTLRDSCT DLELFENGTL LWTPKSLEPF TLEILARSAK IGLASALQPR TVVCHCNAES 
    QCLYNQTSRV GNSSLEVAGC KCDGGTFGRY CEGSEDACEE PCFPSVHCVP GKGCEACPPN 
    LTGDGRHCAA LGSSFLCQNQ SCPVNYCYNQ GHCYISQTLG CQPMCTCPPA FTDSRCFLAG 
    NNFSPTVNLE LPLRVIQLLL SEEENASMAE VNASVAYRLG TLDMRAFLRN SQVERIDSAA 
    PASGSPIQHW MVISEFQYRP RGPVIDFLNN QLLAAVVEAF LYHVPRRSEE PRNDVVFQPI 
    SGEDVRDVTA LNVSTLKAYF RCDGYKGYDL VYSPQSGFTC VSPCSRGYCD HGGQCQHLPS 
    GPRCSCVSFS IYTAWGEHCE HLSMKLDAFF GIFFGALGGL LLLGVGTFVV LRFWGCSGAR 
    FSYFLNSAEA LP

Genular Protein ID: 75687301

Symbol: A0A0G2JS65_HUMAN

Name: N/A

UniProtKB Accession Codes:

Database IDs:

Sequence Information:

  • Length: 7418
  • Mass: 734618
  • Checksum: E9A9AE68920B0159
  • Sequence:
  • MKGARWRRVP WVSLSCLCLC LLPHVVPGTT EDTLITGSKT PAPVTSTGST TATLEGQSTA 
    ASSRTSNQDI SASSQNHQTK STETTSKAQT DTLTQMMTST LFSSPSVHNV METVTQETAP 
    PDEMTTSFPS SVTNTLMMTS KTITMTTSTD STLGNTEETS TAGTESSTPV TSAVSITAGQ 
    EGQSRTTSWR TSIQDTSASS QNHWTRSTQT TRESQTSTLT HRTTSTPSFS PSVHNVTGTV 
    SQKTSPSGET ATSSLCSVTN TSMMTSEKIT VTTSTGSTLG NPGETSSVPV TGSLMPVTSA 
    ALVTVDPEGQ SPVTFSRTST QDTTAFSKNH QTQSVETTRV SQINTLNTLT PVTTSTVLSS 
    PSGFNPSGTV SQETFPSGET TISSPSSVSN TFLVTSKVFR MPISRDSTLG NTEETSLSVS 
    GTISAITSKV STIWWSDTLS TALSPSSLPP KISTAFHTQQ SEGAETTGRP HERSSFSPGV 
    SQEIFTLHET TTWPSSFSSK GHTTWSQTEL PSTSTGAATR LVTGNPSTGA AGTIPRVPSK 
    VSAIGEPGEP TTYSSHSTTL PKTTGAGAQT QWTQETGTTG EALLSSPSYS VTQMIKTATS 
    PSSSPMLDRH TSQQITTAPS TNHSTIHSTS TSPQESPAVS QRGHTQAPQT TQESQTTRSV 
    SPMTDTKTVT TPGSSFTASG HSPSEIVPQD APTISAATTF APAPTGDGHT TQAPTTALQA 
    APSSHDATLG PSGGTSLSKT GALTLANSVV STPGGPEGQW TSASASTSPD TAAAMTHTHQ 
    AESTEASGQT QTSEPASSGS RTTSAGTATP SSSGASGTTP SGSEGISTSG ETTRFSSNPS 
    RDSHTTQSTT ELLSASASHG AIPVSTGMAS SIVPGTFHPT LSEASTAGRP TGQSSPTSPS 
    ASPQETAAIS RMAQTQRTRT SRGSDTISLA SQATDTFSTV PPTPPSITSS GLTSPQTQTH 
    TLSPSGSGKT FTTALISNAT PLPVTYASSA STGHTTPLHV TDASSVSTGH ATPLPVTDTS 
    SESTGHATPL PVTSPSSVST GHTTPLPVTD TSSESTGHVT PLPVTSLSSA STGDSTPLPV 
    TDTSSASTGH VTPLPVTSLS SASTGDTTPL PVTDTSSAST GHATSLPVTD TSSVSTGHTT 
    PLPVTDTSSA STGHATSLPV TDTSSVSTAH ATPLPVTGLS SASTDDTTRL PVTDVSSAST 
    GQAIPLPVTS PSSASTGDTT PLPVTDASSA STGDTTSLPV TIPSSASSGH TTSLPVTDAS 
    SVSTGHATSL LVTDASSVST GDTTPLPVTD TNSASTGDTT PLHVTDASSV STGHATSLPV 
    TSLSSASTGD TTPLPVTSPS SASSGHTTPL PVTDASSVPT GHATSLPVTD ASSVSTGHAT 
    PLPVTDASSV STGHATPLPV TDTSSVSTGQ ATPLPVTSLS SASTGDTTPL PVTDTSSAST 
    GQDTPLPVTS LSSVSTGDTT PLPVTSPSSA STGHATPLLV TDASSVSTGH ATSLLVTDAS 
    SVSTGHATAL HVTDASSLST GDTTPLPVTS PSSASTGDTT PLPVTDTSSA STGHATSLPV 
    TDTSSASTGH ATPLPVTDTS SASTGQATPL PVTGPSSAST GHAIPLLVTD TSSASTGQAT 
    PLPVTSLSSA STGDTTPLPV TDASSVSTGH ATSLPVTSLS SVSTGDTTPL PVTSPSSASS 
    GHTTPLPVTD ASSVSTGDTT PLPVTSPSSA SSGHTTPLPV TSPSSASSGH TTPLPVTDAS 
    SASTGDTTPL PVTDTSSAST GHATHLPVTG LSSASTGDTT RLPVTNVSSA STGHATPLPV 
    TSTSSASTGD TTPLPGTDTS SVSTGHTTPL LVTDASSVST GDTTRLPVTS PSSASTGHTT 
    PLPVTDTPSA STGDTTPLPV TNASSLSTRH TTSLHVTSPS SASTGHATSL PVTDTSSVST 
    GHATPLHVTS PSSASTGDTT PLPVTDTYSA STGQATPLPV TDTSSASTGD TTPLPVTDTS 
    SASTGHATPL PVTNTSSVST GHATPLHVTS PSSASTGHTT PLPVTDASSV STGHATSLPV 
    TDASSVFTGH ATSLPVTIPS SASSGHTTPL PVTDASSVST GHATSLPVTD ASSVSTGHAT 
    PLPVTDASSV STGHATPLPV TDTSSVSTGH ATPLPLTSLS SVSTGDTTPL PVTDTSSAST 
    GQATPLPVTS LSSVSTGDTT PLPVTDTSSA STGHATSLPV TDTSSASTGH ATPLPDTDTS 
    SASTGHATPL PVTDTSSAST GHATLLPVTD TSSASIGHAT PLPVTDTSSI STGHATPLHV 
    TSPSSASTGH ATPLPVTDTS SASTGHANPL HVTSPSSAST GHATPLPVTD TSSASTGHAT 
    PLPVTSLSSV STGDTTPLPV TSPSSASTGH TTPLPVTDTS SASTGQATAL PVTSTSSAST 
    GDTTPLPVTD TSSASTGQAT PLPVTSLSSV STGDTTPLPV TSPSSASTGH ATPLLVTDAS 
    SASTGQATPL PVTDTSSAYT GDTTSLPVTD TSSSSTGDTT PLLVTETSSA STGHATPLHV 
    TSPSSASTGD TTPVPVTDTS SVSTGHATPL PVTGLSSAST GDTTRLPVTD ISSASTGQAT 
    PLPVTNTSSA STGHATPLPV TGLSSASTGD TTRLPVTDIS SASTGQATPL PVTNTSSVST 
    GDTMPLPVTS PSSASTGHAT PLPVTSTSSA STGHATPVPV TSTSSASTGH TTPLPVTDTS 
    SASTGDTTPL PVTSPSSAST GHTTPLHVTI PSSASTGDTS TLPVTGASSA STGHATPLPV 
    TDTSSVSTGH ATPLPVTSFS SVSTGDTTPL PVTDTSSVST GHATPLPVTS FSSVSTGDTT 
    PLPVTDASSA STGHATPLPV TDTSSVSTGH ATPLPLTSLS SVSTGDTTTL PVTDTSSVST 
    GHATPLPVTS FSSVSTGDTT PLPVTDASSA STGHATPLPV TDTSSVSTGH ATPLPVTSLS 
    SVSTGDTTPL PVTDASSAST GQATPLPVTS LSSVSTGDTT PLPVTIPSSA SSGHTTSLPV 
    TDTSSASTGQ ATPLPVTSLS SVSTGDTTPL LVTDASSVST GHATPLPVTD TSSASTGDTT 
    RLPVTDTSSA STGQATPLPV TSLSSVSTGD TTPLLVTNTS SVSRGHATSL PVTIPSSSSS 
    GHTTPLPVTS TSSVSTGHVT PLPVTSTSSA STGHATSLPV TDTSSVSTGH ATSLPVTDTS 
    SVSTGHATPL PVTDASSVST GHATPLPVTD ASSASTGDTT PLPVTNTSSA STGQATPLPV 
    TSLSSVSTGD TMPLPVTSPS SASTGHATPL PVTGLSSAST GDTTPLPVTD TSSASTGHVT 
    PLPVTSLSSA STGDSTPLPV TDTSSASTGH VTPLPVTSLS SASTGDTTPL PVTDTSSAST 
    GHATPLHVTD ASSASTGQAT LLPVTSLSSV STGDTTPLPV TSPSSASTGH ATPLLVTDTS 
    SASTGHATPL PVTDASSVST GHATSLPVTI RSSGSTGHTT PLPVTDTSSA STGQATSLLV 
    TDTSSVSTGD TTPLPVTSTS SASTGHVTPL HVTSPSSAST GHATPLPVTS LSSASTGDTM 
    PLPVTSPSSA STGDTTPLPV TDASSVSTGH TTPLPVTSPS SASTGHTTPL PVTDTTSASK 
    GDTTPLPVTS PSSASTGHTT PLPVTDTSSA STGDTTPLPV TSPSSASTGH ATPLPVTNAS 
    SLSTGHATPL HVTSPSSAST GHATPLPVTS TSSASTGHAT SLPVTSTSSA STGHATPLPV 
    TDNSSVSTGH ATPLPVTGLS SATTDDTTRL PVTDVSSAST GQATPLPVTS LSSVSTGDTT 
    PLPVTSPSSA STGHASPLLV TDASSASTGQ ATPLPVTDTS SVSTAHATPL PVTGLSSAST 
    DDTTRLPVTD VSSASTGQAI PLPVTSPSSA STGDTTPLPV TDASSASTGD TTSLPVTIPS 
    SASSGHTTSL PVTDASSVST GHATSLLVTD ASSVSTGDTT PLPVTDTNSA STGDTTPLHV 
    TDASSVSTGH ATSLPVTSLS SASTGDTTPL PVTSPSSASS GDTTPLPVTD TSSASTGHAT 
    HLPVTGLSSA STGDTTRLPV TNVSSASTGH ATPLPVTSTS SASTGDTTPL PGTDTSSVST 
    GHTTPLLVTD ASSVSTGDTT RLPVTSPSSA STGHTTPLHV TDASSVSTGH TTPLPVTDTS 
    SASTGQATSL LVTDTSSVST GDTTPLPVTS TSSASTGHVT PLHVTSPSSA STGHATPLPV 
    TSLSSASTGD TTPLPVTDTS SVSTGHTTPL PVTSPSSAST GHTTPLPVTD TSSASKGDTT 
    PLPVTSPSSA STGHTTPLPV TDTSSASTGD TTPLPVTNAS SLSTGHATPL HVTSPSSAST 
    GHATPLPVTS TSSASTGHAT PLPVTSTSSA STGHATPLPV TDNSSVSTGH ATPLPVTGLS 
    SATTDDTTRL PVTDVSSAST GQATPLPVTS LSSVSTGDTT PLPVTSPSSA STGHASPLLV 
    TDASSASTGQ ATPLPVTDTS SVSTAHATPL PVTGLSSAST DDTTRLPVTD VSSASTGQAI 
    PLPVTSPSSA STGDTTPLPV TDASSASTGD TTSLPVTIPS SASSGHTTSL PVTDASSVST 
    GHATSLLVTD ASSVSTGDTT PLPVTDTNSA STGDTTPLHV TDASSVSTGH ATSLPVTSLS 
    SASTGDTTPL PVTSPSSASS GHTTSLPVTD ASSVSTGHAT SLPVTIPSSA SSGHTTPLPV 
    TDASSVPTGH ATSLPVTDAS SVSTGHATPL PVTDASSVST GHATPLPVTD TSSVSTGQAT 
    PLPVTSLSSA SSTGDTTPLP VTDTSSASTG QDTPLPVTSL FSVSTGDTTP LPVTSPSSAS 
    TGHATHLLVT DASSVSTGHA TSLLVTDASS VSTGHATALH VTDASSLSTG DTTPLPVTSP 
    SSASTGDTTP LPVTDTSSVS TGHATSLPVT DTSSASTGHA TSLPVTDTSS ASTGQATPLP 
    VTSPSSASTG HAIPLLVTDT SSASTGQATP LPVTSLSSAS TGDTTPLPVT DASSVSTGHA 
    TSLPVTSLSS VSTGDTTPLP VTSPSSASTG HATPLHVTSP SSASTGHATP LPVTSLSSAS 
    TGDTTPLPVT SPSSASTGHA TPLHVTDASS VSTGDTTPLP VTSPSSASSG HTTPLPVTDA 
    SSASTGDTTP LPVTDTSSAS TGHATHLPVT GLSSASTGDT TRLPVTDVSS ASTGHATPLP 
    VTSTSSASTG DTTPLPGTDT SSVSTGHTTP LLVTDASSVS TGDTTRLPVT SPSSASTGHT 
    TPLPVTDTPS ASTGDTTPLP VTNASSLSTR HATSLHVTSP SSASTGHATP LPVTDTSAAS 
    TGHATPLPVT STSSASTGDT TPLPVTDTSS ASTGHATPLP VTNTSSVSTG HATPLHVTSP 
    SSASTGHTTP LPVTDASSVS TGHATSLPVT DASSVSTGHA TPLPVTDASS VSTGHATPLP 
    LTSLSSVSTG DTTPLPVTDT SSASTGQATP LPVTSLSSVS TGDTTPLPVT DTSSASTGHA 
    TSLPVTDTSS ASTGHATPLP VTDTSSISTG HATPLHVTSP SSASTGHATP LPVTDTSSAS 
    TGHATPLPVT SLSSVSTGDT TPLPVTSPSS ASTGHATPLL VTDASSASTG QATPLPVTSL 
    SSVSTGDTTP LPVTSPSSAS TGHATSLPVT DTSSASTGDT TSLPVTDTSS AYTGDTTSLP 
    VTDTSSSSTG DTTPLLVTET SSASTGDTTP VPVTDTSSVS TGHATPLPVT GLSSASTGDT 
    TRLPVTDISS ASTGQATPLP VTNTSSVSTG DTMPLPVTSP SSASTGHATP LPVTSTSSAS 
    TGHATPVPVT STSLASTGHT TPLPVTSPSS ASTGHTTPLP VTDTSSASTG DTTPLPVTNA 
    SSLSTGHTTP LHVTIPSSAS TGDTSTLPVT GASSASTGHA TPLPVTDTSS VSTGHATPLP 
    VTSFSSVSTG DTTPLPVTDA SSASTGHATP LPVTDTSSAS TGDTTPLPVT DASSASTGQA 
    TPLPVTSLSS VSTGDTTPLP VTIPSSASSG HTTSLPVSDT SSASTGQATP LPVTSLSSVS 
    TGDTTPLLVT DASSVSTGHA TPLPVTDTSS ASTGDTTRLP VTDTSSASTG QATPLPVTSL 
    SSVSTGDTTP LLVTNTSSVS TGHATSLPVT IPSSSSSGHT TPLPVTSTSS VSTGHVTPLH 
    VTSPSSSSTG QATPLPVTST SSVSTGHVTP LHVTSPSSAS TGHATPLPVT STSSASTGHA 
    TPLPVTDASS VSTGHATPLP VTDTSSASTG DTTPLPVTDT SSASTGQATP LPVTSLSSVS 
    TGHATPLAVS SATSASTVSS DSPLKMETSG MTTPSLKTDG GRRTATSPPP TTSQTIISTI 
    PSTAMHTRST AAPIPILPER GVSLFPYGAD AGDLEFVRRT VDFTSPLFKP ATGFPLGSSL 
    RDSLYFTDNG QIIFPESDYQ IFSYPNPLPT GFTGRDPVAL VAPFWDDADF STGRGTTFYQ 
    EYETFYGEHS LLVQQAESWI RKMTNNGGYK ARWALKVTWV NAHAYPAQWT LGSNTYQAIL 
    STDGSRSYAL FLYQSGGMQW DVAQRSGNPV LMGFSSGDGY FENSPLMSQP VWERYRPDRF 
    LNSNSGLQGL QFYRLHREER PNYRLECLQW LKSQPRWPSW GWNQVSCPCS WQQGRRDLRF 
    QPVSIGRWGL GSRQLCSFTS WRGGVCCSYG PWGEFREGWH VQRPWQLAQE LEPQSWCCRW 
    NDKPYLCALY QQRRPHVGCA TYRPPQPAWM FGDPHITTLD GVSYTFNGLG DFLLVGAQDG 
    NSSFLLQGRT AQTGSAQATN FIAFAAQYRS SSLGPVTVQW LLEPHDAIRV LLDNQTVTFQ 
    PDHEDGGGQE TFNATGVLLS RNGSEVSASF DGWATVSVIA LSNILHASAS LPPEYQNRTE 
    GLLGVWNNNP EDDFRMPNGS TIPPGSPEEM LFHFGMTWQI NGTGLLGKRN DQLPSNFTPV 
    FYSQLQKNSS WAEHLISNCD GDSSCIYDTL ALRNASIGLH TREVSKNYEQ ANATLNQYPP 
    SINGGRVIEA YKGQTTLIQY TSNAEDANFT LRDSCTDLEL FENGTLLWTP KSLEPFTLEI 
    LARSAKIGLA SALQPRTVVC HCNAESQCLY NQTSRVGNSS LEVAGCKCDG GTFGRYCEGS 
    EDACEEPCFP SVHCVPGKGC EACPPNLTGD GRHCAALGSS FLCQNQSCPV NYCYNQGHCY 
    ISQTLGCQPM CTCPPAFTDS RCFLAGNNFS PTVNLELPLR VIQLLLSEEE NASMAEVNAS 
    VAYRLGTLDM RAFLRNSQVE RIDSAAPASG SPIQHWMVIS EFQYRPRGPV IDFLNNQLLA 
    AVVEAFLYHV PRRSEEPRND VVFQPISGED VRDVTALNVS TLKAYFRCDG YKGYDLVYSP 
    QSGFTCVSPC SRGYCDHGGQ CQHLPSGPRC SCVSFSIYTA WGEHCEHLSM KLDAFFGIFF 
    GALGGLLLLG VGTFVVLRFW GCSGARFSYF LNSAEALP

Genular Protein ID: 2868438431

Symbol: A0T3F4_HUMAN

Name: N/A

UniProtKB Accession Codes:

Database IDs:

Sequence Information:

  • Length: 1437
  • Mass: 151953
  • Checksum: CE57979138FC3A4B
  • Sequence:
  • MKGARWRRVP WVSLSCLCLC LLPHVVPGTT EDTLITGSKT PAPVTSTGST TATLEGQSTA 
    ASSRTSNQDI SASSQNHQTK STETTSKAQT DTLTQMMTST LFSSPSVHNV METVTQETAP 
    PDEMTTSFPS SVTNTLMMTS KTITMTTSTD STLGNTEETS TAGTESSTPV TSAVSITAGQ 
    EGQSRTTSWR TSIQDTSASS QNHWTRSTQT TRESQTSTLT HRTTSTPSFS PSVHNVTGTV 
    SQKTSPSGET ATSSLCSVTN TSMMTSEKIT VTTSTGSTLG NPGETSSVPV TGSLMPVTSA 
    ALVTVDPEGQ SPATFSRTST QDTTAFSKNH QTQSVETTRV SQINTLNTLT PVTTSTVLSS 
    PSGFNPSGTV SQETFPSGET TISSPSSVSN TFLVTSKVFR MPISRDSTLG NTEETSLSVS 
    GTISAITSKV STIWWSDTLS TALSPSSLPP KISTAFHTQQ SEGAETTGRP HERSSFSPGV 
    SQEIFTLHET TTWPSSFSSK GHTTWSQTEL PSTSTGAATR LVTGNPSTGA AGTIPRVPSK 
    VSAIGEPGEP TTYSSHSTTL PKTTGAGAQT QWTQETGTTG EALLSSPSYS VTQMIKTATS 
    PSSSPMLDRH TSQQITTAPS TNHSTIHSTS TSPQESPAVS QRGHTQAPQT TQESQTTRSV 
    SPMTDTKTVT TPGSSFTASG HSPSEIVPQD APTISAATTF APAPTGDGHT TQAPTTALQA 
    APSSHDATLG PSGGTSLSKT GALTLANSVV STPGGPEGQW TSASASTSPD TAAAMTHTHQ 
    AESTEASGQT QTSEPASSGS RTTSAGTATP SSSGASGTTP SGSEGISTSG ETTRFSSNPS 
    RDSHTTQSTT ELLSASASHG AIPVSTGMAS SIVPGTFHPT LSEASTAGRP TGQSSPTSPS 
    ASPQETAAIS RMAQTQRTRT SRGSDTISLA SQATDTFSTV PPTPPSITSS GLTSPQTQTH 
    TLSPSGSVST GHATPLAVSS ATSASTVSSD SPLKMETSGM TTPSLKTDGG RRTATSPPPT 
    TSQTIISTIP STAMHTRSTA APIPILPERG VSLFPYGAGA GDLEFVRRTV DFTSPLFKPA 
    TGFPLGSSLR DSLYFTDNGQ IIFPESDYQI FSYPNPLPTG FTGRDPVALV APFWDDADFS 
    TGRGTTFYQE YETFYGEHSL LVQQAESWIR KITNNGGYKA RWALKVTWVN AHAYPAQWTL 
    GSNTYQAILS TDGSRSYALF LYQSGGMQWD VAQRSGNPVL MGFSSGDGYF ENSPLMSQPV 
    WERYRPDRFL NSNSGLQGLQ FYRLHREERP NYRLECLQWL KSQPRWPSWG WNQVSCPCSW 
    QQGRRDLRFQ PVSIGRWGLG SRQLCSFTSW RGGVCCSYGP WGEFREGWHV QRPWQLAQEL 
    EPQSWCCRWN DKPYLCALYQ QRRPHVGCAT YRPPQPAWMF GDPHITTLDG VSYTFNG