We use cookies to enhance the usability of our website. If you continue, we'll assume that you are happy to receive all cookies. More information. Don't show this again.
General description of the gene and the encoded protein(s) using information from HGNC and Ensembl, as well as predictions made by the Human Protein Atlas project.
Gene namei
Official gene symbol, which is typically a short form of the gene name, according to HGNC.
All transcripts of all genes have been analyzed regarding the location(s) of corresponding protein based on prediction methods for signal peptides and transmembrane regions.
Genes with at least one transcript predicted to encode a secreted protein, according to prediction methods or to UniProt location data, have been further annotated and classified with the aim to determine if the corresponding protein(s) are secreted or actually retained in intracellular locations or membrane-attached.
Remaining genes, with no transcript predicted to encode a secreted protein, will be assigned the prediction-based location(s).
The annotated location overrules the predicted location, so that a gene encoding a predicted secreted protein that has been annotated as intracellular will have intracellular as the final location.
Intracellular
Number of transcriptsi
Number of protein-coding transcripts from the gene as defined by Ensembl.
6
HUMAN PROTEIN ATLAS INFORMATIONi
Summary of RNA expression based on cell line data from the DepMap portal and cell line data generated within the Human Protein Atlas project.
Cell line expression clusteri
The RNA data was used to cluster genes according to their expression across cell lines. Clusters contain genes that have similar expression patterns, and each cluster has been manually annotated to describe common features in terms of function and specificity.
Non-specific - Unknown function (mainly)
Cell line specificityi
RNA specificity category based on RNA sequencing data from cancer cell lines in the Human Protein Atlas grouped according to type of cancer. Genes are classified into six different categories (enriched, group enriched, enhanced, low specificity and not detected) according to their RNA expression levels across the panel of cell lines.
Low cancer specificity
Tau specificity scorei
Tau specificity score is a numerical indicator of the specificity of the gene expression across cells or tissues. The value ranges from 0 and 1, where 0 indicates identical expression across all cells/tissue types, while 1 indicates expression in a single cell/tissue type.
0.20
Cell line distributioni
RNA distribution category based on RNA sequencing data from cancer cell lines in the Human Protein Atlas grouped according to type of cancer. Genes are classified into five different categories (detected in all, detected in many, detected in some, detected in single and not detected) according to their pattern of detected RNA expression across the panel of cell lines.
Detected in all
Protein evidencei
Evidence score for genes based on UniProt protein existence (UniProt evidence); neXtProt protein existence (neXtProt evidence);and a Human Protein Atlas antibody- or RNA based score (HPA evidence). The avaliable scores are evidence at protein level, evidence at transcript level, no evidence, or not avaliable.
RNA expression data as normalized transcript per million (nTPM) values of cancer cell lines.The analyzed cell lines are grouped according to cancer type. Detailed information about the groups is revealed by hovering over the corresponding bar in the chart. More information and cell line data can be found in the Cell line section.
Cell line categories
Alphabetical
Expression
RNA specificity:Low cancer specificity
EXPRESSION CLUSTERING & CORRELATIONi
The RNA data was used to cluster genes according to their expression across samples. The resulting clusters have been manually annotated to describe common features in terms of function and specificity. The annotation of the cluster is displayed together with a confidence score of the gene's assignment to the cluster. The confidence is calculated as the fraction of times the gene was assigned to this cluster in repeated calculations and is reported between 0 to 1, where 1 is the highest possible confidence. The clustering results are shown in a UMAP, where the cluster this gene was assigned to is highlighted as a colored area in which most of the cluster genes reside. A table shows the 15 most similar genes in terms of expression profile.
ZMIZ2 is part of cluster 67Non-specific - Unknown function with confidencei
Confidence is the fraction of times a gene was assigned to the cluster in repeated clustering, and therefore reflects how strongly associated it is to the cluster. A confidence of 1 indicates that the gene was assigned to this cluster in all repeated clusterings.
Correlation between the selected gene and neighboring gene. Correlation is calculated as Spearman correlation in PCA space based on the RNA-seq expression data.
Clusteri
ID of the expression cluster of the neighboring gene.