Protein profile

KP13_00188

Formamidopyrimidine-DNA glycosylase bifunctional protein

Genome: KpKP13

Gene: mutM AHE42102.1 Structure source: AlphaFold + ColabFold UniProt A0A0H3H049
Amino acids 269
Annotations 14
Features 32
PDB binders 7
Druggability 0.296

Overview

Basic information about this protein and its source genome.

Accession
KP13_00188
Gene
mutM AHE42102.1
Status
annotated
Amino acids
269
Structure source
AlphaFold + ColabFold

Target profile

Computed evidence for target prioritization.

Human off-target
hit
Human identity (%)
28.947
Human E-value
2.44e-06
Gut microbiome off-target
hit
Essential (DEG)
N
DEG identity (%)
0.0
Localization
Cytoplasmic
ColabFold pLDDT
94.86

Selected Druggability evidence

AlphaFold / UniProt model

Selected Druggability is the FPocket score chosen for ranking using the curated structure priority. The 3D viewer may show a different loaded structure, so its visible pockets can differ.

FPocket 0.296
Structure A0A0H3H049
Pocket Pocket 6
P2Rank 0.435
Structure A0A0H3H049
Pocket Pocket 1
ColabFold model
FPocket 0.263 · Pocket 1
P2Rank 0.343 · Pocket 1
Core conservation Conserved core gene
Roary core
CoreCruncher core
Gut microbiome 175 / 4744 genomes with a hit
Normalized 0.037

Sequence

Primary amino-acid sequence viewer.

Functional Annotations

Enzyme classification and Gene Ontology terms linked to this protein.

2 EC 12 GO

Enzyme Commission (EC)

2

Gene Ontology (GO)

12
  • GO:0008270 Binding to a zinc ion (Zn).
  • GO:0003684 Binding to damaged DNA.
  • GO:0003906 Catalysis of the cleavage of the C-O-P bond in the AP site created when DNA glycosylase removes a damaged base, involved in the DNA base excision repair pathway (BER).
  • GO:0003676 Binding to a nucleic acid.
  • GO:0006284 In base excision repair, an altered base is removed by a DNA glycosylase enzyme, followed by excision of the resulting sugar phosphate. The small gap left in the DNA helix is filled in by the sequential action of DNA polymerase and DNA ligase.
  • GO:0019104 Catalysis of the removal of damaged bases by cleaving the N-C1' glycosidic bond between the target damaged DNA base and the deoxyribose sugar. The reaction releases a free base and leaves an apurinic/apyrimidinic (AP) site.
  • GO:0006281 The process of restoring DNA after damage. Genomes are subject to damage by chemical and physical agents in the environment (e.g. UV and ionizing radiations, chemical mutagens, fungal and bacterial toxins, etc.) and by free radicals or alkylating agents endogenously generated in metabolism. DNA is also damaged because of errors during its replication. A variety of different DNA repair pathways have been reported that include direct reversal, base excision repair, nucleotide excision repair, photoreactivation, bypass, double-strand break repair pathway, and mismatch repair pathway.
  • GO:0003677 Any molecular function by which a gene product interacts selectively and non-covalently with DNA (deoxyribonucleic acid).
  • GO:0008534 Catalysis of the removal of oxidized purine bases by cleaving the N-C1' glycosidic bond between the oxidized purine and the deoxyribose sugar. The reaction involves the formation of a covalent enzyme-substrate intermediate. Release of the enzyme and free base by a beta-elimination or a beta, gamma-elimination mechanism results in the cleavage of the DNA backbone 3' of the apurinic (AP) site.
  • GO:0016799 Catalysis of the hydrolysis of any N-glycosyl bond.
  • GO:0034039 Catalysis of the removal of 8-oxo-7,8-dihydroguanine bases by cleaving the N-C1' glycosidic bond between the oxidized purine and the deoxyribose sugar.
  • GO:0140078 Catalysis of the cleavage of an AP site 3' of the baseless site by a beta-lyase mechanism, leaving an unsaturated aldehyde, termed a 3'-(4-hydroxy-5-phospho-2-pentenal) residue, and a 5'-phosphate.

Sequence Features

Domain/signature hits from InterPro and related databases.

32 records
Show feature table
Start End DB Term Name
217 269 SUPERFAMILY SSF57716 Glucocorticoid receptor-like (DNA-binding domain)
129 219 Pfam PF06831 Formamidopyrimidine-DNA glycosylase H2TH domain
129 219 InterPro IPR015886 DNA glycosylase/AP lyase, H2TH DNA-binding
1 114 Pfam PF01149 Formamidopyrimidine-DNA glycosylase N-terminal domain
1 114 InterPro IPR012319 Formamidopyrimidine-DNA glycosylase, catalytic domain
1 269 Hamap MF_00103 Formamidopyrimidine-DNA glycosylase [mutM].
1 269 InterPro IPR020629 Formamidopyrimidine-DNA glycosylase
131 269 Gene3D G3DSA:1.10.8.50 -
1 269 PANTHER PTHR22993 FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE
2 128 Gene3D G3DSA:3.20.190.10 -
2 128 InterPro IPR035937 MutM-like, N-terminal
235 269 ProSiteProfiles PS51066 Zinc finger FPG-type profile.
235 269 InterPro IPR000214 Zinc finger, DNA glycosylase/AP lyase-type
2 128 FunFam G3DSA:3.20.190.10:FF:000001 Formamidopyrimidine-DNA glycosylase
131 269 FunFam G3DSA:1.10.8.50:FF:000003 Formamidopyrimidine-DNA glycosylase
2 138 SUPERFAMILY SSF81624 N-terminal domain of MutM-like DNA repair proteins
2 138 InterPro IPR035937 MutM-like, N-terminal
244 268 ProSitePatterns PS01242 Zinc finger FPG-type signature.
244 268 InterPro IPR015887 DNA glycosylase/AP lyase, zinc finger domain, DNA-binding site
241 269 Pfam PF06827 Zinc finger found in FPG and IleRS
241 269 InterPro IPR010663 Zinc finger, FPG/IleRS-type
130 222 SUPERFAMILY SSF46946 S13-like H2TH domain
130 222 InterPro IPR010979 Ribosomal protein S13-like, H2TH
2 116 CDD cd08966 EcFpg-like_N
2 112 ProSiteProfiles PS51068 Formamidopyrimidine-DNA glycosylase catalytic domain profile.
2 112 InterPro IPR012319 Formamidopyrimidine-DNA glycosylase, catalytic domain
129 221 SMART SM01232 H2TH_2
129 221 InterPro IPR015886 DNA glycosylase/AP lyase, H2TH DNA-binding
1 268 NCBIfam TIGR00577 DNA-formamidopyrimidine glycosylase
1 268 InterPro IPR020629 Formamidopyrimidine-DNA glycosylase
2 115 SMART SM00898 Fapy_DNA_glyco_2
2 115 InterPro IPR012319 Formamidopyrimidine-DNA glycosylase, catalytic domain

3D Structure

Selected loaded structure. Experimental PDB entries may cover only a portion of the sequence; predicted models typically cover the full protein.

3D visualization script Full viewer

Loading 3D structure...

Legend High Medium Low

Structural evidence

0 + 2

Experimental PDB entries and predicted models. Click Switch to display a different structure in the viewer.

Entry Method Resolution Chain Coverage Links Status
AlphaFold AF_A0A0H3H049
AlphaFold full sequence Viewing
ColabFold KP13_00188
ColabFold full sequence Loaded
Pocket details FPocket · P2Rank — toggle visibility and zoom from here, or open full viewer

Pockets (FPOCKET)

Showing top-ranked FPocket candidates by druggability. Druggability is color-coded: high (0.7 or higher), medium (0.4 to 0.69), low (below 0.4).

FPOCKET Sticks Spheres Surfaces Druggability Labels Zoom Positions
6 0.296

Pockets (P2RANK)

Showing top-ranked P2Rank candidates by probability. Probability is color-coded per P2Rank calibration: high (≥ 0.5), medium (0.2 – 0.49), low (< 0.2).

P2RANK Sticks Spheres Surfaces Score Probability Labels Zoom Positions
1 4.95 0.223
2 1.27 0.013

Ligand evidence

Ligands grouped by evidence source. PDB ligands keep the source crystal visible, and loaded crystals can be opened directly in the structure viewer.

57 records

Structural evidence inferred from similar proteins. The source crystal indicates where the ligand was observed; the UniProt column identifies the homologous protein carrying that ligand.

Show only:
Ligand Source crystal UniProt (homolog) MW · LogP · TPSA Lipinski PAINS SMILES
2ON P42371 168.2 Da LogP -0.07 TPSA 74.4 ✓ Ro5 ✓ Clean c1[nH]c2c(n1)C(=O)NC(=N2)S
5JL P42371 200.2 Da LogP 0.97 TPSA 80.2 ✓ Ro5 ✓ Clean C12=C(NC(=S)N1)NC(=S)NC2=O
KB5 P42371 167.2 Da LogP 0.22 TPSA 80.5 ✓ Ro5 ✓ Clean c1nc(c2c(n1)[nH]c(n2)S)N
KBN P42371 220.2 Da LogP 1.66 TPSA 54.5 ✓ Ro5 ✓ Clean c1[nH]c2c(n1)c(nc(n2)C(F)(F)F)S
KBQ P42371 167.2 Da LogP 0.91 TPSA 64.4 ✓ Ro5 ✓ Clean c1c[nH]c2c1C(=O)NC(=S)N2
KD8 P42371 208.2 Da LogP 0.99 TPSA 74.4 ✓ Ro5 ✓ Clean Cc1c(nc2c(n1)C(=O)NC(=S)N2)C
PED P50465 200.1 Da LogP -0.77 TPSA 107.2 ✓ Ro5 ✓ Clean CC[C@@H]([C@@H](COP(=O)(O)O)O)O

PDB and ChEMBL records on this protein are shown in full. ChEMBL records from similar proteins are capped at the top 100 per protein (by pchembl) and ZINC at the top 50 (Tanimoto ≥ 0.5). ADME columns are descriptor-based screening flags, not experimental toxicity results.