Protein profile

KP13_00801

Uracil-DNA glycosylase

Genome: KpKP13

Gene: ung AHE43166.1 Structure source: AlphaFold + ColabFold UniProt A0A0H3GTU7
Amino acids 237
Annotations 7
Features 20
PDB binders 16
Druggability 0.357

Overview

Basic information about this protein and its source genome.

Accession
KP13_00801
Gene
ung AHE43166.1
Status
annotated
Amino acids
237
Structure source
AlphaFold + ColabFold

Target profile

Computed evidence for target prioritization.

Human off-target
hit
Human identity (%)
64.848
Human E-value
1.46e-72
Gut microbiome off-target
hit
Essential (DEG)
Y
DEG identity (%)
71.628
DEG E-value
3.34e-113
Localization
Cytoplasmic
ColabFold pLDDT
95.24

Selected Druggability evidence

AlphaFold / UniProt model

Selected Druggability is the FPocket score chosen for ranking using the curated structure priority. The 3D viewer may show a different loaded structure, so its visible pockets can differ.

FPocket 0.357
Structure A0A0H3GTU7
Pocket Pocket 5
P2Rank 0.429
Structure A0A0H3GTU7
Pocket Pocket 1
ColabFold model
FPocket 0.214 · Pocket 10
P2Rank 0.341 · Pocket 1
Core conservation Conserved core gene
Roary core
CoreCruncher core
Gut microbiome 2006 / 4744 genomes with a hit
Normalized 0.423

Sequence

Primary amino-acid sequence viewer.

Functional Annotations

Enzyme classification and Gene Ontology terms linked to this protein.

1 EC 6 GO

Enzyme Commission (EC)

1

Gene Ontology (GO)

6
  • GO:0006284 In base excision repair, an altered base is removed by a DNA glycosylase enzyme, followed by excision of the resulting sugar phosphate. The small gap left in the DNA helix is filled in by the sequential action of DNA polymerase and DNA ligase.
  • GO:0004844 Catalysis of the cleavage of the N-C1' glycosidic bond between the damaged DNA base and the deoxyribose sugar, releasing a free base and leaving an apyrimidinic (AP) site. Enzymes with this activity recognize and remove uracil bases in DNA that result from the deamination of cytosine or the misincorporation of dUTP opposite an adenine.
  • GO:0016799 Catalysis of the hydrolysis of any N-glycosyl bond.
  • GO:0006281 The process of restoring DNA after damage. Genomes are subject to damage by chemical and physical agents in the environment (e.g. UV and ionizing radiations, chemical mutagens, fungal and bacterial toxins, etc.) and by free radicals or alkylating agents endogenously generated in metabolism. DNA is also damaged because of errors during its replication. A variety of different DNA repair pathways have been reported that include direct reversal, base excision repair, nucleotide excision repair, photoreactivation, bypass, double-strand break repair pathway, and mismatch repair pathway.
  • GO:0005737 The contents of a cell excluding the plasma membrane and nucleus, but including other subcellular structures.
  • GO:0097510 A base-excision repair, AP site formation process occurring via excision of a deaminated base.

Sequence Features

Domain/signature hits from InterPro and related databases.

20 records
Show feature table
Start End DB Term Name
11 237 Gene3D G3DSA:3.40.470.10 -
11 237 InterPro IPR036895 Uracil-DNA glycosylase-like domain superfamily
57 218 SMART SM00987 UDG_2_a
27 228 CDD cd10027 UDG-F1-like
27 228 InterPro IPR002043 Uracil-DNA glycosylase family 1
57 218 SMART SM00986 UDG_2
57 218 InterPro IPR005122 Uracil-DNA glycosylase-like
12 228 SUPERFAMILY SSF52141 Uracil-DNA glycosylase-like
12 228 InterPro IPR036895 Uracil-DNA glycosylase-like domain superfamily
5 232 FunFam G3DSA:3.40.470.10:FF:000001 Uracil-DNA glycosylase
14 221 NCBIfam TIGR00628 uracil-DNA glycosylase
14 221 InterPro IPR002043 Uracil-DNA glycosylase family 1
12 229 Hamap MF_00148 Uracil-DNA glycosylase [ung].
12 229 InterPro IPR002043 Uracil-DNA glycosylase family 1
4 230 PANTHER PTHR11264 URACIL-DNA GLYCOSYLASE
4 230 InterPro IPR002043 Uracil-DNA glycosylase family 1
65 74 ProSitePatterns PS00130 Uracil-DNA glycosylase signature.
65 74 InterPro IPR018085 Uracil-DNA glycosylase, active site
63 217 Pfam PF03167 Uracil DNA glycosylase superfamily
63 217 InterPro IPR005122 Uracil-DNA glycosylase-like

3D Structure

Selected loaded structure. Experimental PDB entries may cover only a portion of the sequence; predicted models typically cover the full protein.

3D visualization script Full viewer

Loading 3D structure...

Legend High Medium Low

Structural evidence

0 + 2

Experimental PDB entries and predicted models. Click Switch to display a different structure in the viewer.

Entry Method Resolution Chain Coverage Links Status
AlphaFold AF_A0A0H3GTU7
AlphaFold full sequence Viewing
ColabFold KP13_00801
ColabFold full sequence Loaded
Pocket details FPocket · P2Rank — toggle visibility and zoom from here, or open full viewer

Pockets (FPOCKET)

Showing top-ranked FPocket candidates by druggability. Druggability is color-coded: high (0.7 or higher), medium (0.4 to 0.69), low (below 0.4).

FPOCKET Sticks Spheres Surfaces Druggability Labels Zoom Positions
5 0.357
3 0.252

Pockets (P2RANK)

Showing top-ranked P2Rank candidates by probability. Probability is color-coded per P2Rank calibration: high (≥ 0.5), medium (0.2 – 0.49), low (< 0.2).

P2RANK Sticks Spheres Surfaces Score Probability Labels Zoom Positions
1 5.45 0.259

Ligand evidence

Ligands grouped by evidence source. PDB ligands keep the source crystal visible, and loaded crystals can be opened directly in the structure viewer.

67 records

Structural evidence inferred from similar proteins. The source crystal indicates where the ligand was observed; the UniProt column identifies the homologous protein carrying that ligand.

Show only:
Ligand Source crystal UniProt (homolog) MW · LogP · TPSA Lipinski PAINS SMILES
302 P13051 346.3 Da LogP 0.16 TPSA 146.2 ✓ Ro5 ✓ Clean c1cc(ccc1\C=N\OCCO/N=C/C2=CC(=O)NC(=O)N2)C(=O)O
3FI P13051 346.3 Da LogP 0.29 TPSA 136.6 ✓ Ro5 ✓ Clean c1cc(cc(c1)C(=O)O)\C=N\OCCCNCC2=CC(=O)NC(=O)N2
3FL P13051 346.4 Da LogP 0.42 TPSA 127.1 ✓ Ro5 ✓ Clean c1cc(cc(c1)C(=O)O)CNCCCCNCC2=CC(=O)NC(=O)N2
5NU P9WFQ9 157.1 Da LogP -1.03 TPSA 108.9 ✓ Ro5 ✓ Clean C1=C(C(=O)NC(=O)N1)[N+](=O)[O-]
5UC P9WFQ9 146.5 Da LogP -0.28 TPSA 65.7 ✓ Ro5 ✓ Clean C1=C(C(=O)NC(=O)N1)Cl
6UA P9WFQ9 127.1 Da LogP -1.02 TPSA 84.5 ✓ Ro5 ✓ Clean C1C(=NC(=O)NC1=O)N
DUR P10186 228.2 Da LogP -1.82 TPSA 104.6 ✓ Ro5 ✓ Clean C1[C@@H]([C@H](O[C@H]1N2C=CC(=O)NC2=O)CO)O
FCF P13051 346.3 Da LogP 0.16 TPSA 146.2 ✓ Ro5 ✓ Clean c1cc(cc(c1)C(=O)O)\C=N\OCCO/N=C/C2=CC(=O)NC(=O)…
FCK P13051 346.3 Da LogP 0.29 TPSA 136.6 ✓ Ro5 ✓ Clean c1cc(cc(c1)C(=O)O)CNCCCON=CC2=CC(=O)NC(=O)N2
FLC P9WFQ9 189.1 Da LogP -5.25 TPSA 140.6 ✓ Ro5 ✓ Clean C(C(=O)[O-])C(CC(=O)[O-])(C(=O)[O-])O
QU4 P13051 422.3 Da LogP 2.45 TPSA 169.4 ✓ Ro5 ✓ Clean c1cc(c(cc1C(=C2C=CC(=O)C(=C2)C(=O)O)c3ccc(c(c3)…
TUL P9WFQ9 128.2 Da LogP 0.43 TPSA 48.6 ✓ Ro5 ✓ Clean C1=CNC(=S)NC1=O
URA P12295 112.1 Da LogP -0.94 TPSA 65.7 ✓ Ro5 ✓ Clean C1=CNC(=O)NC1=O
URB D0VWU0 191.0 Da LogP -0.17 TPSA 65.7 ✓ Ro5 ✓ Clean C1=C(C(=O)NC(=O)N1)Br
URF P9WFQ9 130.1 Da LogP -0.80 TPSA 65.7 ✓ Ro5 ✓ Clean C1=C(C(=O)NC(=O)N1)F
WBU P9WFQ9 127.1 Da LogP -1.35 TPSA 91.7 ✓ Ro5 ✓ Clean C1=C(C(=O)NC(=O)N1)N

PDB and ChEMBL records on this protein are shown in full. ChEMBL records from similar proteins are capped at the top 100 per protein (by pchembl) and ZINC at the top 50 (Tanimoto ≥ 0.5). ADME columns are descriptor-based screening flags, not experimental toxicity results.