Protein profile

KP13_32074

protein mucB

Genome: KpKP13

Gene: mucB AHE47434.1 Structure source: AlphaFold + ColabFold UniProt A0A0V9HVV8
Amino acids 412
Annotations 6
Features 18
PDB binders 29
Druggability 0.696

Overview

Basic information about this protein and its source genome.

Accession
KP13_32074
Gene
mucB AHE47434.1
Status
annotated
Amino acids
412
Structure source
AlphaFold + ColabFold
GO
GO:0003684 Binding to damaged DNA. GO:0006281 The process of restoring DNA after damage. Genomes are subject to damage by chemical and physical agents in the environment (e.g. UV and ionizing radiations, chemical mutagens, fungal and bacterial toxins, etc.) and by free radicals or alkylating agents endogenously generated in metabolism. DNA is also damaged because of errors during its replication. A variety of different DNA repair pathways have been reported that include direct reversal, base excision repair, nucleotide excision repair, photoreactivation, bypass, double-strand break repair pathway, and mismatch repair pathway. GO:0005829 The part of the cytoplasm that does not contain organelles but which does contain other particulate matter, such as protein complexes. GO:0003887 Catalysis of the reaction: deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1); DNA-template-directed extension of the 3'-end of a DNA strand by one nucleotide at a time. GO:0042276 The conversion of DNA-damage induced single-stranded gaps into large molecular weight DNA after replication by using a specialized DNA polymerase or replication complex to insert a defined nucleotide across the lesion. This process does not remove the replication-blocking lesions and causes an increase in the endogenous mutation level. For example, in E. coli, a low fidelity DNA polymerase, pol V, copies lesions that block replication fork progress. This produces mutations specifically targeted to DNA template damage sites, but it can also produce mutations at undamaged sites. GO:0009432 An error-prone process for repairing damaged microbial DNA.

Target profile

Computed evidence for target prioritization.

Human off-target
hit
Human identity (%)
24.294
Human E-value
2.37e-06
Gut microbiome off-target
hit
Essential (DEG)
N
DEG identity (%)
0.0
Localization
Cytoplasmic
ColabFold pLDDT
90.18

Selected Druggability evidence

AlphaFold / UniProt model

Selected Druggability is the FPocket score chosen for ranking using the curated structure priority. The 3D viewer may show a different loaded structure, so its visible pockets can differ.

FPocket 0.696
Structure A0A0V9HVV8
Pocket Pocket 1
P2Rank 0.455
Structure A0A0V9HVV8
Pocket Pocket 1
ColabFold model
FPocket 0.871 · Pocket 7
P2Rank 0.757 · Pocket 1
Core conservation Accessory gene
Roary accessory
CoreCruncher accessory
Gut microbiome 68 / 4744 genomes with a hit
Normalized 0.014

Sequence

Primary amino-acid sequence viewer.

Functional Annotations

Enzyme classification and Gene Ontology terms linked to this protein.

6 GO

Gene Ontology (GO)

6
  • GO:0003684 Binding to damaged DNA.
  • GO:0006281 The process of restoring DNA after damage. Genomes are subject to damage by chemical and physical agents in the environment (e.g. UV and ionizing radiations, chemical mutagens, fungal and bacterial toxins, etc.) and by free radicals or alkylating agents endogenously generated in metabolism. DNA is also damaged because of errors during its replication. A variety of different DNA repair pathways have been reported that include direct reversal, base excision repair, nucleotide excision repair, photoreactivation, bypass, double-strand break repair pathway, and mismatch repair pathway.
  • GO:0005829 The part of the cytoplasm that does not contain organelles but which does contain other particulate matter, such as protein complexes.
  • GO:0003887 Catalysis of the reaction: deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1); DNA-template-directed extension of the 3'-end of a DNA strand by one nucleotide at a time.
  • GO:0042276 The conversion of DNA-damage induced single-stranded gaps into large molecular weight DNA after replication by using a specialized DNA polymerase or replication complex to insert a defined nucleotide across the lesion. This process does not remove the replication-blocking lesions and causes an increase in the endogenous mutation level. For example, in E. coli, a low fidelity DNA polymerase, pol V, copies lesions that block replication fork progress. This produces mutations specifically targeted to DNA template damage sites, but it can also produce mutations at undamaged sites.
  • GO:0009432 An error-prone process for repairing damaged microbial DNA.

Sequence Features

Domain/signature hits from InterPro and related databases.

18 records
Show feature table
Start End DB Term Name
162 223 Gene3D G3DSA:1.10.150.20 -
2 227 SUPERFAMILY SSF56672 DNA/RNA polymerases
2 227 InterPro IPR043502 DNA/RNA polymerase superfamily
2 64 Gene3D G3DSA:3.40.1170.60 -
235 348 Pfam PF11799 impB/mucB/samB family C-terminal domain
235 348 InterPro IPR017961 DNA polymerase, Y-family, little finger domain
2 339 CDD cd01700 PolY_Pol_V_umuC
2 344 PANTHER PTHR11076 DNA REPAIR POLYMERASE UMUC / TRANSFERASE FAMILY MEMBER
1 178 ProSiteProfiles PS50173 UmuC domain profile.
1 178 InterPro IPR001126 UmuC domain
233 346 Gene3D G3DSA:3.30.1490.100 -
233 346 InterPro IPR036775 DNA polymerase, Y-family, little finger domain superfamily
360 408 Pfam PF13438 Domain of unknown function (DUF4113)
360 408 InterPro IPR025188 Domain of unknown function DUF4113
2 140 Pfam PF00817 impB/mucB/samB family
2 140 InterPro IPR001126 UmuC domain
65 145 Gene3D G3DSA:3.30.70.270 -
65 145 InterPro IPR043128 Reverse transcriptase/Diguanylate cyclase domain

3D Structure

Selected loaded structure. Experimental PDB entries may cover only a portion of the sequence; predicted models typically cover the full protein.

3D visualization script Full viewer

Loading 3D structure...

Legend High Medium Low

Structural evidence

0 + 2

Experimental PDB entries and predicted models. Click Switch to display a different structure in the viewer.

Entry Method Resolution Chain Coverage Links Status
AlphaFold AF_A0A0V9HVV8
AlphaFold full sequence Viewing
ColabFold KP13_32074
ColabFold full sequence Loaded
Pocket details FPocket · P2Rank — toggle visibility and zoom from here, or open full viewer

Pockets (FPOCKET)

Showing top-ranked FPocket candidates by druggability. Druggability is color-coded: high (0.7 or higher), medium (0.4 to 0.69), low (below 0.4).

FPOCKET Sticks Spheres Surfaces Druggability Labels Zoom Positions
1 0.696
12 0.611

Pockets (P2RANK)

Showing top-ranked P2Rank candidates by probability. Probability is color-coded per P2Rank calibration: high (≥ 0.5), medium (0.2 – 0.49), low (< 0.2).

P2RANK Sticks Spheres Surfaces Score Probability Labels Zoom Positions
1 4.64 0.202
2 2.03 0.044
3 1.96 0.041
4 1.15 0.009
5 1.14 0.009

Ligand evidence

Ligands grouped by evidence source. PDB ligands keep the source crystal visible, and loaded crystals can be opened directly in the structure viewer.

79 records

Structural evidence inferred from similar proteins. The source crystal indicates where the ligand was observed; the UniProt column identifies the homologous protein carrying that ligand.

Show only:
Ligand Source crystal UniProt (homolog) MW · LogP · TPSA Lipinski PAINS SMILES
0G4 Q97W02 486.2 Da LogP -0.53 TPSA 232.8 2 viol. ✓ Clean C1[C@H](O[C@H](S1)COP(=O)(O)OP(=O)(NP(=O)(O)O)O…
0G8 Q97W02 389.2 Da LogP -0.36 TPSA 183.4 ✓ Ro5 ✓ Clean C1[C@H](O[C@H](S1)COP(=O)(O)OP(=O)(O)O)N2C=CC(=…
0KX A0QR77 466.2 Da LogP -1.61 TPSA 253.0 2 viol. ✓ Clean C1[C@@H]([C@H](O[C@H]1N2C=CC(=NC2=O)N)COP(=O)(N…
0OH Q97W02 501.2 Da LogP 0.06 TPSA 249.7 3 viol. ✓ Clean c1nc(c2c(n1)n(cn2)[C@H]3C[C@@H]([C@]4([C@@H]3C4…
0OJ Q97W02 501.2 Da LogP -0.15 TPSA 249.7 3 viol. ✓ Clean c1nc(c2c(n1)n(cn2)[C@]34C[C@H]3[C@@H]([C@H](C4)…
1FZ Q97W02 481.2 Da LogP -1.59 TPSA 246.9 1 viol. ✓ Clean CC1=CN(C(=O)NC1=O)[C@H]2C[C@@H]([C@H](O2)COP(=O…
2LF Q97W02 345.2 Da LogP -1.49 TPSA 185.8 ✓ Ro5 ✓ Clean C1[C@@H]([C@H]2[C@H](c3nc4c(n3[C@@H]1O2)N=C(NC4…
2TM A0QR77 481.2 Da LogP -2.10 TPSA 261.2 2 viol. ✓ Clean C1=CN(C(=O)N=C1N)[C@H]2[C@@H]([C@@H]([C@H](O2)C…
3TT Q97W02 468.2 Da LogP -0.67 TPSA 232.8 2 viol. ✓ Clean C1[C@H](O[C@H](S1)COP(=O)(O)OP(=O)(NP(=O)(O)O)O…
A38 Q97W02 347.2 Da LogP -1.54 TPSA 185.8 ✓ Ro5 ✓ Clean c1nc(c2c(n1)N(C(=O)N2)[C@H]3C[C@@H]([C@H](O3)CO…
ADI Q97W02 395.2 Da LogP 0.31 TPSA 192.1 ✓ Ro5 ✓ Clean c1nc(c2c(n1)n(cn2)[C@H]3CC[C@H](O3)CO[P@@](=O)(…
AF Q97W02 181.2 Da LogP 2.84 TPSA 26.0 ✓ Ro5 ✓ Clean c1ccc-2c(c1)Cc3c2ccc(c3)N
BAP Q97W02 304.3 Da LogP 2.90 TPSA 60.7 ✓ Ro5 ✓ Clean c1cc2ccc3cc4c(c5c3c2c(c1)cc5)C[C@H]([C@H]([C@@H…
DCP Q97W02 467.2 Da LogP -1.18 TPSA 250.2 2 viol. ✓ Clean C1[C@@H]([C@H](O[C@H]1N2C=CC(=NC2=O)N)CO[P@@](=…
DCT Q97W02 451.2 Da LogP -0.15 TPSA 230.0 1 viol. ✓ Clean C1C[C@@H](O[C@@H]1CO[P@](=O)(O)O[P@](=O)(O)OP(=…
DDS Q97W02 475.2 Da LogP 0.43 TPSA 238.7 1 viol. ✓ Clean c1nc(c2c(n1)n(cn2)[C@H]3CC[C@H](O3)CO[P@@](=O)(…
DDY Q97W02 371.2 Da LogP -0.27 TPSA 183.4 ✓ Ro5 ✓ Clean C1C[C@@H](O[C@@H]1CO[P@@](=O)(O)OP(=O)(O)O)N2C=…
DG3 Q97W02 491.2 Da LogP -0.28 TPSA 258.6 2 viol. ✓ Clean c1nc2c(n1[C@H]3CC[C@H](O3)CO[P@@](=O)(O)O[P@](=…
DGT Q97W02 507.2 Da LogP -1.31 TPSA 278.9 3 viol. ✓ Clean c1nc2c(n1[C@H]3C[C@@H]([C@H](O3)CO[P@@](=O)(O)O…
DOC Q97W02 291.2 Da LogP -0.39 TPSA 136.9 ✓ Ro5 ✓ Clean C1C[C@@H](O[C@@H]1COP(=O)(O)O)N2C=CC(=NC2=O)N
DT Q97W02 322.2 Da LogP -1.40 TPSA 151.1 ✓ Ro5 ✓ Clean CC1=CN(C(=O)NC1=O)[C@H]2C[C@@H]([C@H](O2)COP(=O…
DTP Q97W02 491.2 Da LogP -0.60 TPSA 258.9 2 viol. ✓ Clean c1nc(c2c(n1)n(cn2)[C@H]3C[C@@H]([C@H](O3)CO[P@]…
DZ4 Q97W02 490.2 Da LogP -1.03 TPSA 261.7 2 viol. ✓ Clean c1nc(c2c(n1)n(cn2)[C@H]3C[C@@H]([C@H](O3)CO[P@@…
FTD Q97W02 409.2 Da LogP -0.25 TPSA 180.9 ✓ Ro5 ✓ Clean C1[C@H](O[C@H](S1)COP(=O)(O)OP(=O)(O)O)N2CC(=C(…
LTP Q97W02 387.2 Da LogP -1.30 TPSA 203.7 ✓ Ro5 ✓ Clean C1[C@H]([C@@H](O[C@@H]1N2C=CC(=NC2=O)N)COP(=O)(…
POP Q97W02 176.0 Da LogP -2.08 TPSA 129.9 ✓ Ro5 ✓ Clean O[P@@](=O)([O-])O[P@@](=O)(O)[O-]
TTP Q97W02 482.2 Da LogP -1.16 TPSA 244.1 2 viol. ✓ Clean CC1=CN(C(=O)NC1=O)[C@H]2C[C@@H]([C@H](O2)CO[P@]…
XG4 Q97W02 506.2 Da LogP -1.73 TPSA 281.7 3 viol. ✓ Clean c1nc2c(n1[C@H]3C[C@@H]([C@H](O3)CO[P@@](=O)(N[P…
YYY Q97W02 387.2 Da LogP -1.30 TPSA 203.7 ✓ Ro5 ✓ Clean C1[C@@H]([C@H](O[C@H]1N2C=CC(=NC2=O)N)CO[P@@](=…

PDB and ChEMBL records on this protein are shown in full. ChEMBL records from similar proteins are capped at the top 100 per protein (by pchembl) and ZINC at the top 50 (Tanimoto ≥ 0.5). ADME columns are descriptor-based screening flags, not experimental toxicity results.