Protein profile
KP13_01257
UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl- meso-diaminopimelate ligase
Genome: KpKP13
Overview
Basic information about this protein and its source genome.
- Accession
- KP13_01257
- Gene
- AHE46680.1 mpl
- Status
- annotated
- Amino acids
- 457
- Structure source
- AlphaFold + ColabFold
Target profile
Computed evidence for target prioritization.
- Human off-target
- No hit
- Human identity (%)
- 0.0
- Gut microbiome off-target
- hit
- Essential (DEG)
- Y
- DEG identity (%)
- 68.222
- DEG E-value
- 0.0
- Localization
- Cytoplasmic
- ColabFold pLDDT
- 94.7
Selected Druggability evidence
AlphaFold / UniProt modelSelected Druggability is the FPocket score chosen for ranking using the curated structure priority. The 3D viewer may show a different loaded structure, so its visible pockets can differ.
Sequence
Primary amino-acid sequence viewer.
Functional Annotations
Enzyme classification and Gene Ontology terms linked to this protein.
Enzyme Commission (EC)
1Gene Ontology (GO)
9- GO:0009058 A cellular process consisting of the biochemical pathways by which a living organism synthesizes chemical substances. This typically represents the energy-requiring part of metabolism in which simpler substances are transformed into more complex ones.
- GO:0016881 Catalysis of the ligation of an acid to an amino acid via a carbon-nitrogen bond, with the concomitant hydrolysis of the diphosphate bond in ATP or a similar triphosphate.
- GO:0071555 A process that results in the assembly, arrangement of constituent parts, or disassembly of the cell wall, the rigid or semi-rigid envelope lying outside the cell membrane of plant, fungal and most prokaryotic cells, maintaining their shape and protecting them from osmotic lysis.
- GO:0005524 Binding to ATP, adenosine 5'-triphosphate, a universally important coenzyme and enzyme regulator.
- GO:0009252 The chemical reactions and pathways resulting in the formation of peptidoglycans, any of a class of glycoconjugates found in bacterial cell walls and consisting of long glycan strands of alternating residues of beta-(1,4) linked N-acetylglucosamine and N-acetylmuramic acid, cross-linked by short peptides.
- GO:0106418 Catalysis of the reaction: ATP + UDP-N-acetyl-alpha-D-muramate + L-alanyl-gamma-D-glutamyl-meso-2,6-diaminoheptanedioate = ADP + phosphate + UDP-N-acetylmuramoyl-L-alanyl-gamma-D-glutamyl-meso-2,6-diaminoheptanedioate.
- GO:0051301 The process resulting in division and partitioning of components of a cell to form more cells; may or may not be accompanied by the physical separation of a cell into distinct, individually membrane-bounded daughter cells.
- GO:0009254 The continual breakdown and regeneration of peptidoglycan required to maintain the bacterial cell wall. Peptidoglycans consist of long glycan strands of alternating residues of beta-(1,4) linked N-acetylglucosamine and N-acetylmuramic acid, cross-linked by short peptides.
- GO:0008360 Any process that modulates the surface configuration of a cell.
Sequence Features
Domain/signature hits from InterPro and related databases.
Show feature table
| Start | End | DB | Term | Name |
|---|---|---|---|---|
| 17 | 24 | Phobius | SIGNAL_PEPTIDE_C_REGION | C-terminal region of a signal peptide. |
| 91 | 313 | Gene3D | G3DSA:3.40.1190.10 | - |
| 91 | 313 | InterPro | IPR036565 | Mur-like, catalytic domain superfamily |
| 5 | 16 | Phobius | SIGNAL_PEPTIDE_H_REGION | Hydrophobic region of a signal peptide. |
| 1 | 90 | SUPERFAMILY | SSF51984 | MurCD N-terminal domain |
| 1 | 450 | Hamap | MF_02020 | UDP-N-acetylmuramate--L-alanyl-gamma-D-glutamyl-meso-2,6-diaminoheptandioate ligase [mpl]. |
| 1 | 450 | InterPro | IPR005757 | Murein peptide ligase |
| 314 | 451 | FunFam | G3DSA:3.90.190.20:FF:000002 | UDP-N-acetylmuramate--L-alanyl-gamma-D-glutamyl-meso-2,6-diaminoheptandioate ligase |
| 314 | 451 | Gene3D | G3DSA:3.90.190.20 | - |
| 314 | 451 | InterPro | IPR036615 | Mur ligase, C-terminal domain superfamily |
| 91 | 313 | FunFam | G3DSA:3.40.1190.10:FF:000003 | UDP-N-acetylmuramate--L-alanyl-gamma-D-glutamyl-meso-2,6-diaminoheptandioate ligase |
| 2 | 101 | Pfam | PF01225 | Mur ligase family, catalytic domain |
| 2 | 101 | InterPro | IPR000713 | Mur ligase, N-terminal catalytic domain |
| 25 | 457 | Phobius | NON_CYTOPLASMIC_DOMAIN | Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. |
| 312 | 448 | SUPERFAMILY | SSF53244 | MurD-like peptide ligases, peptide-binding domain |
| 312 | 448 | InterPro | IPR036615 | Mur ligase, C-terminal domain superfamily |
| 1 | 24 | Phobius | SIGNAL_PEPTIDE | Signal peptide region |
| 95 | 308 | SUPERFAMILY | SSF53623 | MurD-like peptide ligases, catalytic domain |
| 95 | 308 | InterPro | IPR036565 | Mur-like, catalytic domain superfamily |
| 1 | 24 | SignalP_EUK | SignalP-noTM | SignalP-noTM |
| 312 | 360 | Pfam | PF02875 | Mur ligase family, glutamate ligase domain |
| 312 | 360 | InterPro | IPR004101 | Mur ligase, C-terminal |
| 1 | 90 | Gene3D | G3DSA:3.40.50.720 | - |
| 108 | 291 | Pfam | PF08245 | Mur ligase middle domain |
| 108 | 291 | InterPro | IPR013221 | Mur ligase, central |
| 1 | 4 | Phobius | SIGNAL_PEPTIDE_N_REGION | N-terminal region of a signal peptide. |
| 2 | 453 | PANTHER | PTHR43445 | UDP-N-ACETYLMURAMATE--L-ALANINE LIGASE-RELATED |
| 2 | 449 | NCBIfam | TIGR01081 | UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-diaminopimelate ligase |
| 2 | 449 | InterPro | IPR005757 | Murein peptide ligase |
3D Structure
Selected loaded structure. Experimental PDB entries may cover only a portion of the sequence; predicted models typically cover the full protein.
Loading 3D structure...
Structural evidence
0 + 2Experimental PDB entries and predicted models. Click Switch to display a different structure in the viewer.
| Entry | Method | Resolution | Chain | Coverage | Links | Status |
|---|---|---|---|---|---|---|
|
AlphaFold
AF_A0A0H3GI91
|
AlphaFold | — | — | full sequence | — | Viewing |
|
ColabFold
KP13_01257
|
ColabFold | — | — | full sequence | — | Loaded |
Pocket details FPocket · P2Rank — toggle visibility and zoom from here, or open full viewer
Pockets (FPOCKET)
Showing top-ranked FPocket candidates by druggability. Druggability is color-coded: high (0.7 or higher), medium (0.4 to 0.69), low (below 0.4).
| FPOCKET | Sticks | Spheres | Surfaces | Druggability | Labels | Zoom | Positions |
|---|---|---|---|---|---|---|---|
| 1 | 0.893 | ||||||
| 20 | 0.525 | ||||||
| 11 | 0.338 |
Pockets (P2RANK)
Showing top-ranked P2Rank candidates by probability. Probability is color-coded per P2Rank calibration: high (≥ 0.5), medium (0.2 – 0.49), low (< 0.2).
| P2RANK | Sticks | Spheres | Surfaces | Score | Probability | Labels | Zoom | Positions |
|---|---|---|---|---|---|---|---|---|
| 1 | 40.46 | 0.964 | ||||||
| 2 | 3.14 | 0.107 | ||||||
| 3 | 2.73 | 0.082 | ||||||
| 4 | 2.43 | 0.066 |
Pockets (FPOCKET)
Showing top-ranked FPocket candidates by druggability. Druggability is color-coded: high (0.7 or higher), medium (0.4 to 0.69), low (below 0.4).
| FPOCKET | Sticks | Spheres | Surfaces | Druggability | Labels | Zoom | Positions |
|---|---|---|---|---|---|---|---|
| 3 | 0.762 | ||||||
| 1 | 0.371 |
Pockets (P2RANK)
Showing top-ranked P2Rank candidates by probability. Probability is color-coded per P2Rank calibration: high (≥ 0.5), medium (0.2 – 0.49), low (< 0.2).
| P2RANK | Sticks | Spheres | Surfaces | Score | Probability | Labels | Zoom | Positions |
|---|---|---|---|---|---|---|---|---|
| 1 | 36.73 | 0.956 | ||||||
| 2 | 5.85 | 0.286 | ||||||
| 3 | 2.54 | 0.072 |
Ligand evidence
Ligands grouped by evidence source. PDB ligands keep the source crystal visible, and loaded crystals can be opened directly in the structure viewer.
Highest-confidence structural evidence: ligands co-crystallized with this exact protein. If the source PDB is loaded in TPW, use Open crystal to inspect it in the structure viewer.
No PDB structure with a co-crystallized ligand found for this exact protein.
Structural evidence inferred from similar proteins. The source crystal indicates where the ligand was observed; the UniProt column identifies the homologous protein carrying that ligand.
| Ligand | Source crystal | UniProt (homolog) | MW · LogP · TPSA | Lipinski | PAINS | SMILES |
|---|---|---|---|---|---|---|
| ANP | B7GV74 | 506.2 Da LogP -2.06 TPSA 281.9 | 3 viol. | ✓ Clean |
c1nc(c2c(n1)n(cn2)[C@H]3[C@@H]([C@@H]([C@H](O3)…
|
|
| EPU | P45066 | 677.4 Da LogP -4.03 TPSA 332.2 | 3 viol. | ✓ Clean |
CC(=O)N[C@@H]1[C@H]([C@@H]([C@H](O[C@@H]1O[P@@]…
|
|
| UD1 | P65473 | 607.4 Da LogP -4.65 TPSA 305.9 | 3 viol. | ✓ Clean |
CC(=O)N[C@@H]1[C@H]([C@@H]([C@H](O[C@@H]1O[P@@]…
|
|
| UMA | P45066 | 750.5 Da LogP -4.65 TPSA 361.3 | 3 viol. | ✓ Clean |
C[C@@H](C(=O)O)NC(=O)[C@@H](C)O[C@@H]1[C@H]([C@…
|
|
| UXP | Q9HW02 | 354.4 Da LogP 2.05 TPSA 118.6 | ✓ Ro5 | ✓ Clean |
c1c([nH]nc1Nc2c3c[nH]nc3nc(n2)N4CCCC[C@@H]4CO)C…
|
|
| UYD | Q9HW02 | 406.5 Da LogP 3.27 TPSA 116.6 | ✓ Ro5 | ✓ Clean |
CC(C)(C)c1cc(nn1C)Nc2c3cn[nH]c3nc(n2)N[C@@H](CO…
|
Experimental bioactivity from ChEMBL measured directly on this protein. Score = pchembl (−log Ki/IC₅₀; higher = more potent).
No ChEMBL bioactivity data found for this exact protein.
Bioactivity inferred from similar proteins in ChEMBL. Score = pchembl (−log Ki/IC₅₀; higher = more potent).
No ChEMBL hits found through similar proteins.
Proposed virtual-screening candidates from ZINC. Score = Tanimoto similarity to a known binder (0–1; higher = more similar).
| Ligand | Tanimoto | MW · LogP · TPSA | Lipinski | PAINS | SMILES |
|---|---|---|---|---|---|
| ZINC16546165 | 0.810 | 427.2 Da LogP -1.75 TPSA 232.6 | 2 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@H]1O[C@H](CO[P@](=O)(O)OP(=O)(…
|
| ZINC13518964 | 0.741 | 347.2 Da LogP -1.86 TPSA 186.1 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@@H](COP(=O)(O)O)[C@H](…
|
| ZINC1532515 | 0.741 | 347.2 Da LogP -1.86 TPSA 186.1 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@H]1O[C@@H](COP(=O)(O)O)[C@H](O…
|
| ZINC1571045 | 0.741 | 347.2 Da LogP -1.86 TPSA 186.1 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@@H](COP(=O)(O)O)[C@@H]…
|
| ZINC1842158 | 0.741 | 347.2 Da LogP -1.86 TPSA 186.1 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@H]1O[C@@H](COP(=O)(O)O)[C@H](O…
|
| ZINC2046931 | 0.741 | 347.2 Da LogP -1.86 TPSA 186.1 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@@H](COP(=O)(O)O)[C@H](…
|
| ZINC2126310 | 0.741 | 347.2 Da LogP -1.86 TPSA 186.1 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](COP(=O)(O)O)[C@@H](…
|
| ZINC3201891 | 0.741 | 347.2 Da LogP -1.86 TPSA 186.1 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@@H](COP(=O)(O)O)[C@@H]…
|
| ZINC3201893 | 0.741 | 347.2 Da LogP -1.86 TPSA 186.1 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@H]1O[C@@H](COP(=O)(O)O)[C@@H](…
|
| ZINC3830180 | 0.741 | 347.2 Da LogP -1.86 TPSA 186.1 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@H]1O[C@@H](COP(=O)(O)O)[C@@H](…
|
| ZINC3860156 | 0.741 | 347.2 Da LogP -1.86 TPSA 186.1 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](COP(=O)(O)O)[C@@H](…
|
| ZINC3977897 | 0.741 | 347.2 Da LogP -1.86 TPSA 186.1 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@H]1O[C@H](COP(=O)(O)O)[C@@H](O…
|
| ZINC4806442 | 0.741 | 347.2 Da LogP -1.86 TPSA 186.1 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](COP(=O)(O)O)[C@H](O…
|
| ZINC8613167 | 0.741 | 347.2 Da LogP -1.86 TPSA 186.1 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](COP(=O)(O)O)[C@H](O…
|
| ZINC4096224 | 0.729 | 346.2 Da LogP -1.90 TPSA 191.9 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](CO[P@](N)(=O)O)[C@@…
|
| ZINC105372833 | 0.712 | 345.3 Da LogP -1.93 TPSA 197.6 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](COP(N)(N)=O)[C@H](O…
|
| ZINC105372837 | 0.712 | 345.3 Da LogP -1.93 TPSA 197.6 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](COP(N)(N)=O)[C@H](O…
|
| ZINC17107643 | 0.712 | 345.3 Da LogP -1.93 TPSA 197.6 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](COP(N)(N)=O)[C@@H](…
|
| ZINC204538551 | 0.712 | 345.3 Da LogP -1.93 TPSA 197.6 | ✓ Ro5 | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](COP(N)(N)=O)[C@@H](…
|
| ZINC105469665 | 0.694 | 425.2 Da LogP -1.64 TPSA 223.4 | 2 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](CO[P@@](=O)(O)CP(=O…
|
| ZINC13527614 | 0.694 | 425.2 Da LogP -1.64 TPSA 223.4 | 2 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](CO[P@](=O)(O)CP(=O)…
|
| ZINC219330894 | 0.694 | 425.2 Da LogP -1.64 TPSA 223.4 | 2 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](CO[P@](=O)(O)CP(=O)…
|
| ZINC3873852 | 0.694 | 425.2 Da LogP -1.64 TPSA 223.4 | 2 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@H]1O[C@@H](CO[P@](=O)(O)CP(=O)…
|
| ZINC3873853 | 0.694 | 425.2 Da LogP -1.64 TPSA 223.4 | 2 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@@H](CO[P@](=O)(O)CP(=O…
|
| ZINC3873854 | 0.694 | 425.2 Da LogP -1.64 TPSA 223.4 | 2 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@H]1O[C@@H](CO[P@](=O)(O)CP(=O)…
|
| ZINC3873855 | 0.694 | 425.2 Da LogP -1.64 TPSA 223.4 | 2 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@@H](CO[P@](=O)(O)CP(=O…
|
| ZINC5615251 | 0.677 | 375.3 Da LogP -0.55 TPSA 164.1 | 1 viol. | ✓ Clean |
COP(=O)(OC)OC[C@H]1O[C@@H](n2cnc3c(N)ncnc32)[C@…
|
| ZINC5615253 | 0.677 | 375.3 Da LogP -0.55 TPSA 164.1 | 1 viol. | ✓ Clean |
COP(=O)(OC)OC[C@@H]1O[C@@H](n2cnc3c(N)ncnc32)[C…
|
| ZINC5615258 | 0.677 | 375.3 Da LogP -0.55 TPSA 164.1 | 1 viol. | ✓ Clean |
COP(=O)(OC)OC[C@H]1O[C@@H](n2cnc3c(N)ncnc32)[C@…
|
| ZINC5615263 | 0.677 | 375.3 Da LogP -0.55 TPSA 164.1 | 1 viol. | ✓ Clean |
COP(=O)(OC)OC[C@@H]1O[C@@H](n2cnc3c(N)ncnc32)[C…
|
| ZINC1582675 | 0.667 | 403.3 Da LogP 0.23 TPSA 164.1 | 1 viol. | ✓ Clean |
CCOP(=O)(OCC)OC[C@H]1O[C@@H](n2cnc3c(N)ncnc32)[…
|
| ZINC5486730 | 0.667 | 403.3 Da LogP 0.23 TPSA 164.1 | 1 viol. | ✓ Clean |
CCOP(=O)(OCC)OC[C@H]1O[C@@H](n2cnc3c(N)ncnc32)[…
|
| ZINC5486734 | 0.667 | 403.3 Da LogP 0.23 TPSA 164.1 | 1 viol. | ✓ Clean |
CCOP(=O)(OCC)OC[C@@H]1O[C@@H](n2cnc3c(N)ncnc32)…
|
| ZINC5486740 | 0.667 | 403.3 Da LogP 0.23 TPSA 164.1 | 1 viol. | ✓ Clean |
CCOP(=O)(OCC)OC[C@@H]1O[C@@H](n2cnc3c(N)ncnc32)…
|
| ZINC59207431 | 0.642 | 416.3 Da LogP -1.52 TPSA 178.3 | 1 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](CO[P@](=O)(O)N2CCOC…
|
| ZINC12959005 | 0.632 | 484.1 Da LogP -2.50 TPSA 264.4 | 2 viol. | ✓ Clean |
O=c1ccn([C@@H]2O[C@H](CO[P@](=O)(O)O[P@](=O)(O)…
|
| ZINC12959016 | 0.632 | 484.1 Da LogP -2.50 TPSA 264.4 | 2 viol. | ✓ Clean |
O=c1ccn([C@@H]2O[C@@H](CO[P@](=O)(O)O[P@](=O)(O…
|
| ZINC13548378 | 0.632 | 484.1 Da LogP -2.50 TPSA 264.4 | 2 viol. | ✓ Clean |
O=c1ccn([C@@H]2O[C@H](CO[P@@](=O)(O)O[P@@](=O)(…
|
| ZINC25726233 | 0.632 | 484.1 Da LogP -2.50 TPSA 264.4 | 2 viol. | ✓ Clean |
O=c1ccn([C@H]2O[C@@H](CO[P@](=O)(O)O[P@](=O)(O)…
|
| ZINC3861755 | 0.632 | 484.1 Da LogP -2.50 TPSA 264.4 | 2 viol. | ✓ Clean |
O=c1ccn([C@@H]2O[C@H](CO[P@@](=O)(O)O[P@@](=O)(…
|
| ZINC3875255 | 0.632 | 484.1 Da LogP -2.50 TPSA 264.4 | 2 viol. | ✓ Clean |
O=c1ccn([C@H]2O[C@@H](CO[P@@](=O)(O)O[P@@](=O)(…
|
| ZINC3875256 | 0.632 | 484.1 Da LogP -2.50 TPSA 264.4 | 2 viol. | ✓ Clean |
O=c1ccn([C@@H]2O[C@@H](CO[P@@](=O)(O)O[P@@](=O)…
|
| ZINC3875257 | 0.632 | 484.1 Da LogP -2.50 TPSA 264.4 | 2 viol. | ✓ Clean |
O=c1ccn([C@H]2O[C@@H](CO[P@@](=O)(O)O[P@@](=O)(…
|
| ZINC3875258 | 0.632 | 484.1 Da LogP -2.50 TPSA 264.4 | 2 viol. | ✓ Clean |
O=c1ccn([C@@H]2O[C@@H](CO[P@@](=O)(O)O[P@@](=O)…
|
| ZINC88466482 | 0.632 | 484.1 Da LogP -2.50 TPSA 264.4 | 2 viol. | ✓ Clean |
O=c1ccn([C@@H]2O[C@H](CO[P@@](=O)(O)O[P@@](=O)(…
|
| ZINC3871401 | 0.631 | 427.2 Da LogP -1.75 TPSA 232.6 | 2 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@H]1O[C@@H](COP(=O)(O)O)[C@@H](…
|
| ZINC3871402 | 0.631 | 427.2 Da LogP -1.75 TPSA 232.6 | 2 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@@H](COP(=O)(O)O)[C@@H]…
|
| ZINC3871403 | 0.631 | 427.2 Da LogP -1.75 TPSA 232.6 | 2 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@H]1O[C@@H](COP(=O)(O)O)[C@@H](…
|
| ZINC3871404 | 0.631 | 427.2 Da LogP -1.75 TPSA 232.6 | 2 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@@H](COP(=O)(O)O)[C@@H]…
|
| ZINC4096223 | 0.631 | 427.2 Da LogP -1.75 TPSA 232.6 | 2 viol. | ✓ Clean |
Nc1ncnc2c1ncn2[C@@H]1O[C@H](COP(=O)(O)O)[C@@H](…
|
PDB and ChEMBL records on this protein are shown in full. ChEMBL records from similar proteins are capped at the top 100 per protein (by pchembl) and ZINC at the top 50 (Tanimoto ≥ 0.5). ADME columns are descriptor-based screening flags, not experimental toxicity results.