Hierarchical Database Screenings for HIV
Download
Report
Transcript Hierarchical Database Screenings for HIV
Hierarchical Database Screenings for HIV-1
Reverse Transcriptase Using a Pharmacophore
Model, Rigid Docking, Solvation Docking, and
MM-PB/SA
Junmei Wang, Xinshan Kang, Irwin D.Kuntz, and Peter A. Kollman
Encysive Pharmaceuticals Inc.
University of California, San Francisco
Presentation by Susan Tang
CS 379A
Background
•
There are two approaches to identifying drug leads
–
De novo design
• Aimed to design novel compounds that have electrostatic and hydrophobic
properties complementary to target
• Requires 3D structures of drug targets
–
Database screening
• Applies filters to identify potential drug leads from databases
• Can be divided into query-based and scoring-function-based methods
• Only scoring-function-based methods requires 3D structures of drug targets
1) Query-based screenings
- Search queries such as MW, #H-bond donors/acceptors, and pharmacophore
models are applied to database
- Computationally efficient since 3D structures are not used
- Wrong query fields may produce too high/too low # of hits
2) Scoring-function based approaches
- Apply target functions (typically free-energy calculations of inhibitor binding
to target) to obtain hits
- The most rigorous and accurate methods of free energy calculation are FEP
(Free energy perturbation) and TI (Thermodynamic integration) but they
are too computationally intensive and thus not appropriate for DB screening
- There are several alternative methods as well (such as MM-PB/SA)
Purpose
•
Purpose: To develop a method for the identification of HIV-1 RT drug leads
using hierarchical database screening
•
Sequential Methods Used
1) Pharmacophore model
2) Multiple-conformation rigid docking
3) Solvation docking
4) MM-PB/SA (Molecular Mechanics-Poisson-Boltzman/surface area)
•
Significance of HIV-1 Reverse Transcriptase
– Important target in AIDS-related drug design
– Biological role is to transcribe viral RNA into dsDNA, which is necessary for
viral replication
– Recently, many crystal structures of NNRTI’s (non-nucleoside reverse
transcriptase inhibitors) with HIV-1 RT have been solved
– Since 3-D structures are available, HIV-1 RT poses as a good target for
drug lead development/screening
By showing that their methodology is accurate for HIV-1 RT, the authors
hope to demonstrate that the method can be widely applied to other
systems where target 3D structures are available.
Method Outline and Evaluation
Database = Refined ACD (Available Chemical Directory) DB of 150,000 compounds
Evaluation Criteria for Database Screening Performance
1) Hit rate =
known inhibitors that passed filter(s)
total number of known inhibitors in database
2) Enrichment factor = (Hit rate) x total number of compounds in database
total number of hits that passed filter(s)
Computational Methods
Filter 1: Pharmacophore Model
What is a pharmacophore model?
Defined as the three-dimensional arrangement of atoms - or groups of atoms –
responsible for the biological activity of a drug molecule.
19 crystal structures of HIV-1 RT
in complex with NNRTI’s
tri-feature pharmacophore model
Computational Methods
Filter 1: Pharmacophore Model
head
wing
wing
tail
•
•
•
19 HIV-1RT/NNRTI crystal structures were superimposed on PDB structure
1uwb (HIV-1 RT/TBO)
Spheres indicate where inhibitor atoms reside
Overall shape of bound inhibitors is like a butterfly (allosteric binding site of
enzyme)
Computational Methods
Filter 1: Pharmacophore Model
•
•
•
•
•
Tri-featured pharmacophore model designed from the “butterfly” shape
X1 : represents a 5 or 6 membered aromatic ring
X2 : represents a 5 to 7 membered ring
X3 : represents nitrogen, oxygen, or sulfur
Distinct distance patterns were also identified
Computational Results
Filter 1: Pharmacophore Model
-
Average RMSD of the 19 superimposed NNRTI’s = 0.86 angs.
40,000 compounds / 150,000 passed this filter
Hit rate = 95 %
Enrichment factor = 3.56
Computational Methods
Filter 2: Multiple-Conformation Rigid Docking
•
•
•
•
•
•
Spheres, where inhibitor atoms could potentially be, were highlighted on
HIV-1 RT/TBO reference structure
Cluster analysis selected one cluster consisting of 30-40 spheres around the
binding site and chose this as a center for docking
Conformational searches for the hits having passed Filter 1
Average Number of searched conformations for each molecule = 30
Rigid Docking was performed for all conformations
Crucial docking parameters:
1) Maximum orientations = 1000
2) Minimum matching nodes = 4
3) Maximum matching nodes = 15
4) No intramolecular score
5) Dielectric constant = 4.0
Computational Results
Filter 2: Multiple Conformation Rigid Docking
-
Average RMSD of the 19 superimposed NNRTI’s = 0.86 angs.
16,000 compounds / 40,000 had atleast 1 conformation that passed this filter
Hit rate = 76 %
Enrichment factor = 1.89
Computational Methods
Filter 3: Solvation Docking
•
•
•
•
•
•
•
Solvation docking parameters in the binding free energy formula could easily
vary from system to system
To derive solvation docking model specific for HIV-1 RT, a training set of 12
known HIV-1 RT/NNR-TI crystal structures were used
Each molecule in training set had an RMSD < 3.0 angstroms between the
docked and crystal structure
Parameters (alpha, beta, gamma) in formula I were optimized to reproduce
experimental binding free energies
Formula I:
Solvation docking was performed for molecules having passed filter II using a
solvation docking program
Program outputs the following terms:
1)VDW energy (hydrophobic interaction)
2)Screened electrostatic energy
3)Polar and non-polar accessible surface areas
Using derived solvation docking model, binding free energies were calculated
Computational Results
Filter 3: Solvation Docking
-
The solvation docking model with the following coefficients was produced
( Alpha = 0.1736, beta = 0.1709, gamma = 0.0049 )
-
Solvation docking model achieved average unsigned and rms errors of 1.03
and 1.16 kcal/mol between deltaG(calc) and deltaG(expt) for the training set
Computational Results
Filter 3: Solvation Docking
-
3360 compounds / 16,000 passed this filter with a threshold of –8.8 kcal/mol
Hit rate = 79 %
Enrichment factor = 3.74
Computational Methods
Filter 4: MM-PB/SA
•
•
First 3 filters: only ligand flexibility was taken into account
Current filter: application of MD simulations to sample conformational space
of BOTH inhibitor and receptor
•
•
For each molecule, MD simulations were done at 300 K with 2.0 fs time step
MD simulations carried out using this formula:
•
The inhibitor, water molecules, and receptor residues that are within 20
angs. Of inhibitor mass center were allowed to move during the simulations
Equilibration for 50 ps 20 snapshots were collected
For each snapshot: MM-PB/SA analysis was performed to calculate binding
free energy
•
•
Computational Results
Filter 4: MM-PB/SA
-
-
Because this is the most time/resource demanding step, MM-PB/SA was only
done on the 22 molecules in the control set & 30 top hits that passed Filter 3
16 / 22 control hits from Filter 3 yielded MM-PB/SA scorese < - 6.8 kcal/mol
10 / 30 top hits tested yielded MM-PB/SA scores < - 6.8 kcal/mol
Best hit had a binding free energy of – 17.7 kcal/mol (likely to be a real HIV1 RT inhibitor)
Summary
Results
•
Overall, 16/37 known NNRTs survived all filters
•
Overall hit rate = 41 %
•
Hit rate (first 3 filters) = 56 %
•
Enrichment rate (first 3 filters) = 25
Translates to: the probability of finding a real inhibitor randomly
from the hits of the first 3 filters is 25 fold higher than from the
whole database
Conclusion
The hierarchical multiple-filter database searching strategy attained both
high efficiency and high reliability, making it a viable option for drug lead
discovery.
Future Development
•
Making the time/resource limiting step, MM-PB/SA, more efficient
1) Run MD simulations using implicit (rather than explicit) water models
such as GB/SA and PB/SA
2) Development of new algorithm to calculate entropy accurately and
efficiently