Dispensing Processes Profoundly Impact Biological, Computational

Download Report

Transcript Dispensing Processes Profoundly Impact Biological, Computational

Dispensing Processes Profoundly Impact
Biological, Computational and Statistical Analyses
Sean Ekins1, Joe Olechno2 Antony J. Williams3
1 Collaborations
in Chemistry, Fuquay Varina, NC.
2 Labcyte Inc, Sunnyvale, CA.
3 Royal Society of Chemistry, Wake Forest, NC.
Disclaimer: SE and AJW have no affiliation with Labcyte and have
not been engaged as consultants
Where do scientists get
chemistry/ biology
data?
 Databases
 Patents
 Papers
 Your own lab
 Collaborators
“If I have seen further
than others, it is by
standing upon the
shoulders of giants.”
Isaac Newton
 Some or all of the
above?
 What is common to
all? – quality issues
Data can be found – but …
..drug structure quality is
important
 More groups doing in silico
repositioning
 Target-based or ligand-based
 Network and systems biology
 integrating or using sets of
FDA drugs..if the structures
are incorrect predictions will
be too..
 Need a definitive set of FDA
approved drugs with correct
structures
 Also linkage between in vitro
data & clinical data
Structure Quality Issues
Database released and within days 100’s of errors found in structures
Science Translational Medicine 2011
NPC Browser http://tripod.nih.gov/npc/
DDT 17: 685-701 (2012)
DDT, 16: 747-750 (2011)
DDT editorial Dec 2011
This editorial led to the current
work http://goo.gl/dIqhU
Its not just structure quality we
need to worry about
Finding structures of Pharma molecules is hard
NCATS and MRC
made molecule
identifiers from
pharmas available
with no structures
Southan et al., DDT, 18: 58-70 (2013)
How do you move
a liquid?
Images courtesy of Bing, Tecan
Plastic leaching
McDonald et al., Science 2008,
322, 917.
Belaiche et al., Clin Chem 2009,
55, 1883-1884
Moving Liquids with sound: Acoustic Droplet Ejection (ADE)
Acoustic energy expels droplets without physical contact
 Extremely precise
 Extremely accurate
 Rapid
 Auto-calibrating
 Completely
touchless
 No crosscontamination
 No leachates
 No binding
8
Images courtesy of Labcyte Inc. http://goo.gl/K0Fjz
Using literature data from different dispensing methods to generate
computational models
Few molecule structures and corresponding datasets are public
Using data from 2 AstraZeneca patents –
Tyrosine kinase EphB4 pharmacophores (Accelrys Discovery
Studio) were developed using data for 14 compounds
IC50 determined using different dispensing methods
Analyzed correlation with simple descriptors (SAS JMP)
Calculated LogP correlation with log IC50 data for acoustic
dispensing (r2 = 0.34, p < 0.05, N = 14)
Barlaam, B. C.; Ducray, R., WO 2009/010794 A1, 2009
Barlaam, B. C.; Ducray, R.; Kettle, J. G., US 7,718,653 B2, 2010
14 compounds with structures and IC50 data.
Compound # IC50 Acoustic (µM) IC50 Tips (µM)
5
4
7
W7b
8
W5
6
W3
W1
9
10
W12
W11
11
0.002
0.003
0.003
0.004
0.004
0.006
0.007
0.012
0.014
0.052
0.064
0.158
0.207
0.486
0.553
0.146
0.778
0.152
0.445
0.087
0.973
0.049
0.112
0.170
0.817
0.250
14.400
3.030
Ratio IC50Tip/IC50ADE
276.5
48.7
259.3
42.5
111.3
13.7
139.0
4.2
8.2
3.3
12.8
1.6
69.6
6.2
Barlaam, B. C.; Ducray, R., WO 2009/010794 A1, 2009
Barlaam, B. C.; Ducray, R.; Kettle, J. G., US 7,718,653 B2, 2010
A graph of the log IC50 values for tip-based serial dilution
and dispensing versus acoustic dispensing with direct dilution
shows a poor correlation between techniques (R2 = 0.246).
1.5
1
0.5
0
log IC50-tips
-3
-2.5
-2
-1.5
-1
-0.5
0
0.5
1
1.5
-0.5
-1
-1.5
-2
-2.5
-3
log IC50-acoustic
acoustic
technique
always gave
a more
potent IC50
value
Experimental Process
Results
Acoustic
Model
14 Structures
with Data
Generate
pharmacophore models
for EphB4 receptor
Tip-based
Model
Acoustic
Model
Test models
against new
data
Acoustic
Model
Test models against
X-ray crystal structure
pharmacophores
Tip-based
Model
Tip-based
Model
Results
Initial data set of 14
WO2009/010794, US 7,718,653
Independent data set of 12 Independent crystallography data
WO2008/132505
Bioorg Med Chem Lett 18:2776;
12
18:5717; 20:6242; 21:2207
Tyrosine kinase EphB4 Pharmacophores
Generated with Discovery
Studio (Accelrys)
Cyan = hydrophobic
Green = hydrogen bond
acceptor
Purple = hydrogen bond donor
Each model shows most
potent molecule mapping
Acoustic
Tip based
Hydrophobic
Hydrogen
Hydrogen
Observed vs.
features (HPF)
bond acceptor
bond donor
predicted IC50
(HBA)
(HBD)
r
2
1
1
0.92
0
2
1
0.80
Acoustic mediated process
Tip-based process
•
Ekins et al., PLOSONE, In press
Test set evaluation of pharmacophores
• An additional 12 compounds from AstraZeneca
Barlaam, B. C.; Ducray, R., WO 2008/132505 A1, 2008
• 10 of these compounds had data for tip based dispensing
and 2 for acoustic dispensing
• Calculated LogP and logD showed low but statistically
significant correlations with tip based dispensing (r2=
0.39 p < 0.05 and 0.24 p < 0.05, N = 36)
• Used as a test set for pharmacophores
• The two compounds analyzed with acoustic liquid
handling were predicted in the top 3 using the ‘acoustic’
pharmacophore
• The ‘Tip-based’ pharmacophore failed to rank the
retrieved compounds correctly
Automated receptor-ligand pharmacophore generation
method
Pharmacophores for the tyrosine kinase EphB4 generated from crystal
structures in the protein data bank PDB using Discovery Studio version 3.5.5
Cyan =
hydrophobic
Green = hydrogen
bond acceptor
Purple = hydrogen
bond donor
Grey = excluded
volumes
Each model shows
most potent
molecule mapping
Bioorg
2010,
Bioorg
2008,
Bioorg
2008,
Bioorg
2011,
Med Chem Lett
20, 6242-6245.
Med Chem Lett
18, 5717-5721.
Med Chem Lett
18, 2776-2780.
Med Chem Lett
21, 2207-2211.
Summary
• In the absence of structural data, pharmacophores and other
computational and statistical models are used to guide medicinal
chemistry in early drug discovery.
• Our findings suggest acoustic dispensing methods could improve HTS
results and avoid the development of misleading computational models
and statistical relationships.
• Automated pharmacophores are closer to pharmacophore generated
with acoustic data – all have hydrophobic features – missing from Tipbased pharmacophore model
• Importance of hydrophobicity seen with logP correlation and
crystal structure interactions
• Public databases should annotate this meta-data alongside biological
data points, to create larger datasets for comparing different
computational methods.
0
Serial dilution IC50 μM
10 20
30 40
50
Adapted from Spicer et al.,
Presentation at Drug Discovery
Technology, Boston, MA, August
2005
0
10 20 30 40
Acoustic IC50 μM
50
Adapted from Wingfield.
Presentation at ELRIG2012,
Manchester, UK
NOTE DIFFERENT
ORIENTATION
Acoustic % Inhibition
-40 -20 0 20 40 60 80 100
Acoustic vs. Tip-based Transfers
-40 -20 0 20 40 60 80 100
Aqueous % Inhibition
Adapted from Wingfield et al.,
Amer. Drug Disco. 2007,
3(3):24
Serial dilution IC50 μM
103
102
10
1
Data in this presentation
Log IC50 tips
104
10-1
10-2
10-3
10-3 10-2 10-1 1 10 102 103 104
Acoustic IC50 μM
Log IC50 acoustic
No Previous Analysis of molecule properties
Strengths and Weaknesses
• Small dataset size – focused on one compound series
• No previous publication describing how data quality can be
impacted by dispensing and how this in turn affects
computational models and downstream decision making.
• No comparison of pharmacophores generated from acoustic
dispensing and tip-based dispensing.
• No previous comparison of pharmacophores generated from in
vitro data with pharmacophores automatically generated from
X-ray crystal conformations of inhibitors.
• Severely limited by number of structures in public domain
with data in both systems
• Reluctance of many to accept that this could be an issue
• Ekins et al., PLOSONE, In press
The stuff of nightmares?
 How much of the data in databases is generated by tip based serial
dilution methods
 How much is erroneous
 Do we have to start again?
 How does it affect all subsequent science – data mining etc
 Does it impact Pharmas productivity?
Simple Rules for licensing
“open” data
Could data ‘open accessibility’
equal ‘Disruption’
As we see a future of increased
database integration the
licensing of the data may be a
hurdle that hampers progress
and usability.
1: NIH and other international
scientific funding bodies should
mandate …open accessibility for
all data generated by publicly
funded research immediately
Williams, Wilbanks and Ekins.
PLoS Comput Biol 8(9):
e1002706, 2012
Ekins, Waller, Bradley, Clark and
Williams. DDT, 18:265-71, 2013
You can find me @...
CDD Booth 205
PAPER ID: 13433
PAPER TITLE: “Dispensing processes profoundly impact biological assays and computational and statistical
analyses”
April 8th 8.35am Room 349
PAPER ID: 14750
PAPER TITLE: “Enhancing High Throughput Screening For Mycobacterium tuberculosis Drug Discovery
Using Bayesian Models”
April 9th 1.30pm Room 353
PAPER ID: 21524
PAPER TITLE: “Navigating between patents, papers, abstracts and databases using public sources and
tools”
April 9th 3.50pm Room 350
PAPER ID: 13358
PAPER TITLE: “TB Mobile: Appifying Data on Anti-tuberculosis Molecule Targets”
April 10th 8.30am Room 357
PAPER ID: 13382
PAPER TITLE: “Challenges and recommendations for obtaining chemical structures of industry-provided
repurposing candidates”
April 10th 10.20am Room 350
PAPER ID: 13438
PAPER TITLE: “Dual-event machine learning models to accelerate drug discovery”
April 10th 3.05 pm Room 350