Cartoon modeling of proteins

Download Report

Transcript Cartoon modeling of proteins

Cartoon modeling of proteins
Fred Howell and Dan Mossop
ANC
Informatics
Overview


Why / how to model intracellular processes?
Examples: MCell, Stochsim, Virtual Cell

Cartoon models
Where's the data on structure / interactions?

A new 3D protein interaction simulator





post synaptic density self-assembly
vesicle formation
vesicle transport
Futures & speculations
Why / how to model intracellular processes?





Ordered soup of ~1,000,000 different types of macromolecules
Complex and specific network of interactions
Ion channels and complexes the tip of the iceberg (croutons?)
Much work on gene networks / intracellular pathways
Mostly ignores spatial effects (well mixed pool / kinetics)

Hypothesis of mechanisms typically involve cartoon descriptions /
precise shapes / jigsaw-like interactions of proteins

Computer models typically don't
Intracellular pathway modeling

Single mixed pool:



A number of connected compartments


Virtual cell
Individual molecules / brownian motion


Rate equations / kinetics (as differential equations)
Stochastic simulators (Stochsim)
MCell
... but none of them take into account the actual shapes of proteins!
Single protein modeling

The great protein folding problem - what shapes can the sequence
form?

Uses molecular dynamics (motion of each atom in the molecule) to try
and predict low energy folding conformations of primary sequence


hard, not there yet
Intermediate protein modeling - recognise characteristic subsequences
of amino acids, guess substructures like alpha helices, beta sheets

promising, not there yet

Timescales of femto- and pico- seconds

... data available from crystallography on some proteins (PDB)
... predicting binding sites is very hard

Cartoon models

Typically used to hypothesise mechanisms
Getting data on protein shapes

PDB: coordinates of each atom in protein

One possibility: cluster analysis to reduce to a number of subunits
Getting data on protein interactions

This is harder

Ideally would like binding sites, bond angles, bond strengths

Typically get "A does / does not interact with B (probably)"

... but the situation is set to improve as more data becomes
available in databases
So, how to build models?

Cheat - use a mixture of real and hypothesised model proteins
A new protein interaction simulator







proteins modeled as simplified 3D structures including a number of
subunits / binding sites / conformational states
water not modeled explicitly
proteins moved by brownian motion
bonding / state transition probabilities set as parameters
collision detection
in version 1 protein complexes modeled as rigid structures
membranes modeled as a restriction to 2D diffusion of membrane
bound proteins (still free to rotate)
Example models

(1) Formation of the post synaptic density - a model of recruitment of
AMPA receptors to the vicinity of activated NMDA receptors

(2) Self assembly of clathrin coated vesicles

(3) Transport of vesicles using kinesin
The common theme

Throw together an unordered collection of proteins, with specific
binding sites, interactions and probabilities

Evolve the system through time

See if complex shapes and processes emerge
Example 1 - post synaptic density

NMDAr
glue
AMPAr
CAM KII
Example 2 - Vesicle formation

Clathrin:-
Example 4 - Kinesin

Input - a motor protein model, stable states / transitions / binding
cause it to walk up microtubules carrying its payload
Details of simulator (and approaches tried)




Fluid dynamics?
DPD?
MD?
Monte-carlo?
Simulator design:



XML model description (protein shapes, initial state, binding sites
and probabilities)
Java simulation engine for state updates
Java3D visualisation
Futures: modeling technology

Add spring constants to bonds (rather than completely rigid)

More sophisticated models of membranes (rather than a 2D
restriction on diffusion)

Efficient cytoskeleton models?

Explicit water? Small ions?

Auto generation from databases of protein shapes and interactions?
Futures: applications






DNA replication machinery (helicase / polymerase)
Snares / vesicle docking / budding (a model of Golgi apparatus?)
Full molecular model of a dendritic spine receiving an burst of
transmitter
Ribosome operation
Entire process of cell division (dna replication + microtubule
formation + motor protein separation + control sequences)
Self assembly of viruses from their coat proteins
A model of parallel processing?

How does this ordered soup of proteins maintain a such a large
number of tightly synchronised feedback control systems?

Could it be a useful model of computation in its own right? The well
mixed case is:



we have a memory of 1,000,000 different variables (one per protein)
we have specific probabilties of transitions between these
we have mechanisms for synthesising and destroying proteins
Adding 3D structure we also get:

some combinations of these variables form substructures with specific
properties
interactions depend on where the proteins are
Conclusions

We can build 3D models of protein systems to test and visualise
hypothesis about how structures can form

We still don't have a good way to model all the intracellular
complexity

Perhaps we should focus on molecular models of viruses and
bacteria before attempting eukaryotic cells?

Thanks to Dan Mossop for doing all the work