Active Guidance for a Finless Rocket using Neurevolution
Download
Report
Transcript Active Guidance for a Finless Rocket using Neurevolution
Active Guidance for a Finless
Rocket using Neurevolution
Gomez, F.J. and
Miikkulainen, R.
Content
Background
Enforced SubPopulations
Rocket Model
Experimental Results
Discussion
Background
Rockets used for scientific
research in upper atmosphere
Finless rockets can reach higher
than rockets with fins
Finless rockets are unstable, and
require active guidance
Background
Background
Highly complex environment:
–
drag is complex function of time, speed and altitude
Background
Neural networks can be used to model these
complexities without it knowing all the details
Use Enforced SupPopulations neurevolution to
learn an active guidance control system for the
RSX-2 rocket
Enforced SubPopulations
Extension of Symbiotic, Adaptive Neurevolution
(SANE)
–
–
Evolve individual neurons instead of entire network
Unlike SANE, use individual populations for each
neuron
Enforced SubPopulations:
Advantages and disadvantages
No loss of performance due to recombination
across different specialized neurons
Neurons are evaluated working together
Specialization causes diversity decline
–
–
–
Result: evolution gets stuck in local optimum
Remedy: burst mutation
When performance stagnates, add noise to the
populations according to some distribution
Enforced SubPopulations:
Algorithm (1/2)
Initialization:
–
Create a random population for each hidden unit
Evaluation:
–
–
–
Select a random neuron from each population
Determine fitness of resulting trial model
Repeat to accumulate fitness of individual neurons
over several trial models
Enforced SubPopulations:
Algorithm (2/2)
Recombination:
–
–
Neurons in top quartile average fitness are
recombined using 1-point crossover and mutation
Replace lowest half of neurons with recombined
neurons
Repeat evaluation and recombination until
performance of entire model is sufficient
Rocket Model
RSX-2 rocket is modeled in simulation package
A neural network determines the thrust of each
of the 4 thrusters between 90% and 100% of
maximum throttle
Fitness of a neural network is determined by
the altitude reached before burnout or failure
–
–
Failure occurs when a or b exceed 5 degrees
Networks with continuous 100% thrust get fitness 0
Rocket Model
Rocket Model
10 hidden units, 200 individuals per
subpopulation
Problem is too hard; neurons get stuck in a
local maximum at an early stage
Solution: incremental evolution
–
–
Perform ESP for a rocket with small fins
Use the output as a starting point for a finless rocket
Experimental Results
Experimental Results
Experimental Results
Controller generated without formal knowledge
of the system or correct behaviour
Future work
–
–
–
Remove a or b as input of the network
Change output of network to binary to simplify
rocket hardware
Incorporate noise and wind
Discussion