Exercise
You saw earlier that depending on the parameters, you could expect annual, biennial, or more complex dynamics from two interacting strains–or from a single strain with seasonal forcing. The differences in these dynamics and the uncertainties of our parameter estimates make it essential to fit models rather than rely on correlations and other (often incorrect) rules of thumb.
One of the most powerful methods for analyzing time series is a sequential Monte Carlo (a.k.a. particle filtering) method, multiple iterated filtering for partial observed Markov Processes (MIF for POMP). It’s powerful because it incorporates both process noise and sampling error, thus reducing bias and overconfidence in the estimated parameters. Links to tutorials on pomp are in the Additional Resources, and the developers (Ed Ionides and Aaron King) teach an entire SISMID course on the method, which this brief exercise doesn’t replace! Pomp and its ilk are among the best methods for fitting stochastic nonlinear models, although they can be a little difficult to use in practice–they’re computationally intense. Despite these methods’ power, complex models, including multistrain models, may not converge in the time scale we care about.
In our limited time, we’ll briefly practice fitting our model using a simple error function and simple optimization routines for minimizing the error. (This function can be extended to a likelihood if we have time.) The main motivation for these choices is to give you a sense of (1) where optimizers can get stuck, and which parameters and dynamical regimes are most challenging to fit, and (2) the dangers of model misspecification, especially assumptions related to noise. In other words, this tutorial is meant to teach you through vivid example what not to do. The methods we’ll use’ are available in the scipy.optimize package. Once you understand their performance, it might be fun to try to improve on the fits to the same data set using pomp or other methods on your own.
For this exercise, we’ll initially fit by minimizing squared differences using a Nelder-Mead simplex search. Nelder-Mead is a “derivative-free” method that picks n+1 parameter combinations to form the vertices of a simplex, evaluates the error (or likelihood) at each point, and updates the worst-scoring point. The description below, directly from Bolker 2007), describes the simplex’s subsequent movement:
- attempt to move in the best direction by reflecting the worst point in the simplex through the opposite face;
- if the fit at the new point is better than the best other point in the simplex, double the length of the jump in that direction;
- if this jump was bad – the height at the new point is worse than the second-worst point in the simplex –then try a point that’s only half as far out as the initial try;
- if this second try, closer to the original, is also bad, then contract the simplex around the current best point.
Most fitting methods in use are more complex, efficient, and flexible than the Nelder-Mead simplex. See it in action.
(1) Examine the error function in
fitting.py
. What are its assumptions? Note that most of the variables that are modeled are effectively latent, or unobserved. (A property of hidden Markov models.)
(2) Referencing your bifurcation diagrams, pick parameters in an area associated with stable dynamics: moving a little to the left or the right of each parameter doesn’t appear to change dynamics too much. (Of course, because we’ve considered only a few “slices” by varying one parameter at a time, we can’t be really sure how stable each area is yet.) Update the parameters in
fitting.py
with these values, and generate sample observations. Now assume we don’t know the true value of one of the parameters (ideally one you have examined in a bifurcation diagram). Guess that its true value is 10-20% higher or lower than its actual value, and try to fit.
(3) Now try again, starting from a slightly different location.
(4) If your fits have been pretty good so far, start from a very different point, ideally one corresponding to different dynamics.
(5) Assume you have incorrectly estimated the duration of infection, but you do not know you’ve made a mistake. Change its value in
error_function
, but do not fit it. Repeat a successful optimization from step 2. How are your results affected?
(6) Try fitting two parameters simultaneously.
(7) Add observation error to your data and attempt a previously successful fit. What happens when the error is not normal (“colored”)?
(8) Add process noise to your model and attempt a previously successful fit.
This exercise should have efficiently demonstrated the critical importance of using inference procedures that accommodate process and observation error, as well as the challenges fitting complex dynamics.