Similar Markov Chains

SIMILAR MARKOV CHAINS

Phil Pollett

Department of Mathematics
The University of Queensland

There is an accompanying paper:

Pollett, P.K. (2001) Similar Markov chains. Journal of Applied Probability 38A (to appear).

From that paper:

"I was introduced to David Vere-Jones' work by another of my mentors, Charles Pearce, whose lively undergraduate lectures on the Vere-Jones theory did much to convince me that I should follow a probability path. David's work has been a constant source of inspiration to me and I continue to benefit from our fruitful and enjoyable collaborations. I am grateful to him personally for his advice, encouragement and friendship over many years and I therefore take great pleasure in dedicating this short note to him."

From those lectures:

MAIN REFERENCES

Convergence of Markov transition probabilities and their spectral properties

1. Vere-Jones, D. Geometric ergodicity in denumerable Markov chains. Quart. J. Math. Oxford Ser. 2 13 (1962) 7-28.

2. Vere-Jones, D. On the spectra of some linear operators associated with queueing systems. Z. Wahrscheinlichkeitstheorie und Verw. Gebiete 2 (1963) 12-21.

3. Vere-Jones, D. Ergodic properties of nonnegative matrices. I. Pacific J. Math. 22 (1967) 361-386.

4. Vere-Jones, D. Ergodic properties of nonnegative matrices. II. Pacific J. Math. 26 (1968) 601-620.

Classification of transient Markov chains and quasi-stationary distributions

5. Seneta, E.; Vere-Jones, D. On quasi-stationary distributions in discrete-time Markov chains with a denumerable infinity of states. J. Appl. Probability 3 (1966) 403-434.

6. Vere-Jones, D. Some limit theorems for evanescent processes. Austral. J. Statist. 11 (1969) 67-78.

Related work

7. Vere-Jones, D.; Kendall, David G. A commutativity problem in the theory of Markov chains. Teor. Veroyatnost. i Primenen. 4 (1959) 97-100.

8. Vere-Jones, D. A rate of convergence problem in the theory of queues. Teor. Verojatnost. i Primenen. 9 (1964) 104-112.

9. Vere-Jones, D. Note on a theorem of Kingman and a theorem of Chung. Ann. Math. Statist. 37 (1966) 1844-1846.

10. Heathcote, C. R.; Seneta, E.; Vere-Jones, D. A refinement of two theorems in the theory of branching processes. Teor. Verojatnost. i Primenen. 12 (1967) 341-346.

11. Rubin, H.; Vere-Jones, D. Domains of attraction for the subcritical Galton-Watson branching process. J. Appl. Probability 5 (1968) 216-219.

12. Seneta, E.; Vere-Jones, D. On the asymptotic behaviour of subcritical branching processes with continuous state space. Z. Wahrscheinlichkeitstheorie und Verw. Gebiete 10 (1968) 212-225.

13. Fahady, K. S.; Quine, M. P.; Vere-Jones, D. Heavy traffic approximations for the Galton-Watson process. Advances in Appl. Probability 3 (1971) 282-300.

14. Pollett, P. K.; Vere-Jones, D. A note on evanescent processes. Austral. J. Statist. 34 (1992), no. 3, 531-536.

Important early work on quasi-stationary distributions

Yaglom, A.M. Certain limit theorems of the theory of branching processes (Russian) Doklady Akad. Nauk SSSR (N.S.) 56 (1947) 795-798.

Bartlett, M.S. Deterministic and stochastic models for recurrent epidemics. Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, 1954-1955, Vol. IV, pp. 81-109. University of California Press, Berkeley and Los Angeles, 1956.

Bartlett, M.S. Stochastic population models in ecology and epidemiology. Methuen's Monographs on Applied Probability and Statistics Methuen & Co., Ltd., London; John Wiley & Sons, Inc., New York, 1960.

Darroch, J. N.; Seneta, E. On quasi-stationary distributions in absorbing discrete-time finite Markov chains. J. Appl. Probability 2 (1965) 88-100.

Darroch, J. N.; Seneta, E. On quasi-stationary distributions in absorbing continuous-time finite Markov chains. J. Appl. Probability 4 (1967) 192-196.

Important early work on quasi-stationary distributions

Mandl, Petr Sur le comportement asymptotique des probabilités dans les ensembles des états d'une chaîne de Markov homogène (Russian) Casopis Pest. Mat. 84 (1959) 140-149.

Mandl, Petr On the asymptotic behaviour of probabilities within groups of states of a homogeneous Markov process (Czech) Casopis Pest. Mat. 85 (1960) 448-456.

Ewens, W.J. The diffusion equation and a pseudo-distrib-ution in genetics. J. Roy. Statist. Soc., Ser B 25 (1963) 405-412.

Kingman, J.F.C. The exponential decay of Markov transition probabilities. Proc. London Math. Soc. 13 (1963) 337-358.

Ewens, W.J. The pseudo-transient distribution and its uses in genetics. J. Appl. Probab. 1 (1964) 141-156.

Seneta, E. Quasi-stationary distributions and time-rever-sion in genetics. (With discussion) J. Roy. Statist. Soc. Ser. B 28 (1966) 253-277.

Seneta, E. Quasi-stationary behaviour in the random walk with continuous time. Austral. J. Statist. 8 (1966) 92-98.

DISCRETE-TIME CHAINS

Setting: , a time-homogeneous Markov chain taking values in a countable set S with transition probabilities

displaymath663

Let C be any irreducible and (for simplicity) aperiodic class.

DVJ1: For , tex2html_wrap_inline669 as . The limit does not depend on i and it satisfies . Moreover, tex2html_wrap_inline679 and indeed, for , tex2html_wrap_inline683 , where .

(If C is recurrent, tex2html_wrap_inline689 implies . When C is transient, we can have , or, , which is called geometric ergodicity.)

DVJ2: For any real r>0, the series tex2html_wrap_inline701 , , converge or diverge together; in particular, they have the same radius of convergence R, and . And, all or none of the sequences tex2html_wrap_inline709 tend to zero.

Figure 1. Simulation of a metapopulation model.

(Download the following image (as postscript)):

Figure 2. State transition diagram for
epidemic model.

(Download the following image (as postscript)):

Figure 3. Simulation of the epidemic model
[Simulation | java code]

(Download the following image (as postscript)):

Figure 4. Auto-catalytic reaction simulation
[Simulation | java code]

(Download the following image (as postscript)):

Figure 5. Quasistationarity of an explosive population process
[Simulation | java code]

TRANSIENT CHAINS

The key to unlocking this "quasi-stationarity" is to examine the behaviour of the transition probabilities at the radius of convergence R.

Suppose that C is transient class which is geometrically ergodic ( , R>1). Although tex2html_wrap_inline719 , it might be true that tex2html_wrap_inline721 , where . How does this help?

For ,

multline198

provided that we can justify taking limit under summation.

DVJ3: C is said to be R-transient or R-recurrent according as tex2html_wrap_inline733 converges or diverges. If C is R-recurrent, then it is said to be R-positive or R-null according to whether the limit of tex2html_wrap_inline743 is positive or zero.

DVJ4: If C is R-recurrent, then, for , the inequalities

displaymath751

have unique positive solutions tex2html_wrap_inline753 and tex2html_wrap_inline755 and indeed they are eigenvectors:

displaymath757

C is then R-positive recurrent if and only if , in which case

displaymath765

and, if , then

displaymath769

AND FINALLY

S-DVJ: If C is R-positive recurrent and the left-eigenvector satisfies , then the limiting conditional (or quasi-stationary) distribution exists: as ,

displaymath779

... AND MUCH MORE

Other kinds of QSD, more general and more precise statements, continuous-time chains, general state spaces, numerical methods and in particular truncation methods, MCMC, countless applications of QSDs: chemical kinetics, population biology, ecology, epidemiology, reliability, telecommunications. A full bibliography is maintained at my web site:

http://www.maths.uq.edu.au/~pkp/research.html

(Download the following image (as postscript)):

Figure 6. Surface plot of the quasistationary distribution
for the epidemic model

SIMILAR MARKOV CHAINS

New setting: tex2html_wrap_inline783 , a time-homogeneous Markov chain in continuous time taking values in a countable set S, with transition function tex2html_wrap_inline787 , where

Assuming that tex2html_wrap_inline791 (standard), the transitions rates are defined by tex2html_wrap_inline793 . Set and assume (stable).

Definition: Two such chains X and are said to be similar if their transition functions, P and , satisfy tex2html_wrap_inline807 , , t>0, for some collection of positive constants , .

Immediate consequences of the definition: Since both chains are standard, and the transition rates must satisfy , in particular, . They share the same irreducible classes and the same classification of states.

Birth-death chains: Lenin et al. proved that for birth-death chains the "similarity constants" must factorize as . (Note that tex2html_wrap_inline825 , since .)

Is this true more generally?

Definition: Let C be a subset of S. Two chains are said to be strongly similar over C if tex2html_wrap_inline835 , , t>0, for some collection of positive constants , .

Proposition: If C is recurrent, then .

(Proof: tex2html_wrap_inline849 .)

EXTENSION OF DJV THEORY

Kingman: If C is irreducible, then, for each , tex2html_wrap_inline855 ( ), tex2html_wrap_inline859 , for some , et cetera.

Definition: C is said to be -transient or -recurrent according as tex2html_wrap_inline869 converges or diverges. If C is -recurrent, then it is said to be -positive or -null according to whether the limit of tex2html_wrap_inline879 is positive or zero.

Theorem: If C is -recurrent, then, for , the inequalities

displaymath887

have unique positive solutions tex2html_wrap_inline753 and tex2html_wrap_inline755 and indeed they are eigenvectors:

displaymath893

C is then -positive recurrent if and only if , in which case

displaymath901

Suppose that P and are similar. They share the same and the same "-classification".

Theorem: If C is a -positive recurrent class, then P and are strongly similar over C. We may take tex2html_wrap_inline921 , where tex2html_wrap_inline753 and tex2html_wrap_inline925 are the essentially unique -invariant measures (left eigenvectors) on C for P and for , respectively.

Proof: Let in tex2html_wrap_inline937 . We get, in an obvious notation,

displaymath939

and, since , we have .

Again: Are similar chains always strongly similar?

In the -null recurrent case, it may still be possible to deduce the desired factorization, for, although tex2html_wrap_inline947 , it may be possible to find a such that tex2html_wrap_inline951 tends to a strictly positive limit. (Similar chains will have the same .)

Lemma: Assume that C is -null recurrent and suppose that there is a , which does not depend on i and j, such that tex2html_wrap_inline951 tends to a strictly positive limit for all . Then, there is a positive constant d such that , , where tex2html_wrap_inline753 and tex2html_wrap_inline755 are, respectively, the essentially unique -invariant measure and vector (left- and right-eigenvectors) on C for P.

Remark: Even in the -transient case it might still be possible to find a such that tex2html_wrap_inline951 tends to a positive limit, and for the conclusions to the lemma to hold good. (Note that, by the usual irreducibility arguments, will be the same for all i and j in any given class.)