Complex Probability and Markov Stochastic Process

 

 

Bijan Bidabad[1], Behrouz Bidabad

 

 

 

Abstract:

This note discusses the existence of "complex probability" in the real world sensible problems. By defining a measure more general than conventional definition of probability, the transition probability matrix of discrete Markov chain is broken to the periods shorter than a complete step of transition. In this regard, the complex probability is implied.

 

 

1- Introduction

      

Somtimes analytic numbers coincide with the mathematical modeling of real world and make the real analysis of problems complex. All the measures in our everyday problems belong to R, and mostly to R+. Probability of occurrence of an event always belongs to the range [0,1]. In this paper, it is discussed that to solve a special class of Markov chain which should have solution in real world, we are confronted with "analytic probabilities"!. Though the name probability applies to the values between zero and one, we define a special analogue measure of probability as complex probability where the conventional probability is a subclass of this newly defined measure.

     Now define the well known discrete time Markov chain  a Markov stochastic process whose state space is  for which . Refer to the value of Yn as the outcome of the nth trial. We say Yn being in state i if Yn = i. The probability of Yn+1 being in state j, given that Yn is in state i (called a one–step transition probability) is denoted by  , i.e.,

                                                             (1)

Therefore, the Markov or transition probability matrix of the process is defined by

 

                                

The n-step transition probability matrix whichdenotes the probability that the process goes from state i to state j in n transitions. Formally,

                                               (3)

 

According to Chapman – Kolmogorov relation for discrete Markov matrices (Karlin and Taylor (1975)), it can be proved that

 

Pn which is P to the power n is a Markov matrix if P is Markov.

        Now, suppose that we intend to derive the t-step transition probability matrix P(t) where t≥0 from the above (3) and (4) definition of n-step transition probability matrix P. That is, to find the transition probability matrix for incomplete steps. On the other hand, we are interested to find the transition matrix P(t) when t is between two sequential integers. This case is not just a tatonnement example. To clarify the application of this phenomenon, consider the following example.

Example 1. Usually in population census of societies with N distinct regions, migration information is collected in an NxN migration matrix for a period of ten years. Donote this matrix by M. Any element of M, m­ij is the population who leaved region i and went to region j through the last ten years. By deviding each mij to sum of the ith row of M, a value of Pij is computed as an estimate of probability of transition from ith to jth regions. Thus the stochastic matrix P gives the probabilities of going from region i to region j in ten years (which is one–step transition probability matrix). The question is: how we can compute the transition probability matrix for one year or one tenth step and so on.

        If we knew the generic function of probabilities in very small period of time we would be able to solve problems similar to example 1. But the generic function (5) is not obtainable. If it were, we would apply the continous time Markov procedure using the generic NxN matrix A as

Where P(h) denotes transition probability matrix at time h. Then the transition probability matrix at any time  might be computed as follows. (Karlin and Taylor (1975)).

 

P(t) = e At                                                                                                            (6)

 

        Therefore a special procedure should be adopted to find the transition probability matrix P(t) at any time t from discrete Markov chain information. As it will be show later the adopted procedure coincides with transition probability matrix with complex elements.

 

2- Breaking the time in discrete Markov chain

 

        Consider again matrix P defined in (2). Also, assume P is of full rank.

 

Assumption 1: P is of full rank.

This assumption assures that all eigenvalues of P are nonzero and P is diagnalizable, Searle (1982), Dhrymes (1978). This assumption is not very restrictive, since, actually, most of Markov matrices have dominant diagonals. That is probability of transition from state i to itself is more than the sum of probabilities from state i to all other states. The matrices having dominant diagonals are non-singular, Takayama (1974). Threfore, P can be decomposed as follows (Searle (1982), Klein (1973)).

 

                                                                                                   (7)

Where X is an NxN matrix of eigenvectors 

 

                                                                                                    (8)

 

andthe NxN diagonal matrix of corresponding eigenvalues,

 

                                                                                                      (9)

 

Using (7), (8) and (9) to break n-step transition probability matrix P to any smaller period of time t0, we do as follows. If  for all iЄ{1,…,K}are fractions of n–step period and  for any n belonging to natural numbers then,

                                                                                   (10)

        On the other hand, transition probability matrix of n-step can be broken to fractions of n, if sum of them is equal to n. Therefore, any  fraction of one-step transition probability matrix can be written as,

 

                                                                                                  (11)

where,

                                                                                  (12)

Before discussing on the nature of eigenvalues of P let us define the generalized Markov matrix.

 

Definition 1. Matrix Q is a generalized Markov matrix if the following conditions are fulfilled:

 

Remark 1. According to definition 1, matrix Q can be written as:

 

                                                                                                               (13)

Where U and V are NxN matrices of real and imaginary parts of Q with  

 

Remark 2. Matrix U has all Properties of P defined by (2), thus, PÌ Q.

 

Treorem 1. If P is a Markov matrix then Pt also satisfies Markovian properties.

Proof: According to Chapman–Kolmogorov relation for continous Markov chain (Karlin and Taylor (1975)), we have

 

                                                               (14)

 

That is, if P(t) and P(s), transition probability matrices at times t and s are Markovs, then the product of them P(t+s) is also Markov. Let t=1, then P(1) is a one-step transition probability matrix which is equivanlent to (2). Hence, our discrete Markov matrix P is equivalent to its contionous analogue P(1). So

 

                                                                                                           (15)

 

If we show that

 

                                                                                                             (16)

 

Then according to (14)

 

                                                                                                            (17)

We can conclude that if P is Markov then Pt, Ps and Pt+s are also Markovs for  and the theorem is proved.

Rewrite P(t) in (6) as (18).

 

                                                                                             (18)

Where are the eigenvalues of A defined by (5), and

 

                                                               (19)

 

And X  is the corresponding eigenmatrix of A. Take the natural logarithm of (18),

 

                                                                                                 (20)

Where,

                                                                                (21)

So,

                                                                                            (22)

where

                                                                                            (23)

 

Write (22) for t=1 and multiply both side by t,

 

                                                                                                  (24)

 

By comparison of (22) and (24) conclude that

 

                                                                                        (25)

or,

                                                                                                     (26)

 

Given (15), equation (26) is the same as (16) Q.E.D.

 

Result 1. Matrix Pt fulfils definition 1. Thus, . This comes from the following remarks.

 

Remark 3. Sum of each row of Pt is equal to one. Since Pt satisfies Markovian properties (theorem 1).

 

Remark 4. Sum of imaginary parts of each row is equal to zero. This immediately comes from remark 3.

 

Remark 5. If qij denotes the ijth element of Pt for  thenfor all i and j belonging to S. This remark can be concluded form theorem 1.

 

Remark 6. If  equals to the complex matrix defined by (13), then Since,

 

 

Remark 7. Given Q as in remark 6, then ujkÎ[0,1]. This also comes immediately from theorem 1.

 

3. Discussion on broken times

        The broken time discrete Markov chain is not always a complex probability matrix defined by definition 1. Matrix Pt has different properties with respect to t and eigenvalues.  may be real (positive or negative) or complex depending on the characteristic polynomial of P.

        Since P is a non–negative matrix, Forbenius theorem (Takayama (1974), Nikaido (1970)) assures that P has a positive dominant eigenvalue

 

    (Frobenius root)                                                                                                                (27)

and

                                                                               (28)

 

Furthermore, if P is also a Markov matrix then its Frobenius root is equal to one, (Bellman (1970), Takayama (1974)). Therefore,

 

                                                                                                               (29)

                                                                                               (30)

 

With the above information, consider the following discussions.

 

                                                                                  

 

        In this case all  for  and no imaginary part occures in matrix Pt.  are all positive for i belonging to S if we can decompose the matrix P to two positive semi-definite and positive definite matrices B and C of the same size (Mardia, Kent, Bibby (1982)) as

 

 

 belongs to sets of real and imaginary numbers based on the value of t. In this case Pt belongs to the class of generalized stochastic matrix Q of definition 1. For , it is sufficient that P be positive definite.

 

 

Pt in this case for  and  belongs to the class of generalized Morkov matrices of definition 1.

 

(Natural numbers)

 

In all cases of a, b, and c we never coincide with complex probabilities. Since Pt can be drived by simply multiplying P, t times.

 

(integer numbers)

 

In this case, Pt is a real matrix but does not always satisfy condition 2 of definition 1.

 

Pt is a complex matrix but does always satisfy conditions 2 and 3 of definition 1.

 

4. Complex probability justification

        Interpretation of the "Complex probability" as defined by definition 1 is not very simple and needs more elaborations. The interesting problem is that, it exists in operational works of statistics as the example 1 discussed. Many similar examples like the cited may be gathered.

        With this definition of probability, the moments of a real random variable are complex. Although the t–step distribution  of initial distribution  with respect to Pt may be complex, they have the same total as  That is, if

                                                                                              (32)

Then,

                                                                       (33)

 

And we have the following remark accordingly,

 

Remark 8. Sum of t-step distribution is equal to sum of initial distribution. That is,

 

                                                                                                  (34)

 

This can be derived based on (32) and (33) as

 

              (35)

 

And, sum of t–step distribution is

 

                                          (36)

 

The two parentheses in (36) are one and zero respectively based on conditions 4 and 5 of definition 1. Thus, (36) and (34) are the same.

        The above remark 8 states that though there exist imaginary transition probabilities to move from state j to k, the total sum of “imaginary transitions” is equal to zero. On the other hand, after tth step transition, the total distribution has no imaginary part.

 

5. Summary

 

        By summarizing the discrete and continous times Markov stochastic processes a class of real world problems was introduced which can not be solved by each of the procedures. The solutions of these problems coincide with “Complex probabilities” of transitions which are inherent in mathematical formulation of the model. Complex probability is defined and some of its properties with respect to the cited class are examined. Justification of the idea of complex probability needs more work and is left for further research.

 

6. Acknowledgements

 

        The authors are indebted to Dr. A. Monajemi who read the manuscript and gave valuable remarks and Miss. S. Pourasghari for typing the manuscript.

 

7. Refrences

 

v      R. Bellman (1970), Introduction to matrix analysis, McGraw–Hill.

v      P.J. Dhrymes (1978), Mathematics for econometrics. Springer-Verlag.

v      W. Feller (1970, 1971), An Introduction to probability theory and its applications, vols. 1,2, Wiley, New York.

v      P.G. Hoel, S.C. Port, C. J. Stone (1972), Introduction to stochastic processes. Houghton Mifflin, New York.

v      S. Karlin, H.M.Taylor (1975), A first course in stochastic processes. Academic Press.

v      E. Klein (1973), Mathematical methods in theoretical economics, topological and vector space foundations of equilibrium analysis, Academic Press.

v      K.V. Mardia, J.T. Kent, J.M. Bibby (1982), Multivariate analysis, Academic Press.

v      H. Nikaido (1970), Introduction to sets and mapping in modern economics. North Holland Publishing Co.

v      S.S. Searle (1982), Matrix algebra useful for statistics. Wiley.

v      A.Takayama (1974) Mathematical economics. Dyrden Press , Illinois.

v      E. Wentzel, L. Ovcharov (1986) Applied problems in probability theory. Mir, Moscow.

 



[1]No.18, 12th St., Mahestan Ave., Shahrak Gharb, Tehran 14658-33659, Iran  bijan_bidabad@msn.com