Loading [MathJax]/jax/output/HTML-CSS/jax.js

Thursday, February 16, 2012

Proof that Probability Calculations in Quantum Mechanics Obey the Axioms of Probability Theory

    
Last time I talked about the axioms of probability theory. They can be used to derive all other concepts and theorems of probability theory. Or one can use them to verify whether a given theory is a correct probabilistic theory.

Now, let me show that the probability calculations in QM obey axioms of probability. Again, I'm repeating what is written in chapters 1.5 (Probability Theory), 2.4 (Probability Distributions - Verification of Probability Axioms) and 9.6 (Joint and Conditional Probabilities) of Ballentine's book.

Dynamical variables are represented by hermitian operators. They have real eigenvalues, and that values are possible outcomes of a measurement of that dynamical variable (or the "observable". This fancy word is used to distinguish the classical quantities from its corresponding quantum versions).

Say {rn} is the set of all eigenvalues (they may be degenerate). Δ is a range on the eigenvalue space, it is a subset of {rn}. In quantum mechanics we talk about the probabilities of that the outcome of a measurement of an observable will be in the range Δ. Ballentine's notation is this: p(RΔ|ρ). It means, "the probability that the outcome of R measurement is in the range Δ, if the system is in the state ρ".

The calculation of this probability is done using projection operators. First one has to find the subspace of all Hilbert space H, where all the eigenstates corresponding to the eigenvalues in the range Δ lives. Ballentine's notation of this projection operator is MR(Δ).

For the simplest case, pick a single non-degenerate eigenvalue, rn, as the range, and a pure state as the system's state, |ψ. Corresponding eigenvector is |rn and the projection operator onto it is |rnrn|=MR(rn).

Quantum mechanics' postulates on probability (for pure states) is that, the probability is equal to the absolute square of the component of the state vector that belong to the eigensubspace related to Δ, which is the square of the norm of the projection of the state to the subspace. For the most general case: p(RΔ|ρ)=Tr{ρMR(Δ)}
In our simplest case, which is shown in most introductory texts, p(R=rn|ψ) =|rn|ψ|2 =ψ|rnrn|ψ =ψ|MR(rn)|ψ =ψ|MR(rn)MR(rn)|ψ (from the definition of projection operators M2=M). =(ψ|MR(rn))(MR(rn)|ψ) =ψrn|ψrn =|ψrn2. Where |ψrn is the projection onto the eigensubspace which is cn|rn. (Longest way of showing the relation.) Hence the norm is |cn|2 where {cn} are the coefficients when |ψ is expanded on the eigenstates of R.

Axiom 1
0p(A|B)1
Here our p(A|B) is p(RΔ|ρ). A is the event that the outcome is in the range Δ. B is the event that our system is prepared in the state ρ.

The operator that represents a quantum state must satisfy these conditions: Tr{ρ}=1 and u|ρ|u0
If we expect that the outcome will be anyone of the eigenvalues, then the related projection operator covers the whole Hilbert space, hence it is the identity operator. MR(Δall)=rn{rn}|rnrn|=I. Hence the probability that we will get an eigenvalue of R is p(RΔall|ρ) =Tr{ρMR(Δall)} =Tr{ρI}=1 (according to (1)) which is the maximum probability.

For any other range, the projection does not cover the whole subspace. MR(Δ)=rnΔ|rnrn| I. Therefore Tr{ρMR(Δ)} =Tr{ρrnΔ|rnrn|} =rnΔTr{ρ|rnrn|} =rnΔrn|ρ|rn. According to (2) all terms in this summation is non-negative, the sum either increases or remains the same as we add more terms (as we enlarge the subspace). Covering the whole Hilbert space gives 1, hence covering any subspaces will give a number between 0 and 1, hence axiom (1) holds.

Axiom 2
p(A|A)=1
Here, A is both the state of the system, and the state after the measurement. We are measuring an observable of which eigenvalue 1 corresponds to the state. In pure state notation, the state is |ψ, and MR(Δ)=|ψψ|. And ψ|MR(rn)|ψ =ψ|ψψ|ψ=1. This is not surprising because, it is a bit tautological. What we are saying is this: "If the system is in the state |ψ then the probability of finding it in the state |ψ is 1.

In mixed state notation, the projection onto the state has this relation: ρ=MR(Δ)ρMR(Δ). The probability becomes Tr{ρMR(Δ)} =Tr{ρMR(Δ)MR(Δ)} =Tr{MR(Δ)ρMR(Δ)} =Tr{ρ}=1. Hence axiom (2) holds.

Axiom 3
For the axiom 3 Ballentine follows a different way. Instead of using our axiom 3, which was: p(A|B)=1p(A|B), he uses an alternative axiom "addition of probabilities for exclusive events" which is the definition of mutually exclusive events: p(AB|C)=p(A|C)+p(B|C) One can either start with (3) and derive (3b), or vice versa. Hence proving (3b) is equivalent to proving (3).

Pick two ranges, Δ1 and Δ2 with no intersection. Therefore RΔ1 and RΔ2 are mutually exclusive events, they can't happen at the same time. Because the ranges are disjoint, the vectors belonging to one range are perpendicular to the vectors belonging to the other range. Hence successive application of their projection operators will give zero: MR(Δ1)MR(Δ2)=0. And the projection on the union of the ranges is the sum of the projections: MR(Δ1Δ2) =MR(Δ1)+MR(Δ2)p(RΔ1RΔ2|ρ)=Tr{ρMR(Δ1Δ2)}=Tr{ρMR(Δ1)}+Tr{ρMR(Δ2)}=p(RΔ1|ρ)+p(RΔ2|ρ) Hence axiom (3) holds.

Axiom 4
After this point, I think, we have no doubt that axiom (4) will also hold for QM. But, anyways, let us do this. p(A&B|C)=p(A|C)p(B|A&C)

Here C is the quantum state, ρ or |ψ. A is the event of getting an eigenvalue in the range Δa when R is measured, and similarly B=SΔb of the S measurement. A&B means that simultaneously R has a value in Δa and S has a value in Δb.

First let me show this for the pure state case with the Kolmogorov's version of the axiom, which was: p(A&B)=p(B|A)p(A).

p(A)=p(R=r)=ψ|MR(r)|ψ =ψ|(|rr|)|ψ =|r|ψ|2.

p(B|A) can be thought of the result of two successive measurements. First the event A is happened. Then what is the probability of B to happen. QM tells us that the if the eigenvalue r is measured, than the state becomes (the wavefunction collapses) to |ψ|r. Or a more general expression is |ψMR(r)|ψψ|MR(r)|ψ=MR(r)|ψp(R=r)|ψ Hence p(B|A) now A is the new state |ψ and to do the probability calculation one has to sandwich MS with this new state. p(B|A)=p(S=s||ψ)=ψ|MS|ψ=ψ|MR(r)p(R=r)MS(s)MR(r)|ψp(R=r)=ψ|MR(r)MS(s)MR(r)|ψp(R=r)=ψ|(MSMR+[MR,MS])MR|ψp(R=r)=ψ|MSMR|ψ+ψ|[MR,MS]|ψp(R=r)=ψ|MSMR|ψp(R=r),if[MR,MS]=0

When the operators R and S commute, they have simultaneous eigenvectors, hence MR and MS also commute. p(R=r) was p(A). Hence p(B|A)p(A) =ψ|MSMR|ψ which has to be p(A&B), and fortunately it is.

To find p(A&B)=p(R=r&S=s) one need the projection operator on the eigensubspace, ϵrs where R has the eigenvalue r, and S has the eigenvalue sϵrs is the intersection of ϵr and ϵs. One can project a vector on ϵrs by successively applying the MR and MS projection operators. The order is not important when [R,S]=0. Because then, they have simultaneous eigenvectors. That eigenvectors can be used to create an orthonormal basis. Projection operators constructed from that basis will commute too. And joint probability is defined only the operators commute.

Therefore, the component of |ψ on the ϵrs is |ψrs=MRMS|ψ. Its norm square is the probability. |ψrs2 =ψrs|ψrs =ψ|MSMRMRMS|ψ. Using their commutation it becomes p(R=r&S=s) =ψ|MSMR|ψ. And this quantitiy is the numerator of the conditional probability result. Hence axiom (4) is also satisfied by quantum mechanics' postulates of probability calculations.

Using a mixture state and Cox' version, the calculations becomes:

p(A|C)=p(RΔa|ρ) =Tr{ρMR(Δa)}.

p(A&B|C) =p(RΔa&SΔb|ρ) =Tr{ρMR(Δa)MS(Δb)}.

 p(B|A&C) =p(SΔb|RΔa&ρ). Here, RΔa&ρ can be thought as a new state ρρ which we got after the R measurement. p=p(SΔb|ρ). QM postulate that ρ =MR(Δa)ρMR(Δa)Tr{MR(Δa)ρMR(Δa)}. Therefore, p=Tr{ρMs(Δb)}. p(A|C)p(B|A&C)=p(RΔa|ρ)p(SΔb|RΔa&ρ)=Tr{ρMR(Δa)}Tr{ρMs(Δb)}=Tr{ρMR(Δa)}Tr{MR(Δa)ρMR(Δa)Tr{MR(Δa)ρMR(Δa)}Ms(Δb)}=Tr{ρMR(Δa)}Tr{MR(Δa)ρMR(Δa)Ms(Δb)}Tr{MR(Δa)ρMR(Δa)}=Tr{MR(Δa)ρMR(Δa)Ms(Δb)}=Tr{ρMR(Δa)Ms(Δb)}=p(RΔa&SΔb|ρ)=p(A&B|C)

for which axiom (4) holds too. So, as long as we are dealing with joint probability distributions of commuting observables, QM is a correct probability theory. Just in case you may had any doubts about it...

No comments:

Post a Comment