Topics on Operator Inequalities T. Ando Division of Applied Mathematics Research Institute of Applied Electricity Hokkaido University, Sapporo, Japan Research supported by Kakenhi 234004 CHAPTER I Geometric and Harmonic Means Throughout the lecture G,H,K denote Hilbert spaces. L(H) is the space of (bounded) linear operators on H, while L+ (H) is the cone of positive (i.e. non-negative semi-definite) operators. In this chapter we shall be concerned with simple binary operations in L+ (H), called geometric and harmonic means. Theorem!I.1. Suppose that H = G ⊕ K and a self-adjoint operator T on H is written in the form A C∗ T = where A and B act on G and K respectively and C acts from G to K. Then in order C B that T is positive it is necessary and sufficient that A and B are positive and there is a contraction W (i.e. kW k ≤ 1) from G to K such that C = B 1/2 W A1/2 . Proof. Suppose that A ≥ 0, B ≥ 0 and C = B 1/2 W A1/2 with a contraction W . The condition kW k ≤ 1 implies W W ∗ ≤ 1, hence B 1/2 W W ∗ B 1/2 ≤ B. Then T admits a factorization T = S ∗ S where ! A1/2 W ∗ B 1/2 S= 1/2 , 0 B − B 1/2 W W ∗ B 1/2 which implies that T is positive. Suppose conversely that T is positive. This means that (Ax, x) + 2Re (Cx, y) + (By, y) ≥ 0 for x ∈ G, y ∈ K. A and B are obviously positive and the above inequality is easily seen to be equivalent to the following (Ax, x)(By, y) ≥ |(Cx, y)| 2 for x ∈ G, y ∈ K. Then for each y ∈ K the vector C ∗ y belongs to the range of A1/2 and 2 2 |(Cx, y)| −1/2 ∗ 2 C y = sup ≤ B 1/2 y , A (Ax, x) x where A−1/2 is defined to be the (unbounded) inverse of A1/2 restricted to the orthocomplement of the kernel of A1/2 . Now there is a contraction U from K to G such that A−1/2 C ∗ = U B 1/2 . Finally W = U ∗ meets the requirement. Corollary I.1.1. If T is a positive operator on H and G is a closed subspace of H, then there is a linear operator S such that T = S ∗ S and S(G) ⊆ G. In fact, the operator S in the proof of Theorem I.1 meets the requirement. ! ! A C∗ A C∗ Corollary I.1.2. If is positive, then there is the minimum of all X for which C B C X are positive. ! ∗ −1/2 ∗ A C∗ Proof. As in the proof of Theorem I.1 the positivity of implies A−1/2 C ∗ A C ≤ X. C X ∗ −1/2 ∗ This means that A−1/2 C ∗ A C is the minimum in the assertion. 3 4 I. GEOMETRIC AND HARMONIC MEANS Remark. If A has bounded inverse, the minimum in Corollary I.1.1 is just CA−1 C ∗ . Corollary I.1.3. If H ⊇ G and if S is a linear operator from G to H such that (Sx, y) = (x, Sy) for x, y ∈ G then there is a self-adjoint operator T on H such that T |G = S and kT k = kSk. Further, among all T satisfying these conditions there are the minimum Tµ and the maximum TM . ! A Proof. It can be assumed that kSk = 1. Let S = where A acts on G and C does from G to C K = H G. By assumption A is self-adjoint and 2 2 2 kAxk + kCxk ≤ kxk for x ∈ G. ! ! A C∗ 1 + A C∗ ∗ 2 Therefore C C ≤ 1 − A . T must be of the form T = for which and C X C 1+X ! 1 − A −C ∗ are positive. The inequalities C ∗ C ≤ 1 − A2 and 2−1 (1 − A) ≤ 1 imply that −C 1−X C ∗ 2−1 C ≤ 2−1 1 − A2 = 2−1!(1 − A)(1 + A) ≤ 1 + A. By Remark after Corollary I.1.2 this implies ! C∗ 1 + A C∗ . Therefore there is the minimum Bµ of all X for which C 1+1 C 1+X −1/2 ∗ ∗ −1/2 ∗ are positive. In fact, B is defined by B = ( 1 + A) C ( 1 + A) C − 1 . In order to conclude µ ! µ ! A C∗ 1 − A −C ∗ that Tµ = is the minimum in the assertion, it remains to show that is positive. C Bµ −C 1 − Bµ ! ! 1 − A −C ∗ 1 + A C∗ Since is positive just as , (1 − A)−1/2 C ∗ is a well defined linear operator. By −C 21 C 21 Corollary I.1.2 it reduces to prove the inequality 2 2 2 (1 − A)−1/2 C ∗ y ≤ 2 kyk − (1 + A)−1/2 C ∗ y . the positivity of 1+A But 2 2 (1 + A)−1/2 C ∗ y + (1 − A)−1/2 C ∗ y = =2 1 − A2 −1 C ∗ y, C ∗ y = 2 1 − A2 −1/2 2 C ∗ y . 1 − A2 C ∗ Since C ∗ C ≤ 1 − A2 implies ≥ 0, C 1 ! 2 1 − A2 −1/2 2 2 C ∗ y ≤ 2 kyk , ! ∗ A C∗ as expected. Analogously TM = with BM = 1 − (1 − A)−1/2 C ∗ (1 − A)−1/2 C ∗ is shown to C BM be the maximum. This completes the proof. Theorem I.2. Let A and B be positive operators on H. Then there is the maximum of all self-adjoint ! A X operators X on H for which are positive. X B I. GEOMETRIC AND HARMONIC MEANS 5 −1/2 −1/2 A X X B ! Proof. Consider first the case that A has bounded inverse. Let D = A BA . Then ! 1 A−1/2 XA−1/2 is positive if and only if is positive. We claim that D1/2 is the maximum A−1/2 XA−1/2 D ! ! 1 Y 1 D1/2 of all Y for which are positive. The positivity of is obvious by Corollary I.1.2. Y D D1/2 D ! 1 Y Suppose that is positive. By Theorem I.1 there is a contraction W on H such that Y = D1/2 W . Y D Introduce a new scalar product on H by hx, yi := (D1/2 x, y). Since Y is self-adjoint hW x, yi = hx, W yi, for every x, y ∈ H which means that W is self-adjoint with respect to this new scalar product. Since kW k ≤ 1, for any real number λ with |λ| > 1 the operator λ − W has bounded inverse. The operator (λ − W )−1 is bounded with respect to the new scalar product, hence |hW x, xi| ≤ hx, xi for all x ∈ H, which implies −D1/2 ≤ Y ≤ D1/2 . Thus D1/2 is the maximum as claimed. As!a consequence, 1/2 1/2 A X A1/2 D1/2 A1/2 = A1/2 A−1/2 BA−1/2 A is the maximum of all X for which are positive. X B If A does not admit bounded inverse, consider A = A + with > 0. Let C be the maximum of all X ! A X for which are positive. Since obviously C ≥ C0 ≥ 0 whenever > 0 > 0, the operator lim+ C →0 X B ! A X is the maximum of all X for which are positive. This completes the proof. X B We shall call the maximum in Theorem I.2 the geometric mean of two positive operators A and B, and denote it by A#B. Corollary I.2.1. Geometric means have the following properties. (i) (ii) (iii) (iv) (v) A#B = B#A. (αA)#(αB) = α(A#B) for α ≥ 0. (A1 + A2 )#(B1 + B2 ) ≥ (A1 #B1 ) + (A2 #B2 ). C(A#B)C ∗ ≤ (CAC ∗ )#(CBC ∗ ) for any linear operator C. A#A = A, 1#A = A1/2 and 0#A = 0. 1/2 1/2 (vi) A#B = A1/2 A−1/2 BA−1/2 A if A has bounded inverse. (vii) A−1 #B −1 = (A#B)−1 if both A and B have bounded inverses. Proof. (i) to (iv) are immediate from definition. (v) and (vi) are proved as in the proof of Theorem I.2. (vii) follows from (vi). As a consequence if A commutes with B, then A#B = (AB)1/2 . This justify the terminology “geometric mean”. Corollary I.2.2. If 0 ≤ p ≤ 1, then A ≥ B ≥ 0 implies Ap ≥ B p . Proof. The assertion is true if p = 1 or 0. Therefore it remains to show that the set ∆ of p for which the assertion is true is convex. Take p1 , p2 ∈ ∆, and let p = 12 (p1 + p2 ). Then since Ap = Ap1 #Ap2 and B p = B p1 #B p2 by Corollary I.2.1, Ap ≥ B p follows also from the same Corollary. 6 I. GEOMETRIC AND HARMONIC MEANS Corollary I.2.3. Let A1 and A2 (resp. ! B1 and B2 ) be positive operators on G (reps. K) and ! let C ∗ ∗ Ai C A1 #A2 C be a linear operator from G to K. If are positive i = 1, 2, then so is . C Bi C B1 #B2 Proof. We may assume the bounded invertibility of A1 and A2 . Then by assumption and Corollary ∗ I.1.2 CA−1 i = 1, 2. Therefore by Corollary I.2.1 i C ≤ Bi −1 ∗ ∗ C(A1 #A2 )−1 C ∗ = C A−1 C∗ < CA−1 # CA−1 ≤ B1 #B2 , 1 #A2 1 C 2 C ! A1 #A2 C∗ which implies the positivity of by Corollary I.1.2. C B1 #B2 Corollary I.2.4. Arithmetic mean is greater than geometric mean, that is positive A and B. 1 2 (A + B) ≥ A#B for Proof. We may assume the bounded invertibility of A. Then by Corollary I.2.1 1/2 1 1 −1/2 1 1/2 −1/2 −1/2 1/2 1/2 −1/2 A#B = A A BA A ≤A 1+ A BA A1/2 = (A + B), 2 2 2 1 because X ≤ 1 + X 2 for any positive X. 2 −1 1 −1 −1 The harmonic mean of two positive operators A and B must be defined by A +B when 2 both A and B have bounded inverses. We shall denote harmonic mean by A : B. If A and B do not have bounded inverses, the harmonic mean A : B is defined as the limit of (A + ) : (B + ) as → 0+ . Theorem I.3. Let A and B be positive operators on a Hilbert space H. Then harmonic mean A : B is the maximum of all X for which ! ! 2A 0 X X ≥ . 0 2B X X Proof. We may assume that both A and B have bounded inverse. Then the above inequality is equivalent to the condition that for every x ∈ H (Xx, x) ≤ 2 inf {(Ay, y) + (B(x − y), x − y)} y hence to the condition ( 2 |(Bx, y)| (Xx, x) ≤ 2 (Bx, x) − sup ((A + B)y, y) y ) . Since by definition A : B = 2A(A + B)−1 B = 2 B − B(A + B)−1 B , we have ( ) 2 |(Bx, y)| ((A : B)x, x) = 2 (Bx, x) − sup . y ((A + B)y, y) ! ! 2A 0 X X This shows that A : B is the maximum of all X for which ≥ . 0 2B X X Corollary I.3.1. Harmonic means have the following properties. (i) (ii) (iii) (iv) A : B = B : A. (αA) : (αB) = α(A : B) for α ≥ 0. (A1 + A2 ) : (B1 + B2 ) ≥ (A1 : B1 ) + (A2 : B2 ). C(A : B)C ∗ ≤ (CAC ∗ ) : (CBC ∗ ) for any linear operator C. I. GEOMETRIC AND HARMONIC MEANS 7 (v) A : A = A, 1 : A = 2A(1 + A)−1 and 0 : A = 0. (vi) A : B = 2A(A + B)−1 B if A has bounded inverse. −1 (vii) A−1 : B −1 = 12 (A + B) if both A and B have bounded inverses. Corollary I.3.2. Geometric mean is greater than harmonic mean, that is A#B ≥ A : B for positive A and B. Proof. We may assume the bounded invertibility of A and B. Then by Corollary I.2.3 A−1 #B −1 ≤ 1 −1 A + B −1 . By taking inverse, this yields, by Corollaries I.2.1 and I.3.1, A#B ≥ A : B. 2 Theorem I.4. Let A1 and A2 (resp. ! B1 and B2 ) be positive operators of G (resp. K) and let C ! be a 1 ∗ ∗ Ai C (A1 + A2 ) C linear operator from G to K. If are positive i = 1, 2, then so are 2 and C Bi C B 1 : B2 ! A1 : A2 C∗ . 1 C 2 (B1 + B2 ) Proof. We may assume the bound invertibility of A1 and A2 . Then by assumption and Corollary I.1.2 ≤ Bi i = 1, 2. Therefore by Corollary I.3.1 −1 ∗ 1 −1 ∗ ∗ C (A1 + A2 ) C ∗ = C A−1 C ≤ CA−1 : CA−1 ≤ B1 : B2 , 1 : A2 1 C 2 C 2 ! 1 ∗ (A + A ) C 1 2 2 which implies the positivity of by Corollary I.1.2. The positivity of C B 1 : B2 ! A1 : A2 C∗ is proved analogously. 1 C 2 (B1 + B2 ) ∗ CA−1 i C Corollary I.4.1. The following identity holds for positive A and B; 1 (A + B) #(A : B) = A#B. 2 ! ! A A#B B A#B Proof. By definition and are positive, so that A#B B A#B A is positive by Theorem I.4, which implies 1 (A + B) #(A : B) ≥ A#B. 2 1 2 (A + B) A#B A#B A:B ! When A and B have bounded inverse, the above inequality, with A−1 and B −1 instead of A and B respectively, leads, by taking inverse, to the inequality 1 (A : B)# (A + B) ≤ A#B. 2 These two inequalities yield the assertion. CHAPTER II Operator-Monotone Functions Given a finite or infinite open interval (α, β) on the real line, let us denote by S(α, β; H) the totality of all self-adjoint operators on H whose spectrum are included in the interval (α, β). If there is no confusion, we shall write simply S(α, β). A real-valued continuous function f on (α, β) is said to be operator-monotone on (α, β) if A, B ∈ S(α, β; H) and A ≤ B implies f (A) ≤ f (B), where f (A) and f (B) are defined by familiar functional calculi for self-adjoint operators. An operator-monotone function is non-decreasing in the usual sense, but the converse is not true. This chapter is devoted to intrinsic characterization of operator-monotonousness. Examples II.1. 1. f (λ) := a + bλ with b ≥ 0 is operator-monotone on (−∞, ∞). 2. f (λ) := λp with 0 ≤ p ≤ 1 is operator-monotone on (0, ∞) by Corollary I.2.2. 1 3. For µ ∈ / (α, β) the function f (λ) := µ−λ is operator-monotone on (α, β). In particular f (λ) = − λ1 is operator-monotone on (0, ∞). In fact, if µ < α, then A, B ∈ S(α, β) A ≤ B implies 0 < −(µ − A) ≤ −(µ − B) so that −(µ − A)−1 ≥ −(µ − B)−1 hence f (A) ≤ f (B). If µ > β, then 0 < µ − B < µ − A hence (µ − B)−1 ≥ (µ − A)−1 . It is easy to see that a function f is operator-monotone on a finite interval (α, β) if and only if g(λ) := f − α)λ + α + β) is operator monotone on (−1, 1). On this basis, we shall treat only the interval (−1, 1). 1 2 ((β Lemma II.1. If a continuously differentiable function f on (−1, 1) is operator-monotone, then for any N choice λi ∈ (−1, 1) i = 1, . . . , N the matrix f [1] (λi , λj ) i,j=1 is positive, where f [1] (λ, µ) is defined by f [1] (λ, µ) = f (λ)−f (µ) λ−µ for λ 6= µ and f [1] (λ, λ) = f 0 (λ). λ1 0 .. , considered as a self-adjoint operator on the N Proof. Observe the matrix A := . 0 λN dimensional Hilbert space CN . Take any complex numbers ξ1 , . . . , ξN and consider the positive matrix N B := ξi ξ¯j i,j=1 . Since A ∈ S −1, 1; CN , for sufficiently small > 0, A + B belongs to S −1, 1; CN . We claim that for any continuously differentiable function g on (−1, 1) lim+ −1 {g(A + B) − g(A)} = →0 N X i,j=1 8 g [1] (λi , λj )Pi BPj II. OPERATOR-MONOTONE FUNCTIONS 9 where Pi is the orthoprojection to the subspace spanned by the vector ei := (δij )N j=1 . In fact, this is true if n g is polynomial, because for g(λ) = λ −1 {g(A + B) − g(A)} = n X Ak−1 BAn−k + O() k=1 = N X n X λk−1 λn−k Pi BPj + O() i j i,j=1 k=1 = N X g [1] (λi , λj )Pi BPj + O(). i,j=1 0 Use approximation of g by polynomials on a suitable subinterval when g is merely continuously differentiable. Now since f is operator-monotone and A + B ≥ A for all > 0, we have N X f [1] (λi , λj )Pi BPj = lim+ −1 {f (A + B) − f (A)} ≥ 0. →0 i,j=1 Let x = PN i,j=1 ei , then the positivity of 0≤ N X PN i,j=1 f [1] (λi , λj )Pi BPj implies f [1] (λi , λj )(Pi BPj x, x) = i,j=1 N X f [1] (λi , λj )ξi ξ¯j . i,j=1 N Since (ξi ) are arbitrary, this means that the matrix f [1] (λi , λj ) i,j=1 is positive. Lemma II.2. If a continuously differentiable function f on (−1, 1) is operator-monotone, then there exists a finite positive measure m on the closed interval [−1, 1] such that Z 1 λ f (λ) = f (0) + dm(t) for λ ∈ (−1, 1). −1 1 − λt Proof. Consider the linear subspace H of C(−1, 1), spanned by the functions fλ (t) := f [1] (λ, t) where λ runs over (−1, 1). Introduce a scalar product in H by * + X X X αi fλi , βj fµj = f [1] (λi , µj )αi βj . i j i,j ˆ be the associated Hilbert space. Let us show that This is positive semi-definite by Lemma II.1. Let H λn → λ implies kfλn − fλ k → 0. In fact, 2 kfλn − fλ k = f [1] (λn , λn ) − 2f [1] (λn , λ) + f [1] (λ, λ) = f 0 (λn ) − 2 f (λn ) − f (λ) + f 0 (λ). λn − λ Now by continuous differentiability of f f 0 (λn ) − 2 f (λn ) − f (λ) + f 0 (λ) → f 0 (λ) − 2f 0 (λ) + f 0 (λ) = 0. λn − λ ˆ spanned by all fλ (λ 6= 0). Then D is dense in H, ˆ as shown above. Define Let D be the linear subspace of H, ˆ by a linear map T from D to H T fλ := λ−1 {fλ − f0 } (λ 6= 0). T is symmetric hence well-defined, because hT fλ , fµ i = µf (λ) − λf (λ) f (0) − = hfλ , T fµ i . λµ(λ − µ) λµ 10 II. OPERATOR-MONOTONE FUNCTIONS We claim that T admits a (not necessarily bounded) self-adjoint extension Tˆ. This follows from a theorem of von Neumann, (see [12] p.1231), because the anti-linear involution J, defined by ! X X J αi fλi = αi fλi , i i makes D invariant and commutes with T . Since by definition (µ − λ)fµ = (1 − λTˆ)(µfµ − λfλ ), the kernel if 1 − λTˆ is orthogonal to the linear subspace spanned by fµ (µ 6= λ, and µ 6= 0). This subspace ˆ as shown above, so that 1 − λTˆ is injective, hence (1 − λTˆ)−1 is a (unbounded) self-adjoint is dense in H operator. Therefore f0 = (1 − λTˆ)fλ implies fλ = (1 − λTˆ)−1 f0 . R∞ Now let Tˆ = −∞ tdE(t) be the spectral representation of the self-adjoint operator Tˆ, and dm(t) = hdE(t)f0 , f0 i. Then for λ 6= 0 λ−1 {f (λ) − f (0)} = f [1] (λ, 0) D E = hfλ , f0 i = (1 − λTˆ)−1 f0 , f0 Z ∞ 1 = dm(t). 1 − λt −∞ To complete the proof, it remains to prove that the measure m is concentrated on the closed interval [−1, 1]. Take α > 1, and let us show that [α, ∞) is an m-zero set Z α−1 Z ∞ Z α−1 Z ∞ 1 1 dm(t)dλ ≤ dm(t)dλ 2 (1 − λt) (1 − λt)2 0 α 0 −∞ Z α−1 Z α−1 = hfλ , fλ i dλ = f 0 (λ)dλ = f α−1 − f (0) < ∞. 0 By Fubini’s theorem we have Z α−1 Z 0 ∞ α 0 1 dm(t)dλ = (1 − λt)2 Z ∞ α ∞ Z ≥ α α−1 Z 0 1 t Z 1 dλdm(t) (1 − λt)2 1 s−2 dsdm(t). 0 But this last expression is finite inly if [α, ∞) is an m-zero set. Analogously (−∞, −α] is an m-zero set. R1 Remark. In Lemma II.2 −1 dm(t) = hf0 , f0 i = f 0 (0). Lemma II.3. If a continuously differentiable function f on (−1, 1) is operator-monotone, then for any choice −1 < λ1 < µ1 < λ2 < µ2 < 1 ! f [1] (λ1 , µ1 ) f [1] (λ1 , µ2 ) det ≥ 0. f [1] (λ2 , µ1 ) f [1] (λ2 , µ2 ) Proof. We shall use the notations in the proof of the previous Lemma. First remark that Z 1 1 [1] f (λ, µ) = hfλ , fµ i = dm(t). (1 − λt)(1 − µt) −1 If the measure m is concentrated on a single point, say t0 , then the determinant in question is equal to a scalar multiple of ! (1 − λ1 t0 )−1 (1 − µ1 t0 )−1 (1 − λ1 t0 )−1 (1 − µ2 t0 )−1 det , (1 − λ2 t0 )−1 (1 − µ1 t0 )−1 (1 − λ2 t0 )−1 (1 − µ2 t0 )−1 which is just equal to 0. II. OPERATOR-MONOTONE FUNCTIONS 11 Suppose that m is not concentrated on a single point and that the determinant has negative value. Since the corresponding determinant with λ1 = µ1 and λ2 = µ2 has non-negative by Lemma II.1, by using continuity argument there are µ01 and µ02 such that λ1 ≤ µ01 ≤ µ1 and λ2 ≤ µ02 ≤ µ2 , and ! f [1] (λ1 , µ01 ) f [1] (λ1 , µ02 ) det = 0. f [1] (λ2 , µ01 ) f [1] (λ2 , µ02 ) This implies that there are α1 and α2 such that |α1 | + |α2 | 6= 0 and fλi , α1 fµ01 + α2 fµ02 = 0 (i = 1, 2). Choose β1 and β2 so that β1 + β2 = α1 + α2 and β1 λ2 + β2 λ1 = α1 µ02 + α2 µ01 . Then it follows that 0 = β1 fλ1 + β2 fλ2 , α1 fµ01 + α2 fµ02 Z 1 {α1 + α2 − (α1 µ02 + α2 µ01 )t}2 = dm(t). 0 0 −1 (1 − λ1 t)(1 − λ2 t)(1 − µ1 t)(1 − µ2 t) Since the denominator is strictly positive on [−1, 1] and since the measure m is not concentrated on any single point, the above relation implies that α1 + α2 = 0 and α1 µ02 + α2 µ01 = 0, which contradicts |α1 | + |α2 | 6= 0. This contradiction completes the proof. Now let us return to general operator-monotone functions. Take an infinitely many times differentiable R1 function φ on (−∞, ∞) such that φ is non-negative, vanishes outside of (−1, 1) and −1 φ(t)dt = 1. Suppose that f is operator-monotone on (−1, 1). For any 0 < < 1, define a function f() on (−1+, 1−) by Z 1 f() (t) := φ(s/)f (t − s)ds. − The function f() is operator-monotone on (−1 + , 1 − ), because each f (t − ) is so. Obviously f() is infinitely many times differentiable, and as converges to 0, f() (t) converges to f (t) uniformly on any closed subinterval of (−1, 1). Lemma II.4. If a continuous function f on (−1, 1) is operator-monotone, then on any closed subinterval (α, β) of (−1, 1) f satisfies Lipschitz condition, that is, supα<s<t<β f [1] (t, s) < ∞. Proof. Let us use the notations in the preceding comment. Let λ1 = 12 (α − 1) and µ2 = 12 (β + 1), and apply Lemma II.3 to the operator-monotone function f() with λ2 = t and µ1 = s. By taking limit, we can conclude that ! f [1] (λ1 , s) f [1] (λ1 , µ2 ) det ≥ 0, f [1] (t, s) f [1] (t, µ2 ) or equivalently f [1] (λ1 , µ2 )f [1] (t, s) ≤ f [1] (λ1 , s)f [1] (t, µ2 ). The right side of the above inequality is bounded when s < t run over [α, β]. Since f is non-decreasing, both f [1] (λ1 , µ2 ) and f [1] (t, s) are non-negative. Therefore if f [1] (λ1 , µ2 ) 6= 0, then f [1] (t, s) are bounded when s < t run over [α, β]. Finally f [1] (λ1 , µ2 ) = 0 implies f [1] (t, s) = 0 for all s < t in [α, β]. This completes the proof. Theorem II.1. In order that a continuous function f on (−1, 1) is operator-monotone, it is necessary and sufficient that there is a finite positive measure m on [−1, 1] such that Z 1 λ f (λ) = f (0) + dm(t) for λ ∈ (−1, 1). 1 − λt −1 12 II. OPERATOR-MONOTONE FUNCTIONS Proof. Suppose that f admits a representation of the above form. For each t ∈ [−1, 1] the function λ ht (λ) := 1−λt is operator-monotone on (−1, 1). In fact, if t = 0, h0 (λ) = λ is operator-monotone. If t 6= 0, t−2 t−1 −λ is operator-monotone as stated in one of the R1 average −1 ht (λ)dm(t) is operator-monotone, and so is f . ht (λ) = −t−1 + Examples II.1. As a consequence, the weighted Suppose conversely that f is operator-monotone. Then with the notations in front of Lemma II.4, for each 0 < < 1 f() ((1 − )λ) is operator-monotone on (−1, 1). Then since f() ((1 − )λ) is continuously differentiable, by Lemma II.2 there is a measure m on [−1, 1] such that Z 1 λ f() ((1 − )λ) = f() (0) + dm (t). −1 1 − λt R1 We claim that −1 dm (t) are bounded when runs over (0, 1/2). As remarked after Lemma II.2, we have Z 1 0 dm (t) = f() (0). −1 Since Lemma II.4 f satisfies Lipshitz condition on any closed subinterval of (−1, 1), it has derivative f 0 (t) for almost all λ and ess. sup |f 0 (t)| = r < ∞. Further it is easy to see that −<t< 0 f() (0) = (1 − ) Z f 0 (−s)φ − s ds ≤ (1 − )r . R1 These considerations show the boundedness of −1 dm (t) when runs over (0, 1/2). Now by the Helly theorem (see [11] p. 381) there is a sequence n → 0 and a measure m on [−1, 1] such that Z 1 Z 1 lim g(t)dmn (t) = g(t)dm(t) n→∞ −1 −1 for all continuous functions g on [−1, 1]. Therefore for λ ∈ (−1, 1) Z 1 Z 1 λ λ lim dmn (t) = dm(t). n→∞ −1 1 − λt 1 − λt −1 Finally the assertion follows from lim f(n ) ((1 − n )λ) = f (λ). n→∞ The integral representation in Theorem II.1 shows that an operator-monotone function f on (−1, 1) is necessarily infinitely many times differentiable. Further more it admits analytic continuation to the upper and lower open half planes by Z 1 ζ fˆ(ζ) = f (0) + dm(t) for ζ with Im (ζ) 6= 0. 1 − ζt −1 The function fˆ maps the upper half plane to itself. Indeed Z 1 Im (ζ) ˆ Im f (ζ) = 2 dm(t). −1 |1 − ζt| This observation is completed in the following theorem. Theorem II.2. Let f be a real-valued continuous function on a finite or infinite interval (α, β). In order ˆ that f is operator-monotone it is necessary andsufficient that it admits an analytic continuation f to the upper and lower half planes such that Im fˆ(ζ) > 0 for Im (ζ) > 0. II. OPERATOR-MONOTONE FUNCTIONS 13 Proof. Suppose that f admits an analytic continuation fˆ of the type mentioned above. Then by a well-known theorem of Nevanlina (see [1] p. 7) there are a real number a, a non-negative number b and a finite positive measure m ˆ on Ω := (−∞, ∞) \ (α, β) such that Z 1 + ζt fˆ(ζ) = a + bζ + dm(t). ˆ Ω t−ζ The function g(λ) := a + bλ with b ≥ 0 is obviously operator-monotone on (α, β). For each t ∈ / (α, β) the function ht (λ) := 1+λt is operator monotone, because t−λ ht (λ) = −t + 1 + t2 t−λ 1 and the function t−λ is operator-monotone on (α, β). Therefore the weighted average f (λ) is also operatormonotone on (α, β). This completes the proof. Corollary II.2.1. If f is operator-monotone on (−∞, ∞), then f (λ) = a + bλ with some real a and non-negative b. This follows immediately from the integral representation in the proof of Theorem II.2, because the measure m ˆ must vanish. CHAPTER III Operator-Convex Functions The notion next to monotoneousness seems convexity. In this respect, a real-valued continuous function f on a finite or infinite interval (α, β) is said to be operator-convex if 1 1 f (A + B) ≤ {f (A) + f (B)} forA, B ∈ S(α, β). 2 2 The function f is said to be operator-concave if −f is operator-convex. An operator-convex function is convex in the usual sense, but the converse is not true. This chapter is devoted to intrinsic characterization of operator-convexity. Examples III.1. 1. f (λ) := a + bλ is operator-convex on (−∞, ∞). 2. f (λ) := λ2 is operator-convex on (−∞, ∞). In fact, for self-adjoint A and B 2 1 2 1 1 A + B2 − (A + B) = (A − B)2 ≥ 0. 2 2 4 1 3. For µ < α the function f (λ) := λ−µ is operator-convex on (α, ∞). In particular, f (λ) := convex on (0, ∞). In fact, for A, B ∈ S(α, β), A − µ and B − µ belong to S(0, ∞) and 1 λ is operator- 1 (A − µ)−1 + (B − µ)−1 = {(A − µ) : (B − µ)}−1 . 2 By Corollary I.2.4 and I.3.2. −1 1 1 −1 {(A − µ) : (B − µ)} ≥ (A − µ) + (B − µ) 2 2 −1 1 = (A + B) − µ . 2 Lemma III.1. Let f be a twice continuously differentiable function on (−1, 1). If f is operator-convex, then for each µ ∈ (−1, 1) the function g(λ) := f [1] (µ, λ) is operator-monotone. Conversely if f [1] (0, λ) is operator-monotone, then f is operator-convex. Proof. Let f be operator-convex. Obviously g is continuously differentiable on (−1, 1). Inspection of Lemmas II.1 and II.2 will show that for the operator-monotoneousness of g it suffices to prove that N for any choice λi ∈ (−1, 1) i = 1, 2, . . . , N the matrix g [1] (λi , λj ) is positive. Observe the matrix i,j=1 λ1 0 A := λN with λN +1 = µ, considered as a self-adjoint operator on the (N + 1)-dimensional 0 λN +1 ξ¯1 .. 0 . . Hilbert space CN +1 . Take any complex numbers ξ1 , . . . , ξN and consider the matrix B := ξ¯N ξ , . . . , ξN 0 1 N +1 N +1 Since A ∈ S −1, 1; C , for sufficiently small > 0 A + B belongs to S −1, 1; C . Since f is twice 14 III. OPERATOR-CONVEX FUNCTIONS 15 continuously differentiable, just as in the proof of Lemma II.1, it can be shown that the matrix-valued function f (A + B) is twice differentiable and N +1 X d2 f (A + B) = f [2] (λi , λj , λk )Pi BPj BPk d2 =0 i,j,k=1 N +1 where Pi is the orthoprojection to the subspace spanned by the vector ei := (δij )j=1 and f [2] (s, t, u) = d2 f (A + B) [1] [1] hs (t, u) with hs (t) = f (s, t). Now the operator-convexity of f implies ≥ 0, as in the d2 =0 PN case of scalar convex functions. Let x = i=1 ei . Then 0≤ N +1 X f [2] (λi , λj , λk )(Pi BPj BPk x, x) = i,j,k=1 = N X f [2] (λi , λN +1 , λk )ξk ξ¯i = i,k=1 N X g [1] (λi , λk )ξk ξ¯i . i,k=1 N Since (ξi ) are arbitrary, this means that the matrix g [1] (λi , λj ) i,j=1 is positive. Suppose conversely that f [1] (0, λ) is operator-monotone. Then by Theorem II.1 there exists a finite positive measure m on [−1, 1] such that Z 1 λ f [1] (0, λ) = f 0 (0) + dm(t), 1 − λt −1 hence Z 1 f (λ) = a + bλ + −1 λ2 dm(t), 1 − λt 2 0 λ where a = f (0) and b = f (0). For each t ∈ [−1, 1] the function ht (λ) := 1−λt is operator-convex. In fact, if 2 t = 0, ht (λ) = λ , and if t 6= 0, t−2 ht (λ) = −t−2 − t−1 λ + . 1 − λt These functions are operator-convex, as shown in Example III.1. Since the function a+bλ is operator-convex, too, the weighted average f is operator-convex. This completes the proof. Theorem III.1. If a continuous function f on (−1, 1) is operator-monotone, then both the function Rλ f1 (λ) = 0 f (t)dt and f2 (λ) := λf (λ) are operator-convex. Proof. Since f is continuously differentiable by Theorem II.2, both f1 and f2 are twice continuously differentiable. Now since Z 1 [1] [1] f1 (0, λ) = f (sλ)ds and f2 (0, λ) = f (λ), 0 the assertion follows from Lemma III.1. Theorem III.2. In order that a continuous function f on (−1, 1) is operator-convex, it is necessary and sufficient that there are real numbers a and b, and a finite positive measure m on [−1, 1] such that Z 1 λ2 f (λ) = a + bλ + dm(t) for λ ∈ (−1, 1). −1 1 − λt Proof. Sufficiency was shown already in the proof of Lemma III.1. Suppose that f is operator-convex. By using the notations in the proof of Lemma II.4, f() ((1−)λ) is operator-convex for each 0 < < 1. Theref() ((1 − )λ) − f() ((1 − )µ) fore by Lemma III.1 for any µ ∈ (−1, 1) the function is operator-monotone on λ−µ (−1, 1) so that by taking limit as → 0 f [1] (µ, λ) is operator-monotone on (µ + δ, 1) as well as (−1, µ − δ) 16 III. OPERATOR-CONVEX FUNCTIONS for any 0 < δ. By Theorem II.2 this implies that f is twice continuously differentiable on (−1, 1). Now the argument of the proof of Lemma III.1 can be applied. Now let us consider functions on the half-line (0, ∞). Theorem III.3. An operator-monotone function f on (0, ∞) is operator-concave. Proof. It is seen from the proof of Theorem II.2 that f admits a representation Z 0 1 + λt f (λ) = a + bλ + dm(t) −∞ t − λ where b ≥ 0 and m is a positive measure. It suffices to prove that for each t < 0 the function ht (λ) := 1+λt t−λ is operator-concave. If t = 0, h0 (λ) = −1/λ is operator-concave on (0, ∞), as mentioned in Example III.1. If t < 0, 1 + t2 ht (λ) = −t − λ−t is also operator-concave on (0, ∞), as shown in Example III.1. Since the function a + bλ is obviously operator-concave, the weighted average f is operator-concave, too. Corollary III.3.1. If a function f is operator-monotone and f (λ) > 0 on (0, ∞), then g(λ) := f (λ)−1 is operator-convex. Proof. Take A, B ∈ S(0, ∞). Since f is operator-concave and the function 1/λ is operator-convex on (0, ∞), 1 1 f (A + B) ≥ {f (A) + f (B)} 2 2 and g 1 (A + B) 2 −1 1 ≤ {f (A) + f (B)} 2 1 ≤ {g(A) + g(B)}. 2 Thus g is operator-convex. Corollary III.3.2. A function on (0, ∞) is operator-monotone and operator-convex at the same time, only if it is of the form a + bλ. Theorem III.4. A continuous function f on (0, ∞) with f (0) := lim→0+ f () = 0 is operator-convex if and only if f (λ)/λ is operator-monotone. Proof. Suppose that f is operator-convex. By Theorem III.2 it is infinitely many times differentiable. () Then by Lemma III.1 for each > 0 the function f (λ)−f is operator-monotone hence, as the limit, the λ− function f (λ)/λ is operator-monotone. Suppose conversely that f (λ)/λ is operator-monotone. Then as in the proof of Theorem III.3 there are a and b > 0 and a measure m such that Z 0 1 + tλ f (λ)/λ = a + bλ + dm(t), −∞ t − λ hence f (λ) = aλ + bλ2 + Z 0 −∞ λ(1 + λt) dm(t). t−λ III. OPERATOR-CONVEX FUNCTIONS 17 Since bλ2 is operator-convex, it suffices to prove that for each t < 0 the function ht (λ) := operator-convex on (0, ∞). For t = 0 this is obvious. If t 6= 0, ht (λ) = {−(1 + t2 ) − tλ} + λ(1+λt) t−λ is (1 + t2 ) |t| λ−t is operator-convex, as was shown in Example III.1. Let us investigate for what exponent −∞ < s < ∞ the function f (λ) := λs on (0, ∞) is operatormonotone, operator-convex or operator-concave. Examples III.2. 1. f (λ) = λs is operator-monotone (or operator-concave) if and only if 0 ≤ s ≤ 1. This follows from Corollary I.2.2 and Theorem III.3 and from the fact that for s > 1 or < 0 the function f is not concave. 2. f (λ) = λs is operator-convex if and only if 1 ≤ s ≤ 2 or −1 ≤ s ≤ 0. In fact, by Theorem III.4 for s > 0 f (λ) = λs is operator-convex if and only if λs−1 is operator-monotone. Therefore for s > 0 f is operator-convex if and only if 1 ≤ s ≤ 2. By Corollary III.3.1 f (λ) = λs is operator-convex for s −1 −1 ≤ s ≤ 0. For s < −1 the function λλ−1 does not admit any analytic continuation fˆ to the upper half plane such that Im (fˆ(ζ)) > 0 for Im (ζ) > 0, hence f (λ) = λs is not operator-convex by Lemma III.1. Lemma III.2. Let f (λ) > 0 on (0, ∞) and g(λ) := f (λ−1 )−1 . If f is operator-monotone, so is g. If f is operator-convex and f (0) = 0, the function g is operator-convex. Proof. Suppose that f is operator-monotone. Then by Theorem II.2 it admits an analytic continuation fˆ to the complement of the closed negative real semi-axis such that Im fˆ(ζ) > 0 or < 0 according as −1 Im (ζ) > 0 or Im (ζ) < 0. Then gˆ(ζ) := fˆ ζ −1 is an analytic continuation of g to the complement of the closed negative real semi-axis such that Im (g(ζ)) > 0 or < 0 according as Im (ζ) > 0 or < 0. Therefore again by Theorem II.2 g is operator-monotone. Suppose next that f is operator-convex and f (0) = 0. Then by Theorem III.4 the function h(λ) := f (λ)/λ is operator-monotone. By applying the first part of this lemma to h, we can conclude that the −1 function h λ−1 = g(λ)/λ is operator-monotone, hence by Theorem III.4 g is operator-convex. Theorem III.5. Let f be a continuous positive function on (0, ∞) and A, B positive operators. If f is operator-monotone then f (A : B) ≤ f (A) : f (B). If f is operator-convex and f (0) = 0 then f (A : B) ≥ f (A) : f (B). Proof. Suppose that f is operator-monotone. Then by Lemma III.2 g(λ) := f λ−1 monotone, hence operator-concave by Theorem III.2. Therefore 1 −1 1 −1 g A +B ≥ g A−1 + g A−1 2 2 −1 is operator- hence f (A : B) ≤ f (A) : f (B). The other assertion can be proved quite analogously by using Lemma III.2. CHAPTER IV Positive Maps A (non-linear) transformation which maps L+ (H), the set of positive operators on H, to L+ (K) will be called positive. In this chapter we shall study some special classes of positive maps. Let us start with positive linear maps. A positive linear map Φ from L(H) to L(K) preserves orderrelation, that is, A ≤ B implies Φ(A) ≤ Φ(B), and preserves adjoint operation, that is Φ(A∗ ) = Φ(A)∗ . It is said to be normalized if it transforms 1H to 1K . If Φ is normalized, it maps S(α, β; H) to S(α, β; K). Lemma IV.1. A normalized positive linear map φ has the following properties. (i) Φ A2 ≥ Φ(A)2 for A ∈ S(−∞, ∞; H). (ii) Φ A−1 ≥ Φ(A)−1 for A ∈ S(0, ∞; H). (i) By Remark after Corollary I.1.1 it suffices to prove the positivity of R∞ Consider the spectral representation A = −∞ tdE(t). Since Z ∞ Z ∞ A2 = t2 dE(t) and 1 = dE(t), Proof. −∞ Φ A2 Φ(A) ! Φ(A) . Φ(1) −∞ we have, with tensor product notation, ! Z ! ∞ Φ A2 Φ(A) t2 t = ⊗ dE(t). Φ(A) Φ(1) t 1 −∞ ! t2 t Since 2 × 2 matrices are positive, for all −∞ < t < ∞ the right hand of the above expression t 1 is positive. ! Φ(A) Φ(1) . Since A is positive by assumption, it is written (ii) It suffices to prove the positivity of Φ(1) Φ A−1 R∞ in the form ! A = 0+ tdE(t). Now the positivity in question follows from the positivity of matrices t 1 1 t −1 for 0 < t < ∞, as in the proof of (i). Theorem IV.1. Let Φ be a normalized positive linear map. If f is an operator-convex function on (α, β), then f [Φ(A)] ≤ Φ[f (A)] for A ∈ S(α, β; H). Proof. It suffices to consider the case (α, β) = (−1, 1). By Theorem III.2 f admits a representation Z 1 λ2 f (λ) = a + bλ + dm(t) −1 1 − λt with b ≥ 0 and a positive measure m. Since for A ∈ S(−1, 1; H) Z 1 Φ[f (A)] = a + bΦ(A) + Φ A2 (1 − tA)−1 dm(t) −1 18 IV. POSITIVE MAPS and Z 1 f [Φ(A)] = a + bΦ(A) + −1 19 Φ(A)2 {1 − tΦ(A)}−1 dm(t), it suffices to show Φ A2 (1 − tA)−1 ≥ Φ(A)2 {1 − tΦ(A)}−1 for − 1 ≤ t ≤ 1. For t = 0, this follows from Lemma IV.1. For t 6= 0, again by Lemma IV.1 Φ A2 (1 − tA)−1 = −t2 − t−1 Φ(A) + t2 φ (1 − tA)−1 ≥ −t−2 − t−1 Φ(A) + t−2 {1 − tΦ(A)}−1 = Φ(A)2 {1 − tΦ(A)}−1 . This completes the proof. Corollary IV.1.1. Let Φ be a normalized positive linear map. Then for a positive operator A Φ (Ap ) ≥ Φ(A)p (1 ≤ p ≤ 2) and φ (Ap ) ≤ Φ(A)p (0 ≤ p ≤ 1). Proof. As shown in Examples III.1, λp is operator-convex on (0, ∞) for 1 ≤ p ≤ 2 while −λp is operator-convex for 0 ≤ p ≤ 1. 1/p Corollary IV.1.2. If Φ is a normalized positive map and if A is a positive operator, then Φ (Ap ) 1/q Φ (Aq ) whenever 1 ≤ p ≤ q or 12 q ≤ p ≤ 1 ≤ q. ≤ p/q Proof. By Corollary IV.1.1 Φ (Aq ) ≥ Φ (Ap ) whenever p ≤ q. If p ≥ 1in addition, this implies 1/p q 1/q p 1/p Φ (A ) ≥ Φ (A ) by Corollary I.2.2. If q ≥ 1 ≥ p ≥ 12 q, Φ (Ap ) ≤ Φ(Aq )1/q , because λ1/q is operator-monotone. Corollary IV.1.3. If Φ is a positive linear map, for any positive operators A and B Φ(A : B) ≤ Φ(A) : Φ(B) and Φ(A#B) ≤ Φ(A)#Φ(B). Proof. We may assume A ∈ S(0, ∞; H). Let G be the closure of the range of Φ(A). Then there is uniquely a normalized positive linear map Ψ from L(H) to L(G) such that Φ(A)1/2 Ψ(C)Φ(A)1/2 = Φ A1/2 CA1/2 for C ∈ L(H). Since the functions f (λ) := for any positive C 2λ 1+λ and g(λ) := λ1/2 are operator-concave on (0, ∞), by Theorem IV.1 we have Ψ(1 : C) = Ψ(f (C)) ≤ f (Ψ(C)) = 1 : Ψ(C) and Ψ(1#C) = Ψ(g(C)) ≤ g(Ψ(C)) = 1#Ψ(C). With C = A−1/2 BA−1/2 this leads to the following Φ(A : B) = Φ(A)1/2 Ψ(1 : C)Φ(A)1/2 = Φ(A)1/2 {1 : Ψ(C)}Φ(A)1/2 = Φ(A) : Φ(B) and analogously Φ(A#B) ≤ Φ(A)#Φ(B). 20 IV. POSITIVE MAPS The converse of Theorem IV.1 is also true. Theorem IV.2. Let α ≤ 0 ≤ β and f a continuous function on (α, β) with f (0) = 0. If Φ(f (A)) ≥ f (Φ(A)) for every normalized positive linear map Φ and A ∈ S(α, β) then f is operator-convex. Proof. Let K be the subspace of H ⊕ H, consisting of all vectors x ⊕ x. Define the linear ! ! map Φ from A11 A12 1 A11 + A22 A11 + A22 L(H ⊕ H) to L(K), that assigns to the operator , considered on 4 A11 + A22 A11 + A22 A21 A22 K. Then Φ is positive and normalized. Take A, B ∈ S(α, β; H). Then by definition we have " !# ! ! A 0 f (A) 0 1 f (A) + f (B) f (A) + f (B) Φ f =Φ = , 4 f (A) + f (B) f (A) + f (B) 0 B 0 f (B) while " A f Φ 0 0 B !# " 1 1 ! ! !# −1 0 0 1 1 1 √ 1 0 12 (A + B) 2 −1 1 ! ! ! f (0) 0 1 1 1 1 −1 1 √ =√ 0 f 12 (A + B) 2 1 1 2 −1 1 ! 1 f 12 (A + B) f 12 (A + B) . = 2 f 12 (A + B) f 12 (A + B) " A 0 0 B 1 =f √ 2 Now by assumption Φ f !# " A 0 ≥f Φ 0 B !# which implies f 1 1 (A + B) ≤ {f (A) + f (B)}. 2 2 This completes the proof. A (non-linear) map Φ from a convex subset of L(H) to L(K) is said to be a convex map (resp. a concave map) if Φ 12 (A + B) ≤ 12 {Φ(A) + Φ(B)} (resp. Φ 12 (A + B) ≥ 12 {Φ(A) + Φ(B)}). We shall be concerned with convexity or concavity of maps Φs,t (A) = As ⊗ At defined on S(0, ∞; H) where −∞ < s, t < ∞. Lemma IV.2. If Φ and Ψ are concave maps with range in S(0, ∞; K) then the maps Θ(A) := Φ(A)#Ψ(A) and Ξ(A) := Φ(A) : Ψ(A) are concave. Proof. By Corollary I.2.1 and concavity of Φ and Ψ 1 1 1 Θ (A + B) = Φ (A + B) #Ψ (A + B) 2 2 2 1 1 1 1 ≥ Φ(A) + Φ(B) # Ψ(A) + Ψ(B) 2 2 2 2 1 ≥ {Φ(A)#Ψ(A) + Φ(B)#Ψ(B) 2 1 = {Θ(A) + Θ(B)}, 2 which proves the concavity of Θ. The concavity of Ξ is proved analogously by using Corollary I.3.1. Theorem IV.3. The map Φp,q (A) := AP ⊗ Aq is concave if 0 ≤ p, q and p + q ≤ 1. IV. POSITIVE MAPS 21 Proof. Consider the set Ω of (p, q) in R2+ for which Φp,q are concave. We claim that Ω is a convex set. In fact, let (pi , qi ) ∈ Ω and p = 12 (p1 + p2 ), q = 12 (q1 + q2 ). Then since Φp,q (A) = Ap ⊗ Aq = (Ap1 #Ap2 ) ⊗ (Aq1 #Aq2 ) = (Ap1 ⊗ Aq1 ) # (Ap2 ⊗ Aq2 ) = Φp1 ,q1 (A)#Φp2 ,q2 (A), by Lemma IV.2 the map Φp,q is concave. Obviously (0, 0), (1, 0) and (0, 1) belong to Ω, hence so does (p, q) for which 0 ≤ p, q and p + q ≤ 1. Corollary IV.3.1. For positive Ai and Bi (i = 1, 2) (A1 : B1 )p ⊗ (A2 : B2 )q ≤ (Ap1 ⊗ Aq2 ) : (B1p ⊗ B2q ) whenever 0 ≤ p, q and p + q ≤ 1. Proof. The concavity of Φp,q is seen to be equivalent to the inequality With A = A1 0 (A : B)p ⊗ (A : B)q ≤ (Ap ⊗ Aq ) : (B p ⊗ B q ) . ! ! 0 B1 0 and B = this inequality implies the inequality in the assertion. A2 0 B2 Theorem IV.4. If f and g are positive, operator-monotone functions on (0, ∞), then the map Φ(A) := f (A)−1 ⊗ g(A)−1 is convex. −1 −1 Proof. Let h(λ) := f λ−1 and k(λ) := g λ−1 . Then the convexity of Φ is seen to be equivalent to that for A, B ∈ S(0, ∞) 1 {h(A) ⊗ k(A) + h(B) ⊗ k(B)}. 2 By Lemma III.2 both h and k are operator-monotone, so that by Theorem III.5 h(A : B) ⊗ k(A : B) ≤ {h(A : B) ⊗ k(A : B)} ≤ {h(A) : h(B)} ⊗ {k(A) : k(B)} and further by Corollaries I.3.2 and I.2.4 {h(A) : h(B)} ⊗ {k(A) : k(B)} ≤ {h(A)#h(B)} ⊗ {k(A)#k(B)} = {h(A) ⊗ k(A)}#{h(B) ⊗ k(B)} ≤ 1 {h(A) ⊗ k(A) + h(B) ⊗ k(B)}. 2 Corollary IV.4.1. The map Φ−p,−q (A) = A−p ⊗ A−q is convex if 0 ≤ p, q ≤ 1. This follows from Theorem IV.3 and Corollary I.2.2. Theorem IV.5. The map Φ−p,q (A) = A−p ⊗ Aq is convex if 0 ≤ p ≤ q − 1 < 1. Proof. Let us consider first the case q = p + 1 < 2. The well-known integral representation of λp for 0 < p < 1 (see [11], [12] p. ) Z ∞ p −1 λ = π sin(pπ) tp−1 λ(λ + t)−1 dt 0 22 IV. POSITIVE MAPS is applied to get (1 ⊗ A) Z ∞ −1 = π −1 sin(pπ) t1−p A−1 ⊗ A2 A−1 ⊗ A + t dt. Φ−p,p+1 (A) = A−1 ⊗ A p 0 Therefore it suffices to prove that for each t > 0 the map −1 Φt (A) := A−1 ⊗ A2 A−1 ⊗ A + t is convex. To this end, remark, first of all n Φt (A) = 1 ⊗ A − t (A ⊗ 1)−1 + 1 ⊗ t−1 A −1 o−1 , the first term of which is a convex map. It remains to show that the map n −1 o−1 Θt (A) := (A ⊗ 1)−1 + 1 ⊗ t−1 A is convex. But this follows from Lemma IV.2, because 2Θt (A) = Ψ(A) : Ξ(A) where Ψ(A) := A ⊗ 1 and Ξ(A) := 1 ⊗ t−1 A are obviously concave. Next let us consider the case p < q − 1. Let r = p(q − 1). Since 0 < r < 1, the function λr is operator-concave by Theorem III.4, hence for positive A and B r 1 1 1 1 A+ B ≥ Ar + B r , 2 2 2 2 which implies 1 1 A+ B 2 2 −r ≥ 1 r 1 r A + B 2 2 −1 Further the operator-monotoneousness of λq−1 yields −p −(q−1) 1 1 1 r 1 r A+ B ≥ A + B . 2 2 2 2 It follows from the first part of the proof, as Corollary IV.3.1 does from Theorem IV.3, that for positive A, B, C and D −(q−1) q o 1 1 1 1 1 n −(q−1) C+ D ⊗ A+ B ≤ C ⊗ Aq + D−(q−1) ⊗ B q . 2 2 2 2 2 Now use these inequalities with C = Ar and D = B r to get −p q −(q−1) q 1 1 1 1 1 r 1 r 1 1 A+ B ⊗ A+ B ≤ A + B ⊗ A+ B 2 2 2 2 2 2 2 2 1 −p ≤ A ⊗ Aq + B −p ⊗ B q . 2 This shows the convexity of Φ−p,q . It remains to consider the case p = 1 and q = 2. The convexity of Φ−1,2 means that (A + B)−1 ⊗ (A + B)2 ≤ A−1 ⊗ A2 + B −1 ⊗ B 2 . With C = A−1/2 BA−1/2 this inequality is equivalent to the inequality (1 + C)−1 ⊗ (1 + C)A(1 + C) ≤ 1 ⊗ A + C −1 ⊗ CAC, and further to the inequality C ⊗ (CA + AC) ≤ C 2 ⊗ A + 1 ⊗ CAC. IV. POSITIVE MAPS Considering the spectral representation A = A is an orthoprojection, i.e. A2 = A. Then R∞ 0 23 λdE(λ), it suffices to prove the above inequality for the case C 2 ⊗ A + 1 ⊗ CAC − C ⊗ (CA + AC) = (C ⊗ A − 1 ⊗ AC)∗ (C ⊗ A − 1 ⊗ AC) ≥ 0. This completes the proof. ! A 0 instead of A, it is seen that the maps A 7→ As , A 7→ At and R+ 3 0 α α → αs+t are concave, hence as mentioned in Example III.2 we have 0 ≤ s, t and s + t ≤ 1. If Φs,t is convex with s ≤ t, just as above, the maps A 7→ As and A 7→ At are convex, so that 1 ≤ s ≤ t ≤ 2 or −1 ≤ s ≤ t ≤ 0 or −1 ≤ s ≤ 0 ≤ 1 ≤ t ≤ 2. Replacing A by scalar, we see that the map {α, β} → αs β t is convex on the positive cone of R2 . Therefore, for arbitrarily fixed α, β > 0, the function φ(λ) := (α + λ)s (β − λ)t is a convex function of λ in a neighborhood of 0. By differentiation this implies If Φs,t is concave, using s(s − 1)αs−2 β t − 2stαs−1 β t−1 + t(t − 1)αs β t−2 ≥ 0. If −1 ≤ s ≤ 0 ≤ 1 ≤ t ≤ 2 or 1 ≤ s ≤ t ≤ 2, by arbitrariness of α and β the above inequality is equivalent to s2 t2 ≤ st(s − 1)(t − 1). If −1 ≤ s ≤ 0 < 1 ≤ t ≤ 2, this is possible only if s + t ≥ 1. But the case 1 ≤ s ≤ t ≤ 2 is not consistent with the inequality. Thus we can conclude that Theorem IV.3 exhausts all the case Φs,t is concave while Theorem IV.4 and IV.5 exhaust all the case Φs,t (s ≤ t) is convex. Note Chapter I. Corollary I.1.3 is due to Krein [16]. Geometric mean was introduced by Pusz and Woronowicz [20], who proved Theorem I.2. Corollary I.2.3 can be considered as a non-commutative version of the result of −1 Lieb & Ruskai [18]. Anderson & Duffin [2] defined A−1 + B −1 as parallel sum of two positive matrices. Hilbert space operator case was treated in Anderson & Trapp [3], who proved Theorem I.3. Chapter II and III. The content of these chapters is the famous theory of L¨owner [19] and Kraus [15]. The full account of the theory can be found in Donoghue [10] and Davis [9]. Theorem II.1 and II.2 are due to Bendat & Sherman [6]. The Hilbert space method in Chapter II is due to Koranyi [14]. Chapter IV. Theorem IV.1 is due to Davis [8] and Choi [7], while Theorem IV.2 is pointed out in Davis [9]. The inequality Φ(A : B) ≤ Φ(A) : Φ(B) was proved for a special case by Anderson & Trapp [4]. Theorem IV.3 and Corollary IV.4.1 were proved by Lieb [17] by a different method. Epstein [13] gave a simpler proof. Uhlmann [21] also used geometric means to prove Theorem IV.3. The idea in the proof of Theorem IV.5 will be developed in a forthcoming paper [5]. 24 Bibliography [1] [2] [3] [4] N. I. Akhiezer and I.M. Glazman. Theory of linear operators in Hilbert space. Pitman Pub., Boston :, 1981. W. N. Anderson, Jr. and R. J. Duffin. Series and parallel addition of matrices. J. Math. Anal. Appl., 26:576–594, 1969. W. N. Anderson, Jr. and G. E. Trapp. Shorted operators. II. SIAM J. Appl. Math., 28:60–71, 1975. William N. Anderson, Jr. and George E. Trapp. A class of monotone operator functions related to electrical network theory. Linear Algebra and Appl., 15(1):53–67, 1976. [5] T. Ando. Concavity of certain maps on positive definite matrices and applications to Hadamard products. Linear Algebra Appl., 26:203–241, 1979. [6] Julius Bendat and Seymour Sherman. Monotone and convex operator functions. Trans. Amer. Math. Soc., 79:58–71, 1955. [7] Man Duen Choi. A Schwarz inequality for positive linear maps on C ∗ -algebras. Illinois J. Math., 18:565–574, 1974. [8] Chandler Davis. A Schwarz inequality for convex operator functions. Proc. Amer. Math. Soc., 8:42–44, 1957. [9] Chandler Davis. Notions generalizing convexity for functions defined on spaces of matrices. In Proc. Sympos. Pure Math., Vol. VII, pages 187–201. Amer. Math. Soc., Providence, R.I., 1963. [10] William F. Donoghue, Jr. Monotone matrix functions and analytic continuation. Springer-Verlag, New York, 1974. Die Grundlehren der mathematischen Wissenschaften, Band 207. [11] Nelson Dunford and Jacob T. Schwartz. Linear operators. Part I. Wiley Classics Library. John Wiley & Sons Inc., New York, 1988. General theory, With the assistance of William G. Bade and Robert G. Bartle, Reprint of the 1958 original, A Wiley-Interscience Publication. [12] Nelson Dunford and Jacob T. Schwartz. Linear operators. Part II. Wiley Classics Library. John Wiley & Sons Inc., New [13] [14] [15] [16] York, 1988. Spectral theory. Selfadjoint operators in Hilbert space, With the assistance of William G. Bade and Robert G. Bartle, Reprint of the 1963 original, A Wiley-Interscience Publication. H. Epstein. Remarks on two theorems of E. Lieb. Comm. Math. Phys., 31:317–325, 1973. A. Kor´ anyi. On a theorem of L¨ owner and its connections with resolvents of selfadjoint transformations. Acta Sci. Math. Szeged, 17:63–70, 1956. ¨ Fritz Kraus. Uber konvexe Matrixfunktionen. Math. Z., 41(1):18–42, 1936. M. Krein. The theory of self-adjoint extensions of semi-bounded Hermitian transformations and its applications. I. Rec. Math. [Mat. Sbornik] N.S., 20(62):431–495, 1947. [17] Elliott H. Lieb. Convex trace functions and the Wigner-Yanase-Dyson conjecture. Advances in Math., 11:267–288, 1973. [18] Elliott H. Lieb and Mary Beth Ruskai. Some operator inequalities of the Schwarz type. Advances in Math., 12:269–273, 1974. ¨ [19] Karl L¨ owner. Uber monotone Matrixfunktionen. Math. Z., 38(1):177–216, 1934. [20] W. Pusz and S. L. Woronowicz. Functional calculus for sesquilinear forms and the purification map. Rep. Mathematical Phys., 8(2):159–170, 1975. [21] A. Uhlmann. Relative entropy and the Wigner-Yanase-Dyson-Lieb concavity in an interpolation theory. Comm. Math. Phys., 54(1):21–32, 1977. 25 Index concave map, 20 convex map, 20 geometric mean, 5 harmonic mean, 6 normalized map, 18 operator-concave function, 14 operator-convex function, 14 operator-monotone function, 8 positive operator, 3 27 Symbols A : B, 6 A#B, 5 G,H,K, 3 L(H), 3 L+ (H), 3 S(α, β), 8 S(α, β; H), 8 Φs,t (A), 20 29
© Copyright 2024