Skip to main content

Extension of Jensen’s inequality to affine combinations

Abstract

The article provides the generalization of Jensen’s inequality for convex functions on the line segments. The main and preliminary inequalities are expressed in discrete form using affine combinations that can be reduced to convex combinations. The resulting quasi-arithmetic means are used to extend the two well-known inequalities.

MSC:26A51, 26D15.

1 Introduction

First of all, we recapitulate the basic notations of affinity and convexity relating to the binomial combinations. Let X be a real vector space.

A set AX is affine if it contains all binomial affine combinations in A, that is, the combinations αa+βb of points a,bA and coefficients α,βR of the sum α+β=1. The affine hull of a set SX as the smallest affine set that contains S is denoted affS. A function f:AR is affine if the equality

f(αa+βb)=αf(a)+βf(b)
(1.1)

holds for all binomial affine combinations αa+βb in A.

A set CX is convex if it contains all binomial convex combinations in C, that is, the combinations αa+βb of points a,bC and non-negative coefficients α,βR of the sum α+β=1. The convex hull of a set SX as the smallest convex set that contains S is denoted convS. A function f:CR is convex if the inequality

f(αa+βb)αf(a)+βf(b)
(1.2)

holds for all binomial convex combinations αa+βb in C.

Using mathematical induction, it can be proved that everything said above is true for all n-membered affine or convex combinations. In this way, Jensen (see [1]) has extended the inequality in equation (1.2).

We will consider affine and convex combinations of the real numbers, the case where X=R. Take real numbers a and b such that ab. Every number xR can be uniquely presented as the binomial affine combination

x= b x b a a+ x a b a b,
(1.3)

which is convex if, and only if, the number x belongs to the interval conv{a,b}. Given the function f:RR, let f { a , b } line :RR be the function of the chord line passing through the points A(a,f(a)) and B(b,f(b)) of the graph of f. Applying the affinity of f { a , b } line to the combination in equation (1.3), we get

f { a , b } line (x)= b x b a f(a)+ x a b a f(b).
(1.4)

If the function f is convex, then we have the inequality

f(x) f { a , b } line (x)if xconv{a,b},
(1.5)

and the reverse inequality

f(x) f { a , b } line (x)if xconv{a,b}{a,b}.
(1.6)

A wide area of convex analysis including convex functions and their inequalities is covered in [2]. The practical applications of convex analysis are presented in [3].

A brief scientific-historic background on Jensen’s inequality follows at the end of this introduction. Because of its attractiveness, Jensen’s and related inequalities were studied during the whole last century. So, there are Steffensen’s, Brunk’s, and Olkin’s inequality. In this century the research goes on, and we got Jensen-Mercer’s and Mercer-Steffensen’s inequality. For information as regards these inequalities, one may refer to papers [47] and [8].

2 Generalization and reversal of Jensen-Mercer’s inequality

In order to generalize Jensen’s inequality, we firstly consider Jensen-Mercer’s inequality. A generalization of this inequality is given in Corollary 2.2, and the reverse inequality is specified in Corollary 2.4. The presentation of the section is based on the affine combinations in equation (2.1), and the simple inequality in equation (2.2).

Let α,β,γR be coefficients such that α+βγ=1. Let a,b,cR be points. We consider the affine combination αa+βbγc. Including the affine combination c=λa+μb assuming that λ+μ=1, we get the binomial form

αa+βbγc=(αγλ)a+(βγμ)b.
(2.1)

Lemma 2.1 Let α,β,γ[0,1] be coefficients such that α+βγ=1. Let a,b,cR be points such that cconv{a,b}.

Then the affine combination αa+βbγc is in conv{a,b}, and every convex function f:conv{a,b}R satisfies the inequality

f(αa+βbγc)αf(a)+βf(b)γf(c).
(2.2)

Proof The condition c=λa+μbconv{a,b} involves λ,μ[0,1]. Then the binomial combination of the right-hand side in (2.1) is convex since its coefficients αγλαγ=1β0 and also βγμ0. So, the combination αa+βbγc belongs to conv{a,b}.

Since the inequality in equation (2.2) is trivially true for a=b, we suppose that ab. Applying the inequality in equation (1.5), and the affinity of f { a , b } line , we get

f(αa+βbγc) f { a , b } line (αa+βbγc)
(2.3)
=αf(a)+βf(b)γ f { a , b } line (c)
(2.4)
αf(a)+βf(b)γf(c)
(2.5)

because f { a , b } line (c)f(c). □

The inequality in equation (2.2) can also be proved applying the convexity of the function f to the right-hand side of the representation in equation (2.1), wherein we use the inequality f(c)λf(a)+μf(b).

Corollary 2.2 Let α,β,γ[0,1] and γ i [0,1] be coefficients such that α+βγ= i = 1 n γ i =1. Let a,b, c i R be points such that all c i conv{a,b}.

Then the affine combination αa+βbγ i = 1 n γ i c i is in conv{a,b}, and every convex function f:conv{a,b}R satisfies the inequality

f ( α a + β b γ i = 1 n γ i c i ) αf(a)+βf(b)γ i = 1 n γ i f( c i ).
(2.6)

As demonstrated above, to prove the inequality in equation (2.6) it is not necessary to use the Jensen inequality. Lemma 2.1 and Corollary 2.2 are also valid for γ[1,1] because then the observed affine combinations with γ0 become convex, and the associated inequalities follow from Jensen’s inequality. The combinations including γ[1,1] were observed in [[9], Corollary 11 and Theorem 12] additionally using a monotone function g.

If α=β=γ=1, then the inequality in equation (2.6) is reduced to Mercer’s variant of Jensen’s inequality obtained in [6]. Another generalization of Mercer’s result was achieved in [10] using the majorization assumptions. The importance and impact of the majorization theory to inequalities can be found in [11].

Lemma 2.3 Let α,β,γ[1,) be coefficients such that α+βγ=1. Let a,b,cR be points such that cconv{a,b}{a,b}.

Then the affine combination αa+βbγc is not in conv{a,b}{a,b}, and every convex function f:conv{a,b,c}R satisfies the inequality

f(αa+βbγc)αf(a)+βf(b)γf(c).
(2.7)

Proof The condition c=λa+μbconv{a,b}{a,b} entails λ0 or λ1, and the first coefficient of the binomial form of (2.1) satisfies αγλα1 if λ0, or αγλαγ=1β0 if λ1. So, the combination αa+βbγ does not belong to conv{a,b}{a,b}. Applying the inequality in (1.6), we get the series of inequalities as in equations (2.3), (2.4), and (2.5), but with the reverse inequality signs. □

It is not necessary to require γ[1,) in Lemma 2.3, it follows from the conditions α,β[1,) and α+βγ=1.

Corollary 2.4 Let α,β,γ[1,) and γ i [0,1] be coefficients such that α+βγ= i = 1 n γ i =1. Let a,b, c i R be points such that any c i conv{a,b}{a,b} and the convex combination i = 1 n γ i c i conv{a,b}{a,b}.

Then the affine combination αa+βbγ i = 1 n γ i c i is not in conv{a,b}{a,b}, and every convex function f:conv{a,b, c i }R satisfies the inequality

f ( α a + β b γ i = 1 n γ i c i ) αf(a)+βf(b)γ i = 1 n γ i f( c i ).
(2.8)

3 Main results

The aim of the paper is to generalize the inequality in equation (2.2). The first step is already done in Corollary 2.2 in the expression of the inequality in equation (2.6), presenting the point c as the convex combination. We are going to proceed in the same way with the points a and b. The main theorem follows.

Theorem 3.1 Let α i , β j , γ k 0 be coefficients with the sums α= i = 1 n α i , β= j = 1 m β j , and γ= k = 1 l γ k satisfying 0<α,β1 and α+βγ=1. Let a i , b j , c k R be points such that all c k conv{a,b} where

a= 1 α i = 1 n α i a i ,b= 1 β j = 1 m β j b j .
(3.1)

Then the affine combination

i = 1 n α i a i + j = 1 m β j b j k = 1 l γ k c k
(3.2)

belongs to conv{a,b}, and every convex function f:conv{ a i , b j }R satisfies the inequality

f ( i = 1 n α i a i + j = 1 m β j b j k = 1 l γ k c k ) i = 1 n α i f ( a i ) + j = 1 m β j f ( b j ) k = 1 l γ k f ( c k ) .
(3.3)

If f is concave, then the reverse inequality is valid in equation (3.3). If f is affine, then the equality is valid in equation (3.3).

Proof Note that γα,β. Prove the theorem for the convex function f.

If γ=0, the combination in equation (3.2) takes the convex form αa+βb belonging to conv{a,b}, and the inequality in equation (3.3) is reduced to Jensen’s inequality.

If γ>0, then, including points a, b, and

c= 1 γ k = 1 l γ k c k
(3.4)

in equation (3.2), we get the combination αa+βbγc which belongs to conv{a,b} by Lemma 2.1. The inequality in equation (3.3) is trivially true for a=b. So, we assume that ab and use the function f { a , b } line . Applying the affinity of f { a , b } line to the convex combination in equation (3.4), and respecting the inequalities f { a , b } line ( c k )f( c k ), we have

f { a , b } line (c)= 1 γ k = 1 l γ k f { a , b } line ( c k ) 1 γ k = 1 l γ k f( c k ).
(3.5)

Using the inequality in equation (2.4), applying Jensen’s inequality to f(a) and f(b), and finally using the inequality in equation (3.5) respecting the minus, we get

f ( i = 1 n α i a i + j = 1 m β j b j k = 1 l γ k c k ) α f ( a ) + β f ( b ) γ f { a , b } line ( c ) i = 1 n α i f ( a i ) + j = 1 m β j f ( b j ) k = 1 l γ k f ( c k ) ,

completing the proof. □

Corollary 3.2 Let all the assumptions of Theorem  3.1 be fully satisfied. Let r = 1 s λ r x r , x r conv{a,b}, and λa+μb be convex combinations having the common center with the affine combination in equation (3.2), that is,

r = 1 s λ r x r =λa+μb= i = 1 n α i a i + j = 1 m β j b j k = 1 l γ k c k .
(3.6)

Then every convex function f:conv{ a i , b j }R satisfies the double inequality

r = 1 s λ r f( x r )λf(a)+μf(b) i = 1 n α i f( a i )+ j = 1 m β j f( b j ) k = 1 l γ k f( c k ).
(3.7)

If f is concave, then the reverse inequalities are valid in equation (3.7). If f is affine, then the equalities are valid in equation (3.7).

Proof Relying on the function f { a , b } line and the Jensen inequality, we derive the series of inequalities

r = 1 s λ r f ( x r ) r = 1 s λ r f { a , b } line ( x r ) = f { a , b } line ( r = 1 s λ r x r ) = f { a , b } line ( λ a + μ b ) = λ f ( a ) + μ f ( b ) = f { a , b } line ( α a + β b k = 1 l γ k c k ) = α f ( a ) + β f ( b ) k = 1 l γ k f { a , b } line ( c k ) i = 1 n α i f ( a i ) + j = 1 m β j f ( b j ) k = 1 l γ k f ( c k )

containing the double inequality in equation (3.7). □

Corollary 3.3 Let all the assumptions of Theorem  3.1 be fully satisfied. Let r = 1 s λ r x r be a convex combination with points x r conv{a,b} such that

r = 1 s λ r x r + k = 1 l γ k c k = i = 1 n α i a i + j = 1 m β j b j .
(3.8)

Then every convex function f:conv{ a i , b j }R satisfies the inequality

r = 1 s λ r f( x r )+ k = 1 l γ k f( c k ) i = 1 n α i f( a i )+ j = 1 m β j f( b j ).
(3.9)

We finish the section with the basic integral variant of the main theorem. In the corollary that follows we use segments with different endpoints.

Corollary 3.4 Let α,β,γ[0,1] be coefficients such that α+βγ=1. Let [ a 1 , a 2 ],[ b 1 , b 2 ],[ c 1 , c 2 ]R be segments such that

[ c 1 , c 2 ] [ a 1 + a 2 2 , b 1 + b 2 2 ] .

Then the integral sum

x ¯ = α a 2 a 1 a 1 a 2 xdx+ β b 2 b 1 b 1 b 2 xdx γ c 2 c 1 c 1 c 2 xdx
(3.10)

belongs to [( a 1 + a 2 )/2,( b 1 + b 2 )/2], and every convex function f:[ a 1 , b 2 ]R satisfies the inequality

f( x ¯ ) α a 2 a 1 a 1 a 2 f(x)dx+ β b 2 b 1 b 1 b 2 f(x)dx γ c 2 c 1 c 1 c 2 f(x)dx.
(3.11)

If f is concave, then the reverse inequality is valid in equation (3.11). If f is affine, then the equality is valid in equation (3.11).

Proof The sum of (3.10) is equal to the affine combination

x ¯ =α a 1 + a 2 2 +β b 1 + b 1 2 γ c 1 + c 2 2
(3.12)

belonging to [( a 1 + a 2 )/2,( b 1 + b 2 )/2] by Lemma 2.1. Given the positive integer n, we can construct the affine combination

x ¯ n = i = 1 n α n i a n i + j = 1 n β n j b n j k = 1 n γ n k c n k ,
(3.13)

which approaches the integral sum in equation (3.10) as n approaches infinity. Let us show how it looks in the example of the first member of the above combination.

We make the partition [ a 1 , a 2 ]= i = 1 n [ ( a 1 ) n i , ( a 2 ) n i ] where all subsegments have the same length ( a 2 a 1 )/n, and the adjacent subsegments have a common endpoint. If we take points a n i [ ( a 1 ) n i , ( a 2 ) n i ], then we have

lim n i = 1 n 1 n a n i = lim n i = 1 n ( a 2 ) n i ( a 1 ) n i a 2 a 1 a n i = 1 a 2 a 1 a 1 a 2 xdx
(3.14)

suggesting to take α n i =α/n.

Applying Theorem 3.1 to the affine combination in equation (3.13) and the convex function f, we get

f( x ¯ n ) i = 1 n α n i f( a n i )+ j = 1 n β n j f( b n j ) k = 1 n γ n k f( c n k ),
(3.15)

and letting n to infinity, we obtain the integral inequality in equation (3.11). □

4 Application to quasi-arithmetic means

Let IR be an interval. Applying the Jensen inequality to the means, we use strictly monotone continuous functions φ,ψ:IR such that the function ψ φ 1 :φ(I)R is convex. In this case, we say that ψ is convex with respect to φ, or ψ is φ-convex. The same notation is used for the concavity. If the function ψ is φ-convex and φ-concave, we can say that it is φ-affine.

Suppose that

x ¯ = i = 1 n α i a i + j = 1 m β j b j k = 1 l γ k c k
(4.1)

is an affine combination as in Theorem 3.1, and φ:IR is a strictly monotone continuous function where I=conv{ a i , b j }. The discrete φ-quasi-arithmetic mean of the combination x ¯ can be defined as the point

M φ ( x ¯ )= φ 1 ( i = 1 n α i φ ( a i ) + j = 1 m β j φ ( b j ) k = 1 l γ k φ ( c k ) )
(4.2)

belonging to conv{a,b}, because the affine combination enclosed in parentheses is located in φ(conv{a,b}). The quasi-arithmetic means defined in equation (4.2) are invariant with respect to the affinity, that is, the equality

M φ ( x ¯ )= M u φ + v ( x ¯ )
(4.3)

holds for all pairs of real numbers u0 and v.

The order of the pair of quasi-arithmetic means M φ and M ψ depends on convexity of the function ψ φ 1 and monotonicity of the function ψ, as follows.

Corollary 4.1 Let x ¯ be an affine combination as in equation (4.1) satisfying all the assumptions of Theorem  3.1. Let φ,ψ:IR be strictly monotone continuous functions where I=conv{ a i , b j }.

If ψ is either φ-convex and increasing or φ-concave and decreasing, then we have the inequality

M φ ( x ¯ ) M ψ ( x ¯ ).
(4.4)

If ψ is either φ-convex and decreasing or φ-concave and increasing, then we have the reverse inequality in equation (4.4).

If ψ is φ-affine, then the equality is valid in equation (4.4).

Proof We prove the case that the function ψ is φ-convex and increasing. Put the set J=φ(I)=conv{φ( a i ),φ( b j )}. Applying the inequality in equation (3.3) of Theorem 3.1 to the affine combination

x ¯ φ = i = 1 n α i φ( a i )+ j = 1 m β j φ( b j ) k = 1 l γ k φ( c k ),
(4.5)

which is in the set φ(conv{a,b}), and the convex function f=ψ φ 1 :JR, we get

ψ φ 1 ( x ¯ φ ) x ¯ ψ .

Assigning the increasing function ψ 1 to the above inequality, we attain

M φ ( x ¯ )= φ 1 ( x ¯ φ ) ψ 1 ( x ¯ ψ )= M ψ ( x ¯ ),

which finishes the proof. □

The inequality in equation (4.4) may further be applied to the power means. The monotonicity of these power means is also valid. We will only derive the formula of the harmonic-geometric-arithmetic mean inequality.

Corollary 4.2 If x ¯ is an affine combination as in equation (4.1) satisfying all the assumptions of Theorem  3.1 with the addition that all a i , b j >0, then we have the harmonic-geometric-arithmetic mean inequality

( i = 1 n α i a i + j = 1 m β j b j k = 1 l γ k c k ) 1 i = 1 n a i α i j = 1 m b j β j k = 1 l c k γ k i = 1 n α i a i + j = 1 m β j b j k = 1 l γ k c k .
(4.6)

Proof Let φ 1 (x)= x 1 , φ 0 (x)=lnx, and φ 1 (x)=x. Applying the inequality in equation (4.4), first to the pair φ 1 and φ 0 , and then to the pair φ 0 and φ 1 , we get M 1 M 0 M 1 representing the inequality in equation (4.6). □

The general approach to the theory of means and their inequalities was presented in [12]. Different forms of quasi-arithmetic means were considered in [13].

5 Application to other inequalities

The essential inequalities in equations (2.2) and (2.7) can be used to extend the Bernoulli and the Young inequality. Let us start with a simplified case of the mean inequality in equation (4.6).

Corollary 5.1 If numbers α,β,γ[0,1] satisfy α+βγ=1, then the double inequality

( α a + β b γ c ) 1 a α b β c γ αa+βbγc
(5.1)

holds for all numbers a,b,c>0 such that cconv{a,b}.

If α,β,γ[1,) satisfy α+βγ=1, then the reverse double inequality is valid in equation (5.1) for all a,b,c>0 such that cconv{a,b}{a,b}.

Proof The inequality in equation (5.1) follows from the inequality in equation (4.6) with α i =α, β j =β, γ k =γ and a i =a, b j =b, c k =c. So, it is actually a consequence of the inequality in equation (2.2).

The reverse inequality of equation (5.1) can be obtained from the inequality in equation (2.7) using the functions φ 1 , φ 0 , and φ 1 . □

The previous corollary with a suitable choice of the coefficients and the points will be applied in the following two examples.

Example 5.2 If numbers p,q,r[0,1] satisfy p+qr=1, then the double inequality

( p 1 + x + q 1 + y r 1 + z ) 1 ( 1 + x ) p ( 1 + y ) q ( 1 + z ) r 1+px+qyrz
(5.2)

holds for all numbers x,y,z>1 such that zconv{x,y}.

If p,q,r[1,) satisfy p+qr=1, then the reverse double inequality is valid in equation (5.2) for all x,y,z>1 such that zconv{x,y}{x,y}.

Proof Corollary 5.1 should be applied to the coefficients α=p, β=q, γ=r and the points a=1+x, b=1+y, c=1+z. □

The inequality

1 + t 1 + t p t ( 1 + t ) p 1+pt
(5.3)

applying to p[0,1] and t>1 follows from the inequality in equation (5.2) putting first y=z=0 and taking t=x(1,0], then putting x=z=0 and taking t=y[0,). In the same way, from the reverse inequality of equation (5.2) we can obtain the reverse inequality of equation (5.3) respecting p[1,) and t>1. The right-hand sides of these inequalities are the well-known Bernoulli’s inequalities.

Example 5.3 If numbers p,q[1,) and r[0,1] satisfy 1/p+1/qr=1, then the double inequality

( x p p + y q q r z 1 ) 1 x y z r x p p + y q q rz
(5.4)

holds for all numbers x,y,z>0 such that zconv{ x p , y q }.

If p,q(0,1] and r[1,) satisfy 1/p+1/qr=1, then the reverse double inequality is valid in equation (5.4) for all x,y,z>0 such that zconv{ x p , y q }{ x p , y q }.

Proof Corollary 5.1 should be applied to the coefficients α=1/p, β=1/q, γ=r and the points a= x p , b= y q , c=z. □

Taking r=0 in equation (5.4), we have the inequality

( x p p + y q q ) 1 xy x p p + y q q ,
(5.5)

which applies to numbers p,q(1,) with the sum 1/p+1/q=1, as well as to numbers x,y>0. The right-hand side of the inequality in equation (5.5) represents the discrete form of Young’s inequality.

References

  1. Jensen JLWV: Om konvekse Funktioner og Uligheder mellem Middelværdier. Nyt Tidsskr. Math. B 1905, 16: 49-68.

    Google Scholar 

  2. Pečarić JE, Proschan F, Tong YL: Convex Functions, Partial Orderings, and Statistical Applications. Academic Press, New York; 1992.

    MATH  Google Scholar 

  3. Rockafellar RT: Convex Analysis. Princeton University Press, Princeton; 1972.

    MATH  Google Scholar 

  4. Pečarić JE: A simple proof of the Jensen-Steffensen inequality. Am. Math. Mon. 1984, 91: 195-196. 10.2307/2322359

    Article  MathSciNet  MATH  Google Scholar 

  5. Bjelica M: Refinement and converse of Brunk-Olkin inequality. J. Math. Anal. Appl. 1998, 272: 462-467.

    Article  MathSciNet  MATH  Google Scholar 

  6. Mercer AMcD: A variant of Jensen’s inequality. J. Inequal. Pure Appl. Math. 2003. Article ID 73, 4: Article ID 73

    Google Scholar 

  7. Pečarić JE, Perušić A, Smoljak K: Mercer and Wu-Srivastava generalisations of Steffensen’s inequality. Appl. Math. Comput. 2013, 219: 10548-10558. 10.1016/j.amc.2013.04.028

    Article  MathSciNet  MATH  Google Scholar 

  8. Pavić Z: Generalizations of Jensen-Mercer’s inequality. J. Pure Appl. Math. Adv. Appl. 2014, 11: 19-36.

    Google Scholar 

  9. Pavić Z: The applications of functional variants of Jensen’s inequality. J. Funct. Spaces Appl. 2013. Article ID 194830, 2013: Article ID 194830

    Google Scholar 

  10. Niezgoda M: A generalization of Mercer’s result on convex functions. Nonlinear Anal., Theory Methods Appl. 2009, 71: 2771-2779. 10.1016/j.na.2009.01.120

    Article  MathSciNet  MATH  Google Scholar 

  11. Marshall AW, Olkin I: Inequalities: Theory of Majorization and Its Applications. Academic Press, New York; 1979.

    MATH  Google Scholar 

  12. Bullen PS, Mitrinović DS, Vasić PM: Means and Their Inequalities. Reidel, Dordrecht; 1988.

    Book  MATH  Google Scholar 

  13. Mićić J, Pavić Z, Pečarić J: The inequalities for quasiarithmetic means. Abstr. Appl. Anal. 2012. Article ID 203145, 2012: Article ID 203145

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zlatko Pavić.

Additional information

Competing interests

The author declares that they have no competing interests.

Rights and permissions

Open Access  This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pavić, Z. Extension of Jensen’s inequality to affine combinations. J Inequal Appl 2014, 298 (2014). https://doi.org/10.1186/1029-242X-2014-298

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/1029-242X-2014-298

Keywords