Cauchy – Schwarz Inequality


For all real vectors (?) x and y, the Cauchy – Schwarz inequality states:

|xy|xy

This inequality is equivalent to the triangle inequality for a norm in a space with a scalar product...

Before we dive into proving the Cauchy – Schwarz inequality, let’s clear up an important distinction between absolute value and norm — two concepts that often look similar but are different in meaning.


What’s the Difference Between |x| and x?

Example: |5|=5, |5|=5.


Example: For x=(3,4),

x=32+42=5

Key Difference


Cauchy – Schwarz Inequality: The Statement

The Cauchy – Schwarz inequality says:

|xy|xy

Where:

So, basically, looking at the dot product formula with cos(θ), the Cauchy – Schwarz inequality tells us that the dot product is always bounded by the lengths of the vectors.
cos(θ) can never exceed 1, so the dot product can never exceed the product of the lengths.


What Does It Mean?

Cauchy – Schwarz inequality tells us:

The dot product between two vectors is always smaller than or equal to the product of their lengths.

It’s like comparing the "shadow" of one vector on another to their total lengths. The dot product measures the alignment between vectors:


Proof of the Cauchy – Schwarz Inequality

Let’s prove it step by step using a clever trick with quadratic equations:

Step 1: Start with a General Vector Expression

For any real vectors x and y, define:

f(λ)=xλy2
Remark

f(λ) is a function of a real parameter λ that returns the norm of the difference between x and λy.
In other words, it’s the squared distance between the vectors.

Xyx-yvector "difference"

Since norms are always non-negative, we know:

f(λ)0for all real λ

Step 2: Expand the Norm

Using the definition of the dot product and norm:

So, basically:

x2=(x12+x22++xn2)2=x12+x22++xn2=xx

Similarly, we can say:

f(λ)=(xλy)(xλy)

Expand the dot product:

f(λ)=xx2λ(xy)+λ2(yy)

Step 3: Rewrite in a Quadratic Form

Let’s set it up as a quadratic expression in terms of λ:

f(λ)=x22λ(xy)+λ2y2

Step 4: Analyze the Quadratic Inequality

Since f(λ)0 for all λ, the quadratic equation:

λ2y22λ(xy)+x20

must have no real roots or a non-positive discriminant. The discriminant (Δ) for a quadratic aλ2+bλ+c is:

Δ=b24ac

Here:


Step 5: Set the Discriminant Condition

Since f(λ)0, the discriminant must be less than or equal to zero:

Δ=(2(xy))24(y2)(x2)0

Step 6: Simplify the Discriminant

4(xy)24x2y20

Divide both sides by 4 and a little rearrangement gives:

(xy)2x2y2

Step 7: Take the Square Root

Finally, taking the square root of both sides:

|xy|xy

And there you have it — Cauchy – Schwarz proven!


...

Links to notes, where the Cauchy – Schwarz inequality is used: