Introduction

Suppose \( (X, \Vert .\Vert ) \) is a finite dimensional normed linear space over the real field. Let \( S_{X} \) and \( B_{X} \) denote the unit sphere and the unit ball of \( (X, \Vert .\Vert ) \) respectively i.e. \( S_{X}=\{x\in X:\Vert x\Vert =1\}\) and \( B_{X}=\{x\in X:\Vert x\Vert \le 1\}\).

In a normed linear space there are several notions of orthogonality, all of which are generalizations of orthogonality in an inner product space. Different notions of orthogonality in a normed linear space have been studied by many mathematicians over the time, some of them are Birkhoff [4], James [79], Kapoor and Prasad [10], Alonso [2, 3], Saidi [18, 19], Alber [1], Dragomir and Kikianty [5] . One of the most natural and important notion of orthogonality in a normed linear space is due to Birkhoff and James.

An element \( x \) is said to be orthogonal to \( y \) in X in the sense of Birkhoff–James, written as, \( x \bot _{B}y \), iff

$$\begin{aligned} \Vert x \Vert \le \Vert x + \lambda y \Vert ~~ \quad \text{ for } \text{ all } \text{ scalars } ~\lambda . \end{aligned}$$

The notion of orthogonality in the sense of Birkhoff–James plays a central role in the study of geometry of Banach spaces. The connection of Birkhoff–James orthogonality with various geometric properties of the norm, like strict convexity, uniform convexity and smoothness, has been studied in great detail in [79, 14, 16, 22, 23] and many other papers. Recently in [20], using the notion of Birkhoff–James orthogonality in normed linear spaces, we have characterized finite dimensional inner product spaces in terms of operator norm attainment. Till date this is a very active area of research with many open problems and interesting results.

If X is an inner product space and \( x,y \ne \theta \), then \( x \bot _{B}y \) implies \( \Vert x \Vert < \Vert x + \lambda y \Vert \) for all scalars \( \lambda \ne 0.\) Motivated by this fact we have studied the notion of strong orthogonality in [15]. An element \( x \) is said to be strongly orthogonal to another element \( y \) in the sense of Birkhoff–James, written as \( x \perp _{SB}y, \) iff

$$\begin{aligned} \Vert x \Vert < \Vert x + \lambda y \Vert ~~ \quad \text{ for } \text{ all } \text{ scalars } ~\lambda \ne 0 . \end{aligned}$$

Although we have given an explicit definition of strong orthogonality between two vectors \(x\) and \(y\), an equivalent form of the definition was already there in [17] under the name ‘strict orthogonality’, we were unware of this fact when we introduced the definition in [15].

We now mention the relevant definitions and terminologies to be used throughout the paper.

Strongly Orthogonal Set Relative to an Element: A finite set of elements \( S = \{x_1,x_2, \ldots ,x_n \} \) is said to be a strongly orthogonal set relative to an element \( x_{i_0}\) contained in S in the sense of Birkhoff–James iff

$$\begin{aligned} \Vert x_{i_0} \Vert < \left\| x_{i_0} + \sum _{j=1,j\ne i_0}^{n} \lambda _j x_j\right\| \end{aligned}$$

whenever not all \( \lambda _j\)’s are 0.

Strongly Orthogonal Set: A finite set of elements \( \{x_1,x_2, \ldots , x_n \} \) is said to be a strongly orthogonal set in the sense of Birkhoff–James iff for each \( i \in \{ 1,2, \ldots , n \} \)

$$\begin{aligned} \Vert x_{i} \Vert < \left\| x_i + \sum _{j=1,j\ne i}^{n} \lambda _j x_j \right\| \end{aligned}$$

whenever not all \( \lambda _j\)’s are 0.

We relate the notion of strongly orthonormal Hamel basis in the sense of Birkhoff–James with the notions of best approximation and best coapproximation in a finite dimensional real normed linear space. In a normed linear space X, for an element \( x_0 \in X \) and a linear subspace G, an element \( g_{0} \in G \) is said to be the best approximation to \( x_{0} \) out of \( G \) iff \( \Vert x_{0}-g_{0}\Vert \le \Vert x_{0}-g\Vert ~ \text{ for } \text{ all }~ g \in G\) and an element \(w_0 \in G\) is said to be the best coapproximation to \( x_{0} \) out of \( G \) iff \( \Vert w_0-g\Vert \le \Vert x_{0}-g\Vert ~ \text{ for } \text{ all }~ g \in G. \) The concept of best approximation (best coapproximation) in a finite dimensional normed linear space has been generalized from the fact that in Euclidean space the length of the hypotenuse of a right angled triangle is always greater than the length of the perpendicular (base). The notions of best approximation and best coapproximation have been studied by Singer [21], Franchetti and Furi [6], Papini [12], Papini and Singer [13] and Narang [11] and many others.

As a consequence of this natural connection, we also prove that the existence of best coapproximation to an element of the normed linear space out of a given subspace and its coincidence with the best approximation to that element out of that subspace carries a very special geometrical significance and characterises a real inner product space of dimension \( \ge \)3. We prove that the normed linear space \((R^n,\Vert ~\Vert _p)\) is an inner product space i.e., \( p = 2 \) iff given any element on the unit sphere of the space there exists a strongly orthonormal Hamel basis in the sense of Birkhoff–James containing that element. Motivated by this we conjecture that a finite dimensional (\(\ge \)3) real smooth normed linear space is an inner product space iff given any element on the unit sphere of the space there exists a strongly orthonormal Hamel basis in the sense of Birkhoff–James containing that element.

Strongly Orthonormal Hamel Basis and Best Approximation & Best Coapproximation

We begin with a simple remark which hints at a connection between strongly orthonormal Hamel basis relative to an element in the sense of Birkhoff–James and the notion of best coapproximation in normed linear spaces.

Remark 2.1

Suppose X is a strictly convex normed linear space and \( \{ x_1,x_2, \ldots , x_n\}\) is a strongly orthonormal set in the sense of Birkhoff–James. Let \(x_{n+1} \in S_X - span~\{x_1,x_2, \ldots , x_n\}.\) If \(w \in span~\{ x_1,x_2, \ldots , x_n\} \) is the best coapproximation to \(x_{n+1}\) out of \( span ~\{ x_1,x_2, \ldots , x_n\} \) then \( \{ x_1,x_2, \ldots , x_n, \frac{x_{n+1} - w}{\Vert x_{n+1} - w\Vert }\}\) is strongly orthonormal relative to \( x_i,\) in the sense of Birkhoff–James for each \( i = 1,2, \ldots , n.\)

We are interested in exploring whether the converse to the above remark holds true. In case of a finite dimensional strictly convex normed linear space, we have the following result in this direction.

Theorem 2.2

Suppose X is a strictly convex normed linear space and \( S= \{ x_1,x_2, \ldots , x_n\} \) is a strongly orthonormal set in the sense of Birkhoff–James. Let \(x_{n+1} \in S_X - span~\{x_1,x_2, \ldots , x_n\}.\) Let \(w \in span~\{ x_1,x_2, \ldots , x_n\} \) be such that \( \{ x_1,x_2, \ldots , x_n, \frac{x_{n+1} - w}{\Vert x_{n+1} - w\Vert }\}\) is strongly orthonormal relative to \( x_i,\) in the sense of Birkhoff–James for each \( i \in \{ 1,2, \ldots , n\}. \) Then there exists a constant \( k \in (0,1) \) such that \( \Vert x_{n+1} - y \Vert \ge k \Vert w - y \Vert \) for all \( y \in span~\{x_1,x_2, \ldots , x_n\}.\)

Proof

Let us define a new norm \( \Vert \Vert _{S} \) on \( span~ S \) as follows:

\(\Vert \alpha _1 x_1 +\alpha _2 x_2+ \ldots + \alpha _n x_n\Vert _{S}=max\{|\alpha _{i}| : 1 \le i \le n\}, \) for any scalars \( \alpha _1,\alpha _2,\ldots ,\alpha _n. \)

(It is easy to see that \( \Vert \Vert _{S} \) is indeed a norm on \( span~ S \) by comparing it with the standard norm \( \Vert \Vert _{\infty } \) on \( R^{n} \)).

Since any two norms on a finite dimensional normed linear space are equivalent, there exists a constant \( k > 0 \) such that \( \Vert u\Vert _{S} \ge k \Vert u\Vert ~\forall ~ u \in span~S. \) Moreover, since \( \{x_1,x_2,\ldots ,x_n\} \) is a strongly orthonormal set in the sense of Birkhoff–James, it follows that \( \Vert u\Vert _S \le \Vert u\Vert ~\forall ~u \in span~S. \) Let \( w = \mu _{1}x_{1} + \mu _{2}x_{2}+ \ldots +\mu _{n}x_{n}, \) where \( \mu _1,\mu _2,\ldots ,\mu _n \) are scalars.

Then for any \( y = \lambda _1x_1 + \lambda _2x_2\ldots + \lambda _nx_n, \) where \( \lambda _1,\lambda _2,\ldots ,\lambda _n \) are scalars, we have

$$\begin{aligned} \Vert x_{n+1} - y \Vert&= \Vert x_{n+1} - \lambda _1x_1 - \ldots - \lambda _nx_n \Vert \\&= \left\| ~~\Vert x_{n+1} - w \Vert \frac{x_{n+1} - w}{\Vert x_{n+1} - w \Vert }-(\lambda _1 - \mu _1)x_1 - \ldots - (\lambda _n - \mu _n) x_n~~\right\| \\&\ge max\{|\lambda _i - \mu _i| : 1 \le i \le n\} ~~ \text{ for } \text{ each } i \in \{1,2,\ldots , n\} \\&= \Vert w - y\Vert _{S} \\&\ge k\Vert w-y.\Vert \end{aligned}$$

Since \( S= \{ x_1,x_2, \ldots , x_n\} \) is a strongly orthonormal set in the sense of Birkhoff–James and \( span~S \) is strictly convex, it follows that \( k \in (0,1). \)

This completes the proof of the theorem. \(\square \)

If we could have proved \(k = 1,\) then \(w\) is the best coapproximation to \( x_{n+1} \) out of \( span\{x_1,x_2,\ldots ,x_n\},\) but in general \(w\) may not be the best coapproximation. We next give an example to illustrate the situation.

Example 2.3

Consider the normed linear space \( (R^3,\Vert .\Vert _{3})\) and a strongly orthonormal set in the sense of Birkhoff–James \( S = \{2^{-\frac{1}{3}}(1,1,0), {4}^{-\frac{1}{3}}(1,-1,-2^{\frac{1}{3}}) \}. \) Let \( (1,-1, 2^{\frac{1}{3}}) \in S_X ~- ~span~S .\) Then \( \theta \in ~ span ~\{2^{-\frac{1}{3}}(1,1,0), {4}^{-\frac{1}{3}}(1,-1,-2^{\frac{1}{3}}), (1,-1, 2^{\frac{1}{3}})\} \) is such that \( \{ 2^{-\frac{1}{3}}(1,1,0), {4}^{-\frac{1}{3}}(1,-1, 2^{\frac{1}{3}}), {4}^{-\frac{1}{3}}(1,-1,-2^{\frac{1}{3}}) \} \) is a strongly orthonormal set in the sense of Birkhoff–James but \( \theta \) is not the best coapproximation to \( (1,-1, 2^{\frac{1}{3}})\) out of \( span~ \{2^{-\frac{1}{3}}(1,1,0), {4}^{-\frac{1}{3}}(1,-1,-2^{\frac{1}{3}}) \}. \)

In the next theorem we give a necessary and sufficient condition for a smooth and strictly convex normed linear space of dimension s\(~\ge ~\)3 to be an inner product space.

Theorem 2.4

Let \( X \) be a smooth and strictly convex normed linear space with \( dim X \ge 3. \) Then \( X \) is an inner product space iff the best coapproximation to any element of \( X \) out of any one dimensional subspace of \( X \) is equal to the best approximation to that element out of that subspace.

Proof

The necessary part of the theorem follows easily. For the sufficient part we show that \( x \bot _{B} y \Rightarrow y \bot _{B} x ~ \forall x,y \in S_{X}\). We assume that \( x,y \in S_{X} \) and \( x \bot _{B} y. \)

Since \( X \) is strictly convex, it follows that \( \{x,y\} \) is strongly orthonormal relative to \( x \) in the sense of Birkhoff–James. Let \( w = \lambda _0 x \) be the best coapproximation and best approximation to \( y \) out of \( span\{x\}. \) From the definition of best coapproximation and best approximation it follows that \( \{x,\frac{y-w}{\Vert y-w\Vert }\} \) is a strongly orthonormal set in the sense of Birkhoff–James. Since X is smooth and \( span~\{x,y\} = span~\{x,\frac{y-w}{\Vert y-w\Vert } \},\) so we get \( y = \lambda \frac{y-w}{\Vert y-w\Vert } \) for some \( \lambda \in S_{K}.\) As \( \{x,\frac{y-w}{\Vert y-w\Vert }\} \) is a strongly orthonormal set in the sense of Birkhoff–James so \( \frac{y-w}{\Vert y-w\Vert } \bot _{B} x \) and hence \( y \bot _{B} x.\) Thus it follows from [8, Th.1] that X is an inner product space .

This completes the proof of the theorem. \(\square \)

We next characterize the normed linear spaces \((R^n,\Vert ~\Vert _p)\) in terms of existence of strongly orthonormal Hamel basis in the sense of Birkhoff–James at each point of the unit sphere of the space. First we need the following lemma to proceed in the desired direction.

Lemma 2.5

In \((R^2,\Vert ~\Vert _p), p \ne 2,\) there exists an element \((x,y)\) on the unit sphere such that there does not exist a strongly orthonormal Hamel basis in the sense of Birkhoff–James containing that element. In other words there exists an element \((x,y)\) on the unit sphere such that \((x,y) \bot _B (u,v) \) and \((u,v) \bot _B (x,y) \) implies \( u = v = 0.\)

Proof

Consider the element

$$\begin{aligned} x_0 = \frac{1}{(1 + k^p)^{1/p}},\quad y_0 = \frac{k}{(1 + k^p)^{1/p}} \end{aligned}$$

on the unit sphere with \( k > 1.\) If \((x_0,y_0)\) is strongly orthogonal to \((x_1,y_1)\) in the sense of Birkhoff–James then we have

$$\begin{aligned} x_1 = \pm \frac{-k^{p-1}}{ (1 + k^{p(p-1)})^{1/p}},\quad y_1 = \pm \frac{1}{ (1 + k^{p(p-1)})^{1/p}}. \end{aligned}$$

It is easy to verify that \((x_1,y_1)\) is not strongly orthonormal to \((x_0,y_0)\) in the sense of Birkhoff–James and thus there does not exist a strongly orthonormal Hamel basis in the sense of Birkhoff–James containing \((x_0,y_0)\).

This completes the proof of the lemma. \(\square \)

We now prove the following theorem to prove the speciality of \((R^n,\Vert ~\Vert _2)\) among \((R^n,\Vert ~\Vert _p)\) spaces, in terms of the existence of strongly orthonormal Hamel basis in the sense of Birkhoff–James at each point of the unit sphere of the space.

Theorem 2.6

The normed linear space \((R^n,\Vert ~\Vert _p)\) is an inner product space i.e., \( p = 2 \) iff given any element on the unit sphere of the space there exists a strongly orthonormal Hamel basis in the sense of Birkhoff–James containing that element.

Proof

The necessary part of the theorem is obvious. For \( p = 1, \infty , \) the theorem follows easily as not every element on the unit sphere is an extreme point of the unit sphere and for a strongly orthonormal Hamel basis in the sense of Birkhoff–James, each element of the basis must be an extreme point of the unit sphere.

For the sufficient part we show that if \( p \ne 2,\) then there exists an element on the unit sphere such that there does not exist a strongly orthonormal Hamel basis in the sense of Birkhoff–James containing that element.

By the preceding lemma, there exists an element \((x,y)\) on the unit sphere of \((R^2,\Vert ~\Vert _p)\) such that \((x,y) \bot _B (u,v) \) and \((u,v) \bot _B (x,y) \) implies \( u = v = 0.\) Consider the element \( w_1 = (x,y, 0,0 \ldots ,0) \) on the unit sphere of \((R^n,\Vert ~\Vert _p).\) Let \( w_1 \bot _B w_2 = (u_1,u_2,u_3, \ldots , u_n).\)

Clearly \( w_1 \bot _B (0,0,u_3, \ldots , u_n).\) As \((R^n,\Vert ~\Vert _p)\) is smooth and Birkhoff–James orthogonality is right additive in a smooth space, so we get \( w_1 \bot _B (u_1,u_2,0, \ldots , 0).\) This shows that \((x,y) \bot _B (u_1,u_2) \) in \((R^2,\Vert ~\Vert _p).\)

Again if \( (u_1,u_2,u_3, \ldots , u_n) \bot _B (x,y, 0,0 \ldots ,0)\) then \( (u_1,u_2) \bot _B (x,y)\) and so by the lemma \( u_1 = u_2 = 0.\)

Thus \((x,y, 0,0 \ldots ,0) \bot _B (u_1,u_2,u_3, \ldots , u_n)\) and \( (u_1,u_2,u_3, \ldots , u_n) \bot _B (x,y, 0,0 \ldots ,\) \(0)\) in \((R^n,\Vert ~\Vert _p)\) implies that \( u_1 = u_2 = 0.\)

This proves that it is impossible to find \((n-1)\) linearly independent elements \(\{ w_2,w_3, \ldots , w_{n} \} \) such that \(\{ w_1,w_2,w_3, \ldots , w_{n} \} \) is a strongly orthonormal Hamel basis in the sense of Birkhoff–James.

This completes the proof of the theorem. \(\square \)

In the next theorem we show that it is possible to characterize finite dimensional real inner product spaces among finite dimensional real smooth normed linear spaces in terms of the existence of a strongly orthonormal Hamel basis in the sense of Birkhoff–James at each point of the unit sphere of the normed linear space and the notion of best coapproximation in normed linear spaces.

Theorem 2.7

A finite dimensional real smooth space \( X \) of dimension \( \ge 3\) is an inner product space iff

  1. (i)

    given any element \(e_i\) on the unit sphere of the space there exists a strongly orthonormal Hamel basis \( \{ e_1,e_2, \ldots ,e_i,\dots , e_n\} \) in the sense of Birkhoff–James.

  2. (ii)

    \( \theta \) is the best coapproximation to \(e_i\) out of \( span~ \,\{e_1,e_2, \ldots , e_{i-1}, e_{i+1}, \ldots ,e_n\}\) for each \( i \in \{1,2,\ldots , n\}.\)

Proof

The necessary part of the theorem is obvious. For the converse part, let \( x_1, y \in S_X \) and \( x_1 \bot _B y.\) Then there exists a strongly orthonormal Hamel basis \( \{ x_1,x_2, \ldots , x_n\} \) in the sense of Birkhoff–James containing \(x_1.\) Let \( y = \alpha _1 x_1 + \alpha _2 x_2 + \ldots + \alpha _n x_n, \) for some scalars \( \alpha _i\)’s. We claim that \( \alpha _1 = 0.\)

As X is smooth so from \( x_1 \bot _B y \) and \( x_1 \bot _B (\alpha _2 x_2 + \ldots + \alpha _n x_n)\) we get \( x_1 \bot _B \alpha _1 x_1 \) which is not possible unless \( \alpha _1 = 0\) and so \( y = \alpha _2 x_2 + \alpha _3 x_3 + \ldots + \alpha _n x_n.\)

Now by the hypothesis \( \theta \) is the best coapproximation to \(x_1\) out of \( span~ \{x_2,x_3, \ldots , x_n\}\) and so

$$\begin{aligned} \left\| \frac{\alpha _2}{\lambda } x_2 + \frac{\alpha _3}{\lambda } x_3 + \ldots + \frac{\alpha _n}{\lambda } x_n + x_1 \right\| > \left\| \frac{\alpha _2}{\lambda } x_2 + \frac{\alpha _3}{\lambda } x_3 + \ldots + \frac{\alpha _n}{\lambda } x_n \right\| ~~ \forall ~ \lambda \ne 0. \end{aligned}$$

This shows that \( \Vert y + \lambda x_1 \Vert > \Vert y \Vert ~~ \forall \lambda \ne 0\) and so \( y \bot _B x_1.\)

Thus the notion of Birkhoff–James orthogonality is symmetric and so from [8, Th.1] it follows that the space X is an inner product space.

This completes the proof of the theorem. \(\square \)

Remark 2.8

The question that arises naturally is that whether (i) implies (ii) i.e., whether the existence of strongly orthonormal Hamel basis at each point \(e_i\) of the unit sphere in the sense of Birkhoff–James implies that \( \theta \) is the best coapproximation to \(e_i\) out of the subspace spanned by the remaining elements of that basis. The following two examples show that the result is locally not true i.e. if \( \{ e_1,e_2, \ldots , e_n\} \) is a strongly orthonormal Hamel basis at \( \{e_i\} \) in the sense of Birkhoff–James in a finite dimensional real normed linear space (strictly convex smooth space) then \( \theta \) may not be the best coapproximation to \(e_i\) out of the subspace spanned by the remaining elements of the basis.

Example 2.9

Consider the Hamel basis

$$\begin{aligned} S = \{ (1,1,1,1), (1,1,-1,-1), (1,-1,-1,1),(1,-1,1,-1) \} \end{aligned}$$

in the normed linear space \( (R^4,\Vert .\Vert _{\infty }).\) Then S is a strongly orthonormal Hamel basis in the sense of Birkhoff–James but \( \theta \) is not the best coapproximation to \( (1,-1,-1,1)\) out of \( span~ \{ (1,1,1,1), (1,1,-1,-1), (1,-1,1,-1) \}. \)

Example 2.10

Consider the Hamel basis

$$\begin{aligned} S = \left\{ 2^{-\frac{1}{3}}(1,1,0), {4}^{-\frac{1}{3}}(1,-1, 2^{\frac{1}{3}}), {4}^{-\frac{1}{3}}(1,-1,-2^{\frac{1}{3}}) \right\} \end{aligned}$$

in the normed linear space \( (R^3,\Vert .\Vert _{3}).\) Then S is a strongly orthonormal Hamel basis in the sense of Birkhoff–James but \( \theta \) is not the best coapproximation to \( {4}^{-\frac{1}{3}}(1,-1, 2^{\frac{1}{3}})\) out of \( span~ \{2^{-\frac{1}{3}}(1,1,0), {4}^{-\frac{1}{3}}(1,-1,-2^{\frac{1}{3}}) \}. \)

We conclude with the following conjecture:

Conjecture 2.11

A finite dimensional (\(\ge \)3) real smooth normed linear space is an inner product space iff given any element on the unit sphere of the space there exists a strongly orthonormal Hamel basis in the sense of Birkhoff–James containing that element.

Brief Conclusion

We have established the connection between strongly orthonormal Hamel basis in the sense of Birkhoff–James and the existence of best coapproximation in normed linear spaces in Remark 2.1 and Theorem 2.2. Theorem 2.4 gives a characterization of inner product spaces in terms of equality of best approximation and best coapproximation on one dimensional subspaces of the space. In Theorem 2.5 we prove the speciality of \((R^n,\Vert ~\Vert _2)\) among \((R^n,\Vert ~\Vert _p)\) spaces, in terms of the existence of strongly orthonormal Hamel basis in the sense of Birkhoff–James at each point of the unit sphere of the space. We conjecture that the existence of strongly orthonormal Hamel basis in the sense of Birkhoff–James at each point of the unit sphere of the space characterizes finite dimensional real inner product spaces among all finite dimensional real smooth normed linear spaces.