Generalizations of the functional equation of the mean sun

HARALD FRIPERTINGER*
LUDWIG REICH

^*Supported by the Fonds zur Förderung der wissenschaftlichen Forschung P14342-MAT.

Abstract

Two generalizations U( + )y() = U()y( + ) (, , A) of the functional equation of the mean sun are studied, where (A, +) is an Abelian group, K is a field, n is a positive integer, and both y: A --> Kⁿ and U: A --> GL(n, k) (or U: A --> M_n(k) in the second case) are unknown functions, which will be determined by the equation.

1 Introduction

Local solar time is measured by a sundial. When the center of the sun is on an observer’s meridian, the observer’s local solar time is zero hours (noon). Because the earth moves with varying speed in its orbit at different times of the year and because the plane of the earth’s equator is inclined to its orbital plane, the length of the solar day is different depending on the time of year. It is more convenient to define time in terms of the average of local solar time. Such time, called mean solar time, may be thought of as being measured relative to an imaginary sun (the mean sun) that lies in the earth’s equatorial plane and about which the earth orbits with constant speed. Every mean solar day is of the same length.¹

In [1, 4] it is shown that the mean sun satisfies the functional equation

M (c + t,f)T y(s) = M (c,f)T y(s + t), A s,t,c (- R - p/2 < f < p/2,

(1)

where y(s) is a vector of length 1, which is the direction from the center of the earth to the sun at the time s (one day corresponds to 2

) expressed in a geocentric coordinate system. As a basis of this system we can choose two orthogonal vectors in the equatorial plane and one vector along the axis of the earth. M(

) is the matrix

( ) - sinc - sin fcos c cosf cosc M (c, f) = cosc - sin f sin c cosf sinc . 0 cosf sin f

Then M(, )y(s) is the direction from the earth to the sun expressed in a local coordinate system on the surface of the earth in the point of longitude and latitude .

In the present paper we investigate generalizations of equation (1) for fixed . To be more precise, first we will solve the following functional equation

U (c + m)y(n) = U(c)y(m + n), A c, m,n (- A,

(2)

where (A, +) is an Abelian group, K is a field, n is a positive integer, and both y: A -->

Kⁿ and U: A -->

GL(n, K) are unknown functions, which will be determined by (2). In some situations we will additionally have to assume that A = K. Later on we will study the more general situation when we replace GL(n, k) by M_n(K), the set of all n × n matrices over K. The following types of questions can be asked in connection with (2):

Determine all solutions (U, y) of (2).
For given U determine all y, such that (U, y) is a solution of (2).
For given y determine all U, such that (U, y) is a solution of (2).
Find relations between U and y for a solution (U, y) of (2).

We will mainly deal with problems of the second and third kind.

In Theorem 6 we describe in an appropriate system of coordinates the structure of the space S_U of all solutions of (2) for a given U: A --> GL(n, K). We also state in this theorem how such a mapping U necessarily looks if a nontrivial solution y (i.e. y0) exists. A similar description of U-invariant subspaces of S_U is given in Theorem 8. We emphasize that by our result (and similarly by the following theorems) the problem of solving (2) can be reduced, at least to some extent, to the problem of finding all exponential functions U₁₁: A --> GL(k, K) (cf. the representation of U in Theorem 6), i.e. non singular matrices U₁₁() satisfying the equation

U11(c + m) = U11(c)U11(m), A c,m (- A.

Here we assume that these functions are known and we refer the reader to [3].

In Theorem 9 we construct to a given subspace S⁰ of Kⁿ the set of all mappings U and correspondingly the space S of all functions y, such that (U, y) satisfies (2) and S⁰ is exactly the set of all initial values y(0) for y S. It is clear that this yields together with Theorem 6 an implicit description of the set of all solutions (U, y) of (2) by varying the subspace S⁰ of Kⁿ. However, the space S and the mapping U obtained in this way from S⁰ may have the property that S is a proper subset of S_U. Therefore we also deal with the problem to characterize the situation when S = S_U.

From a mathematical point of view it seems also interesting to study the functional equation (2) for mappings U: A --> M_n(K). This situation is more complicated both with respect to the technical details and the construction (description) of the solutions U or y or (U, y). In Theorem 20 we start from a given mapping U: A --> M_n(K) and describe completely in appropriate coordinates the set of all functions y: A --> Kⁿ, such that (U, y) is a solution of (2). Again this theorem provides necessary conditions on U for the existence of nontrivial solutions y of (2). We also show in Theorem 21 how to construct all mappings U: A --> M_n(K) and corresponding spaces S of functions y: A --> Kⁿ, such that {y(0) |y (- S} is a given subspace S⁰ of Kⁿ and (U, y) is a solution of (2), hence giving an implicit description of the general solution of (2) by varying S⁰. However, we were not able to contribute to the problem when S = S_U.

The main difficulties in this last part seem to arise from the fact that there can occur solutions y (to a given U) with y(0) = 0 but y0 (cf. Lemma 18).

2 Regular matrices U()

Here in this part we always assume that U is a mapping from the abelian group A to GL(n, K).

Lemma 1. Let B, C be matrices in GL(n, k). Then (U, y) is a solution of (2) if and only if (V, By) is a solution of (2), where V () = CU()B^-1.

: Proof. The pair (U, y) is a solution of (2) if U(+)y() = U()y(+) for all , , A. Since B and C are regular matrices, this is equivalent to CU( + )B^-1By() = CU()B^-1By( + ) for all , , A.

For C = U(0)^-1 we get CU(0) = I n, the identity matrix. Hence, without loss of generality we will always assume that U(0) = In.

Lemma 2. If (U, y) is a solution of (2), then

y(m) = U (m)y(0), A m (- A,

(3)

[U (c + m) - U (c)U (m)]y(0) = 0, A c, m (- A.

(4)

: Proof. Since U(0) = In, we get (3) from (2) for = = 0. And we get (4) from (2) and (3) for = 0.

It is also possible to reverse the statement of Lemma 2.

Lemma 3. Assume U(0) = In and let y be given by (3). If (U, y) satisfies (4), then (U, y) is a solution of (2).

For any mapping U: A --> GL(n, k) let

SU := {y |(U, y) is a solution of (2)} and S0U := {y(0) |y (- SU }.

Some basic properties of these two sets are collected in the following

Lemma 4. Both S_U and S_U⁰ are K-linear spaces and : S_U⁰ --> S_U, given by (y⁰) := U(^.)y⁰, is a vector space isomorphism.

: Proof. It is clear that S_U and S_U⁰ are linear spaces. Assume that y⁰ S_U⁰, then there is some y S_U, such that y(0) = y⁰. Since (U, y) satisfies (3), the function is well defined. It is surjective, since for any y S_U we have (y(0)) = U(^.)y(0) = y(^.) according to (3). The mapping is also injective, since from (y₁⁰) = (y₂⁰) we derive U()y₁⁰ = U()y₂⁰, which implies for = 0 (and U(0) = In) that y₁⁰ = y₂⁰. Finally we have to prove that is a linear mapping. Let y₁⁰, y₂⁰ S_U⁰ and let ₁, ₂ K, then ₁y₁⁰ + ₂y₂⁰ S_U⁰ and (₁y₁⁰ + ₂y₂⁰) = U(^.)(₁y₁⁰ + ₂y₂⁰) = ₁U(^.)y₁⁰ + ₂U(^.)y₂⁰ = ₁(y₁⁰) + ₂(y₂⁰).

In conclusion, both S_U and S_U⁰ are m-dimensional linear spaces for some 0 < m < n.

There are some more interesting properties of S_U and S_U⁰.

Lemma 5. Let U: A --> GL(n, k). Then:

S_U is U(₀)-invariant for all ₀ A (i.e. if y S_U, then also U(₀)y S_U).
S_U is invariant under translations (i.e. if y S_U, then also y(^. + ₀) S_U for all ₀ A).
S_U⁰ is U(₀)-invariant for all ₀ A.
S_U⁰ =

Proof.

Assume that z() := U(₀)y(). Then (U, z) satisfies (2), since U( + )z() = U( + )U(₀)y() = U( + )U(₀)U()y(0) = U( + )U(₀ + )y(0) = U( + )y(₀ + ) = U()y( + ₀ + ) = U()U(0)y(₀ + + ) = U()U(₀)y( + ) = U()z( + ) by (3), (4), (3), (2), special form of U(0) and (2).
Let z() := y(+₀), then (U, z) satisfies (2), since U(+)z() = U(+)y( + ₀) = U()y( + + ₀) = U()z( + ) by (2).
If y⁰ S_U⁰, there exists some y S_U, such that y⁰ = y(0). From the first item of this lemma we know that U(₀)y S_U, hence U(₀)y(0) = U(₀)y⁰ S_U⁰.
According to the definition of S_U⁰ we know that S_U⁰ . Let y S_U and assume that ₀ A \ , then it follows from the second item of this lemma that z(^.) := y(^. + ₀) S_U and y(₀) = z(0) S_U⁰.

If {b ,...,b }
1 m denotes a basis of S_U⁰, then there exists a matrix B GL(n, k), such that Bb_i = e_i, the i-th unit vector in Kⁿ. Applying this matrix B as a coordinate transformation on Kⁿ as in Lemma 1 we get that S_UB^-1^{- 1} = <e₁, ..., e_m>, the m-dimensional linear space generated by the first m unit vectors in Kⁿ. Thus without loss of generality we may assume that S_U = <e₁, ..., e_m>.

Theorem 6. Let U: A --> GL(n, K) be a mapping, such that S_U⁰ = <e₁, ..., e_m> and U(0) = In. Then U() can be partitioned as a block matrix of the form

( ) U (c) = U11(c) U12(c) , (0)n-m,m U22(c)

where U₁₁() GL(m, K), U₂₂() GL(n - m, K) and U₁₂() M_m,n-m(K). These matrices satisfy the boundary conditions U₁₁(0) = Im, U₂₂(0) = In-m and U₁₂(0) = (0)m,n-m, the zero matrix. Moreover, U₁₁ is an exponential function, i.e. U₁₁( + ) = U₁₁()U₁₁() for all , A.

Each y S_U can be expressed as

( ) y(c) = U11(c)y(0) , A c (- A, 0

and (0) K^m.

Proof. Let y

S_U, then y(

)

S_U⁰ = <e₁, ..., e_m>, so y(

) =

and

(

)

K^m. We partition U(

) as a block matrix

( ) U (c) = U11(c) U12(c) , U21(c) U22(c)

(5)

such that U₁₁(

) is an m × m-matrix. From (3) we deduce that

y(c) = U11(c)y(0) + U12(c)0 = U11(c)y(0) 0 = U21(c)y(0) + U22(c)0 = U21(c)y(0).

Since

(0) is an arbitrary element of K^m, it is clear that U₂₁(

) = (0)n-m,m for all

A. Because of the fact that U(

) is regular, both U₁₁ and U₂₂ are regular matrices as well. From U(0) = In the boundary conditions follow. Inserting into (4) the form of U and y just described we get

( ) ( ) U11(c + m) U12(c + m) y(0) (0)n-m,m U22(c + m) 0 =

( ) ( ) ( ) U11(c) U12(c) U11(m) U12(m) y(0) (0)n-m,m U22(c) (0)n-m,m U22(m) 0 ,

which means that U₁₁( + ) (0) = U₁₁()U₁₁() (0) for all (0) K^m and , A, so that U₁₁ is an exponential function.

We are also interested in subspaces of S_U. First we present a generalization of Lemma 5.

Lemma 7. Let S be a subspace of S_U. Then the following statements are equivalent:

S is a U(₀)-invariant space for all ₀ A.
S is invariant under translations.
S⁰ is U(₀)-invariant for all ₀ A, where S⁰ := .

Proof. In order to prove that 1 implies 2, we set z(

) := y(

₀) for arbitrary

₀

A. Since S is U(

₀)-invariant, U(

₀)y

S. It is enough to prove that z = U(

₀)y, since then z

S. For

A we get z(

) = y(

₀ +

) = U(

₀ +

)y(0) = U(

₀)U(

)y(0) = U(

₀)y(

), so z = U(

₀)y by (3), (4) and (3).

To each y⁰ S⁰ there exists y S, such that y⁰ = y(0). Under the assumption 2, the function z() := y( + ₀) belongs to S for any ₀ A. So z(0) S⁰ and z(0) = y(₀) = U(₀)y(0) = U(₀)y⁰ by (3). Thus we proved that 2 implies 3.

In order to close the cycle of implications take y S. Then y(0) S⁰. For arbitrary ₀ A also U(₀)y(0) belongs to S⁰. Hence, there exists z S such that z(0) = U(₀)y(0). Taking into account that S is a subspace of S_U we can write z as z() = U()z(0) = U()U(₀)y(0) = U( + ₀)y(0) = U(₀)U()y(0) = U(₀)y() by (3), (4), (4) and (3). Thus U(₀)y S.

A generalization of Theorem 6 is

Theorem 8. Let S be a k-dimensional U-invariant subspace of S_U. Then there exist coordinates in Kⁿ, such that S⁰ = <e₁, ..., e_k>, and U() is a block matrix of the form

( ) U (c) = U11(c) U12(c) , (0)n-k,k U22(c)

(6)

where U₁₁(

)

GL(k, K), U₂₂(

)

GL(n - k, K) and U₁₂(

)

M_k,n-k(K), such that U₁₁(0) = Ik, U₂₂(0) = In-k, U₁₂(0) = (0)k,n-k. Moreover, U₁₁ is an exponential function, and y

S if and only if

( ) y(c) = U (c)y(0) = U11(c)q 0

for K^k.

So far we described solutions (U, y) of (2) when the mapping U was given. Now we will assume that a linear subspace S⁰ of Kⁿ is given and we describe all solutions (U, y) of (2), such that S_U⁰ = S⁰. Let S⁰ be a k-dimensional U-invariant subspace of Kⁿ, then without loss of generality S⁰ = <e₁, ..., e_k>.

Theorem 9. Let S⁰ = <e₁, ..., e_k> be a subspace of Kⁿ, and let U₁₁() GL(k, K), U₂₂() GL(n - k, K) and U₁₂() M_k,n-k(K), such that U₁₁(0) = Ik, U₂₂(0) = In-k, U₁₂(0) = (0)k,n-k. Moreover U₁₁ is assumed to be an exponential function. Then

{ ( ) } U11(c)q k S := y |y(c) = 0 , q (- K

is a U-invariant subspace of S_U, where U is given by (6).

Proof.

( ) ( ) ( ) U11(c + m) U12(c + m) U11(n)q U11(c + m)U11(n)q U (c + m)y(n) = (0)n- k,k U22(c + m) 0 = 0 =

( ) ( ) U11(c + m + n)q U11(c)U11(m + n)q 0 = 0 = U (c)y(m + n)

When does S = S_U hold?

Lemma 10. The two spaces S and S_U coincide if and only if for all K^n-k \ {0} there exists (₀, ₀) A², such that

( ) [U11(c0)U12(m0) + U12(c0)U22(m0) - U12(c0 + m0)]j /= 0. [U22(c0)U22(m0) - U22(c0 + m0)]j

(7)

: Proof. From Lemma 2 and Lemma 3 we know that S is a subspace of S_U different from S_U if and only if there exists y⁰ / S⁰, such that [U( + ) - U()U()]y⁰ = 0 for all , A. In other words, S = S_U if and only if for each y⁰ / S⁰ there exists (₀, ₀) A², such that [U(₀+₀)-U(₀)U(₀)]y⁰0. When writing y⁰ in the form for K^k and K^n-k, then y⁰ Kⁿ \ S⁰ if and only if 0. Together with (6) we get $( ) 0 [U11(c)U12(m) + U12(c)U22(m) - U12(c + m)]j [U (c + m) - U (c)U (m)]y = [U22(c)U22(m) - U22(c + m)]j ,$
which finishes the proof.

Now we are going to present several examples for the situation S = S_U, i.e. by Lemma 10 examples, where condition (7) is satisfied. Here we always assume that A = K. First we will deal with the second line of condition (7). Secondly, if this condition is not satisfied by all , then let V denote the set

{j (- Kn -k |[U22(c)U22(m) - U22(c + m)]j = 0, A c,m (- A} .

Thus V is an r-dimensional subspace of K^n-k for 0 < r < n - k. In order to satisfy the requirements of Lemma 10 in this situation as well, the first line in (7) must be satisfied for V .

Now we describe some examples how to construct U₂₂: K --> GL(s, K) for s < n - k, such that

A j (- Ks \{0} E c ,m (- K : [U (c )U (m ) - U (c + m )]j /= 0. 0 0 22 0 22 0 22 0 0

(8)

Case charK2:: Set U₂₂() = cIs for all K \ with c K \ . Then c²c. For ₀ = ₀ = 1 we get [U₂₂(1)U₂₂(1)-U₂₂(1+1)] = [c²I s -cIs] = (c²-c)I s0 for all 0.
Case charK = 2 and > 2:: There exists c K\, such that c²1. Let U₂₂() = cIs for all K \ , then for ₀ = ₀ = 1 we get [U₂₂(1)U₂₂(1) - U₂₂(1 + 1)] = [U₂₂(1)² - U₂₂(0)] = [c²I s - Is] = (c² - 1)I s0 for all 0.
Case = 2:: If s = 1 each mapping U₂₂: K GL(1, K) = is a homomorphism, so (8) cannot be satisfied. If s > 1 there exist matrices M GL(s, K) of order 2^s-1. As a permutation of the vectors in K^s the cycle decomposition of M consists of one fixed point, the 0-vector, and a cycle of length 2^s-1. (Actually, cf. [2] 3.5 Theorem, there are (2^s - 1)/s irreducible polynomials of degree s over K = GF (2), such that the companion matrix of these polynomials is of order 2^s - 1.) If U₂₂(1) = M, then for ₀ = ₀ = 1 we get [U₂₂(1)U₂₂(1) - U₂₂(1 + 1)] = [M² - I_s]0 for all 0, since 2 < 2^s - 1.

Now we describe examples how to construct U₁₂: K --> M_k,n-k(K), such that

A j (- Kn -k\{0} E c0,m0 (- K : [U11(c0)U12(m0) + U12(c0)U22(m0) - U12(c0 + m0)]j /= 0.

(9)

Again we assume that A = K. Furthermore we assume that both U₁₁ and U₂₂ are exponential functions. Hence r = n - k. From the preceding considerations we already know that U₁₁(

)

GL(k, K), U₂₂(

)

GL(r, K), U₁₁(0) = Ik, U₂₂(0) = Ir and U₁₂(0) = (0)k,r. Again we describe several different cases:

Case charK2:

If k > r, then assume that U₁₂(1) = (0)k,r and

( ) - 1 0 ... 0 0 - 1 0 ... ... ... ( ) U (2) = = - Ir , 12 0 0 ... - 1 (0)k-r,r 0. ... ... 0. .. .. 0 ... ... 0

where the upper part is -Ir and the lower part is a 0-matrix of the dimension (k - r) × r. Then for ₀ = ₀ = 1 we get that U₁₁(1)U₁₂(1) + U₁₂(1)U₂₂(1) - U₁₂(1 + 1) = -U₁₂(2) and it is obvious that -U₁₂(2)0 for all K^r \ {0} .

For k < r one possible way to proceed is indicated in

Lemma 11. If there are enough elements in K, to be more precise, if |K | > 2 |~ ~|
r
k + 1, then it is always possible to find ₀ and ₀ satisfying (9).

Proof. There exist uniquely determined integers q, s, such that r = kq + s and 0 < s < k. If q > 0 assume that

₁

K \

, then -

₁

K \

. Let U₁₂(±

₁) be given by

U12(c1) = ( Ik (0)k,r-k ) .U22(c1) and U12(- c1) = U11(- c1) .(Ik (0)k,r-k ).

If q > 1 and |K | is big enough, then there exists ₂ K \ {0, ±c1} and we assume that

U12(c2) = ((0)k,k Ik (0)k,r-2k ) .U22(c2) and U (- c ) = U (- c ) .( (0) I (0) ). 12 2 11 2 k,k k k,r-2k

Going on like this we can find elements ₁, ..., _q K and matrices U₁₂(±_i). If s > 0 and |K | is big enough, then there exists _q+1 K \ {0,± c1,...,± cq} and we assume that

( (0)s,qk Is ) U12(cq+1) = .U22(cq+1) and (0)k-s,qk (0)k- s,s ( ) (0)s,qk Is U12(- cq+1) = U11(- cq+1) . (0)k-s,qk (0)k-s,s .

When K^r \ {0} , then there exists 1 < i < r, such that _i0. Hence, there exists j {1,2,...,q + 1} , such that (j - 1)k < i < jk. For ₀ = _j and ₀ = -_j we have U₁₁(_j)U₁₂(-_j) + U₁₂(_j)U₂₂(-_j) - U₁₂(0) = 2U₁₂(_j)U₂₂(-_j). According to the choice of i and j it is clear that (9) is satisfied.

This is a very general result, but it is not the best result which is possible.

Example 12. Let K be the prime field of characteristic 3, k = 1 and r = 2. In this case |K | < 2 |~ r ~|
k + 1, but it is also possible to find U₁₂, such that (9) is satisfied. For instance U given by

( ) ( ) 1 1 0 1 0 0 U (1) = 0 1 1 U (2) = 0 1 2 0 0 1 0 0 1

satisfies (9).

Case charK = 2 and > 2:

Assume first k > r. There exists

K \

and then

+ 1 /

. Let furthermore U₁₂(1) = U₁₂(

) = (0)k,r and

( ) U12(c + 1) = Ir , (0)k- r,r

then for ₀ = 1 and ₀ = we get U₁₁(1)U₁₂() + U₁₂(1)U₂₂() - U₁₂( + 1) = U₁₂( + 1) and (9) is satisfied. For k < r the following lemma holds:

Lemma 13. If there are enough elements in K, to be more precise, if |K | > 2 |~ r ~|
k + 2, then it is always possible to find ₀ and ₀ satisfying (9).

Proof. There exist uniquely determined integers q, s, such that r = kq + s and 0 < s < k. Assume U₁₂(1) = (0)k,r. For 1 < i < q there exists

K \

, such that

_i + 1 /

. Let U₁₂(

_i) and U₁₂(

_i + 1) be given by

U12(ci) = (0)k,r and U12(ci + 1) = ( (0)k,(i- 1)k Ik (0)k,r-ik ) .

If s > 0 and |K | is big enough, then there exists _q+1 K, such that _q+1, _q+1 + 1 K \ {0,1,cj,cj + 1 |j < q} and we assume that

U12(cq+1) = (0)k,r and ( ) U12(cq+1 + 1) = (0)s,qk Is . (0)k-s,qk (0)k-s,s

Given K^r \ {0} there exists 1 < i < r, such that _i0. Hence, there exists j {1,2,...,q + 1} , such that (j - 1)k < i < jk. For ₀ = _j and ₀ = 1 we have U₁₁(_j)U₁₂(1) + U₁₂(_j)U₂₂(1) - U₁₂(_j + 1) = -U₁₂(_j + 1). According to the choice of i and j it is clear that (9) is satisfied.

Case = 2:

In the situation r > k we can only give partial results. If

₀ = 0 or

₀ = 0, then it is impossible to satisfy (9). Since U₁₁ and U₂₂ are exponential functions, the orders of U₁₁(1) and U₂₂(1) are divisors of 2. If both U₁₁(1) = Ik and U₂₂(1) = Ir, then it is also impossible to satisfy (9). If r = 1, then U₂₂(1) is the identity matrix I1. From the previous statements it is clear that necessarily k > 1 and U₁₁(1) must be a matrix of order 2. If U is defined by

( ) 1 .0 .... 0 1 ( 0 ) 0 .. .. 0 .. U11(1) = ... ... ... ... and U12(1) = . , .. .. 0 . . 0 1 0 0 0 0 1

then (9) is satisfied. For r = 2 assume that U₁₁(1) is given as above and

( 1 0 ) ( ) 1 1 0. 0. U22(1) = 0 1 and U12(1) = .. .. , 0 0 0 1

then again (9) is satisfied. For k = r = 3 and for any choice of U₁₁(1), U₂₂(1) GL(3, K) of order dividing 2 the computer did not find a matrix U₁₂(1) in M₃(K) such that (9) is satisfied. Other cases were not studied so far.

If k > r it is not possible to satisfy (9), since there is only one possible choice ₀ = ₀ = 1, which determines exactly one matrix U₁₁(1)U₁₂(1) + U₁₂(1)U₂₂(1). This matrix describes a homomorphism from K^k to K^r, which has a kernel of dimension > k - r > 0.

3 The general situation

In this part we generalize the functional equation (2) by assuming that U() is not necessarily a regular matrix, i.e. U: A --> M_n(K). Also in this situation Lemma 1 holds. When we define S_U and S_U⁰ as it was done earlier, then S_U and S_U⁰ are K-linear spaces (cf. Lemma 4). Again S_U⁰ is an m-dimensional subspace of Kⁿ for 0 < m < n, and S_U is invariant under translations, and S_U⁰ = {y(c) |c (- A, y (- SU } (cf. Lemma 5). Without loss of generality we can assume (as in the earlier case) that there exists a basis of Kⁿ, such that S_U⁰ = <e₁, ..., e_m>.

Since U(0) need not be a regular matrix, we do not get the results of Lemma 2, and in general there is no isomorphism between S_U and S_U⁰.

For = = 0 or = 0 we derive from (2)

Lemma 14. Let (U, y) be a solution of (2), then

U (0)y(m) = U (m)y(0), A m (- A,

(10)

U(c)y(m) = U(0)y(c + m), A c, m (- A.

(11)

If U() is partitioned as in (5) and y() is written as ( )
y(c)
0 for () K^m, then from (10) we get

( U (0) U (0)) ( y(m) ) ( U (m) U (m)) ( y(0) ) 11 12 = 11 12 , U21(0) U22(0) 0 U21(m) U22(m) 0

which leads to the system of equations

U11(0)y(m) = U11(m)y(0) A m (- A U21(0)y(m) = U21(m)y(0) A m (- A.

(12)

Lemma 15. Let (U, y) be a solution of (2). Then there exists a system of coordinates of Kⁿ, such that

( U (c) U (c) ) U(c) = 11 12 , A c (- A, (0)n- m,m U22(c)

(13)

where the m × m-matrix U₁₁(0) is the block matrix of the form

( ) Ik (0)k,m- k U11(0) = (0)m- k,k (0)m -k,m -k

(14)

for some k < m.

Proof. According to Lemma 1 choose matrices C

GL(n, K) and B'

GL(m, K), such that

( ) ( ) B' (0)m,n- m V11(0) V12(0) CU (0) (0)n-m,m In- m = (0)n-m,m V22(0)

and

( ) Ik (0)k,m -k V11(0) = (0)m -k,k (0)m-k,m-k .

Without loss of generality assume that U = V . From the second line of (12) we deduce that 0 = 0 () = U₂₁() (0) for all A. Since (0) can arbitrarily be chosen in K^m, it is clear that U₂₁() = (0)n-m,m for all A.

Since S_U⁰ = <e₁, ..., e_m>, there exist y₁, ..., y_m S_U, such that y_j(0) = e_j, the j-th unit vector in Kⁿ, for 1 < j < m. Let S_U' := <y₁, ..., y_m>, then S_U' is an m-dimensional subspace of S_U. In order to prove this, it is only necessary to show that y₁, ..., y_m are linearly independent. Let ₁, ..., _m K, such that sum _{i = 1}^m_iy_i = 0, then also sum _{i = 1}^m_iy_i(0) = 0, which implies sum _{i = 1}^m_ie_i = 0, so that ₁ = ... = _m = 0.

For y S_U' there exist uniquely defined ₁, ..., _m K such that y = sum _{i = 1}^m_iy_i. These _i can be read from y(0), since y(0) = sum _{i = 1}^m_ie_i.

Define the m × m-matrix Y () corresponding to the chosen y₁, ..., y_m by

Y (c) = (y1(c),...,ym(c)),

(15)

i.e. the j-th column of Y (

) is the vector

_j(

)

K^m. Then for y

S_U' we have

( ) sum m a1 y(c) = aiyi(c) = Y(c) ... = Y(c)y(0). i=1 a m

(16)

Replacing y by y_j in the first line of (12) we get for all

U11(0)yj(m) = U11(m)yj(0) = U11(m)ej, j = 1,...,m.

These equations are collected to the matrix equation

U11(0)Y (m) = U11(m), A m (- A.

(17)

The special form of U from Lemma 15 inserted into (11) yields for y = y_j the equation

U11(c)yj(m) = U11(0)yj(c + m), A c, m (- A.

Again these equations can be collected for j = 1, ..., m and we derive

U11(c)Y (m) = U11(0)Y (c + m), A c, m (- A.

(18)

Equations (17) and (18) together yield

U11(0)[Y (c + m)- Y (c)Y (m)] = (0)m,m, A c,m (- A.

(19)

According to the special form of U₁₁(0) described in Lemma 15 we partition Y (

) as a block matrix

( ) Y (c) = Y11(c) Y12(c) , Y21(c) Y22(c)

such that Y ₁₁() is a k × k-matrix. We note that the “auxiliary” matrix function Y : A --> M_m(K), which will help us to describe the space S_U of solutions y (for given U), is in general not uniquely determined. However, from (17), from the decomposition of U₁₁(0) in Lemma 15 and the corresponding decomposition of Y () we see that Y ₁₁() and Y ₁₂() are uniquely determined by U₁₁(), namely

( ) ( ) ( ) U (m) = U (0)Y (m) = Ik 0 Y11(m) Y12(m) = Y11(m) Y12(m) . 11 11 0 0 Y21(m) Y22(m) (0)m -k,k (0)m -k,m -k

(20)

Then (19) can be rewritten as

( )[( ) ( )( )] Ik 0 Y11(c + m) Y12(c + m) - Y11(c) Y12(c) Y11(m) Y12(m) = (0) 0 0 Y21(c + m) Y22(c + m) Y21(c) Y22(c) Y21(m) Y22(m) m,m

and we end up with the system of equations

Y11(c + m) = Y11(c)Y11(m) + Y12(c)Y21(m) A c, m (- A. Y12(c + m) = Y11(c)Y12(m) + Y12(c)Y22(m)

(21)

From Y (0) = Im we deduce that Y ₁₁(0) = Ik, Y ₂₁(0) = (0)m-k,k, Y ₁₂(0) = (0)k,m-k and Y ₂₂(0) = Im-k. If

is replaced by

₁ +

₂ and taking into account that + is an associative composition we get from the first line of (21) that Y ₁₁(

₁+

₂)) = Y ₁₁(

)Y ₁₁(

₁+

₂)+Y ₁₂(

)Y ₂₁(

₁+

₂) = Y ₁₁(

)[Y ₁₁(

₁)Y ₁₁(

₂)+Y ₁₂(

₁)Y ₂₁(

₂)]+Y ₁₂(

)Y ₂₁(

₁+

₂) is equal to Y ₁₁((

₁) +

₂) = Y ₁₁(

₁)Y ₁₁(

₂) + Y ₁₂(

₁)Y ₂₁(

₂) = [Y ₁₁(

)Y ₁₁(

₁) + Y ₁₂(

)Y ₂₁(

₁)]Y ₁₁(

₂) + [Y ₁₁(

)Y ₁₂(

₁) + Y ₁₂(

)Y ₂₂(

₁)]Y ₂₁(

₂), which yields

Y12(c)[Y21(m1 + m2) - Y21(m1)Y11(m2) - Y22(m1)Y21(m2)] = 0, A c, m1,m2 (- A.

(22)

In the same way we can derive from the second line of (21) that

Y12(c)[Y22(m1 + m2) - Y21(m1)Y12(m2) - Y22(m1)Y22(m2)] = 0, A c, m1,m2 (- A.

(23)

Each Y ₁₂(

) determines a homomorphism from K^m-k to K^k. Let W := $/~\$ _A ker Y ₁₂(

). Then W is an r-dimensional subspace of K^m-k for 0 < r < m - k with basis ${ } ^d1,..., ^dr$ . Moreover, there exists an m - k - r-dimensional subspace V of K^m-k, such that K^m-k = V

W . Let ${^c1,...,^cm-k-r}$ be a basis of V . We embed K^m-k in a natural way into K^m by placing k zeros in front of each vector, i.e.

( ) ( ) ci := 0 (- Km and di := 0 (- Km. ^ci ^di

Then it is possible to find a matrix B'' GL(m - k, K), such that the coordinate transformation on K^m induced by

( ) ' Ik (0)k,m- k B := (0)m -k,k B'' (- GL(m, K)

satisfies

' B ei = ei 1 < i < k B'ci = ek+i 1 < i < m - k- r B'di = em -r+i 1 < i < r.

Let B be the corresponding coordinate transformation on Kⁿ

( ) B' (0)m,n-m B := (0)n-m,m In- m (- GL(n, K).

If U is decomposed as in (13) and (14), then also UB has this property.

Without loss of generality we assume that the basis of Kⁿ was chosen in such a way that Lemma 15 is satisfied and that {ek+1,...,em- r} and {em -r+1,...,em} are a basis of V or W respectively. Then it is useful and important to partition Y () further as a 3 × 3 block matrix of the form

( ) Z11(c) Z12(c) Z13(c) Y (c) = Z21(c) Z22(c) Z23(c) , Z (c) Z (c) Z (c) 31 32 33

such that Z₁₁() = Y ₁₁() M_k(K), Z₂₂() M_m-k-r(K) and Z₃₃() M_r(K). Hence

( Z (c) ) ( Z (c) Z (c) ) Y12(c) = ( Z12(c) Z13(c) ), Y21(c) = 21 , Y22(c) = 22 23 . Z31(c) Z32(c) Z33(c)

Let x = ( )
~x
^x denote a vector in K^m-k, where K^m-k-r and K^r. Then x belongs to W if and only if = 0. Moreover Y ₁₂()|_W = 0 for all A, which means Z₁₃() = (0)k,r for all A. From the definition of W it is clear that Z₁₂() = 0 for all A is equivalent to = 0.

The first line of (21) reads now as

Z11(c + m) = Z11(c)Z11(m) + Z12(c)Z21(m) A c,m (- A.

From the second line of (21) we derive

Z12(c + m) = Z11(c)Z12(m) + Z12(c)Z22(m), A c,m (- A

and

(0) = Z (c + m) = Z (c)(0) + Z (c)Z (m), A c,m (- A. k,r 13 11 k,r 12 23

Hence, each column of Z₂₃() is 0 K^m-k-r, so that Z₂₃() = (0)m-k-r,m-k-r for all A.

From (22) we deduce

Z12(c)[Z21(m1 + m2) - Z21(m1)Z11(m2) - Z22(m1)Z21(m2)] = (0)k,k, A c,m1, m2 (- A.

Let M denote the matrix between the two braces [ and ], then each column of M is 0 K^m-k-r and consequently M = (0) m-k-r,k. Hence, we proved that

Z (m + m )- Z (m )Z (m ) - Z (m )Z (m ) = (0) , A m ,m (- A. 21 1 2 21 1 11 2 22 1 21 2 m -k-r,k 1 2

The same way we deduce from (23) that

Z12(c)[Z22(m1 + m2) - Z21(m1)Z12(m2) - Z22(m1)Z22(m2)] = (0)k,m -k-r, A c, m1,m2 (- A

and correspondingly

Z22(m1 + m2) - Z21(m1)Z12(m2) - Z22(m1)Z22(m2) = (0)m- k- r,m- k- r, A m1,m2 (- A.

This finishes the proof of

Theorem 16. There exists a coordinate system of K^m, such that Y () is a solution of (19) if and only if Y () can be written as

( ) Z11(c) Z12(c) (0)k,r Y(c) = Z21(c) Z22(c) (0)m- k-r,r , Z (c) Z (c) Z (c) 31 32 33

where

( ) Z11(c) Z12(c) Z (c) Z (c) 21 22

is an exponential function, Z₁₁() M_k(K), Z₂₂() M_m-k-r(K), Z₃₃() M_r(K), satisfying the conditions Z₁₁(0) = Ik, Z₂₂(0) = Im-k-r, Z₃₃(0) = Ir, Z₁₂(0) = (0)k,m-k-r, Z₂₁(0) = (0)m-k-r,k, Z₃₁(0) = (0)r,k and Z₃₂(0) = (0)r,m-k-r. For 0 the matrices Z₃₁(), Z₃₂(), Z₃₃() can be arbitrarily chosen.

Next we describe the structure of S_U in more details.

Lemma 17. For each y S_U with y(0)0 there exists a subspace S_U' of S_U, such that y S_U'.

: Proof. Since y(0)0 also (0)0. Hence there exist z₂⁰, ..., z_m⁰ Kⁿ, such that is a basis of S_U⁰ = <e₁, ..., e_m>. Consequently, there exist z₂, ..., z_m S_U, such that z_j(0) = z_j⁰ for j = 2, ..., m. Then y, z₂, ..., z_m are linearly independent, which implies that <y, z₂, ..., z_m> is an m-dimensional subspace of S_U. Hence, there exist y₁, ..., y_m <y, z₂, ..., z_m>, such that y_j(0) = e_j for j = 1, ..., m and y <y, z₂, ..., z_m> = <y₁, ..., y_m> =: S_U'.

Let N(S_U) denote the set {z (- SU | z(0) = 0} . Then N(S_U) is a subspace of S_U. The appearance of this subspace N(S_U) of S_U, which is in general not {0} , is one of the main differences to the case of mappings U: A --> GL(n, K). We will see that N(S_U) is closely related to the space W . This is described in

Lemma 18. A function z: A --> Kⁿ belongs to N(S_U) if and only if

( ) 0 z(0) = 0 and z(c) = ^z(c) for ^z(c) (- W.

(24)

Proof. The function z belongs to N(S_U) if and only if

(0) = 0 and U₁₁(

)

(

) = U₁₁(

)

(

) for all

A. Especially for

= 0 and because of the particular form of U₁₁ given in (20) we get

( ) ( ) ( ) ( ) Y11(m) Y12(m) 0 = Y11(0) Y12(0) ~z(m) = (0)m -k,k (0)m -k,m -k 0 (0)m-k,k (0)m-k,m-k ^z(m)

( ) ( ) Ik (0)k,m -k ~z(m) (0)m -k,k (0)m-k,m-k ^z(m) , A m (- A,

so that () = 0 for all A. For = 0 we derive

( ) ( ) ( ) ( ) Y11(c + m) Y12(c + m) 0 Y11(c) Y12(c) 0 (0)m- k,k (0)m -k,m -k 0 = (0)m -k,k (0)m-k,m-k ^z(m) ,

so that Y ₁₂() () = 0 for all , A. This however implies that () W for all A.

Assuming conversely that (0) = 0, () = 0 and () W for all A, then it is obvious that z N(S_U).

In conclusion we get the following result:

Lemma 19. Let S_U' be an m-dimensional subspace of S_U (constructed as above), then S_U = S_U' N(S_U).

: Proof. It is clear that S_U' $/~\$ N(S_U) = and that S_U' N(S_U) is a subspace of S_U. Assume that x is an element of S_U, then there exists y S_U', such that y(0) = x(0). Then z() := x() - y() belongs to N(S_U). Hence x() = z() + y(), which finishes the proof.

We notice that in the decomposition S_U = S_U' N(S_U) the space S_U' is in general not uniquely determined, whereas N(S_U) is unique by definition. However, S_U' can be any m-dimensional subspace of S_U, such that the space of initial values y(0) (for y S_U') is already S_U⁰.

As an immediate consequence we get

dim S = dim S' + dim N (S ) = m + dim N (S ). U U U U

The following Theorem 20 yields together with Theorem 16 the structure of the space S_U, of all solutions of (2), for a given function U: A --> M_n(K). These theorems also contain necessary conditions on U in order to admit a nontrivial solution y.

Theorem 20. Let U: A --> M_n(K) be given and assume that dim S_U⁰ = m. Then there exist coordinates in Kⁿ and solutions (U, y_j) of (2) for j = 1, ..., m, such that y_j(0) = e_j. Moreover, U() can be written as in (13) and U₁₁() satisfies (20). Y ₁₁() and Y ₂₂() are the blocks in the first row of the matrix Y () given by (15). This matrix is also a solution of (19) and each element y of S_U can be expressed as y() = ( )
y(c) + z(c)
0 for given by (16) with arbitrary (0) K^m and () given by (24).

We finish by Theorem 21, which provides a construction of all solutions (U, y) of (2) by starting from an arbitrary subspace of initial values y(0) of Kⁿ. This choice then leads via the block matrix Y () satisfying (19) to a matrix valued function U and a space S of solutions corresponding to U. In this general situation we do not discuss the problem when S = S_U.

Theorem 21. If Y () satisfies (19), U₁₁() is given by (20), is given by (16) for arbitrary (0) K^m and () given by (24), then (U, y) is a solution of (2) for

( ) y(c) = y(c) + z(c) . 0

: Proof. From the special form of U() and y() it is clear that (2) is satisfied if and only if U₁₁( + )[ () + ()] = U₁₁()[ ( + ) + ( + )] for all , , A. This is equivalent to U₁₁( + )[Y () (0) + ()] = U₁₁()[Y ( + ) (0) + ( + )]. Due to the definition of () this is equivalent to U₁₁(0)[Y ( + )Y () - Y ()Y ( + )] (0) = 0 for all , , A. Since (0) is an arbitrary element of K^m we derive $U11(0)[Y (c + m)Y (n) - Y (c)Y(m + n)] = (0)m,m, A c, m,n (- A.$
We can rewrite U₁₁(0)[Y ( + )Y () - Y ()Y ( + )] as U₁₁(0)[Y ( + )Y () - Y ( + + ) + Y ( + + ) - Y ()Y ( + )] = U₁₁(0)[Y ( + )Y () - Y ( + + )] + U₁₁(0)[Y ( + + ) - Y ()Y ( + )], which is equal to (0)m,m since (19) holds.

In order to determine all solutions (U, y) of (2) we start with an arbitrary m-dimensional subspace S⁰ of Kⁿ for some 0 < m < n. Let {b1,...,bm} be a basis of S⁰, then there exists a matrix B GL(n, K), such that Bb_i = e_i for 1 < i < m. Hence BS⁰ = <e₁, ..., e_m>. For each solution Y () of (19) described in Theorem 16 let U₁₁() be given by (20) and U() be given by (13) with arbitrary matrices U₁₂() and U₂₂(). Then each element y of

{ ( ) } T := y(c) = y(c) + z(c) |y(0) (- Km, y(c) = Y (c)y(0), z(c) given by (24) 0

is together with U a solution of (2). Due to this construction T ⁰, the space of initial values y(0) for y T , is equal to <e₁, ..., e_m>. According to Lemma 1 each pair (UB, B^-1y) for y T is a solution of (2) and {B -1y(0)| y (- T} = S⁰. Hence, by varying S⁰ over all subspaces of Kⁿ we determine all solutions (U, y) of (2).

References

[1] H. Fripertinger and J. Schwaiger. Some applications of functional equations in astronomy. Grazer Mathematische Berichte, 344 (2001), 1-6.

[2] R. Lidl and H. Niederreiter. Finite Fields, volume 20 of Encyclopedia of Mathematics and its Applications. Addison-Wesley Publishing Company, London, Amsterdam, Don Mills - Ontario, Sydney, Tokyo, 1983. ISBN 0-201-13519-1.

[3] M.A. McKiernan. The matrix equation a(xoy) = a(x)+a(x)a(y)+a(y). Aequationes Mathematicae, 15 (1977), 213-223.

[4] J. Schwaiger. Some applications of functional equations in astronomy. Aequationes Mathematicae, 60 (2000), p. 185. In Report of the meeting, The Thirty-seventh International Symposium on Functional Equations, May 16-23, 1999, Huntington, WV.

HARALD FRIPERTINGER
LUDWIG REICH
Institut für Mathematik
Karl-Franzens-Universität Graz
Heinrichstr. 36/4
A-8010 Graz
Austria
harald.fripertinger@kfunigraz.ac.at
ludwig.reich@kfunigraz.ac.at