ๅฎŸ็›ด่‹ฆ้—˜ ใ‚ธใƒƒใƒใƒงใ‚ฏใ‚ฏใƒˆใ‚ฆใ€ใ‚ฑใ‚คใ‚ธใƒณใ‚ดใ‚ฏใ‚จใƒ„

Lie Algebra And Its Classification

Every simple lie, its root will be found, when winter comes.

ๆ•ฐๅญฆ

In this article, we will study Lie algebra and classify some of them. The central technique is given in the post of root systems, and the aim of this article is to find a root system of each Lie algebra and prove the correspondence of them.

Lie algebra

Axiomatically, A Lie algebra can be defined as a special algebra on a F\mathbb{F}-vector space g\mathfrak{g}, with a bilinear operation [โ‹…,โ‹…]:gโ†’g[\cdot,\cdot]:\mathfrak{g}\rightarrow\mathfrak{g} called Lie bracket, which is demanded to have the following properties.

  1. For all x,yโˆˆgx,y\in\mathfrak{g}, [x,y]=โˆ’[y,x][x,y]=-[y,x].
  2. For all x,y,zโˆˆgx,y,z\in\mathfrak{g}, we have Jacobi identity,
[x,[y,z]]+[y,[z,x]]+[z,[x,y]]=0.[x,[y,z]]+[y,[z,x]]+[z,[x,y]]=0.

In this article, we simply take F=C\mathbb{F}=\mathbb{C}.

As it is an algebra, we can define the subalgebra and then the ideal of it. Basically, a subset h\mathfrak{h} of it is a subalgebra if it is closed under all the operations on g\mathfrak{g}. And we say it is an ideal, if it is a subalgebra and for all xโˆˆhx\in\mathfrak{h} and yโˆˆgy\in\mathfrak{g}, we have [x,y]โˆˆh[x,y]\in\mathfrak{h} (as what is defined in the case of a ring), denoted as hโŠดg\mathfrak{h}\trianglelefteq\mathfrak{g}.

The necessity of ideal is given by the calculation of quotient space. Given a subalgebra hโŠ‚g\mathfrak{h}\subset\mathfrak{g}, we can define x+hโˆˆg/hx+\mathfrak{h}\in\mathfrak{g}/\mathfrak{h}. Then we can give the definition of commutator in quotient space by

[x+h,y+h]=[x,y]+h.[x+\mathfrak{h},y+\mathfrak{h}]=[x,y]+\mathfrak{h}.

Now, for arbitrary hโˆˆhh\in\mathfrak{h}, we shall see (x+h)+h=x+h(x+h)+\mathfrak{h}= x+\mathfrak{h} since the equivalence relation of quotient space is defined as xโˆผyโ‡”xโˆ’yโˆˆhx\sim y\Leftrightarrow x-y\in\mathfrak{h}. Then we have

[x,y]+h=[x+h,y+h]=[x+h+h,y+h]=[x+h,y]+h=[x,y]+[h,y]+h.\begin{aligned} {} [x,y]+\mathfrak{h}=&[x+\mathfrak{h},y+\mathfrak{h}]\\ =&[x+h+\mathfrak{h},y+\mathfrak{h}]\\ =&[x+h,y]+\mathfrak{h}=[x,y]+[h,y]+\mathfrak{h}. \end{aligned}

We want this is well-defined, then we need [h,y]โˆˆh[h,y]\in\mathfrak{h}. Considering the arbitrariness of hโˆˆhh\in\mathfrak{h} and yโˆˆgy\in\mathfrak{g}, this is exactly the definition of an ideal.

Once we see the idea of quotient space, we can think of direct sum of Lie algebras.

Given two Lie algebras a\mathfrak a and b\mathfrak b, we can define the direct sum of them as the direct sum of vector space aโŠ•b={(a,b);aโˆˆa,bโˆˆb}\mathfrak{a}\oplus\mathfrak b=\{(a,b);a\in\mathfrak a, b\in\mathfrak b\} with new Lie bracket

[(a1,b1),(a2,b2)]aโŠ•b=([a1,a2]a,[b1,b2]b).[(a_1,b_1),(a_2,b_2)]_{\mathfrak a \oplus\mathfrak b}=([a_1,a_2]_\mathfrak a,[b_1,b_2]_\mathfrak b).

And we can define the decomposition of direct sum for two ideals a,b\mathfrak{a,b} of g\mathfrak{g}. If g\mathfrak{g} is the direct sum of a,b\mathfrak{a,b} in the sense of vector space (i.e, aโˆฉb=0\mathfrak a\cap\mathfrak b=0), then we say aโŠ•b\mathfrak{a}\oplus\mathfrak{b} is the decomposition of g\mathfrak{g}.

Actually, we can define the homomorphism on Lie algebra which is a linear map that preserves Lie bracket, and it is an isomorphism if it is a bijection. Then we shall see g\mathfrak{g} is isomorphic to aโŠ•b\mathfrak{a}\oplus\mathfrak{b}.

And we shall see now, bโ‰…g/a\mathfrak{b}\cong \mathfrak{g}/\mathfrak{a} !!!!!!

Simple, Simple and Simple

Now, letโ€™s consider the converse problem: given a\mathfrak a is an ideal of g\mathfrak g, is aโŠ•g/aโ‰…g\mathfrak{a}\oplus\mathfrak{g/a}\cong \mathfrak{g}?

At the first glance, it seems to be obvious, but actually, we can construct a counter example. Letโ€™s think a=C\mathfrak a=\mathbb C and b=C2\mathfrak{b}=\mathbb C^2 are two trivial Lie algebras with

โˆ€a,bโˆˆC,[a,b]=0,ย andย โˆ€a,bโˆˆC2,[a,b]=0.\forall a,b\in\mathbb C,[a,b]=0,\text{ and }\forall a,b\in\mathbb C^2,[a,b]=0.

Now, we shall see C3\mathbb C^3 can be a Lie algebra, and it can be a direct sum of C\mathbb C and C2\mathbb C^2.

However, we can give different Lie brackets between C\mathbb C and C2\mathbb C^2. We suppose a=C\mathfrak a=\mathbb C is an ideal of C3\mathbb C^3 and b=C2\mathfrak b=\mathbb{C}^2 is a subalgebra of it. Now, choose a base {e1}\{e_1\} for a\mathfrak a and {e2,e3}\{e_2,e_3\} for b\mathfrak b. Then we only have to find how [e1,e2][e_1,e_2] and [e1,e3][e_1,e_3] computes since the Lie bracket is a bilinear operation.

Since a\mathfrak a is an ideal, then [e1,e2][e_1,e_2] and [e1,e3][e_1,e_3] are all resides in a\mathfrak a. Then, we can assume [e1,e2]=ฮปe1[e_1,e_2]=\lambda e_1 and [e1,e3]=ฮผe1[e_1,e_3]=\mu e_1. For each pair of (ฮป,ฮผ)(\lambda,\mu), it defines a unique Lie algebra C(ฮป,ฮผ)3\mathbb C^3_{(\lambda,\mu)}. For (a1,b1,c1),(a2,b2,c2)โˆˆC(ฮป,ฮผ)3(a_1,b_1,c_1),(a_2,b_2,c_2)\in \mathbb C^3_{(\lambda,\mu)}, we have

[(a1,b1,c1),(a2,b2,c2)]=[a1e1+b1e2+c1e3,a2e1+b2e2+c2e3]=[a1e1,a2e1]+[a1e1,b2e2]+[a1e1,c2e3]+[b1e2,a2e1]+[b1e2,b2e2]+[b1e2,c2e3]+[c1e3,a2e1]+[c1e3,b2e2]+[c1e3,c2e3]=(ฮปa1b2+ฮผa1c2โˆ’ฮปa2b1โˆ’ฮผa2c1)e1=[ฮป(a1b2โˆ’a2b1)+ฮผ(a1c2โˆ’a2c1)]e1=(ฮป(a1b2โˆ’a2b1)+ฮผ(a1c2โˆ’a2c1),0,0).\begin{aligned} {}[(a_1,b_1,c_1),(a_2,b_2,c_2)]=&[a_1e_1+b_1e_2+c_1e_3,a_2e_1+b_2e_2+c_2e_3]\\ =&[a_1e_1,a_2e_1]+[a_1e_1,b_2e_2]+[a_1e_1,c_2e_3]\\ +&[b_1e_2,a_2e_1]+[b_1e_2,b_2e_2]+[b_1e_2,c_2e_3]\\ +&[c_1e_3,a_2e_1]+[c_1e_3,b_2e_2]+[c_1e_3,c_2e_3]\\ =&(\lambda a_1b_2+\mu a_1c_2-\lambda a_2b_1-\mu a_2c_1)e_1\\ =&[\lambda(a_1b_2-a_2b_1)+\mu(a_1c_2-a_2c_1)]e_1\\ =&(\lambda(a_1b_2-a_2b_1)+\mu(a_1c_2-a_2c_1),0,0). \end{aligned}

If ฮป,ฮผ=ฬธ0\lambda,\mu\not=0, then we shall see g/aโ‰…b\mathfrak{g/a}\cong\mathfrak b, but aโŠ•b=ฬธg\mathfrak a \oplus\mathfrak{b}\not=\mathfrak g, which means aโŠ•g/aโ‰…ฬธg\mathfrak{a}\oplus\mathfrak{g/a}\not\cong \mathfrak{g}, and this is a โ€œwrong decompositionโ€!

Now, we can think of a kind of โ€œgoodโ€ Lie algebras, which has the property that for any ideal of it, we have a decomposition of direct sum. Why? Because our aim is to classify Lie algebras, thus we need them to be structured and simple enough.

Letโ€™s think the simplest situation, where a Lie algebra doesnโ€™t have any nontrivial ideal. Moreover, itโ€™s easy to see that [g,g]:={[a,b];a,bโˆˆg}[\mathfrak{g},\mathfrak{g}]:=\{[a,b];a,b\in\mathfrak{g}\} is an ideal of g\mathfrak g, then [g,g][\mathfrak g,\mathfrak g] equals either 00 or g\mathfrak g. The former is too trivial and we are not interested in it, we only want the latter. This leads to the definition of simple Lie algebra:

A Lie algebra g\mathfrak{g} is called simple if it has no non-trivial ideal and there exists a,bโˆˆga,b\in\mathfrak g such that [a,b]=ฬธ0[a,b]\not=0.

If we can classify simple ones, the direct sum of them is naturally classified. Such objects, are called semisimple Lie algebras.

Formally, a Lie algebra g\mathfrak g is called semisimple, if and only if

gโ‰…s1โŠ•โ‹ฏโŠ•sn,\mathfrak {g\cong s_1\oplus\cdots\oplus s_n},

where all si\mathfrak s_i are simple Lie algebra. (Semi.A)

Now, consider any ideal a\mathfrak a of g=โจiโˆˆIsi\mathfrak{g}=\bigoplus_{i\in I}\mathfrak{s}_i. For each iโˆˆIi\in I, [a,si]โŠ‚aโˆฉsi[\mathfrak{a},\mathfrak{s}_i]\subset\mathfrak{a}\cap\mathfrak{s}_i, which is an ideal of si\mathfrak{s}_i. By the simplicity of si\mathfrak s_i, we shall see [a,si][\mathfrak a,\mathfrak s_i] is either si\mathfrak{s}_i or 00. Assume it is 00 for all si\mathfrak{s}_i, then for each iโˆˆIi\in I, ฯ€si(a)โŠ‚Z(si)=0\pi_{\mathfrak{s}_i}(\mathfrak{a})\subset Z(\mathfrak{s}_i)=0, where ฯ€si(a)\pi_{\mathfrak{s}_i}(\mathfrak{a}) is the project on si\mathfrak{s}_i from a\mathfrak{a} and Z(si)Z(\mathfrak s_i) is the center of si\mathfrak s_is. Hence a=0\mathfrak{a}=0 in such case. Suppose a\mathfrak{a} is not 00, then we have a=โจiโˆˆJโŠ‚Isi\mathfrak{a}=\bigoplus_{i\in J\subset I}\mathfrak{s}_i, we choose b=โจiโˆˆฬธJsi\mathfrak{b}=\bigoplus_{i\not\in J}\mathfrak{s}_i, then we know g=aโŠ•b\mathfrak{g}=\mathfrak{a}\oplus\mathfrak{b}. Since bโ‰…g/a\mathfrak{b}\cong\mathfrak{g/a}, then

gโ‰…aโŠ•g/a.\mathfrak{g}\cong\mathfrak{a}\oplus\mathfrak{g/a}.

Here we know, semisimple Lie algebras are the ideal objects we want.

Characterization of Semisimple Lie Algebras

Consider [g,g][\mathfrak{g},\mathfrak{g}]. Itโ€™s clear that it is an ideal of g\mathfrak{g}, and it gives a decomposition of g\mathfrak{g}. If this process keeps, we may find it down to 00. To formalize this idea, we define g(0)=g\mathfrak{g}^{(0)}=\mathfrak{g}, and for each nโˆˆNn\in\mathbb{N}, we write g(n+1)=[g(n),g(n)]\mathfrak{g}^{(n+1)}=[\mathfrak{g}^{(n)},\mathfrak{g}^{(n)}]. Then we have something called a derived sequence of g\mathfrak{g}:

g=g(0)โŠตโ‹ฏโŠตg(n)โŠตโ‹ฏโ€‰.\mathfrak{g}=\mathfrak{g}^{(0)}\trianglerighteq \cdots\trianglerighteq \mathfrak{g}^{(n)}\trianglerighteq\cdots.

We say g\mathfrak{g} is solvable if there is some nn such that g(n)=0\mathfrak{g}^{(n)}=0. Itโ€™s clear that the direct sum of two solvable Lie algebra is still solvable. Then we can define the thing called radical of a Lie algebra g\mathfrak{g}, which is the biggest solvable ideal, denoted as radโก(g)\operatorname{rad}(\mathfrak{g}).

Since r=radโก(g)\mathfrak{r}=\operatorname{rad}(\mathfrak{g}) is an ideal of g\mathfrak{g}, we have r=โจiโˆˆIsi\mathfrak{r}=\bigoplus_{i\in I}\mathfrak{s}_i where si\mathfrak{s}_i are simple algebras. However for every iโˆˆIi\in I, [si,si]=0[\mathfrak{s}_i,\mathfrak{s}_i]=0. Hence, we know that radโก(g)=0\operatorname{rad}(\mathfrak{g})=0.

Conversely, we suppose radโก(g)=0\operatorname{rad}(\mathfrak{g})=0. We want to prove that g\mathfrak{g} is semisimple, which means it can be decomposed into direct sum of simple Lie algebras. This will give a characterization of semisimplicity.

However, the direct proof is hard. Itโ€™s because we donโ€™t have enough tools to decompose a Lie algebra. Given an ideal iโŠดg\mathfrak{i}\trianglelefteq \mathfrak{g}, how can we find a standard complement of it? The traditional way (like what we do in linear algebra) to do this, is finding an orthogonal complement

iโŠฅ:={xโˆˆg;B(x,y)=0,yโˆˆi},\mathfrak{i}^{\perp}:=\{x\in\mathfrak{g};B(x,y)=0,y\in\mathfrak{i}\},

where BB is a bilinear form (Kill.A). Moreover, we need iโŠฅ\mathfrak{i}^\perp to be an ideal of g\mathfrak{g}, therefore for xโˆˆiโŠฅ,yโˆˆi,zโˆˆgx\in\mathfrak{i}^\perp,y\in\mathfrak{i},z\in\mathfrak{g}, [z,x]โˆˆiโŠฅ[z,x]\in\mathfrak{i}^\perp, then B([z,x],y)=0B([z,x],y)=0. Since B(x,[z,y])B(x,[z,y]) is always 00, hence ideally, we need BB to have the property (Kill.B) that

B([z,x],y)=โˆ’B(x,[z,y]),ย forย allย x,y,zโˆˆg.B([z,x],y)=-B(x,[z,y]),\text{ for all } x,y,z\in\mathfrak{g}.

Therefore, we need to find such a bilinear form B:gร—gโ†’FB:\mathfrak{g}\times\mathfrak{g}\rightarrow\mathbb{F} at first.

Since g\mathfrak{g} is too far from numbers, we first need to give a representation ฯ:gโ†’Endโก(g)\rho:\mathfrak{g}\rightarrow\operatorname{End}(\mathfrak{g}). Naturally, we can use adjoint representation here,

adโก:gโ†’Endโก(g),adโก(x)=[x,โ‹…].\operatorname{ad}:\mathfrak{g}\rightarrow\operatorname{End}(\mathfrak{g}),\operatorname{ad}(x)=[x,\cdot].

Itโ€™s quite clear that such map is a good representation of Lie algebras, which is given by the Jacobiโ€™s identity that

[x,[y,z]]+[y,[z,x]]+[z,[x,y]]=0โ‡’[x,[y,z]]+[y,[z,x]]=โˆ’[z,[x,y]]โ‡’[x,[y,z]]โˆ’[y,[x,z]]=[[x,y],z]โ‡’(adโก(x)โˆ˜adโก(y))(z)โˆ’(adโก(y)โˆ˜adโก(x))(z)=adโก([x,y])(z)hereย weย define[A,B]=ABโˆ’BAโ‡’[adโก(x),adโก(y)](z)=adโก([x,y])(z)โ‡’[adโก(x),adโก(y)]=adโก([x,y]).\begin{aligned} &[x,[y,z]]+[y,[z,x]]+[z,[x,y]]=0\\ \Rightarrow\quad&[x,[y,z]]+[y,[z,x]]=-[z,[x,y]]\\ \Rightarrow\quad&[x,[y,z]]-[y,[x,z]]=[[x,y],z]\\ \Rightarrow\quad&(\operatorname{ad}(x)\circ\operatorname{ad}(y))(z)-(\operatorname{ad}(y)\circ\operatorname{ad}(x))(z)=\operatorname{ad}([x,y])(z)\\ &\text{here we define}[A,B]=AB-BA\\ \Rightarrow\quad&[\operatorname{ad}(x),\operatorname{ad}(y)](z)=\operatorname{ad}([x,y])(z)\\ \Rightarrow\quad&[\operatorname{ad}(x),\operatorname{ad}(y)]=\operatorname{ad}([x,y]). \end{aligned}

Now, the problem becomes finding a bilinear form C:Endโก(g)ร—Endโก(g)โ†’CC:\operatorname{End}(\mathfrak{g})\times\operatorname{End}(\mathfrak{g})\rightarrow\mathbb{C}.

By some representation theory, we have Endโก(g)โ‰…VโŠ—Vโˆ—\operatorname{End}(\mathfrak{g})\cong V\otimes V^*, and the problem is equivalent to find a linear map

D:VeiโŠ—Vejโˆ—โŠ—VekโŠ—Velโˆ—โ†’C.D:\underset{e_i}V\otimes\underset{e^j}V^*\otimes\underset{e_k}V\otimes\underset{e^l}V^*\rightarrow\mathbb{C}.

For A,BโˆˆEndโก(g)A,B\in\operatorname{End}(\mathfrak{g}), we now can write A=โˆ‘i,jAijeiโŠ—ejA=\sum_{i,j}A_{ij}e_i\otimes e^j and B=โˆ‘k,lBklekโŠ—elB=\sum_{k,l}B_{kl}e_k\otimes e^l. We have two ways to compute such linear map by contraction. If we let eje^j acts on eie_i and ele^l acts on eke_k, then

D1(A,B)=โˆ‘i,j,k,lAijBklฮดijฮดkl=โˆ‘i,kAiiBjj=trโก(A)trโก(B).D_1(A,B)=\sum_{i,j,k,l}A_{ij}B_{kl}\delta_i^j\delta_k^l=\sum_{i,k}A_{ii}B_{jj}=\operatorname{tr}(A)\operatorname{tr}(B).

If we let ele^l acts on eie_i and eje^j acts on eke_k, then

D2(A,B)=โˆ‘i,j,k,lAijBklฮดilฮดkj=โˆ‘i,jAijBij=trโก(AB).D_2(A,B)=\sum_{i,j,k,l}A_{ij}B_{kl}\delta_i^l\delta_k^j=\sum_{i,j}A_{ij}B_{ij}=\operatorname{tr}(AB).

Now, we have two candidates for the bilinear form. For x,yโˆˆgx,y\in\mathfrak{g},

B1(x,y)=trโก(adโก(x))trโก(adโก(y)),B_1(x,y)=\operatorname{tr}(\operatorname{ad}(x))\operatorname{tr}(\operatorname{ad}(y)), B2(x,y)=trโก(adโก(x)โˆ˜adโก(y)).B_2(x,y)=\operatorname{tr}(\operatorname{ad}(x)\circ\operatorname{ad}(y)).

However, in semisimple Lie algebra g\mathfrak{g}, [g,g]=โจi[si,si]=โจisi=g[\mathfrak{g},\mathfrak{g}]=\bigoplus_i[\mathfrak{s}_i,\mathfrak{s}_i]=\bigoplus_i\mathfrak{s}_i=\mathfrak{g}. Then for each xโˆˆgx\in\mathfrak{g}, it can be written as x=โˆ‘i[yi,zi]x=\sum_i[y_i,z_i]. Then

trโก(adโก(x))=โˆ‘itrโก(adโก([xi,yi]))=โˆ‘itrโก(adโก(xi)adโก(yi)โˆ’adโก(yi)adโก(xi))=โˆ‘itrโก(adโก(xi)adโก(yi))โˆ’trโก(adโก(xi)adโก(yi))=0.\begin{aligned} \operatorname{tr}(\operatorname{ad}(x))&=\sum_i\operatorname{tr}(\operatorname{ad}([x_i,y_i]))\\ &=\sum_i\operatorname{tr}(\operatorname{ad}(x_i)\operatorname{ad}(y_i)-\operatorname{ad}(y_i)\operatorname{ad}(x_i))\\ &=\sum_i\operatorname{tr}(\operatorname{ad}(x_i)\operatorname{ad}(y_i))-\operatorname{tr}(\operatorname{ad}(x_i)\operatorname{ad}(y_i))=0. \end{aligned}

Hence, B1B_1 is disused here because B1(x,y)โ‰ก0B_1(x,y)\equiv0.

For B2B_2, we let z={xโˆˆg;B2(x,y)=0,โˆ€yโˆˆg}\mathfrak{z}=\{x\in\mathfrak{g};B_2(x,y)=0,\forall y\in\mathfrak{g}\}. Itโ€™s easy to verify that zโŠดg\mathfrak{z}\trianglelefteq\mathfrak{g}. A machinery can be given here called Cartanโ€™s criterion that a Lie algebra g\mathfrak{g} over a field k\mathbb{k} characteristic zero is solvable if and only if [g,g]โŠ‚{xโˆˆg;B2(x,y)=0,โˆ€yโˆˆg}[\mathfrak{g},\mathfrak{g}]\subset\{x\in\mathfrak{g}; B_2(x,y)=0,\forall y\in\mathfrak{g}\} (See [Etingof, Theorem 16.18], and we wonโ€™t prove it since itโ€™s too technical). Then, we see z\mathfrak{z} is solvable. Since radโก(g)=0\operatorname{rad}(\mathfrak{g})=0, then z=0\mathfrak{z}=0.

Now, we know B2B_2 is non-degenerate on semisimple Lie algebras, and it is exactly the bilinear form we want to construct. Historically, this is originally given by Wilhelm Killing, thus we call it Killing form, denoted as ฮบ\kappa.

Back to the waist line, we now use the Killing form to decompose a given semisimple Lie algebra g\mathfrak{g}. For any ideal iโŠดa\mathfrak{i}\trianglelefteq\mathfrak{a}, let

iโŠฅ={xโˆˆg;ฮบ(x,y)=0,โˆ€yโˆˆi}.\mathfrak{i}^\perp=\{x\in\mathfrak{g};\kappa(x,y)=0,\forall y\in\mathfrak{i}\}.

For arbitrary zโˆˆg,iโˆˆiz\in\mathfrak{g},i\in\mathfrak{i} and aโˆˆiโŠฅa\in\mathfrak{i}^\perp, ฮบ([z,a],i)=โˆ’ฮบ(a,[z,i])=0\kappa([z,a],i)=-\kappa(a,[z,i])=0 (This is by (Kill.B)). Then iโŠฅ\mathfrak{i}^\perp is an ideal. And again by Cartanโ€™s criterion, we have iโˆฉiโŠฅ=0\mathfrak{i}\cap\mathfrak{i}^\perp=0. Since ฮบ\kappa is non-degenerate, we have g=iโŠ•iโŠฅ\mathfrak{g}=\mathfrak{i}\oplus\mathfrak{i}^\perp.

Then we do induction on the dimension of g\mathfrak{g}. Suppose when dimโกg<n\dim\mathfrak{g}<n, it can be decomposed into direct sum of simple Lie algebras. When dimโกg=n\dim\mathfrak{g}=n, take m\mathfrak{m} to be the minimal non-zero ideal of g\mathfrak{g}, then g=mโŠ•mโŠฅ\mathfrak{g}=\mathfrak{m}\oplus\mathfrak{m}^\perp. Assume tโŠดm\mathfrak{t}\trianglelefteq\mathfrak{m} is a non-trivial ideal of m\mathfrak{m}, then

[g,t]=[m,t]+[mโŠฅ,t]=t+0=t.[\mathfrak{g},\mathfrak{t}]=[\mathfrak{m,t}]+[\mathfrak{m}^\perp,\mathfrak{t}]=\mathfrak{t}+0=\mathfrak{t}.

Hence nโŠดg\mathfrak{n}\trianglelefteq\mathfrak{g}, which contradicts to the minimality of m\mathfrak{m}. And [m,m][\mathfrak{m},\mathfrak{m}] by radโก(g)=0\operatorname{rad}(\mathfrak{g})=0, then m\mathfrak{m} is a simple Lie algebra. And mโŠฅ\mathfrak{m}^\perp is semisimple, because radโก(g)โŠดradโก(g)=0\operatorname{rad}(\mathfrak{g})\trianglelefteq\operatorname{rad}(\mathfrak{g})=0. Then by assumption, mโŠฅ\mathfrak{m}^\perp can be written as direct sum of simple Lie algebras since dimโกmโŠฅ<n\dim\mathfrak{m}^\perp<n. Therefore, g\mathfrak{g} can be decomposed as direct sum of simple Lie algebras.

Here, we proved the characterization of semisimplicity. Formally, the following 3 are equivalent to each other,

  1. g=โจisi\mathfrak{g}=\bigoplus_{i}\mathfrak{s}_i, where si\mathfrak{s}_i are simple; (Semi.A)
  2. radโก(g)=0\operatorname{rad}(\mathfrak{g})=0; (Semi.B)
  3. the Killing form ฮบ\kappa is non-degenerate on g\mathfrak{g}. (Semi.C)

The equivalence of (Semi.C) is given by sandwiching it between (Semi.A) and (Semi.B). In some textbooks, (Semi.B) is used as the standard definition of semisimplicity, but here we use (Semi.A) because of the clarity of it.

Jordan Decomposition of Simple Lie Algebras

Now, we have found the right object to be classified. That is the complex simple Lie algebras.

How to classify something linear? Maybe we need to recall the Jordan canonical form, which is a good way to see through things.

Given a square matrix AA, it can be written as

A=[A1โ‹ฑAn]A= \begin{bmatrix} A_{1} &&\\ &\ddots&\\ &&A_{n}\\ \end{bmatrix}

under some basis, where

Ai=[ฮปi1ฮปi1โ‹ฑโ‹ฑฮปi1ฮปi]=[ฮปiโ‹ฑฮปi]+[01โ‹ฑโ‹ฑ010]A_i= \begin{bmatrix} \lambda_i &1&\\ &\lambda_i&1\\ &&\ddots&\ddots\\ &&&\lambda_i&1\\ &&&&\lambda_i \end{bmatrix}= \begin{bmatrix} \lambda_i\\ &\ddots\\ &&\lambda_i \end{bmatrix} +\begin{bmatrix} 0&1\\ &\ddots&\ddots\\ &&0&1\\ &&&0 \end{bmatrix}

and ฮปi\lambda_i is the eigenvalue of AiA_i.

Then, the linear operator A\mathscr{A} corresponding to AA can be decomposed as

A=Ad+An,\mathscr{A}=\mathscr{A}_d+\mathscr{A}_n,

where Ad\mathscr{A}_d is diagonalizable and An\mathscr{A}_n is nilpotent (means some mโˆˆNm\in\mathbb{N} be such that Anm=0\mathscr{A}_n^m=0). Moreover, we always have AdAn=AnAd\mathscr{A}_d\mathscr{A}_n=\mathscr{A}_n\mathscr{A}_d.

Here, we can do the same thing on Lie algebras. Given a xโˆˆgx\in\mathfrak{g}, adโกx\operatorname{ad}x is a complex matrix, then it can be decomposed into (adโกx)=(adโกx)d+(adโกx)n(\operatorname{ad}x)=(\operatorname{ad}x)_d+(\operatorname{ad}x)_n. Then, we hope we can find x=xd+xnx=x_d+x_n where xd,xnโˆˆgx_d,x_n\in\mathfrak{g} and (adโกx)d=adโกxd,(adโกx)n=adโกxn(\operatorname{ad}x)_d=\operatorname{ad}x_d,(\operatorname{ad}x)_n=\operatorname{ad}x_n.

To do such thing, we need some knowledge of derivation. Watch, start from the Jacobiโ€™s identity,

[x,[y,z]]+[y,[z,x]]+[z,[x,y]]=0[x,[y,z]]=[[x,y],z]+[y,[x,z]](adโกx)[y,z]=[(adโกx)y,z]+[y,(adโกx)z].\begin{aligned} [x,[y,z]]+[y,[z,x]]+[z,[x,y]]&=0\\ [x,[y,z]]&=[[x,y],z]+[y,[x,z]]\\ (\operatorname{ad}x)[y,z]&=[(\operatorname{ad}x)y,z]+[y,(\operatorname{ad}x)z].\\ \end{aligned}

If something has such identity, we call it a derivation. Formally, an operator ฮด\delta is called a derivation if and only if

ฮด[y,z]=[ฮดy,z]+[y,ฮดz].\delta[y,z]=[\delta y,z]+[y,\delta z].

All derivations on g\mathfrak{g}, is written as derโกg\operatorname{der}\mathfrak{g}. Apparently, adโกgโŠ‚derโกg\operatorname{ad}\mathfrak{g}\subset\operatorname{der}\mathfrak{g}.

For some ฮดโˆˆderโกg\delta\in\operatorname{der}\mathfrak{g}, it can be decomposed into ฮด=ฮดd+ฮดn\delta=\delta_d+\delta_n. Here, we can define generalized eigenspace of eigenvalue ฮป\lambda of ฮด\delta as (here II is the identity)

gฮป:={xโˆˆg;ฮดdx=ฮปx}.\mathfrak{g}_\lambda:=\{x\in\mathfrak{g};\delta_dx=\lambda x\}.

This is equivalent to say that

gฮป={xโˆˆg;(ฮดโˆ’ฮปI)N=0,ย forย someย bigย enoughย N}.\mathfrak{g}_\lambda=\{x\in\mathfrak{g};(\delta-\lambda I)^N=0,\text{ for some big enough }N\}.

Itโ€™s clear that there is a decomposition g=โจฮปgฮป\mathfrak{g}=\bigoplus_\lambda\mathfrak{g}_\lambda (in the sense of vector space).

Now for xโˆˆgฮป,yโˆˆgฮผx\in\mathfrak{g}_\lambda,y\in\mathfrak{g}_\mu, there must be some p,qโˆˆNp,q\in\mathbb{N} such that (ฮดโˆ’ฮปI)px=0,(ฮดโˆ’ฮผI)qx=0(\delta-\lambda I)^px=0,(\delta-\mu I)^qx=0. Now, compute

(ฮดโˆ’(ฮป+ฮผ)I)[x,y]=ฮด[x,y]โˆ’(ฮป+ฮผ)[x,y]=[ฮดx,y]+[x,ฮดy]โˆ’[ฮปx,y]โˆ’[x,ฮผy]=[(ฮดโˆ’ฮปI)x,y]+[x,(ฮดโˆ’ฮผI)y].\begin{aligned} (\delta-(\lambda+\mu)I)[x,y]&=\delta[x,y]-(\lambda+\mu)[x,y]\\ &=[\delta x,y]+[x,\delta y]-[\lambda x,y]-[x,\mu y]\\ &=[(\delta-\lambda I)x,y]+[x,(\delta-\mu I)y]. \end{aligned}

Continue such procedure, by the parody of binomial theorem, we have

(ฮดโˆ’(ฮป+ฮผ)I)n[x,y]=โˆ‘k=0n(nk)[(ฮดโˆ’ฮป)kx,(ฮดโˆ’ฮผ)nโˆ’ky].(\delta-(\lambda+\mu)I)^n[x,y]=\sum_{k=0}^n\binom{n}{k}[(\delta-\lambda)^kx,(\delta-\mu)^{n-k}y].

We take n=p+qn=p+q and check every term in this formula, then (ฮดโˆ’(ฮป+ฮผ)I)n[x,y]=0(\delta-(\lambda+\mu)I)^n[x,y]=0. Hence, we have [x,y]โˆˆgฮป+ฮผ[x,y]\in\mathfrak{g}_{\lambda+\mu}. (Deco.A)

Now, for arbitrary ฮดโˆˆderโกg\delta\in\operatorname{der}\mathfrak{g},

ฮดd[x,y]=(ฮป+ฮผ)[x,y]=[ฮปx,y]+[x,ฮดy]=[ฮดdx,y]+[x,ฮดdy].\delta_d[x,y]=(\lambda+\mu)[x,y]=[\lambda x,y]+[x,\delta y]=[\delta_d x,y]+[x,\delta_d y].

Hence, we know that ฮดd\delta_d is still a derivation, and naturally follows that, ฮดn\delta_n is still a derivation.

Now, we use a functional analytical method to prove the fact that adโกg=derโกg\operatorname{ad} \mathfrak{g}=\operatorname{der}\mathfrak{g} for g\mathfrak{g} is semisimple. (Deco.B)

For arbitrary ฮดโˆˆderโกg,yโˆˆg\delta\in\operatorname{der}\mathfrak{g},y\in\mathfrak{g}, we define a linear functional that f(y)=trโก(ฮดโˆ˜adโกy)f(y)=\operatorname{tr}(\delta\circ\operatorname{ad} y). Since ฮบ\kappa is non-degenerate on g\mathfrak{g}, then there exists unique zโˆˆgz\in\mathfrak{g} such that f(y)=ฮบ(z,y)f(y)=\kappa(z,y). Let ฮดโ€ฒ=ฮดโˆ’adโกz\delta'=\delta-\operatorname{ad}z, then ฮดโ€ฒ\delta' is a derivation and

trโก(ฮดโ€ฒโˆ˜adโกy)=trโก((ฮดโˆ’adโกz)โˆ˜adโกy)=trโก(ฮดโˆ˜adโกy)โˆ’trโก(adโกzโˆ˜adโกy)=f(y)โˆ’ฮบ(z,y)=0.\begin{aligned} \operatorname{tr}(\delta'\circ\operatorname{ad}y)&=\operatorname{tr}((\delta-\operatorname{ad}z)\circ\operatorname{ad}y)\\ &=\operatorname{tr}(\delta\circ\operatorname{ad}y)-\operatorname{tr}(\operatorname{ad}z\circ\operatorname{ad}y)\\ &=f(y)-\kappa(z,y)=0. \end{aligned}

For any x,yโˆˆgx,y\in\mathfrak{g},

Notice that

ฮดโ€ฒ[x,y]=[ฮดโ€ฒx,y]+[x,ฮดโ€ฒy]โ‡’(ฮดโˆ˜adโกx)y=adโก(ฮดโ€ฒx)(y)+adโกx(ฮดโ€ฒy)โ‡’adโก(ฮดโ€ฒx)(y)=(ฮดโ€ฒโˆ˜adโกx)yโˆ’(adโกxโˆ˜ฮดโ€ฒ)yโ‡’adโก(ฮดโ€ฒx)=ฮดโ€ฒโˆ˜adโกxโˆ’adโกxโˆ˜ฮดโ€ฒ,\begin{aligned} &\delta'[x,y]=[\delta' x,y]+[x,\delta' y]\\ \Rightarrow\quad&(\delta\circ\operatorname{ad}x)y=\operatorname{ad}(\delta' x)(y)+\operatorname{ad}x(\delta' y)\\ \Rightarrow\quad&\operatorname{ad}(\delta' x)(y)=(\delta'\circ\operatorname{ad}x)y-(\operatorname{ad}x\circ\delta')y\\ \Rightarrow\quad&\operatorname{ad}(\delta' x)=\delta'\circ\operatorname{ad}x-\operatorname{ad}x\circ\delta', \end{aligned}

therefore,

ฮบ(ฮดโ€ฒx,y)=trโก(adโก(ฮดโ€ฒx)โˆ˜adโกy)=trโก(ฮดโ€ฒโˆ˜adโกxโˆ˜adโกy)โˆ’trโก(adโกxโˆ˜ฮดโ€ฒโˆ˜adโกy)=0.\begin{aligned} \kappa(\delta'x,y)&=\operatorname{tr}(\operatorname{ad}(\delta'x)\circ\operatorname{ad}y)\\ &=\operatorname{tr}(\delta'\circ\operatorname{ad}x\circ\operatorname{ad}y)-\operatorname{tr}(\operatorname{ad}x\circ\delta'\circ\operatorname{ad}y)\\ &=0. \end{aligned}

Hence, ฮดโ€ฒ=0\delta'=0, so we now know adโกz=ฮด\operatorname{ad}z=\delta. This is equivalent to say that adโกg=derโกg\operatorname{ad}\mathfrak{g}=\operatorname{der}\mathfrak{g}. Since (adโกx)dโˆˆderโกg(\operatorname{ad}x)_d\in\operatorname{der}\mathfrak{g}, then there is some xdโˆˆgx_d\in\mathfrak{g} such that (adโกx)d=adโก(xd)(\operatorname{ad}x)_d=\operatorname{ad}(x_d). Now, we can do the Jordan decomposition on the simple Lie algebra (actually by the process, we see it is the same on semisimple ones) that

x=xd+xn.x=x_d+x_n.

Cartan Subalgebra

We say an element xโˆˆgx\in\mathfrak{g} is diagonalizable if xd=xx_d=x, and we say it is nilpotent if xn=xx_n=x. The toral subalgebra is defined as a subalgebra tโŠ‚g\mathfrak{t}\subset\mathfrak{g} that every xโˆˆtx\in\mathfrak{t} is diagonalizable. And we can define a thing called Cartan subalgebra, which is named after ร‰lie Cartan, that h\mathfrak{h} is a maximal toral subalgebra.

A thing interesting can be given here is that h\mathfrak{h} can be diagonalized simultaneously. First, we prove that h\mathfrak{h} is abelian, i.e. for any x,yโˆˆhx,y\in\mathfrak{h}, adโกxโˆ˜adโกy=adโกyโˆ˜adโกx\operatorname{ad}x\circ\operatorname{ad}y=\operatorname{ad}y\circ\operatorname{ad}x. To prove this, is equivalent to prove that [x,y]=0[x,y]=0.

We constrain xx on h\mathfrak{h}, then adโกxโˆฃh\operatorname{ad}x|_\mathfrak{h} is still diagonalizable. Suppose {ฮปi}iโˆˆI\{\lambda_i\}_{i\in I} are eigenvalues of it and {hi}iโˆˆI\{h_i\}_{i\in I} are corresponding eigenvectors. Then [x,hi]=ฮปihi[x,h_i]=\lambda_i h_i. Now,

(adโกhi)2x=(adโกhi)[hi,x]=(adโกhi)(โˆ’[hi,x])=(adโกhi)(ฮปihi)=โˆ’ฮปi[hi,hi]=0.\begin{aligned} (\operatorname{ad} h_i)^2x&=(\operatorname{ad}h_i)[h_i,x]\\ &=(\operatorname{ad}h_i)(-[h_i,x])\\ &=(\operatorname{ad}h_i)(\lambda_ih_i)\\ &=-\lambda_i[h_i,h_i]=0. \end{aligned}

Hence, we see xx resides in the generalized eigenspace of hih_i where the eigenvalue is 00. However, we know that hih_i is diagonalized. Hence xx resides in the eigenspace of hih_i where eigenvalue is 00. Therefore, [hi,x]=0โ‹…x=0[h_i,x]=0\cdot x=0. Thus we know [x,hi]=0[x,h_i]=0 for all hiโˆˆhh_i\in\mathfrak{h}. So [x,y]=0[x,y]=0.

Now, since adโกh\operatorname{ad}\mathfrak{h} is abelian, while everything inside can be diagonalized, then we see adโกh\operatorname{ad}\mathfrak{h} can be diagonalized by simple linear algebra.

Now, for each xโˆˆhx\in\mathfrak{h}, it gives a decomposition of g=โจฮปigฮปi\mathfrak{g}=\bigoplus_{\lambda_i}\mathfrak{g}_{\lambda_i} in the sense of vector space. Such decomposition is the same for all xโˆˆhx\in\mathfrak{h}, hence we have a simultaneous eigenspace decomposition that

g=โจฮฑโˆˆhโˆ—gฮฑ\mathfrak{g}=\bigoplus_{\alpha\in\mathfrak{h}^*}\mathfrak{g}_\alpha

where

gฮฑ={xโˆˆg;[h,x]=ฮฑ(h)x,โˆ€hโˆˆh}.\mathfrak{g}_\alpha=\{x\in\mathfrak{g};[h,x]=\alpha(h)x,\forall h\in\mathfrak{h}\}.

For xโˆˆg,hโˆˆhx\in\mathfrak{g},h\in\mathfrak{h}, by the maximality of Cartan subalgebra h\mathfrak{h}, we know [h,x]=0โ‡”xโˆˆh[h,x]=0\Leftrightarrow x\in\mathfrak{h}. Then we can write ฮฆ={ฮฑโˆˆhโˆ—โˆ–0;gฮฑ=ฬธ0}\Phi=\{\alpha\in \mathfrak{h}^*\setminus 0;\mathfrak{g}_\alpha\not=0\}, and

g=hโŠ•โจฮฑโˆˆฮฆgฮฑ.\mathfrak{g}=\mathfrak{h}\oplus\bigoplus_{\alpha\in\Phi}\mathfrak{g}_\alpha.

ฮฆ\Phi

In this chapter, we will dive into ฮฆ\Phi to study its structure, and we will see it is a root system which has been studied throughoutly in the previous post.

ฮฆ\Phi, Canto I, the real Hilbert space

We first prove that ฮบโˆฃhร—h\kappa|_{\mathfrak{h}\times\mathfrak{h}} is still non-degenerate. For arbitrary hโˆˆh,xโˆˆgฮฑh\in\mathfrak{h},x\in\mathfrak{g}_\alpha, by (Deco.A),

adโก(h)โˆ˜adโก(x):gฮฒโ†’gฮฑ+ฮฒ.\operatorname{ad}(h)\circ\operatorname{ad}(x):\mathfrak{g}_\beta\rightarrow\mathfrak{g}_{\alpha+\beta}.

Hence, trโก(adโก(h)โˆ˜adโก(x))=0\operatorname{tr}(\operatorname{ad}(h)\circ\operatorname{ad}(x))=0. Then we see if ฮบ(h,h)=0\kappa(h,\mathfrak{h})=0, then ฮบ(h,g)=0\kappa(h,\mathfrak{g})=0. Then by non-degenerateness of ฮบ\kappa on g\mathfrak{g}, we have h=0h=0.

Now, we can see ฮบ\kappa gives a linear isomorphism that

t:hโ†’hโˆ—,t(h)=ฮบ(h,โ‹…).t:\mathfrak{h}\rightarrow\mathfrak{h}^*,t(h)=\kappa(h,\cdot).

Then for any ฮฑโˆˆฮฆ\alpha\in\Phi, there is a tโˆ’1(ฮฑ)โˆˆht^{-1}(\alpha)\in\mathfrak{h}. Now, we define

(ฮฑ,ฮฒ):=ฮบ(tโˆ’1(ฮฑ),tโˆ’1(ฮฒ)).(\alpha,\beta):=\kappa(t^{-1}(\alpha),t^{-1}(\beta)).

It is bilinear and non-degenerate.

Now we take E=spanโกR(ฮฆ)E=\operatorname{span}_\mathbb{R}(\Phi), then

(ฮฑ,ฮฑ)=ฮบ(tโˆ’1(ฮฑ),tโˆ’1(ฮฑ))=trโก(adโก(tโˆ’1(ฮฑ))2)=โˆ‘ฮฒโˆˆฮฆdimโก(gฮฒ)ฮฒ(tโˆ’1(ฮฑ))2=โˆ‘ฮฒโˆˆฮฆฮฒ(tโˆ’1(ฮฑ))2=โˆ‘ฮฒโˆˆฮฆฮบ(tโˆ’1(ฮฒ),tโˆ’1(ฮฑ))2=โˆ‘ฮฒโˆˆฮฆ(ฮฒ,ฮฑ)2,\begin{aligned} (\alpha,\alpha)&=\kappa(t^{-1}(\alpha),t^{-1}(\alpha))\\ &=\operatorname{tr}(\operatorname{ad}(t^{-1}(\alpha))^2)\\ &=\sum_{\beta\in\Phi}\dim(\mathfrak{g}_\beta)\beta(t^{-1}(\alpha))^2\\ &=\sum_{\beta\in\Phi}\beta(t^{-1}(\alpha))^2\\ &=\sum_{\beta\in\Phi}\kappa(t^{-1}(\beta),t^{-1}(\alpha))^2\\ &=\sum_{\beta\in\Phi}(\beta,\alpha)^2, \end{aligned}

which is a sum of square of real numbers. Hence, we see (โ‹…,โ‹…)(\cdot,\cdot) is an inner product on EE.

ฮฆ\Phi, Canto II, sl(2,C)\mathfrak{sl}(2,\mathbb{C})-triple and some prerequisites

For each ฮฑโˆˆฮฆ\alpha\in\Phi, assume โˆ’ฮฑโˆˆฬธฮฆ-\alpha\not\in\Phi, then consider non-zero xโˆˆgฮฑx\in\mathfrak{g}_\alpha, when

  1. yโˆˆgฮฒy\in\mathfrak{g}_\beta where ฮฒโˆˆฮฆ\beta\in\Phi, then ฮฑ+ฮฒ=ฬธ0\alpha+\beta\not=0. We then see adโกxโˆ˜adโกy:gฮณโ†’gฮณ+ฮฑ+ฮฒ\operatorname{ad}x\circ\operatorname{ad}y:\mathfrak{g}_{\gamma}\rightarrow\mathfrak{g}_{\gamma+\alpha+\beta}. Since they are different components, trโก(adโกxโˆ˜adโกy)=0\operatorname{tr}(\operatorname{ad}x\circ\operatorname{ad}y)=0. Hence ฮบ(x,y)=0\kappa(x,y)=0.
  2. yโˆˆhy\in \mathfrak{h}, then adโกxโˆ˜adโกy:gฮณโ†’gฮณ+ฮฑ\operatorname{ad}x\circ\operatorname{ad}y:\mathfrak{g}_{\gamma}\rightarrow\mathfrak{g}_{\gamma+\alpha}. Still different components, so ฮบ(x,y)=0\kappa(x,y)=0.

Since g=hโŠ•โจฮฒโˆˆฮฆgฮฒ\mathfrak{g}=\mathfrak{h}\oplus\bigoplus_{\beta\in\Phi}\mathfrak{g}_\beta, we have ฮบ(x,g)=0\kappa(x,\mathfrak{g})=0. However, we know x=ฬธ0x\not=0, which is a contradiction. Hence, we know โˆ’ฮฑโˆˆฮฆ-\alpha\in\Phi.

Similarly, we have for xโˆˆgฮฑx\in\mathfrak{g}_\alpha, ฮบ(x,gโˆ’ฮฑ)=ฬธ0\kappa(x,\mathfrak{g}_{-\alpha})\not=0. We then can choose eโˆˆgฮฑ,fโ€ฒโˆˆgโˆ’ฮฑe\in\mathfrak{g}_\alpha,f'\in\mathfrak{g}_{-\alpha} such that ฮบ(e,fโ€ฒ)=ฬธ0\kappa(e,f')\not=0 and take

f=2(ฮฑ,ฮฑ)ฮบ(e,fโ€ฒ)fโ€ฒ.f=\frac{2}{(\alpha,\alpha)\kappa(e,f')}f'.

Then for any hโˆˆhh\in\mathfrak{h},

ฮบ([e,f],h)=ฮบ(e,[f,h])=ฮบ(e,โˆ’[h,f])=ฮบ(e,ฮฑ(h)f)=ฮฑ(h)ฮบ(e,f)=ฮบ(tโˆ’1(ฮฑ),h)ฮบ(e,f)=ฮบ(ฮบ(e,f)tโˆ’1(ฮฑ),h).\begin{aligned} \kappa([e,f],h)&=\kappa(e,[f,h])\\ &=\kappa(e,-[h,f])\\ &=\kappa(e,\alpha(h)f)\\ &=\alpha(h)\kappa(e,f)\\ &=\kappa(t^{-1}(\alpha),h)\kappa(e,f)\\ &=\kappa(\kappa(e,f)t^{-1}(\alpha),h). \end{aligned}

By the non-degenerateness of ฮบ\kappa on h\mathfrak{h}, we have [e,f]=ฮบ(e,f)tโˆ’1ฮฑ[e,f]=\kappa(e,f)t^{-1}\alpha.

Now,

[e,f]=ฮบ(e,2(ฮฑ,ฮฑ)ฮบ(eฮฑ,fโ€ฒ)f)tโˆ’1(ฮฑ)=2(ฮฑ,ฮฑ)tโˆ’1ฮฑ=:h[e,f]=\kappa\left(e,\frac{2}{(\alpha,\alpha)\kappa(e_\alpha,f')}f\right)t^{-1}(\alpha)=\frac{2}{(\alpha,\alpha)}t^{-1}\alpha=:h

While eโˆˆgฮฑe\in \mathfrak{g}_\alpha, then [h,e]=ฮฑ(h)e=2ฮฑ(tโˆ’1ฮฑ)e(ฮฑ,ฮฑ)=2(ฮฑ,ฮฑ)e(ฮฑ,ฮฑ)=2e[h,e]=\alpha(h)e=\frac{2\alpha(t^{-1}\alpha)e}{(\alpha,\alpha)}=\frac{2(\alpha,\alpha)e}{(\alpha,\alpha)}=2e. Similarly, we have [h,f]=โˆ’2f[h,f]=-2f. Therefore, we have {e,f,h}\{e,f,h\} satisfies

[h,e]=2e,[h,f]=โˆ’2f,[e,f]=h.[h,e]=2e,[h,f]=-2f,[e,f]=h.

This give a basis for a special Lie algebra, which is called sl(2,C)\mathfrak{sl}(2,\mathbb{C}).

Here, we introduce some results of representations of sl(2,C)\mathfrak{sl}(2,\mathbb{C}). Since these are totally irrelevant algebraic results, we wonโ€™t prove them here, but only give as machineries. For more information, refer to [Erdmannโ€“Wildon, Chapter 8].

Every finite dimensional module of sl(2,C)\mathfrak{sl}(2,\mathbb{C}) is totally reducible. And every irreducible module of sl(2,C)\mathfrak{sl}(2,\mathbb{C}) is determined by a non-negative integer dd, written by VdV_d, with dimension d+1d+1, and there is a basis {v0,v1,โ‹ฏโ€‰,vd}\{v_0,v_1,\cdots,v_d\} such that

hvk=(dโˆ’2k)vk,evk=(dโˆ’k+1)vkโˆ’1,fvk=(k+1)vk+1,h v_k=(d-2k)v_k,\quad e v_k=(d-k+1)v_{k-1},\quad fv_k=(k+1)v_{k+1},

where vโˆ’1=vd+1:=0v_{-1}=v_{d+1}:=0.

Therefore, we know that

  1. Every eigenvalue of hh is integer, and is symmetric about zero; (Repr.A)
  2. In any finite dimensional sl(2,C)\mathfrak{sl}(2,\mathbb{C})-module, if vv is an eigenvector of hh with eigenvalue greater than 00, then fv=ฬธ0fv\not=0. (Repr.B)

ฮฆ\Phi, Canto III, verification of root system

Recall, the knowledge of root systems. In this chapter, we will give the fact that ฮฆ\Phi is actually a root system.

We suppose Eโ‰…RrE\cong \mathbb{R}^r, and (โ‹…,โ‹…)(\cdot,\cdot) is a positive definite inner product on it. A finite subset RโŠ‚Eโˆ–{0}R\subset E\setminus\{0\} is called an abstract root system, if

  1. (Root.A) EE is spanned by RR;
  2. (Root.B) โˆ€ฮฑ,ฮฒโˆˆR\forall\alpha,\beta\in R, nฮฑฮฒ=2(ฮฑ,ฮฒ)(ฮฑ,ฮฑ)n_{\alpha\beta}=2\frac{(\alpha,\beta)}{(\alpha,\alpha)} is an integer;
  3. (Root.C) If ฮฑ,ฮฒโˆˆR\alpha,\beta\in R, then sฮฒ(ฮฑ)=ฮฑโˆ’nฮฒฮฑฮฒโˆˆRs_\beta(\alpha)=\alpha-n_{\beta\alpha}\beta\in R.

(Root.A) is vacuous.

To verify (Root.B), define the ฮฑ\alpha-string through ฮฒ\beta as

Mฮฒฮฑ=โจkโˆˆZgฮฒ+kฮฑ.M_\beta^\alpha=\bigoplus_{k\in\mathbb{Z}}\mathfrak{g}_{\beta+k\alpha}.

By (Deco.A), itโ€™s easy to verify that this is a module of sl(2,C)\mathfrak{sl}(2,\mathbb{C}). Then the eigenvalue of hh is

(ฮฒ+kฮฑ)h=ฮฒ(h)+2k,(\beta+k\alpha)h=\beta(h)+2k,

(here ฮฑ(h)=2\alpha(h)=2 because [h,e]=2e[h,e]=2e and eโˆˆgฮฑe\in\mathfrak{g}_\alpha).

Since we know the eigenvalue of hh is always integer, then ฮฒ(h)\beta(h) is integer.

Now,

ฮฒ(h)=ฮฒ(2tโˆ’1ฮฑ(ฮฑ,ฮฑ))=2ฮบ(tโˆ’1ฮฒ,tโˆ’1ฮฑ)(ฮฑ,ฮฑ)=2(ฮฑ,ฮฒ)(ฮฑ,ฮฑ)=nฮฑฮฒ,\beta(h)=\beta(\frac{2t^{-1}\alpha}{(\alpha,\alpha)})=\frac{2\kappa(t^{-1}\beta,t^{-1}\alpha)}{(\alpha,\alpha)}=\frac{2(\alpha,\beta)}{(\alpha,\alpha)}=n_{\alpha\beta},

which gives the verification of (Root.B).

To verify (Root.C), we need to compute ฮฒโˆ’nฮฑฮฒฮฑ\beta-n_{\alpha\beta}\alpha. The eigenvalue of hh is ฮฒ(h)+2k\beta(h)+2k, while in V(d)V(d), the eigenvalue of hh is dโˆ’2kd-2k. Then we see the set K={k;gฮฒ+kฮฑ}=ฬธ0K=\{k;\mathfrak{g}_{\beta+k\alpha}\}\not=0 is continuous, i.e. K={โˆ’p,โˆ’p+1,โ‹ฏโ€‰,q}K=\{-p,-p+1,\cdots,q\}. By (Repr.A), we see ฮฒ(h)+2q=โˆ’(ฮฒ(h)โˆ’2p)\beta(h)+2q=-(\beta(h)-2p), so nฮฑฮฒ=pโˆ’qn_{\alpha\beta}=p-q. Since โˆ’qโ‰คqโˆ’pโ‰คq-q\le q-p\le q, then sฮฑ(ฮฒ)=ฮฒโˆ’(pโˆ’q)ฮฑโˆˆฮฆs_\alpha(\beta)=\beta-(p-q)\alpha\in\Phi.

ฮฆ\Phi, Canto IV, reducedness and irreduciblity

In the post of root system, what we have proved is the reduced and irreducible root systems. Hence we now have to prove that ฮฆ\Phi is reduced and irreducible.

We now see g\mathfrak{g} as an sl(2,C)={e,f,h}\mathfrak{sl}(2,\mathbb{C})=\{e,f,h\} deduced ฮฑโˆˆฮฆ\alpha\in\Phi. Assume that dimโกgฮฑโ‰ฅ2\dim\mathfrak{g}_\alpha\ge2. ฮบ(f,โ‹…):gฮฑโ†’C\kappa(f,\cdot):\mathfrak{g}_\alpha\rightarrow\mathbb{C} then is a linear functional. Since dimโกgฮฑโ‰ฅ2\dim\mathfrak{g}_\alpha\ge 2, kerโกฮบ(f,โ‹…)โˆฉgฮฑ=ฬธ0\ker \kappa(f,\cdot)\cap\mathfrak{g}_\alpha\not=0, take 0=ฬธxโˆˆgฮฑ0\not= x\in\mathfrak{g}_\alpha such that ฮบ(f,x)=0\kappa(f,x)=0. Use the same procedure we used in Canto II, we can replace ee by xx and get [f,x]=0[f,x]=0. However, xx is the eigenvector of hh, ฮฑ(h)=2\alpha(h)=2 and f(x)=[f,x]=0f(x)=[f,x]=0, which contradicts to (Repr.B).

Therefore, we see dimโกgฮฑ=1\dim\mathfrak{g}_\alpha=1.

And assume g2ฮฑ=ฬธ0\mathfrak{g}_{2\alpha}\not=0, take 0=ฬธyโˆˆg2ฮฑ0\not=y\in\mathfrak{g}_{2\alpha}, then h(y)=2ฮฑ(h)y=4yh(y)=2\alpha(h)y=4y. We now decompose g\mathfrak{g} into direct sum irreducible sl(2,C)\mathfrak{sl}(2,\mathbb{C})-modules, then yy is in some DdD_d, and there must be an eigenvalue 44, then there must be some eigenvalue 22, which contributes a component of gฮฑ\mathfrak{g}_\alpha. Meanwhile, we know that spanโก{e,f,h}\operatorname{span}\{e,f,h\} is a submodule of g\mathfrak{g} with eigenvalues {โˆ’2,0,2}\{-2,0,2\}. However, eโˆˆgฮฑe\in\mathfrak{g}_\alpha, therefore dimโกgฮฑโ‰ฅ2\dim\mathfrak{g}_\alpha\ge 2. But this is a contradiction. Hence 2ฮฑโˆˆฬธฮฆ2\alpha\not\in\Phi. Similar for kฮฑโˆˆฬธฮฆk\alpha\not\in\Phi. Hence ฮฆ\Phi is a reduced root system.

Assume that ฮฆ\Phi is not irreducible. Then there is a decomposition that

ฮฆ=ฮฆ1โŠ”ฮฆ2,ย whereย ฮฑโˆˆฮฆ1,ฮฒโˆˆฮฆ2,(ฮฑ,ฮฒ)=0.\Phi=\Phi_1\sqcup\Phi_2,\text{ where }\alpha\in\Phi_1,\beta\in\Phi_2,(\alpha,\beta)=0.

Assume ฮฑ+ฮฒโˆˆฮฆ\alpha+\beta\in\Phi where ฮฑโˆˆฮฆ1,ฮฒโˆˆฮฆ2\alpha\in\Phi_1,\beta\in\Phi_2, W.L.O.G, we have ฮฑ+ฮฒโˆˆฮฆ1\alpha+\beta\in\Phi_1. Then (ฮฑ+ฮฒ,ฮฒ)=0(\alpha+\beta,\beta)=0. However, (ฮฑ,ฮฒ)+(ฮฒ,ฮฒ)>0(\alpha,\beta)+(\beta,\beta)>0, this is a contradiction. Hence [gฮฑ,gฮฒ]โŠ‚gฮฑ+ฮฒ=0[\mathfrak{g}_\alpha,\mathfrak{g}_\beta]\subset\mathfrak{g}_{\alpha+\beta}=0. Similarly, ฮฑโˆ’ฮฒโˆˆฬธฮฆ\alpha-\beta\not\in\Phi.

We define hฮฑ=2tโˆ’1ฮฑ(ฮฑ,ฮฑ)h_\alpha=\frac{2t^{-1}\alpha}{(\alpha,\alpha)}, and similarly for eฮฑ,fฮฑe_\alpha,f_\alpha. Let h1=spanโก{hฮฑ;ฮฑโˆˆฮฆ1}\mathfrak{h}_1=\operatorname{span}\{h_\alpha;\alpha\in\Phi_1\}, h2=spanโก{hฮฑ;ฮฑโˆˆฮฆ2}\mathfrak{h}_2=\operatorname{span}\{h_\alpha;\alpha\in\Phi_2\}, then h=h1โŠ•h2\mathfrak{h}=\mathfrak{h}_1\oplus\mathfrak{h}_2. Define i=h1โŠ•โจฮฑโˆˆฮฆ1gฮฑ=ฬธ0\mathfrak{i}=\mathfrak{h}_1\oplus\bigoplus_{\alpha\in\Phi_1}\mathfrak{g}_\alpha\not=0. Since g=hโŠ•โจฮฑโˆˆฮฆgฮฑ\mathfrak{g}=\mathfrak{h}\oplus\bigoplus_{\alpha\in\Phi}\mathfrak{g}_\alpha, we only need to check every component of g\mathfrak{g},

  1. [h1,i]โŠ‚i[\mathfrak{h}_1,\mathfrak{i}]\subset\mathfrak{i} is obvious.

  2. Take hฮฒโˆˆh2h_\beta\in\mathfrak{h}_2 and ฮฑโˆˆฮฆ1\alpha\in\Phi_1, [hฮฒ,eฮฑ]=ฮฑ(hฮฒ)eฮฑ=nฮฑฮฒeฮฑ=0[h_\beta,e_\alpha]=\alpha(h_\beta)e_\alpha=n_{\alpha\beta}e_\alpha=0. Hence [h2,gฮฑ]=0[\mathfrak{h}_2,\mathfrak{g}_\alpha]=0. Meanwhile, [h2,h1]=0[\mathfrak{h}_2,\mathfrak{h}_1]=0 by h\mathfrak{h} is abelian. Therefore, [h2,i]=0โŠ‚i[\mathfrak{h}_2,\mathfrak{i}]=0\subset\mathfrak{i}.

  3. Take ฮฒโˆˆฮฆ2,ฮฑโˆˆฮฆ1\beta\in\Phi_2,\alpha\in\Phi_1, hฮฒ\mathfrak{h}_\beta, [gฮฒ,hฮฑ]=โˆ’ฮฒ(hฮฑ)gฮฒ=โˆ’nฮฒฮฑgฮฒ=0[\mathfrak{g}_\beta,h_\alpha]=-\beta(h_\alpha)\mathfrak{g}_\beta=-n_{\beta\alpha}\mathfrak{g}_\beta=0. [gฮฒ,gฮฑ]=0[\mathfrak{g}_\beta,\mathfrak{g}_\alpha]=0 is already proved. Hence [gฮฒ,i]=0โŠ‚i[\mathfrak{g}_\beta,\mathfrak{i}]=0\subset\mathfrak{i}.

  4. [gฮฑ,i]โŠ‚i[\mathfrak{g}_\alpha,\mathfrak{i}]\subset\mathfrak{i} is obvious.

Hence, [g,i]=i[\mathfrak{g},\mathfrak{i}]=\mathfrak{i}. Therefore i\mathfrak{i} is an ideal of g\mathfrak{g}. However, this is a contradiction since g\mathfrak{g} is simple. Hence ฮฆ\Phi is irreducible.

Consequence

Now, we have given a way to classify the complex simple Lie algebras.

But, we still have two things to prove, which are the equivalence between different Cartan subalgebra of a given Lie algebra (i.e. they give the same root system), and the recovery of simple Lie algebra from a given root system. Formally speaking, they are the following two theorems.

(Cons.A) Let g\mathfrak{g} be a complex simple Lie algebra and h1\mathfrak{h}_1 and h2\mathfrak{h}_2 are two Cartan subalgebra of g\mathfrak{g}, then there is some ฯ†โˆˆInnโก(g)\varphi\in\operatorname{Inn}(\mathfrak{g}) such that ฯ†(h1)=h2\varphi(\mathfrak h_1)=\mathfrak{h_2}.

(Cons.B, Serreโ€™s Theorem) Let C=(cij)C = (c_{ij}) be the Cartan matrix of a root system, of rank โ„“\ell. Let LL be the complex Lie algebra generated by elements ei,fi,hie_i, f_i, h_i for 1โ‰คiโ‰คโ„“1 \leq i \leq \ell, subject to the following relations:

  1. [hi,hj]=0[h_i, h_j] = 0 for all i,ji, j;

  2. [hi,ej]=cji,ej[h_i, e_j] = c_{ji}, e_j and [hi,fj]=โˆ’cji,fj[h_i, f_j] = -c_{ji}, f_j for all i,ji, j;

  3. [ei,fi]=hi[e_i, f_i] = h_i for each ii, and [ei,fj]=0[e_i, f_j] = 0 if iโ‰ ji \neq j;

  4. (ad;ei)1โˆ’cji(ej)=0(\mathrm{ad}; e_i)^{1-c_{ji}}(e_j) = 0 and (ad;fi)1โˆ’cji(fj)=0(\mathrm{ad}; f_i)^{1-c_{ji}}(f_j) = 0 if iโ‰ ji \neq j.

Then LL is finite-dimensional and semisimple, with Cartan subalgebra H=span{h1,โ€ฆ,hโ„“}H = \mathrm{span}\{h_1, \dots, h_\ell\}, and its root system has Cartan matrix CC.

However, the proof of them wonโ€™t be given in the post. Because I donโ€™t really care about the techniques used in these proof, and these two theorems are so intuitive that I donโ€™t really think them should be proved again.

Hence, this is the end of this article.