1. Introduction
The family of semiconducting transition-metal dichalcogenides (TMDs)
(
Mo, W;
S, Se) appears as one of the most promising platforms for future technological applications [
1,
2,
3,
4]. These materials are indeed characterized by the presence of many degrees of freedom (charge, spin, valley, layer, lattice, …), strongly entangling each other [
5,
6,
7,
8,
9,
10,
11], opening the possibility of tuning the electronic/optical/magnetic/transport properties in a controlled, flexible, and reversible way by external magnetic or electric fields. When isolated at the single-layer level, these compounds present a direct bandgap at the high-symmetry points K, K′ of the Brillouin zone, the
valleys, as shown by photoluminescence probes [
5,
7,
12,
13,
14,
15]. As in graphene, the honeycomb lattice structure is reflected in peculiar optical selection rules, which induce selectively interband optical transitions in a given valley upon circularly polarized light. This scenario prompts the concept of “valleytronics”, i.e., the possibility of manipulating the quantum degrees of freedom selectively in a single valley [
13,
14]. Such optical sensitivity in TMDs has been widely explored in single-layer compounds [
2,
4,
8,
16,
17,
18,
19,
20,
21,
22,
23,
24,
25,
26,
27,
28,
29,
30]. A common tool is the observation of an optical dichroism, i.e., a different optical response upon left-hand or right-hand circularly polarized photons. One striking difference of these compounds with respect to graphene is the presence of a strong spin–orbit coupling, which provides a sizable spin-splitting of the valence band. Within this context, circularly polarized light is selectively coupled not only with a given valley, but also with a given spin, yielding spin-polarized charges in the conduction band along with opposite-spin charges in the valence band [
4,
8,
16,
17,
18,
19,
20,
21,
22,
23,
26,
27,
29,
31,
32,
33,
34,
35,
36].
The entanglement between light polarization and spin population can be conveniently investigated by means of the observation of a finite Kerr or Faraday rotation [
37,
38,
39]. These effects signalize the presence of an intrinsic magnetic field in the sample, and in single-layer TMDs they can be naturally triggered as a result of circularly polarized pumping [
40], which gives rise, as discussed above, to valley-selective and spin-selective particle–hole excitations [
4,
19,
20,
21,
22,
23,
29,
35,
36,
41,
42].
The aim of the present paper is to provide a compact and microscopical investigation of the onset of Kerr/Faraday rotation in a wide energy spectrum of single-layer TMDs. A key point is the identification of the orbital character of the particle–hole excitations at different energies allowed by the optical selection rules, and how it is reflected in the sign and strength of the optical Kerr/Faraday rotation. In order to address this issue in the clearest and controlled way, we introduce a proper generalization of a expansion in a three-band framework. The optical response is computed at the non-interacting level within a fully quantum Kubo approach, where Kerr/Faraday effects are related to the appearing of off-diagonal components of the optical tensor. Different optical features are identified as associated with different particle–hole excitations, and their time evolution in out-of-equilibrium dynamics is discussed. While the exact energies of such optical features should be considered as unrenormalized by exciton binding effects (not considered here), the present work provides an analytical insight into the microscopic onset of the different optical features whose strengths can be conveniently modeled in terms of a unique parameter.
2. Single-Particle Hamiltonian
Single-layer TMDs contain a plane of
M-atoms in a triangular lattice, sandwiched between two layers of chalcogen atoms,
X. The resulting lattice, from a top view, appears as a bipartite hexagonal structure. Many theoretical approaches have been proposed to capture the relevant physics of these materials. As a general rule, effective low-energy models (like
expansions) retain only the relevant conduction and valence bands, mapping the complex band structure onto an effective two-band gapped Dirac model [
7,
43,
44,
45]. On the other hand, tight-binding (TB) models have emphasized the role of the
d-orbitals of the metal atoms, in particular the
,
, and
ones, which provide the main orbital content of both the valence and conduction bands, as well as of a third higher-energy conduction band [
46,
47,
48,
49,
50,
51,
52,
53]. From a microscopical point of view, however, the simple triangular lattice of the
M atoms cannot account for a gapped semiconducting band-structure, and the hybridization with
X atoms has been shown to play a crucial role. The choice between a simplified, semi-analytical approach and the multiband complexity of a fully microscopical TB model is a delicate issue that depends on the physics on which to be focused.
An interesting balance between these two approaches has been provided by Liu et al. in Ref. [
54], where they introduced a compact three-band tight-binding model within the reduced Hilbert space:
where the role of the hybridization of the
d-orbitals of
M atoms with
p-orbitals of
X is modeled, using symmetry arguments, by means of effective complex hopping parameters that break the triangular symmetry, enforcing the physics of a bipartite hexagonal lattice. The resulting one-particle Hamiltonian can be thus written in the full Brillouin zone thus as:
where interatomic hoppings up to third-nearest-neighbor level are included in the TB part,
, providing an excellent agreement with ab initio calculations for the conduction and valence bands close to the K, K′ valleys. The spin–orbit coupling (SOC) is safely approximated with its dominant contribution due to the local spin-diagonal term [
54,
55], which in this basis reads:
where
.
Such a three-band tight-binding model provides an accurate description of the energy dispersion and of the orbital character of the main relevant bands for optical probes with the advantage of a reduced Hilbert space. In the following, we focus on the optical response that is governed by the particle–hole excitations close to the K, K′ valley points. The above model thus also represents a suitable platform for an analytical
expansion that generalizes the previous
approaches [
7,
43,
44,
45] up to the relevant three-orbital space. To this aim, we thus expand
up to the quadratic order in the relative momentum
(
) close to the K, K′. It is also convenient to express the resulting Hamiltonian
,
in the chiral basis:
where
and
. On such a basis, the spin–orbit term appears purely diagonal,
and we can write:
where
and
a is the in-plane
M-
M distance. The full expression of the band parameters in Equations (
6) and (
7) in terms of the original tight-binding parameters is provided in
Appendix A. The total spin-full Hamiltonians at the K and K′
(where
K, K′) contain all the relevant entanglements between spin, valleys, and chirality.
Equation (
8) defines the energy levels at the K point for each spin family. We have explicitly:
At the K point, the energy level
corresponds to the conduction band edge, which results here in spin-degeneracy since we neglect the weak spin–orbit coupling of the
X chalcogen atoms. The valence band at K is associated with the R-chiral state with spin-split energies
, for up and down spin, respectively. The
levels correspond to higher energy states, characterized by a L-chiral symmetry [
47]. A similar energy spectrum is found at the K′ point, but with chiral content exchanged between the valence and the high-energy levels.
The optical selection rules are encoded in the multiband matrix structure of the current operators, which can be straightforwardly computed as derivatives of the single-particle Hamiltonian,
, where
and
K, K′. At the high-symmetry points (
) we obtain:
In similar way, one can derive the chiral current operators
:
In order to evaluate the optical response, Equation (
8) can be numerically diagonalized for finite momentum
to obtain eigenvalues (the band dispersion) and eigenvectors of the resulting eigenstates. This task, however, can be further simplified within a
framework where the band dispersion, at the quadratic order we are interested in, is simply provided by the diagonal terms of Equations (
6) and (
7)
corrected by the second-order corrections resulting from the off-diagonal elements. We thus obtain:
where
and where
The numerical expression of the
band parameters in Equations (23)–(25) is also provided in
Appendix A.
The comparison between the full TB dispersion in the Brillouin zone, from Ref. [
54], and our analytical three-band model expanded around the K, K′ points is displayed in
Figure 1, showing an excellent agreement.
Note that, within the same expansion, the leading order to the current operators is not affected and Equations (16)–(19) are still valid also in the context. As mentioned, the different matrix structure of Equations (16)–(19) enforces the different optical selection rules at the K and K′ points. It is useful to recall that right-circularly polarized (RCP) light couples with the L-chiral current , whereas LCP couples with , according to the relation . Equations (16)–(19) dictate for instance how, under external pumping conditions, absorption of a left-circularly polarized (LCP) photon can induce particle–hole optical excitations at the K point only between the valence band with character and the conduction band with character, or (in the case of electron-doped samples) between the conduction band with character and the high-level conduction band with character. On the other hand, the same absorption of a LCP photon can effectively create particle–hole optical excitations at the K′ point between the valence band with character and the high-level conduction band with character. The selection rules for right-circularly polarized light, coupled with the chiral current , are graphically obtained by reversing the arrow of each particle–hole excitation.
Similar selection rules govern also the virtual processes involved in the optical linear response, whose analytical computation will be addressed in the next section.
3. Optical Response
Equipped with the theoretical tools outlined in
Section 2, we can now compute in the useful chiral basis all the elements of the optical tensor of single-layer transition-metal dichalcogenides through the evaluation of the frequency-dependent current–current response function. In the Matsubara space we have:
where
,
K, K′,
and where
is the chemical potential. Here,
are the internal fermionic frequencies,
is the external bosonic frequency, and
T is the temperature.
The generic elements of the optical conductivity tensor are thus obtained as:
where
Given the three-band structure of our model, the total optical response can be divided (leaving aside the Drude-like intraband terms at low frequencies, which are not relevant in the present analysis) in three interband contributions:
The term
describes optical transitions between the (spin-split) valence band and the lowest energy conduction bands. Due to the spin-splitting of the valence band, this term accounts for the A and B exciton resonances. The term
conveys information about optical transitions between the valence band (with
or
character) and the high-energy conduction band with opposite
character, which has been discussed in detail in Refs. [
26,
27]. Finally, the term
describes optical transitions between the lowest-energy and high-energy conductions bands. In the undoped semiconducting regime, due to the Pauli blocking, this term is usually irrelevant, but it plays a role in photo-induced doping [
26,
27].
All the terms present a similar functional structure, where the relevant role is played by the band population. For sake of simplicity, we focus thus for the moment on the first term,
. Since the system is block-diagonal in the spin-index, one can also formally compute separately the optical response
for each spin and each valley,
. We have for instance:
where
is the occupation factor for a given momentum and band, where, under equilibrium conditions,
. At
, in the semiconducting state
, one obtains
,
, and, for
:
where
is the universal two-dimensional conductivity,
,
and the excitation edges for the A and B excitons, respectively,
and
Using the tight-binding parameters of Ref. [
54], we obtain for MoS
2 eV,
eV,
,
eV, and
eV. For symmetry, we have also
.
In a similar way, one obtains:
and, for
, and
:
The real parts of
and the imaginary parts of
can be thus easily obtained using the Kramers–Kronig relations. Furthermore, using the symmetry relations encoded in the different matrix structures at different valleys, one can recognize that at equilibrium:
(
being here the reversed
spin), so that, under such equilibrium conditions, the contributions of different valleys
sum up for the diagonal terms of the optical tensor, whereas they
cancel out for the off-diagonal ones, in accordance with the time-reversal symmetry. The net result for the whole optical tensor is summarized in
Figure 2a. Similar expressions can be derived for the
and
terms.
4. Non-Equilibrium Optical Response
Equations (
31) and (
38) provide a suitable framework to model the optical response under non-equilibrium conditions, by specifying the proper occupation factors in the presence of photo-induced particle–hole excitations. More specifically, within such a semi-classical approach, we can simulate the effects of an LCP laser pumping tuned at the
frequency by assuming an effective photo-induced charge transfer,
, from the valence to the conduction band. Due to the selected circular polarization, such particle–hole excitations occur only at the K point and, due to the selected frequency in resonance with the A exciton, only for the spin ↑. Since only one valley, with a given spin, will be populated in both the valence and conduction band, the photo-induced charge density can be further parametrized in terms of a characteristic momentum,
, for which
,
(
), obeying the relation:
Typical values of can range up to . For a representative case, , we obtain for MoS2 , and, using Å, Å−1.
Due to the selection rules, only the states at the K point with the proper chirality, corresponding in this case to spin-up, are affected by the pumping. For
, we have thus a
reverse Pauli blocking. The contribution of these states to the optical conductivity reads thus:
where
. In
Figure 2b, we show the effect of the left-circularly polarized pumping on the diagonal and off-diagonal parts of the optical conductivity,
. Due to the reverse Pauli blocking, the real part of the diagonal term, Re
, shows a depletion of spectral intensity close to the A-edge energy. In real samples, in the presence of many-body exciton binding, this depletion appears as a reduction of the A-exciton intensity, as has been experimentally observed many times in reflectivity probes. More striking is the result on the off-diagonal component of the optical tensor,
, where the contributions from spin-up and spin-down transitions and from K and K′ close to the A-resonance do not cancel out anymore, giving rise to a
finite off-diagonal term,
,
On the experimental ground, the onset of a finite off-diagonal component,
, is observed as a optical (Faraday or Kerr) rotation of the transmitted/reflected polarization of the probe, which commonly signalizes the presence of a finite effective magnetic field [
37,
38,
39]. In more detail, we estimate an off-diagonal spectral intensity at the A-exciton energy range:
where the spectral off-diagonal intensity is meant to be integrated in the frequency range
close to the A-resonance. We stress here that, although we employ here a non-interacting model to obtain an analytical insight, the physical origin of such magneto-optical rotation depends uniquely on the selective valley-population enforced by the circularly polarized pumping, yielding a non-equivalent optical response that does not cancel in the K and K′ valleys. This is a quite general mechanism that will not be affected by the formation of localized states when bound excitons form. Within this context, we expect that the step-function spectral shape of Equation (
46), also shown in
Figure 2b, will evolve smoothly in a
-like Lorentzian peak, preserving an integrated intensity,
, which is dictated by the amount of the spin-polarized photo-induced charges in the conduction and valence bands, and hence still scaling with
. The onset of a Faraday/Kerr rotation at the A-exciton energy is consistent with previous experimental and theoretical investigations [
2,
19,
20,
21,
22,
23,
39]. It is worth underlining here that the valley-selective/spin-selective population induced by the circularly polarized pumping is expected within our modeling to give rise to a finite pump-driven Kerr/Faraday rotation also at two further energy scales, which we identify with the so-called C-exciton and with another characteristic energy, which we denote as the D-peak.
We notice a
remarkably strong band-nesting between these two bands close to the K/K′ points. Such a feature has usually been disregarded in the context of TMDs, where the analyses have focused on the nesting properties between the valence and the lowest-energy conduction band [
56,
57,
58]. We relate these transitions with the broad shoulder, commonly denoted as the C-exciton.
Currently, the origin of the remarkable shoulder in the optical conductivity denoted as the C-exciton has not been fully assessed. A mainstream consensus associates this spectral feature with the enhanced optical activity between the valence band and the lowest-energy conduction band along the
-K path, where these two bands are thought to have a parallel energy dispersion (
band nesting) [
56,
57,
58]. Generalizing this idea within the three-band context, we notice a
remarkably strong band-nesting at K/K′ points between the valence and the lowest-energy conduction, governed by the nesting factor
. Prompted by a careful analysis of the first-principle and tight-binding dispersions, we suggest thus a slightly different perspective, where the C-exciton shoulder stems from band-nesting close to the K (K′) point between the valence band with
(
) character and the high-energy conduction band with
(
) character. In our modeling, neglecting the exciton binding energy, we can expect thus
. Such a change of perspective has a deep impact on the predictions about the effects of pumping with circularly polarized light on the magneto-optical properties. In the original scenario, the
points responsible for the band-nesting are located close to the
point along the path
-K. These states do not have a significant chiral character, and as a consequence they have a small spin-splitting and they are weakly entangled with circularly polarized light. On the contrary, in the present context where band-nesting states lie close to the K, K′ points, we can predict a strong chiral character, a different response for spin-up and spin-down charges, a strong entanglement with the circularly polarized light, and a remarkable onset of a off-diagonal component of the optical tensor. Such a picture is consistent with the experimental findings observed in Refs. [
26,
27].
Our three-band model is naturally suited to describe this scenario, where the band-nesting optical transitions responsible for the C-exciton shoulder are accounted for by the
term (see
Figure 3a).
At the same time, the photo-induced charging of the conduction band triggers finite optical transitions between the conduction band itself with
and the high-energy conduction band with
(
) character. The valley-population of these states is also very sensitive to circularly polarized light, and it is expected thus to drive a finite optical rotation at typical energy, neglecting the exciton binding energy,
(
Figure 3a). These latter optical transitions are taken into account by the term
.
The effect of photo-induced pump charging with circularly polarized light in the whole frequency domain can be thus evaluated by considering all the possible contributions,
. The formal structure of each term is very similar to the term
, which we have explicitly evaluated above. Taking into account the slight differences for each term, we obtain thus:
where
and where
and
. The real part,
, is thus obtained by Kramers–Kronig relations. In Equation (
48), we have assumed that
, which is a valid assumption for the W-based transition-metal dichalcogenides WS
2, WSe
2, and WTe
2 (see
Table A1). However, since these materials are very close to the perfect band-nesting limit (
) for these states at the K, K′ point, the sign of
is not a priori determined. For the Mo
family, for instance
(see
Table A1), the analytical expression
should rather read:
The plot of
for MoS
2, with a pump-driven photo-induced charge density
(
), is shown in
Figure 3b, showing how a valley-selective population due to a circularly polarized pumping gives rise to a finite off-diagonal component (and hence to a finite Faraday/Kerr effect), not only at the A-exciton energy,
, but also at the C-exciton energy,
, and at another energy range,
, corresponding to
, net of the exciton binding energy. We predict thus an opposite sign of the off-diagonal component of the optical tensor (and hence an opposite Faraday/Kerr rotation) at the energies
,
compared with the one predicted at the A-exciton energy scale. The absolute intensity of these additional features in the off-diagonal component of the optical tensor is found:
The expression of Equations (52) and (53) is formally identical at Equation (
47) for
, upon changing the proper variables, with the noticeable difference of a factor 2. This is due to the fact that the strength of
at
profits from the presence of the pump-driven charge in
both the conduction and valence bands. On the other hand, the onset of a finite off-diagonal component,
, at the energies
,
is related in an independent way only to the pump-driven charge in the conduction band and in the valence band, respectively. This complex multi-peak structure of the pump-induced Faraday/Kerr effect opens an interesting perspective, not only for characterizing and proving the onset of these effects, but also for harvesting them for multi-frequency operative devices.
5. Time-Dependence
In the previous section, we have shown, using an appropriate three-band model, how a valley-selective population driven by a circularly polarized pump can give rise to an off-diagonal component of the optical tensor, and hence to a finite Faraday/Kerr optical rotation at three characteristic energies, related to the A-exciton, the C-exciton, and to another energy scale governed by the optical transitions between the lowest conduction band and high-energy conduction band, roughly determined by the energy difference between the A and C-exciton.
Most notable, in this description, is the absence of any Faraday/Kerr signature at the B-exciton energy. This is essentially due to the strong light-polarization/orbital/valley/spin entanglement, so that a circularly polarized pumping tuned at the A-exciton resonance induces a valley/spin selective population. More, in particular, left-circularly polarized light tuned at the A-exciton resonance, in the absence of scattering, triggers particle–hole transitions uniquely at the K valley and uniquely for spin-up electrons, making thus an optical unbalance only in the spin-up sector. This scenario gives rise to a finite Faraday/Kerr signature only close to the energies , , and . Such a snapshot is valid, however, only on a short time scale, before impurity and many-body scattering can cause spin-flip and/or intervalley processes.
In order to gain an insight into how many-body scattering can affect the magneto-optical Faraday/Kerr properties induced by circularly polarized pumping, we consider the charge density in each relevant band that is involved in the time-dynamics. We denote thus
as the charge density in the lowest-energy conduction band with
-orbital and
-spin character at the
valley,
as the charge (hole) density in the valence band at the K valley with spin-up, and
as the charge (hole) density in the valence band at the K′ valley with spin-down. Neglecting the frequency-resolved details of each optical feature, we can estimate the “Faraday/Kerr” intensity of each spectral feature as:
Here, following the analysis for the other features, we have properly estimated the intensity of a spectral feature at the energy corresponding to the B-exciton as resulting by the and , and hence governed by the time-dynamics of and . In all the cases, we have taken into account that, due to the orbital/spin/valley entanglement, similar processes at K′ cancel the contributions at the K valley. Assuming an initial pumping with left-circularly polarized photons resonant at the A-exciton energy, we model at the respective charge density as , . These conditions reproduce the static results discussed in the previous section.
Recombination processes, related to annihilation of particle–hole excitations, are known to occur on a very long time scale. On the other hand, the off-diagonal term,
, of the optical tensor is expected to vanish in a much shorter time scale when scattering processes redistribute the charge in both the conduction and valence bands, giving identical populations in the K, K′ valley. Neglecting the very weak intravalley spin-flip processes, two main scattering channels have been identified in this scenario [
16,
34,
35,
36,
41,
59,
60]: (
i) an interband spin-conserving scattering, mediated by electron–phonon coupling and/or impurities, where the charge-density of the conduction band with given spin is scattered for a valley,
, to the opposite valley,
. This process is hampered in the valence band due to the spin-splitting [
34,
36,
41]; and (
) spin-flip intervalley exchange, where a particle–hole couple in a given valley with given spin is scattered into the opposite valley with reverse spin [
36,
60,
61]. According to this picture, we model in a compact way the time dynamics of the pump-driven charges with a set of coupled equations:
where
is the profile of the pump pulse,
is related to the absorption coefficient, and
,
are the scattering rates of the two processes discussed above. The factors
and
take into account that the intervalley exchange scattering requires the presence of both particle–hole changes in the conduction and valence bands. We take in the representative values
fs [
35,
59] and
ps [
20,
61]. The time dynamics of the different charge densities,
, is shown in
Figure 4a, and the corresponding time-dependence of the Kerr intensity of the different spectral features in
Figure 4b, whereas
Figure 4c depicts some representative time-snapshots of
. Notice that, in order to focus on the time dynamics, we plot here only the dependence of
on the time-dependent charge densities,
, neglecting the current operator matrix elements and other time-independent factors, so that the relative ratio of the intensities here is not meant to be representative of the experimental ratio. The overall behavior of
and
that we obtain is in very good agreement with the results by Ref. [
36], performed with ab initio techniques.
We can identify three main regimes. (
i) A short time scale,
(gray areas, left panel of
Figure 4c), where the charge populations are mainly determined by the driving pump, with a significant population of only
and
. This is reflected in a sharp onset of the Faraday/Kerr intensities
,
, and
. (
) Soon after this scenario, in the short time range
(yellow areas, middle panel of
Figure 4c), the impurity/electron–phonon scattering has a main effect of a depletion of
due a redistribution of the spin-up conduction electrons towards
. As a consequence, a sharp decrease of
and
is predicted, whereas the finite valley population,
, gives rise to a finite (delayed) intensity of
. (
) In the time regime
(light blue areas, right panel of
Figure 4c), the key many-body process is the exchange scattering (assisted by the impurity/electron–phonon one), leading towards a slower equal spin population of each conduction and valence band. The total spectral intensities in this regime scale as
. The final steady state (right panel of
Figure 4c) is, however, reached only for
. In this regime, the contributions of off-diagonal elements of the optical tensor at K and K′ cancel out, and any Faraday/Kerr effect vanishes. Note that the transient Faraday/Kerr intensity at the B-exciton energy range is a by-product of the finite valley-K′ population of
. In a similar way as this valley-transient population is expect to give rise to a Faraday/Kerr effect at the energy
associated with optical transitions between
and
, we can expect the appearance of further Faraday/Kerr features at the energies associated with optical transitions between
and
, and between
and
. We denote these transitions as
and
, whose Faraday/Kerr spectral intensity scales, assuming pumping tuned at the A-exciton resonance, are
and
. The band-parameters determining the detailed spectral properties of these features are also listed in
Table A1.