Skip to main content
Erschienen in: Social Choice and Welfare 4/2017

Open Access 01.03.2017 | Original Paper

On redistributive taxation under the threat of high-skill emigration

verfasst von: Alan Krause

Erschienen in: Social Choice and Welfare | Ausgabe 4/2017

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The increasing international mobility of high-skill individuals is often seen as posing a threat to domestic social welfare, by limiting the ability of governments to tax these individuals and redistribute to the poor. In this paper, we examine a simple dynamic nonlinear income tax model without commitment. In this setting, it is shown that the threat of emigration by high-skill individuals facilitates redistribution and increases social welfare in the short-run, and has no effect on social welfare over the long-run.
Hinweise
I thank an associate editor and two anonymous referees for comments and suggestions which led to a much improved paper. Any errors remain my responsibility.

1 Introduction

Optimal tax analyses typically assume that individuals cannot emigrate to avoid domestic taxation. Such an assumption is, however, increasingly viewed as being unrealistic, especially as it relates to high-skill individuals. When high-skill individuals are immobile, redistributive taxation must take into consideration that these individuals may change their labour supply along the intensive margin. That is, high-skill individuals may work less, thus reducing the amount of income available for redistribution. When high-skill individuals are internationally mobile, they have the additional option of changing their labour supply along the extensive margin, i.e., they may emigrate. An often-employed method to capture the threat of emigration is to introduce type-dependent participation constraints into the optimal tax problem; see, e.g., Osmundsen (1999), Krause (2009a), Simula and Trannoy (2010, 2012), and Lehmann et al. (2014). Naturally, these additional constraints reduce the level of social welfare attainable.
In this paper, we introduce the threat of emigration by high-skill individuals into a two-period nonlinear income tax model without commitment. As in the related literature, this threat is captured by introducing a participation constraint for high-skill individuals. However, we show that the introduction of the high-skill type’s participation constraint facilitates redistribution and increases social welfare in the short-run, and has no effect on social welfare over the long-run. One may wonder how it is possible that the introduction of a binding constraint into an optimisation problem does not reduce the maximised value of the objective function. The participation constraint appears in period 2 and it does reduce the maximised value of the objective function (social welfare) in that period, but the specific manner in which social welfare is reduced—by limiting redistribution—enables a higher level of social welfare to be obtained in period 1. Thus social welfare summed over both periods remains unchanged. The intuition can be summarised as follows. In period 1 the government does not know each individual’s skill type, and therefore implements standard (incentive-compatible) nonlinear income taxation which induces individuals to reveal their type. However, high-skill individuals know that if they reveal their type in period 1, they lose their information advantage and will be subjected to first-best taxation in period 2. This means that high-skill individuals must be offered a very attractive tax treatment in period 1 to reveal their type, to compensate them for having to face first-best taxation in period 2. Accordingly, the ability of the government to redistribute in period 1 is severely limited. However, the threat of emigration reduces the extent of redistribution possible in period 2, meaning that high-skill individuals require less compensation in period 1 to reveal their type. This enables the government to implement more redistribution in period 1, which correspondingly increases first-period social welfare. The threat of emigration limits redistribution and reduces social welfare in period 2, but optimal taxation balances the short-run benefits against the long-run costs. Thus the threat of emigration has no effect on the level of social welfare summed over both periods.
In terms of previous results, the paper most closely related to ours is that by Leite-Monteiro (1997). He also finds that increased international mobility may enhance redistribution, but his model is entirely different to ours. In Leite-Monterio’s model, the government can always implement first-best personalised lump-sum taxes, and changes in the country’s skill composition after migration takes place is what makes enhanced redistribution a possibility. By contrast, in our model the participation constraint ensures that no one migrates, so the skill composition remains unchanged. Our paper is also related to the literature on dynamic Mirrlees (1971) nonlinear income taxation without commitment, e.g., Roberts (1984), Apps and Rees (2006), Brett and Weymark (2008a), Krause (2009b), Berliant and Ledyard (2014), and Guo and Krause (2011, 2013, 2014, 2015a, b). In dynamic Mirrlees models, the question arises as to whether the government can or cannot commit to not using skill-type information revealed by individuals in earlier periods when it implements taxation in latter periods. A common theme in the literature is the highlighting of how different and counter-intuitive optimal policy can be when the commitment assumption is relaxed. The present paper provides another example of a result that, at first glance, appears quite counter-intuitive.
The remainder of the paper is organised as follows. Section 2 outlines the model and the structure of optimal taxation, while Sect. 3 presents and discusses our result. Section 4 discusses other possible solutions to the optimal tax problem, and what these would imply for our result. Section 5 concludes, and the proof of our result is contained in an appendix.

2 A simple model

We consider a two-period model with a unit measure of individuals and with the following timing. In period 1 the government knows that there are \(\phi \in (0,1)\) high-skill individuals and \((1-\phi )\) low-skill individuals in the economy, but it does not know any individual’s skill type. The government therefore implements standard second-best (incentive-compatible) nonlinear income taxation, under which each individual is willing to reveal their type. Then, in period 2, the government knows each individual’s skill type, and is tempted to use this information to implement first-best redistributive taxation. However, high-skill individuals have the option of emigrating, so redistribution is limited by the high-skill type’s participation constraint. We assume that emigration is only possible in period 2. The implications of this assumption are discussed in Sect. 4, though it is consistent with the observation that individuals tend to be immobile in the short-run (period 1), but mobile in the long-run (period 2). For simplicity we assume that individuals do not save or borrow, so the only link between the periods is the revelation and use of skill-type information.
Specifically, the government in period 2 solves the following problem. Choose tax treatments \(\langle c_{L}^{2},y_{L}^{2}\rangle \) and \(\langle c_{H}^{2},y_{H}^{2}\rangle \) for the low-skill and high-skill individuals, respectively, to maximise:
$$\begin{aligned} (1-\phi )\left\{ u(c_{L}^{2})-\frac{y_{L}^{2}}{w_{L}}\right\} +\phi \left\{ u(c_{H}^{2})-\frac{y_{H}^{2}}{w_{H}}\right\} \end{aligned}$$
(2.1)
subject to:
$$\begin{aligned}&(1-\phi )[ y_{L}^{2}-c_{L}^{2}] +\phi [ y_{H}^{2}-c_{H}^{2} ] \ge 0 \end{aligned}$$
(2.2)
$$\begin{aligned}&u(c_{H}^{2})-\frac{y_{H}^{2}}{w_{H}}\ge V_{H} \end{aligned}$$
(2.3)
where \(c_{i}^{t}\) is type i’s consumption (or post-tax income) in period t, \(y_{i}^{t}=w_{i}l_{i}^{t}\) is type i’s pre-tax income in period t, with \(w_{i}\) denoting type i’s wage rate and \(l_{i}^{t}\) denoting type i’s labour supply in period t. It is assumed that \(w_{H}>w_{L}>0\) and that wages remain constant over time. Equation (2.1) is a utilitarian social welfare function, where \(u(\cdot )\) is increasing and strictly concave and the individuals’ utility function is quasi-linear in labour.1 Equation (2.2) is the government’s budget constraint,2 and Eq. (2.3) is the high-skill type’s participation constraint. High-skill individuals can obtain a utility level of \(V_{H}\) by emigrating, which is their reservation utility. We assume that \(V_{H}\) is sufficiently high such that Eq. (2.3) is binding, i.e., the threat of emigration is effective. The solution to programme (2.1)–(2.3) yields the value function \(W^{2}(\phi ,w_{L},w_{H},V_{H})\), which is the level of social welfare attainable in period 2.
In period 1 the government does not know each individual’s skill type, and therefore implements second-best (incentive-compatible) nonlinear income taxation. It chooses tax treatments \(\langle c_{L}^{1},y_{L}^{1}\rangle \) and \(\langle c_{H}^{1},y_{H}^{1}\rangle \) for the low-skill and high-skill individuals, respectively, to maximise:
$$\begin{aligned} (1-\phi )\left\{ u(c_{L}^{1})-\frac{y_{L}^{1}}{w_{L}}\right\} +\phi \left\{ u(c_{H}^{1})-\frac{y_{H}^{1}}{w_{H}}\right\} \end{aligned}$$
(2.4)
subject to:
$$\begin{aligned}&(1-\phi )[ y_{L}^{1}-c_{L}^{1}] +\phi [ y_{H}^{1}-c_{H}^{1} ] \ge 0 \end{aligned}$$
(2.5)
$$\begin{aligned}&u(c_{H}^{1})-\frac{y_{H}^{1}}{w_{H}}+\delta V_{H}\ge u(c_{L}^{1})-\frac{ y_{L}^{1}}{w_{H}}+\delta \max [ \widehat{U}_{H}^{2},V_{H}] \end{aligned}$$
(2.6)
where \(\widehat{U}_{H}^{2}=u(c_{L}^{2})-\frac{y_{L}^{2}}{w_{H}}\) is the utility that a high-skill individual would obtain in period 2 from the low-skill type’s tax treatment. That is, \(c_{L}^{2}=c_{L}^{2}(\phi ,w_{L},w_{H},V_{H})\) and \(y_{L}^{2}=y_{L}^{2}(\phi ,w_{L},w_{H},V_{H})\) come from the solution to programme (2.1)–(2.3). We use \(\delta \in (0,1)\) to denote the discount factor.
Equation (2.4) is the first-period utilitarian social welfare function. The government in period 1 might care about the level of social welfare in period 2, but its choice of \(\langle c_{L}^{1},y_{L}^{1}\rangle \) and \( \langle c_{H}^{1},y_{H}^{1}\rangle \) cannot affect the level of social welfare attainable in period 2, which is \(W^{2}(\phi ,w_{L},w_{H},V_{H})\). Therefore, the government in period 1 acts as if maximising only first-period social welfare. Equation (2.5) is the government’s first-period budget constraint, while Eq. (2.6) is the high-skill type’s incentive-compatibility constraint.
In order for a high-skill individual to be willing to choose \(\langle c_{H}^{1},y_{H}^{1}\rangle \) in period 1 and thus reveal their type, the utility they obtain from this choice in period 1 plus the utility they obtain in period 2 (which will be their reservation utility, \(V_{H}\)), must be greater than or equal to the utility they could obtain by mimicking. A high-skill individual has two mimicking strategies. First, a high-skill individual could choose the low-skill type’s tax treatment in period 1—effectively announcing to the government that they are low-skill—and then not emigrate in period 2. In this case, because in period 1 they announced that they are low-skill, the government in period 2 will treat them as low-skill and give them the low-skill type’s tax treatment. The mimicking high-skill individual would therefore obtain utility level \( \widehat{U}_{H}^{2}\) in period 2. Second, a high-skill individual could choose the low-skill type’s tax treatment in period 1 and then emigrate in period 2, to avoid being treated as low-skill in period 2. In this case, they obtain utility level \(V_{H}\) in period 2. Which of these mimicking strategies is best depends upon \(\widehat{U}_{H}^{2}\) and \(V_{H}\). The first is better when \(\widehat{U}_{H}^{2}>V_{H}\), and the second is better when \( \widehat{U}_{H}^{2}<V_{H}\). Note that if \(\widehat{U}_{H}^{2}\le V_{H}\), the incentive-compatibility constraint (2.6) reduces to \(u(c_{H}^{1})-\frac{ y_{H}^{1}}{w_{H}}\ge u(c_{L}^{1})-\frac{y_{L}^{1}}{w_{H}}\), which is the same as the condition required for incentive compatibility in static models. However, as we explain below (in Sect. 3.1), \(\widehat{U}_{H}^{2}>V_{H}\) will hold unless \(V_{H}\) is so high as to undo a key feature of dynamic nonlinear taxation without commitment.
We make the standard assumption that any individual who is indifferent between truthfully revealing their type and mimicking will tell the truth. Therefore, no mimicking occurs in the solution to the optimal tax problem. We also omit the low-skill type’s incentive-compatibility constraint, because we make the common assumption that the redistributive goals of the government create an incentive for high-skill individuals to mimic low-skill individuals, but not vice versa.3 As discussed earlier, emigration is only possible in period 2, so a participation constraint does not appear in period 1. The solution to programme (2.4)–(2.6) yields the value function \(W^{1}(\phi ,w_{L},w_{H}, \widehat{U}_{H}^{2},V_{H},\delta )\), which represents the level of social welfare attainable in the first period.

3 The threat of emigration and social welfare

It is shown in the appendix that:
Proposition
Suppose \(\widehat{U}_{H}^{2}>V_{H}\). In our dynamic nonlinear income tax model without commitment, the threat of high-skill emigration increases social welfare in period 1 (\(\partial W^{1}(\cdot )/\partial V_{H}>0\)), decreases social welfare in period 2 (\(\partial W^{2}(\cdot )/\partial V_{H}<0\)), and has no effect on social welfare summed over the two periods (\(\partial W^{1}(\cdot )/\partial V_{H}+\delta \partial W^{2}(\cdot )/\partial V_{H}=0\)).
The proposition implies that the threat of high-skill emigration facilitates redistribution and increases social welfare in the short-run, and has no effect on the level of social welfare attainable over the long-run. The intuition follows from a key feature of dynamic nonlinear income taxation without commitment. In the absence of a participation constraint, high-skill individuals know that if they reveal their type in period 1, they will be subjected to first-best redistributive taxation in period 2. Therefore, high-skill individuals must be offered an attractive tax treatment in period 1 if they are to be willing to reveal their type, to compensate them for the unattractive tax treatment they will face in period 2. This restricts the government’s ability to redistribute and raise social welfare in period 1. Now consider the effects of the participation constraint. The participation constraint means that high-skill individuals obtain more utility in period 2, because they will emigrate unless they receive at least their reservation utility. Therefore, high-skill individuals now have less incentive to conceal their type in period 1. Accordingly, the government can now offer high-skill individuals a less attractive tax treatment in period 1 and still obtain skill-type information. Taken together, this implies that social welfare is lower in period 2 (due to the participation constraint), but higher in period 1 (as the incentive-compatibility constraint is relaxed). Moreover, social welfare summed over the two periods remains unchanged, because optimal taxation balances the short-run benefits vis-a-vis relaxation of the incentive-compatibility constraint against the long-run costs of the participation constraint.

3.1 A numerical example

As our proposition assumes that \(\widehat{U}_{H}^{2}>V_{H}\), in this subsection we provide a numerical example of our result. We also explain why our assumption, and hence our result, will be valid unless \( V_{H} \) is so high as to undo a key feature of dynamic nonlinear taxation without commitment.
In the numerical example we assume that the utility function takes the form ln(\(c_{i}^{t})-l_{i}^{t}\). This is based on Chetty (2006), who concludes that a reasonable estimate of the coefficient of relative risk aversion is one (log utility). The OECD (2014) reports that approximately one-third of adults have attained tertiary level education. We assume that these individuals are high-skill and the remainder are low-skill, i.e., we set \( \phi =1/3\). Fang (2006) and Goldin and Katz (2007) estimate that the college wage premium is approximately 60%. We therefore normalise the low-skill type’s wage rate to unity (\(w_{L}=1\)), and set the high-skill type’s wage rate at \(w_{H}=1.6\). Following common practice, we assume an annual discount rate of 4%. However, as most individuals work for around 40 years of their lives, we take each period to be 20 years in length. An annual discount rate of 4% then implies a 20-year discount factor of \(\delta =0.456\). Table 1 reports the parameter values used in the numerical example, as well as the results for the cases of \(V_{H}=-1.0\) and \(V_{H}=-0.9\).
Table 1
Numerical example \(u(c_{i}^{t})-l_{i}^{t}=\hbox {ln}(c_{i}^{t})- l_{i}^{t}\)
Parameter values:
\(\varphi \)
0.333
\(w_{L}\)
1.000
\(\delta \)
0.456
\(w_{H}\)
1.600
  
\(V_{H}=-1.0\)
\(V_{H}=-0.9\)
Low-skill utility: period 1
\(-\)1.082
\(-\)1.046
High-skill utility: period 1
\(-\)0.432
\(-\)0.478
Mimicker utility: period 1
\(-\)0.710
\(-\)0.687
Low-skill utility: period 2
\(-\)0.624
\(-\)0.704
High-skill utility: period 2
\(-\)1.0
\(-\)0.9
Mimicker utility \({\hat{U}^{2}}_{H}\) : period 2
\(-\)0.390
\(-\)0.444
Social welfare: period 1
\(-\)0.866
\(-\)0.856
Social welfare: period 2
\(-\)0.749
\(-\)0.769
Social welfare: total
\(-\)1.208
\(-\)1.208
It can be seen that as the value of the participation constraint is increased from \(V_{H}=-1.0\) to \(V_{H}=-0.9\), the discounted sum of social welfare over the two periods remains unchanged (at \(-1.208\)). Social welfare increases in period 1 and decreases in period 2. These findings are an example of our proposition, since the assumption \(\widehat{U}_{H}^{2}>V_{H}\) is satisfied for the parameters chosen.
For the case of \(V_{H}=-1.0\), it can be seen that the utility a high-skill individual obtains in period 1 (\(-0.432\) by choosing \(\langle c_{H}^{1},y_{H}^{1}\rangle \)) is greater than what they could obtain by mimicking (\(-0.710\) by choosing \(\langle c_{L}^{1},y_{L}^{1}\rangle \)). Likewise, for \(V_{H}=-0.9\), the utility a high-skill individual obtains in period 1 (\(-0.478\) by choosing \(\langle c_{H}^{1},y_{H}^{1}\rangle \)) is greater than what they could obtain by mimicking (\(-0.687\) by choosing \( \langle c_{L}^{1},y_{L}^{1}\rangle \)). As discussed earlier, a key feature of dynamic nonlinear taxation without commitment is that high-skill individuals know that they will be subjected to first-best taxation in period 2 if they reveal their type in period 1. Accordingly, high-skill individuals must be offered an attractive tax treatment in period 1 to reveal their type, as compensation for the unattractive tax treatment they will receive in period 2. Therefore, compared to static settings, \(\langle c_{H}^{1},y_{H}^{1}\rangle \) will be favourable and \(\langle c_{L}^{1},y_{L}^{1}\rangle \) will be unfavourable. That is, \(u(c_{H}^{1})- \frac{y_{H}^{1}}{w_{H}}>u(c_{L}^{1})-\frac{y_{L}^{1}}{w_{H}}\) is a general feature of dynamic nonlinear taxation without commitment, while \(u(c_{H}^{1})-\frac{y_{H}^{1}}{w_{H}}=u(c_{L}^{1})-\frac{y_{L}^{1}}{w_{H}}\) would hold in static settings. It is only if \(V_{H}\) was so high as to undo this general feature of dynamic nonlinear taxation without commitment that our assumption would no longer be valid.

3.2 A general condition

In this subsection, we provide a general condition under which \( \widehat{U}_{H}^{2}>V_{H}\), assuming log utility. It is shown in the appendix that:
$$\begin{aligned} \widehat{U}_{H}^{2}>V_{H} \quad \textit{if and only if}\quad (1-\phi )\left[ \ln (w_{L})-\frac{w_{L}}{w_{H}}\right] +\phi [ \ln (w_{H})-1] >V_{H}\qquad \end{aligned}$$
(3.1)
For any particular value of the participation constraint, Eq. (3.1) can be used to check if \(\widehat{U}_{H}^{2}>V_{H}\). For example, one can check that (3.1) is satisfied for the parameters used in our numerical example.

4 Discussion and some caveats

Our proposition is driven by an important feature of dynamic nonlinear income taxation without commitment, namely, that high-skill individuals must be offered an attractive tax treatment in period 1 to reveal their type. This restricts redistribution in period 1 and reduces social welfare. In fact, it is possible that the welfare cost is sufficiently severe that it would be better if the government did not induce skill-type revelation. That is, it may be optimal for the government to implement pooling taxation. This involves the government offering a single tax treatment to all individuals, and therefore skill-type information is not revealed.
Our analysis implicitly assumes that pooling taxation is not optimal. If pooling taxation is optimal, our result no longer holds. However, the question arises as to how likely it is that pooling becomes optimal. Pooling taxation is quite severe in that it imposes the same tax treatment on everyone. Guo and Krause (2015b) show, for a range of empirically-plausible parameter values, that pooling taxation is not optimal in either two-period or infinite-horizon settings. Indeed, they show that even the autarkic equilibrium yields a higher level of social welfare than does pooling taxation. Therefore, while it is possible that pooling taxation is optimal, it is less likely than regular separating taxation (under which our result holds).
Perhaps a more serious caveat follows from our assumption that the participation constraint appears only in period 2. If one assumes that high-skill individuals are immobile in the short-run and mobile in the long-run, then our assumption seems reasonable. However, if a participation constraint for high-skill individuals was introduced into period 1, and it rendered the incentive-compatibility constraint slack, then our result would no longer hold. For this to occur, the value of the first-period participation constraint would have to be quite high, since incentive-compatibility requires that high-skill individuals be offered a generous first-period tax treatment (as discussed earlier). Nevertheless, it is worth noting that our result would not hold in that case. It is also worth noting that for sufficiently high values of the participation constraint, it might be better to simply let high-skill individuals emigrate. But we do not consider that possibility here.
Finally, nonlinear income taxation a la Mirrlees (1971) can be viewed as an application of principal-agent theory, so one may wonder if an analogue of our result would apply in other applications of principal-agent theory in dynamic settings. For example, repeated contracting vis-a-vis insurance, regulation, or employment agreements.4 An important difference is that in nonlinear income taxation, the principal (the government) seeks to maximise the aggregate welfare of the agents (the individuals), rather than having its own separate objective. That said, in all dynamic applications without commitment, the agent-type with the information advantage must be compensated to give-up that advantage. So it seems possible that an analogue of our result may apply in these other applications as well.

5 Conclusion

In this paper we have shown, using a simple dynamic nonlinear income tax model without commitment, that the threat of high-skill emigration can increase social welfare in the short-run and have no effect on social welfare over the long-run. While this result may be viewed as being primarily of theoretical interest, it also provides a new example of how restrictions on policy instruments that are clearly welfare-reducing when the government can commit, may in fact be beneficial when the government cannot commit.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://​creativecommons.​org/​licenses/​by/​4.​0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Anhänge

Appendix

Proof of the Proposition
The Lagrangian corresponding to programme (2.1)–(2.3) is:
$$\begin{aligned} \mathcal {L}^{2}= & {} (1-\phi )\left\{ u(c_{L}^{2})-\frac{y_{L}^{2}}{w_{L}} \right\} +\phi \left\{ u(c_{H}^{2})-\frac{y_{H}^{2}}{w_{H}}\right\} \nonumber \\&+\,\lambda ^{2}\big \{ (1-\phi )[ y_{L}^{2}-c_{L}^{2}] +\phi [ y_{H}^{2}-c_{H}^{2}] \big \} +\alpha _{H}^{2}\left\{ u(c_{H}^{2})-\frac{y_{H}^{2}}{w_{H}}-V_{H}\right\} \nonumber \\ \end{aligned}$$
(A.1)
where \(\lambda ^{2}>0\) and \(\alpha _{H}^{2}>0\) are Lagrange multipliers. The first-order conditions are:
$$\begin{aligned}&(1-\phi )u^{\prime }(c_{L}^{2})-\lambda ^{2}(1-\phi )=0 \end{aligned}$$
(A.2)
$$\begin{aligned}&-(1-\phi )\frac{1}{w_{L}}+\lambda ^{2}(1-\phi )=0 \end{aligned}$$
(A.3)
$$\begin{aligned}&\phi u^{\prime }(c_{H}^{2})-\lambda ^{2}\phi +\alpha _{H}^{2}u^{\prime }(c_{H}^{2})=0 \end{aligned}$$
(A.4)
$$\begin{aligned}&-\,\phi \frac{1}{w_{H}}+\lambda ^{2}\phi -\alpha _{H}^{2}\frac{1}{w_{H}}=0 \end{aligned}$$
(A.5)
$$\begin{aligned}&(1-\phi )[ y_{L}^{2}-c_{L}^{2}] +\phi [ y_{H}^{2}-c_{H}^{2}] =0 \end{aligned}$$
(A.6)
$$\begin{aligned}&u(c_{H}^{2})-\frac{y_{H}^{2}}{w_{H}}-V_{H}=0 \end{aligned}$$
(A.7)
By the Envelope Theorem:
$$\begin{aligned} \frac{\partial W^{2}(\cdot )}{\partial V_{H}}=\frac{\partial \mathcal {L}^{2}(\cdot )}{\partial V_{H}}=-\alpha _{H}^{2}=-\phi \left[ \frac{w_{H}}{ w_{L}}-1\right] \end{aligned}$$
(A.8)
where the last equality follows from (A.3) and (A.5). This shows that \(\partial W^{2}(\cdot )/\partial V_{H}<0\).
Under the assumption that \(\widehat{U}_{H}^{2}>V_{H}\), the Lagrangian corresponding to programme (2.4)–(2.6) is:
$$\begin{aligned} \mathcal {L}^{1}= & {} (1-\phi )\left\{ u(c_{L}^{1})-\frac{y_{L}^{1}}{w_{L}} \right\} +\phi \left\{ u(c_{H}^{1})-\frac{y_{H}^{1}}{w_{H}}\right\} \nonumber \\&+\,\lambda ^{1}\big \{ (1-\phi )[ y_{L}^{1}-c_{L}^{1}] +\phi [ y_{H}^{1}-c_{H}^{1}] \big \}\nonumber \\&+\,\theta _{H}^{1}\left\{ u(c_{H}^{1})-\frac{y_{H}^{1}}{w_{H}}+\delta V_{H}-u(c_{L}^{1})+\frac{y_{L}^{1}}{w_{H}}-\delta u(c_{L}^{2}(\cdot ))+\frac{ \delta y_{L}^{2}(\cdot )}{w_{H}}\right\} \qquad \qquad \end{aligned}$$
(A.9)
where \(\lambda ^{1}>0\) and \(\theta _{H}^{1}>0\) are Lagrange multipliers, and use is made of the fact that \(\widehat{U}_{H}^{2}=u(c_{L}^{2}(\cdot ))-\frac{ y_{L}^{2}(\cdot )}{w_{H}}\). The first-order conditions on \(y_{L}^{1}\) and \( y_{H}^{1}\) are, respectively:
$$\begin{aligned}&-(1-\phi )\frac{1}{w_{L}}+\lambda ^{1}(1-\phi )+\theta _{H}^{1}\frac{1}{w_{H} }=0 \end{aligned}$$
(A.10)
$$\begin{aligned}&-\phi \frac{1}{w_{H}}+\lambda ^{1}\phi -\theta _{H}^{1}\frac{1}{w_{H}}=0 \end{aligned}$$
(A.11)
By the Envelope Theorem:
$$\begin{aligned} \frac{\partial W^{1}(\cdot )}{\partial V_{H}}= & {} \frac{\partial \mathcal {L}^{1}(\cdot )}{\partial V_{H}}=\delta \theta _{H}^{1}\left[ 1-u^{\prime }(c_{L}^{2})\frac{\partial c_{L}^{2}(\cdot )}{\partial V_{H}}+\frac{1}{w_{H}} \frac{\partial y_{L}^{2}(\cdot )}{\partial V_{H}}\right] \nonumber \\= & {} \delta \theta _{H}^{1}\left[ 1-\frac{1}{w_{L}}\frac{\partial c_{L}^{2}(\cdot )}{\partial V_{H}}+\frac{1}{w_{H}}\frac{\partial y_{L}^{2}(\cdot )}{\partial V_{H}} \right] \end{aligned}$$
(A.12)
where the last equality makes use of (A.2) and (A.3).
From (A.2) and (A.3) we also obtain:
$$\begin{aligned} u^{\prime }(c_{L}^{2})-\frac{1}{w_{L}}=0\Longrightarrow \frac{\partial c_{L}^{2}(\cdot )}{\partial V_{H}}=0 \end{aligned}$$
(A.13)
Likewise, from (A.3)–(A.5) we obtain:
$$\begin{aligned} u^{\prime }(c_{H}^{2})-\frac{1}{w_{H}}=0\Longrightarrow \frac{\partial c_{H}^{2}(\cdot )}{\partial V_{H}}=0 \end{aligned}$$
(A.14)
From (A.7) and using (A.14) we obtain:
$$\begin{aligned} y_{H}^{2}-w_{H}[ u(c_{H}^{2})-V_{H}] =0 \Longrightarrow \frac{\partial y_{H}^{2}(\cdot )}{\partial V_{H}}=-w_{H} \end{aligned}$$
(A.15)
From (A.6) and using (A.13)–(A.15) we obtain:
$$\begin{aligned} y_{L}^{2}-c_{L}^{2}-\frac{\phi c_{H}^{2}}{1-\phi }+\frac{\phi y_{H}^{2}}{ 1-\phi }=0\Longrightarrow \frac{\partial y_{L}^{2}(\cdot )}{\partial V_{H}}=\frac{\phi w_{H}}{1-\phi } \end{aligned}$$
(A.16)
Equation (A.12) can now be simplified to:
$$\begin{aligned} \frac{\partial W^{1}(\cdot )}{\partial V_{H}}=\delta \phi \left[ \frac{w_{H} }{w_{L}}-1\right] \end{aligned}$$
(A.17)
where use has been made of (A.10) and (A.11). This shows that \(\partial W^{1}(\cdot )/\partial V_{H}>0\).
Finally, Eqs. (A.8) and (A.17) imply that \(\partial W^{1}(\cdot )/\partial V_{H}+\delta \partial W^{2}(\cdot )/\partial V_{H}=0\). \(\square \)
Derivation of Equation ( 3.1 )
Assuming log utility, Eqs. (A.2)–(A.7) can be solved to yield:
$$\begin{aligned} c_{L}^{2}=w_{L}\quad \text {and} \quad y_{L}^{2}=w_{L}-\frac{\phi w_{H}}{ (1-\phi )}[ \ln (w_{H})-1-V_{H}] \end{aligned}$$
(A.18)
Recall that \(\widehat{U}_{H}^{2}=u(c_{L}^{2})-\frac{y_{L}^{2}}{w_{H}}\). Using (A.18) and after some algebraic manipulation, it can be shown that:
$$\begin{aligned} \widehat{U}_{H}^{2}>V_{H} \Longleftrightarrow (1-\phi )\left[ \ln (w_{L})-\frac{w_{L}}{w_{H}}\right] +\phi [ \ln (w_{H})-1] >V_{H} \end{aligned}$$
(A.19)
which is Eq. (3.1). \(\square \)
Fußnoten
1
The literature on the comparative statics of optimal nonlinear income taxes (e.g., Weymark 1987; Brett and Weymark 2008b, 2011; Simula 2010) has shown that results are generally obtainable only when the utility function is quasi-linear. As we make use of comparative statics methods, we also assume that the utility function is quasi-linear.
 
2
We assume that the government cannot save or borrow, and that its revenue requirement is zero. Thus taxation is implemented only for redistributive purposes.
 
3
In fact, high-skill individuals generally have a stronger incentive to mimic when the government cannot commit than when it can commit. This makes it theoretically possible that ‘pooling’ taxation may be optimal; see, e.g., Brett and Weymark (2008a), Krause (2009b), and Guo and Krause (2015a, b). In this paper, however, we focus only on ‘separating’ taxation, under which high-skill and low-skill individuals always receive different tax treatments. The effects of pooling taxation are discussed further in Sect. 4.
 
4
Laffont and Martimort (2002) provide a textbook treatment of principal-agent theory and its applications.
 
Literatur
Zurück zum Zitat Apps P, Rees R (2006) Repeated optimal nonlinear income taxation. Mimeo, University of Munich Apps P, Rees R (2006) Repeated optimal nonlinear income taxation. Mimeo, University of Munich
Zurück zum Zitat Berliant M, Ledyard J (2014) Optimal dynamic nonlinear income taxes with no commitment. J Public Econ Theory 16:196–221CrossRef Berliant M, Ledyard J (2014) Optimal dynamic nonlinear income taxes with no commitment. J Public Econ Theory 16:196–221CrossRef
Zurück zum Zitat Brett C, Weymark J (2008a) Optimal nonlinear taxation of income and savings without commitment. Vanderbilt University Working Paper, No. 08-W05 Brett C, Weymark J (2008a) Optimal nonlinear taxation of income and savings without commitment. Vanderbilt University Working Paper, No. 08-W05
Zurück zum Zitat Brett C, Weymark J (2008b) The impact of changing skill levels on optimal nonlinear income taxes. J Public Econ 92:1765–1771CrossRef Brett C, Weymark J (2008b) The impact of changing skill levels on optimal nonlinear income taxes. J Public Econ 92:1765–1771CrossRef
Zurück zum Zitat Brett C, Weymark J (2011) How optimal nonlinear income taxes change when the distribution of the population changes. J Public Econ 95:1239–1247CrossRef Brett C, Weymark J (2011) How optimal nonlinear income taxes change when the distribution of the population changes. J Public Econ 95:1239–1247CrossRef
Zurück zum Zitat Chetty R (2006) A new method of estimating risk aversion. Am Econ Rev 96:1821–1834CrossRef Chetty R (2006) A new method of estimating risk aversion. Am Econ Rev 96:1821–1834CrossRef
Zurück zum Zitat Fang H (2006) Disentangling the college wage premium: estimating a model with endogenous education choices. Int Econ Rev 47:1151–1185CrossRef Fang H (2006) Disentangling the college wage premium: estimating a model with endogenous education choices. Int Econ Rev 47:1151–1185CrossRef
Zurück zum Zitat Goldin C, Katz L (2007) The race between education and technology: the evolution of U.S. educational wage differentials, 1890–2005. NBER Working Paper, No. 12984 Goldin C, Katz L (2007) The race between education and technology: the evolution of U.S. educational wage differentials, 1890–2005. NBER Working Paper, No. 12984
Zurück zum Zitat Guo J-T, Krause A (2011) Optimal nonlinear income taxation with habit formation. J Public Econ Theory 13:463–480CrossRef Guo J-T, Krause A (2011) Optimal nonlinear income taxation with habit formation. J Public Econ Theory 13:463–480CrossRef
Zurück zum Zitat Guo J-T, Krause A (2013) Optimal nonlinear taxation of income and education expenditures. Oxf Econ Pap 65:74–95CrossRef Guo J-T, Krause A (2013) Optimal nonlinear taxation of income and education expenditures. Oxf Econ Pap 65:74–95CrossRef
Zurück zum Zitat Guo J-T, Krause A (2014) Optimal dynamic nonlinear income taxation under loose commitment. Macroecon Dyn 18:1403–1427CrossRef Guo J-T, Krause A (2014) Optimal dynamic nonlinear income taxation under loose commitment. Macroecon Dyn 18:1403–1427CrossRef
Zurück zum Zitat Guo J-T, Krause A (2015a) Dynamic nonlinear income taxation with quasi-hyperbolic discounting and no commitment. J Econ Behav Organ 109:101–119CrossRef Guo J-T, Krause A (2015a) Dynamic nonlinear income taxation with quasi-hyperbolic discounting and no commitment. J Econ Behav Organ 109:101–119CrossRef
Zurück zum Zitat Guo J-T, Krause A (2015b) Dynamic income taxation without commitment: comparing alternative tax systems. Econ Model 47:319–326CrossRef Guo J-T, Krause A (2015b) Dynamic income taxation without commitment: comparing alternative tax systems. Econ Model 47:319–326CrossRef
Zurück zum Zitat Krause A (2009a) Education and taxation policies in the presence of countervailing incentives. Economica 76:387–399CrossRef Krause A (2009a) Education and taxation policies in the presence of countervailing incentives. Economica 76:387–399CrossRef
Zurück zum Zitat Krause A (2009b) Optimal nonlinear income taxation with learning-by-doing. J Public Econ 93:1098–1110CrossRef Krause A (2009b) Optimal nonlinear income taxation with learning-by-doing. J Public Econ 93:1098–1110CrossRef
Zurück zum Zitat Laffont J, Martimort D (2002) The theory of incentives: the principal-agent model. Princeton University Press, Princeton Laffont J, Martimort D (2002) The theory of incentives: the principal-agent model. Princeton University Press, Princeton
Zurück zum Zitat Lehmann E, Simula L, Trannoy A (2014) Tax me if you can! Optimal nonlinear income tax between competing governments. Q J Econ 129:1995–2030CrossRef Lehmann E, Simula L, Trannoy A (2014) Tax me if you can! Optimal nonlinear income tax between competing governments. Q J Econ 129:1995–2030CrossRef
Zurück zum Zitat Leite-Monteiro M (1997) Redistributive policy with labour mobility across countries. J Public Econ 65:229–244CrossRef Leite-Monteiro M (1997) Redistributive policy with labour mobility across countries. J Public Econ 65:229–244CrossRef
Zurück zum Zitat Mirrlees J (1971) An exploration in the theory of optimum income taxation. Rev Econ Stud 38:175–208CrossRef Mirrlees J (1971) An exploration in the theory of optimum income taxation. Rev Econ Stud 38:175–208CrossRef
Zurück zum Zitat OECD (2014) Highlights from education at a Glance. OECD Publications, Paris OECD (2014) Highlights from education at a Glance. OECD Publications, Paris
Zurück zum Zitat Osmundsen P (1999) Taxing internationally mobile individuals—a case of countervailing incentives. Int Tax Public Finance 6:149–164CrossRef Osmundsen P (1999) Taxing internationally mobile individuals—a case of countervailing incentives. Int Tax Public Finance 6:149–164CrossRef
Zurück zum Zitat Roberts K (1984) The theoretical limits to redistribution. Rev Econ Stud 51:177–195CrossRef Roberts K (1984) The theoretical limits to redistribution. Rev Econ Stud 51:177–195CrossRef
Zurück zum Zitat Simula L (2010) Optimal nonlinear income tax and nonlinear pricing: optimality conditions and comparative static properties. Soc Choice Welf 35:199–220CrossRef Simula L (2010) Optimal nonlinear income tax and nonlinear pricing: optimality conditions and comparative static properties. Soc Choice Welf 35:199–220CrossRef
Zurück zum Zitat Simula L, Trannoy A (2010) Optimal income tax under the threat of migration by top-income earners. J Public Econ 94:163–173CrossRef Simula L, Trannoy A (2010) Optimal income tax under the threat of migration by top-income earners. J Public Econ 94:163–173CrossRef
Zurück zum Zitat Simula L, Trannoy A (2012) Shall we keep the highly skilled at home? The optimal income tax perspective. Soc Choice Welf 39:751–782CrossRef Simula L, Trannoy A (2012) Shall we keep the highly skilled at home? The optimal income tax perspective. Soc Choice Welf 39:751–782CrossRef
Zurück zum Zitat Weymark J (1987) Comparative static properties of optimal nonlinear income taxes. Econometrica 55:1165–1185CrossRef Weymark J (1987) Comparative static properties of optimal nonlinear income taxes. Econometrica 55:1165–1185CrossRef
Metadaten
Titel
On redistributive taxation under the threat of high-skill emigration
verfasst von
Alan Krause
Publikationsdatum
01.03.2017
Verlag
Springer Berlin Heidelberg
Erschienen in
Social Choice and Welfare / Ausgabe 4/2017
Print ISSN: 0176-1714
Elektronische ISSN: 1432-217X
DOI
https://doi.org/10.1007/s00355-017-1038-8

Weitere Artikel der Ausgabe 4/2017

Social Choice and Welfare 4/2017 Zur Ausgabe