The Cost of Dissolving the WTO: The Role of Global Value Chains

Mostafa Beshkar

Ahmad Lashkaripour (Indiana University, CESifo, CEPR)

Working paper · April 2020

Abstract. As trade agreements face renewed pressure, we show that the rise of global value chains has multiplied the value of trade agreements to unprecedented levels. We cast our argument using a non-parametric neoclassical trade model that accommodates global input-output networks and nests a wide class of quantitative trade models as a special case. To guide our analysis, we derive analytic formulas for optimal non-cooperative trade taxes in this general framework. These formulas predict the extent of trade restriction if global trade agreements were to dissolve. Mapping these formulas to data, we quantify the value of trade agreements for various countries. We find that the disintegration of existing trade agreements will erase 30% of the overall gains from trade, which amounts to a $2.8 trillion loss in global GDP. Around 46% of this value is driven by the agreements' facilitation of global value chains.

Machine-readable full text

This section was extracted with OpenDataLoader PDF from the hosted PDF so the full text is accessible in HTML and Markdown.

Abstract

As trade agreements face renewed pressure, we show that the rise of global value chains has multiplied the value of trade agreements to unprecedented levels. We cast our argument using a non-parametric neoclassical trade model that accommodates global input-output networks and nests a wide class of quantitative trade models as a special case. To guide our analysis, we derive analytic formulas for optimal non-cooperative trade taxes in this general framework. These formulas predict the extent of trade restriction if global trade agreements were to dissolve. Mapping these formulas to data, we quantify the value of trade agreements for various countries. We find that the disintegration of existing trade agreements will erase 30% of the overall gains from trade, which amounts to a $2.8 trillion loss in global GDP. Around 46% of this value is driven by the agreements' facilitation of global value chains.

1 Introduction

The WTO and other free trade agreements have recently come under unprecedented pressure, which has put their future existence in jeopardy. At the same time, international trade has undergone a major transformation. Production processes have become globalized and countries have become submerged in complex

global value chains.1

It is, however, unclear if the globalization of value chains has increased or decreased the value of trade agreements that prevent unilaterally optimal but inefficient trade restrictions. On one hand, trade restrictions have become more costly to the global economy as they disrupt supply chains. One the other hand, global value chains have presumably diminished the appeal of unilateral trade restrictions. So, it is conceivable that sovereign governments would maintain the free flow of goods in and out of their country without a need for existing agreements.

The academic literature, meanwhile, provides little guidance on these pressing issues. The mainstream trade literature has directed most of its attention to measuring the gains from trade relative to autarky. These measures do not speak to the value of trade agreements, since a collapse of existing agreements will not necessarily lead to autarky (Costinot and Rodríguez-Clare (2014)). At the same time, the few studies that focus on the pure gains from global trade agreements do not account for the role of global value chains.2

This paper aims to fill this gap. We present a general theory that describes how the globalization of value chains has altered governments' incentives vis-à-vis protective, unilateral trade restrictions.3 We map our theory to data, to determine the counterfactual level of trade restrictions and the efficiency loss that will occur without existing agreements. We find that the globalization of value chains has barely lowered governments' incentives to adopt protectionist trade restrictions; but it has multiplied the efficiency cost of such restrictions. Accordingly, the disintegration of existing trade agreements will erase 30% of the overall gains from trade, which amounts to a $2.8 trillion loss in global GDP.

Our analysis builds on Bagwell and Staiger's (1999) insight that trade agreements solely "remedy the inefficient terms-of-trade-driven restrictions in trade that arise when trade policies are set unilaterally." Based on this insight, the counterfactual level

  • 1See Irwin (2020) for a historical account of how public attitude towards free trade agreements has shifted over time, and a synthesis of common criticisms facing these agreements.
  • 2Ossa (2014) provides the most sophisticated analysis of the gains from global trade agreements to date. His analysis presents a notable advance over prior analyses of the "gains from global trade agreements" but it abstracts from global value chains. Relatedly, Caliendo and Parro (2014) conduct an ex-post analysis of tariff cuts under NAFTA while accounting for input-output linkages. But, unlike Ossa (2014), their ex-post methodology cannot predict to the ex-ante welfare loss from dissolving global trade agreements.
  • 3We focus on the macro (input-output) view of global value chains as defined in Johnson (2018).

of trade restriction (without trade agreements) can be determined using the unilaterally optimal trade tax schedule in each country. Such a tax schedule describes the best-response of each country to the vector of trade taxes chosen by its trading partners. Solving the best-response functions simultaneously for all countries determines the terms-of-trade-driven trade restrictions that will occur in the noncooperative (post-agreement) Nash equilibrium.

Unfortunately, the existing literature does not provide a formal characterization of unilaterally optimal trade taxes in the presence of global value chains. Existing characterizations of optimal trade taxes are either derived within partial equilibrium frameworks or abstract from global input-output networks.

To bridge this gap, we first present an analytical characterization of optimal trade policy in a general equilibrium non-parametric neoclassical trade model that accommodates an arbitrary number of industries, an arbitrary demand structure, an arbitrary global input-output network, and diseconomies of scale due to industry-specific factors of production. Our general model nests an important class of quantitative trade models, including the widely-used multi-industry gravity model with input-output linkages in Caliendo and Parro (2014).

Our characterization delivers simple sufficient statistics optimal tax formulas that depend on only industry-level trade elasticities, the share of industry-specific capital in production, and observable expenditures, revenue, and input-output shares. These formulas highlight three stark patterns:

  • (a) Absent diseconomies of scale, the optimal non-cooperative policy consists of (i) uniform or zero import taxes, and (ii) non-zero export taxes that are decreasing in the industry-level trade elasticity and level of upstreamness.4
  • (b) But if export taxes are inapplicable (e.g., due to institutional constraints), im-

4The redundancy of import taxes drives from the targeting principle and the Lerner symmetry. Absent diseconomies of scale, Home can only manipulate its import market power on an economywide basis. Doing so asks for a uniform tariff that (by the Lerner symmetry) can be perfectly mimicked with a uniform shift in all export taxes. By the targeting principle, import taxes are not the first-best instrument for manipulating export market power either. So, altogether, they can be discarded. Export taxes, meanwhile, are the first-best instrument for manipulating industry-level export market power. Optimal export taxes are decreasing in the industry-level trade elasticity, which governs the degree of export market power. They are relatively lower on upstream industries that supply intermediate inputs, because an export tax on intermediate inputs is partially passed back to domestic consumers. Finally, the absence of diseconomies of scale is akin to the complete tariff passthrough estimated by Fajgelbaum et al. (2020) and Amiti et al. (2019).

port taxes can be employed as a second-best instrument to manipulate export market power through the global input-output network.

  • (c) Optimal second-best import taxes are higher on imported intermediates that are used intensively in highly-differentiated, export-oriented industries.5 They are uniform across all final and intermediate input goods that are never used in exporting industries.

The above tax structure optimizes a non-cooperative government's ability to correct two unilateral terms-of-trade-related inefficiencies: (i) unexploited import market power, which concerns the unexploited ability of the Home country to charge a mark-down on the price of imported goods; and (ii) unexploited export market power, which concerns the unexploited ability of the Home country to charge a mark-up on the price of exported goods. Correcting these two inefficiencies is unilaterally optimal, but is detrimental to global efficiency and has a negative externality on the rest of the world.

Our optimal policy formulas indicate that the globalization of value chains has two opposing effects on terms-of-trade-driven motives for taxation: On one hand, if export taxes are applicable, global value chains prompt governments to set a lower export tax on upstream industries, because a fraction of this tax is passed back to domestic consumers. On the other hand, if governments cannot use export taxes, global value chains prompt them to set a higher import tariff on upstream industries, because these tariffs are partially passed on to Foreign consumers.6 In that case, import tariffs can serve as a second-best export policy measure. The empirical significance of these two effects depends on the global input-output structure and the size of the tax-imposing country compared to the rest of the world.

In Section 6 we map our analytic formulas to data to measure both the consequences of non-cooperative trade policies and the value of the agreements that remedy them. To this end, we employ data on applied tariffs, expenditure, pro-

  • 5To elaborate, import taxes on intermediates are partially passed on to foreign consumers through the global input-output network. The more-intensively an imported intermediate is used in the production of differentiated export goods, the higher the incidence of its tax on Foreign consumers. The trade-off, though, is that taxing intermediate inputs to mimic export taxes, distorts domestic production in a way that is avoidable with first-best export taxes.
  • 6Bown, Conconi, Erbahar, and Trimarchi (2020) confirm these cascading effects empirically, showing that tariffs in upstream sectors have large effects on prices, employment, sales, and investment in downstream sectors.

duction, and input-output shares across 16 industries and 42 major economies. Our data includes all 27 members of the European Union, Australia, Brazil, China, Indonesia, India, Japan, Korea, Mexico, Norway, Russia, Switzerland, Taiwan, Turkey, the United States, and an aggregate of the rest of the world.

With the aid of our theory, the effects of transitioning from the status quo to the non-cooperative Nash equilibrium can be determined by merely solving a system of equations that depend on observables and industry-level trade elasticities. This exercise, by design, determines the value of global trade agreements. Our quantitative analysis delivers four basic results:

  • i. The globalization of value chains has only modestly reduced the terms-oftrade-driven incentives to tax trade. Correspondingly, due to the ripple effects of policy, the gains from non-cooperative trade taxation and the externality inflicted by these taxes on the rest of the world are doubled by the presence of global value chains.
  • ii. Import taxes are significantly less effective at manipulating the terms-oftrade compared to first-best export taxes. They can only replicate 65% of the unilateral gains attainable under the first-best export tax schedule. The ineffectiveness of second-best import taxes is driven by an innate trade-off between preserving allocative efficiency in the local economy and exploiting export market power through the production network. The former discourages taxing differentiated intermediate inputs, while the latter requires it.
  • iii. Existing trade agreements contribute $2.8 trillion to global GDP by remedying the inefficient terms-of-trade-driven taxes on foreign trade. To put this figure in perspective, it amounts to 30% of the total gains from trade relative to autarky and equals the GDP of France.
  • iv. Around 46% of the above value drives from the agreements' facilitation of global supply chains. This finding suggests that as the tides are turning against global trade agreements, their value has also multiplied.

Related Literature. To the best of our knowledge, this paper is the first to analytically characterize the unilaterally optimal trade policy in an important class of general equilibrium quantitative trade models with global input-output networks.

Several studies have adopted similar models to study how input-output linkages modify the gains from exogenous tariff reductions (e.g., Costinot and RodríguezClare (2014); Caliendo and Parro (2014); Caliendo, Feenstra, Romalis, and Taylor (2015); and Baqaee and Farhi (2019)). But we are not aware of any prior characterization of optimal trade policy in these canonical models.

Our characterization of optimal trade policy is related to Blanchard, Bown, and Johnson (2016) who characterize the optimal final-good tariff in a partial equilibrium trade model under political economy considerations.7 Relative to the aforementioned study, our theory accommodates a wide range of general equilibrium linkages, analyzes both import and export taxation, and considers taxes on both final and intermediate input varieties. As a result, we are able to identify key differences in the governments' incentives to tax final and intermediate goods.

Relatedly, our result on the uniformity (and therefore redundancy) of optimal tariffs generalizes similar results in Costinot, Donaldson, Vogel, and Werning (2015) and Opp (2010) to a non-parametric neoclassical trade model that accommodates global input-output networks.

We also contribute to a recent literature that quantifies the gains from multilateral trade agreements. Ossa (2014, 2016) highlight the importance of market imperfections when quantifying the gains from trade agreements. Bagwell, Staiger, and Yurukoglu (2018) highlight the importance of inter-connected bilateral negotiations. Carballo, Handley, and Limão (2018) analyze the role of trade agreements in mitigating policy uncertainty. We contribute to this literature by measuring the contribution of global value chains to the value of trade agreements.

Another strand of literature has used the structural gravity approach to measure the value of free trade agreements (e.g., Baier and Bergstrand (2007); Egger, Larch, Staub, and Winkelmann (2011); Anderson and Yotov (2016)). This literature takes a different approach from ours and the quantitative studies listed above. It (a) models the economy as one industry, and (b) uses the estimated gravity coefficients to infer the counterfactual level of trade restriction in the absence of trade agreements. While this approach has many merits, it is not optimized to account

7Relatedly, Grant (2019) studies the role of Special Economic Zones using an enriched partial equilibrium trade policy model where buyers are possibly heterogenous and the government applies import tariffs on many intermediate and final good varieties. An older generation of papers study intermediate input tariffs in a three-good, partial equilibrium economy, where the only intermediate good is produced in just one country (e.g., Suzuki (1978); Das (1983)).

for global value chains—mostly, due to the single-industry nature of the structural gravity model (see Limão (2016) for a review of this approach).

2 The Economic Environment

We consider a global economy that consists of i = 1,..., N countries (with C denoting the set of countries) and k = 1,..., K industries (with K denoting the set of industries). Each country i is populated with Li workers that are perfectly mobile across industries but immobile across countries. Li,k denotes the number of workers employed in industry k. In addition to labor, each industry in country i is endowed with K ̄i,k units of industry-specific capital that is inelastically supplied.

Industry k ∈ K in country i ∈ C produces a differentiated variety that is ultimately sold in some market j ∈ C. We use ij, k (origin i–destination j–industry k) to index these product varieties. We use the superscript C to designate if a good is used as a final consumption good and superscript I to designate if its used as an intermediate input. Since we impose no restrictions on the size or the number of industries, we can interpret index k as denoting narrowly-defined product categories rather than broadly-defined industries.

  • 2.1 Preferences The representative consumer in country i chooses the vector of consumption quan-

tities, qiC ≡ {qCji,k}, to maximize a non-parametric utility function, Ui(qiC), subject to their budget constraint. The superscript C, as noted earlier, differentiates between final consumption varieties and intermediate input varieties, with the latter denoted by I. The optimal consumption choice yields the following indirect utility function,

Vi(Yi, p ̃C

i ) ≡ max

qiC

s.t. ∑

j∈C

k∈K

Ui(qC

###### i )

ji,kqC

p ̃C

ji,k = Yi. (1)

The same problem also yields a non-parametric Marshallian demand function,

qC

i = Di(Yi, p ̃C

i ),

which summarizes the demand-side of the economy as a function of net consumption income Yi and the vector of consumer prices p ̃iC ≡ {p ̃Cji,k} in country i. The tilde notation on the price variables is used to differentiate the after-tax final price from pre-tax producer price. To keep track of optimal demand choices, we define the price and income elasticities of demand as follows.

###### D1. [Marshallian Demand Elasticities]

  • (i) [own price elasticity] εji,k ≡ ∂ ln qCji,k/∂ ln p ̃ji,k;
  • (ii) [cross-price elasticity] εȷjii,,gk ≡ ∂ ln qCji,k/∂ ln p ̃ȷi,g for ȷi, g ji, k;
  • (iii) [income elasticity] ηji,k ≡ ∂ ln qCji,k/∂ lnYi.

Throughout this paper, we restrict our attention to well-behaved demand functions that are continuous and locally non-satiated. We also assume that demand for each traded variety exhibits an elastic region where | εji,k |> 1. As in monopoly problems, this condition will be necessary for obtaining a bounded solution for optimal trade taxes.

###### 2.2 Technology

We assume that firms are competitive and operate with a non-parametric production function that employs (i) labor, (ii) intermediate inputs, and (iii) industryspecific capital. To elaborate, let Qi,k = ∑j τij,kqij,k denote country i's total output in industry k, where τij,k ≥ 1 accounts for the iceberg melt cost associated with transporting variety ij, k—as is standard in the literature, we normalize τii,k = 1. The industry-level output is produced using a general constant returns to scale production function,

Qi,k = Fi,k(Li,k, K ̄i,k, qI

i,k),

that combines labor, Li,k, industry-specific capital, K ̄i,k, and intermediate inputs, qiI,k.8 We assume that the share of labor plus industry-specific capital is constant in total production and equal to α ̄i,k. However, we take no parametric stance on how labor and capital are combined or how different intermediate inputs are combined in the production function.

8The above production structure implicitly assumes a non-finite elasticity of transformation between varieties sold in different markets. Relaxing this assumption will lead to diseconomies of scale at the good rather than industry level (see Powell and Gruen (1968)).

Facing the above production structure, cost-minimizing firms charge a competitive "producer" price that is a function of (i) the wage rate in economy i, wi, (ii) the vector of intermediate input prices employed by producers in country i, p ̃iI ≡ {p ̃Iji,g}, and (iii) total industry-level output, Qi,k. Namely,

pij,k = τij,kCi,k(wi, p ̃I

i ; Qi,k), (2)

where Ci,k(.) is homogeneous of degree 1 with respect to wi and p ̃iI. A familiar special case of this general structure is the Ricardian case in which Ci,k (.) = ai,kwi, where ai,k is a constant unit labor cost that is invariant to industry-level output.

Cost minimization by suppliers in industry k of country i yields an industrylevel demand function for intermediate inputs, D ̃i,k(Yi,k, p ̃iI), which depends on gross expenditure on intermediate inputs, Yi,k ≡ ∑n(1 − α ̄i,k)pin,kqin,k, and the entire vector of intermediate input prices in country i. Country i's overall demand

for intermediate inputs, qiI ≡ {qIji,k}, is determined as the sum of demands across all industries:

###### D ̃i,k(Yi,k, p ̃I

##### i = ∑

qI

###### i ).

k

For notational convenience, we assume that the after-tax price of product, ji, k, is the same whether it is used as an intermediate input (indexed I) or a consumption good (indexed C):

p ̃I

ji,k = p ̃C

ji,k = p ̃ji,k.

This assumption is innocuous since we can always extend the set of goods so that ji, k indexes only the final good varieties, while ji, k′ indexes only the intermediate input varieties supplied by the same industry.9 With the same rationale, we also assume that for all k, D ̃i,k(.)/∂ ln Yi,k = ∂ ln Di(.)/∂ lnYi and ∂ ln D ̃i,k(.)/∂ ln p ̃iI = ∂ ln Di(.)/∂ ln p ̃iC.

We use input-output shares to keep track of global value chains. To define these shares, let qIli,g(k) denote the optimal amount of input li, g used by industry k in country i—by construction, qIli,g = ∑k qIli,g(k). Considering this choice of notation, we define the global input-output shares as follows.

D2. [Input-Output Shares] The share of intermediate input goods from "country l×industry g" that are used in the production of output goods in "country j×industry

9Following this argument, our model still accommodates cases where the government imposes differential tax rates on final good varieties (k) versus intermediate input varieties (k′).

I li,gqIli,g(k)

k" is defined as αil,,kg ≡ p ̃

∑j pij,kqij,k .

To be clear, we do not impose that input-output shares be constant. Instead, α's can be variable, and will presumably change in response to trade taxation, e.g., taxing input li, g will lower αil,,kg. Our input-output setup is also flexible enough to allow for the expenditure share on the intermediate input varieties of a given good to diverge from the expenditure share on its final good varieties. A well-known special case of our general input-output structure is Caliendo and Parro (2014), which we formally discuss in Section 4.1.

We use the supply elasticity, as defined below, to keep track of diseconomies of scale at the industry level. D3. [Industry-Level Supply Elasticity] γi,k ≡ ∂ lnCi,k(.)/∂ ln Qi,k

The above definition is motivated by the observation that Ci,k(..., Qi,k) characterizes the industry-level supply curve for country i. To gain intuition about how this elasticity relates to diseconomies of scale, consider the case where the share of the industry-specific capital in production is constant and equal to βi,k. In that case, it is straightforward to show that γi,k = βi,k/(1 − βi,k). Beyond this special, γi,k still reflects the importance of the inelastically-supplied capital in production. As we will see shortly, γi,k also governs the degree of national-level import market power in each industry.

###### 2.3 Trade Policy Instruments

The government in country i has access to a full set of industry-level export taxcum-subsidy instruments (denoted by x) and import tax-cum-subsidy instruments (denoted by t). Together, these policy instruments create a wedge between the after-tax price, p ̃ji,k, and the producer price, pji,k, of each good ji, k as follows:

p ̃ji,k = 1 + tji,k 1 + xji,k pji,k. (3)

In the above equation, tji,k denotes the import tax applied by country i on good ji, k, while xji,k denotes the export tax applied by country j on the same good. The combination of these tax instruments raises the following tax revenue for the gov-

ernment in country i:

Ri = ∑

##### ∑

j∈C

k∈K

tji,k 1 + xji,k pji,kqin,k + xij,kpij,kqij,k . (4)

Tax revenues are rebated to the consumers in a lump-sum fashion.10 Throughout this paper, we assume that domestic policies are unavailable, which amounts to tii,k = xii,k = 0 for all i and k. As noted earlier, our product space is general enough that k can index only the final good version or the intermediate input version of each industry's output. So, in principle, trade taxes can arbitrarily discriminate between final consumption and intermediate input varieties of the same good.

###### 2.4 General Equilibrium

We assume throughout this paper that equilibrium is unique—noting that uniqueness can be formally established using the procedure in Alvarez and Lucas (2007). Below, we formally define the general equilibrium in our setup.

Definition. For any given vector of taxes, t, and, x, equilibrium is a vector of wages, w, (pre-tax) producer and (after-tax) final prices, pi, and p ̃i, final consumption and input demand choices, qiC and qiI, total surplus paid to industry-specific capital, Π, net consumer expenditure, Y, and gross industry-level output, Y, such that (i) the producer price for each good is characterized by Equation 2; (ii) the consumer price for each good is given by Equation 3; (iii) consumption choices, qiC = Di (Yi, p ̃i), are a solution to 1; (iv) demand for intermediate inputs, qiI = ∑k D ̃i(Yi,k, p ̃iI), is chosen to minimize cost; (iv) factor markets clear

pij,kqij,k −∑

##### ∑

wiLi +Πi = ∑

##### ∑

pI

ji,kqI

ji,k ∀i,

j

j

k

k

and (v) net consumption income equals factor income plus tax revenue

Yi = wiLi + Πi + Ri ∀i,

where the tax revenue, Ri, is given by Equation 4.

To streamline the presentation of our theory, we henceforth express aggregate

10Since labor is inelastically supplied in our framework, lump-sum rebates are observationally equivalent to a wage subsidy or a uniform consumption subsidy.

welfare in country i as a function of taxes t, and, x, and wages, w,

Wi (ti, xi; t−i, x−i, w) ≡ Vi (Yi(ti, xi; t−i, x−i, w), p ̃i(ti, xi; t−i, x−i, w)) .

w is an equilibrium outcome and, thus, an implicit function of trade taxes, So, we use A to denote the set of all wage×policy combinations, A = (t, x; w), that are feasible. The reason we express Wi as a function of (t, x; w) rather than just trade taxes is that an across-the-board shift in trade taxes combined with an equalproportional adjustment to nominal wage rates preserves welfare. Expressing equilibrium outcomes in terms of the triplet (t, x; w) enables us to track this kind of tax neutrality.

In equilibrium, the importance of each good for taxation purposes is determined (among other things) by its share in gross expenditure and output. So before concluding this section, it is useful to define the aforementioned shares for good ji, k (origin j–destination i–industry k). D4. [Gross Expenditure and Output Shares] [within-industry expenditure share] λji,k ≡ ∑ȷ∈p ̃Cji,pk ̃qȷiji,k,kqȷi,k [overall expenditure share] λ ́ ji,k ≡ ∑ȷ∈C∑p ̃jig∈,kKqjip ̃,kȷi,gqȷi,g. [within-industry output share] rji,k ≡ ∑ι∈pCji,pkqjιji,k,kqjι,k [overall output share] r ́ji,k ≡ ∑ι∈C∑pjig∈,kKqjip,kjι,gqjι,g

Gross expenditure and output shares, as defined above, are directly observable in standard trade datasets. As we well see shortly, optimal trade taxes can be fully characterized in terms of these observable shares plus input-output shares and the reduced-form demand and supply elasticities defined earlier.

3 Unilaterally Optimal Trade Taxes

We begin our analysis by characterizing the unilaterally optimal tax schedule in each country given applied taxes in the rest of the world. Since our analysis precludes political economy motives, the aforementioned taxes are purely terms-of trade-driven. So, following Bagwell and Staiger (1999), they govern the efficiency loss that free trade agreements are designed to remedy.11

11Our analysis fits into the terms-of-trade framework, wherein trade agreements "remedy the inef-

For expositional purposes, we cast our theory first using a two-country setup where h indexes the Home country and f indexes Foreign that represents an aggregate of the rest of the world, i.e., C = {h, f}. Later, in Section 4.2 we show how our baseline results extend to a multi-country setup with arbitrarily many countries. The Home government's optimal non-cooperative policy, t∗h ≡ {t∗fh,k}, and x∗h ≡ {xhf,k}, solves the following problem taking the vector of Foreign taxes, tf ≡ {thf,k}, and xf ≡ {xfh,k}, as given:

max

(th,xh;tf,xf,w)∈A

Wh th, xh; tf, xf, w . (5)

The above problem is complicated by a myriad of general equilibrium interrelations. We can nonetheless simplify the problem by appealing to several intermediate results, the first of which is the Lerner symmetry.12

###### Lemma 1. [The Lerner Symmetry] For any a ∈ R+, combinations A =

1 + th,1 + xh; tf, xf, wh, wf and A′ = a(1 + th), (1 + xh)/a; tf, xf, awh, wf represent identical equilibria, i.e., Wi(A) = Wi(A′) for all i .

The above lemma simplifies our analysis as follows: Optimal import and export taxes both feature a uniform term that accounts for the ability of trade taxes to increase Home's wage relative to Foreign (wh/wf). Following the above lemma, we need not to formally characterize this term as it is redundant. There is another way to cast this redundancy: When both export and import taxes are applicable, Lemma 1 indicates that we can normalize wages in both economies (i.e., set wh = wf = 1) and still identify one of the multiple optimal tax combinations. This result, however, follows only if a full set of industry-level export and import tax instruments are applicable. If the policy space is restricted in any way, the uniform tax component that accounts for the general equilibrium changes in wh/wf should be formally characterized—see Section 3.1 for further details.

In our general equilibrium setup, trade taxes have a non-trivial passthrough

ficient terms-of-trade-driven restrictions in trade that arise when trade policies are set unilaterally, (Bagwell and Staiger (1999))" Based on this viewpoint, the value of free trade agreements can be measured as the reduction in trade that would otherwise occur if countries were imposing terms-of-trade-driven restrictions in trade. See also Maggi and Rodriguez-Clare (1998); ? for a domestic-commitment theory of trade agreements.

12See Costinot and Werning (2019) for a more comprehensive treatment of the Lerner symmetry in the presence of multinational firms, global imbalances, and imperfect competition.

onto the vector of final prices, p ̃i, in each country. A tax on one good can alter the entire vector of prices through its effect on country-level wages, input prices, and industry-wide output. To handle these complex interrelations, we can cast our original optimal policy problem as one where the government directly chooses final prices, p ̃ fh, p ̃hf, and p ̃hh, to maximize Home's welfare Wh (p ̃; w) ≡ Vh (Yh(p ̃; w), p ̃). Stated formally, the optimal policy problem can be reformulated as

Wh (p ̃; w) ,

max

(p ̃;w)∈A ̃

where A ̃ denotes the set of feasible wage-price combinations that is defined analogously to A. Implicit in the above formulation is the observation that (i) Wh(.) does not explicitly dependent on p ̃ f f, and (ii) the market equilibrium is efficient, so it is optimal to set p ̃hh = phh. The above problem, as a result, corresponds to one where the government chooses the consumer-to-producer price wedges that pin down the trade taxes: 1 + th = p ̃ fh/(1 + xf)pfh and 1 + xh = p ̃hf/(1 + tf)phf.

Finally, we can appeal to supply- and demand-side envelop conditions to handle general equilibrium behavioral responses. On the demand side, we can appeal to Roy's identity, whereby the direct welfare effect of a change on consumer price, p ̃Cfh,k, is reduced to ∂Vh(.)/∂p ̃Cfh,k = −qCfh,k. Similarly, we can appeal to Shepard's lemma to account for input-output-driven price linkages. Specifically, a change in input price p ̃Ili,g, holding the wage and all other input prices fixed, has the following effect on the output price of a cost-minimizing supplier:

∂ ln pij,k ∂ ln p ̃Ili,g

∂ lnCi,k(.) ∂ ln p ̃Ili,g

= αil,,kg,

=

where αlj,,kg denotes the share of input li, g in output ij, k as defined by D3. The change in p ̃Ili,g has an additional general equilibrium effect, which operates through changes in w, Y, and Y. As noted earlier, though, the handling of these general equilibrium effects can be simplified with the application of Lemma 1.

Trade taxes also affect, Πi, which is the surplus paid to industry-specific capital in country i. To track this effect, we can appeal to Hotelling's lemma, whereby the effect of a change in output prices on surplus (holding all input prices constant) can be stated as: ∂Πi(.)/∂pij,k = qij,k. Likewise, the effect of a change in the price of intermediate input ji, k (holding the price of all other inputs fixed) can be expressed

as: ∂Πi(.)/∂pIji,k = qIji,k.

By combining the aforementioned envelope conditions and appealing to Lemma 1, we can produce the following theorem that characterizes the optimal policy as a function of gross expenditure shares, λ, gross revenue shares, r, global input-output shares, α, reduced-form demand elasticities, ε, and reduced-form supply elasticities, γ. The former three statistics are directly observable, while the latter two can be locally estimated.

  • Theorem 1. Home's unilaterally optimal trade taxes are unique up-to a uniform tax shifter, t, and given by ̄

γf,krfh,k 1 − γf,krf f,kεf f,k

###### (1 + t ̄)

1 + t∗

fh,k = 1 +

εhf,k 1 + εhf,k + ξhf,k − ∑g αhf,,kgλr ́ ́fh,g

###### (1 + t ̄)−1

###### 1 + x∗

hf,k =

hf,k

where ξhf,k k = Ξ−1 − IK Ω accounts for input-output adjusted cross-demand effects between industries, with Ξ ≡

λ ́ hf,gεhfhf,,kg λ ́ hf,kεhf,k

and Ω ≡ 1 − ∑g αhf,,kgλr ́ ́fh,g

.

hf,k k

k,g

Foreign's unilaterally optimal taxes are, analogously, described by a similar formula that swaps the country indices. As noted earlier, per the Lerner symmetry, the unilaterally optimal policy is unique only up to a uniform tariff, t ̄. For a highenough choice of t ̄, the optimal policy will consist of import taxes paired with export subsidies. Aside from the uniform tax shifter, Home's unilaterally optimal import tax, t∗fh,k, equals the optimal industry-level mark-down on pfh,k (i.e., the inverse of the export supply elasticity). Home's unilaterally optimal export tax, xhf∗ ,k, is equal the optimal monopoly mark-up on p ̃hf,k that internalizes cross-demand effects and tax propagation through the input-output network. In the special case with zero cross-substitutability between industries (i.e., Ξ = IK ⇐⇒ ξhf,k = 0) and no input-output networks (i.e., αhf,,kg = 0), the optimal export tax reduces to the familiar single-product optimal monopoly markup, 1 + xhf∗ ,k = εhf,k/ 1 + εhf,k .

Importantly, the optimal tax schedule specified by Theorem 1 internalizes For-

eign's applied taxes, tf, and xf. For instance, an increase in Foreign's import taxes will shrink Home's exports to Foreign. This effect will reflect itself as a reduction

in εhf,k (and therefore xhf∗ ,k) if the demand for variety hf, k is sub-convex. We can,

thus, express t∗fh,k and xhf∗ ,k as an implicit function of Foreign's applied taxes. We will invoke this property in Section 5, where we solve for Nash taxes in the noncooperative equilibrium.

Theorem 1 has an attractive feature: It characterizes the optimal policy in terms of estimable or observable sufficient statistics. In the words of Piketty and Saez (2013), such sufficient statistic formulas have two broad merits. First, they allow us "to understand the key economic mechanisms behind the formulas." Second, they "are also often robust to changing the primitives of the model." In the present context, the formula characterized by Theorem 1 can be empirically evaluated with readily-available trade statistics. Given these qualities and as shown later in Section 6, Theorem 1 streamlines the quantitative analysis of trade policy to a great degree.

There is a simple intuition for why optimal import taxes (unlike export taxes) do not explicitly depend on global input-output shares. From the perspective of the Home country, trade taxes can correct two terms-of-trade-related distortions:

  • i. Unexploited import market power, which concerns the unexploited ability of the Home economy to charge a mark-down on Foreign producer prices, pfh,k.
  • ii. Unexploited export market power, which concerns the unexploited ability of the Home economy to charge a mark-up on Foreign consumer prices, p ̃hf,k.

By the targeting principle, industry k's optimal import tax, t∗fh,k, is targeted exclusively at lowering pfh,k (Margin 1). For this reason, t∗fh,k does not explicitly depend on input-output shares. To make this point, consider the following thought experiment: Fix the price of all of Home's export goods, i.e., {p ̃hf,k}, as these prices can be directly manipulated with export taxes. In that case, tfh,k cannot affect pfh,k through its effect on the price of good fh, k's inputs—these prices are pegged to {p ̃hf,k}. Instead, tfh,k can only lower pfh,k by either shrinking Foreign's output in industry k or by lowering wf. Both of these effects operate independently of the input-output network.

Based on the same rationale, optimal export taxes depend explicitly on global input-output shares. An export tax on intermediate input, hf, k, will raise the price of any Foreign-produced good employing that input, including goods that are sold back to the Home country. That is, An export tax on intermediate inputs is partially passed back to Home consumers through re-importation. To mitigate these

adverse feedback effects, the optimal export tax is lower on more upstream industries where re-importation of export taxes is more of an issue. This claim is implicit in the formula under Theorem 1, since in a vertical economy

 

r ́fh,g λ ́ hf,k ∈

{0} if fh, k is a final good (0,1) otherwise

αhf,,kg

##### ∑

.

g

That export taxes are lower on upstream industries, however, should not be confused with smaller gains from taxation. On the contrary, the unilateral gains from taxing upstream exports are actually larger. We formally document and discuss this point in Section 6.

Theorem 1 indicates that import taxes are a necessary instrument only if there are diseconomies of scale arising from industry-specific capital. Otherwise, if γf,k ≈ 0, Home can discard import taxes by choice of t ̄ = 0, and attain the first-best non-cooperative outcome with only export taxes. The following corollary synthesizes this claim and others discussed earlier.

Suppose we calibrate

  • Corollary 1. Controlling for λ and r, the globalization of value chains lowers the unilaterally optimal export tax on more-upstream industries. Moreover, absent diseconomies of scale, import taxes remain a redundant terms-of-trade-improving policy instrument even after countries become submerged in global value chains.

The globalization of value chains, in the above corollary, corresponds to a counterfactual thought experiment where λ and r are fixed, but α is elevated from zero to its factual level. This experiment accords with the quantitative exercise in Section 6, where the model is calibrated to data on λ and r with and without accounting for global value chains. From the lens of that exercise, Corollary 1 indicates that overlooking global value chains will lead to an over-estimation of non-cooperative export taxes in upstream industries.

Finally, the absence of diseconomies of scale is a sufficient but not necessary condition for the redundancy of import taxes. Import taxes can be redundant under much weaker conditions. For instance, suppose Foreign employs industryspecific capital but Home accounts for a small share of global demand for the Foreign industry. Then, rfh,k ≈ 0 and import taxes are once again redundant. The aforementioned situation provides an accurate description of any individual coun-

try's position relative to the rest of the world. It also aligns with the complete tariff passthrough documented by Amiti et al. (2019) and Fajgelbaum et al. (2020).

###### 3.1 Second-Best Non-Cooperative Import Taxes

We now consider a scenario that has received considerable attention in the prior literature. In this scenario, governments cannot apply export taxes due to institutional constraints like those embedded in the United States' constitution. But they have the discretion to apply import taxes. We show that, in such circumstances, it is unilaterally optimal for governments to use import taxes as a second-best instrument to manipulate export market power. To make this point succinctly, we abstract from diseconomies of scale and cross-industry demand effects; but these channels are formally accounted for in Appendix C.

In a second-best scenario where the government cannot apply export taxes, Home's unilaterally optimal import taxes solve the following problem (taking tf and xf as given):

max

Wh th,0; tf, xf, w .

(th,0;tf,xf,w)∈A

Since export taxes are restricted to zero, i.e., xh = 0, the Lerner symmetry no longer implies a multiplicity of optimal tax schedules. Instead, the above problem identifies a unique vector of optimal import taxes. Since the Home economy possesses no import market power by assumption (i.e., γf,krfh,k ≈ 0), second-best import taxes pursue one objective: to indirectly manipulate Home's export market power.

Export market power manipulation can be carried through two distinct chan-

nels: First, import taxes can raise Home's wage relative to Foreign, wh/wf (i.e., they can charge a markup on the wage rate embedded in Home's exports). Sec-

ond, import taxes can be used to charge a markup on p ̃hf,k through the inputoutput network. The former channel was previously-redundant due to the Lerner symmetry-driven multiplicity of optimal taxes. The latter channel was also irrelevant due to the targeting principle.

To account for the now-relevant general equilibrium wage effects, we use Lij (t, x; w) to denote country j's demand for country i's labor. The elasticity of Lij with respect to wi depends on the value-added content of sales, which determines the overall contribution of country i's labor to gross revenue in each industry. We can measure the value-added content of country i's output in industry k, namely,

δi,k, using the input-output matrix. Specifically, let

αji ≡ αij,,kg

k,g

denote the K × K input-output matrix that describes the share of country j's inputs used in country i's outputs. We can apply the Implicit Function Theorem to calculate δi ≡ [δi,k]k as follows:

δi = (IK − αii)−1 α ̄i,

where α ̄i ≡ [α ̄i,k]k is a K × 1 vector denoting industry-level labor shares in country i . Using the above notation, we can calculate the elasticity of labor demand in

terms of gross reduced-form demand elasticities, εij,k, and value-added shares, δi,k.

D4. [Elasticity of Labor Demand] ε ̄ij ≡ ∂∂lnlnLwiji = ∑k ωij,kεij,k, where ωij,k ≡ ∑δgj,δkjλ ́,kijr ́,jik,k is a weight that reflects the value-added contribution of good ij, k to country i's exports.

In principle, ε ̄ij measures country i's economy-wide export market power net of input-output linkages. As noted by the following Theorem, Home's optimal import tax in each industry is determined by ε ̄hf and an industry-specific term that accounts for the ability of import taxes to manipulate industry-specific export market power through the input-output network.

  • Theorem 2. In a second-best scenario where export taxes are inapplicable, unilaterally optimal import taxes feature of a uniform component and an industry-specific competent that captures the extent to which import taxes are passed on to Foreign consumers. Namely,

ε ̄hf 1 + ε ̄hf

###### 1 + t∗

fh,k =

τk εfh,k

1 +

,

where τ ≡ [τk] is given by

 

 1k=g +

−1

εhhfh,,gg εfh,k

λhf,g rfh,g

εhf,g ε ̄hf

α ̃hf,,gk

α ̃hf,,gk

α ̃ ff,,kg − [1 +

##### ∑

,

]

τ =

g∈K

k

k,g

= IK − αfh −1 αfh. Correspondingly, τk = 0 for import goods that are not used as intermediate inputs in export goods and τk 0 otherwise.

= IK − αf f −1 αfhαhf, and α ̃hf,,gk

with α ̃ ff,,kg

k,g

k,g

The uniform term, ε ̄hf/(1 + ε ̄hf), in the above formula, corresponds to an optimal export mark-up on the wage rate, wh. To elaborate, a uniform import tax is equivalent to a uniform export tax, which is itself analogous to a uniform markup

applied to the wage rate, wh, in Foreign markets. The optimal level of this markup is proportional to the elasticity of Foreign demand for Home's labor, ε ̄hf.13

The industry-specific component, 1 + τk/εfh,k, accounts for the fact that an import tax on intermediate inputs is partially passed on to Foreign consumers. So, such a tax can be used to imperfectly mimic (the unavailable first-best) export taxes. To see this point from the lens of Theorem 2, note that the optimal import tax on good fh, k depends primarily on α ̃hf,,gk, which measures the degree to which the tax is passed on to Foreign consumers. By construction,

α ̃hf,,gk = αfh + α2fh + α3fh + ...

which implies that if (i) industry k is a strictly downstream industry in a vertical economy, or (ii) the inputs produced by industry k are never used in exporting industries, then α ̃hf,,gk = α ̃ ff,,kg = 0 and, as a result, τk = 0. Plugging this value back into Theorem 2, the optimal import tax on such industries is uniform and equal to t∗fh,k = ε ̄hf/(1 + ε ̄hf). The intuition is that an import tax on the aforementioned industries is never passed on to Foreign consumers beyond general equilibrium wage effects. So, there is no rationale for taxing such imports beyond what is justified by the flat wage markup.

If industry k supplies intermediate inputs to differentiated and export-oriented

industries, then α ̃hf,,gk > 0, which implies that τk/εfh,k > 0.14 Plugging this value into Theorem 2 indicates that industry k imports should be taxed above the uni-

form rate. The reason is that an import tax on intermediate inputs is partly passed on to Foreign consumers, so tfh,k can be used to extract an additional markup,

13To gain further perspective, consider a basic CES gravity model without input-output linkages

(as presented in Section 4.1). In this widely-used framework, α ̃if,,gk = 0 and δi,k = 1, which implies a uniform optimal import tax across all industries:

r ́hf,k r ́hf

t ̄∗

fh,k = 1/∑

εkλf f,k ,

k

with εk denoting the trade elasticity in industry k. The above formula itself strictly generalizes the optimal single-industry tariff formula in Gros (1987) to many asymmetric industries.

14This result follows from α ̃hf,,gk > α ̃ ff,,kg ≈ 0.

τk/εfh,k, from Foreign consumers besides the flat wage markup, ε ̄hf/(1 + ε ̄hf). In other words, a tax on imported intermediates can be used to (imperfectly) mimic the first-best export taxes.15 The trade-off is that taxing intermediate inputs distorts production decisions in a way that is avoidable with first-best export taxes.

In summary, these arguments indicate that the globalization of value chains creates an additional incentive for import taxation in second-best scenarios. More specifically, if value chains become globalized and export taxes are inapplicable, it is unilaterally optimal to increase the import tariff on more-upstream industries. The following corollary synthesizes these arguments.

  • Corollary 2. Suppose export taxes are inapplicable: Controlling for λ and r, the globalization of value chains increases the unilaterally optimal tariff on upstream industries that supply intermediate inputs to export-oriented sectors. The globalization of value chains, however, has no effect on the unilaterally optimal tariff in industries that supply final goods or intermediate goods that are never used in export-oriented sectors.

The above corollary raises a basic question: How much are governments willing to raise the tariff on intermediate inputs to mimic export taxes? Answering this question is ultimately a quantitative matter, which is explored in Section 6. But we can gain valuable insights by approaching the question theoretically. If imported goods are re-exported without being processed, then an import tax on these goods is identical to an export tax. So, the globalization of value chains prompts a sharp increase in optimal import taxes.

Beyond this extreme case, import taxes on intermediate inputs are an extremely inefficient substitute for export taxes: First, they directly distort production-related choices in the Home economy. Second, they may be partially passed on to domestic consumers. Accordingly, the effectiveness of second-best trade taxes is hampered by an innate trade-off between preserving allocative efficiency in the local economy and exploiting export market power through the input-output network. The

15Aside from α ̃hf,,gk, the effectiveness of tfh,k at mimicking first-best export taxes is governed by two additional factors: (a) the degree to which intermediate input variety fh, k is substitutable with other inputs (i.e., εfh,k in the formula specified by Theorem 2); and (b) the intensity at which input fh, k is used in highly-differentiated export-oriented industries (i.e., α ̃hf,,gkλhf,gεhf,g in the formula specified by Theorem 2). Specifically, t∗fh,k is higher if intermediate input fh, k is low-demand elastic and is used more-intensively in highly-differentiated, export-oriented industries. The following corollary summarizes these arguments.

former discourages taxing differentiated intermediate inputs, while the latter requires it. As a result, for all practical purposes, the globalization of value chains should have a modest effect on non-cooperative import taxes.

4 Canonical Special Cases and Extensions

In this section, we first highlight two widely-used quantitative trade models that are nested by our theory. We then demonstrate how our baseline results readily extend to a multi-country setup.

###### 4.1 Canonical Special Cases

To highlight the practicality of our optimal tax formula, we outline two canonical special cases: First, a basic multi-industry gravity model without input-output linkages à la Costinot et al. 2011. Second, a multi-industry gravity model with flexible input-output linkages à la Caliendo and Parro (2014).

  • (i) Basic Multi-Industry Gravity Model (Costinot et al. 2011). This model corresponds to a special case of our framework where labor is the only factor of production and preferences have a Cobb-Douglas-CES parameterization: Ui = ∏k Qie,ik,k,

1/ρk

where Qi,k = ∑j=h,f χji,kqρjik,k

. The Cobb-Douglas-CES demand structure implies that εhf,k = −1 − εkλf f,k, where εk ≡ ρk/ (1 − ρk). The Cobb-Douglass assumption also eliminates cross-price elasticity effects, which amounts to ξhf,k = 0. The absence of input-output networks implies that αhf,,gk = 0 for a k and g. Plugging these values into the optimal tax formula specified by Theorem 1 yields the following:

fh,k = 1 + t ̄ 1 + x∗

1 + t∗

1 εkλf f,k

hf,k = 1 +

(1 + t)−1. (6)

Stated verbally, the optimal trade tax consists of a uniform tariff, t ̄, and an industryspecific export tax that varies primarily with the industry-level trade elasticity, εk. If the economy is modeled as a single industry, the above formula reduces to xhf∗ =

1/ελf f, which by the Lerner symmetry is equivalent to Gros' (1987) optimal tariff formula, t∗fh = 1/ελf f.

  • (ii) Multi-Industry Gravity Model with I-O Linkages (Caliendo and Parro (2014)) This model features the same CES-Cobb-Douglas utility function described above. Production, though, combines intermediate inputs and labor using a CobbDouglas-CES aggregator, which implies that

pij,k = wiα ̄i,k ∏

P ̃iα,gi,gk,

g

where αi,kg denotes the constant share of industry k's inputs in industry g's output, with α ̄i,k ≡ 1 − ∑g αi,gk. P ̃i,g denotes the (after-tax) price index of industry g's composite intermediate input, which is by assumption identical to the price in-

1/(1−εg)

dex of industry g's composite consumption good: P ̃i,g = ∑j p ̃1ji−,gεg

. The Cobb-Douglas-CES demand for the consumption variety of good ji, k is qCji,k =

p ̃ji,k/P ̃i,k −εk ei,kYi. The demand for the intermediate input variety of good ji, k is given by qIji,k = p ̃ji,k/P ̃i,k −εk ∑g αi,kgYi,g.

We can apply Theorem 1 along with the following steps to determine the optimal trade tax in this setup: Since there are no industry-specific factors of production, we can set γf,k = 0 for all k. The Cobb-Douglas assumption meanwhile ensures that ξhf,k = 0. The within-industry CES demand system implies that εhf,k = −1 − εkλf f,k. The assumption that the demand aggregator for final and intermediate goods are identical, entails that αhf,,kg = αf,kgλhf,k. Plugging these values into Theorem 1 yields the following formula for optimal trade taxes in the Caliendo and Parro (2014) model:

###### fh,k = 1 + t ̄ 1 + x∗

1 + t∗

1 + εkλf f,k εkλf f,k − ∑g αf,kgrfh,g

hf,k =

(1 + t)−1 . (7)

In-line with our earlier discussion, the optimal export tax implied by the above formula is strictly lower than that implied by the baseline gravity model (Equation 6). However, the gains from optimal trade taxation are higher relative to the baseline model. This point is quantitatively established in Section 6 where we calibrate the

above formulas to actual data.

###### 4.2 Extension to a Multi-Country Setup

We now extend our optimal tax formulas to a multilateral setup with N > 2 countries. The optimal policy problem in this setup is a basic extension of 5: Country

  • i chooses (N − 1) K import tax instruments, ti = {tji,k}, and (N − 1) K export tax instruments, xi = {xij,k}, given the vector of applied 'taxes in the rest of the world, t−i, and x−i . 16 We can follow the same steps outlined earlier to derive sufficient statistics formulas for optimal trade taxes in this setup (see Appendix D):

γj,krji,k 1 − ∑ι i γj,krjι,kεjι,k

###### (1 + t ̄)

1 + t∗

ji,k = 1 +

εij,k

(1 + t ̄)−1. (8)

###### 1 + x∗

ij,k =

1 + εij,k + ξij,k − ∑g ∑ȷ i ΛΛ ̃ ȷi,g

α ̃ij,ȷk,g

ij,k

In the above formula, ξij,k accounts for cross-industry demand effects and is defined analogously to the cross-demand term specified under Theorem 1;17 Λȷi,g ≡ pji,gqji,g/ ∑ȷ i ∑k pȷi,kqȷi,k and Λ ̃ ij,g ≡ p ̃ij,gqij,g/ ∑ȷ i ∑k p ̃iȷ,kqiȷ,k denote import and export shares; and α ̃ij,ȷk,g = αijαjȷ if ȷ j, with α ̃ijj,k,g = αij.

The optimal tax formula specified above differs from the baseline formula (presented under Theorem 1) in one basic aspect. The optimal export tax accounts for the fact that a tax on any individual export variety may be passed back to domestic consumers through multiple partners. The term α ̃ij,ȷk,g, in the denominator, accounts for the extent to which xij,k is passed back to domestic consumers through country ȷ's export of industry g goods. The higher the α ̃ij,ȷk,g's from various ȷ's, the lower the optimal export tax. The same formula also indicates that the burden of an export tax on one partner may be borne primarily by a third country that is not directly involved in the export transaction.

This subtle difference notwithstanding, the claims presented under Corollaries

  • 16An added complication here is that a tax on good ji, k can alter the entire vector of nationallevel wages {w1,...wN}. This complication can be handled by noting that tji,k's effect on Wi has a first-order component that operates through changes in wj/wi and a second-order component that operates through changes in wȷ/wi (where j j)—see Appendix D.
  • 17Stated formally, ξij,k is given by ξij,k

  1 − ∑g ∑ȷ ΛΛ ̃ ȷi,g

  λ ́ij,gε

###### −1

ij,k ij,g

α ̃ij,ȷk,g .

− IK

=

λ ́ ij,kεij,k

k

ij,k

k,g

1 and 2 readily extend to the multi-country case. As before, the optimal import taxes do not explicitly depend on the global input-output shares, which is a manifestation of the targeting principle. Absent diseconomies of scale, import taxes are also uniform and, therefore, redundant under the first-best scenario.18 We also demonstrated how the characterization under Theorem 1 readily extends to a multi-country setup.

5 The Non-Cooperative Trade Equilibrium

As noted earlier, the value of free trade agreements (FTAs) can be measured as the prevented welfare loss that would otherwise occur if countries adopted noncooperative trade taxes. In this section, we discuss how our earlier results can help characterize these losses. We do so by solving the non-cooperative tax equilibrium that will ensue after the dissolution of trade agreements.

Theorem 1 described Home's optimal non-cooperative tax schedule given the vector of taxes in the rest of the world. Section 4.2 extended this characterization to a setup where Country i sets optimal non-cooperative taxes against arbitrarily many trading partners. These taxes are characterized by Equation 8, and are an implicit function of applied taxes in the rest of the world. To formalize this dependence, let t∗ji,k = Tji,k(t−i, x) and xij∗,k = Xij,k(t, x−i) denote Country i's optimal non-cooperative trade taxes as a function of applied taxes in the rest of the world (i.e., t−i and x−i). If trade agreements disintegrate and all countries set their trade taxes non-cooperatively, the counterfactual vector of trade taxes in the noncooperative Nash equilibrium can be determined using the following system:19

x1∗j,k = X1j,k(t∗, x∗−1) t∗j1,k = Tj1,k(t∗−1, x∗)



. (9)

. x∗Nj,k = X1N,k(t∗, x∗−N) t∗jN,k = TjN,k(t∗−N, x∗)



  • 18The uniformity of optimal tariffs across industries is independent of the first-order approxi-

mation discussed in Footnote 10—see Appendix D for details.

  • 19This approach is analogous to Grossman and Helpman's (1995)approach to characterizing

Nash tariff in a partial equilibrium model with lobbying.

Let t and x denote the factual vector of trade taxes, which are the outcome of existing agreements. The value of trade agreements for country i can, thus, be measured as the welfare change when moving from applied trade taxes to Nash trade taxes:

Value of FTAsi = Wi(t, x, w)/Wi(t∗, x∗, w∗).

To take stock, we first derived sufficient statistics formulas for unilaterally optimal trade taxes. These formulas determine the inefficient terms-of-trade-driven restrictions on trade that will occur if trade agreements were to dissolve. With knowledge of these counterfactual trade restrictions, we can then measure the deadweight loss associated with unilateralism and the value of trade agreements that remedy these losses. Before mapping out theory to data, two additional remarks about this approach are in order.

The Role of Political Economy Considerations. The Nash taxes implied by System 9, do not necessarily coincide with the tax levels that will prevail without agreements. Instead, the counterfactually applied taxes will reflect the sum of politically-driven and terms-of-trade-driven restriction on trade (Grossman and Helpman (1995)). As argued by Bagwell and Staiger (1999), however, the politically optimal trade taxes are Pareto efficient. So, the only inefficiency trade agreements correct are terms-of-trade-driven restrictions. These are the exact restrictions identified by System 9, which also motivates the expression for "Value of FTAsi."

Sequential Trade Policy Game. In System 9, the best response of each country is derived taking taxes in the rest of the world as given. This approach is perhaps appropriate if non-cooperative taxes are adopted simultaneously. Alternatively, consider a Stackelberg game where one country is sufficiently large compared to the rest of the world and has a first-mover advantage. In that case, the first-mover may strategically choose non-uniform tariffs to influence export taxes in the rest of the world. This situation is conceptually interesting, but not representative. As we will see in the next section, most countries are effectively small open economies vis à vis the rest of the world.

6 Quantitative Analysis

In this section, we map our theoretical model to global trade and production data. Our analysis pursues two distinct objectives. First, we want to measure the consequences of unilateralism. Second, we wish to quantify the value of existing trade agreements and determine what fraction of it is driven by global value chains. In this process, we also highlight how our analytic tax formulas streamline the computational analysis of trade policy.

###### 6.1 Mapping Non-Cooperative Tax Formulas to Data

Below, we discuss how the non-cooperative optimal tax formulas specified under Theorem 1 can be mapped to data. With the aid of these formulas, the gains from trade agreements or the consequences of unilateral policies can be computed using a system of equations that depend on observables and trade elasticities. We demonstrate this point first with a baseline model that overlooks global inputoutput networks. We then move on to our main model that formally accounts for global input-output networks. Comparing the predictions of these two models will helps us isolate the role of value chains.

Baseline Model without Input-Output Networks. We first consider the basic multi-industry gravity model without input-output networks as outlined in Section 4.1. To present our methodology succinctly, we focus our presentation here on the two-country model. The multi-country implementation is presented in Appendix E. Recall that the baseline gravity model features a Cobb-Douglas utility aggregator across industries paired with a CES demand aggregator within industries. This preference structure implies that V Yi, P ̃i = Yi/P ̃i, where the aggregate price index in economy i ∈ C is given by

P ̃i = ∏

k∈K

χji,kp ̃−ji,εkk

j∈C

−ei,k/εk

. (10)

In the above formulation, χji,k accounts for policy invariant taste shifters and p ̃ij,k = (1 + tij,k)(1 + xij,k)τij,kai,kwi. Consider a counterfactual policy change, whereby the vector of trade taxes changes from its applied rate {tij,k}, and {xji,k}

to a counterfactual rate, {tij′ ,k}, and {x′ji,k}. In response to this policy change, let z′ denote the counterfactual value of any variable z and let zˆ = z′/z denote the corresponding change in that variable. The CES demand structure implies that

Pˆ ̃i,k = ∑

j

w ˆj( 1 + tji,k)( 1 + xji,k)

−εk

λji,k

−1/εk

where 1 + tji,k ≡ (1 + t′ji,k)/(1 + tji,k) and 1 + xij,k = (1 + xij′ ,k)/(1 + xji,k) denote the change in import and export taxes. The change in trade shares is then given by

−εk

( 1 + tji,k)( 1 + xji,k)wˆj

λˆ ji,k =

−εk . (11)

∑n λni,k ( 1 + tni,k)( 1 + xni,k)wˆn

The new vector of wages and income levels in the counterfactual equilibrium are determined by the labor market clearing condition:

wi′Li = ∑

##### ∑

j

k

λij′ ,k (1 + xij′ ,k)(1 + tij′ ,k)

ej,kYj′

and the condition that total income equals wage income plus tax revenue:

Yi′ = wi′Li +∑

##### ∑

j

k

t′ji,k 1 + t′ji,k

xij′ ,k 1 + xij′ ,k

λ′ji,kei,kYi′ +

λij′ ,kej,kYj′ .

Both of the above equations can be written in terms of changes, by noting that Yi′ = YˆiYi, wi′ = wˆiwi, and λij′ ,k = λˆ ij,kλij,k. Likewise, the post-change tax rates in the above equations can be expresses as t′ji,k = ( 1 + tji,k)(1 + tji,k) − 1 and xij′ ,k = ( 1 + xij,k)(1 + xij,k) − 1.

With the above background in mind, consider our counterfactual policy experiment of interest: One in which all countries simultaneously apply their optimal non-cooperative trade taxes. In the two-country case, this amounts to Home applying t′fh,k = t∗fh,k = 0 and xhf′ ,k = x∗fh,k = 1/εkλ′f f,k, and Foreign applying t′hf,k = t∗hf,k = 0 and x′fh,k = x∗fh,k = 1/εkλ′hh,k (see Equation 6). The multi-country case (N > 2) implies analogous tax formulas that are outlined in Appendix E. Inserting these counterfactual tax changes into the equation presented earlier, we can compute λˆ ij,k, wˆi, and Yˆi, as well as the welfare consequences that result from this

policy change. The following proposition outlines this procedure.

  • Proposition 1. Suppose the observed data is generated by the baseline gravity model without input-output networks, as outlined in Section 3. The consequences of moving from status-quo to the non-cooperative trade equilibrium can be fully characterized by solving the following system of equations

###### 1 + x∗ji,k = 1 + ε 1

kλˆii,kλii,k; 1 + t∗ji,k = 1 λˆ ji,k = (1+t

−εk Pˆ ̃iε,kk

∗ ji,k)(1+x∗ji,k)

(1+tji,k)(1+xji,k) wˆj



−εk

(1+t∗ji,k)(1+x∗ji,k) (1+tji,k)(1+xji,k) wˆj

Pˆ ̃i−,kεk = ∑j

,

λji,k

λ ˆ ij,kλij,k

(1+xij∗,k)(1+tij∗,k)ej,kYˆjYj Y ˆiYi = wˆiwiLi + ∑k ∑j

w ˆiwiLi = ∑k ∑j

t∗ji,k 1+t∗ji,k

λˆ ji,kλji,kei,kYˆiYi + x



λˆ ij,kλij,kej,kYˆjYj

ij,k

1+xij∗,k

The above system solves N(K + 2) independent unknowns, namely, {x∗ji,k}, {wˆi}, and {Yˆi}, as a function of industry-level trade elasticities, εk, and the following set of observables: (i) applied trade taxes; (ii) within- and across-industry expenditure shares λji,k, and ei,k; (iii) wage income, wiLi; and (iv) total income, Yi = wiLi + Ri.

Solving the system specified by Proposition 1, determines the welfare consequences of dissolving the existing trade agreements as

−1

Y ˆi ∏k Pˆ ̃i,k

Value of FTAsi = Wˆ −1

.

###### i =

The consequences of unilateralism can also be measured using a similar approach. To that end, the system specified by Proposition 1 should be modified so that taxes in one country are assigned their unilaterally optimal rate, whereas taxes in the rest of the world are assigned their applied rate.

Importantly, the aforementioned procedure is implementable without appealing to any global optimization routine or without knowledge of structural parameters like τij,k, ai,k, or χij,k. To put our approach in perspective, compare it to the standard approach that involves a constrained global optimization subject to equilibrium constraints (Costinot and Rodríguez-Clare (2014); Ossa (2014)). The standard approach can be implemented using a nested fixed-point procedure, which

is impractical with many industries (i.e., a high K). Or, alternatively, it can be implemented using the MPEC procedure in Su and Judd (2012), which requires specialized commercial solvers like SNOPT or KNITRO to attain credible results. The procedure outlined by Proposition 1 bypasses these challenges by reducing the global optimization problem to a system of equations that is straightforward to solve.

Main Model with Input-Output Networks. Now, consider the multi-industry gravity model with input-output networks as presented in Section 4.1. In this extended model, preferences are still characterized by Equation 10; but the price of good ij, k depends on the entire vector of industry-level price indexes in country i:

p ̃ij,k = (1+tij,k)(1+xij,k)τij,kai,kwiα ̄i,k ∏

P ̃iα,gi,gk.

g

The change in expenditure shares in response to a change in trade taxes can be, accordingly, specified as

−εk

( 1 + tij,k)( 1 + xij,k)wˆiα ̄i,k ∏g Pˆ ̃iα,gi,gk

λˆ ij,k =

−εk ,

∑n λnj,k ( 1 + tnj,k)( 1 + xnj,k)wˆnα ̄n,k ∏g Pˆ ̃nα,ng,gk

where the change in price indexes are given by

###### 

  .

−εk

Pˆ ̃i−,kεk = ∑

Pˆ ̃nα,ng,gk

 1+tni,k)( 1+xni,k)wˆnα ̄n,k ∏

λji,k

g

j

In the counterfactual equilibrium that arises after the tax change, gross industrylevel revenues can be calculated as the sum of sales net of taxes:

##### Yi′,k = ∑

j

λij′ ,k (1 + xij′ ,k)(1 + tij′ ,k)

E′

j,k ,

where Ei′,k ≡ ei,kYi′ + ∑g αi,gkYi′,g denotes gross expenditure on industry k in country i. Net consumption income is equal to wage income plus tax revenues:

Yi′ = wi′Li +∑

##### ∑

j

k

t′ji,k 1 + t′ji,k

xij′ ,k 1 + xij′ ,k

λ′ji,kE′

λij′ ,kE′

j,k .

i,k +

Wages income in the above expression is itself equal to the sum of labor compensation across all industries: wi′Li = ∑k α ̄i,kYi′,k. As before we can write the above equilibrium conditions in terms of changes by noting that λ′ji,k = λˆ ji,kλji,k, wi′ = wˆiwi, Yi′,k = Yˆi,kYi,k, and Yi′ = YˆiYi.

To compute the gains from existing trade agreements, we need to apply the above procedure to a counterfactual tax change where all trading parties simultaneously apply their optimal non-cooperative taxes. In the two-country case, this amounts to Home applying 1 + xhf′ ,k = 1 + xhf∗ ,k = (1 + εkλ′f f,k)/(εkλ′f f,k − ∑g αf,kgr′fh,g) and t′fh,k = t∗fh,k = 0, and Foreign applying 1 + x′fh,k = 1 + x∗fh,k = (1 + εkλ′hh,k)/(εkλ′hh,k − ∑g αh,kgrhf′ ,g) and t′hf,k = t∗hf,k = 0 (see Equation 7). The multi-country case (N > 2) implies analogous tax formulas that are outlined in Appendix E. The following proposition summarizes these arguments.

  • Proposition 2. Suppose the observed data is generated by a multi-industry gravity model with global input-output networks, as outlined in Section 4.1. The consequences of moving from status-quo to the non-cooperative trade equilibrium can be fully determined by solving the following system of equations

1 + x∗ji,k = ε 1+εkλˆii,kλii,k

kλˆii,kλii,k−∑g αj,kgrˆij,grij,g; 1 + t∗ji,k = 1 λˆ ji,k = (1+t

−εk Pˆ ̃iε,kk

∗ ji,k)(1+x∗ji,k)

(1+tji,k)(1+xji,k)wˆαj ̄i,k ∏g Pˆ ̃iα,gi,gk

−εk

(1+t∗ji,k)(1+x∗ji,k) (1+tji,k)(1+xji,k)wˆαj ̄i,k ∏g Pˆ ̃iα,gi,gk

Pˆ ̃i−,kεk = ∑j

λji,k



r ˆji,krji,k = (1λˆ+ji,xkλ∗ji,kEˆi,kEi,k

λ ˆ jn,kλjn,kEˆn,gEn,g (1+x∗jn,k)(1+t∗jn,k) E ˆi,kEi,k = ei,kYˆiYi + ∑g ∑j αi,gk

ji,k)(1+t∗ji,k)/ ∑n

λˆ ij,gλij,gYˆj,gYj,g (1+xij∗,g)(1+tij∗,g) wiwiLi = ∑k ∑j α ̄i,k

λˆ ij,kλij,kYˆj,kYj,k (1+xij∗,k)(1+tij∗,k) YˆiYi = wˆiwiLi + ∑k ∑j

t∗ji,k 1+t∗ji,k

λˆ ji,kλji,kEˆi,kEi,k + x

ij,k



1+xij∗,k

λˆ ij,kλij,kEˆj,kEj,k

,

The above system solves N(3K +2) independent unknowns, namely, {x∗ji,k}, {Pˆ ̃i,k}, {wˆi}, {Yˆi}, and {Yˆi,k}, as a function of industry-level trade elasticities, εk, and the following set of observables: (i) gross expenditure and revenue shares, λji,k, and rij,k; (ii) industrylevel consumption shares, ei,k; (iii) wage income, wiLi; (iv) final good expenditure, Yi = wiLi + Ri; (v) gross industry-level revenue and expenditure levels, Yi,k and Ei,k; and (vi) input-output shares, αi,kg.

As with the baseline model, the system specified by Proposition 2 also determines the gains from trade agreements, which are given by Wˆ −1

i = Y ˆi/ ∏k Pˆ ̃i,k

−1

. Finally, an amended version of Proposition 2 where taxes in one country are assigned their unilaterally optimal rate, whereas taxes in the rest of the world are assigned their applied rate, can determine the consequences of unilateralism.

###### 6.2 Data Description

Our main data source is the 2014 edition of the World Input-Output Database (WIOD, Timmer et al. 2012). The WIOD database covers 56 industries and 42 countries that account for more than 85% of world GDP, plus an aggregate of the rest of the world. The countries in the sample include all 27 members of the European Union (EU) and 15 other major economies: Australia, Brazil, Canada, China, India, Indonesia, Japan, Mexico, Norway, Russia, South Korea, Switzerland, Taiwan, Turkey, and the United States—see Table 4 in the appendix for a full list of countries. Following Costinot and Rodríguez-Clare (2014), we aggregate the original 56 industries in the WIOD into 16 industrial categories, which are listed in Table 1. Finally, we make the WIOD database consistent with our theoretical model by purging it from trade imbalances. In this process, we closely follow the methodology in Costinot and Rodríguez-Clare (2014) who apply Dekle et al.'s (2007) hat-algebra methodology to purge the 2008 edition of the WIOD.20

We take data on applied tariffs, tji,k, from the United Nations Statistical Division, Trade Analysis and Information System (UNCTAD-TRAINS). The 2014 version of the UNCTAD-TRAINS reports the simple tariff line average of the effectively applied tariff (AHS) across 31 two-digit (in ISIC rev.3) sectors, 185 importers, and

20A similar approach is also applied by Ossa (2014) to eliminate trade imbalances from the GTAP database.

243 export partners. When tariff data are missing in a given year, we use tariff data for the nearest available year, giving priority to earlier years. We aggregate the UNCTAD-TRAINS data into individual WIOD industries, following the methodology in Kucheryavyy et al. (2016). Since individual EU member countries are not represented in the UNCTAD-TRAINS data during the 2000-2014 period, our analysis treats the 27 EU members as one taxing authority.

To map our model to the WIOD and UNCTAD-TRAINS datasets, we treat good ij, k as a good pertaining to WIOD industry k that is supplied by country i to market

  • j. Under this interpretation, our data reports the following information:
  • i. The gross expenditure on good ij, k: p ̃ij,kqij,k
  • ii. The applied import tax on good ij, k: tij,k
  • iii. The share of industry k inputs used in industry g's output in country i: αi,kg.

Using the above data points, we can construct the within-industry gross expenditure shares as

p ̃ij,kqij,k ∑n p ̃in,kqin,k

.

λij,k =

Similarly and assuming xij,k = 0, the within-industry gross output shares can be constructed as

p ̃ij,kqij,k/(1 + tij,k) ∑n p ̃in,kqin,k/(1 + tij,k)

.

rij,k =

The total gross output of industry k in country i can be calculated as the sum of gross sales net of taxes:

Yi,k = ∑

n

p ̃in,kqin,k/(1 + tij,k) ,

With information on gross output, country i's net spending on final goods can be calculated as the gross minus intermediate input expenditure:

##### Yi = ∑

##### ∑

j

k

p ̃ji,kqji,k −∑

##### ∑

g

k

αi,kgYi,g .

The industry-level consumption shares are, accordingly, given by

##### ei,k = ∑

j

p ̃ji,kqji,k −∑

g

αi,kgYi,g /Yi.

Finally, wage income can be calculated as the sum of labor compensation across all industries: wiLi = ∑ α ̄i,kYi,k, where α ̄i,k = 1 − ∑g αi,gk.

Estimating the Industry-Level Trade Elasticities We estimate the industry-level trade elasticities using the triple-difference estimator developed by Caliendo and Parro (2014). To present this procedure, note that the multi-industry gravity model (with or without input-output networks) predicts the following formulation for trade shares:

λij,k = Φi,kΩj,kτij−,kεk(1 + tij,k)−εk,

where Φi,k ≡ ai−,kεkwi−,kα ̄i,kεk ∏g P ̃i−,gαi,gkεk and Ωj,k ≡ ∑n τnj−,εkk(1 + tnj,k)−εkΦn,k can be viewed as exporter×industry and importer×industry fixed effects. Suppose the

iceberg trade cost, ln τij,k = ln dij,k + εij,k, is composed of two components: (i) a systematic and symmetric component, dij,k = dji,k, that accounts for the effect of distance, common language, and common border, and (ii) a random disturbance term, εji,k, that represents deviation from symmetry. Using this decomposition, we can produce the following estimating equation for any triplet (j,i, n):

(1 + tji,k)(1 + tin,k)(1 + tnj,k) (1 + tij,k)(1 + tni,k)(1 + tjn,k)

λji,kλin,kλnj,k λij,kλni,kλjn,k

+ ε ̃jin,k,

= −εk ln

ln

where ε ̃jin,k ≡ εk(εij,k − εji,k + εin,k − εni,k + εnj,k − εjn,k). Using the above estimating equation, we can attain unbiased and consistent estimates for εk under the identifying assumption that Corr(tji,k, εji,k) = 0. We estimate the above equation separately for each industry, using data on {λji,k} from the 2014 version of the WIOD and data on {tji,k} from the UNCTAD-TRAINS database. The estimation results are reported in Table 1 and broadly align with those produced by Caliendo and Parro (2014), who use data on a smaller sample of countries from 1993.

Table 1: List of industries and estimated trade elasticities.

|Number<br><br>|Description<br><br>|εk|std. err.<br><br>|Obsv.| |---|---|---|---|---|

|1<br>2<br>|Crop and animal production, hunting Forestry and logging Fishing and aquaculture Mining and Quarrying<br><br>|0.93|0.19|12,341| |---|---|---|---|---| |3|Food, Beverages and Tobacco<br><br>|0.53|0.13<br><br>|12,300| |4<br><br>|Textiles, Wearing Apparel and Leather|2.71<br><br>|0.51|12,341| |5<br><br>|Wood and Products of Wood and Cork<br><br>|5.64|0.87<br><br>|12,183| |6<br><br>|Paper and Paper Products Printing and Reproduction of Recorded Media|4.65|1.49<br><br>|12,300| |7|Coke, Refined Petroleum and Nuclear Fuel<br><br>|13.38|1.94<br><br>|9,538| |8|Chemicals and Chemical Products Basic Pharmaceutical Products<br><br>|2.36|0.91<br><br>|12,300| |9<br><br>10<br><br>11<br><br><br>|Rubber and Plastics Other Non-Metallic Mineral Basic Metals Fabricated Metal Products|1.51|0.89|12,341<br><br>| |12<br><br>|Computer, Electronic and Optical Products Electrical Equipment<br><br>|4.07|1.02|12,341| |13<br><br>|Machinery and Equipment n.e.c<br><br>|5.65|1.34|12,341| |14<br><br>|Motor Vehicles, Trailers and Semi-Trailers Other Transport Equipment|2.70<br><br>|0.45|12,341<br><br>| |15<br><br>|Furniture; other Manufacturing<br><br>|2.04<br><br>|0.59<br><br>|12,341| |16|All Service-Related Industries (WIOD Industry No. 23-56)<br><br>|3.80<br><br>|0.84<br><br>|12,341|

Note: This table estimates the industry-level trade elasticities using the Caliendo and Parro (2014) methodology. The original WIOD industry classification features 56 industries, which we aggregate into 16 industrial categories.

###### 6.3 The Consequences of Unilateralism

Before analyzing the value of trade agreements, we discuss the consequences of unilateralism. For each country, the computed gains from a unilateral adoption of optimal trade taxes are reported in Table 2.21 The same table also reports the

21Each iteration of our analysis treats one of the countries listed in Table 2 as the Home coun-

externality imposed by theses taxes on the rest of the world—all in terms of a percentage change in real GDP. The first two columns in Table 2 report the gains implied by the baseline gravity model that overlooks global value chains. The second two columns report the gains implied by our main model that accounts for global value chains. In both cases, we separately report the consequences of first-best unilateral export taxes and second-best unilateral import taxes.

The numbers reported in Table 2 indicate that for a non-cooperative country first-best export taxes deliver welfare gains that are (on average) 52% larger than second-best import taxes. On the flip side, non-cooperative export taxes also inflict a greater negative externality on the rest of the world. This observation has basic implications for the weight of terms-of-trade versus political economy considerations in the governments' objective function. It suggests that, absent political motives, governments should be applying export taxes more vigorously as they deliver sizable terms-of-trade gains and are not prohibited by the WTO. The fact that export taxes are underexploited implies that export-oriented industries have a greater-than-previously-estimated political weight in the governments objective function. To put this finding in perspective, note that inferring the government's objective function from only import tax measures implies a greater weight on terms-oftrade considerations (see Goldberg and Maggi (1999)).22

Second-best non-cooperative import taxes are only modestly more effective at mimicking export taxes in the presence of global value chains. As argued earlier, this outcome is an artifact of an innate tension between two welfare considerations: (a) to maintain allocative efficiency in the domestic economy, non-cooperative governments should not tax highly-differentiated intermediate inputs, but (ii) to exploit export market power with second-best import taxes, governments should levy taxes on highly-differentiated intermediate imports. This tension renders import taxes as an ineffective second-best substitute for export taxes.

Global value chains multiply the gains from non-cooperative taxation and the externality of such policies on the rest of the world by more than 100%. Intuitively,

try and aggregates the remaining countries into one Foreign economy. Alternatively, we can use the multilateral formulas presented in Section 4.2 to avoid aggregating other countries into one Foreign economy. Doing so delivers welfare gains that are practically indistinguishable from our benchmark results.

22To elaborate, Goldberg and Maggi (1999) study the cross-industry variation in import taxes and find that "the weight of welfare in the government's objective function is many times larger than the weight of contributions."

in the presence of global value chains, the excess burden per dollar of tax revenue falls relatively more on the rest of the world. Consider, for instance, an export tax on rare earth minerals that are used as an input in smartphone production abroad. A fraction of this export tax is borne on transactions involving foreign producers and foreign consumers of smartphones. So, through the global input-output network, the tax-imposing country can effectively raise revenue on transactions occurring outside its jurisdiction. In other words, global value chains present governments with extra-territorial taxing power.

Table 2: The Welfare Consequences of Unilateralism (% change in real GDP)

Baseline Model Model with IO Networks

Export taxation Import taxation Export taxation Import taxation Country %∆Wi %∆W−i %∆Wi %∆W−i %∆Wi %∆W−i %∆Wi %∆W−i AUS 1.7% -0.1% 1.1% -0.0% 3.3% -0.1% 2.2% -0.1%

  • EU 0.6% -0.3% 0.3% -0.2% 1.3% -0.8% 1.0% -0.6% BRA 1.2% -0.1% 0.8% -0.0% 2.1% -0.1% 1.4% -0.1%
  • CAN 1.9% -0.1% 0.9% -0.0% 3.6% -0.2% 2.3% -0.1%
  • CHE 1.4% -0.0% 0.7% -0.0% 3.0% -0.1% 2.2% -0.0% CHN 0.5% -0.3% 0.4% -0.2% 1.6% -0.5% 1.4% -0.4% IDN 1.9% -0.0% 1.3% -0.0% 3.7% -0.1% 2.3% -0.0% IND 0.7% -0.0% 0.3% -0.0% 1.6% -0.1% 0.9% -0.1% JPN 0.7% -0.1% 0.4% -0.0% 1.4% -0.2% 1.1% -0.1% KOR 1.2% -0.1% 0.8% -0.0% 3.0% -0.1% 2.4% -0.1% MEX 1.7% -0.0% 1.1% -0.0% 3.0% -0.1% 0.9% -0.0% NOR 3.2% -0.0% 2.4% -0.0% 5.2% -0.1% 3.3% -0.0% RUS 1.6% -0.1% 0.7% -0.0% 3.2% -0.2% 1.9% -0.1%
  • TUR 1.6% -0.0% 1.0% -0.0% 3.4% -0.1% 1.8% -0.0% TWN 1.6% -0.0% 1.0% -0.0% 3.8% -0.1% 3.1% -0.0%
  • USA 0.5% -0.2% 0.2% -0.1% 0.9% -0.5% 0.6% -0.4%
  • Average 1.39% -0.09% 0.83% -0.06% 2.75% -0.19% 1.80% -0.14%

Note: This paper reports the change in a country's welfare from applying its optimal trade taxes. The baseline model is the perfectly competitive multi-industry gravity model 'a la Costinot et al. (2011). The model with IO networks is the perfectly competitive multi-industry gravity model with input-output networks 'a la Caliendo and Parro (2014).The gains from unilateral export taxation are solved using an amended version of Propositions 1 and 2, where one country acts noncooperatively and the rest of the world retains their applied tax rates. The gains from (second-best) import taxes concern a scenario where export taxes are inapplicable.

###### 6.3.1 Have Global value chains Reduced Protectionist Motives?

Global value chains influence the consequences of unilateralism (a) by changing the non-cooperative tax level in each country, and (b) by multiplying welfare consequences conditional on the chosen tax level. The latter channel is relatively well understood and echos the results from the standard gains from trade literature. The former channel is new and merits further elaboration. In fact, as noted in the introduction, some experts argue that existing trade agreements are obsolete because countries that are immersed in global value chains have little (terms-oftrade-driven) incentive to raise trade restrictions.

Figure 1 illustrates the optimal non-cooperative trade tax schedule for the European Union (E.U.). These are the non-cooperative E.U. taxes given the vector of applied taxes by trading partners. As implied by Theorem 1, the optimal export tax is lower once we account for input-output networks. The intuition is that export taxes cascade through the input-output network, and a fraction of the tax is ultimately passed back to domestic consumers. The optimal export tax is set lower to mitigate these adverse cascading effects. This reduction is especially meaningful in more upstream industries like Metal, Mining, and Services.

Second-best non-cooperative import taxes are uniform if we do not account for input-output networks. Otherwise, they vary across industries in a way that partially resembles the optimal first-best export taxes. In particular, the optimal import tax is higher on low trade elasticity industries. But, unlike first-best export taxes, it is also relatively higher on upstream industries. The optimal import tax is, for example, higher on Mining and Service industries that are both relatively more upstream and have a relatively low trade elasticity. The Petroleum industry, in comparison, is relatively upstream but exhibits the highest trade elasticity of all industries. As a result, the second-best import tax on the Petroleum industry is lower than others.23

23The fact that second-best import taxes vary modestly across industries reflects a tension between misallocation-worsening and terms-of-trade-improving effects. To explain this tension again, consider the extreme case where all imported intermediates are re-exported without processing in the domestic economy. In that case, the second-best import tax schedule converges to the first-best export tax schedule. Because, in this extreme case, neither tax instrument distorts input choices in the domestic economy. This is far from what happens in reality, though—as is clearly manifested in Figure 1. In practice, industries like Chemicals and Mining that should be targeted with secondbest import taxes are both high-value-added and more upstream. So, disrupting them with taxes will be quite detrimental to allocative efficiency.

Figure 1: The European Union's Unilaterally Optimal Trade Taxes

Note: This graph plots the optimal trade tax for the E.U. economy against the rest of the world. Industries are ordered by the trade elasticity estimates reported in Table 1. The left panel reports the first-best export taxes, normalizing the redundant import taxes to zero. The hollow squares correspond to the optimal export taxes predicted by the multi-industry competitive gravity model without input-output networks. Second best import taxes correspond to a scenario where export taxes are prohibited.

Importantly, these patterns can reconcile the terms-of-trade thesis with evidence that governments assign a low political weight to protecting upstream industries because consumers of intermediate inputs are more organized (see Shapiro (2019)). To see this, note that applied tariffs -in the absence of external restrictions- convey information about the sum of political economy and terms-oftrade considerations (Grossman and Helpman (1992)). Correspondingly, the political weight of industry-level protection can be inferred as the wedge between the socially optimal tariff and the applied tariff rate in each industry. A high political weight, under this approach, indicates more lobbying for protective tariffs in that given industry.

Suppose we infer political weights from optimal tariffs that do not internalize global value chains. In that case, we will find no systematic relationship between the level of upstreamness and political weights, which defies the conventional wisdom.24 However, if we apply our theory that implies a higher optimal tariff on upstream industries, we will infer a low political weight for protection in these

24A notable example undertaking this approach is Ossa (2014). He infers political weights

industries, which aligns with the aforementioned evidence in Shapiro (2019).

Above all, Figure 1 indicates that the globalization of value chains has not significantly reduced the governments' incentive to raise trade restrictions. This outcome is driven by all economies, including the E.U., being effectively small economies in relation to the rest of the world. In that case, E.U. taxes on intermediate inputs have a small (albeit non-trivial) effect on prices in the rest of the world. Consequently, the optimal non-cooperative taxes implied with and without global value chains are rather similar and resemble the optimal tax of a small open economy supplying differentiated products.

###### 6.4 The Value of Trade Agreements

As a final step, we quantify the value of existing trade agreements for the global economy. Recall that the value of these agreements is driven by their ability to remedy non-political trade restrictions that ensue in the non-cooperative Nash equilibrium. The extent of these counterfactual restrictions can be measured using the system of equations specified under Propositions 1 and 2.25

Table 3 reports the value of existing trade agreements for each country, and contrasts it to the value implied by the baseline model that overlooks global value chains. These numbers are reported under two different scenarios: First, a scenario where countries can freely raise all trade tax instruments, in which case agreements remedy the first-best non-cooperative export restrictions. Second, a scenario where countries cannot raise export taxes due to institutional barriers like those in the U.S. constitution. In this latter case, agreements remedy the second-best noncooperative import restrictions.

To put the estimated gains from trade agreements in perspective, the last two columns of Table 3 report the overall gains from trade relative to autarky. These gains are calculated using the sufficient statistics formulas presented by Costinot and Rodríguez-Clare (2014).

using optimal tariffs implied by the multi-industry Krugman (1980) model. Optimal tariffs in the aforementioned model are determined primarily by the industry-level trade elasticities (see Lashkaripour and Lugovskyy (2019)) . Accordingly, Ossa (2014) estimates political weights that are strongly increasing in an industry's trade elasticity but are less dependent on an industry's level of upstreamness.

25See Appendix E for how the system specified by Propositions 1 and 2 extends to an environment with arbitrarily many countries.

Table 3: The Gains from Trade Agreements (% change in real GDP) Gains from Remedying Export Restrictions

Overall Gains From Trade Country Baseline IO Networks Baseline IO Networks Baseline IO Networks AUS 0.4% 0.8% 0.5% 0.9% 5.9% 7.7%

Gains from Remedying Import Restrictions

  • EU 1.0% 2.1% 1.2% 1.2% 4.1% 5.8% BRA 0.2% 0.5% 0.4% 0.4% 3.3% 4.4%
  • CAN 2.7% 4.6% 2.1% 2.8% 11.5% 14.5%
  • CHE 2.9% 5.4% 2.7% 3.6% 10.2% 14.1% CHN 0.8% 2.7% 1.0% 1.3% 3.3% 5.2% IDN 0.4% 1.3% 0.9% 0.9% 6.4% 8.8% IND 3.3% 4.5% 2.5% 1.9% 8.8% 11.8% JPN 3.2% 5.0% 2.3% 1.9% 10.0% 12.9% KOR 5.5% 9.4% 3.4% 3.9% 18.0% 23.9% MEX 1.9% 3.1% 1.8% 2.3% 19.9% 25.0% NOR -0.5% 0.3% -0.3% 0.9% 8.3% 11.5% RUS 1.2% 2.5% 1.5% 2.1% 7.6% 10.1%
  • TUR 2.2% 3.6% 2.3% 2.3% 10.1% 13.4% TWN 6.1% 11.3% 4.8% 5.7% 22.3% 29.2%
  • USA 1.1% 1.9% 1.2% 1.1% 3.8% 4.8%
  • Average 2.00% 3.69% 1.75% 2.07% 9.60% 12.70%

Note: This paper reports the change in a country's welfare from applying its optimal trade taxes. The baseline model is the perfectly competitive multi-industry gravity model 'a la Costinot et al. (2011). The model with IO networks is the perfectly competitive multi-industry gravity model with input-output networks 'a la Caliendo and Parro (2014). The value of trade agreements at remedying export taxation are solved using the systems specified by Propositions 1 and 2. The gains from remedying import taxation concern a scenario where non-cooperative export taxes are inapplicable.

The value of trade agreements, as far as remedying unilateral export restriction is concerned, is around 3.7% of the real GDP for the average country. This amounts to 30% of the overall gains from trade. In other words, if existing trade agreements disintegrate, the average country will lose close to a third of the underlying gains from trade. To put a dollar value on these effects, trade agreements add $2.8 trillion to global GDP, which is the equivalent of France's GDP.26

The above numbers can be viewed as an upper bound on the value of trade agreements. As note earlier, governments may be obliged to erect only im-

26This figure is calculated using data on real GDP from the Penn World Table.

port restriction if existing agreements dissolve. In that case, by remedying noncooperative import restrictions, existing trade agreements add 2.1% to the average country's real GDP (see fourth column in Table 3). In that case, trade agreements contribute a total of $1.6 trillion to global GDP, which can be viewed as a lower bound on the value of existing trade agreements.

Around 46% of the estimated value of trade agreements is driven by their promotion of global value chains. This number can be inferred by comparing the value implied by the baseline model that overlooks global value chains and the main model that accommodates them. In other words, existing estimates that overlook global value chains may understate the value of trade agreements by around 46%. More importantly, these numbers suggest that the rise of global value chains has rendered trade agreements more valuable than ever. This is a crucial observation in a time when the tides have turned against free trade agreements like the WTO.

7 Concluding Remarks

Our understanding of global trade has improved drastically over the past two decades; thanks, in part, to the emergence of tractable quantitative trade models that admit a rich geography of trade costs and global value chains. Surprisingly, these canonical models have had little impact on how we think about trade policy and trade agreements. To address this gap, we presented a full analytical characterization of optimal non-cooperative trade policy in the aforementioned class of quantitative trade models. We demonstrated that the globalization of value chains have barely reduced the terms-of-trade-driven incentives to restrict trade; but it has increased the cost of such restrictions quite remarkably. Applying our theory to data, we measured that a disintegration of existing trade agreements will erase 30% of the total gains from trade.

Our theory presents researchers with a new benchmark to study the political economy of trade taxation in the age of global value chains. The current practice in the literature is to infer political economy motives by contrasting the applied trade taxes with their socially optimal level. Under this approach, our general theory of optimal trade policy will enable researchers to credibly estimate the political weight of protecting upstream versus downstream industries. Likewise, our theory can shed fresh light on the relative importance of terms-of-trade versus polit-

ical economy considerations in the conduct of trade policy. Previous estimates that focus only on import taxation indicate that political considerations are dwarfed by terms-of-trade considerations. Our theory suggests that accounting for export taxes may revise this prediction.

On a broader level, our findings highlight an important dilemma facing policymakers in the current political climate. On one hand, critics argue that trade agreements infringe on national sovereignty, and limit governments' ability to tackle domestic distortions, inequality, and environmental issues. On the other hand, we find that dissolving existing trade agreements will result in extensive losses from disrupting global value chains. Future research can perhaps resolve this dilemma, at the core of which lies a basic question: Can existing agreements be reformed to maintain international cooperation in trade policies but with less infringement on national sovereignty?

References

Alvarez, F. and R. E. Lucas (2007). General equilibrium analysis of the eaton–kortum model of international trade. Journal of monetary Economics 54(6), 1726–1768.

Amiti, M., S. J. Redding, and D. E. Weinstein (2019). The impact of the 2018 tariffs on prices and welfare. Journal of Economic Perspectives 33(4), 187–210.

Anderson, J. E. and Y. V. Yotov (2016). Terms of trade and global efficiency effects of free trade agreements, 1990–2002. Journal of International Economics 99, 279–298.

Bagwell, K. and R. W. Staiger (1999). An economic theory of gatt. American Economic Review 89(1), 215–248.

Bagwell, K., R. W. Staiger, and A. Yurukoglu (2018). Quantitative analysis of multi-party tariff negotiations. Technical report, National Bureau of Economic Research.

Baier, S. L. and J. H. Bergstrand (2007). Do free trade agreements actually increase members' international trade? Journal of international Economics 71(1), 72–95.

Baqaee, D. and E. Farhi (2019). Networks, barriers, and trade. Technical report, National Bureau of Economic Research.

Blanchard, E. J., C. P. Bown, and R. C. Johnson (2016). Global supply chains and trade policy. Technical report, National Bureau of Economic Research.

Bown, C., P. Conconi, A. Erbahar, and L. Trimarchi (2020). Trade protection along supply chains.

Caliendo, L., R. C. Feenstra, J. Romalis, and A. M. Taylor (2015). Tariff reductions, entry, and welfare: Theory and evidence for the last two decades. Technical report, National Bureau of Economic Research.

Caliendo, L. and F. Parro (2014). Estimates of the trade and welfare effects of nafta. The Review of Economic Studies, rdu035.

Carballo, J., K. Handley, and N. Limão (2018). Economic and policy uncertainty: Export dynamics and the value of agreements. Technical report, National Bureau of Economic Research.

Costinot, A., D. Donaldson, and I. Komunjer (2011). What goods do countries trade? a quantitative exploration of ricardo's ideas. The Review of economic studies, rdr033.

Costinot, A., D. Donaldson, J. Vogel, and I. Werning (2015). Comparative advantage and optimal trade policy. The Quarterly Journal of Economics 130(2), 659–702.

Costinot, A. and A. Rodríguez-Clare (2014). Trade theory with numbers: Quantifying the consequences of globalization. In Handbook of international economics, Volume 4, pp. 197–

  1. Elsevier. Costinot, A. and I. Werning (2019). Lerner symmetry: A modern treatment. American Economic Review: Insights 1(1), 13–26. Das, S. P. (1983). Optimum tariffs on final and intermediate goods. International Economic Review, 493–508. Dekle, R., J. Eaton, and S. Kortum (2007). Unbalanced trade. American Economic Review 97(2), 351–355. Egger, P., M. Larch, K. E. Staub, and R. Winkelmann (2011). The trade effects of endogenous preferential trade agreements. American Economic Journal: Economic Policy 3(3), 113–43. Fajgelbaum, P. D., P. K. Goldberg, P. J. Kennedy, and A. K. Khandelwal (2020). The return to protectionism. The Quarterly Journal of Economics 135(1), 1–55. Goldberg, P. K. and G. Maggi (1999). Protection for sale: An empirical investigation. American Economic Review 89(5), 1135–1155. Grant, M. (2019). Why special economic zones? using trade policy to discriminate across importers.

Gros, D. (1987). A note on the optimal tariff, retaliation and the welfare loss from tariff wars in a framework with intra-industry trade. Journal of international Economics 23(3), 357–367.

Grossman, G. M. and E. Helpman (1992). Protection for sale. Technical report, National Bureau of Economic Research.

Grossman, G. M. and E. Helpman (1995). Trade wars and trade talks. Journal of political

Economy 103(4), 675–708. Irwin, D. A. (2020). Free trade under fire. Princeton University Press. Johnson, R. C. (2018). Measuring global value chains. Annual Review of Economics 10, 207–

  1. Krugman, P. (1980). Scale economies, product differentiation, and the pattern of trade. The American Economic Review 70(5), 950–959.

Kucheryavyy, K., G. Lyn, and A. Rodríguez-Clare (2016). Grounded by gravity: A wellbehaved trade model with industry-level economies of scale. Technical report, National Bureau of Economic Research.

Lashkaripour, A. and V. Lugovskyy (2019). Profits, scale economies, and the gains from trade and industrial policy.

Limão, N. (2016). Preferential trade agreements. In Handbook of commercial policy, Volume 1, pp. 279–367. Elsevier.

Maggi, G. and A. Rodriguez-Clare (1998). The value of trade agreements in the presence of political pressures. Journal of Political Economy 106(3), 574–601.

Opp, M. M. (2010). Tariff wars in the ricardian model with a continuum of goods. Journal of International Economics 80(2), 212–225.

Ossa, R. (2014). Trade wars and trade talks with data. The American Economic Review 104(12), 4104–4146.

Ossa, R. (2016). Quantitative models of commercial policy. Handbook of Commercial Policy 1, 207–259.

Piketty, T. and E. Saez (2013). Optimal labor income taxation. In Handbook of public economics, Volume 5, pp. 391–474. Elsevier.

Powell, A. A. and F. Gruen (1968). The constant elasticity of transformation production frontier and linear supply system. International Economic Review 9(3), 315–328.

Shapiro, J. S. (2019). The environmental bias of trade policy. UC Berkeley and NBER (Ed.), 1–76.

Su, C.-L. and K. L. Judd (2012). Constrained optimization approaches to estimation of structural models. Econometrica 80(5), 2213–2230.

Suzuki, K. (1978). The welfare effects of an export tax levied on an intermediate good. The Quarterly Journal of Economics, 55–69.

Timmer, M., A. A. Erumban, R. Gouma, B. Los, U. Temurshoev, G. J. de Vries, I.-a. Arto, V. A. A. Genty, F. Neuwahl, J. Francois, et al. (2012). The world input-output database (wiod): contents, sources and methods. Technical report, Institue for International and Development Economics.

Online Appendix

A Proof of Lemma 1

The proof of Lemma 1 follows from two intermediate claims, labeled C1and C2. To present these claims, note that A = (1 + t,1 + x, wh, wf) and A′ = (a(1 + t), (1 + x)/a, awh, wf) . Accordingly, all equilibrium variables corresponding to the wage-tax combination A′ will be denoted by the prime notation. Claim C1 is based on the demand function Dji,k (p ̃i,Yi) being homogeneous of degree zero. Specifically,

 

###### p ̃′

h = ap ̃h, Yh′ = aYh p ̃′

###### =⇒ q′

ji,k = qji,k ∀ji, k. (C1)

f = p ̃ f, Yf′ = aYf

where recall that p ̃i ≡ {p ̃ji,k}j,k denotes the vector of final prices in country i. Claim C2 can be stated as follows

 

###### p ̃′

h = ap ̃h, Yh′ = aYh p ̃′

q′

ji,k = qji,k ∀ji, k =⇒

. (C2)

f = p ̃ f, Yf′ = aYf

The claim that p ̃′

h = ap ̃h and p ̃′

f = p ̃ f follow trivially from the price equation:

p ̃ji,k = (1+xji,k)(1+tji,k)Cj,k(wj,p ̃j;∑

τjn,kqjn,k),

n

given that Ci,k is homogeneous of degree 1 w.r.t. wj and p ̃ j. That Yf′ = w′f Lf = wf Lf = Yf, holds by construction. The claim that Yh′ = aYh, meanwhile, can be shown along the following steps:

h +∑

Y′

h = w′

hL′

k

= awhLh +∑

k

awhLh +∑

k

1 −

= awiLi +a∑

k

xk′ 1 + xk′

t′k 1 + t′k

p ̃′

fh,kq′

p ̃′

hf,kq′

fh,k +

hf,k

1 1 + xk′

a 1 + t′k

p ̃′fh,kqfh,k + a −

p ̃hf,kqhf,k

1 −

1 (1 + xk)/a

a a(1 + tk)

ap ̃fh,kqfh,k + a −

p ̃hf,kqhf,k

tk 1 + tk

xk 1 + xk

p ̃fh,kqfh,k +

p ̃hf,kqhf,k = a (whLh + Rh) ,

where the second line follows from the balanced trade condition, i.e., ∑k p ̃fh1,+kqtkfh,k − p ̃hf,kqhf,k = 0. Together, Claims C1 and C2 establish Lemma 1.

B Proof of Theorem 1

Note that by definition 1 + tk ≡ p ̃fh,k/pfh,k and 1 + xk ≡ p ̃hf,k/phf,k. Hence, the optimal policy problem can be reformulated as a problem of directly choosing consumer prices,

p ̃fh,k, and, p ̃hf,k, rather than taxes, tk, and, xk. In that case, the optimal policy problem can alternatively be expressed as

Wh (p ̃; w) ,

max

(p ̃;w)∈A ̃

where A ̃ denotes the set of feasible wage-price combinations that is defined analogous to D2. Note that in the presence of input-output linkages, equilibrium is conventionally defined in terms three sets of equilibrium outcomes: (a) wages, w, (b) net income Y, and (c) gross revenue, Y. However, combining equilibrium conditions (iv) and (v) in D2, we can uniquely solve Y ≡ Y(p ̃; w) and Y ≡ Y(p ̃; w) in terms of prices and wages. Considering this we can uniquely solve all equilibrium variables, including welfare, in terms of prices and wages.

To handle the complex nature of the above problem, we use two technical tricks. First,

we assume that the government can choose p ̃hh in addition to p ̃ fh and p ̃hf. In that case, Home's tax revenues include revenue's from domestic taxation, which we denote by

RhD ≡ ∑

(p ̃hh,k − phh,k)qhh,k.

k

Including the aforementioned instrument is innocuous since (i) markets are perfectly competitive, and (ii) governments have not political economy motives for intra-national redistribution. As a result, there is no rationale for domestic taxation (i.e., p ̃∗hh,k = phh,k) and the problem with extended instruments becomes isomorphic to our main problem of interest. The second trick that greatly simplifies our analysis is the observation that dWh(p ̃, w)/dp ̃f f,k = 0. In light of the above result, we are simply solving a problem where the government can directly choose all price variables that matter for Home's welfare. So, we should not track how a change in, say price variable p ̃fh,k, affects any other consumer price variables.

Before proceeding to the first step of the proof, let us alert the readers of a slight change in notation, which is meant to streamline the presentation: We, hereafter, use p ̃hf,k ≡ p ̃hf,k/(1 + thf,k) to denote the consumer price of good hf, k net of Foreign's im-

port taxes, and pfh,k ≡ (1+ xfh,k)pfh,k to denote the producer price of good fh, k inclusive of Foreign's export taxes.

Step 1: Characterizing the F.O.C. w.r.t. Import Taxes. The F.O.C. with respect to sector k's tariff can be expressed as

dWh (p ̃; w) dln p ̃fh,k

∂Vh(.) ∂Yh

=

∂Yh ∂ ln p ̃fh,k

dwh dln p ̃fh,k

∂Yh ∂wh

+

∂Vh(.) ∂p ̃fh,k

+

It should be noted upfront that the difference between the present setup and the pure Ricardian case, is that the (conditional) tariff pass-through σjhfh,g,k = ∂ ln p ̃jh,g/∂ ln (1 + tk) can be non-zero (even if g k) due to (i) the upward sloping supply curve in industry g, plus (ii) the cross-substitutability between industry k and industry g goods. Plugging Yh = whLh + Πh + RhX + RhM + RhD, into the F.O.C. yields the following:

dWh (p ̃; w) dln p ̃fh,k

∂Vh ∂Yh

=

∂Πh ∂ ln p ̃fh,k

∂RhX ∂ ln p ̃fh,k

+

∂RhM ∂ ln p ̃fh,k

+

∂Vh/∂p ̃fh,k ∂Vh/∂Yh

+

∂Vh/∂wh ∂Vh/∂Yh

dwh dln p ̃fh,k

+

###### = 0,

∂RhD ∂ ln p ̃fh,k

+

where ∂Vh/∂wh = ∂Yh/∂wh + ∑g ∂Vh/∂p ̃jh,g ∂p ̃jh,g/∂wh . The above F.O.C. is characterized by six different elements that can be characterized as follows. First, The effect of

tariffs on producer surplus, ∂Πh/∂ ln p ̃fh,k, can be expressed as

∂Πh (p ̃; w) ∂ ln p ̃fh,k

##### = ∑

##### ∑

g

i=h,f

##### = ∑

##### ∑

g

i=h,f

∂Πh ∂ ln phi,g

phi,gqhi,g

∂ ln phi,g ∂ ln p ̃Ifh,k

∂ ln phi,g ∂ ln p ̃Ifh,k

+

∂ ln qhh,g ∂ ln p ̃fh,k

∂phi,g ∂ ln qhh,g

∂Πh ∂ ln p ̃Ifh,k

+

+

∂ ln qhh,g ∂ ln p ̃fh,k − pfh,kqI

∂phi,g ∂ ln qhh,g

fh,k,

where the second line follows from Hotelling's lemma that ∂Πhi,g/∂phi,g = qhi,g and ∂Πh/∂p ̃Ifh,k = qIfh,k. Second, noting that dWh/dp ̃f f,g = 0, the effect of import taxes on

import tax revenues, RhM ≡ ∑g(p ̃fh,g − pfh,g)qfh,g, can be expressed as

∂RM (p ̃; w) ∂ ln p ̃fh,k

∂lnp ̃fh,k ∑

(p ̃fh,g − pfh,g)qfh,g

=

g

= p ̃fh,qqfh,k +∑

g

∂ ln qfh,g ∂ ln p ̃fh,k − pfh,gqfh,g

∂ ln pfh,g ∂ ln qfh,g

∂ ln qfh,g ∂ ln p ̃fh,k

(p ̃fh,g − pfh,g)qfh,g

,

where ∂ ln qfh,g/∂ ln p ̃fh,k accounts for the overall effect p ̃fh,k on final and intermediate input demand. More specifically, letting D ̃ fh,gs(.) denote the demand for intermediate input variety fh, g from industry s,

qCfh,g qfh,g

∂ ln qfh,g ∂ ln p ̃fh,k

=

qIfh,g qfh,g

+

∂ ln Dfh,g(.) ∂ ln p ̃fh,k

∂ ln Dfh,g(.) ∂ lnYh

dlnYh dln p ̃fh,k

+

∂ ln D ̃ fh,g(.) ∂ ln p ̃fh,k

∂ ln D ̃ fh,gs(.) ∂ ln Yh,s

dln Yh,s dln p ̃fh,k

##### +∑

αh,gs

s

dln Eh,g dln p ̃fh,k

= εfh,g + ηfh,g

where, αh,gs ≡ qIfh,gs/qIfh,g, and Ei,g ≡ ωiC,gYi + ∑s ωiI,gsYi,s denotes demand-weighted gross expenditure on final plus intermediate inputs fro industry g, with the last line fol-

D ̃ fh,g(.)

D ̃ fh,g(.)

lowing for the implicit assumption that ∂ln

∂lnYh,k = ∂ln∂DlnfhY,g(.)

and ∂ln

∂lnp ̃fh,k = ∂ln∂lnDp ̃fh,g(.)

fh,k

h

for all s. Third, the effect of import taxes on export tax revenues, RhX ≡ ∑g(p ̃hf,g − phf,g)qhf,g, can be expressed as

∂RhX (p ̃; w) ∂ ln p ̃fh,k

∂lnp ̃fh,k ∑

(p ̃hf,g − phf,g)qhf,g

=

g

##### = ∑

g

∂ ln Ef,g ∂ ln p ̃fh,k − phf,gqhf,g

(p ̃hf,g − phf,g)qhf,gηhf,g

∂ ln phf,g ∂ ln p ̃Ifh,k

∂ ln phf,g ∂ ln qhh,g

∂ ln qhh,g ∂ ln p ̃fh,k

+

.

Fourth, the effect of import taxes on domestic tax revenues, RhD ≡ ∑g(p ̃hh,g − phh,g)qhh,g, can be expressed as

∂RD (p ̃; w) ∂ ln p ̃fh,k

∂lnp ̃fh,k ∑

(p ̃hh,g − phh,g)qfh,g

=

g

#### =−∑

g

phh,gqhh,g

∂ ln phh,g ∂ ln p ̃Ifh,k

∂ ln phh,g ∂ ln qhh,g

∂ ln qhh,g ∂ ln p ̃fh,k

+

where the second line implicitly assumes that by the second welfare theorem, p ̃∗hh,k = phh,k. Fifth, the effect of taxes on the consumer prices can be simplified using Roy's identity, ∂∂VVi/∂p ̃ji,g

i/∂Yi = −qji,g, as follows

∂Vh/∂ ln p ̃fh,k ∂Vh/∂Yh

###### = −p ̃fh,kqC

fh,k.

Finally, the effect of tariffs on wages can be determined by applying the implicit function theorem to the balanced trade condition, Dh (p ̃, w) = ∑g pfh,gqfh,g − p ̃hf,gqhf,g . Doing so, implies ddwp ̃ h

= −∂∂Dp ̃hfh(,.k)/∂D∂wh(.)

. Hence, defining τ ̄ ≡ ∂ ∂wVhh/∂∂VYh

/∂∂Dwh

,

fh,k

h

h

h

∂Vh/∂wh ∂Vh/∂Yh

dwh d ln p ̃fh,k

= −τ ̄ ∑

g

∂ ln qfh,g ∂ ln p ̃fh,k

pfh,gqfh,g

###### +

∂ ln pfh,g ∂ ln qhh,g

∂ ln qhh,g ∂ ln p ̃fh,k − p ̃hf,gqhf,gηhf,g

∂ ln Ef ∂ ln p ̃fh,k

+ pfh,gqfh,g

Combing the above expression as well as noting that ∂Vh/∂Yh > 0 and qji,k = qIji,k + qCji,k, the F.O.C. can be simplified expressed as follows:

##### ∑

g

##### +∑

g

p ̃fh,g pfh,g − (1 + τ ̄) pfh,gqfh,g

∂ ln qfh,g ∂ ln p ̃fh,k − (1 + τ ̄)

phf,g p ̃hf,g

1 − (1 + τ ̄)

p ̃hf,gqhf,gηhf,g

∂ ln Ef,g ∂ ln p ̃fh,k

###### = 0

∂ ln qfh,g ∂ ln p ̃fh,k

∂ ln pfh,g ∂ ln qfh,g

pfh,gqfh,g

Given that 1 + tfh,g ≡ pp ̃fhfh,,gg and 1 + xhf,g ≡ pp ̃hfhf,,gg, we can further simplify the above expression if we divide it by 1 + τ ̄ and ∑g,i pfi,gqfi,g:27

##### ∑

g

1 + tfh,g 1 + τ ̄ − 1 −

∂ ln pfh,g ∂ ln qfh,g

∂ ln qfh,g ∂ ln p ̃fh,k

∂ ln Eh,g ∂ ln p ̃fh,k

r ́fh,g + ∆(xh)

###### = 0

λ ́ hf,gηhf,g . Since (a) Foreign labor is assigned as the numeraire, wf = 1, (b) RXf,k/Ef ≈ 0, and (c) the share of intermediate inputs in Foreign's production is invariant to p ̃fh,k, then ∂ ln Ef,g/∂ ln p ̃fh,k = 0. Finally, we can determine ∂ ln pfh,k/∂ ln qfh,k by applying the Implicit Function theorem to pfh,k = τfh,kCf,k(...; ∑i τfi,kqfi,k), which yields ∂ ln pfh,k/∂ ln qfh,k = γf,krfh,k/ 1 − γf,krf f,kεf f,k .

where ∆(xh) ≡ ∑g 1 − (1+τ ̄)(11+xhf,g)

27Since Foreign does not collect tax revenue, gross expenditure in Foreign, Ef, also equals gross output revenue.

Combining these results reduces the F.O.C. to

∂RXf,k ∂p ̃fh,k ≈ 0

RXf,k Ef

##### ∑

g

1 + tfh,g 1 + τ ̄ − 1 −

γf,krfh,k 1 − γf,krf f,kεf f,k

∂ ln qfh,g ∂ ln p ̃fh,k

r ́fh,g = 0. (12)

Step 2. Characterizing the F.O.C. w.r.t. Export Taxes. The F.O.C. with respect to sector k's export tax can be stated as

dWh (p ̃; w) d ln p ̃hf,k

∂Vh ∂Yh

=

∂Πh ∂ ln p ̃hf,k

∂Vh/∂ ln p ̃hf,k ∂Vh/∂Yh

+

+

∂RhX ∂ ln p ̃hf,k

+

∂RhM ∂ ln p ̃hf,k

+

∂Vh/∂wh ∂Vh/∂Yh

dwh d ln p ̃hf,k

∂RhD ∂ ln p ̃hf,k

+

###### = 0,

In the above expression, ∂V∂h/V∂ln p ̃hf,k

h/∂Yh = 0. The remaining non-zero elements can be expressed as follows. First, The effect of export taxes on producer surplus, ∂Πh/∂p ̃hf,k, can be expressed as

∂Πh (p ̃; w) ∂ ln p ̃hf,k

##### = ∑

##### ∑

g

i=h,f

##### = ∑

##### ∑

g

i=h,f

∂Πhi,g ∂phi,g

∂phi,g ∂ ln p ̃hf,k

∂Πh ∂ ln p ̃Ihf,k

+

∂ ln qhf,g ∂ ln p ̃hf,k

∂phi,g ∂ ln qhf,g

,

phi,gqhi,g

where the last line follows from Hotelling's lemma that ∂Πhi,g/∂phi,g = qhi,k and ∂Πh/∂pIhf,g = 0. Second, the effect of export taxes on import tax revenues, RhM ≡ ∑g(p ̃fh,g − pfh,g)qfh,g, can be expressed as

∂RM (p ̃; w) ∂ ln p ̃hf,k

∂lnp ̃hf,k ∑

(p ̃fh,g − pfh,g)qfh,g

=

g

#### = −∑

g

∂ ln pfh,g ∂ ln p ̃Ihf,k

∂ ln Eh ∂ ln p ̃hf,k

(p ̃fh,g − pfh,g)ηfh,g

+ pfh,gqfh,g

Third, the effect of export taxes on export tax revenues, RhX ≡ ∑g(p ̃hf,g − phf,g)qhf,g, can be expressed as

∂RX (p ̃; w) ∂ ln p ̃hf,k

∂lnp ̃hf,k ∑

(p ̃hf,g − phf,g)qhf,g

=

g

##### =∑

g

∂ ln qhf,g ∂ ln p ̃hf,k − phf,gqhf,g

∂ ln phf,g ∂ ln qhf,g

∂ ln qhf,g ∂ ln p ̃hf,k

p ̃hf,g − phf,g qhf,g

Fourth, the effect of import taxes on domestic tax revenues, RhX ≡ ∑g(p ̃hh,g − phh,g)qhh,g, can be expressed as

∂RD (p ̃; w) ∂ ln p ̃hf,k

∂lnp ̃hf,k ∑

(p ̃hh,g − phh,g)qfh,g

=

g

#### =−∑

g

∂ ln phh,g ∂ ln qhf,g

∂ ln qhf,g ∂ ln p ̃hf,k

phh,gqhh,g

where the second line implicitly assumes that by the second welfare theorem, p ̃∗hh,k = phh,k. Finally, the effect of export taxes on wages can be determined by applying the implicit function theorem to the balanced trade condition, Dh = ∑g pfh,gqfh,g − p ̃hf,gqhf,g. Doing so, implies ddwp ̃ h

= −∂∂p ̃Dhfh,k/∂∂Dwh

. Hence, adopting our earlier definition, τ ̄ ≡

hf,k

h

∂Vh ∂wh/∂∂VYh

/∂∂Dwh

,

h

h

∂ ln pfh,g ∂ ln p ̃Ihf,k

∂Vh/∂wh ∂Vh/∂Yh

∂ ln Eh ∂ ln p ̃hf,k

dwh d ln p ̃hf,k

= −τ ̄ ∑

pfh,gqfh,g

+ ηfh,g

g

∂ ln qhf,g ∂ ln p ̃hf,k

− p ̃hf,kqhf,k −∑

p ̃hf,gqhf,g

g

Combing the above expression as well as noting that ∂Vh/∂Yh > 0 and qji,k = qIji,k + qCji,k, the F.O.C. can be simplified expressed as follows:

phf,g p ̃hf,g

pfh,gqfh,gαhf,,kg +∑

(1+τ ̄)∑

(1 + τ ̄) −

g

g

p ̃fh,g pfh,g − (1 + τ ̄) pfh,gqfh,gηfh,g

∂ ln Eh ∂ ln p ̃hf,k

##### + ∑

,

g

∂ ln qhf,g ∂ ln p ̃hf,k

p ̃hf,gqhf,g

. Given that 1 + tfh,g ≡ pp ̃fhfh,,gg and 1 + xhf,g ≡

= εhfhf,,kg + ηhf,g∂∂lnlnp ̃Ef

where recall that ∂∂lnlnqp ̃hf,g

hf,k

hf,k

p ̃hf,g phf,g, we can further simplify the above expression if we divide it by 1 + τ ̄ and Ef (noting

that pfh,kqfh,k = r ́fh,kEf and p ̃hf,gqhf,g = λ ́ hf,gEf):

λ ́hf,k +∑

g

##### −∑

g

1 (1 + xhf,g)(1 + τ ̄)

λ ́ hf,gεhfhf,,kg

1 −

∂ ln Ef ∂ ln p ̃hf,k

Yf Ef

∂ ln Eh ∂ ln p ̃hf,k

r ́fh,gαhf,,kg

+ ∆h(th)

+∆f(x)

###### = 0,

where ∆h(th) ≡ ∑g 1+1+tfhτ ̄,g − 1 r ́fh,gηfh,g is a uniform term. As noted before, given our choice of numeraire, ∂ ln Ef/∂ ln p ̃hf,k = 0. Given the Lerner symmetry and the multiplicity of the optimal trade tax, there always exists a solution to the above problem where ∆h(t) = 0. Henceforth, we restrict our attention to solving for this particular solution. Once we do that, the remaining solutions can be identified with a basic multiplicative transformation of the import and export tax vectors. yields the following optimality condition:

λ ́hf,k +∑

g

1 (1 + xg)(1 + τ ̄)

1 −

λ ́hf,gεhfhf,,kg −∑

g

r ́fh,gαhf,,kg = 0. (13)

Step 3: Simultaneously Solving the System of F.O.C. As a final step, we simultaneously solve the system of F.O.C.s for all tax instruments:

##### ∑

g

##### ∑

g

∂ ln qfh,g ∂ ln p ̃fh,k

1 + tg 1 + τ ̄ − 1 − γfh,g

rfh,g , ∀k ∈ K

λ ́ hf,g λ ́ hf,k

r ́fh,g λ ́ hf,k

1

##### εhf,g = 1−∑

(1 + xg)(1 + τ ̄) − 1

g

αhffh,,kg, ∀k ∈ K

First, note that by the Lerner symmetry, the value of τ ̄ is redundant. In other words, replacing 1 + τ ̄ with any 1 + t ̄ ∈ R+ identifies an optimal tax schedule. To derive a simplified expression for optimal export taxes, define Ωhf,k ≡ 1 − ∑g λ rfhhf,,gkαhf,,kg and suppose the optimal export tax has the following formulation:

εhf,k εhf,k + ξhf,k + Ωhf,k

###### 1 + x∗

.

hf,k =

Plugging the above expression back into the F.O.C. implies that that ∑g ξhf,g + Ωhf,g

λ ́ hf,gεhfhf,,kg λ ́hf,kεhf,k = Ωhf,k. This equation can be written in matrix notation

as Ξ (ξ + Ω) = Ω, where ξ ≡ [ξhf,k]k and Ω ≡ [Ωhf,k]k are K × 1 vectors and Ξ ≡ [

λ ́ hf,gεhfhf,,kg λ ́hf,kεhf,k ]k,g and IK are K × K matrixes. Inverting the above system implies that

[ξhf,k]k = Ξ−1 − IK Ω. (14)

So, altogether the following tax schedule corresponds to an optimal policy for an arbitrary choice of 1 + t ̄ ∈ R+:

γf,krfh,k 1 − γf,krf f,kεf f,k

###### (1 + t ̄)

1 + t∗

fh,k = 1 +

εhf,k

###### (1 + t ̄)−1,

###### 1 + x∗

hf,k =

1 + εhf,k + ξhf,k − ∑g λr ́ ́fh,g

αhffh,,kg

hf,k

where ξhf,k is given by Equation 14.

C Proof of Theorem 2

The F.O.C. with respect to sector k's tariff can be expressed as

dWh (p ̃; w) dln p ̃fh,k

∂Vh(.) ∂Yh

=

∂Yh ∂ ln p ̃fh,k

dwh dln p ̃fh,k

∂Yh ∂wh

+

∂Vh(.) ∂p ̃fh,k

+

It should be noted upfront that the difference between the present setup and the pure Ricardian case, is that the (conditional) tariff pass-through σjhfh,g,k = ∂ ln p ̃jh,g/∂ ln (1 + tk) can be non-zero (even if g k) due to (i) the upward sloping supply curve in industry g, plus (ii) the cross-substitutability between industry k and industry g goods. Plugging Yh = whLh + RhM, into the F.O.C. yields the following:

dWh (p ̃; w) dln p ̃fh,k

∂Vh ∂Yh

=

∂RhM ∂ ln p ̃fh,k

∂Vh/∂p ̃fh,k ∂Vh/∂Yh

+

∂Vh/∂wh ∂Vh/∂Yh

dwh dln p ̃fh,k

+

###### = 0,

By Roy's identity

∂ ln p ̃Chh,g ∂ ln p ̃Ifh,k

∂Vh/∂ ln p ̃fh,k ∂Vh/∂Yh

fh,k −∑

= −p ̃fh,kqC

p ̃hh,gqC

hh,g

g

hh,gα ̃hf,,gk

fh,k −∑

= −p ̃fh,kqC

p ̃hh,gqC

g

where α ̃if,k,k ≡ ∂ ln p ̃ij,g/∂ ln p ̃Ifh,k denotes the general equilibrium effect of a change in p ̃fh,k on the price industry g's output in country i.28 The K × K matrix for α ̃ ff,,kk can be calculated by applying the Implicit Function Theorem to p ̃fi,k = (1 + tfi,k)τfi,kCf,k(wf, p ̃If f, p ̃Ihf), which implies that

α ̃ ff,,kg

= IK − αff,,kg

k,g

k,g

αhf,,kg

k,g

αhf,,gk

,

k,g

Likewise, α ̃ ff,,kk can be calculated by applying the Implicit Function Theorem to p ̃hi,k = τhi,kCh,k(wf, p ̃Ifh, p ̃Ihh), which yields the following:

α ̃hf,,gk

= IK − αhh,,kg

k,g

k,g

αhf,,gk

.

k,g

With the above definition in mind, the effect of import taxes on import tax revenues, RhM ≡ ∑g(p ̃fh,g − pfh,g)qfh,g, can be expressed as

∂RM (p ̃; w) ∂ ln p ̃fh,k

∂lnp ̃fh,k ∑

(p ̃fh,g − pfh,g)qfh,g

###### =

g

= p ̃fh,qqfh,k +∑

g

(p ̃fh,g − pfh,g)qfh,g

∂ ln qfh,g ∂ ln p ̃fh,k

∂ ln qfh,g ∂ ln p ̃hh,g

α ̃hf,,sk − pfh,gqfh,gα ̃ ff,,ks ,

###### +

where as before (with a slight abuse of notation) ∂∂lnlnqp ̃fh,g

= ∂∂lnlnqp ̃fh,g

+ ηfh,g∂∂lnlnpE ̃h,g

. The income driven term, however, can be dropped as it has a second-order effect, since

ih,k

ih,k

ih,k

28To keep things simple, we are exercising a slight abuse of notation here. A a more elaborate choice of notation would be α ̃ifh,k,k ≡ ∂ ln p ̃ij,g/∂ ln p ̃Ifh,k, which clarifies that α ̃ifh,k,k α ̃if f,k,k.

∂ ln Eh,g ∂lnp ̃ih,k ∝ (RhM/Yh)λih,k, where δh ≈ λih,k ≈ 0.

∂Vh/∂wh ∂Vh/∂Yh

dwh d ln p ̃fh,k

= −τ ̄ ∑

g

pfh,gqfh,g

∂ ln qfh,g ∂ ln p ̃fh,k

∂ ln qfh,g ∂ ln p ̃hh,g

α ̃hf,,sk +

+

∂ ln qhf,g ∂ ln p ̃hf,g

+ pfh,gqfh,gα ̃ ff,,kg − p ̃hf,gqhf,g

α ̃hf,,gk

Finally, we have to plug Equations back into the F.O.C.. To simplify some terms we can use the zero profits condition, which entails that

∂Πh (p ̃; w) ∂ ln p ̃fh,k

##### = ∑

##### ∑

g

i=h,f

##### = ∑

##### ∑

g

i=h,f

∂ ln phi,g ∂ ln p ̃Ifh,k

∂Πh ∂ ln p ̃Ifh,k

∂Πh ∂ ln phi,g

##### +∑

+

g

phi,gqhi,gα ̃hf.,gk − pfh,kqI

fh,k −∑

phh,gqI

g

Combining the above expressions, the F.O.C. can be stated as

∂ ln p ̃Ihh,g ∂ ln p ̃Ifh,k

∂Πh ∂ ln p ̃Ihh,k

hh,gα ̃hf.,gk = 0,

##### ∑

g

##### +∑

g

p ̃fh,g pfh,g − (1 + τ ̄) εfh,g + εhhfh,,ggα ̃hf,,gk − (1 + τ ̄)α ̃ ff,,kg pfh,gqfh,g

phf,gqhf,g (1 + τ ̄)α ̃hf,,gk + τε ̄ hf,gα ̃hf,,gk = 0

where note that p ̃fh,g/pfh,g = 1 + tfh,k. To further simplify the above expression, we can divide by 1 + τ ̄ and gross expenditure in Foreign (which equal gross production revenue

since Foreign collects no tax revenue, i.e., ∑k Ef,k = ∑k Yf,k), which yields

##### ∑

g

1 + tfh,g 1 + τ ̄ − 1 εfh,g + εhhfh,,ggα ̃hf,,gk − α ̃ ff,,kg rfh,g

+ 1 +

τ ̄ 1 + τ ̄

εhf,g α ̃hf,,gk λhf,k = 0. (15)

(Wf/∂wf)

The next step is to characterize τ ̄ ≡ −∂

∂∑g(pfh,gqfh,g−phf,gqhf,g)/∂wf , where the partial derivative denotes the derivative with respect to wf holding p ̃fh,k fixed.

− ∑g pfh,kqfh,g∂∂lnlnpwfh,g

f

τ ̄ =

∑g pfh,gqfh,g∂∂lnlnpwfh,g

− phf,gqhf,g ∂ ∂lnlnqp ̃hf,g

∂ ln pf f,k

∂lnwf + ∂∂lnlnqEhf,g

∂ ln Ef,g ∂ ln wf

f f,g

f,g

f

− ∑g pfh,kqfh,g∂∂lnlnpwfh,g

f

=

∑g pfh,gqfh,g∂∂lnlnpwfh,g

− ∑g phf,gqhf,g εhf,g + ηhf,g δf,g

f

= − ∑g pfh,kqfh,gδf,k

1 1 + ∑g ωhf,gεhf,g

1 1 + ε ̄hf

,

= −

= −

∑g pfh,gqfh,gδf,k + ∑g phf,gqhf,gεhf,kδf,k

where the second line follows from the well-known result in consumer theory that ∑i εhffi,g,g + ηhf,g = 0; ε ̄hf denotes the elasticity of foreign demand for labor as defined under Definition D5; and ωhf,g ≡ δf,kλ ́ hf,k/ ∑k δf,kr ́fh,k is the weight assigned to industry g when calculating the elasticity of labor demand. δf,g, meanwhile, denotes the total contribution of Foreign labor to industry-level output. As discussed in Section 3.1, the vector δf ≡ [δi,k]k cam be calculated as

δf = IK − αf f −1 α ̄ f,

where recall that α ̄ f≡ [α ̄ f,k]k is a K × 1 vector and αf f ≡ [αff,,kg]k,g is a K × K input-output matrix. Also, note that if Home is sufficiently small compared to Foreign (which is often

the case since Foreign is an aggregate of the rest of the world), then δf,k ≈ 1 in which case ωhf,g ≈ λ ́ hf,g/λ ́ hf. Finally, plugging the all the above expressions back into the Equation 15, and inverting the system specified by this equation implies that

###### 1 + t∗

where τ ≡ [τk] is given by

 1k=g +

 

εhhfh,,gg εfh,k

α ̃hf,,gk

τ =

ε ̄hf 1 + ε ̄hf

fh,k =

τk εfh,k

1 +

,

−1

k,g

##### ∑

g∈K

εhf,g ε ̄hf

α ̃ ff,,kg − [1 +

λhf,g rfh,g

α ̃hf,,gk

]

.

k

D The Multiple Country Case

Suppose there are arbitrarily many countries. Then, the effect of country i's import tax on own welfare can be expressed as

dWi(.) dln(1 + tji,k)

∂Wi(.) ∂ ln(1 + tji,k)

=

dln wȷ dln(1 + tji,k)

∂Wi(.) ∂ ln wȷ

##### +∑

.

ȷ j

dln wȷ

It is straightforward to show that ∂∂Wlni(w.)

dln(1+tji,k) ∝ λȷirȷi,kλji,k, and that based on actual trade data, λȷirȷi,k/λiirii,k ≈ 0 for ȷ i. So, treating labor in country j as the numeraire, changes in welfare can be approximated to a first-order as

ȷ

∂Wi(.) ∂ ln(1 + tji,k)

dWi(.) dln(1 + tji,k) ≈

∂Wi(.) ∂ ln wi

dln wi dln(1 + tji,k)

.

+

The same applies to export taxes. Wee can, thus, cast the Country i's optimal policy problem as one that maximizes Wi(p ̃, wi) by choosing {p ̃ij,k} and {p ̃ji,k}subject to feasibility constraints. Following the same steps taken in Appendix B, the F.O.C. corresponding to price p ̃ji,k (i.e., import tax, tji,k) can be expressed as:

##### ∑

##### ∑

g

ȷ

1 + tji,g 1 + τ ̄i − 1 − γȷi,g

∂ ln qȷi,g ∂ ln p ̃ji,k

pȷi,gqȷi,g = 0.

γji,k can be derived by applying the Implicit Function Theorem to pji,k = τji,kCi(..., ∑g ∑ι τjι,gqjι,g), which implies the following:

γj,krji,k 1 − ∑ι i γj,krjι,kεjι,k

.

γji,k =

Likewise, the F.O.C. corresponding price p ̃ij,k (i.e., export tax, xij,k) can be expressed as:

##### ∑

##### ∑

g

ȷ

1 + tȷi,g 1 + τ ̄ − 1 − γji,g

##### ∑

g

1

(1 + xij,g)(1 + τ ̄i) − 1

∂ ln qȷi,g ∂ ln p ̃ji,k

pȷi,gqȷi,g p ̃ij,gqij,g

, ∀k ∈ K

λ ́ ij,g λ ́ ij,k

Λȷi,g Λ ̃ ij,k

α ̃ij,ȷk,g, ∀k ∈ K

εij,g = 1−∑

##### ∑

g

ȷ

where Λȷi,g ≡ pji,gqji,g/ ∑ȷ i ∑k pȷi,kqȷi,k; Λ ̃ ij,g ≡ p ̃ij,gqij,g/ ∑ȷ i ∑k p ̃iȷ,kqiȷ,k; and α ̃ij,ȷk,g = αijαjȷ if ȷ j, with α ̃ijj,k,g = αij. Combining the above expressions implies

γj,krji,k 1 − ∑ι i γj,krjι,kεjι,k

###### (1 + t ̄)

1 + t∗

ji,k = 1 +

εij,k

###### (1 + t ̄)−1.

###### 1 + x∗

ij,k =

1 + εij,k + ξij,k − ∑g ∑ȷ ΛΛ ̃ ȷi,g

α ̃ij,ȷk,g

ij,k

Uniformity of Import Taxes. Here, we establish the uniformity of import taxes (when γi,k = 0) without invoking any first-order approximation. Analogous to the two-country model, welfare in country i can be expressed as Wi = ∂Vi (Yi, p ̃i), where Yi = wiLi + ∑k tji,kpji,kqji,k + xij,kpij,kqij,k . Correspondingly, Wi is uniquely determined by the vector of import and export taxes, ti = {tji,k} and xi = {xij,k}, plus the vector of country-level wages, w = {wj}:

Wi (ti, xi; w) = Vi (Yi (ti, xi; w) , p ̃i (ti, xi; w))

Defining Di (ti, xi; w) = wiLi − ∑k ∑l (pil,kqil,k), the equilibrium vector of aggregate wages, w, solves the following system of equations:

 

D1 (t1, x1; w) = 0

(16)

. DN (tN, xN; w) = 0



Keeping the above observation in mind, we can write the F.O.C. with respect to tji,k as

dWi (ti, xi; w) d 1 + tji,k

∂Vi (Yi, p ̃i) ∂Yi

dwi

∂Yi

∂Yi ∂wi

=

+

∂ 1 + tji,k

d 1 + tji,k

∂p ̃li,g ∂wl

∂p ̃li,g

∂Vi (Yi, p ̃i) ∂p ̃li,g

dwl

##### + ∑

##### ∑

+

∂ 1 + tji,k

d 1 + tji,k

g∈K

l∈C

∂Vi (Yi, p ̃i) /∂p ̃ji,k ∂Vi (Yi, p ̃i) /∂Yi

∂p ̃ji,g

∂Vi (Yi, p ̃i) ∂Yi

∂Yi

##### +∑

=

+

∂ 1 + tji,k

∂ 1 + tji,k

l

∂Wi (.) ∂wl

dwl

d 1 + tji,k

###### = 0

Invoking Roy's identity, ∂∂VVi(Yi,p ̃i)/∂p ̃ji,k

i(Yi,p ̃i)/∂Yi = qji,k, and noting that

∂Yi

= p ̃ji,kqji,k + ∑

##### ∑

∂ ln 1 + tji,k

g∈K

l∈C

∂ lnYi

tli,kpli,kqli,k εjili,,kg + ηli,g

∂ ln 1 + tji,k

Plugging the above equation back into the F.O.C. and defining ∆ljii,,kg ≡ εjili,,kg + ηli,g∂ln∂lnYi

, will yield the following optimality condition

(1+tji,k)

dWi (ti, xi; w) dln 1 + tji,k

∂Vi (Yi, p ̃i) ∂Yi

=

p ̃ji,kqji,k − p ̃ji,kqji,k + ∑

##### ∑

g∈K

l∈C

tli,kpli,kqli,k∆ljii,,kg − ∑

l∈C

dln wl

νil

dln 1 + tji,k

where νil ≡ ∂Wi/∂ ln wl. Applying the implicit function theorem to the System of Equations 16, we can solve for d ln w/d ln1 + ti as follows:

d ln w1

∂ ln D1

dln(1+t1i,k) · · · ∂lnd(1ln+wtNi1 ,k)

∂ln(1+t1i,k) · · · ∂ln∂(1ln+DtNi1 ,k)

−1

∂ ln D ∂ ln w

... .

... .

,

=

.

.

 

 

 

 

dln(1+t1i,k) · · · dlnd(ln1+wtNiN ,k)

d ln wN

∂ln(1+t1i,k) · · · ∂ln∂(ln1+DtNiN,k)

∂ ln DN

Letting τli denotes element li of matrix (∂ ln D/∂ ln w)−1, the above system implies that for every l ∈ C

dln wl

pni,gqni,g∆niji,,kg .

##### = ∑

##### τln ∑

dln 1 + tji,k

g∈K

l∈C

Plugging the above expression back into the F.O.C. implies the following

,

tni,gpni,gqni,g∆niji,,kg − ∑

##### ∑

##### ∑

##### νl ∑

g∈K

n∈C

n∈C

l∈C

The above expression can in turn be rearranged as

pni,gqni,g∆niji,,kg = 0.

##### τln ∑

g∈K

τlnνil pni,gqni,g∆niji,,kg

##### ∑

##### ∑

tni,g − ∑

g∈K

n∈C

l∈C

tni,g − τ ̄ni pni,gqni,g∆niji,,kg = 0,

##### = ∑

##### ∑

g∈K

n∈C

where τ ̄ni ≡ ∑l∈C τlnνil, or, equivalently

###### Ti∆i = 0,

where Ti = tni,g − τ ̄ni ng∈C×K and ∆i = pni,gqni,g∆niji,,kg

are respectively 1 × N ·

ng,jk∈C×K

K and N · K × N · K matrixes. If det∆i 0, then Ti = 0 is the unique solution to the above system, which implies that the optimal import tax is uniform across products originating from the same exporting country:

t∗

ji,k = τ ̄ji, ∀j, k.

E Quantitative Implementation with Many Countries

This appendix extends the quantitative procedure outlined in Section 6 to environments that feature arbitrarily many countries.

Baseline Model without IO Networks. Without input-output network (i.e., α ̃ij,ȷk,g = 0) and with CES-Cobb-Douglas preferences (i.e., εij,k = −1 − εk(1 − λij,k)), Equation 8 implies the an optimal export tax on good ji, k that is equal to

1 εk(1 − λˆ ij,kλij,k)

1 + x∗

,

ij,k = 1 +

where for for each variable, xxˆ denotes the counterfactual value in the non-cooperative equilibriumCombining the above formula with the equilibrium conditions specified in Section 6 yields the following system of equations:

1 + xij∗,k = 1 + ε 1

k(1−λˆij,kλij,k); 1 + t∗ji,k = 1 λˆ ji,k = (1+t

−εk Pˆ ̃iε,kk

∗ ji,k)(1+x∗ji,k)

(1+tji,k)(1+xji,k) wˆj



−εk

(1+t∗ji,k)(1+x∗ji,k) (1+tji,k)(1+xji,k) wˆj

Pˆ ̃i−,kεk = ∑j

λji,k

λ ˆ ij,kλij,k

(1+xij∗,k)(1+tij∗,k)ej,kYˆjYj Y ˆiYi = wˆiwiLi + ∑k ∑j

w ˆiwiLi = ∑k ∑j

t∗ji,k 1+t∗ji,k

λˆ ji,kλji,kei,kYˆiYi + x

ij,k



1+xij∗,k

λˆ ij,kλij,kej,kYˆjYj

.

Solving the above system determines the welfare consequences of dissolving the existing trade agreement: Wˆ i = Yˆi/ ∏k Pˆ ̃i,k.

Main Model with IO Networks. Now, consider the main model with input-output networks. By Equation 8, the optimal export tax on good ji, k can be expressed as

1 + εk(1 − λˆ ij,kλij,k) εk(1 − λˆ ij,kλij,k) − ∑g ∑ι i

###### 1 + x∗

,

ij,k =

λˆ ιj,gλιj,gEˆj,gEj,g

(λˆij,kλij,kEˆj,kEj,k)(1+tιj,g)(1+xιj,g)α ̃ij,ȷk,g

where for for each variable, xxˆ denotes the counterfactual value in the non-cooperative equilibrium; while α ̃ijj,k,g = λˆ ij,kλij,kαj,kg and

α ̃ij,ȷk,g = λ ˆ ij,kλij,kαj,kg k,g λ ˆ jȷ,gλiȷ,gαȷ,gk k,g ,

if ȷ j. Combining the above formula with the equilibrium conditions specified in Appendix 6 yields the following system of equations:

1 + xij∗,k = 1+εk(1−λˆij,kλij,k) εk(1−λˆ ij,kλij,k)−∑g ∑ι i

###### ; 1 + t∗ji,k = 1

λˆ ιj,gλιj,g

Eˆj,gEj,g

α ̃ij,ȷk,g

(λˆij,kλij,k

Eˆj,kEj,k)(1+tιj,g)(1+xιj,g)

−εk Pˆ ̃iε,kk

∗ ji,k)(1+x∗ji,k)

λˆ ji,k = (1+t

(1+tji,k)(1+xji,k)wˆαj ̄i,k ∏g Pˆ ̃iα,gi,gk

−εk

(1+t∗ji,k)(1+x∗ji,k) (1+tji,k)(1+xji,k)wˆαj ̄i,k ∏g Pˆ ̃iα,gi,gk

Pˆ ̃i−,kεk = ∑j

λji,k



.

r ˆji,krji,k = (1λˆ+ji,xkλ∗ji,kEˆi,kEi,k

λ ˆ jn,kλjn,kEˆn,gEn,g (1+x∗jn,k)(1+t∗jn,k) E ˆi,kEi,k = ei,kYˆiYi + ∑g ∑j αi,gk

ji,k)(1+t∗ji,k)/ ∑n

λˆ ij,gλij,gYˆj,gYj,g (1+xij∗,g)(1+tij∗,g) wiwiLi = ∑k ∑j α ̄i,k

λˆ ij,kλij,kYˆj,kYj,k (1+xij∗,k)(1+tij∗,k) YˆiYi = wˆiwiLi + ∑k ∑j

t∗ji,k 1+t∗ji,k

λˆ ji,kλji,kEˆi,kEi,k + x

λˆ ij,kλij,kEˆj,kEj,k

ij,k



1+xij∗,k

Solving the above system determines the welfare consequences of dissolving the existing trade agreement: Wˆ i = Yˆi/ ∏k Pˆ ̃i,k

###### Table 4: List of countries in quantitative analysis

Country name WIOD code Basic aggregation Australia AUS Australia Brazil BRA Brazil Canada CAN Canada China CHN China Indonesia IDN Indonesia India IND India Japan JPN Japan Korea KOR Korea Mexico MEX Mexico Russia RUS Russia Turkey TUR Turkey Taiwan TWN Taiwan United States USA United States Austria AUT

Belgium BEL Bulgaria BGR Cyprus CYP Czech Republic CZE Germany DEU Denmark DNK Spain ESP Finland FIN France FRA United Kingdom GBR Greece GRC Hungary HUN Ireland IRL Italy ITA Netherlands NLD Poland POL Portugal PRT Romania ROM Slovakia SVK Slovenia SVN Sweden SWE Estonia EST Latvia LVA Lithuania LTU Luximburg LUX Malta MLT Rest of the World RoW Rest of the World

European Union