Download Report

Large Firms and Within Firm Occupational
Reallocation
Theodore Papageorgiou
McGill University, Department of Economics y
April 2015
Abstract
This paper considers the equilibrium interaction between within …rm and across
…rm reallocation in the presence of labor market frictions. While a sizable literature has investigated frictional labor markets, it has ignored within …rm mobility.
Nonetheless, every year a sizable fraction of workers switch occupations without
changing …rms. Employees in large …rms can sample from a larger selection and
across …rm mobility is replaced by within …rm mobility. Bringing together within
and across …rm reallocation along with labor market frictions, naturally accounts
for the observed di¤erences in worker ‡ows and wages across …rms of di¤erent sizes.
Keywords: Frictional Labor Markets, Occupational Mobility, Within Firm Occupational Reallocation, Firm Size-Wage Premium, Separation Rates
JEL Classi…cation: E24, J24, J31
I thank Joe Altonji, Dimitris Cheliotis, Jed DeVaro, Mike Elsby, Manolis Galenianos, Ed Green,
William Hawkins, Christian Hellwig, Erik Hurst, Philipp Kircher, Fabian Lange, Ed Lazear, Alex Monge,
Giuseppe Moscarini, Elena Pastorino, Andrés Rodríguez-Clare, Richard Rogerson, Rob Shimer, Russell
Toth, Neil Wallace, as well as numerous seminar participants for useful comments. I am grateful to
Anders Frederiksen for generating results from the IDA dataset.
y
855 Sherbrooke St West, Room 519, Montreal, QC H3A 2T7, Canada.
E-mail:
[email protected]. Web: http://www.theodorepapageorgiou.com
1
Introduction
A key role of labor markets is to assign workers to the task they perform best. A large
literature investigates this assignment process in the presence of labor market frictions.
This literature focuses on moves of workers across …rms,1 but ignores an important empirical phenomenon: every year 8% of workers switch occupations within their …rm.2 These
moves within the …rm allow workers to allocate without being subject to labor market
frictions and thus facilitate the assignment of workers to occupations. This is the …rst
paper to consider how between and within …rm mobility interact in the presence of labor
market frictions.
In the …rst part of the paper, I examine empirically worker reallocation within and
across …rms. Large …rm workers switch occupations more often within their …rm. In
addition, they separate less often, even controlling for wages. Workers with high wages
switch occupations less often within the …rm and also separate from their …rm less often.
I also …nd that within a …rm, occupational ‡ows are largely o¤setting, so that for every
worker going from one occupation to another, there is another worker switching in the
opposite direction.
I also con…rm the well-known size-wage premium, whereby workers in …rms with more
employees earn higher wages. This wage premium is higher for longer tenured workers.
In addition, using matched employer-employee data from Denmark, I examine whether
the wage premium is due to the number of …rm employees per se or the increased number
of occupations of large …rms. I …nd that, controlling for the number of …rm employees,
workers in …rms with more occupations have substantially higher earnings than workers
in …rms with fewer occupations.
In the second part of the paper, motivated by these …ndings, I introduce an equilibrium
model of within …rm and across …rm reallocation. The setup is tractable and I am able
to show analytically that it can replicate the abovementioned facts. It is also amenable
to aggregation and general equilibrium analysis and therefore I am able to characterize
analytically the wage distributions both within the …rm, as well as the entire economy.
In my baseline setup, a …rm constitutes a labor market without search frictions and
workers can costlessly switch occupations within a …rm. In contrast, reallocation across
…rms is subject to frictions. The quality of a match between a worker and an occupation
is uncertain and revealed gradually. Workers and …rms observe output realizations and
1
Examples include Mortensen and Pissarides (1994), Burdett and Mortentsen (1998), Postel-Vinay
and Robin (2002) and Moscarini (2005).
2
1996 Panel of the Survey of Income and Program Participation.
1
update their beliefs in Bayesian fashion. At any point a worker has the option of leaving
his current occupation and going to another one within his …rm or to unemployment.
The optimal behavior of the worker is one of a reservation strategy: if his belief regarding the quality of his match with his current occupation falls below a certain threshold,
the worker moves on to another one; this threshold is increasing in the number of occupations within the …rm that the worker has not tried. Workers in large …rms have more
occupations remaining and are therefore willing to abandon unpromising matches more
easily. In equilibrium, workers in large …rms are better matched.
The model correctly predicts that workers in larger …rms are less likely to separate
from their …rm both because they have more occupations available and also because they
are better matched. Moreover, the …rm size remains important even when conditioning on
wages. In my model, as well as the data, workers in large …rms replace across …rm mobility
with within …rm mobility: conditional on wage, workers in larger …rms are more likely to
switch occupations within their …rm, consistent with the data. In addition, workers with
higher wages are less likely to switch occupations, both within as well as across …rms.
Workers who switch occupations within a …rm see their wages rise. I also prove that the
setup replicates the observed wage premium.
Furthermore, I am able to derive analytically the within-…rm and overall wage distributions. In particular, the shape of the within …rm wage distribution is the same as
that of the overall wage distribution of my economy. This is consistent with empirical
evidence from several countries presented in Lazear and Shaw (2009). Moreover, both
theoretical wage distributions have an empirically accurate shape and can replicate the
fat Pareto-type tail that we observe in the data.
In addition, I discuss a number of extensions, including human capital accumulation,
on-the-job search, search frictions inside the …rm and wage posting. Finally, I calibrate
the model, by targeting moments related to within and across …rm mobility and examine
its predictions regarding the wage premium, as well as other non-targeted moments.
A key assumption of the above setup is that frictions are lower within …rms compared
to across. From the beginning, the search literature has emphasized that search frictions
are a shortcut for the costly acquisition of information regarding both the location of
vacancies and job-seekers, as well as the characteristics of a particular job and worker.3
It is reasonable to expect that both types of information disseminate faster within …rms
than in the market: for instance, a …rm is better informed about the characteristics of its
current workers, rather than other applicants. Similarly, an employed job-seeker is more
3
See for instance Mortensen (1978) or the survey by Pissarides (2000).
2
likely to learn about a job opening in his own …rm rather than another one.
The contribution of the present paper is that it considers the equilibrium interaction
between within …rm and across …rm reallocation in the presence of labor market frictions
and how it a¤ects worker outcomes. Idson (1993) investigates empirically labor turnover
inside the …rm and argues that large …rms provide more …rm-speci…c training to their
workers. Novos (1992) investigates worker turnover and …rm scope in the presence of
learning-by-doing and asymmetric information. Moscarini and Thomsson (2007), using
data from the Current Population Survey, document that a signi…cant fraction of occupational switches do not involve an employer switch. The idea of match uncertainty between
a worker and a …rm dates back to Jovanovic (1979).4;5 The importance of occupations in
labor market outcomes has been emphasized by a sizable literature.6
In a related paper, Pastorino (2014) estimates a model of learning, job assignment
and human capital accumulation within the …rm. She …nds that the impact of learning
is particularly important, especially when taking into account its role in job assignment
within the …rm. See also Kahn and Lange (2014) and Pastorino (2015).
Since this paper …rst appeared, some new papers have followed the approach introduced in the present paper and also started investigating the equilibrium interaction
between within …rm and across …rm reallocation in the presence of labor market frictions:
In ongoing work, Kramarz, Postel-Vinay and Robin (2014) use French data to study the
importance of within-…rm reallocation in the assignment process of workers to occupations in the presence of frictions. Their approach is in many ways complementary to mine:
My setup is very tractable and I am able to obtain results analytically, whereas their approach relies on solving the model numerically, while also allowing for ‡exible worker and
…rm heterogeneity and focusing on the quantitative exercise.7 Their approach builds on
Postel-Vinay and Robin (2002) with commitment on the part of the …rm and Bertrand
4
The present paper is also related to Neal (1999) who argues that workers follow a two-stage search
strategy: …rst they search for a career (occupation), and then shop for jobs within the chosen career. In
his setup however workers cannot search for another occupation/career within an employer.
5
In addition, there’s an extensive empirical literature that documents the observed size-wage premium.
Brown and Medo¤ (1989), Idson and Feaster (1990) and Schmidt and Zimmermann (1991) …nd that the
positive relationship between …rm size and wage holds even conditional on observed and unobserved
labor quality and therefore cannot be explained away by worker selection. Brown and Medo¤ (1989)
also demonstrate that it remains even when considering …rms that o¤er a piece-rate system, therefore
indicating that it is not just the result of wage backloading.
6
Examples include Kambourov and Manovskii (2009), Alvarez and Shimer (2009) and (2011), Eeckhout and Weng, (2010), Antonovics and Golan (2012), Hsieh et al. (2013), Carrillo-Tudela and Visschers
(2014), Gervais, Jaimovich, Siu and Yedid-Levi (2014), Groes, Kircher and Manovskii (2014), Papageorgiou (2014).
7
It is possible to extend the setup introduced here to allow for some ex-ante heterogeneity on the
worker and the …rm side (e.g. assume a certain number of types).
3
competition between employers; my setup instead builds on the matching theory of Jovanovic (1979) and the equilibrium unemployment theory of Mortensen and Pissarides
(1994) with a lack of commitment on the part of the …rm (see also Moscarini, 2005).
While their work is still ongoing, it is ambitious and consistent with the idea put forward
in the present paper that within and across …rm reallocation should no longer be viewed
in isolation. Similarly, Li and Tian (2013) modify the setup introduced in the present
paper to allow for wage posting by the …rms and directed, rather than random search on
the part of the workers.
The next section investigates empirically worker mobility within and across …rms, as
well as how it’s a¤ected by the size of the …rm and wages. I also examine the wage premium
and how it’s a¤ected by the number of occupations of the …rm. Section 3 introduces a
model of within …rm and across …rm reallocation that is consistent with the above facts.
I solve the model, consider several extensions and calibrate it. Section 4 concludes.
2
Empirical Evidence
In this section I empirically examine within …rm and across …rm mobility and how they
depend on …rm size and wages. I focus on occupational matching and use data from the
1996 panel of the Survey of Income and Program Participation (SIPP). The SIPP is a
nationally representative sample of households in the civilian non-institutionalized U.S.
population. Interviews were conducted every four months for four years and included
approximately 36,000 households. It contains information on the worker’s wage, 3-digit
occupation, current employer and employer size. The SIPP reports employer size in three
size bins: fewer than 25 employees (22% of the sample), between 25 and 99 employees
(13% of the sample) and more than 100 employees (65% of the sample). Because the
SIPP reports both an employer identi…er as well as an occupation identi…er, I am able
to distinguish between within and across …rm mobility. The 1996 SIPP uses dependent
interviewing, which is found to reduce occupational coding error (Hill, 1994). Hourly
wages are de‡ated to real 1996 dollars using the Consumer Price Index.
2.1
Within Firm Occupational Reallocation
I begin by investigating whether there is a typical career progression or career-ladder
within …rms when considering occupations. For instance, in the case of 3 occupations, A,
B and C, a typical ladder might involve workers switching from occupation A to B and
4
Occupation i
A
A
C
A
A
I
A
J
I
D
C
C
D
C
Occupation j
D
C
D
G
I
J
B
L
L
G
G
L
I
I
F low(i; j) + F low(j; i)
968
541
330
300
274
262
219
167
164
161
136
133
119
103
jF low(i;j) F low(j;i)j
F low(i;j)+F low(j;i)
0.141
0.117
0.055
0.193
0.183
0.107
0.26
0.054
0.098
0.044
0.132
0.173
0.025
0.01
Table 1: Within Firm Net Reallocation. 1996 Panel of Survey of Income and Program
Participation. 4-month transitions. Occupational groups de…ned as A. Managerial and
Professional Specialty Occupations (003-199); B. Technical Support (203-235); C. Sales
(243-285); D. Administrative Support (303-389); E. Private Household Occupations (403407); F. Protective Service Occupations (413-427); G. Service (433-469); H. Farming
(473-499); I. Precision Production (503-699); J. Machine Operators (703-799); K. Transportation (803-859); L. Handlers (864-889)
then from B to C (from the bottom to the top). We would not expect to see ‡ows from B
to A or from C to B or to A. Put di¤erently gross ‡ows across these occupations should
equal net ‡ows.
To test this prediction of the career-ladder model, I focus on 12 major occupational
groups and examine the ‡ows of workers who switch occupations within their …rms. It’s
worth noting that 64% of 3-digit switches within …rms involve a switch among major
occupations as well. Table 1 presents the pairs of occupations with the most ‡ows. I’m
interested in whether these ‡ows are ‡ows in one direction only, or whether they are
bidirectional. With that in mind, I compute the ratio of the absolute di¤erence between
the two ‡ows over the sum of the two ‡ows, in other words the ratio of net ‡ows over
gross ‡ows. The career-ladder model would predict that this ratio should be close to 1.
Instead, for all occupational pairs shown in Table 1 the ratio is far from 1 and for several
pairs it is close to zero. This pattern is not driven by the pairs of occupations with the
most ‡ows presented in the table. When I compute the weighted average of the above
ratio for all pairs (5743 observations), I …nd that it’s equal to 0.137 (the ratios for all
5
Occ. Switching
Occ. Switch
Occ. Switch
Occ. Switch
Occ. Switch
Prob. (Probit) Prob. (Probit) Prob. (Probit) Prob. (Probit) Prob. (Probit)
Same Employer
Same Empl
Same Empl
Same Empl
Same Empl
All
Services
Manufact
High School
College
100 employees
0.0122
0.0129
0.0093
0.0125
0.0158
(0:0014)
(0:0021)
(0:0047)
(0:0018)
(0:0049)
ln(wage)
-0.0046
(0:0013)
-0.0056
(0:002)
-0.0068
(0:0035)
-0.0048
(0:0017)
Table 2: Wage and Size Impact on Within Firm Reallocation. 1996 Panel of Survey of
Income and Program Participation. 4-month probabilities. 3-digit occupations. Controls
include gender, race, education, medium size …rm dummy, quartic in age, marital status,
population density, 11 industry dummies, 13 occupation dummies. Services includes
industry codes 700 through 893. Manufacturing includes industry codes 100 through
392. Standard errors clustered by individual. Coe¢ cients represent marginal e¤ects
evaluated at the average value of the 4-month probability, which equals 0.028 (all), 0.026
(services), 0.033 (manufacturing), 0.029 (high school), 0.03 (college) respectively. 130,839
(all), 47,345 (services), 26,983 (manufacturing), 93,669 (high school), 13,439 (college)
observations respectively.
‡ows are presented in the Online Appendix). Put di¤erently, gross ‡ows are on average
almost an order of magnitude greater than net ‡ows and therefore I conclude that most
‡ows within …rms are bidirectional.8
I next examine how within …rm mobility is a¤ected by …rm size and wage. Table
2 shows the impact of wages and …rm size on 3-digit occupational switching within a
…rm. Among workers who remain with the same employer for two consecutive waves
(4 month period), workers earning higher wages are less likely to switch occupations.
Furthermore, conditional on wages, workers in large …rms are signi…cantly more likely to
switch occupations within a …rm. The size e¤ect is quantitatively large, as working in a
large …rm increases the four-month probability of an occupational switch within a …rm by
almost 45%.9 The results remain quantitatively large when focusing on manufacturing,
services, high school graduates or college graduates.
8
I reach the same conclusion when looking at within-…rm ‡ows between 3-digit occupations. While
there are a lot fewer ‡ows for each pair, when I focus on pairs that have at least 30 ‡ows between them,
the weighted ratio of net over grows ‡ows is equal to 0.191.
9
If workers in small …rms perform multiple tasks, whereas in large they specialize (Lazear, 2004), this
might arti…cially generate more mobility in small …rms. Then the true di¤erence between large and small
…rms would be larger.
6
-0.0053
(0:0031)
Occ Switch Dummy
ln(wage)t
1
ln(wage)t
Same
Employer
All
0.0242
(0:0029)
0.8914
(0:005)
ln(wage)t
Same
Employer
Services
0.0244
(0:0052)
ln(wage)t
Same
Employer
Manufact
0.0189
(0:0051)
ln(wage)t
ln(wage)t
Same
Same
Employer
Employer
High School
College
0.0219
0.0559
(0:0033)
(0:0106)
0.8868
(0:0091)
0.9009
(0:0088)
0.8942
(0:0054)
0.8943
(0:0215)
Table 3: Impact of Occupation Switch Within a Firm on Wage. 1996 Panel of Survey
of Income and Program Participation. 4-month intervals. 3-digit occupations. Controls
include gender, race, education, quartic in age, marital status, population density, 11
industry dummies, 13 occupation dummies. Services includes industry codes 700 through
893. Manufacturing includes industry codes 100 through 392. Standard errors clustered by
individual. 122,536 (all), 43,714 (services), 25,797 (manufacturing), 88,135 (high school)
and 11,645 (college) observations respectively.
It’s also worth noting that 88% of within …rm ‡ows are switches to occupations that
the worker has not worked in, during the past 3 years.
Finally, I study the impact of within …rm mobility on wages. As shown in Table 3,
workers who switch occupations within their …rm experience a wage increase of 2.42% on
average. Again, this result holds when the two industries and two education groups are
considered separately.
2.2
Across Firm Occupational Reallocation
I now turn to the across …rm reallocation and consider how …rm separation probability is
a¤ected by wages and …rm size. The …rst column of the top panel of Table 4 shows that
workers in large …rms are signi…cantly less likely to separate from the …rm. In particular,
the …rm separation probability is 26% lower in large …rms. In the second column of
the top panel of Table 4 it’s worth noting that workers in large …rms are less likely to
separate, even conditional on the worker’s wage. Even though the size coe¢ cient falls in
magnitude as expected, it continues to be economically large and statistically signi…cant:
indeed, the impact of size on the separation probability is as large as that of a 28% wage
increase. Again these facts hold when considering workers in manufacturing or services,
7
100
empl
Separation
Probability
All
-0.0283
(0:0023)
ln(wage)
Separation
Probability
All
-0.0202
(0:0024)
Separation Separation
Probability Probability
Services
Services
-0.0197
-0.015
(0:0038)
(0:0038)
-0.0721
(0:0029)
Separation
Probability
Manufact
-0.0437
(0:005)
-0.0578
(0:0045)
Separation
Probability
Manufact
-0.031
(0:0051)
-0.0762
(0:0063)
Separation
Separation Separation Separation
Probability Probability Probability Probability
High School High School
College
College
100 empl
-0.0287
-0.0205
-0.0405
-0.0333
(0:0028)
(0:0028)
(0:0074)
(0:0074)
ln(wage)
-0.0721
(0:0035)
-0.0558
(0:0068)
Table 4: Wage and Size Impact on Firm Separation Probability (Probit). 1996 Panel of
Survey of Income and Program Participation. 4-month probabilities. Controls include
gender, race, education, medium size …rm dummy, quartic in age, marital status, population density, 11 industry dummies, 13 occupation dummies. Services includes industry
codes 700 through 893. Manufacturing includes industry codes 100 through 392. Standard
errors clustered by individual. Coe¢ cients represent marginal e¤ects evaluated at the average value of the 4-month probability, which equals 0.1097 (all), 0.113 (services), 0.071
(manufacturing), 0.104 (high school) and 0.103 (college) respectively. 150,936, 55,027,
29,580, 107,435 and 15,618 observations respectively.
and when considering workers with a high school or college degree.10 In addition, the
overall occupational switching behavior (i.e. both within and across …rms) of workers in
large …rms is di¤erent than those in smaller …rms. More speci…cally, workers in larger …rms
are signi…cantly less likely to switch occupations overall (results available upon request).
2.3
Firm Size Wage Premium
Table 5 presents the well-known empirical regularity that workers in …rms with more
employees earn higher wages. In particular workers in …rms with more than 100 employees
10
The results are qualitatively and quantitatively similar in magnitude if I focus on separating and
switching occupations. In addition, wages increase upon separating and switching occupations (results
available upon request).
8
100 empl
25-99 empl
ln(wage)
All
0.1311
(0:0047)
ln(wage)
Services
0.0932
(0:0073)
0.0298
(0:0054)
0.016
(0:0086)
ln(wage)
ln(wage)
ln(wage)
Manufacturing High School
College
0.1878
0.13
0.1588
(0:0126)
(0:0056)
(0:0179)
0.0215
(0:0139)
0.0359
(0:0066)
0.0316
(0:0221)
Table 5: Wage Premium. 1996 Panel of Survey of Income and Program Participation.
Controls include gender, race, education, quartic in age, marital status, population density, 11 industry dummies, 13 occupation dummies. Services includes industry codes 700
through 893. Manufacturing includes industry codes 100 through 392. Standard errors
clustered by individual. 169,536 (all), 61,968 (services), 32,408 (manufacturing), 119,383
(highschool) and 17,203 (college) observations respectively.
earn on average 13% higher wages than workers in …rms with fewer than 25 employees.
Moreover, the wage premium is present for workers employed in the service industry, in
manufacturing, as well as when focusing on workers with a high school degree and also
those with a college degree.11
I next ask whether the observed wage premium is due to the number of …rm employees
or the larger selection of occupations in large …rms and thus I investigate how the wage
premium depends on the number of occupations in the …rm, separately from …rm size.12
The number of occupations in each …rm can only be obtained through the use of
matched employer-employee datasets and, to my knowledge, the existing U.S. matched
employer-employee datasets do not contain detailed information on workers’occupations.
This is not true however for the matched employer-employee datasets of other countries,
such as Denmark. The Integrated Database for Labour Market Research (IDA) created
by Statistics Denmark contains detailed information on all employers and all employees in
the Danish economy. The interested reader should refer to Frederiksen and Kato (2014)
for more details on the IDA dataset.13
The …rst column of Table 6 presents the wage premium for Denmark in 2007. As in
11
The wage premium is also present if I consider establishment size instead of employer size.
The model presented in the next section predicts that …rms with more occupations have more employees. In the data the correlation between number of workers and number of occupations, while positive,
is less than one (see Online Appendix), as there are other factors leading to variation in …rm size. The
fundamental mechanism in the model that generates the wage premium is the number of occupations,
rather than the number of …rm employees.
13
I am grateful to Anders Frederiksen for running the regressions below on the IDA dataset.
12
9
ln(labor income) ln(labor income)
ln(# of employees)
0.0124
-0.0114
(0:0002)
(0:0006)
4-6 occupations
0.0562
(0:0020)
7-10 occupations
0.1123
(0:0022)
11-20 occupations
0.1616
(0:0024)
21-40 occupations
0.1784
(0:0031)
>40 occupations
0.1958
(0:0042)
Table 6: Log Annual Labor Income on Firm Size and Number of Occupations in 2007.
Integrated Database for Labour Market Research - Denmark. Controls include gender,
years of schooling, quadratic in age and labor market experience and 9 industry code
dummies. Actual labor market experience. Labor market experience is constructed using
pension data from 1964 onwards. Log of annual labor income include both base pay and
variable pay components such as bonuses. 1,196,522 observations.
the U.S. data, an increase in …rm size, as measured by the number of employees, leads
to economically and statistically large increases in earnings. In the second column, I run
the same regression, but now ‡exibly control for the number of 3-digit occupations of
the …rm, by creating six groups. Increasing the number of occupations inside the …rm
leads to large and statistically signi…cant earnings increases for workers. In particular, all
else equal, workers in …rms with more than 40 occupations have almost 20% higher labor
income compared to workers in …rms with 3 occupations or less, conditional on …rm size.
The coe¢ cient on …rm size is now negative.14
Thus the results in Table 6 are consistent with the notion advanced in this paper that
a key mechanism behind the wage premium is the presence of more occupations in large
…rms rather than the number of …rm employees per se.
Summarizing, the patterns of within …rm mobility suggest that there is no typical
14
Brown and Medo¤ (1989) showed that the size premium persists even when controlling for a number
of …rm and worker characteristics. They did not control however for the number of occupations.
One explanation for the negative coe¢ cient is that in …rms with many employees, search frictions
are still present within the …rm, and that information dissemination is harder than in small …rms. For
example, in very large …rms it may be di¢ cult for a department that has an opening to have good
information about workers in other departments. In that case, workers who are in small …rms but with
many occupations are at an advantage compared to workers in big …rms with same number of occupations,
consistent with the regression results.
10
career-ladder that workers follow within the …rm. Instead, ‡ows within a …rm are o¤setting. Workers with low wages, as well as workers in large …rms are more likely to switch
occupations within their …rm. Moreover, workers in larger …rms are less likely to separate from their …rm. The size e¤ect is quantitatively large and holds even conditioning
on wages. In addition, I replicate the well-known size wage premium and also …nd that
workers in …rms with more occupations have substantially higher earnings, even when
holding constant the number of …rm employees.
3
Model
Motivated by the facts documented in the previous section, I develop a tractable, equilibrium model of within and across …rm reallocation. Given my …ndings above, I build on
the matching theory of Jovanovic (1979) and assume that workers learn about the quality
of their occupational match within a …rm. This is consistent with o¤setting ‡ows within
a …rm, documented above. There are no search frictions inside the …rm.15 Search frictions across …rms are modeled in a similar fashion to the search and matching literature:
matches between workers and …rms are determined by a matching function and the rents
to realized matches are split between workers and …rms using Nash bargaining.
In equilibrium, …rms with more occupations employ more workers. Workers in larger
…rms are more selective and abandon unpromising matches faster than workers in smaller
…rms. As a result they are more likely to switch occupations within their …rm, conditional
on wages. In addition, workers in larger …rms have more options, so the separation rate
is lower, even conditional on wages. In large …rms, across …rm reallocation is replaced by
within …rm reallocation. Workers with higher wages are less likely to switch occupations
both within, as well as across …rms. Workers in larger …rms are better matched and earn
higher wages. The within-…rm wage distribution has a similar shape as the economy-wide
wage distribution and both are empirically accurate.
I …rst describe the environment, solve for the equilibrium, then describe the implications and …nally derive the within-…rm and overall wage distributions.
3.1
The Economy
Time is continuous. There is a large mass of ex ante identical potential …rm entrants.
Firms enter the market by paying c. Each …rm, upon entering, draws the number of its
15
In the Online Appendix, I extend the model to allow for search frictions inside the …rm as well.
11
occupations, m, from a known distribution with support f1; 2:::; M g and corresponding
probabilities qm .16 Firms are destroyed exogenously according to a Poisson process with
parameter > 0, in which case all of its employees become unemployed.
There is a population of risk neutral workers of mass one. A worker can be either
employed or unemployed. Both workers and …rms have a discount rate equal to r.
There are search frictions: workers and …rms need to search for each other. Search is
undirected and an unemployed worker meets a …rm according to a Poisson process with
parameter > 0.17 All unemployed workers are identical and an unemployed worker
earns b.
When employed, a worker works in one task at a time. When he begins employment at
a …rm with m tasks, he randomly picks one of those m tasks which are ex ante identical.
The ‡ow output for worker i, in task j, in …rm l at time t is given by
dYtijl =
ijl
dt + dZtijl ;
G
where dZtijl is the increment of a Wiener process, ijl 2
; B is mean output per
unit of time and > 0. In addition G > B and productivities, ijl , are independently
distributed across tasks, …rms and workers.18
Furthermore ijl is unknown. Before starting work at a task, a worker draws his
productivity parameter, ijl , from a Bernoulli distribution, with parameter p0 , the probability that ijl = G .19 To avoid a situation where a worker never chooses to be employed,
I assume that p0 G + (1 p0 ) B > b.
At any point, a worker can leave his current task and go to another one or to unemployment. In the Online Appendix, I extend the model to allow for search frictions
inside the …rm, whereby a new task becomes available to the worker at some rate (see also
discussion in Section 3.5). There are no restrictions in the number of workers employed
in a task, i.e. there are no congestion externalities. A worker cannot return to a task.20
A worker separates from his …rm either endogenously or exogenously if his …rm exits the
16
I use the terminology “occupations” or “tasks”, but the reader may choose to think of these as
departments, teams, locations etc.
17
is the implied job …nding rate from an increasing, constant returns-to-scale matching function
m (u; f ), where u is the mass of unemployed workers and f is the mass of operating …rms.
18
In the Online Appendix, I also consider a setup where workers are hierarchically ranked, so that
abilities are positively correlated across tasks and workers learn about their unobserved productivity.
In addition, it is straightforward to extend the model to allow for hierarchies within occupations.
ijl
19
The results that follow also hold for the case where pijl
0 is not the same for all matches, but p0 is
distributed according to a known distribution with support [0; 1].
20
As noted in the Section 2, the vast majority of within …rm ‡ows, are switches to new occupations.
12
market.
Search frictions generate rents to realized matches which are split between the worker
and the …rm through the wage setting mechanism. Following the literature, I assume
that the wage is determined by generalized Nash bargaining between the …rm and each
worker separately, with 2 (0; 1) denoting the worker’s bargaining power. In Section
3.5 I discuss how the results below go through when …rms post wages. Nash bargaining
implies that the incentives of workers and …rms are perfectly aligned. In what follows,
I describe behavior from the worker’s point of view, e.g. “the worker leaves his current
task and moves to another one.”Alternatively one can also view the outcome as the …rm
reassigning the worker to another task.
While employed, workers observe output and obtain information regarding the quality
of the match in that speci…c task. Let pijl
t denote the posterior probability that the match
of worker i with task j in …rm l is good, i.e. ijl = G . In particular, a worker observes
his ‡ow output, dYtijl , and updates pijl
t , according to (Liptser and Shyryaev, 1977)
dpijl
t
G
=
pijl
t
1
pijl
t
dYtijl
pijl
t
G
+ 1
pijl
t
B
dt
;
(1)
B
where =
. The last term on the right hand side is a standard Wiener process
with respect to the unconditional probability measure used by the agents. To minimize
notation, from now on, I drop the t subscript, as well as the i, j and l superscripts.
The beliefs regarding the quality of the worker-task match follow a Bernoulli distribution. The posterior probability is thus, a su¢ cient statistic of the worker’s beliefs and a
state variable for his value function.
3.2
Equilibrium
I focus on the stationary equilibrium of the above economy. I analyze equilibrium wages
and optimal behavior of workers and …rms. In the next section I examine the model’s
implications and also derive the equilibrium …rm size.
Besides their employment status and beliefs regarding their current task, workers
also di¤er in their opportunities to work in other tasks within the …rm. The number of
remaining tasks, k, available to the worker in his current …rm, therefore, also constitutes
a state variable for the value of an employed worker.21
21
Note that the total number of tasks in the worker’s current …rm is not relevant, as employment
history in previous tasks does not a¤ect current or future payo¤s.
13
Given the process for the evolution of beliefs (equation (1)), the Hamilton-JacobiBellman equation of a worker employed in a task with posterior p and k 0 other tasks
are available to work in, is given by
rW (p; k) = w (p) +
1 2 2
p (1
2
p)2 Wpp (p; k)
(W (p; k)
U) ;
(2)
where W ( ) is his value, Wpp ( ) is the second derivative of W ( ) with respect to p and
w ( ) is his wage, which as shown below, depends only on p. The ‡ow bene…t of the worker
consists of his ‡ow wage, plus a term capturing the option value of learning, which allows
him to make informed decisions in the future (e.g. whether to leave his current task or
whether to quit to unemployment). Finally, at rate the worker exogenously separates
from the …rm.
The ‡ow value of an unemployed worker is given by:
rU = b +
M
P
qm W (p0 ; m
1)
U;
(3)
m=1
where qm is the fraction of …rms with m tasks.
The solution to the generalized Nash bargaining problem results in the linear sharing
rule:
J (p; k) = (1
) (W (p; k) U ) ;
(4)
where J (p; k) is the value to the …rm of employing a worker with posterior p and k tasks
remaining. As shown in Appendix A, using the above equation, I obtain:
Lemma 1 The worker’s wage is given by
w (p) =
where
(p) = p
G
+ (1
p)
B
(p) + (1
) rU;
(5)
is the worker’s expected output.
Note that the wage is an a¢ ne transformation of the posterior, p and does not depend
on k: As in the model of Mortensen and Pissarides (1994), wages only depend on the
current level of output and not on its future path.22
If there is an increase in the number of tasks available to the worker, then the worker
cannot be worse o¤, as he has the option of not trying them out, so W ( ) is non-decreasing
in k. Moreover, it cannot be the case that W (p; k + 1) = W (p; k), as that would imply
22
The wage equation (5) is almost identical to the wage equation in the Mortensen-Pissarides (1994)
model (see for instance equations (2.9) and (2.10) on page 42 of Pissarides, 2000).
14
that a worker can forever ignore one task and obtain the same value. This can not be
true, as the option value of trying out one more task is positive, hence:23
Lemma 2 The value of an employed worker, W ( ), is increasing in k.
A worker needs to decide when to leave his current task and move on to either another
task, or to unemployment.
Take the case of a worker with k 1. The solution to the optimal stopping problem
is given by a trigger p (k), such that the following value matching and smooth pasting
conditions are satis…ed:
W p (k) ; k = W (p0 ; k 1) ;
(6)
and
(7)
Wp p (k) ; k = 0:
Note that I allow the threshold, p (k), to depend on the number of remaining tasks in the
current …rm.
When there are no more tasks available, i.e. k = 0, the worker optimally quits to
unemployment when:
W p (0) ; 0 = U;
(8)
and
(9)
Wp p (0) ; 0 = 0:
In Appendix B, I use the boundary conditions (6) through (9) to solve for the value
function of a worker. The following proposition states the result:
Proposition 3 The value of an employed worker, W ( ), is increasing in p. A worker
optimally leaves his current task when his posterior hits p (k), de…ned as:
p (0) =
(
(
1) (rU
B
1) rU
B ) + ( + 1) (
23
G
rU )
;
(10)
Consider a worker with current posterior p < 1 and k tasks remaining. Assume W ( ) is increasing
in p (this is shown to be true below) and that one more task becomes available. Consider the following
strategy: if his current task proves unsuccessful, which, as shown below, occurs when his belief reaches
a threshold, p (k), instead of moving on to another task, he begins employment in the newly-available
task instead. In this case, he has k tasks remaining again, but a higher posterior p0 > p (k), so his value
must be higher. Since the worker leaves his current task with positive probability, then the option value
of trying out one more task is positive.
15
for k = 0 and:
p (k) =
(
( + 1) (
for k > 0, where
(3).24
=
1) (r + ) W (p0 ; k 1) (r +
G
B)
2 ((r + ) W (p0 ; k 1)
q
8(r+ )
2
r) U
(r +
B
r) U
B)
;
(11)
+ 1 and the value of unemployment is de…ned in equation
The above proposition, along with Lemma 2, immediately results in the following
corollary:
Corollary 4 The triggers p (k), are increasing in k.
In other words, the better their outside option, the more likely are workers to leave
an unpromising match.
Finally the mass of operating …rms is pinned down by the free …rm entry condition
M
P
qm Vm = c;
m=1
where Vm is the value of a …rm with m tasks and no workers.
3.3
Model Implications
In this section I discuss the model’s implications regarding wages, within …rm reallocation
and …rm separation rates. I also derive the equilibrium …rm size.
The equilibrium evolution of beliefs, employment states and remaining tasks is a positive recurrent process: starting from any posterior and any number of remaining tasks k,
(p; k) 2 (p (k) ; 1] [0; M 1], the joint process returns to (p; k) in…nitely many times, as
long as > 0.25 Therefore there exists a stationary distribution of beliefs and remaining
tasks.
I …rst show that workers are more productive, on average, in …rms with more tasks,
m. The proof consists of two steps: I …rst consider all workers with k tasks remaining and
24
A worker quits to unemployment only after exhausting all tasks available to him, i.e. W (p0 ; 0) > U .
To see this note that if a worker were to never work in the last task then this would imply that p (0) p0 .
Since p (0) < p (k) for all k > 0 (see Corollary 4 below), a worker chooses not work in the last task if and
only if he refuses to work in any task (i.e. the worker always prefers to remain unemployed). This does
not happen since by assumption p0 G + (1 p0 ) B > b.
25
An unemployed worker also returns to being unemployed in…nitely many times. The assumption that
> 0, while not necessary for the existence of a stationary equilibrium, ensures that in equilibrium not
all workers have found a good match and thus never switch occupations.
16
show that average productivity is increasing in k. I next establish that there is a higher
percentage of workers that have few tasks remaining (low k), in …rms with fewer tasks
(low m).
The following lemma, proved in Appendix C, states that the cross-sectional posterior
mean of all workers with k tasks remaining, is increasing in k.
Lemma 5 Let Fk (x) be the probability that a worker with k tasks remaining has posterior,
p, less or equal to x, i.e. Fk (x) Pr (p xjtasks remaining = k). Then
Z
1
p(k0 )
when k 0 > k.
Moreover, let
k
Z
pfk0 (p) dp >
Z
1
pfk (p) dp;
p(k)
1
p
G
+ (1
p)
B
fk (p) dp;
p(k)
denote the mean output of tasks in which the employed worker has k tasks remaining.
Then Lemma 5 immediately implies that:
Lemma 6 Workers who have more tasks available to work in, have, on average, higher
output, i.e.
k0
>
k;
whenever k 0 > k.
Intuitively, from Corollary 4, workers with many tasks remaining (high k) are more
selective, i.e. have higher p (k). This implies that when their (expected) output falls, they
are more likely than workers with few tasks remaining to switch to a new task with higher
output. As a result, on average, their output is higher than workers with few tasks.
I next show that in …rms with many tasks (high m), there is a lower percentage of
workers that have few tasks remaining (low k).
Intuitively, consider two …rms, one with m = 10 and one with m = 2. In the former
…rm, all workers start o¤ with k = 9. If their task does not work out, they move on to
k = 8 and so on. In the latter …rm, all workers start o¤ with k = 1 and then might move
on to k = 0. Comparing the steady state distribution across tasks remaining, k, in the
two …rms, it’s natural to expect a much lower percentage of workers to have k = 1 or
17
k = 0 in the …rm with 10 tasks compared to the …rm with 2 tasks. In addition, no workers
have k > 2 in the m = 2 …rm.
Formally, let sm
k be the share of workers with k tasks remaining, in …rms with m tasks
in total. The average productivity of …rms with m total tasks is given by
m
P1
sm
k
k:
k=0
I next examine how the employment share of task k, sm
k , changes as m increases.
Consider a worker who has just been hired by a …rm. Let Pr (m 1jm) denote the
probability that a worker starting o¤ at task m reaches task m 1.26 Then the probability
that a worker entering a …rm with m tasks, reaches task k is given by
Pr (m
1jm)
Pr (m
2jm
1)
:::
Pr (kjk + 1) :
Since Pr (jjj + 1) 2 (0; 1) for all j, the probability of a worker reaching task k m 1,
declines with the number of …rm tasks, m (even though the probability of being reassigned
to another task increases in the number of remaining matches). In other words, the
expected amount of time a worker spends in tasks greater than k is increasing in m.
However, conditional on reaching task k, the expected amount of time he spends in every
one of the remaining k tasks has not changed. The expected share of time a worker spends
in every task, k
m 1, while employed in the …rm, is therefore decreasing in m. By
Birkho¤’s Ergodic Theorem the following lemma holds:
Lemma 7 The steady state …rm employment share, sm
k , of every task, k
decreasing in the total number of …rm tasks, m.
m
1, is
Put di¤erently, the distribution of workers across tasks is better, in a …rst-order stochastic dominance sense, in …rms with more tasks. Lemma 7, along with Lemma 6, imply
that in …rms with a large number of tasks, fewer workers are employed in tasks where
average productivity is low, leading to the following proposition:
Proposition 8 Average …rm productivity,
m
P1
sm
k
k,
is increasing in the number of total
k=0
…rm tasks, m.
26
Ignoring shocks, the probability that a worker’s posterior belief, p0 , reaches p (m 1), before it
reaches 1, unconditionally on match quality, is given by 1 1p(mp0 1) (Karlin and Taylor, 1981). The result
that follows holds for
> 0, since Pr (jjj + 1) 2 (0; 1) for all j, when
18
is …nite.
Since wages are an a¢ ne function of expected output, then:
Corollary 9 Average wages are higher in …rms with more tasks, m.
This is consistent with the …ndings in Table 6 where workers in …rms with more
occupations earn higher wages.
It’s worth noting that since all workers start o¤ with the same wage, w (p0 ), regardless
of …rm size, the wage premium is generated through di¤erential wage growth rates in
large …rms relative to smaller ones.27 Empirically, the …rm-size wage premium is indeed
higher for workers with longer tenure. In particular, it more than doubles when comparing
newly hired workers to workers that have more than 9 years of tenure in the …rm (results
available upon request).
I next explore the framework’s implication regarding within …rm reallocation. The
probability a worker with posterior belief p, switches occupations when he has k tasks
remaining within the …rm, ignoring shocks, is given by (Karlin and Taylor, 1981)
Occ Switch Prob =
1
1
p
:
p (k)
The probability of switching occupations within the …rm is declining in p and therefore
the wage, w (p), but increasing in p (k). Corollary 4 states that p (k) is increasing in the
number of remaining tasks, k, while from Lemma 7 workers in high m …rms have higher
k on average. Thus the following proposition holds:
Proposition 10 (i) Conditional on wages, workers in …rms with more tasks, m, are
more likely to switch occupations within their …rm. (ii) Workers with higher wages are
less likely to switch occupations within their …rm.
Both predictions are consistent with the results of Table 2. Moreover, allowing for
exogenous match destruction, > 0, does not change the above result: It decreases the
probability of an occupational switch for all workers uniformly, since by assumption, it
does not depend on …rm size or wages.
I next explore the impact of occupational switching on wages. The model implies that
workers who switch occupations within a …rm see their wages increase, as they leave an
unproductive match and move on to a (potentially) more productive one. Formally, since
p0 > p (k) for all k, from Lemma 1, w (p0 ) > w p (k) :
27
It is easy to show that relaxing the assumption that p0 is the same for all matches (by allowing it
to be distributed according to a known distribution with support [0; 1]), leads to a positive initial wage
premium for workers in …rms with more tasks, m. All other results continue to hold.
19
Proposition 11 Workers who switch occupations within the …rm experience wage gains.
Indeed, this prediction is consistent with the data: Table 3 reveals that workers switching occupations within a …rm experience wage gains.
I now examine how the …rm separation rate varies with the number of occupations or
tasks, m. The probability that a worker, who just found employment in a …rm with m
tasks, separates endogenously, is given by (ignoring shocks)
Pr (m
1jm)
Pr (m
2jm
1)
:::
Pr (U nj1) ;
where Pr (U nj1) is the probability a worker starting o¤ in the last task quits to unemployment. Since Pr (jjj + 1) 2 (0; 1) 8j, the probability of endogenous …rm separations
declines with the number of …rm tasks, m. It is clear that the above result does not
change if > 0, since that shock is independent of m. The following proposition holds:
Proposition 12 Firms with more tasks, m, have lower separation rates.
Note that Proposition 12 implies that average tenure in …rms with more tasks is higher.
Thus the following corollary immediately follows:
Corollary 13 Firms with more tasks, m, have more employees.
Therefore, all the results where large …rms are de…ned to be those with more tasks,
also hold when considering the number of workers as the measure of size.
The SIPP does not contain information on the number of occupations per …rm, but
only its size (number of workers in reported in three bins). In the Online Appendix,
I use the Danish matched employer-employee data and document that …rms with more
occupations have more workers, consistent with Corollary 13 above.
The …rm separation probability of two workers with the same posterior p and therefore
the same wage, is lower for the one with higher k, based on the derivation above. From
Lemma 7, the following proposition is true:
Proposition 14 Conditional on the wage, the …rm separation probability declines in the
total number of …rm tasks m.
Again, this is consistent with the results presented in Table 4, where, conditional on
wages, the annual separation probability is approximately 6% lower for workers in large
20
…rms. Moreover, consistent with the data, wages negatively a¤ect the …rm separation
probability.
Comparing two workers who are similarly matched (and therefore earn the same wage),
the worker in the larger …rm is less likely to separate; indeed if their match doesn’t work
out, the worker employed in the larger …rm has more alternatives within his …rm and
therefore is less likely to leave his employer (Proposition 14). Indeed, as shown above
(Proposition 10 and Table 2), conditional on wages workers in large …rms are more likely
to move within the …rm, which is consistent with them having more options available.28
3.4
Within-Firm and Economy-Wide Wage Distributions
I next consider the distribution of wages in the above economy, as well as the distribution
of wages with each …rm. Both distributions are derived analytically and the resulting
expressions are presented in Appendix D, while the detailed derivations can be found in
the Online Appendix. The following proposition summarizes the …ndings:
Proposition 15 The unique distribution of wages for the above economy is given by
equations (19) and (20) of Appendix D. In addition, the distribution of wages within a
…rm is given by equations (21) and (22) also in the Appendix. Both wage distributions are
decreasing in the region [w (p0 ) ; w (1)] and feature a fat right Pareto-type tail if > 2 .
The functional form of the within …rm wage distribution is the same as the wage
distribution of the economy, though the coe¢ cients are di¤erent. This is consistent with
empirical evidence from several countries presented in Lazear and Shaw (2009). Moreover
both wage distributions have an empirically accurate shape and can replicate the fat
Pareto-type tail that we observe in the data (see also discussion in Moscarini, 2005).
3.5
Extensions
I next discuss some extensions/modi…cations of the current setup:
Search frictions inside the …rm: I relax the assumption that there are no frictions
inside the …rm. Consider instead the case where a task becomes available inside the
…rm at some rate and a worker needs to decide whether to switch to a new task. If his
beliefs regarding his current task are low enough, he is willing to switch to a new task
if one becomes available. Whether he is willing to switch is captured by some threshold
28
The model also implies that workers see their wages decline prior to an occupational switch. This is
indeed true in the data (results available upon request).
21
level of beliefs. If however a task does not become available and his belief continues to
fall, he may decide that it’s not worth waiting for a new task and instead separate to
unemployment. This will be determined by a di¤erent belief threshold. Both thresholds
depend on the number of remaining tasks, k. Alternatively, it could be the case that
the worker always prefers to remain in his current task waiting for another one, rather
than quit to unemployment. In the Online Appendix I solve for these thresholds and
characterize optimal worker behavior when there are search frictions inside the …rm.
Human capital accumulation: I also extend the model to allow for the possibility
of human capital accumulation. In particular, there are now two types of workers: workers
who have accumulated human capital and workers without human capital. Workers who
do not have human capital, accumulate it at some rate, . Expected output produced by
(p), where > 1, while workers without human
workers with human capital is now
capital produce (p) as before. In the case of a separation, a worker loses the human
capital he has accumulated.
Workers behave di¤erently depending on their human capital level. More speci…cally,
there are now two threshold, pnohc (k) and phc (k). The solution of the model in this case,
which is substantially more complicated, is presented in Appendix E, whereas detailed
derivations are contained in the Online Appendix.
On-the-job search: The model can be extended to allow for on-the-job search. In
particular, assume that, following an appropriate modi…cation of the matching function,
employed workers are allowed to contact other …rms at rate , where > 0 captures
the e¤ectiveness of employed workers’search technology relative to unemployed workers.
Search is costless and unobserved by the …rm.
If an employed worker meets another …rm, he chooses the …rm where his value is the
highest when receiving the wage resulting from Nash bargaining. In the Online Appendix, I show that this is the equilibrium outcome of an ascending auction in which the
current and poaching …rms place bids to attract the worker. I also discuss there how the
assumptions behind the auction preserve the convexity of the payo¤ set, thus overcoming
Shimer’s (2006) critique.
An employed worker with current belief p and k tasks remaining, who contacts a
…rm with m tasks, switches to the new …rm if and only if W (p0 ; m 1) > W (p; k) or
p < potj (k; m), where the implicit threshold potj (k; m), depends on both k and m. An
employed worker’s value function now becomes
22
1 2 2
p (1 p)2 Wpp (p; k)
(W (p; k) U )
2
M
P
qm max fW (p0 ; m 1) ; W (p; k)g W (p; k) ;
rW (p; k) = w (p) +
+
m=1
where the last term captures the discrete gain to the worker if he meets a …rm where his
value is higher. The value of the …rm, J (p; k), is similarly modi…ed to capture its loss
when a worker moves to another …rm.
Following similar calculations as in the model without on-the-job search (see steps in
Appendix A), the wage function now becomes
w (p) =
(p) + r (1
)U
M
P
qm I fW (p0 ; m
(12)
1) > W (p; k)g ((1
) (W (p0 ; m
1)
W (p; k)) + J (p; k)) :
m=1
Note that the worker’s wage now has an additional term compared to the case with no
on-the-job search (equation (5)). E¤ectively, his wage is reduced by an amount proportional to his search intensity. When the worker leaves his current …rm for another …rm,
the separation is no longer bilaterally e¢ cient, as there are lost rents for the incumbent
…rm. The worker compensates his …rm by an amount equal to the weighted average of
M
P
the expected worker’s gains,
qm I fW (p0 ; m 1) > W (p; k)g W (p0 ; m 1) W (p; k),
m=1
and the …rm’s losses, J (p; k), multiplied by his job …nding probability.
Unlike the model without on-the-job search, it is not possible to solve for the decision thresholds p (k) and potj (k; m) analytically, but instead they need to be computed
numerically.
Directed search and wage posting: In the main model above wages are determined
by generalized Nash bargaining between the worker and the …rm. Li and Tian (2013)
modify the setup developed in present paper, to allow for …rms to post wages (rather
than bargain with the workers). In addition, workers now choose to which …rm they
should apply (directed search). In equilibrium, workers are indi¤erent across …rms with
di¤erent number of occupations, m. That setup as well can generate the observed sizewage premium and also a negative relationship between …rm size and worker separation
rates.
23
mL = 2 mL = 3 mL = 3 mL = 4 mL = 4 mL = 5 mL = 5
Parameters: mS = 1 mS = 1 mS = 2 mS = 1 mS = 2 mS = 1 mS = 2
p0
0.21
0.296
0.118
0.261
0.192
0.309
0.255
0.304
0.139
0.28
0.108
0.245
0.12
0.172
L
q
53.81% 51.97% 58.55% 50.18% 57.12% 50.38% 56.73%
0.058
0.072
0.072
0.073
0.08
0.079
0.085
G
27.61
41.56
46.34
60.16
37.57
44.20
41.52
B
4.37
-4.07
4.15
-8.81
2.37
-5.94
-1.98
Table 7: Calibrated Parameters
3.6
Calibration
I next calibrate the model to show that it can replicate some of the key facts presented in
this paper. I use information on worker reallocation and the level of wages to calibrate the
model’s parameters and then check some additional moments, such as the wage premium.
A number of observable characteristics such as education, gender and race have been
found to impact wages (Abowd et al., 1999) and occupational mobility (Kambourov and
Manovskii, 2008). Given that, I restrict my analysis to the largest subsample in my data
along these dimensions: white males with a high school education.
Following the size partition in the data, I divide …rms to those with more than 100
employees and those with fewer than 100 employees. To match the SIPP’s sampling
procedure, I sample the simulated data every four months. I next describe the calibration
procedure. Appendix F contains more details.29
I …rst discuss the parameters that are calibrated independently. The discount rate, r,
is set to 1.64% which equals approximately 5% annually. The job-…nding probability, , is
calibrated as in Shimer (2012), leading to a value of 0.78. Since the SIPP contains weekly
employment information, time aggregation is not a concern. Following Shimer (2005),
the Nash bargaining coe¢ cient, , is set to 0.72. Finally, the value of home production
b, is set to $3.42 which is 30% of average wage in the sample. In what follows I consider
di¤erent combinations for the number of occupations in large and small …rms, mL and
mS . As it turns out, smaller numbers are better able to match the moments.
I need to calibrate the following six parameters: the initial belief, p0 , the signal to noise
ratio, , the exogenous separation rate , the probability an unemployed worker contacts
29
Even though wages in the SIPP are top-coded at $30, this a¤ects less than 1% of observations in the
sample.
24
mL = 2
Data mS = 1
62.94% 62.90%
$11.41 $11.41
$9.50
$9.48
8.75% 8.63%
12.41% 12.58%
-0.032 -0.038
Targeted Moments:
Empl Share Large Firms
Mean Wage
Initial Wage
Separation Prob Large
Separation Prob Small
Large Coe¢ cient
Other Moments:
Wage Gains upon Switch
4.97% 4.90%
Wage Coe¢ cient
-0.0095 -0.0126
Wage Growth after Switch 8.22% 7.09%
% of Wage Pr Explained
mL = 3
mS = 1
62.89%
$11.40
$9.62
8.32%
13.08%
-0.045
mL = 3
mS = 2
62.89%
$11.41
$9.40
9.29%
11.16%
-0.019
mL = 4
mS = 1
63.03%
$11.42
$9.44
7.99%
13.65%
-0.054
4.02%
-0.0109
7.79%
3.38% 2.74%
-0.0056 -0.0091
10.36% 9.76%
mL = 4
mS = 2
62.93%
$11.41
$9.42
9.12%
11.69%
-0.025
mL = 5
mS = 1
62.97%
$11.41
$9.71
8.17%
13.76%
-0.052
mL = 5
mS = 2
62.96%
$11.41
$9.43
9.03%
11.77%
-0.026
2.42% 1.53%
-0.006 -0.0101
11.71% 8.79%
2.32%
-0.0056
11.97%
12.50% 21.79% 13.33% 30.20% 19.13% 34.63% 20.78%
Table 8: Moments
a large …rm rather than a small one, q L ,30 the mean output in a good match, G , and the
mean output in a bad match, B . To do so, I use six moments. In particular, I target
the share of employment in large …rms, the separation rates of large and small …rms, the
average wage level and the average initial wage level. The last moment targeted is the
coe¢ cient of the large …rm dummy in the linear probability regression of the probability
of …rm separation on wages and …rm size. The calibrated parameters are presented in
Table 7.
Informally, identi…cation works as follows: the share of employment in large …rms
pins down q L . G and B are pinned down by the average wage and initial wage levels.
The separation rate of small …rms and the coe¢ cient on large …rms pin down p0 and .
In particular, workers in small …rms separate more often because they are both worse
matched (lower belief) and have fewer options within the …rm; when conditioning on
wages (and thus beliefs), workers in smaller …rms still separate more often because of fewer
options. The initial belief, p0 , determines whether or not separation will (eventually) take
place: if p0 is very high (close to 1), few workers separate; conversely, if p0 is very low
almost all matches eventually dissolve. Thus the overall separation rate of small …rms,
for a given , pins down p0 . On the other hand, , determines how fast a worker learns
and thus separates. The di¤erence in the separation rates, conditional on the wage, for a
30
Note that I don’t assume that workers contact each …rm with the same probability, but allow for the
possibility of workers oversampling large …rms relative to small.
25
35
mL=2, mS=1
mL=4, mS=2
30
25
20
15
10
0.2
0.3
0.4
0.5
Lamda
0.6
0.7
0.8
Figure 1: Fraction of Wage Premium Explained.for Di¤erent Values of Lamda
given p0 , pins down . Finally, the separation rate of large …rms pins down (conditional
on p0 and ).
Table 8 presents the targeted moments, as well as its empirical counterparts. The
model, especially the speci…cations with low values of mL and mS , matches the targeted
moments well. More speci…cally, all speci…cations match the wage levels and the employment shares well. Speci…cations with low values for mL and mS are better able to match
the separation probabilities and the coe¢ cient on the large …rm dummy.
In addition, Table 8 reports three moments that are not targeted in the calibration:
the wage gain that a worker enjoys when switching occupations within the …rm; the wage
coe¢ cient in the separation regression conditional on size; and the annual wage growth
rate in the …rst year following an occupational or …rm switch. Again the speci…cations with
low values of mL and mS match well these non-targeted moments, whereas larger values
are somewhat less successful.31 This suggests that a small number of occupations is a
better approximation.32;33 Finally, in all speci…cations, the fraction of the wage premium
31
Introducing …rm-speci…c human capital in the mL = 2 and mS = 1 calibration leads to similar results.
The solution of the model with human capital is presented in Appendix E and detailed derivations are
contained in the Online Appendix.
32
The Danish data suggests that a small number of occupations is a good approximation for …rms with
fewer than 100 employees.
33
Large …rms, have a number of di¤erent occupations, ranging from janitors to accountants. For the
calibration of this setup however, I am not interested in the total number of occupations in each …rm,
but rather the number of relevant occupations for each worker, which is smaller.
Moreover note that even though the worker does not keep track of past occupations, in equilibrium he
would prefer a new occupation to one he’s already tried out since p0 > p (k) for all k. Thus mL and mS
should be seen as the number of “new”occupations a worker encounters in a …rm, rather than the …rm’s
26
explained ranges from 12.50% to 34.63%. However besides search frictions, there are
arguably other impediments to switching …rms, such as administrative costs and adverse
selection (Acemoglu and Pischke, 1998), whose impact is the same as a decrease in the
job …nding rate. In addition, the job …nding rate ‡uctuates signi…cantly (Elsby et al.,
2009, Shimer, 2012). Given these concerns, it is instructive to consider how the wage
premium varies with the job …nding rate. Figure 1 presents the fraction explained of the
observed wage premium for other values of the job …nding rate, , for the speci…cations
with mL = 2; mS = 1 and mL = 4; mS = 2 which …t the data well. As expected, as
falls, the fraction of the premium explained increases. Workers in larger …rms are now
much better matched than their counterparts in smaller …rms.
4
Conclusion
This paper links the reallocation of workers across occupations but within …rms, and the
reallocation of workers across …rms. Unlike a large literature that has studied frictional
labor markets, while ignoring within …rm mobility, this paper considers the equilibrium
interaction of the two forms of reallocation in the presence of frictions.
I …rst investigate empirically worker mobility within and across …rms and how it’s
a¤ected by the size of the …rm and wages. Workers in large …rms replace across …rm
mobility with within …rm mobility: Workers in large …rms are less likely to separate,
but more likely to switch occupations within their …rm, even conditional on wages. High
wage workers are less likely to reallocate either within a …rm, or across …rms. Moreover, I
…nd that occupational mobility within …rms is not consistent with a career-ladder model.
Flows within a …rm are largely o¤setting. I also show that workers in …rms with more
occupations have substantially higher earnings, even when holding constant the number
of …rm employees.
I then show that bringing together within …rm and across …rm reallocation along with
labor market frictions, naturally accounts for the di¤erences in worker ‡ows and wages
across …rms of di¤erent sizes. I introduce a tractable equilibrium model of within …rm
and across …rm mobility and prove that it is consistent with the observed facts.
The tractability of the model implies that it can be extended in several dimensions: In
the Appendix and Online Appendix, I consider frictions within the …rm, human capital
total number of occupations.
Finally, while in general the level of within …rm mobility is overpredicted in the model, the combinations
of mL = 2 and mS = 1, as well as mL = 3 and mS = 1 are closer to their empirical counterpart than
higher values of mL and mS (3.42% and 4.99% respectively compared to 2.69% in the data).
27
accumulation and on-the-job search. In addition, future work can consider ex-ante worker,
as well as …rm heterogeneity, which are relatively straightforward to introduce in the
model. The contribution of the present paper is that within and across …rm reallocation
should no longer be viewed in isolation.
Appendix
A
Proof of Lemma 1
The value to the …rm of employing a worker with posterior p and k is given by
rJ (p; k) =
(p)
Multiplying eq. (2) by 1
(1
) r (W (p; k)
1 2 2
p (1 p)2 Jpp (p; k)
J (p; k) :
2
and subtracting (1
) rU leads to
(13)
w (p; k) +
U ) = (1
1
(1
) 2 p2 (1 p)2 Wpp (p; k) (14)
2
) (W (p; k) U ) (1
) rU:
) w (p; k) +
(1
Multiplying eq. (13) by , subtracting eq. (14) from it and using the surplus sharing
condition, eq. (4), leads to
w (p; k) =
(p) +
1 2 2
p (1
2
p)2 ( Jpp (p; k)
(1
) Wpp (p; k)) + (1
) rU:
(15)
Finally, by taking the second derivative with respect to p in the surplus sharing condition, eq. (4), and using eq. (15), results in the wage equation, (5).
B
Proof of Proposition 3
The surplus of the match between the …rm and the worker, S (p; k), is given by
S (p; k) = W (p; k) + J (p; k)
U:
Substituting in for W (p; k) and J (p; k) leads to
(r + ) S (p; k) =
(p)
rU +
28
1 2 2
p (1
2
p)2 Spp (p; k) :
The general solution to the above di¤erential equation is given by
S (p; k) =
where
=
q
8(r+ )
2
1
(p) rU
+ K1k p 2
r+
1
2
1
1
1
1
1
p) 2 + 2 + K2k p 2 + 2 (1
(1
1
2
p) 2
;
+ 1 and K1k and K2k are undetermined coe¢ cients that depend on k.
1
1
When p ! 1 however, limK2k p 2 + 2 (1
p!1
1
p) 2
1
2
1
= K2k 1 lim (1
1
2
p) 2
p!1
= +1 which
follows from > 1. Since the present discount sum of the output produced by a good
G
match is given by r+ < 1, it must be the case that K2k = 0, 8k. Thus
1
(p) rU
+ K1k p 2
r+
S (p; k) =
1
2
1
1
p) 2 + 2 ;
(1
where K1k is an undetermined coe¢ cient. Moreover
G
Sp (p; k) =
B
+ K1k
r+
1
2
1
2
p p
1
2
1
2
(1
p)
1
+ 12
2
:
Consider the case where k = 0. Using equation (4), one can rewrite the value matching
and smooth pasting conditions (equations (8) and (9) respectively), in terms of S ( ) and
use them to pin down p (0) and K10 .
From equation (9), one immediately obtains
G
K10
=
B
r+
1
2
1
2
1
1
1
p (0) 2 + 2
p (0)
1
p (0)
1
2
1
2
:
Substituting for K10 into equation (8) and solving leads to equation (10).
Similarly in the case where k > 0, equation (7) leads to
G
K1k =
r+
B
1
2
1
2
1
p (k)
1
1
p (k) 2 + 2
1
p (k)
1
2
1
2
:
Substituting for K1k into equation (6) leads to equation (11).
To show that the value function is increasing in p, note that straightforward calculations show that Spp (p; k) > 0, and therefore Wpp (p; k) > 0, for all p and k. Given that
Wp p (k) ; k = 0, this implies that Wp (p; k) > 0 for all p > p (k).
29
C
Proof of Lemma 5
R1
In this appendix I want to show that p(k) pfk (p) dp is increasing in p (k), and therefore k
(Corollary 4). Intuitively the higher boundary does not let the process move away from
p0 towards zero and shoots it back to p0 sooner. I prove that this holds for the more
general case where each worker draws his prior, p0 from some distribution with support
[0; 1] and known density h ( ). In this case, a worker begins employment at task, as long
as his draw is greater than p (k). This implies that workers who enter the distribution,
h(p0 jp0 >p(k))
do so at various priors according to 1 H p(k) .
( )
First consider the set of all workers with k tasks remaining who draw the same prior,
p0 2 p (k) ; 1 . Let pt be a di¤usion process that starts at p0 2 (0; 1), evolves according
to equation (1), while at a Poisson rate > 0, it returns to p0 . Finally, let p (k) 2 (0; p0 ),
be a re‡ective boundary, such that when the process hits it, it immediately returns to p0 .
Moreover de…ne
Z
1
I p0 ; p (k) =
p(k)
p0
(p) dp;
pgp(k)
p0
where gp(k)
(p) is the steady state density of the above process. Then
Z
1
p(k)
pfk (p) dp =
Z
1
I p0 ; p (k)
h p0 jp0 > p (k)
h(p0 jp0 >p(k))
is higher when p (k), and therefore k
p(k)
For a given p0 > p (k), the weight
1 H (p(k))
1
H p (k)
dp0 :
(Corollary 4), is higher. Moreover, if I p0 ; p (k) is increasing in p (k) and p0 and therefore
k, then the left hand side is increasing in k.
I proceed to show that I p0 ; p (k) is increasing in p (k) and p0 .
I …rst want to prove that if p1 < p2 , then I p0 ; p1 < I p0 ; p2 .
De…ne
Tp = inf ft > 0 : pt = pg :
Call pit the di¤usion with boundary pi . From Athreya and Lahiri (2006), Theorem
30
14.2.10, part (i),34 it is the case that
I p0 ; p =
1
Ep0 Tp
Ep0
Z
Tp
(16)
ps ds :
0
Since
Ep0 Tp1 = Ep0 Tp2 + Ep2 Tp1 ;
and
Ep0
Z
Tp1
p1s ds
Z
= Ep0
0
Tp2
p2s ds
+ Ep2
0
Z
Tp1
p1s ds ;
0
straightforward algebra implies that it su¢ ces to prove that
1
Ep2 Tp1
Z
Ep2
Tp1
p1s ds
< I p 0 ; p2 :
(17)
0
De…ne a process Yt which starts from p2 and evolves like p1t with the only di¤erence
that whenever it reaches p1 , it is resets to p2 , rather than p0 . On the contrary, p2t starts
from p0 . Run Yt and p2t with the same Brownian motion, Wt , and the same Poisson times
of resetting. Now, when Yt and p2t meet, they continue together until p2 is hit. Then p2t
jumps to p0 , while Yt continues. Moreover, since p2t starts above Yt , it is always true that
Yt p2t (in the case of shock, p2t jumps to p0 and Yt jumps to p2 ). However for a positive
proportion of time the inequality is strict. Thus
1
lim
t!1 t
Z
0
t
1
Ys ds < lim
t!1 t
Z
t
p2s ds;
0
which using eq. (16) above, leads to eq. (17).
I also need to prove that if p30 < p40 , then I p30 ; p < I p40 ; p . Now call pit the di¤usion
starting o¤ from pi0 . As before, since
Ep40 Tp = Ep40 Tp30 + Ep30 Tp ;
and
Ep40
Z
0
Tp
p4s ds
= Ep40
Z
Tp3
0
0
34
p4s ds
+ Ep30
Z
Tp
p3s ds ;
0
Here set f (Xj ) = Xj . The theorem states the proof in the case where Xj is a discrete time process.
To prove it in the continuous time I follow the proof of the discrete case. The sums Yi in that proof are
replaced by integrals and the sequence of regeneration times in this case is the times where pit hits pi
(i = 1; 2). In each of these times the di¤usion pit restarts and the trajectories between these times are
i.i.d.
31
straightforward algebra implies that it su¢ ces to prove that
I p30 ; p <
R Tp30
Ep40
0
p4s ds
(18)
:
Ep40 Tp30
Using a similar argument as above, de…ne a process Zt which starts from p40 and evolves
like p4t with the only di¤erence that whenever it resets to p40 whenever it reaches p30 rather
than p. On the contrary, p3t starts from p30 and resets at p. Running Zt and p3t with the
same Brownian motion, Wt , and the same Poisson times of resetting implies as above
that35
Z
Z
1 t
1 t 3
ps ds < lim
Zs ds;
lim
t!1 t 0
t!1 t 0
which using eq. (16) above, leads to eq. (18).
Since shocks are i.i.d. across workers, by Birkho¤’s Ergodic Theorem time averages
equal to space averages. Therefore by proving that the mean of the above system is
increasing in p (k), I also prove that the mean of the cross-sectional distribution of workers
with k task remaining is also increasing in p (k).
D
Within-Firm & Economy-Wide Wage Distributions
Let z (w) denote the wage density of employed workers. Then as derived in the Online
Appendix, the economy-wide steady-state wage distribution is the following:
For w 2 w p (0) ; w (p0 )
2
z (w) =
G
B 2
B
[ w
+ (1
) rU
1
G
w
(1
2
) rU
M
X1
Ck1 (19)
k=0
w
B
+ (1
) rU
2
G
w
(1
) rU
1
M
X1
1
Ck1
k=0
p (k)
p (k)
and for w 2 [w (p0 ) ; w (1)]
z (w) =
2
G
B 2
w
B
+ (1
) rU
1
G
w
(1
) rU
2
M
X1
Ck3 ;
k=0
35
(20)
Essentially I now have p40 in the place of p0 , p30 in the place of p2 , and p in the place of p1 . In the
place of Yt I have p3t and in the place of p2t I have Zt .
32
2
1
];
where
1 2
[p0 (1
2
1 p (k) 2 1
+
p0
p (k)
Z p0
p 1 (1 p)
Ck3 = Ck1 f
1
p0 )
1
(1
2
( 1
p0 )
(
1
dp +
p(k)
1
2
2
1
f
2
(1
2
2 ) p (k)
( 1
1
+2
p (k)
p (k)
1
p (k)
+ 3p0 ) p0 (1
+ 3p0 )
g
1
p0 )
+
3p0 )]
2
Z
1
p0
2
p
(1
p)
1
dp
p(k)
Z
1
1
p
(1
p)
2
dpg 1 ;
p0
and Ck1 is determined by the solution of a M by M system is linear system of equations,
given by equation (5) of the Online Appendix.
Similarly, let zm (w) denote the wage density of workers employed in a …rm with m
tasks. As derived in the Online Appendix, the within-…rm wage distribution, for a …rm
with m tasks, is the following:
For w 2 w p (0) ; w (p0 )
2
zm (w) =
B 2
G
B
[ w
+ (1
) rU
1
G
w
(1
2
) rU
m
X1
Ckm1 (21)
k=0
B
w
+ (1
2
) rU
G
w
(1
1
) rU
m
X1
1
Ckm1
k=0
p (k)
p (k)
and for w 2 [w (p0 ) ; w (1)]
zm (w) =
2
B 2
G
B
w
+ (1
) rU
1
G
w
(1
) rU
2
m
X1
Ckm3 ;
k=0
(22)
where
Ckm3 = Ckm1 f
+
1
Z
p (k)
p (k)
1 2
[p0 (1
2
2
p(k)
p
p0
(1
1
( 1
+ 3p0 )
1
p0
1
p0 )
p)
1
(1
2
p0 )
dp +
(
1
+2
p (k)
p (k)
33
2
3p0 )]
1
Z
p0
p(k)
p
2
(1
p)
1
dp
2
1
];
1
2
2
1
f
2
(1
2
2 ) p (k)
( 1
1
1
p (k)
+ 3p0 ) p0 (1
p0 )
g
1
+
Z
1
p
1
(1
p)
2
dpg 1 ;
p0
and Ckm1 is given by the m by m linear equation system given by equations (11) and
(12) of the Online Appendix. Note that the above expressions give the solution for the
within-…rm wage distribution, for a given m. In other words, one needs to solve a di¤erent
system of linear equations for every m.
It’s worth noting that the functional form of the within-…rm wage distribution is
the same as the wage distribution of the economy, though of course the coe¢ cients are
di¤erent. This is consistent with empirical evidence from several countries in Lazear and
Shaw (2009).
Finally, as demonstrated in the Online Appendix, both the overall wage distribution,
as well as the within-…rm wage distribution are declining from w (p0 ) until the highest
possible wage, w (1), and they also feature a fat Pareto-type tail, as long as > 2 . The
latter is the same condition as in Moscarini (2005). The intuition is that if the speed of
learning ( ) if fast enough relative to the rate at which workers get replaced ( ), then
“too many” workers will …nd out that they’re in a good match and receive very high
wages. Conversely, if learning is relatively slow compared to the replacement rates, then
relatively few workers will reach those high levels.
E
Human Capital
The derivations of the two threshold functions, pnohc (k) and phc (k) can be found in the
Online Appendix. Below are the resulting expressions.
For the case of a worker who has not accumulated human capital, the threshold
nohc
p
(k) is implicitly de…ned by
lim[
p!1
+
1
1
1
1
2q
pq (1
1 q
p (1
2q k3
g (W )
1
q
q
pnohc (k)
1 q
1
p)1
p)
1
[ k1
q
pnohc (k)
pnohc (k)
q
1
!q
1
0
@
34
pnohc (k)
1
q 1
1
!q
pnohc (k)
pnohc (k)
!
q
1 p
p
q
1
pnohc (k) W
1
q 1
1 p
A
p
k2
+
+
1
q (q
1
2q
1
1
@
1)
1
p1
q
nohc
q+p
(k)
pnohc (k)
q
p)q [k1
(1
1
+q
1
1
0
@ 1
1
q (q
1)
g (W ) =
k2 =
q
p
p
G
rU
r+
1
1
2r+ +
r+ +
B
hc
rU +
r+ +
G
hc
=
pnohc (k)
1
G
k1 =
=
s
B
1
q+p
p
1
1
q 1
p
1
A]
p
q W
! q!
;
2
nohc ;
K1k ;
q
1
pnohc (k)
pnohc (k)
!1 q 1
nohc
(k)
1 p
A
nohc
p
(k)
! q
!!
q pnohc (k)
1 pnohc (k)
]]
pnohc (k)
pnohc (k)
p
1 q
q
!q
pnohc (k)
1
p
q
p
nohc
(k)
pnohc (k)
p
rU
+
r+ +
r+ +
k3 =
p
1
p
where
1
1
1
q
p
G
=
1
p)q pnohc (k)q
(1
1 1
p
2q k3
+g (W )
+k2
0
B
r+
8 (r + )
2
hc
rU
r+
B
!
+ 1;
nohc
=
1
; q=
2
G
;
r+ +
r+ +
1
;
1
2
1
2
r
hc ;
4 + k3
k3
!
;
B
=
:
Finally for the case where k = 0, then W = 0, whereas for the case where k > 0,
W = Ep0 S nohc (p0 ; k
1) ;
where
S nohc (p; k) =
1
1
+
2q
1
1
pq (1
p)1
1 q
p (1
2q k3
q
pnohc (k)
q
q 1
pnohc (k)
1 q
0
!q
nohc
1
p
(k)
1
@
p)1 q [ k1
q 1
pnohc (k)
35
1
pnohc (k) W
1
1
p
p
q 1
1
A
g (W )
+
+
1
1
q
0
pnohc (k)
pnohc (k)
q + pnohc (k)
1
@
k2
q (q 1)
pnohc (k)
1
1
2q
1
1
p1
q
1
+q
1
q (q
0
@ 1
1
1
1
p) [k1
q
1
p
1)
pnohc (k)
pnohc (k)
q
1 q
p
q
p
1
q
!
1
q+p
p
pnohc (k)
1
1
p
q 1
1
A]
p
q W
! q!
pnohc (k)
pnohc (k)
!1 q 1
1 pnohc (k)
A
pnohc (k)
! q
!!
1 pnohc (k)
1 pnohc (k)
q nohc
]:
pnohc (k)
p
(k)
p
p
q
!q
pnohc (k)
1
1
p
q
p
p
1
q
(1
1
1
p)q pnohc (k)q
(1
1 1
p
2q k3
+g (W )
+k2
q
!q
p
p
1
For the case of a worker who has accumulated human capital, we have
phc (0) =
(
hc
(
1) (rU
1) rU
B) + (
hc
B
hc
+ 1) (
G
rU )
;
and
( hc 1) (r + ) Ep0 V hc (p0 ; k 1) (r +
r )U
hc
2 (r + ) Ep0 V (p0 ; k 1) + 2 (r +
r ) U + hc ( G
phc (k) =
where:
S hc (p0 ; 0) =
S hc (p0 ; 0) =
(p0 )
r+
1 1
2 2
G
K10 =
=
r+
rU
G
B
rU
1
+ K10 p02
1
2 hc
1
1
1
2
1
2 hc
p0 ) 2 + 2
(1
hc
B)
G
+
B
+
;
;
r+
1
hc
B
r+
G
K1k
(p0 )
r+
B
B
1
2
1
2
1
2
1
2
1
1
phc (0) 2 + 2
phc (0)
hc
1
hc
1
hc
p (k)
36
phc (0)
1
1
1
1
phc (0) 2 + 2
phc (0)
hc
1
phc (k) 2 + 2
1
p02
1
2 hc
hc
1
phc (0)
hc
1
phc (k)
1
1
2
1
2 hc
1
2
1
2 hc
1
p0 ) 2 + 2
(1
;
:
hc
;
F
Calibration Details
The model is calibrated in two stages. First, some parameters are calibrated independently
(r; ; and b). Second, I pick a combination of mL and mS and calibrate the remaining
six parameters (p0 ; ; ; q L ; G and B ) by targeting the six moments described in the
main text.
In order to simulate the model, I …rst need to compute the worker’s optimal decision
rules. In particular, the triggers, equations (10) and (11), are a function of the value of
an unemployed worker, equation (3), which in turn depends on the value of an employed
worker, equation (2). I therefore use a …xed point in order to compute the value of
unemployment and the equilibrium triggers.
I simulate a discrete-time version of the model presented above, by discretizing the
model presented in Section 3. I use the Central Limit Theorem to approximate the Wiener
process of equation (3) by the sum of repeated Bernoulli trials. Similarly the exogenous
destruction shock, , is approximated by a Poisson distribution. I exploit the ergodicity
of the setup and simulate a single worker for 4 million periods each time.
The weighting matrix used to match the six moments is the inverse of the empirical
variance-covariance matrix of these moments. This matrix is obtained by bootstrapping
the data 10,000 times.
References
[1] Abowd, J. M., F. Kramarz and D. N. Margolis (1999): “High Wage Workers and
High Wage Firms,”Econometrica, 67(2), 251-333.
[2] Acemoglu D. and J.-F. Pischke (1998): “Why Do Firms Train? Theory and Evidence,”Quarterly Journal of Economics, 113(1), 79-119.
[3] Alvarez F. and R. Shimer (2009): “Unemployment and Human Capital,” Mimeo,
University of Chicago, Department of Economics.
[4] Alvarez F. and R. Shimer (2011): “Search and Rest Unemployment,”Econometrica,
79(1), 75-122.
[5] Antonovics K. and L. Golan (2012): “Experimentation and Job Choice,”Journal of
Labor Economics, 30(2), 333-366.
37
[6] Athreya, K. B. and S. N. Lahiri (2006): Measure Theory and Probability Theory,
New York: Springer.
[7] Brown C. and J. Medo¤ (1989): “The Employer Size-Wage E¤ect,” Journal of Political Economy, 97(5), 1027-1059.
[8] Burdett K. and D. T. Mortensen (1998): “Wage Di¤erentials, Employer Size and
Unemployment,”International Economic Review, 39(2), 257-273.
[9] Carrillo-Tudela, C. and L. Visschers (2014): “Unemployment and Endogenous Reallocation over the Business Cycle,” mimeo, University of Essex, Department of Economics.
[10] Eeckhout, J. and X. Weng (2010): “Assortative Learning,” Mimeo, University of
Pennsylvania, Department of Economics.
[11] Elsby, M, R. Michaels and G. Salon (2009): “The Ins and Outs of Cyclical Unemployment,”American Economic Journal: Macroeconomics, 1(1), 84-110.
[12] Frederiksen, A. and T. Kato (2014): “Human Capital and Career Success: Evidence
from Linked Employer-Employee Data,”Mimeo, Aarhus University.
[13] Gervais, M., N. Jaimovich, H. E. Siu and Y. Yedid-Levi (2014): “What Should I Be
When I Grow Up? Occupations and Unemployment over the Life Cycle,” Mimeo,
University of British Columbia, Department of Economics.
[14] Groes, F., P. Kircher and I. Manovskii (2014): “The U-Shapes of Occupational Mobility,”Review of Economic Studies, forthcoming.
[15] Hill, D. H. (1994): “The Relative Empirical Validity of Dependent and Independent
Data Collection in a Panel Survey,”Journal of O¢ cial Statistics, 10(4), 359-380.
[16] Hsieh, C.-T., E. Hurst, C. I. Jones and P. J. Klenow (2013): “The Allocation of
Talent and U.S. Economic Growth,”Mimeo, University of Chicago, Booth School of
Business.
[17] Idson, T. L. (1993): “Employer Size and Labor Turnover,”Discussion Paper No. 673,
Columbia University, Department of Economics.
[18] Idson, T. L. and D. J. Feaster (1990): “A Selectivity Model of Employer-Size Wage
Di¤erentials,”Journal of Labor Economics, 8(1), Part 1, 99-122.
38
[19] Jovanovic, B. (1979): “Job Matching and the Theory of Turnover,” Journal of Political Economy, 87(5) Part 1, 972-990.
[20] Jovanovic, B. and Y. Nyarko (1997): “Stepping-Stone Mobility,”Carnegie-Rochester
Conference Series on Public Policy, 46, 289-325.
[21] Kahn, L. B. and F. Lange (2014): “Employer Learning, Productivity and the Earnings Distribution: Evidence from Performance Measures,”Review of Economic Studies, 81(4), 1575-1613.
[22] Kambourov, G. and I. Manovskii (2008): “Rising Occupational and Industry Mobility
in the United States: 1968-1997,”International Economic Review, 49(1), 41-79.
[23] Kambourov, G. and I. Manovskii (2009): “Occupational Mobility and Wage Inequality,”Review of Economic Studies, 76(2), 731-759.
[24] Karlin, S. and H. M. Taylor (1981): A Second Course in Stochastic Processes, New
York: Academic Press.
[25] Kramarz, F., Postel-Vinay, F. and J.-M. Robin (2014): “Occupational Mobility and Wage Dynamics Within and Between Firms,” Mimeo, University
College of London, Department of Economics, https://editorialexpress.com/cgibin/conference/download.cgi?db_name=SED2014&paper_id=168
[26] Lazear, E. P. (2004): “Balanced Skills and Entrepreneurship,” American Economic
Review, 94(2), 208-211.
[27] Lazear, E. P. and K. L. Shaw (2009): “Wage Structure, Wages, and Mobility,”in E. P.
Lazear and K. L. Shaw (ed.), The Structure of Wages: An International Perspective,
University of Chicago, National Bureau of Economic Research.
[28] Li, F. and C. Tian (2013): “Directed Search and Job Rotation,”Journal of Economic
Theory, 148(3), 1268-1281.
[29] Liptser, R. and A. Shyryaev (1977): Statistics of Random Processes, Vol. 2 Berlin:
Springer-Verlag.
[30] Mortensen, D. T. (1978): “Speci…c Capital and Labor Turnover,” The Bell Journal
of Economics, 9(2), 572-586.
39
[31] Mortensen, D. T. and C. A. Pissarides (1994): “Job Creation and Job Destruction
in the Theory of Unemployment,”The Review of Economic Studies, 61(3), 397-415.
[32] Moscarini, G. (2005): “Job Matching and the Wage Distribution,” Econometrica,
73(2), 481-516.
[33] Moscarini G. and K. Thomsson (2007): “Occupational and Job Mobility in the US,”
Scandinavian Journal of Economics, 109(4), 807-836.
[34] Neal, D. (1999): “The Complexity of Job Mobility among Young Men,” Journal of
Labor Economics, 17(2), 237-261.
[35] Novos, I. E. (1992): “Learning by doing, Adverse Selection and Firm Structure,”
Journal of Economic Behavior and Organization, 19(1), 17-39.
[36] Papageorgiou, T. (2014): “Learning Your Comparative Advantages,”Review of Economic Studies, 81(3), 1263-1295.
[37] Pastorino, E. (2014): “Careers in Firms: Estimating a Model of Job Assignment,
Learning and Human Capital Acquisition,” Mimeo, University of Minnesota, Department of Economics.
[38] Pastorino, E. (2015): “Job Matching Within and Across Firms,”International Economic Review, forthcoming.
[39] Pissarides, C. A. (2000): Equilibrium Unemployment Theory, 2nd edition, Cambridge
MA: MIT Press.
[40] Postel-Vinay, F. and J.-M. Robin (2002): “Equilibrium Wage Dispersion with Worker
and Employer Heterogeneity,”Econometrica, 70(6), 2295-2350.
[41] Schmidt, C. M. and K. F. Zimmermann (1991): “Work Characteristics, Firm Size
and Wages,”The Review of Economics and Statistics, 73(4), 705-710.
[42] Shimer, R. (2005): “The Cyclical Behavior of Equilibrium Unemployment and Vacancies,”American Economic Review, 95(1), 25-49.
[43] Shimer, R. (2006): “On-the-Job Search and Strategic Bargaining,” European Economic Review, 50(4), 811-830.
[44] Shimer, R. (2012): “Reassessing the Ins and Outs of Unemployment,” Review of
Economic Dynamics, 15(2), 127-148.
40