2015-11 Rasmus Landersø PhD Thesis Essays in the Economics of Crime DEPARTMENT OF ECONOMICS AND BUSINESS AARHUS UNIVERSITY DENMARK Essays in the Economics of Crime By Rasmus Landersø A PhD thesis submitted to School of Business and Social Sciences, Aarhus University, in partial fulfilment of the requirements of the PhD degree in Economics and Business May 2015 Contents I Does Incarceration Length Affect Labor Market Outcomes? 1 Introduction 1 3 2 Background 2.1 The Reform of the Penal Code . . . . . . . . . . . . . . . . . . . . . . . . . . 2.2 Imprisonment in Denmark . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 10 14 3 Data 16 4 Econometric Framework 21 5 Results 5.1 Macroeconomic Trends . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.2 Mechanisms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 29 33 6 Conclusion 36 References 38 A Supplementary Results 43 B Data Appendix 48 II 53 School Starting Age and the Crime-Age Profile 1 Introduction 55 2 Institutional settings and mechanisms 2.1 Educational Institutions and School Starting Age . . . . . . . . . . . . . . . 2.2 Institutions Guarding Juvenile Crime . . . . . . . . . . . . . . . . . . . . . . 58 58 60 3 Methodology 62 4 Data 64 5 Results 5.1 Timing of Birth Within the Calendar Year and School Starting Age 5.2 Crime Results: 2SLS . . . . . . . . . . . . . . . . . . . . . . . . . . 5.3 Heterogeneity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.4 Potential Mechanisms and Effects on Alternative Outcomes . . . . . 6 Conclusion . . . . . . . . . . . . . . . . . . . . 69 69 70 78 79 83 References 84 A Supplementary Results 88 III The Effects of Admissions to Psychiatric Hospitals 105 1 Introduction 107 2 Background 109 2.1 Institutional framework - mental health care in Denmark . . . . . . . . . . . 111 3 Data 112 4 Econometric Framework 121 5 Results 5.1 2SLS Results . . . . . . . . . . . . . . . 5.2 Gender and age differences . . . . . . . . 5.3 Marginal Treatment Effects . . . . . . . 5.4 Effect of Admittance on Spouses’ Labour 124 125 134 138 144 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Market Outcomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 Conclusion 145 References 147 A Supplementary Results 152 B Data Appendix 163 Preface Foremost, I wish to thank both of my two supervisors, Marianne Simonsen and Helena Skyt Nielsen, who have guided me safely towards the point where I now can hand in my thesis. I appreciate that you have always been forthcoming while treating me as your peer, and I have gained significantly from being allowed to work and learn independently througout our joint project. I also appreciate that you have been very considerate towards our ‘long distance relationship’. I also need to thank a further number of individuals who have helped me in different ways throughout my PhD. First, I would like to thank James Heckman for hosting my visit to the the University of Chicago. I have learned tremendously from this visit and from our project. Second, I owe thanks to Greg Veramendi, for being an expert tutor in latent factor models and for mediating my contact with the University of Chicago. Third, I wish to thank Christian Dustmann for his guidance and tutoring throughout our joint work. Almost eight years ago, I attended a TA session in Introductory Macroeconomics at 8 in the morning - we were only three students present. After the class, the TA showed a jobposting for a research assistant at the Rockwool Foundation Research Unit (RFRU) with deadline that very day. I went home, wrote an application, and have been working there ever since. I have not always known that I would write a PhD in Economics, and the truth is that I am not certain I would be where I am now, had it not been for that particular TA session. Working for RFRU and research director Torben Tranæs has showed me the crossroad between empirical research, microeconomics, and policy relevance, which quickly turned out to be exactly where my research interests lie. After completing my master’s degree, Torben Tranæs offered to fund my PhD. I am very grateful for the years of support RFRU has given me, while provinding me with the freedom and funding to follow my research interests (whether these took me to Aarhus or Chicago). I During all of the years at RFRU, I have been so fortunate to work alongside Peter, Peer, and Lars who soon became my good friends. Peter and I have shared office and have supported each other through the ups and downs that academic (and non-academic) life brings through those years. Peer and Lars have both taught me very valuable lessons; Peer has been my everyday tutor into protestant work-ethics and economics, while Lars has shown that criminology, fatherhood, and pub-crawling does intersect somewhere. Selma, your love and ever-steady support has helped me through these past years’ endevours. You have listened patiently to my ongoing (and seemingly endless) monolugues on the topics of this thesis, and you joined me on my academic travels around the world. I treasure the life we live together. Finally, however much I may be thrilled by the prospect of completing my PhD, this will not be the most significant event in my life during the past three years. Josephine - my daughter - does not know what PhD and Economics mean. Yet, as her presense makes all of the long and hard working days seem infitesimal, I cannot list all the important people in the making of this thesis without mentioning her. May 20th, 2015 Copenhagen Rasmus Landersø II Summary This PhD dissertation consists of three self-contained papers. The papers are not meant to constitute an entity even though they share a lot of common ground in topics, data souces, methodology, and outcomes of interest. Furthermore, the three essays all focus on aspects of at-risk behavior and on providing a better understanding of the outcomes experienced by marginalized individuals in soceity. In the first paper, titled Does Incarceration Length Affect Labor Market Outcomes?, I investigate the effects of time spent in jail on subsequent unemployment rates, dependency on other public transfers, and earnings. The previous literature on the effects of imprisonment focuses on the effect of imprisonment at the extensive margin and not at the intensive margin: the lengths of incarceration. At the intensive margin, the effect of incarceration length will depend on the marginal costs and benefits of time spent in jail and not the costs and benefits of the jail sentence in total. I use a DD and DDD approach to study the effects of Danish reform in 2002 for offenders who are convicted of simple violence, which constitute around 20% of all imprisonment-sentences in Denmark. The reform mainly affected offenders who serve incarceration spells of one or two months, which are modal incarceration spells in Denmark. I find that the increase in incarceration length resulted in lower rates of unemployment, unchanged dependency on other public transfers, and higher earnings. Using survey data, I show that participation in rehabilitation increase by time spent in jail, especially during the first months. Thus a likely mechanism is through increased participation rates for which an offender’s pre-reform sentence would otherwise have been too short, leaving the offender solely with the possible stigma, job-loss, and general alienation from the labor market which incarceration might involve. Finally, I show that the reform mainly affected young men with relatively short criminal histories. As younger individuals may be more malleable than older offenders, young offenders may benefit significantly more III from rehabilitating programs or change their perception of the payoff to crime to a larger degree compared to their older peers with more extensive criminal histories. The second paper, titled School Starting Age and the Crime-Age Profile (joint with H. S. Nielsen and M. Simonsen) investigates long-term effects of school starting age (SSA) while providing novel insights into the determinants of life-cycle criminal behavior. Parents may manipulate their childs school starting age; in our sample of children born in Denmark in the period from 1981-1993 around 20-25% of boys and 10-13% of girls start school one year later than the law dictated. In order to obviate this non-random selection in SSA, we exploit that Danish children typically start first grade in the calendar year they turn seven, which gives rise to a fuzzy regression discontinuity design that shifts SSA by one year around New Year. First, we present causal evidence of the underlying nature of onset and persistence of criminal behavior across an individuals life-course. We find that the onset of crime can be modified by life-course and is not only determined by age per se while the continuation of criminal behavior is affected by both age, life circumstances, and criminal opportunity. Finally, we find that incapacitation seems to play an important role and that the effects of SSA vary across different parental characteristics as education and labor market attachment. In the third paper The Effects of Admissions to Psychiatric Hospitals (joint with P. Fallesen) we examine the effects of admitting a patient as an inpatient upon first contact with a psychiatric hospital on the patient’s subsequent contacts and admissions to psychiatric hospitals, self-harm/suicide attempts, labor market outcomes, criminal behavior, and on his spouse’s labor market attachment. We address the fundamental differences between the counterfactual outcomes of individuals who are admitted and those who are not by using the intensity of patient contacts to a hospital the weeks before an individual’s first contact (a proxy for a given hospital’s occupancy rate) as an IV. We find that inpatient care has ambiguous effects. In the short run, inpatient care addresses the patients’ immediate needs, to the benefit of both the patients and potential victims of crime. Being admitted lowers IV self-inflicted lesions/cuts and leads to large reductions in crime shortly after the admission. We show that the crime reduction is driven by incapacitation during the period of utmost mental distress. In the longer run people admitted into inpatient care experience a higher degree of institutionalization which also leaves them with poorer long-term labor market outcomes. We also identify large heterogeneity across observable and unobservable characteristics. Males experience the largest reductions to crime whereas females experience the largest increase in re-admissions and reductions to labor market attachment. By estimating Marginal Treatment Effects we show that patients with the most severe disorders experience significant reductions to overdoses of drugs/alcohol and the largest reductions to crime, whereas patients with the least severe disorders are institutionalized to a larger extend. Finally, we identify additional sources of positive externalities of hospital admissions, as we find that it increases the employment rates of patients’ spouses. V Resumé (Danish summary) Denne ph.d. afhandling består af tre separate kapitler. Kapitlerne er ikke tænkt eller skrevet som en enkelt enhed eller sammenhængende tekst, men de har en lang række fællestræk i form af emner, datakilder, metoder og relevante resultater. Hvad mere er, så fokuserer de tre artikler alle på aspekter af risikoadfærd og forsøger at give en bedre forståelse af de livsforløb, som personer, der befinder sig i sammenfundets margin, oplever. I det første kapitel, Does Incarceration Length Affect Labor Market Outcomes? undersøger jeg, hvordan indespærringslængde påvirker kriminelles efterfølgende indtægt, afhængighed af overførselsindkomster og arbejdsløshedsrater. Den eksisterende litteratur om effekterne af fængsling fokuserer primært på den ekstensive margin og ikke på den intensive margin: længden af fængslingen. På den intensive margin vil effekten af indespærringslængde afhænge af de marginale omkostninger og fordele ved den specifikke tid i fængsel, ikke omkostningerne og fordelene ved fængselsdommen som en helhed. Jeg anvender en difference-in-differences (DD) og triple-differences (DDD) tilgang til at undersøge effekterne af en dansk reform fra 2002 for kriminelle, som bliver dømt for simpel vold (Straffelovens §244), hvilket udgør omkring 20% af alle fængselsdomme i Danmark. Reformen påvirkede primært kriminelle, som aftjente domme på 1 eller 2 måneder, hvilket er de hyppigst anvendte domslængder i Danmark. Jeg finder, at den længere tid i fængsel resulterer i lavere arbejdsløshedsrater, samme afhængighed af overførselsindkomster og højere gennemsnitsindtægt. Ved hjælp af data fra en spørgeskemaundersøgelse blandt indsatte i danske fængsler viser jeg, at deltagelse i rehabilitering stiger med tiden brugt i fængsel, specielt i de første måneder. Derfor er øget deltagelse i rehabilitering en sandsynlig mekanisme bag resultaterne. Indespærringslængderne før reformen har sandsynligvis været så korte, at rehabilitering har været umulig, hvorfor de indsatte udelukkende er blevet efterladt med de mulige stigma, tab af job og generel fremmedgørelse fra arbejdsmarkedet, som fængsling kan medføre. Endelig viser jeg, at reformen primært påvirkede unge mænd med en forholdsvis kort kriminel baggrund. VI Yngre individer er muligvis mere fleksible end ældre kriminelle og derfor mere modtagelige overfor rehabiliteringsprogrammer. De har muligvis også større chancer for at skifte opfattelse af værdien af kriminalitet i sammenligning med deres ældre, mere kriminelt erfarne, medindsatte. Det andet kapitel, School Starting Age and the Crime-Age Profile (skrevet sammen med H. S. Nielsen og M. Simonsen) undersøger langsigtede effekter af skolestartsalder (SSA) og bidrager med ny indsigt i sammenhængen mellem kriminalitet og alder. Forældre kan manipulere deres børns skolestartsalder; i vores sample af børn født i Danmark i perioden fra 1981-1993 startede omkring 20-25% af drenge og 10-13% af piger et år senere end loven dikterer. For at omgå denne højst selektive udvælgelse af, hvilke børn der starter senere og tidligere, udnytter vi, at danske børn ofte starter i første klasse i det kalenderår, hvor de fylder syv. Dette resulterer i et fuzzy regression discontinuity-design, der skubber SSA med et år for børn født lige omkring nytår. Herved undersøger vi sammenhænge mellem start og fortsættelse af kriminel opførsel i forhold til et individs livsforløb og alder. Vi finder, at starten på kriminel løbebane er knyttet til et individs livsforløb og ikke til alder som sådan. Til sammenligning er fortsat kriminel adfærd både påvirket af alder såvel som kriminelle muligheder og livsforløb. Vi finder endvidere, at indespærring/fasholdelse i uddannelse lader til at være en vigtig mekanisme heri. Gennem vores undersøgelse leverer vi også ny viden omkring konsekvenserne af skolestartsalder på længere sigt. Endelig finder vi, at effekterne af skolestartsalder varierer mellem forældre med forskellige uddannelser og arbejdsmarkedstilknytning. I det tredje kapitel, The Effects of Admissions to Psychiatric Hospitals (skrevet sammen med P. Fallesen) undersøger vi effekterne af at indlægge en patient på et psykiatrisk hospital ved første kontakt med psykiatrien, på patientens videre kontakt og indlæggelse på psykiatriske hospitaler, skade på sig selv/selvmordforsøg, arbejdsmarkedstilknytning, kriminalitet og på patientens ægtefælles arbejdsmarkedsforhold. Vi tager højde for de fundamentale VII forskelle mellem individer, som bliver indlagt, og dem, der ikke bliver, ved at bruge intensiteten af patientkontakter med hospitalet i ugerne op til et individs første kontakt (en indikator af et specifikt hospitals belægningsgrad) som en instrument variabel. Vi finder, at indlæggelse har et tvetydigt resultat. På kort sigt hjælper indlæggelse på patientens direkte behov, til fordel for både patienten og eventuelle ofre af kriminelle handlinger. Indlæggelse sænker selvpåførte skader såvel som reducerer kriminalitet lige efter indlæggelse. Vi viser, at reduktionen i kriminalitet er resultatet af fastholdelse/indespærring som følge af hospitalsindlægelsen. På længere sigt viser indlagte individer en højere grad af institutionalisering, som giver dem lavere arbejdsmarkedstilknytning. Vi identificerer også en stor heterogenitet på tværs af observerbare såvel som uobserverbare karakteristika. Mænd oplever den største reduktion i kriminalitet, mens kvinder oplever den største stigning i sandsynlighed for genindlæggelse og den største reduktion i arbejdsmarkedstilknytning. Ved at estimere Marginal Treatment Effects viser vi, at patienter med de mest alvorlige mentale lidelser oplever markante reduktioner i sandsynligheden for overdoser af alkohol/stoffer såvel som den største reduktion i kriminalitet. Til sammenligning bliver patienter med de mildeste lidelser institutionaliseret til en større grad. Afsluttende finder vi yderligere positive eksternaliteter af hospitalsindlæggelse, idet vi viser, at indlæggelse hæver ægtefællers beskæftigelsesrater. VIII Part I Does Incarceration Length Affect Labor Market Outcomes? 1 Does Incarceration Length Affect Labor Market Outcomes? Rasmus Landersø Abstract This paper studies how longer incarceration spells affect offenders’ labor market outcomes, by using a reform that increases incarceration lengths by approximately one month. I use detailed register data for offenders who predominantly serve incarceration spells of one to two months. I analyze the sample for several years prior to and after incarceration and show that the reform led to an exogenous increase in incarceration length. I find that the longer incarceration spells result in lower unemployment and higher earnings, possibly because marginal increases in short incarceration spells improve conditions and incentives for rehabilitation, but not the costs of jail related to these outcomes. I show that the estimates are robust to different econometric specifications and further provide evidence that my results are not driven by changes in macroeconomic conditions. Keywords: crime, incarceration length, labor market outcomes. JEL: K4 Acknowledgements: I thank Marianne Simonsen Helena Skyt Nielsen, Joseph Doyle, Anna Piil Damm, Christopher Taber, Bas van der Klaauw, Peter Sandholt Jensen, the participants of the 2011 EALE conference, the 2011 American Association of Criminology conference, Dennis Carlton, and an anonymous referee for useful comments and suggestions. I also thank colleagues Signe Hald Andersen, Peer Skov, and Lars Højsgaard Andersen for helpful discussions and Linda Kjær Minke for providing data from her study of Danish jails. The paper is forthcoming in the Journal of Law and Economics. 2 1 Introduction As prison populations have risen during the past decades both in the U.S. and elsewhere (OECD (2010)), so have the benefits from successful rehabilitation of former inmates. Also, reintegration in the labor market after release from jail is profitable for the offender as well as society in general, e.g., because it lowers the risk of recidivism (e.g., Fougère et al. (2009); Witte & Tauchen (1994)). This paper studies the effect of the length of an incarceration spell on three subsequent labor market outcomes: unemployment rates, dependency on other public transfers, and earnings. The previous literature on the effect of imprisonment on subsequent labor market outcomes mainly focuses on the effect of serving a jail sentence relative to not serving a jail sentence. Yet, policy-makers and judges do not only face choices at the extensive margin on whether to convict offenders to imprisonment or not. They also face choices at the intensive margin, as determining the lengths of incarceration. At the intensive margin, the effect of incarceration length, which is the focus of this paper, will depend on the marginal costs and benefits of time spent in jail and not the costs and benefits of the jail sentence in total. The majority of the earlier studies struggle with the endogenous relationship between crime and labor market outcomes, as offenders with different incarceration lengths differ on a number of observable and unobservable characteristics. One exception is Kling (2006) who estimates the average effects of incarceration length on employment and earnings for a wide range of offenders by using an instrumental variable of randomly assigned judges. I obtain causal inference by examining the effects of a reform that increased violent offenders’ incarceration lengths by roughly one month, independent of individual offender characteristics. The effects of incarceration lengths may depend on whether one investigates the effects of changes in shorter or longer incarceration spells. I.e., the conclusions could rely heavily on the range of incarceration spells one examined, and estimations including offenders who 3 serve short and long incarceration spells may offset opposing effects. This study extends Kling (2006) by focussing on offenders who mainly serve incarceration spells of one or two months, which are modal incarceration spells,1 instead of compiling different offender groups. I use highly detailed Danish register data to construct a panel of the 1,748 individuals who were sentenced to jail for crimes subject to the reform, and analyze the sample from several years before incarceration until three years after release. The sample includes around 20 percent of all men who served time in Danish jails during the period of time in question. I define the control and treatment group by the date of crime relative to the timing of the reform. Those who committed crimes prior to the reform are the control group and those who committed crimes after the reform are the treatment group. Figure I.1 visualizes the main empirical findings. It shows the averages of the three outcomes for the treatment and control group, from 48 months prior to incarceration until 36 months following release. I denote the start of the incarceration as time 0, the month prior to this time −1, and similarly, I denote the first month following release time 1, the subsequent month time 2, and so forth. All points in time from incarceration initiation to the date of release have been deleted for all individuals in order to create a coherent time-line. Figure I.1 shows that unemployment, dependency on other public transfers, and earnings did not differ systematically between the treatment and control group prior to incarceration, neither in trends nor in levels.2 Moreover, the figure shows that the two groups’ unemployment rates and earnings diverge after release from jail. The treatment group has lower average unemployment rates and a higher level of earnings, whereas there are no differences in dependency on other public transfers. When estimating the effects of the reform, using a 1 The sentences that this paper investigates are comparable to most county jail sentences. The U.S. Bureau of Justice Statistics estimates that around 1/3 of all U.S. inmates serve time in county jails. 2 The time paths of the unemployment rates and earnings display a spike/dip prior to incarceration. The spikes/dips could indicate the initiation of a criminal trajectory, while they also display great resemblance to Ashenfelter’s dip (Ashenfelter (1978)). As noted with regard to effect evaluation in labor economics, the spikes/dips here could imply self-selection into incarceration. However, as there is no difference between the treatment and control group’s spikes/dips they do not affect my results. 4 differences-in-differences model and a triple-difference model with property offenders as an additional control group, I confirm these results. The results are robust to differential trends by pre-incarceration characteristics and I show that the estimated effects are not caused by business cycle changes across the timing of the reform. By estimating the average characteristics of those who experienced the largest increase in incarceration length as a result of the reform, I also find that offenders who are younger and have shorter criminal records than the full sample are most likely to drive the results. The findings are in line with the results from Kling (2006) and highlight that offenders who serve short incarceration spells may incur the costs of going to jail without being able to benefit from the time spent behind bars. Especially for young offenders, increases in incarceration length as induced by the reform, improve labor market outcomes with the improvement in the conditions and incentives for rehabilitation, which may take various forms. Some may benefit from being constrained to a highly structured life while others may benefit from couselling, anger-management, transition focussed aid around the before release, or increase labor force participation as the payoff to crime decreases. The remainder of the paper is organized as follows: Section 2 provides the background by introducing the link between incarceration length and subsequent labor market outcomes and reviewing the previous literature and findings. Section 3 introduces the data and the sample. Section 4 introduces the econometric framework and Section 5 presents the results and specification tests. Section 6 concludes. 5 Figure I.1: Average Earnings, Rates of Unemployment, and Dependency on Other Public Transfers for the Control and Treatment Groups 1800 1 1600 0,9 0,8 1400 0,7 1200 0,6 0,5 Rate US $, 2005 1000 800 0,4 600 0,3 400 0,2 200 0,1 0 0 -60 -48 -36 -24 -12 0 12 24 36 48 Months Earnings, control Unemployment, control Other public transfers, control Earnings, treatment Unemployment, treatment Other public transfers, treatment Note: Figure shows monthly earnings, rates of unemployment, and dependency of other public transfers before and after incarceration (time 0) for the control and treatment (pre- and post reform) groups. For earnings there were no significant differences the last 48 months leading up to the incarceration spell, except in one single month (at time -8), for rates of unemployment there were no significant differences between the two groups the last 48 months prior to incarceration, and for dependency of other public transfers there were no significant differences the last 30 months prior to incarceration. 6 2 Background An individual’s labor market outcomes are determined by a wide range of individual skills and characteristics such as level of human capital, work-experience, level of education (e.g., Ben-Porath (1967); Mincer (1974)), social capital (e.g., Granovetter (1995)),3 search behavior (e.g., Holzer (1988)), and non-cognitive skills (Almlund et al. (2011); Cunha & Heckman (2008)). On the one hand, these skills may affect the propensity to commit crime and the type of crime, (cf. Cunha et al. (2010); Lochner (2004); Machin et al. (2011); Moretti & Lochner (2004)) consequently affecting how much time any given offender serves in jail. On the other hand, the skills can also change as a result of a jail sentence and the subsequent time spent behind bars; some skills may erode while other skills may increase and provide society with benefits in relation to an offender’s labor market outcomes after release from jail. Moreover, experiencing incarceration may also deter offenders from recidivism (Abrams (2012); Owens (2009)) and increase their motivation to find employment. The individual costs of jail can manifest themselves as stigma, job-loss, general informal sanctions from society, or depreciation of human capital as offenders lose skills and productivity in general or miss out on potential work-experience (e.g., Waldfogel (1994); Western et al. (2001)). Incarceration spells may also depreciate social capital by eroding personal connections which match workers to employers or provide information about possible job opportunities (Sampson & Laub (1995)). Individual costs of incarceration, in terms of labor market outcomes, may be evaluated at the extensive margin and at the intensive margin. At the extensive margin, costs include the penalty from receiving a jail sentence and consequently setting foot in jail, even for just one second, while at the intensive margin the costs include the effects of marginal increases in time spent in jail. 3 The economic literature on social capital is more sparse than that on human capital. For an introduction e.g., Glaeser et al. (2002). 7 The benefits may be increased possibilities for participation in rehabilitating programs or assistance in job search (a proposition put forward as early as the 19th century by criminologist Arnould Bonneville De Marsangy), academic or vocational training, treatment for substance abuse (e.g., Kling (2006)), and deterrence in the form of more realistic evaluations of the pay-off from crime (Freeman (1996); Sah (1991); Wilson & Abrahamse (1992)). Once an offender has been sentenced to jail, the costs at the extensive margin are sunk. It then follows that the relationship between marginal costs and marginal benefits composes the maximization problem in relation to labor market outcomes after release from jail. Where marginal benefits exceed marginal costs the offender would increase his subsequent labor market affiliation by being imprisoned for a longer period of time. In other words, disregarding deterrence and incapacitation effects of jail sentences, increasing incarceration lengths could increase society’s net benefits from an offender’s jail sentence even though the total costs of incarceration exceed the total benefits. A vast body of literature investigates the effects of incarceration at the extensive margin (for notable examples see Aizer & Doyle (2011); Freeman (1992); Grogger (1995); Nagin & Waldfogel (1995, 1998); Waldfogel (1994); Western et al. (2001)). Most of the studies find that total individual costs of going to jail or prison are substantially larger than total individual benefits, and the literature generally ascribes this finding to stigma from the incarceration, as employers who face imperfect information may use criminal records as signals, revealing otherwise unobserved characteristics. However, these findings do not necessarily imply that marginal costs exceed marginal benefits, but only that the costs at the extensive margin are large. Literature on the effects of incarceration length at the intensive margin is much sparser than the literature on the effects of incarceration at the extensive margin. Lott (1992a) estimates a first-difference model on a sample of convicted drug-offenders and finds no significant association between sentencing length and the difference in earnings before and after 8 jail. In contrast, Lott (1992b) finds a significant monetary penalty to offenders convicted of larceny or theft by as much as a 32 percent reduction in earnings for an additional month in jail while his corresponding estimates for offenders convicted of embezzlement or fraud are insignificant. Needles (1996) uses a quasi-experiment of randomly assigned “Transitional Aids”4 to newly-released prisoners convicted of various types of crime. When examining the marginal changes in incarceration length while controlling for selection into employment, she finds no significant effect on earnings. Kling (2006) uses an instrumental variable of randomly assigned judges as exogenous variation in time incarcerated on a sample of various types of offenders convicted by the federal judicial system in California.5 He finds no significant effects from incarceration length on neither future employment nor on earnings nine years after the beginning of the incarceration spell. The study further finds relatively small positive but significant short-term effects from incarceration length on future employment and earnings (12 and 30 months after release respectively), using data from the Florida state prison system together with the Californian sample (Kling does, however, stress that the lack of exogenous variation in the latter model may bias the estimates). In addition, Kling (2006) suggests that the results arise because longer incarceration spells might enhance the possibility of the individual inmates receiving assistance that increases employability once released, such as treatment for substance abuse. No matter whether one studies the effects at the extensive or intensive margin, the endogenous nature of crime and labor market outcomes must be addressed. Different incarceration lengths may correlate with unobserved individual characteristics that also affect labor market outcomes, which could obscure the results and the causal reading of the estimates. Frameworks such as first difference or fixed effects estimations may obviate time-invariant unobserved individual characteristics that affect both employability and proneness to crime, but these frameworks do not eliminate any probable relationships between unobserved time4 A programme designed to help newly-released prisoners rehabilitate, see Needles (1996). Inmates of federal prisons must have committed a crime defined as ”being within federal jurisdiction”. Kling mentions ”interstate postal fraud and some drug cases” as examples of such crimes. 5 9 varying components and labor market conditions; for example if layoffs, pay-cuts, etc. prior to the incarceration spark crime. In such cases, previous levels of labor market outcomes or shocks will correlate with the length of the subsequent incarceration spell, which means that simply eliminating the individual fixed effects is insufficient. One solution to this problem of endogeneity is to implement an instrumental variable, as Kling (2006) does to estimate the average effects of incarceration length on subsequent earnings and employment for a wide range of offenders. However, estimates of the effects of incarceration length on subsequent labor market outcomes may differ across various incarceration lengths and consequently heterogeneous and non-linear effects may offset when averaging treatment effects in samples of pooled offender types who serve jail sentences of very different lengths. I obtain causal inference by using a reform that increases incarceration lengths exogenously while I avoid issues concerning non-linearities by estimating the effect of incarceration length at a more narrow range of incarceration lengths. 2.1 The Reform of the Penal Code On the 31st of May 2002 the Danish government and parliament made a change to the penal code concerning violent crime. It was done by formally issuing an decree to judges saying that they should increase incarceration lengths by approximately 1/3.6 Further, the maximum sentences was changed accordingly so the reform would not be ineffective to offenders with sentences close to the pre-reform maximum sentences. The bill was immediately put into effect and the aim was explained as follows: The government finds that the previous level of sanctions in cases of crime harming others does not adequately reflect the victims’ suffering. Hence, the government wishes to increase the sanctions for such crimes. [Secretary of Justice Lene Espersen (2002), own translation] 6 Though an internal review (Ministry of Justice, Research Unit (2007)) concluded that the reform did not increase incarceration lengths by 1/3 but only around 1/6. 10 Thus, the reform targeted violent offenders in general and increased the sanctions for a given violent crime. Figure I.2 shows the distributions of incarceration lengths (actual time spent in jail) in this paper’s sample7 divided by whether the crime was committed prior to or after the reform.8 Those who committed crime prior to the reform are the control group and those who committed crime after the reform are the treatment group. Figure I.2: Pre- and Post-Reform Distributions of Incarceration Length Note: Figure shows densities of incarceration length measured in days for the control and treatment (preand post reform) groups. P-value for Kolmogorov-Smirnov test for equality of distributions < 0.001. P-value for a test of differences in means < 0.01. Both distributions are heavily concentrated in the interval 20 to 70 days, with peaks at 30 and 60 days corresponding to sentences of one and two months of incarceration, respectively. 7 I will introduce the sample formally in section 3. The figure has been censored at 150 days. The censored data corresponds to 5.26 percent of the sample prior to the reform and 6.30 after. 8 11 Moreover, the post-reform distribution in general shows fewer sentences in the lower part of the sentence range and a larger number of sentences at the high end of the range. There seems to be no noteworthy difference between the tails of the two distributions which might call in question whether the reform increased the incarceration length for all offenders. However, as a consequence of the few observations on offenders receiving sentences above 70 days any non-extreme changes would be undetectable in practice. Also, figure I.2 shows that much of the increase in average length arose because of increases in incarceration length from one to two months, for a proportion of the sample, i.e., a substantial proportion of offenders served two months instead of one month as a result of the reform. The sample consists of individuals sentenced to jail for ordinary violence (general fights, bar brawl, minor assaults, etc.).9 The upper panel of figure I.3 shows the fraction that violent crimes in general, ordinary violence, and the sample constituted relative to all crimes10 committed by men aged 18 to 45 between December 2000 and November 2003 which resulted in convictions to imprisonment. The vertical solid line marks the timing of the reform. In analogy, the lower panel of figure I.3 shows the fraction of all convictions for violent crimes and ordinary violence committed by men aged 18 to 45 between December 2000 and November 2003 which resulted in either a suspended sentence or a conviction to imprisonment. Figure I.1 in the Appendix shows trends in crime convictions by crime type from 1995 until 2009. The upper panel of figure I.3 shows no change in the proportions of violent crimes or ordinary violence relative to the aggregated crimes across the timing of the reform. Violent crimes accounted for approximately half of aggregated crime and ordinary violence was the most common type of violence as it constituted 50-60 percent of all convictions due to violent 9 For a legal definition of the term see https://www.retsinformation.dk/Forms/R0710.aspx?id=126465. In everyday terms ordinary violence comprises assaults and fights from which the victim does not suffer major permanent injuries. 10 In the following paragraph I only consider individuals sentenced as results of violations of the Penal Code. This includes property crimes, violent crimes, and drug related crimes, but not traffic offences, environmental offences, or military crimes. 12 Figure I.3: Composition of Crimes from Dec. 2000 to Nov. 2003 for Men Aged 18-45 &ƌĂĐƚŝŽŶŽĨĂůůĐŽŶǀŝĐƚŝŽŶƐƚŽŝŵƉƌŝƐŽŶŵĞŶƚ ϭ Ϭ͕ϵ Ϭ͕ϴ Ϭ͕ϳ Ϭ͕ϲ Ϭ͕ϱ Ϭ͕ϰ Ϭ͕ϯ Ϭ͕Ϯ Ϭ͕ϭ Ϭ ĚĞĐͲϬϬ ũƵŶͲϬϭ ĚĞĐͲϬϭ ũƵŶͲϬϮ ĚĞĐͲϬϮ ũƵŶͲϬϯ DŽŶƚŚŽĨĐƌŝŵĞ sŝŽůĞŶĐĞ KƌĚŝŶĂƌLJǀŝŽůĞŶĐĞ ^ĂŵƉůĞ &ƌĂĐƚŝŽŶŽĨĂůůĐŽŶǀŝĐƚŝŽŶƚŝŽŶƐŽĨǀŝŽůĞŶƚĐƌŝŵĞͬ ŽƌĚŝŶĂƌLJǀŝŽůĞŶĐĞ Note: Figure shows the fractions that violent crime, ordinary violence, and the sample constituted of all suspended sentences or convictions to imprisonment across the timing of the reform (marked by the vertical solid line) for males aged 18-45 at the time of crime. ϭ Ϭ͕ϵ Ϭ͕ϴ Ϭ͕ϳ Ϭ͕ϲ Ϭ͕ϱ Ϭ͕ϰ Ϭ͕ϯ Ϭ͕Ϯ Ϭ͕ϭ Ϭ ĚĞĐͲϬϬ ũƵŶͲϬϭ ĚĞĐͲϬϭ ũƵŶͲϬϮ DŽŶƚŚŽĨĐƌŝŵĞ ĚĞĐͲϬϮ ũƵŶͲϬϯ sŝŽůĞŶĐĞ͕ĐŽŶǀŝĐƚŝŽŶƐƵƐƉĞŶĚĞĚƐĞŶƚ͘ŽƌŝŵƉƌŝƐŽŶŵĞŶƚ sŝŽůĞŶĐĞ͕ĐŽŶǀŝĐƚĞĚƚŽŝŵƉƌŝƐŽŶŵĞŶƚ KƌĚŝŶĂƌLJǀŝŽůĞŶĐĞ͕ĐŽŶǀŝĐƚŝŽŶƐƵƐƉ͘ƐĞŶƚ͘ŽƌŝŵƉƌŝƐŽŶŵĞŶƚ KƌĚŝŶĂƌLJǀŝŽůĞŶĐĞ͕ĐŽŶǀŝĐƚĞĚƚŽŝŵƉƌŝƐŽŶŵĞŶƚ Note: Figure shows the fractions of all convictions that resulted in a suspended sentence or a sentence to imprisonment for violent crime, ordinary violence, and the sample across the timing of the reform (marked by the vertical solid line). 13 crimes during the period. In addition, the figure shows that this paper’s sample includes approximately 20 percent of men aged 18 to 45, who served time in jail during that particular period of time. Likewise, the lower panel of figure I.3 shows no change in the proportions of suspended sentences or convictions to imprisonment across the timing of the reform. Approximately 75-80 percent of all violent crime and ordinary violence resulted in either a conviction to imprisonment or a suspended sentence. Around 45-50 percent of all violent crime and ordinary violence resulted in a conviction to imprisonment. In conclusion, figure I.2 shows that the reform resulted in an increase in incarceration lengths at the intensive margin, while figure I.3 shows that the reform did not result in changes at the extensive margin, as it affected neither the types of crime nor to the composition of convictions. 2.2 Imprisonment in Denmark In Denmark, a sentence to imprisonment is either served in local, open, or closed prisons. The penal system in Denmark is generally considered quite lenient both in terms of incarceration lengths and serving conditions (Danish Prison and Probation Service (2012)), in particular when compared to the United States (Guerino & Sabol (2012); Motivans (2012)). Around 85% of sentencing lengths in Denmark are below one year and only 7% are longer than two years (Danish Prison and Probation Service (2012)) which, in terms of time in jail, makes U.S. county jails the best suited comparison. Only offenders with the longest prison sentences or gang affiliation serve in closed prisons, which have more staff and control than open prisons and stricter rules about e.g. money, telephone calls, visits, and other matters. The stark differences between the modal sentencing lengths in Denmark and the U.S. partly stem from differences in the frequency of ’severe crime’. However, they also reflect that the general view towards criminal sactions differs. The most common form of imprisonment in Denmark is in open state prisons which, according to the Danish Prison and Probation Service, reflect the particular view that closed prisons 14 often institutionalize and brutalize offenders compared to open prisons.11 The open prisons seek to mimic a highly structured everyday life, where the inmates have mandatory chores such as manual or vocational labor and also cook their own meals. The prisons also provide possibilities of various lower secondary and vocational educations, in addition to programs such as anger-management and treatment for substance abuse. Traditionally, rehabilitation and counseling has been focussed on offenders who served long incarceration spells. In addition, there were no official guidelines for Danish jails on e.g. rehabilitating and work release programs for short sentences until 2006. Beyond anecdotal evidence, little is known of in-jail rehabilitation for offenders who only served one or two months in jail prior to 2006.12 Minke (2010) presents the most thorough documentation of the inmates’ conditions in Denmark through her survey from 2007-2009 (see Minke (2010) for information on the survey). Table I.1 in the shows the reported rates of participation in rehabilitation programs across time spent in jail using data from this survey. Table I.1: Participation Rates, In-Jail Rehabilitation Time spent in jail Participation rates Less than 1 month 0.202 Between 1 and 2 months 0.244 Between 2 and 3 months 0.414 Between 3 and 6 months 0.500 More than 6 months 0.534 Observations 109 41 29 72 442 Note: Table shows fraction of inmates who report that they had participated in rehabilitation programs while incarcerated by time spent in jail. Rehabilitation programs include: Treatment for drug or alcohol abuse, anger management therapy, cognitive behavior therapy. Based on data from Minke (2010). The data stem from surveys of inmates in 12 of the 16 jails in Denmark. The surveys were conducted between 2007 and 2009. The table shows that participation rates in rehabilitating programs increase in incarcer11 http://www.kriminalforsorgen.dk/Default.aspx?ID=1256, (accessed 04/11-2014) After 2006, all incarcerated offenders are entitled to an interview with a case officer at the time of admission into jail. The Danish Jail and Probation Service cannot specify the content of the admission interviews, except that the “incarceration lengths obviously limit the type of in-jail rehabilitation that can be put into effect.” Nevertheless, Minke (2010), which to my knowledge is the only coherent analysis of rehabilitation possibilities in Danish jails, finds that rehabilitation posibilities during short incarceration spells are very limited, even if the inmates themselves request treatment, assistance or training. 12 15 ation length. Around 20 % of the survey participants who have been incarcerated for less than one month and more than half the survey participants who have been incarcerated for three or more months have participated in rehabilitating programs. Finally, the absence of official guidelines prior to 2006 suggests that the offenders who committed crime prior to the reform experienced similar conditions as those who committed crime after the reform. 3 Data This paper13 uses full population register data from Statistics Denmark14 on criminal records, education, earnings, age, gender, ethnicity, marital status, and children, along with information on recipients of public transfers from the DREAM database.15 The DREAM database contains weekly information on all recipients of public transfers in Denmark along with a specification on type of public transfer (unemployment benefits, social assistance, public pensions, etc.). The various information are linked by an individual-specific social security number. The criminal registers include a unique case-specific code, verdict (guilty, acquitted), sentence type (imprisonment, suspended sentence, fine, warning, detainment), date of crime, type of crime, incarceration date, release date, type of incarceration (e.g., remand, serving term of imprisonment, etc.). As the frequency of violent crime decreases with age and the majority of crimes are committed by men, I only include men who were aged 45 or younger at the date of the 13 See Appendix B for a detailed description of the construction of data. Only Danish residents are included in the registers. 15 A supplementary register used in the Danish Rational Economic Agent Model (DREAM), an general equilibrium model used for forecasting. The database contains information on every Danish citizen who has received public benefits/transfers of any kind. For further information see: http://www.dst.dk/upload/microsoft word - beskrivelse af dream koder - version 22.pdf. 14 16 crime,16 in order to ensure homogeneity between the individuals in the sample. As special conditions apply for individuals below the age of 18, I only include adults of 18 or above. The data are available for several years prior to and after incarceration. Hence, I create a panel with one time series per individual per case. Since individuals in the sample served their specific jail sentence at different points in time, I create a separate time-line that does not include the time a given individual spent in jail, but measures the time to/from the start/end of incarceration for all individuals. Hence, -1 denotes the last observation before the beginning of the incarceration spell and 1 denotes the first observation after release. The sample consists of individuals incarcerated as a consequence of ordinary violence committed within a period of 18 months on each side of the reform, i.e., between December 2000 and November 2003. I only use offenders convicted of ordinary violence in order to obtain a sufficiently large homogenous sample, with respect to both incarceration length and observable characteristics. The time-span is chosen to reach a sample size that on the one hand reduces random variation, and on the other hand minimizes the differences between the earlier and later parts of the sample caused by changing demographics and business cycles. The group that committed crimes before the reform is the control group and the opposite group is the treatment group. No individual in the sample experienced more than one incarceration for ordinary violence during the time span considered in this paper.17 The resulting panel is perfectly balanced with a final sample size of 1,748 individuals, 875 belonging to the control group and 873 belonging to the treatment group. Importantly, though one and two months are the modal incarceration lengths, the sample is not restricted 16 Women and individuals aged 46 or older at the time of the crime made up 3.7% of the sample at this point. 17 I exclude the individuals who are not included in data the for at least 12 months prior to the incarceration in focus. This applies to 9 individuals, 5 and 4 individuals from the treatment and control groups, respectively. Additionally, some individuals e.g., emigrated or died during the three years after release. I exclude these from the sample. This censoring applies to 43 and 40 persons from the treatment and control groups respectively (2.5 and 2.2 percent of the final sample). The estimation results, with various imputations for the attrites’ missing periods instead of the above mentioned censoring, do not differ qualitatively from the results presented throughout the paper. The auxiliary estimation results can be obtained from the author on request. 17 by incarceration length. Limiting the sample by a endogenous variable would likely bias the results. Hence, the paper uses all individuals who were convicted of ordinary violence committed on each side of the reform. This study focuses on three outcomes: unemployment rates, dependency on other public transfers (e.g. pensions, support for education and skills-upgrades, labor market programs), and earnings (gross earnings excluding public transfers). The residual of unemployment and dependency on other public transfers is employment and self-sufficiency. The sum of unemployment and dependency of other public transfers constitute all public transfers. Thus, a reduction to one which is not offset by an increase to the other implies an increase to employment. I use the weekly observations of public transfers from the DREAM database to compute monthly rates of unemployment and dependency on other public transfers. I then combine this information with information on income, to obtain monthly observations of earnings. Table I.2 shows means and standard deviations of the three outcome variables along with observable characteristics. The pre-incarceration outcomes and socio-economic characteristics are measured 12 months prior to the incarceration and indicators of previous criminal history are measured at the time of incarceration. The table also shows means divided by treatment status, in order to investigate whether the reform was unrelated to the two groups’ observed characteristics prior to incarceration, and a random draw of the full male population in 2002 with similar age distribution as main sample of ordinary violence offenders. The table shows that the individuals in the sample suffered from high rates of unemployment, dependency on other public transfers, and low earnings compared to their average equal aged peers. Average monthly earnings were $ 1,473 (monetary units have been transformed using the 2005 exchange rate of DKK 6,003.37 to USD 1,000).18 Further, sizable proportions of the sample - 33 and 15 percent, respectively - were unemployed or depend18 As reported by the Central Bank of Denmark: http://nationalbanken.dk/DNDK/statistik.nsf/side/ Faerdige tabeller - Valutakurser!OpenDocument (accessed 11/03-2011). 18 Table I.2: Summary Statistics by Treatment Status Full sample Variable Measured 12 months prior to incarceration Unemployed Dependent on other public transfers Earnings (USD 2005) Age Married Cohabitant Have children Non-western immigrants or descendants No job-qualifying education Vocational or skilled Upper secondary or higher Convicted before Convicted of a violent crime before Convicted of a property crime before Months since 1st date of crime leading to an indictment Months since 1st date of crime leading to an conviction Months since 1st incarceration Number of previous convictions N Control Treatment Significant Random difference weighted sample 0.33 0.33 0.33 0.05 (0.46) (0.46) (0.45) (0.21) 0.15 0.15 0.15 0.15 (0.48) (0.35) (0.34) (0.36) 1,473 1,510 1,417 2,993 (1,922) (1,980) (1,862) (2,567) 28.13 28.40 27.86 28.61 (7.80) (7.83) (7.76) 0.23 0.26 0.20 (0.42) (0.44) (0.40) (7.70) ∗∗∗ 0.26 (0.44) 0.28 0.28 0.27 0.45 (0.45) (0.45) (0.44) (0.50) 0.36 0.38 0.34 (0.48) (0.49) (0.48) ∗ 0.34 (0.47) 0.12 0.11 0.13 0.07 (0.48) (0.32) (0.34) (0.25) 0.67 0.68 0.67 0.51 (0.47) (0.47) (0.47) (0.50) 0.23 0.23 0.23 0.31 (0.42) (0.42) (0.42) (0.46) 0.10 0.09 0.10 0.18 (0.30) (0.30) (0.30) (0.39) 0.83 0.83 0.83 0.21 (0.37) (0.37) (0.37) (0.41) 0.48 0.50 0.47 0.04 (0.50) (0.50) (0.50) (0.20) 0.67 0.67 0.66 0.15 (0.47) (0.47) (0.48) (0.36) 109 110 108 (83) (81) (84) 98 99 97 (84) (82) (85) 46 47 44 (73) (71) (75) 4.44 4.45 4.43 (5.89) (5.62) (6.15) 1,748 875 873 33,248 Note: Table shows summary statistics for the full sample and divided by treatment status. T-test for mean differences in mean sample by treatment status: + p < 0.10, ∗ p < 0.05, ∗∗ p < 0.01, ∗∗∗ p < 0.001 Random sample has been drawn from the full Danish population in 2002 with similar age distribution as main sample (here labelled Full sample). Significant difference indicate significant difference in means between the control and treatment groups. 19 ent on other public transfers 12 months prior to incarceration. There were no significant differences in any of the outcome variables 12 months prior to incarceration. The table also shows that the sample had a low level of resources as measured by socioeconomic variables, as the majority had no education beyond secondary school and few were married or cohabiting. On average the offenders committed the first crime for which they were charged 109 months prior to the incarceration, they had committed their first crime for which they were convicted at 98 months prior to the incarceration, and 18 percent of these crimes were violent. In addition, they experienced their first incarceration at 46 months prior to the incarceration, and more than 80 percent had received a conviction prior to the one studied in this paper. 43 percent had been convicted of a violent crime, whereas 64 percent had received convictions for a property crime. Finally, the average individual in the sample had more than four convictions prior to the one in question. The table also shows a significantly higher proportion of married individuals in the control group and a lower proportion of individuals with children in the treatment group. Beyond these two covariates, the table shows no other significant differences between the two groups. The use of a reform as identification also rests on the assumption that the pre-reform and post-reform groups were subject to equal trends. Therefore, I also need to consider potential macro-level differences across the reform. Denmark experienced a small recession that began in late 2001 and ended at the beginning of 2004. The recession was followed by a boom that lasted for the remaining part of the data period covered by this paper. However, figure I.1 indicates that the labor market outcomes for the two groups prior to their incarceration were not affected by the business cycles. Also, the official policies on the transition from life in jail to life outside did not change during the period of time covered by this paper. Hence, the two groups most likely faced the same general conditions on the labor market before their incarceration spells and there is little or no sign of any change in the average characteristics of the “violent offender” across the reform, nor any sign of a difference between the two 20 groups as a result of macroeconomic trends. Consequently, the reform most likely provided an exogenous increase in incarceration lengths. 4 Econometric Framework This paper evaluates the effect of a treatment on subsequent labor market outcomes. I define treatment as being incarcerated under the post-reform guidelines. I assess this treatment effect by following the terminology first introduced by Rosenbaum & Rubin (1983); Rubin (1974), and adopted by the general treatment literature, and define the treatment effect on the treated conditional on observable characteristics as: δAT T = E (δ | Di = 1) = E (yi (1) | xi , Di = 1) − E (yi (0) | xi , Di = 1) (1) where Di is a binary treatment indicator equal to 1 if individual i receives treatment and 0 otherwise. yi (1) denotes the outcome for individual i if i is sentenced under the postreform guidelines and yi (0) is the outcome if i is sentenced under the pre-reform guidelines, and xi is a set of observable characteristics. The estimate of δAT T expresses the difference between the expected outcomes for individual i if he is sentenced to incarceration under the post-reform guidelines rather than the pre-reform guidelines, under the condition that he would be incarcerated under the post-reform guidelines. Obviously, in reality I cannot observe individual i in both states at the same point in time. In order to reduce the volatility of the labor market outcomes I use the weekly observation of public transfers from the DREAM and income data to compute 6 months rates of unemployment, dependency on other public transfers, and earnings. I denote every 6 month period since release period s. Moreover, imposing a parametric form on the relationship between the outcome yi , the observable characteristics xi , the reform Di , and the unobserved components over time (defined as periods of six months), I express this as: 21 yis = βxis + δDi + ai + eis (2) where yis is a given labor market outcome for individual i in period s > 0 (that is, period s after the incarceration), xis is a set of observable characteristics, ai is an unobserved fixed effect, and eis is an unobserved idiosyncratic error. δ is the parameter of interest, i.e., the effect on y of the increase in incarceration length induced by the reform. By differencing equation (2) with pre-incarceration characteristics (one year prior to the beginning of the incarceration spell) I define ∆yis = yis − yi,−1year , ∆xis = xis − xi,−1year , ∆eis = eis − ei,−1year . Keeping in mind that Di is a dummy indicator of the treatment that I seek to evaluate over several periods from time of release s, the panel structure of the data allows me to pool every period s = 1, ..., 6 (i.e., the first three years after release) and I obtain the differencesin-differences (DD)19 model: ∆yis = β∆xis + s=6 X γs ds + s=2 s=6 X δsDD Di + ∆eis (3) s=1 where β is a vector measuring the effects of the co-variates,20 and ds is an indicator of the time since release equal to 1 if period = s and 0 otherwise. γs captures the pre-reform group’s (control group’s) labor market trends after release from jail while δsDD captures the difference between the control and treatment group in each of the six month periods since release. Hence, δsDD captures the effect of the reform. In order to obtain consistent results, none of the terms I include in equation (3) may correlate with the unobserved components. Figure I.1 and table I.2 showed that neither the magnitude of the dip nor the trends and levels of the outcome variables prior to incarceration 19 Here, differences-in-differences imply the difference between the pre- and post-incarceration outcomes of the post-reform offenders relative to the difference between the pre- and post-incarceration outcomes of the pre-reform offenders. 20 ∆x includes a constant term, changes in marital status, changes in children, three indicators of changes in education status, and changes in area of residence. 22 differed significantly across the two groups, and the reform provided an exogenous shift in incarceration lengths (so Di is orthogonal to the unobserved factors embedded in the time invariant ai and the idiosyncratic error eis ). If prior and recent levels of characteristics xi· are independent of the idiosyncratic error ei· while the reform provides an exogenous shift in incarceration lengths I can estimate the parameters consistently by OLS . As the observable differences between the two groups are negligible, it seems reasonable to assume that there are no fundamental differences between the unobserved characteristics of the two groups.21 However, the estimation of δsDD also relies on the assumption that the reform did not coincide with changes in labor market conditions. In such a case the pre- and post-reform groups would be subject to different macroeconomic trends after release from jail, which would - wrongfully - be interpreted as effects of the reform. In order to eliminate any macroeconomic trends that may bias the DD estimates I also estimate the effect of the reform by a triple-differences (DDD) model using offenders of property crimes (who were not subject to any reforms and did not experience any changes to the length of their incarceration spells) as additional controls. Hence, the triple-differences model allows me to eliminate any differences in labor market outcomes between the control and treatment group that were not a result of the reform. Appendix B (online appendix) describes the construction of the auxiliary sample. Let Vi be a binary indicator equal to 1 if i belongs to the main sample of violent offenders and equal to 0 if i belongs to the auxiliary sample of property offenders: ∆yis = β∆xis + s=6 X s=2 µs ds + s=6 X γs ds · Vi + s=1 s=6 X s=1 21 ρs D i + s=6 X δsDDD Di · Vi + ∆eis (4) s=1 Using the reform as an instrumental variable is an alternative estimation strategy relying on similar assumptions. If an IV strategy was employed instead, the DD estimates would be normalized by a first stage regression. The DD estimates should be divided by the mean increase as result of the reform if the treatment variable was measured in days of incarceration. This would result in smaller estimates than I obtain with the differences-in-differences strategy. The DD estimates should be rescaled by the increase in the fraction treated, if the treatment variable was defined as a dummy variable of experiencing incarceration below or above a threshold (e.g. 35 days see table IA.5). This would result in larger estimates than I obtain with the differences-in-differences strategy. 23 Here µs captures the property offenders’ pre-reform labor market trends after release from jail, γs captures the violent offenders’ pre-reform labor market trends, ρs captures the property offenders’ post reform groups labor market trends, and δsDDD captures the DDD estimates of the reform. The effect of the reform will be estimated consistently by OLS if prior and recent levels of characteristics xi· are independent of the idiosyncratic error term ei· and if the property offenders’ pre-reform to post-reform change in labor market outcomes equals that of the ordinary violence offenders, had these not been subject to the reform. Furthermore, as noted by Bertrand et al. (2004), differences-in-differences estimates often yield underestimated standard errors, as ordinary standard errors fail to take account of serial correlation often observed in relation to labor market outcomes, e.g. between unemployment rates from on period to the next. Therefore, I bootstrap the standard errors by blocks, drawing all observations for each individual instead of single observations, and implement a wild bootstrap procedure that introduce the variance to the estimates which otherwise would be too low.22 5 Results The following section presents the results of the differences-in-differences and triple-differences estimations of the effect of the reform. In addition, the section will also introduce robustness checks of the estimated effects. For unemployment rates and dependency on other public transfers the parameter estimates are effects in percentage points, as the two variables are measured in percentages. A parameter estimate of e.g., -0.055 at 7-12 months after release from jail, as seen in column 1 of table I.3, implies that those who committed crime after the reform experienced 0.055 percentage points lower unemployment rates during those six months when compared to 22 The wild bootstrap randomly assigns extra noise to the estimates. As proposed n by Davidson & Flachaire with probability 0.5 (2008); Flachaire (2005), I multiply each estimated error ∆eis with ρis = −1 , so 1 with probability 0.5 E (ρis ) = 0 and σρ2 = 1. 24 individuals who committed crime prior to the reform. For earnings, the parameter estimates corresponds to a change in monthly earnings in 2005 USD.23 Hence, a parameter estimate of 150.99 at 7-12 months after release from jail, as seen in column 5 of table I.3, corresponds likewise to an increase in earnings of 150.99 USD during the six months in question. Table I.3 shows the estimated effects of the reform (δs ) for the first three years after release for each of the three outcomes, together with block-bootstrapped standard errors of the estimates in parentheses. The columns labelled DD show the estimates from the differences-in-differences model as specified in equation (3) and the columns labelled DDD show the estimates from the triple-differences model as specified in equation (4). Table I.3: Main Estimation Results Periods 1-6 months after release 7-12 months after release 13-18 months after release 19-24 months after release 25-30 months after release 31-36 months after release R2 N Wald statistic (p-val) Unemployment (1) DD -0.033* (2) DDD -0.105*** Dep. on other transfers (3) (4) DD DDD 0.005 0.044 Earnings (0.019) (0.039) (0.014) (0.029) -0.055*** -0.129*** 0.005 0.009 150.99** (0.021) (0.043) (0.015) (0.033) (76.16) (122.52) -0.067*** -0.121*** 0.04 0.007 126.84 130.85 (0.020) (0.043) (0.016) (0.033) (77.74) (133.53) -0.071*** -0.132*** 0.004 -0.014 255.53*** 303.50** (0.021) (0.039) (0.017) (0.032) (82.18) (138.38) 205.19 (5) DD 87.49 (6) DDD 147.45 (63.00) (113.93) 171.75 -0.076*** -0.074* 0.007 -0.024 236.04*** (0.021) (0.043) (0.018) (0.034) (87.39) (152.65) -0.096*** -0.068 -0.001 0.010 393.04*** 219.33 (0.020) (0.044) (0.019) (0.038) (86.35) (166.58) 0.023 0.025 1,748 2,388 <0.001*** <0.001*** 0.008 1,748 0.998 0.014 2,388 0.801 0.023 0.021 1,748 2,388 <0.001*** 0.044** Significance levels: ∗ : p<10% ∗∗ : p<5% ∗ ∗ ∗ : p<1% Note: Table shows DD and DDD regression results for unemployment rates, dependency of other public transfers, and earnings the first 36 months after release from jail. The block-bootstrapped standard errors are reported in parentheses below the estimates 23 Though it is customary, I do not use log earnings because a large proportion of the sample experience longer periods with full unemployment. For unemployed individuals, earnings are equal to zero by definition. Table IA.1 in the online appendix shows the estimates using log earnings by replacing earnings equal to zero with one. The DD estimates do not differ qualitatively from those shown by table I.3 though the DDD estimates are very noisy with large standard errors. 25 Column 1 and 2 show the estimated effect of the reform on unemployment for the first three years after release. All of the DD estimates are negative and, except for the first six months after release, significant at a one percent significance level. The sizes of the estimates indicate that the reform induced a drop in the unemployment rate of approximately 4-5 percentage points. As time since release increases, so does the numerical size of the estimates, to a level of roughly 7 to 10 percentage points two years after release. The DDD estimates are also negative. However, they are numerically largest in the first two years after release from jail, with estimates around 10 percentage points. The estimates decrease somewhat two years after release both in size and significance and are insignificant three years after release. The table suggests that the reform has resulted in lower unemployment rates, but is ambiguous as to whether the effect increases or decreases in size with time since release. Moreover, the estimates appear to be volatile with large standard errors. Column 3 and 4 show the estimated effects of the reform on dependency on other public transfers the first three years after release. The table shows that most of the estimates are positive, but all numerically small and insignificant even at a ten percent level. The estimates are not significant when tested jointly either. Furthermore, the insignificant estimates of dependency on other public transfers show that there was no substitution between unemployment and other public transfers.24 This finding implies that the change in incar24 In order to investigate the possibility of opposing effects within the composite measure of dependency on other public transfers, I have estimated the model with a subdivision of this outcome into two general categories: first, voluntary efforts revealing an interest in (re-)entering the labor market at some point, e.g., financial support for education, financial aid for upgrading work-oriented skills, specific voluntary labor market programs; and second, mandatory programs required in order to be eligible for the receipt of benefits or passive support without any requirements, such as sick leave, early retirement relating to lack of employability, etc. None of the benefits included in the second group are aimed at obtaining future employment on the labor market. The first outcome category was unaffected by the reform, while there was a weak sign of an increase in the second category as a consequence of the reform. However, this result was neither sufficiently robust nor significant on a sufficient level for any conclusions to be drawn. Additionally, I have estimated the model with total dependency on public transfers (the sum of unemployment and dependency on other public transfers). The results using this outcome did not change any conclusions, as they were not significantly different from those with rate of unemployment as an outcome. Hence, I conclude that the results are not caused by a substitution effect. 26 ceration length following the reform increased employment for the sample, as the reduction to unemployment is not offset by an increase in other types of public dependency. Column 5 and 6 show the estimated effect of the reform on earnings for the first three years after release. The DD estimates show no effect of the reform on subsequent earnings during the first six months after release, but the estimated effect increases with time since release and, except for the time between 1-6 months and 13-18 months after release, it is highly significant. The estimates show a positive and significant effect of the reform of more than $ 300 per month, three years after release from jail. The DDD estimates do not differ in sign from the DD estimates. All DDD estimates are positive but generally the standard errors of the DDD estimates are much larger than those of the DD estimates. Only one of the DDD estimates is significant at a five percent level, while the estimates are jointly significant at a five percent level. The results suggest that the reform was associated with an increase in earnings. The data do not allow me to determine whether the earnings increases appear due to higher levels of productivity or lower levels of unemployment. However, since table I.3 also shows that unemployment decreased for the treated, resulting in a greater number of individuals with none-zero earnings, I suspect this to be the dominant factor. A possible source of bias is different trends according to pre-treatment characteristics, e.g., if offenders with children experience different labor market trajectories. Table I.4 presents the estimation results, while allowing for different trends according to observable pre-incarceration characteristics. The columns labelled DD show the estimates from the differences-in-differences model and the columns labelled DDD show the estimates from the triple-differences model. Column 1 and 2 of the table show the estimated effect of the reform on unemployment, column 3 and 4 show the estimated effects of the reform on dependency on other public transfers, and column 5 and 6 show the estimated effect of the reform on earnings the first three years after release. The table shows that the estimates from table I.3 are robust to differences in trends 27 Table I.4: Estimation Results, Different Trends Periods Unemployment (2) DDD -0.104*** Dep. on other transfers (3) (4) DD DDD 0.005 0.043 Earnings (5) DD 87.49 1-6 months after release (1) DD -0.037* (0.019) (0.040) (0.014) (0.028) (47.80) (115.72) 7-12 months after release -0.060*** -0.128*** 0.005 0.009 150.99** 174.95 (0.021) (0.042) (0.015) (0.032) (90.56) (124.69) 132.30 13-18 months after release 19-24 months after release 25-30 months after release 31-36 months after release R2 N Wald statistic (p-val) -0.072*** -0.122*** 0.04 0.007 126.84 (0.020) (0.041) (0.016) (0.033) -0.075*** -0.134*** 0.004 -0.014 (0.021) (0.039) (0.017) (0.033) -0.080*** -0.075* 0.007 -0.026 (0.021) (0.042) (0.018) (0.036) -0.100*** -0.067 -0.001 0.009 (0.020) (0.043) (0.019) (0.038) (107.99) 255.53*** (109.21) 236.04*** (156.17) 393.04*** (260.28) 0.027 0.033 1,748 2,388 <0.001*** <0.001*** 0.009 1,748 0.999 0.015 2,388 0.775 (6) DDD 149.93 (137.20) 307.99** (144.60) 210.44 (154.61) 223.60 (164.27) 0.032 0.033 1,748 2,388 <0.001*** 0.043** Significance levels: ∗ : p<10% ∗∗ : p<5% ∗ ∗ ∗ : p<1% Note: Table shows DD and DDD regression results for unemployment rates, dependency of other public transfers, and earnings the first 36 months after release from jail with different trends for pre-incarceration characteristics: dummies for completed any education beyond secondary school, completed any qualifying education, maritial and cohabiting status, and parental status. The block-bootstrapped standard errors are reported in parentheses below the estimates according to pre-treatment characteristics, as none of the estimates shown in table I.4 differ much from the estimates shown in table I.3. The estimates on unemployment are negative and highly significant, the estimates on dependency on other public transfers are insignificant, and the estimates on earnings are positive and jointly significant. A different source of bias could arise as the timeline on the one hand is defined relative to the date of release while the reform on the other hand changes the incarceration length and hence the dates of release for the treatment group. If, for example, the reform lead to surge in summer releases, relative to the pre reform period, then the estimated reform effects could include both seasonal differences in unemployment rates as well as the prolonged incarceration effect. However, computing the treatment effect based on the start of incarceration will 28 eliminate any seasonal differences between the treatment and control group that are caused by changes to release dates rather than the actual changes to incarceration lengths. Table IA.225 shows the DD and DDD estimation results when evaluating unemployment rates from the date of incarceration and not from the day of release, setting all in-prison unemployment rates to 0, 1, or the last observed unemployment rate before incarceration, respectively. Generally, the results do not differ in sign, size or significance level compared to the main estimation results from table I.3. In similar vein, longer incarceration spells will also postpone future earnings after release. Table IA.326 shows the DD and DDD estimation results for earnings discounted relative the incarceration length using an annual discount rate of 0.04, in order to investigate whether the effects of the reform on the present value of earnings differ from the overall effect on earnings. The estimation results from table IA.3 do not differ qualitatively from those of table I.3. 5.1 Macroeconomic Trends In the following, I restrict the sample such that I only include persons who committed crime 4 months prior to or after the implementation of the reform27 (rather than 18 months).28 I do this to investigate whether macro-level changes affected the main results. The shorter time-span reduces the sample size to 351 for the differences-in-differences estimation and 484 for the triple-differences. The reduced sample size may affect significance levels but not signs or sizes of the estimates from table I.3, if these are robust. Table I.5 shows the estimated effect of the reform for the three outcomes using the reduced sample(s) (corresponding to table I.3), together with the standard errors of the estimates. 25 In the appendix. In the appendix. 27 Table IA.4 shows balancing tests for this restricted sample. 28 I have also performed this robustness check with persons sentenced to imprisonment for a crime committed 2, 3, 5, 6, 7, 8, and 9 months prior to or after the implementation of the reform. The results can be obtained from the author on request. 26 29 Table I.5: Estimation Results, Reduced Sample Periods 1-6 months after release (1) DD -0.046 (2) DDD -0.146* Dep. on other transfers (3) (4) DD DDD 0.031 0.062 (0.043) (0.086) (0.033) (0.064) (138.82) (253.71) 7-12 months after release -0.092** -0.144 0.044 0.112 238.51 231.10 (0.045) (0.092) (0.034) (0.070) (159.44) (278.80) 13-18 months after release 19-24 months after release 25-30 months after release 31-36 months after release R2 N Wald statistic (p-val) Unemployment Earnings (5) DD 51.54 (6) DDD 145.87 -0.128*** -0.190* 0.023 0.007 287.15 229.77 (0.044) (0.105) (0.038) (0.077) (150.25) (287.04) -0.059 -0.181* 0.004 0.005 205.50 170.64 (0.047) (0.102) (0.041) (0.081) (166.92) (312.15) -0.040 -0.015 0.013 -0.076 -17.75 66.98 (0.050) (0.102) (0.043) (0.087) (187.67) (330.36) -0.034 -0.032 0.014 0.026 95.16 90.28 (0.046) (0.099) (0.042) (0.089) (174.60) (349.71) 0.031 351 0.020** 0.043 484 0.047** 0.013 351 0.832 0.043 351 0.250 0.041 484 0.914 0.021 484 0.608 Significance levels: ∗ : p<10% ∗∗ : p<5% ∗ ∗ ∗ : p<1% Note: Table shows estimation results where the sample has been narrowed to +-4 months around the reform. The block-bootstrapped standard errors are reported in parentheses below the estimates Column 1 and 2 show the estimated effects of the reform on subsequent unemployment. The size of the estimated effects in the short run increases in comparison to the original estimates from table I.3. However, the volatility of the estimates also increases as the span narrows. Hence, the table offers no definitive guidance as to the exact size of the effect of the reform, as the estimates from table I.3 are well within any conventional confidence intervals. Column 3 and 4 show the estimates with dependency on other public transfers as outcome. There are some differences between the two columns the original estimates in column 3 and 4 of table I.3. The numerical size of the estimates increases somewhat compared to the full-sample estimates, especially for one specific DDD estimate. Yet, none of the estimates are close to being significant, neither when tested alone nor jointly.29 Column 5 and 6 show 29 Again, I have estimated the model using total dependency on public transfers. The results were not significantly different from those for unemployment. Therefore, they suggest that the rate of unemployment dropped as a consequence of the reform, while there was no attrition from the labor market, as the reform did not affect dependency on other public transfers. 30 the estimates with earnings as outcome. Comparing the estimates to those from column 5 and 6 of table I.3 the estimates for the first two years after release have similar sign and size, while the remaining reduced sample estimates diverge from the full sample estimates. Also, the estimates from the reduced sample the standard errors increase as the sample is reduced. Overall, the estimated parameters of the effect on the unemployment rate and dependency of other public transfers appeared robust to the reduction in sample size and the significant reduction of the unemployment rates appears robust as it shifted in the opposite direction of the probable macro-trend bias. Still, there is little evidence as to whether the shorter time span affected the estimates for earnings as the sample size reduction more than doubled the standard errors. To confirm the findings for unemployment and elucidate the effect on earnings, I define a series of placebo-reforms for each month from November 1999 to November 2004 and test for joint significance related to the effects on unemployment and earnings. Around each placeboreform I have constructed a new sample as described in section 3, including individuals who committed crimes within an 18 month span on each side of the placebo-reform.30 Figure I.4 shows the p-value for a Wald-test for joint significance for the estimated effect of the placeboreforms from the DDD model31 with unemployment and earnings as outcomes, respectively. If the reform is the causal catalyst driving the estimates, the placebo-reform-samples before January 2001 and after December 2003 should be insignificant as the 18 month span on each side does not coincide with the true timing of the reform at June 2002. Between January 2001 and December 2003, the estimation samples will increasingly coincide with the true 30 The sample size of each placebo-reform is approximately equal to the sample size of the main sample constructed around the actual reform. The individual placebo-control groups and placebo-treatment groups are also approximately of equal size. 31 I have also estimated the effects of the placebo-reform for the DD model. The results and overall pattern do not differ qualitatively form DDD results reported in figure I.4. The DD placebo results can be obtained from the author on request. 31 timing of the reform as the timing moves towards the vertical line at June 2002. I.e., the placebo-reforms should increase in significance as the gap to June 2002 closes from each side. Figure I.4: Significance Level for Joint Tests of Placebo-Reforms Level of significance (p-value) 1 0.1 0.05 Nov-04 Jul-04 Sep-04 Mar-04 May-04 Jan-04 Nov-03 Jul-03 Sep-03 May-03 Jan-03 Mar-03 Nov-02 Sep-02 Jul-02 May-02 Jan-02 Mar-02 Nov-01 Jul-01 Sep-01 Mar-01 May-01 Jan-01 Nov-00 Sep-00 Jul-00 May-00 Jan-00 Mar-00 Nov-99 0 Timing of pseudo-reform Earnings Unemployment Note: Figure shows p-values for pseudo-reforms before and after the actual timing of the reform. The vertical dashed line marks the time of the reform. The data for each placebo-reform includes crimes convicted 18 months prior to and 18 months after the given placebo-reform. Hence, “treatment” and “control” groups of placebo-reforms between January 2001 and December 2003 will coincide with the true timing of the reform in June 2002 to greater or lesser extent. The figure shows insignificant effects on earnings and unemployment for all sets of placeboreforms that do not include June 2002. As the timing of the placebo-reforms approaches the timing of the real reform, from the left and from the right, the levels of significance increase for both models. Only placebo-reforms where around half or more of the placebo-reform data coincide with the data constructed around the true reform are significant. Hence, the figure confirms that the DDD estimates are robust to trends and fluctuations, and that the main estimation results are valid. The figure also shows that the estimated effects on 32 unemployment around the true timing of the reform are highly significant for a longer time span than the estimated effects on earnings. The dominant effect on unemployment relative to the effect on earnings could suggest that the reform causes average earnings to increase because the reform lowers unemployment and not because longer incarceration spells result in higher rates of productivity and labor income. In 2006 the eligibility rules for reception of social assistance where tightened. If this change affected the sample’s unemployment rates, then it should translate into significant effects for the full sample from around 24 months following release and onwards, while it should not affect the estimates from the reduced sample. However, tables I.3, I.4, and I.5 showed that the estimates to unemployment rates were significant from the timing of release. Moreover, the placebo-reforms around the timing of change to the eligibility rules in March 2006 should be significant, which they are not. Therefore, the estimated effects are not results of the tightening of the eligibility rules for reception of social assistance. 5.2 Mechanisms With the range incarceration lengths that this paper focuses on in mind, it seems unreasonable that the results stem from changes in human capital. Instead, the effects of the increase in incarceration length could work through different channels. The survey results from table I.1 discussed previously in the paper shows a positive relationship between incarceration length and participation in rehabilitating programs which introduces a likely mechanism: The reform may affect employment through increased participation rates in jail for which an offender’s pre-reform sentence would otherwise have been too short, leaving the offenders solely with the possible stigma, job-loss, and general alienation from the labor market which incarceration might involve. This proposition is in accordance with Balvig (2006),32 32 Produced by Flemming Balvig, Professor of Criminal Law, in cooperation with The Danish Bar and Law Society. 33 who describes the chances of entering rehabilitation during a short incarceration spell in a Danish jail as follows: “Convicted offenders who are sentenced to the shortest incarceration spells are in practice only able to use the jails’ rehabilitating services to a very limited degree (...) Jails do not draft plans of action for the time of incarceration and for the release of the offenders, as a consequence of the short incarceration spells.” [Balvig (2006, pp. 12), own translation]. Hence, the longer incarceration spells induced by the reform may have increased labor market outcomes simply by giving prison authorities additional time to aid the offender at the time of release. Also, if jail is no longer an exception to everyday life but something more persistent, incarceration may be perceived quite differently, so e.g. an increase from one month of incarceration to two months causes the offender to update his perception of the payoff to crime to a lower level, thereby reducing the likelihood of recidivism (Abrams (2012); Owens (2009); Sah (1991)) and thus increasing chances finding employment. In order to gain further insight into the estimated effects, I calculate the average characteristics of the group whose incarceration lengths the reform affected the most. Almond & Doyle (2011) provide a framework for estimating complier characteristics in estimations of Local Average Treatment Effects. I adapt this technique in order to estimate the average characteristics of those who experienced the largest increase in incarceration length as a result of the reform. Figure I.2 showed that a large proportion of offenders served two months instead of one month. I define a dummy variable P equal to 0 if individual i served 35 days or less in jail and equal to 1 if i served more than 35 days. Let πa be the fraction of always-takers who would have served more than 35 days disregarding of whether they committed their crime before or after the reform. Let πn be the fraction of never-takers who would have served less than 35 days disregarding of whether they committed their crime before or after the reform. πa is estimated as the fraction of pre-reform offenders who served more than 35 days and πn is estimated as the fraction of post-reform offenders who did not serve more than 35 days. 34 Finally, πc is those who only served more than 35 days in jail as a result of the reform. By monotonicity I can estimate πc = 1 − πa − πn and identify the average characteristics for the offenders who served more than 35 days as a result of the reform as: πa πc + πa E (X | P = 1, D = 1) − E (X | P = 1, D = 0) πc π c + πa (5) Table IA.5 in the appendix shows the average characteristics of the group that only served more than 35 days in jail as a result of the reform and the full sample. The table shows that those who experienced the largest effect from the reform were younger, not married, had very little education, and had lower pre-incarceration unemployment rates compared to the full sample. Also, the criminal trajectory of those who served around two months in stead of one month were shorter, and very few had a previous convictions of violent crimes. This finding sheds light on the size of the main estimation results presented earlier. The compliers (younger offenders at the onset of a long criminal trajectory) are likely much more malleable and benefit much more from a change to their life course compared with the average offenders (older offenders with a long criminal history). The results also imply that incarceration may benefit the young offenders with a short criminal record at the intensive margin, as opposed to the extensive margin, where e.g., Aizer & Doyle (2011) show that incarceration has large negative effects for juvenile offenders. I.e. once the offender is convicted to imprisonment, young offenders may benefit more from rehabilitating programs or be more likely to change their perception of the payoff to crime than older offenders. Moreover, the sizes of the estimates on unemployment rates and earnings also suggest a positive long run effect of the reform using a back-of-an-envelope cost-benefit calculation. I derive this by using the estimated effects on unemployment and earnings to proxy savings on social assistance and increased tax revenue while correcting for 35 the average costs per offender of the increased incarceration length caused by the reform.33 The approximation yields a positive net present value within the first two years after release. 6 Conclusion This paper investigates how incarceration length affects unemployment rates, dependency on other public transfers, and earnings for offenders who receive short jail sentences. I use a reform of the Danish penal code in 2002 to facilitate causal inference. The estimates showed that an increase in incarceration length resulted in lower rates of unemployment in a sample of violent offenders. In the first years following release, unemployment rates were reduced by as much as approximately 10 percentage points as an effect of the reform, though the estimates were ambiguous on the persistence of the effect beyond two years after release. The results to unemployment were robust with respect to limitation of the time-span and also to a series of placebo-reforms. In contrast, dependency on other public transfers was not affected by the reform. This implies that the increase in incarceration length increased the residual outcome self-dependency and employment. Furthermore, the initial estimates suggested a positive effect on earnings that increased with time after release. While the estimated effect on earnings was robust to differential time trends according to pre-incarceration characteristics and a series of placebo-reforms, the increases in earnings were not highly significant. This suggests that the effect on average earnings may be a second order effect that works through the reduction to unemployment and not e.g. through higher wage rates. The conclusion that longer incarceration spells lower unemployment is in accordance with the results from Kling (2006). The stronger effects found in this paper may arise from the margin of evaluation. This paper focuses strictly on offenders who received short 33 The average cost of one week of incarceration in 2002 was approximately USD 1,000 in 2005 prices, according to the Danish Prison and Probation Service. 36 incarceration spells, which constitute the majority of Danish offenders who are sentenced to imprisonment, whereas Kling (2006) included a wider range of offenders to estimate average general effects that may have offset heterogeneous effects. The larger effects could also arise due to the characteristics of the group of offenders who experienced the largest increases in incarceration lengths due to the reform. The reform mainly affected men that were younger and had a shorter criminal history compared to the full sample. As younger individuals may be more malleable than older offenders, young offenders may benefit significantly more from rehabilitating programs or change their perception of the payoff to crime to a larger degree compared to their older peers with longer criminal records. The results of this paper rely on the exogeneity of the reform and for the DDD estimates on the use of property offenders as an additional control group. If the reform was not exogenous and caused the average offender to change, it is reasonable to assume that longer sentences would result in a treatment group that has weaker socioeconomic potential than the control group - or in other words, a group with less employment-security and poorer affiliation to the labor market, etc. However, the results (especially for the rate of unemployment) suggest the opposite. If the use of the reform in this setting is endogenous, the likely direction of bias is thus not towards zero. The direction of the bias will be changed indeterminately if the two groups experienced different trends in the outcomes or alternatively if the dips/spikes prior to incarceration differed. The paper shows that this was not the case. Nevertheless, the pre-incarceration dips/spikes call for further study, which should elucidate whether the dips/spikes are a general phenomenon for all offender types and further, how they are related to the criminal acts themselves and the severity of the crimes, in order to uncover any selfselection. It is worthwhile to consider whether very short sentences only serve to stigmatize offenders without providing any proper aid or incentive to rehabilitate. Policy-makers and judges, for instance, may consider whether very short sentences of imprisonment could usefully be 37 replaced by suspended sentences or community service, without violating the victims’ sense of justice. And where imprisonment is deemed necessary, the various stakeholders should perhaps consider the offender’s possibilities of rehabilitation to a greater degree. References Abrams, David S. 2012. Estimating the Detterent Effect of Incarceration Using Sentecing Enhancements. American Economic Journal: Applied Economics, 4(4), 32–56. Aizer, Anna, & Doyle, Joseph J. 2011. Juvenile Incarceration & Adult Outcomes: Evidence from Randomly-Assigned Judges. Unpublished Working Paper. Almlund, Mathilde, Duckworth, Angela L, Heckman, James J, & Kautz, Tim D. 2011. In: Hanushek, Eric, Machin, Stephen & Woesmann, Ludger (eds), Handbook of the Economics of Education vol. 4. Amsterdam: Elsevier. 1–111. Almond, Douglas, & Doyle, Joseph J. 2011. After Midnight: A Regression Discontinuity Design in Length of Postpartum Hospital Stays. American Economic Journal: Economic Policy, 3(3), 1–34. Ashenfelter, Orley. 1978. Estimating the Effect of Training Programs on Earnings. The Review of Economics and Statistics, 60(1), 47–57. Balvig, Flemming. 2006. Ni anbefalinger fra arbejdsgruppen om fremtidens straffe. Tech. rept. Advokatsamfundet. Ben-Porath, Yoram. 1967. The Production of Human Capital and the Life Cycle of Earnings. The Journal of Political Economy, 75(4), 352–365. Bertrand, Marianne, Duflo, Esther, & Mullainathan, Sendhil. 2004. How Much Should We Trust Differences-in-Differences Estimates? Quarterly Journal of Economics, 119(1), 249–275. 38 Cunha, Flavio, & Heckman, James J. 2008. Formulating, identifying and estimating the technology of cognitive and noncognitive skill formation. Journal of Human Resources, 43(4), 738. Cunha, Flavio, Heckman, James J., & Schennach, Susanne M. 2010. Estimating the technology of cognitive and noncognitive skill formation. Econometrica, 78(3), 883–931. Danish Prison and Probation Service. 2012. Statistik 2011. Davidson, Russell, & Flachaire, Emmanuel. 2008. The wild bootstrap, tamed at last. Journal of Econometrics, 146(1), 162–169. Flachaire, Emmanuel. 2005. Bootstrapping heteroskedastic regression models: wild bootstrap vs. pairs bootstrap. Computational Statistics & Data Analysis, 49(2), 361–376. Fougère, D., Kramarz, F., & Pouget, J. 2009. Youth unemployment and crime in France. Journal of the European Economic Association, 7(5), 909–938. Freeman, Richard B. 1992. Crime and the Employment of Disadvantaged Youths. In: Peterson, George, & Vroman, Wayne (eds), Urban Labor Markets and Job Opportunities. Washington D.C.: The Urban Institutes Press. Freeman, Richard B. 1996. Why do so many young American men commit crimes and what might we do about it? The Journal of Economic Perspectives, 10(1), 25–42. Glaeser, Edward L., Laibson, David, & Sacerdote, Bruce. 2002. An Economic Approach to Social Capital. Economic Journal, 112(483), 437–458. Granovetter, Mark S. 1995. Getting a job. 2 edn. Chicago, (IL): University of Chicago Press. Grogger, Jeffrey. 1995. The Effect of Arrests on the Employment and Earnings of Young Men. The Quarterly Journal of Economics, 110(1), 51–71. 39 Guerino, Paul, Paige M. Harrison, & Sabol, William J. 2012. Prisoners in 2010. Tech. rept. Bureau of Justice Statistics. Holzer, Harry J. 1988. Search Method Use by Unemployed Youth. Journal of Labor Economics, 6(1), 1–20. Kling, Jeffrey R. 2006. Incarceration Length, Employment, and Earnings. American Economic Review, 96(3), 863–876. Lochner, Lance. 2004. Education, Work, and Crime: A Human Capital Approach. International Economic Review, 45(3), 811–843. Lott, John R. 1992a. An Attempt at Measuring the Total Monetary Penalty from Drug Convictions: The Importance of an Individual’s Reputation. The Journal of Legal Studies, 21(1), 159–187. Lott, John R. 1992b. Do We Punish High Income Criminals Too Heavily? Economic Inquiry, 30(4), 583–608. Machin, Stephen, Marie, Olivier, & Vujic, Suncica. 2011. The Crime Reducing Effect of Education. Economic Journal, 121(552), 463–484. Mincer, Jacob. 1974. Schooling, Experience, and Earnings. Human Behavior & Social Institutions No. 2. New York, NY: National Bureau of Economic Research, Inc. Ministry of Justice, Research Unit. 2007. Udviklingen i Anmeldelsestallene og i Straffe for Vold. Minke, Linda Kjær. 2010. Fængslets indre liv: med særlig fokus på fængselskultur og prisonisering blandt indsatte. Tech. rept. University of Copenhagen, Faculty of Law. Moretti, Enrico, & Lochner, Lance. 2004. The effect of education on criminal activity: evidence from prison inmates, arrests and self-reports. American Economic Review, 94(1), 2004. 40 Motivans, Mark. 2012. Federal Justice Statistics 2009 - Statistical Tables. Tech. rept. Bureau of Justice Statistics. Nagin, Daniel, & Waldfogel, Joel. 1995. The effects of criminality and conviction on the labor market status of young British offenders. International Review of Law and Economics, 15(1), 109–126. Nagin, Daniel, & Waldfogel, Joel. 1998. The Effect of Conviction on Income Through the Life Cycle. International Review of Law and Economics, 18(1), 25–40. Needles, Karen S. 1996. Go directly to jail and do not collect? A long-term study of recidivism, employment, and earnings patterns among prison releases. Journal of Research in Crime and Delinquency, 33, 471–496. OECD. 2010. OECD Factbook 2010: Economic, Environmental and Social Statistics. OECD Publishing. Owens, Emily G. 2009. More Time, Less Crime? Incapacitative Effect og Sentence Enhancements. Journal of Law and Economics, 52(3), 551–579. Rosenbaum, Paul R., & Rubin, Donald B. 1983. The Central Role of the Propensity Score in Observational Studies for Causal Effects. Biometrika, 70(1), 41–55. Rubin, Donald B. 1974. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66(5), 688–701. Sah, Raaj K. 1991. Social Osmosis and Patterns of Crime. Journal of Political Economy, 99(6), 1272–1295. Sampson, Robert J., & Laub, John H. 1995. Crime in the making: pathways and turning points through life. Harvard University Press. Secretary of Justice Lene Espersen. 2002 (Apr.). Betnkning til 2001/2 LF 118. tps://www.retsinformation.dk/Forms/R0710.aspx?id=101010. 41 ht- U.S. Bureau of Justice Statistics. 2012. Crime & Justice Electronic Data Abstract spreadsheets. http://www.bjs.gov/content/dtdata.cfm, accessed 2012.08.22. Waldfogel, Joel. 1994. The Effect of Criminal Conviction on Income and the Trust ”Reposed in the Workmen”. The Journal of Human Resources, 29(1), 62–81. Western, Bruce, Kling, Jeffrey R., & Weiman, David F. 2001. The Labor Market Con- sequences of Incarceration. Crime Delinquency, 47(3), 410–427. Wilson, James Q, & Abrahamse, Allan. 1992. Does Crime Pay. Justice Quarterly, 9(3), 359–377. Witte, Ann D., & Tauchen, Helen. 1994. Work and crime: an exploration using panel data. NBER Working Paper no. 4794. 42 A Supplementary Results Table IA.1: Estimation Results Using Log Earnings Periods 1-6 months after release DD 0.216 (0.173) (0.341) 7-12 months after release 0.489*** 0.416 13-18 months after release 19-24 months after release 25-30 months after release 31-36 months after release R2 N Wald statistic (p-val) (0.193) 0.583*** (0.206) 0.689*** (0.222) 0.609*** (0.221) 0.919*** (0.217) DDD 0.270 (0.368) 0.446 (0.384) 0.740* (0.395) 0.511 (0.406) 0.413 (0.425) 0.015 0.031 1,748 2,388 <0.001*** 0.158 Significance levels: ∗ : p<10% ∗∗ : p<5% ∗ ∗ ∗ : p<1% Note: Table shows DD and DDD regression results until 36 months after release from jail with log earnings as outcome. The block-bootstrapped standard errors are reported in parentheses below the estimates 43 44 (0.043) (0.020) (0.020) -0.098*** (0.021) -0.065*** (0.021) -0.075*** (0.021) -0.048*** (0.021) (0.043) -0.068 (0.042) -0.054 (0.039) -0.121*** (0.041) -0.106*** (0.041) -0.134*** (0.039) 0.016 0.021 0.022 0.025 1,748 2,388 1,748 2,388 <0.001*** <0.001*** <0.001*** <0.001*** -0.066 (0.042) -0.097*** -0.054 -0.065*** (0.021) (0.040) (0.021) (0.041) -0.120*** (0.021) -0.074*** -0.108*** (0.041) (0.021) -0.050*** -0.138*** -0.057*** -0.055** (0.043) -0.067 (0.042) -0.054 (0.039) -0.120*** (0.411) -0.106*** (0.042) -0.124*** (0.043) 0.064 0.056 1,748 2,388 <0.001*** <0.001*** (0.020) -0.098*** (0.021) -0.065*** (0.021) -0.075*** (0.021) -0.049*** (0.021) -0.051*** (0.019) (0.017) (0.018) (0.039) Unem. as before jail Full unem. in jail DD DDD DD DDD -0.032*** -0.094*** -0.001 -0.063 Full emp. in jail DD DDD -0.029 -0.097* Significance levels: ∗ : p<10% ∗∗ : p<5% ∗ ∗ ∗ : p<1% Note: Table shows DD and DDD regression results where time 0 is the start of incarceration instead of release from jail. The table shows the results for unemployment as outcome using three different imputations of in-jail unemployment: Full employment, unemployment rate as before incarceration, and full unemployment. The block-bootstrapped standard errors are reported in parentheses below the estimates R2 N Wald statistic (p-val) 31-36 months after incarceration start 25-30 months after incarceration start 19-24 months after incarceration start 13-18 months after incarceration start 7-12 months after incarceration start 1-6 months after incarceration start Periods Table IA.2: Estimation Results From Start of Incarceration Table IA.3: Estimation Results Discounted Earnings Periods 1-6 months after release DD 86.391 (62.512) (114.950) 7-12 months after release 149.878** 167.508 13-18 months after release 19-24 months after release (75.598) (123.980) 125.534 125.339 (77.224) (136.473) 253.323*** 299.236** (81.590) 25-30 months after release R2 N Wald statistic (p-val) (143.677) 233.744*** 202.546 (86.992) 31-36 months after release DDD 142.046 (153.984) 389.898*** 222.889 (85.764) (163.696) 0.023 1,748 <0.001*** 0.026 2,388 0.059* Significance levels: ∗ : p<10% ∗∗ : p<5% ∗ ∗ ∗ : p<1% Note: Table shows DD and DDD regression results with discounted earnings as outcome. Discount rate is 0.04. The block-bootstrapped standard errors are reported in parentheses below the estimates 45 Table IA.4: Summary Statistics by Treatment Status for Reduced Sample Window Variable Measured 12 months prior to incarceration Unemployed Dependent on other public transfers Earnings (USD 2005) Age Married Cohabitant Have children Non-western immigrants or descendants No job-qualifying education Vocational or skilled Upper secondary or higher Measured at the start of incarceration Have been convicted before Have been convicted of a violent crime before Have been convicted of a property crime before Months since 1st date of crime leading to an indictment Months since 1st date of crime leading to an conviction Months since 1st incarceration Number of previous convictions N Full sample Control Treatment 0.32 (0.45) 0.18 (0.35) 1,532 (1,891) 28.99 (7.91) 0.21 (0.41) 0.29 (0.46) 0.39 (0.49) 0.13 (0.34) 0.67 (0.47) 0.23 (0.42) 0.10 (0.30) 0.33 (0.46) 0.17 (0.37) 1,427 (1,964) 29.27 (7.97) 0.25 (0.43) 0.32 (0.45) 0.41 (0.49) 0.16 (0.37) 0.63 (0.48) 0.26 (0.44) 0.11 (0.31) 0.30 (0.44) 0.16 (0.33) 1,481 (1,925) 28.71 (7.87) 0.18 (0.38) 0.27 (0.47) 0.37 (0.48) 0.10 (0.30) 0.70 (0.46) 0.21 (0.41) 0.09 (0.29) 0.87 (0.33) 0.52 (0.50) 0.68 (0.47) 117 (84) 106 (86) 52 (78) 4.96 (6.06) 351 0.90 (0.31) 0.56 (0.50) 0.67 (0.46) 118 (83) 106 (85) 50 (75) 4.95 (6.01) 172 0.85 (0.35) 0.48 (0.50) 0.69 (0.47) 117 (85) 106 (87) 55 (81) 4.937 (6.12) 179 Sign. diff. ∗ T-test for differences in the means; significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1% Note: Table shows summary statistics for the full sample and divided by treatment status for the reduced sample window. Sign. diff. indicate significant difference in means between the control and treatment groups. 46 Table IA.5: Summary Statistics for the Compliers and the Full Sample Variable Increased incarceration Full sample length to +35 days Measured 12 months prior to incarceration Unemployed Dependent on other public transfers Earnings (USD 2005) Age Married Cohabitant Have children Non-western immigrants or descendants No job-qualifying education Vocational or skilled Upper secondary or higher Measured at the start of incarceration Have been convicted before Have been convicted of a violent crime before Have been convicted of a property crime before Months since 1st date of crime leading to an indictment Months since 1st date of crime leading to an conviction Months since 1st incarceration Number of previous convictions 0.24 0.12 1,406 24.89 0.00 0.42 0.25 0.17 0.72 0.18 0.10 0.33 0.15 1,473 28.13 0.23 0.28 0.36 0.12 0.67 0.23 0.10 0.83 0.05 0.56 101 83 25 2.90 0.83 0.48 0.67 109 98 46 4.44 Note: Table shows summary statistics for the full sample and compliers if the reform is used as an IV for jail¿35 days and not reduced form as in the remainder of the paper. 47 Figure I.1: Trends in Number of Crime Convictions, Males Aged 18-45 at Time of Crime ((a)) All crime, excl. traffic ((b)) Property crime ((c)) Ordinary violence Note: Figures show monthly number of crimes, excluding crimes against the Traffic Act, number of property crimes (Penal Law), number of simple violent crimes (as in the paper) indexed to the number of crimes in each category in May 2002 for males aged between 18 and 45 at the time of the crime. The vertical red lines mark the period where the paper’s sample is drawn from. B Data Appendix This section describes the construction of the main sample and variables as outlined in section 3. The construction of the auxiliary sample (for the DDD estimations) is described in the appendix. As noted previously, the paper uses full population Danish register data. I make use of 48 the criminal registers for incarcerations, convictions and charges, the DREAM database, the educational registers, the demographics registers, and the earnings registers. Construction of Sample and Crime Variables First, I select all incarceration spells between 1980 and 2007. Second, I merge the relevant case specific variables to the individual observations of incarceration spells, by using the unique combination of the case and social-security numbers. I then exclude individuals who were not sentenced to imprisonment. This includes individuals who were convicted to detainment (as these are de facto mentally ill) and all individuals who were not found guilty. If a given case has been appealed, I only include the information from the final trial. At this point, the sample includes an individual specific entry for all incarceration spells as results of convictions to imprisonment. To this information, I merge information former and subsequent charges and convictions from the various criminal registers. I also merge information on former and subsequent incarceration spells, by merging the information from the first and second step, as described above, to this data set. I also merge information on demographic characteristics to the data. Limiting the data to men between the age of 18 and 45 and dividing by type of crime allows me to create the upper panel of figure I.3. I create the lower panel of figure I.3 by combining the data of incarceration spells with data on all crimes, together with type of crime and type of conviction (suspended sentence, imprisonment, acquitted, etc.). I only keep individuals who were convicted of ordinary violence as defined by Statistics Denmark’s coding34 in the sample. I also limit the sample to men aged 18 to 45. Finally, I limit the data according to the desired time span (e.g., 18 or 4 months) around the timing of the reform. By plotting the lengths of the various incarceration spells I create figure I.2. 34 See http://dst.dk/tilsalg/forskningsservice/Dokumentation/hkt4forsker/hkt4 variabel liste forsker/ hkt4 variabel.aspx?fk=13217, for a list of codes for the different types of crime, 49 Construction of Outcome Variables The DREAM database contains weekly information on all recipients of public transfers in Denmark along with a specification on type of public transfer (unemployment benefits, social assistance, public pensions, etc.) from 1991 until 2011. I merge the DREAM database to the data introduced above by the individualspecific social security number. I summarize the weekly entries to monthly entries. I also merge information on yearly wage earnings (which is available from 1980 to 2010) to the data. I deflate earnings to 2005 dollars. The individual information on reception of public transfers (e.g., when a given individual received unemployment benefits, social assistance, educational support, public pensions, etc.) allows me to identify the months where a given individual was earning his wages. Hence, I compute monthly earnings. From the information on monthly earnings, unemployment rates, and reception of other public transfers I estimate the content of figure I.1. Construction of Covariates I merge information on educational attainment and various demographics to the data by the individual-specific social security number. The data is reported on yearly basis. However, it also includes information on the date of last entry change, e.g., date of last change to marital status. From this I compute monthly variables. At this point, each individual incarceration spell is a panel of monthly observations of earnings, unemployment, dependency of other public transfers, educational attainment, and marital status together with information on ethnicity, and prior and future crimes. From this information I can estimate the contents of table I.2. Also, I can use the information on each individual’s date of crime relative to the timing of the reform, incarceration length, and covariates in order to estimate the content of table IA.5. Finalizing the Data and Placebo Reforms Finally, I limit the sample only to include information from 12 months prior to beginning of and 36 months following release from the incarceration in question. I discard all observations between the first and the last date 50 of incarceration and pool the monthly observations to biannual observations. The data construction is complete and I can estimate the effects of the reform as reported in section 5. To estimate the content of figure I.4, I construct the data as described in this section, but change the time span conditions to the relevant dates around the timing of the placebo reform in question. I then create a new dummy variable indicating the timing of the placebo reform, which is used for the estimations. I do this for each of the placebo reforms reported in figure I.4. Construction of Auxiliary Sample This auxiliary sample has been constructed as described in section B. First, I construct a sample that consists of men who were aged 18 to 45 at the date of the crime for individuals convicted of various types of fraud, robbery, theft, and tax evasion. Second, I limit the sample to include individuals who committed the crime in focus between December 2000 and November 2003. Table IB.1 shows summary statistics for the auxiliary sample. The table shows that the auxiliary sample also had a low level of resources as measured by socioeconomic variables. The majority were either unemployed or not in the labor force. Most had not completed any education beyond secondary school and few were married or cohabiting. The table also shows that the only significant difference across the reform is the number of immigrants and descendants. 51 Table IB.1: Summary Staticstics of Auxiliary Sample Variable Unemployment Depedendency of other public transfers Earnings Age Married Cohabitant Have children Non-western immigrants or descendants*** No job-qualifying education Vocational or skilled Upper secondary or higher N Full sample 0.37 Control 0.36 Treatment 0.39 (0.47) (0.47) (0.47) 0.17 0.19 0.15 (0.36) (0.38) (0.34) 864 802 926 (1,521) (1,454) (1,585) 26.84 27.28 26.41 (7.62) (7.67) (7.55) 0.19 0.20 0.19 (0.40) (0.40) (0.39) 0.25 0.25 0.25 (0.43) (0.43) (0.43) 0.28 0.30 0.26 (0.45) (0.46) (0.44) 0.20 0.17 0.24 (0.40) (0.37) (0.43) 0.73 0.73 0.73 (0.45) (0.44) (0.45) 0.15 0.15 0.16 (0.36) (0.36) (0.37) 0.12 0.12 0.11 (0.32) (0.32) (0.32) 640 318 320 T-test for differences in the means; significance levels: ∗ : p<10% ∗∗ : p<5% ∗ ∗ ∗ : p<1% Note: Table shows summary statistics for the full sample and divided by treatment status. 52 Part II School Starting Age and the Crime-Age Profile 53 School Starting Age and the Crime-Age Profile Rasmus Landersø, Helena Skyt Nielsen? , and Marianne Simonsen? Abstract This paper uses register-based data to investigate the effects of school starting age on crime. Through this, we provide insights into the determinants of crime-age profiles. We exploit that Danish children typically start first grade in the calendar year they turn seven, which gives rise to a discontinuity in school starting age for children born around New Year. Our analysis speaks against a simple invariant crime-age profile as is popular in criminology: we find that higher school starting age lowers the propensity to commit crime at young ages and that this to some extent is driven by incapacitation. We also find persistent effects on the number of crimes committed for boys. JEL: I21, K42 Keywords: old-for-grade, school start, criminal charges, violence, property crime. Acknowledgments: Comments from Matthew Lindquist, Lance Lochner, Naci Mocan, Magne Mogstad, Bas van der Klaauw, Peter Sandholt Jensen, Anna Piil Damm, as well as seminar participants at BeNA Humboldt University, ISER, University of Essex, University of Sussex, KORA, CAM, University of St. Gallen, Xiamen University, the joint University of Bergen & UCL workshop 2013, the workshop on “Economics of Successful Children” at Aarhus University 2013, the University of Chicago’s Life Cycle and Inequality working group as well as participants at ESPE 2013, EALE 2013, ASSA 2014, the IZA YSW at Georgetown University 2014, and at the DGPE workshop are appreciated. Financial support from the Danish Council for Strategic Research (CSER, 09-070295) and CIRRAU (Simonsen) is gratefully acknowledged. The usual disclaimer applies. Revise and resubmit requested from the Economic Journal. ? Department of Business and Economics, Aarhus University 1 Introduction This paper investigates long-term effects of school starting age on crime while exploiting a discontinuity in school starting age for children born around New Year. Through this, we provide novel insights into the determinants of life-cycle criminal behavior. The crime-age profile refers to an almost universally observed relationship between crime rates and age, where crime rates increase continuously until around age 18-20 and then decrease for the remainder of the life. We use the mechanical relationship between delayed school entry and delayed life-course to address whether the crime-age relationship is entirely caused by age (Gottfredson and Hirschi (1990)) or can be mediated by the timing of key life experiences (Sampson and Laub (1995)). A large literature is concerned with effects of school starting age and subsequent educational outcomes and has convincingly shown that starting school later leads to improved test scores (e.g. Bedard and Dhuey (2006)). Black et al. (2011) and Crawford et al. (2010) refine this type of analysis and show that this result is completely driven by an age-at-test effect: children who start school later are simply older when they perform tests and this leads to better performance. Just as these authors show in the case of test scores, we show that it is important for our analysis whether crime is aligned in terms of age or life-course. In contrast to existing studies, our paper is concerned with crime outcomes. School starting age may affect crime through several channels. Firstly, to some extent, enrollment leads to locking-in or incapacitation: when youth are in school, they simply have less time to commit crime (Lochner (2011)). Previous studies confirm this mechanism: Jacob and Lefgren (2003) and Luallen (2006) exploit plausibly exogenous changes in number of school days,1 1 They study urban schools and find that increasing the number of school days reduce arrests for property crimes but increases arrests for violent crimes. While the effect on property crime is thought to be due to incapacitation, the effect on violent crime is thought to be a network effect (spending more time with criminal peers). 55 while Anderson (2014) uses changes in compulsory schooling laws.2,3 Secondly, postponing school start delays graduation, which may in turn affect opportunity costs of crime at a given age (Grogger (1998), Uggen (2000)). Thirdly, skill formation and behavior may play a role. Cunha and Heckman (2008) show that cognitive and (especially) non-cognitive skills at preschool ages are key determinants of later skill acquisition, behavior, and adult outcomes. If different school starting ages are associated with different levels of skills (school readiness), then these differences may be amplified and affect other outcomes such as the tendency to engage in criminal activities. Lubotsky and Kaestner (2014) find some support for such complementarity as cognitive skills of pupils who are old-for-grade grow faster in kindergarten and first grade, but the gap fades away after first grade. Black et al. (2011) show that higher school starting age leads to improved mental health (for boys) and a lower risk of teenage pregnancies (for girls), while there is conflicting evidence regarding the risk of receiving ADHD diagnoses (Dalsgaard et al. (2012); Elder (2010); Evans et al. (2010)). A final channel is the individuals placement in the age hierarchy. Increasing school starting age by one year will most likely move the individual from being one of the youngest to being one of the oldest children in the classroom (e.g., Gaviria and Raphael (2001) and Sacerdote (2001)). However, the potential effects of such a change are ambiguous. On the one hand, having older peers who are more likely to engage in risky behavior may spark risky behavior at an earlier stage. On the other hand, having older peers might also increase skill acquisition and maturity, thus lowering delinquent behavior and improving educational outcomes. Fredriksson and Öckert (2005) and Black et al. (2013) find no substantial impact of the age composition of peers on educational and labour market outcomes or on teenage pregnancies among girls, and thus rule out that the relative age composition in class explains the impact of school starting age. 2 Anderson estimates that a minimum dropout age of 18 decreases arrest rates for 16-18 year-olds by 17%. The effect is present for both violent and property crime. 3 One note of caution is that crime reports to the police may differ according to whether youth are in school or not. If criminal events taking place in school are treated differently than criminal events taking place outside school, this would lead to similar findings. This issue is likely more relevant for violent crime than for property crime or traffic incidents. 56 To the best of our knowledge, Cook and Kang (forthcoming) is the only other study of the relationship between school starting age and crime. They use administrative data for five cohorts of public school children in North Carolina to show that school starting age decreases juvenile delinquency but increases serious adult crime. As opposed to our study, their analysis is, however, complicated by grade retention and the existence of compulsory school leaving age legislation that creates a mechanical relationship between school starting age and length of compulsory education. Our empirical analyses rely on exogenous variation in school starting age generated by administrative rules. In particular, we exploit that Danish children typically start first grade in the calendar year they turn seven, which gives rise to a fuzzy regression discontinuity design. By comparing children born in December with children born in January we investigate the effects of starting first grade at the age of 6.6 compared to 7.6. Our analysis uses Danish register-based data for children born in the period from 1981-1993 with crucial information on exact birth dates, a range of crime outcomes, and a rich set of background characteristics. We find that higher age at school start lowers the propensity to commit crime at young ages but the effects begin to fade out in the 20s. Hence the crime-age profile can be modified by life-course and is not only determined by age per se. In addition to a delay in the onset of a criminal trajectory, for boys a higher school starting age also causes a persistent reduction in the number of crimes committed, indicating that the persistence of criminal behavior is affected by age and criminal opportunity in unison. Furthermore, we investigate potential mechanisms behind the crime reductions during the teen-years and find that incapacitation does seem to play an important role. For boys, a higher school starting age reduces criminal charges significantly until age 19, and the effect is mainly driven by property crime. For girls, a higher school starting age postpones the initiation of crime, and the effect is driven by violent crime. Finally, we find that the relative age of classroom peers does not seem to be behind the reduction in crime. 57 The paper is structured as follows: Section 2 briefly reviews the Danish institutional set-up and discusses mechanism through which child behavior may be affected by school starting age and Section 3 describes the methodology. Section 4 presents our data, Section 5 the results and finally Section 6 concludes. 2 2.1 Institutional settings and mechanisms Educational Institutions and School Starting Age During the period relevant for this study, Danish law stipulated that education was compulsory from the calendar year of the child’s 7th birthday and until completion of 9th grade.4 This school system is fortunate for a study like ours because there is no automatic relationship between school starting age and minimum required schooling as there would be in the US and the UK systems, for instance. After 9th grade, education was voluntary and could follow an academic path (starting with high school) or a vocational path (starting with vocational school).5 The year before entering first grade, children could enroll in a voluntary preschool class. The preschool class, compulsory schooling from 1st to 9th grade and post-compulsory schooling were free of charge. Furthermore, most children below the age of six were inscribed into some form of public child care, which was heavily subsidized.6 Parents and administrators have considerable leeway in deciding when children should 4 The school starting regulations are not strictly enforced and exemptions are granted based on applications from the parents. Exemptions are granted by the local municipality if it is considered beneficial for the child’s development. School start can only be delayed by one year, and school is no longer compulsory from July 31 in the calendar year of the child’s 17th birthday even if 9th grade has not been completed. School children do not pass or fail grades, but in collaboration with the parents, the school principal can decide that child repeats a grade or jumps a grade again if this is considered beneficial for the childs development. For more details consult the Danish Law of public schools. 5 It was also possible to complete a voluntary 70th grade before continuing on to a vocational or academic path. 6 A minimum of 67 % of the expenses is covered by the local authorities, c.f. the Danish Law of day care. 58 start school.7 Therefore, school starting age is not random and is most likely affected by a range of factors that may also correlate with the child’s outcomes, behavioral as well as academic. For example children’s overall school readiness and behavior in preschool are likely to affect the timing of school start.8 But other factors may impact on the decision as well: as shown by the previous literature, starting later is likely to increase test scores. While this has not been found to impact significantly on long term outcomes such as earnings,9 higher grades may improve the consumption value of attending school and allow for a more extensive educational choice set. Finally, there is considerable variation in school starting age culture across municipalities even conditional on a rich set of observable characteristics. For completeness, we will investigate some of these hypotheses towards the end of the paper. To meaningfully address consequences of school starting age, our empirical analysis will make use of the following observation: because the formal age at school start is defined by the year of birth, each January 1st provides a cut-off point at which children born on each side are subject to a one year difference in timing of administratively determined school start, even though they are born very close in time. Section 3 will formalize this idea. Some parents of children born close to this cut-off date do choose to manipulate their childrens actual school starting age: children born at the end of the year are more likely to postpone school start one year, whereas children born early in the year are more likely to start school one year earlier than the law stipulates.10 In consequence, some children born in December will start school one year later than they are supposed to - approximately at age 7.6 years whereas the remainder of the children born in December will start when their age is around 6.6 years. Likewise, some children born in January will start school at age 6.6, which is 7 UNI-C (2009) documents this and describes background characteristics of children across school starting age. 8 This pattern is clear from Figure A1 that shows the distributions of social and emotional difficulties at age 4 among punctual and late school starters, drawn from an auxiliary data source (the Danish Longitudinal Survey of Children). 9 Fredriksson and Öckert (2013) do find earnings gains for individuals with low-educated parents. 10 A recent white-paper on school start concluded that ’many parents worry whether their children are ready to start school, and these concerns are supported by the preschool staff’, cf. God Skolestart (2006). 59 Figure II.1: Fraction Punctual, Early, and Late School Start, by Birth Date (selected cohorts) ((a)) Girls ((b)) Boys Note: Figure shows the school starting pattern of the full population of children born January 1 1994 January 1 1995. Early school start refers to school start the calendar year the child turns 6, punctual school start refers to school start the calendar year the child turns 7, and late school start refers to school start the calendar year the child turns 8. one year earlier than the law stipulates, while the remainder will start school at age 7.6. As shown in Figure II.1, school starting age for children born around the cut-off date is effectively reduced to a binary outcome: either children start at age 6.6 or they start at age 7.6. If children born around the cut-off date are 7.6 years old at school start, we label them old-for-grade. Figure II.2 shows the fraction of children who are old-for-grade by date of birth for each gender. We see that there is a smooth upward trend in the fraction of girls and boys who are old-for-grade in December followed by a large discontinuity around New Year. The figure also shows that boys are much more likely than girls to be old-for-grade. 2.2 Institutions Guarding Juvenile Crime Below we describe the institutions that may be relevant for understanding the potential impact of schooling and school starting age, in particular, on criminal activity of teenagers. In Denmark, the age of criminal responsibility is 15, which is high in an international 60 Figure II.2: Fraction who are Old-for-Grade, by Date of Birth ((a)) Girls ((b)) Boys Note: Figure shows the fraction of children who are old-for-grade by date of birth around New Year (marked by the vertical line). Being old-for-grade implies that the child starts school at age 7.6 instead of at age 6.6. Averages for population of children born in December or January from December 1981 to January 1993. comparison; England has an age of criminal responsibility of 10, while only few US states have a formal limit and in those cases the limit is 6-12 years.11 Before their 15th birthday, Danish children cannot be arrested, brought to court or imprisoned, although they may be withheld up to 6 hours by the police in which case a social worker must be present during interrogation. This is true regardless of the severity of the crime, and there is no such thing as a youth court.12 At ages 15-17, youths are considered fully responsible for their criminal acts, and may be imprisoned, though this should be separate from adult prisoners.13 Thus, the focus is on prevention and rehabilitation rather than prosecution and punishment. All local authorities have an interdisciplinary framework for prevention of juvenile crime involving the schools, the social services and the police (denoted SSP). This is a network of relevant players who collaborate to understand and prevent juvenile crime in the local area. They are concerned with general, specific as well as individual-oriented policies and interventions. Reported victimisation rates in Denmark are falling like in the rest of the OECD. However, 11 http://www.unicef.org/pon97/p56a.htm. The question of guilt is, in fact, never determined for children below the age of criminal responsibility. The severity of the case is solely considered by the Attorney General. 13 See the Danish Service Act. 12 61 victimisation rates in Denmark are somewhat higher than in Norway and Sweden and also higher than the OECD average (19 v. 16 %) while they are almost on par with the US (18 %) and the UK (21 %) rates.14 Therefore, we have no particular reason to expect that the effects of school starting age on crime should be substantively different in Denmark compared to other countries. 3 Methodology Our goal is to estimate the effect school starting age (SSA) on associated crime outcomes. Our equation of interest is the following: Yi = α + βSSAi + Xi0 γ + εi (1) where Y denotes the outcome, X observable characteristics,15 and ε unobservable characteristics. ε is likely related to the choice of school starting age and would bias results if ignored. To circumvent the problem that SSA is not randomly allocated, we formally employ a strategy similar to Black et al. (2011), Evans et al. (2010), Elder (2010), and Fredriksson and Öckert (2013). In particular, we exploit that school starting rules imply that children born just prior to January 1st are on average younger when they enroll in school than children born immediately after January 1st. In some sense, we can think about administrative school starting age rules as imposing time and effort costs on parents who choose to enroll their child later (or earlier) than prescribed. We can therefore instrument SSA with a dummy for being born immediately after 14 The reported victimisation rates reveal what proportion of a sample of 2000 individuals report that they themselves or persons in their households had experienced one of ten types of conventional crimes (such as vehicle-related crime, theft of personal property and contact crime), see OECD (2009). 15 X includes a constant and child and parental characteristics predictive of SSA and Crime: APGAR score, birth weight, gestation length for children, mothers’ age at the birth of first child, both parents’ education and labour market participation, a flexible function of distance in days to the cut-off, and a constant. 62 January 1st. As argued by the previous literature, such cut-off dates constitute valid instruments in the sense of being uncorrelated with unobserved characteristics of child outcomes. Yet in order to estimate the local average treatment effect - the average effect of being old-for-grade for the group of children who would be inclined to increase their school starting age solely because they were born in January and not December - we also require that the monotonicity assumption is satisfied. Barua and Lang (2012), Aliprantis (2012), and Fiorini et al (2013) argue, however, that monotonicity is likely violated if the school starting age distribution of children born just after the cut-off date does not stochastically dominate the corresponding distribution for children born just before the cut-off date. In the US example given by Barua and Lang (2012) for children born in the 1950s, children born in the last quarter of the year were on average younger at school start than children born in the first quarter of the year but the underlying choices were not monotonically related to the cut-off date: while children born in the last quarter of the year were less likely to start school at age 5 compared to children born in the first quarter of the year, they were at the same time less likely to be very young (4 years or younger) and more likely to be very old (6 years or older). Hence it seems that being born after the cut-off date increases school starting age for some but reduces it for others. We do not find this to be an issue in our setting because no children start more than one year before/after the date at which they are supposed to start. This is illustrated by Figure 1 where we show that school starting age in our case is effectively reduced to a binary variable that indicates whether the child enrolls at age 6.6 or 7.6. Always-takers will start at age 7.6 regardless of when they were born, never-takers will start at age 6.6, and compliers will start at age 6.6 if born in December and 7.6 if born in January. For these groups, the instrument monotonically increases school starting age. We assume away defiers: a child whose parents choose to incur the costs of enrolling him earlier (at age 6.6) than at the age specified by administrative rules if born in January but are at the same time also willing 63 to bear the additional costs of enrolling him later (at age 7.6) than at the age specified by administrative rules if born in December. This type of behavior would both be inconsistent with parental preferences for the child being among the youngest in his classroom as well as preferences for him being the oldest. In practice, we consider a short bandwidth with children born ± 30 days around January 1st. In our main specification, we model SSA as a binary variable indicating a school starting age of 7.6 as opposed to 6.6. Along with various standard robustness tests, we show that results are robust to modelling SSA as a continuous variable. See more discussion in Section 5 below. 4 Data We use Danish register-based data for children born in the period from mid-1981 until mid1993 with crucial information on exact birth dates, charges16 for property crime, violence and other types of crime (in particular traffic incidents), together with the specific dates of crime, and a rich set of background characteristics. Crime Outcomes As is true for most of the existing literature on school starting age, choosing the right outcome is a challenge: on the one hand, one wants to align children in terms of age. This is particularly relevant because crime is positively correlated with age in the age range considered in this paper. On the other hand, one wants to align children in terms of length of education because the agents that decide a child’s school staring age may focus on these outcomes or because education may have a direct effect on the tendency to commit crime. To address these issues, our main outcomes consist of age-specific measures but we also separately consider criminal charges at a given point in the educational cycle. 16 Using charges instead of convictions enables us to use three additional years of data because the time involved in processing cases through courts and subsequent appeals is obviated. Conclusions are robust to using convictions instead of charges. 64 We consider two types of age-specific crime measures: one outcome measures whether an individual has been charged with a crime at a given age from age 15 and onwards. This is a memoryless measure, which simply informs about the tendency to commit crime at any given age (i.e. from one birthday to the next). It is particularly useful for detecting sudden changes in the crime-age profile caused by school starting age. Our other type of outcome measures whether an individual has been charged with a crime at or before a given age and in this way keeps track of earlier incidences. This is convenient if one wants to address more permanent effects on crime. Because of considerable recidivism, both measures are required to give a full picture of the consequences of school starting age on the crime-age curve. We might see negative effects of school starting age on crime at a given age but not at crime at or before a given age if those committing the crime are simply the same individuals. Conversely, we could see effects on crime at or before a given age and not on crime at a given age if school starting age has a longer-lasting effect on criminal behavior. It is clearly important to be able to distinguish between these scenarios. Due to space considerations some of our descriptive analyses and all robustness tests will focus on the accumulated outcome but the full set of descriptives and formal results are available on request. In addition to our main analyses, sub-analyses show results for types of crime (’property crime’, ’violent crime’, and the residual ’other crime’) and number of crimes to address differential effects on the intensive and extensive margin. Figure IIA.2 illustrates means of our main outcome variables. The figure replicates the well-known age pattern where criminal activity peaks at ages 19-20 (Gottfredson and Hirschi (1990)). For girls, 2% are charged with a crime at ages 19-20, while for boys 11% are charged with a crime at ages 19-20, after which age the fraction declines. All over the age range, the proportion charged with a crime is higher for individuals who are old-for-grade compared to individuals who are young-for-grade. Our empirical analysis will reveal to what extent this reflects a causal relationship. 65 To give a better sense of the nature of the crime committed, Table II.1 summarizes the distribution of crime at or before a given age across three types of crime: property crime, violent crime, and other crime.17 Throughout the age distribution, boys are three times more likely to have been charged with a crime than girls. At the youngest ages, property crimes tend to be most prevalent, but after age 18 when the individuals in the sample gradually acquire a drivers license, other crimes including traffic incidents accumulate. For girls, other crime dominates from age 22 onwards, while for boys it dominates already from age 18. Table II.1: Means of selected outcome variables, by types of crime Age 15 16 17 18 19 20 21 22 23 24 25 26 27 All 0.019 0.032 0.044 0.054 0.069 0.083 0.096 0.106 0.116 0.124 0.133 0.138 0.143 Property 0.018 0.029 0.037 0.043 0.049 0.053 0.055 0.056 0.056 0.057 0.057 0.057 0.057 Girls Violence 0.002 0.003 0.005 0.007 0.009 0.010 0.010 0.011 0.011 0.012 0.012 0.012 0.011 Other 0.001 0.003 0.006 0.010 0.021 0.034 0.045 0.057 0.068 0.075 0.086 0.092 0.100 N 48,546 48,546 48,546 48,546 43,668 39,037 34,559 30.209 26,093 22,125 18,240 14,630 11,045 All 0.039 0.081 0.140 0.189 0.243 0.290 0.327 0.358 0.383 0.402 0.417 0.429 0.439 Property 0.033 0.059 0.081 0.101 0.118 0.131 0.141 0.147 0.151 0.154 0.156 0.156 0.157 Boys Violence 0.006 0.014 0.025 0.036 0.047 0.057 0.065 0.070 0.074 0.076 0.079 0.081 0.082 Other 0.007 0.029 0.077 0.119 0.174 0.222 0.262 0.295 0.323 0.345 0.362 0.375 0.388 N 50,383 50,383 50,383 50,383 45,368 40,606 36,012 31,405 26,937 22,781 18,723 14,949 11,273 Note: Table shows fraction of sample who have commited crime until a given age, by age, gender, and crime type. Violent crimes comprise the most severe types. The most common examples are ordinary assaults, aggravated assaults, threats, and violence towards public servants. 80% of convictions for violence result in imprisonment or a suspended sentence for boys and 67% for girls. Property crimes and other crime are typically less severe crime. The most frequent examples of property crime are shoplifting, burglary, and vandalism. A quarter of all convictions for property crime result in imprisonment or a suspended sentence for boys, while this number falls below 10% for girls. The category of other crimes is dominated by traffic related crime (50% for girls and 90% for boys) such as driving a car without a license, while the second 17 Due to space considerations, we have chosen only three broad categories of crimes. Different classifications would be possible. 66 largest category is drug or weapon related crime (e.g. selling drugs or possession of illegal weapons). Convictions for other crimes rarely lead to imprisonment. Table IIA.1 shows mean crime outcomes by birth-month and gender. Those born in December tend to be more likely to have been charged with a crime compared to those born in January. When we consider whether an individual has been charged with a crime at a given age, we see that boys born in December are more likely to have been charged with a crime at each age from 15 to 21, while the pattern is more scattered for girls (top panel). This outcome will be important for our analysis of incapacitation. When we consider the accumulated measure, namely whether or not the individual has been charged with a crime at or before a given age, we see that the difference is significant up until age 24 (bottom panel). As argued above, the accumulated outcome is more informative about potential catching up effects and other long run effects. Figure IIA.3 shows the standard regression-discontinuity plots for the accumulated crime measures at ages 19 and 27. The figure reveals a discontinuity in the raw outcome variable which is only statistically significant for boys at age 19. Measuring School Starting Age Unfortunately, we do not have information on the specific timing of school starting age for the cohorts in question. Instead we use age in 8th grade as an approximation. We do observe childrens exact ages at all grade levels from 2007 and onwards and we use this data to check that the approximation of school starting age by age in 8th grade works very well (see Table IIA.2). The vast majority of children who are old-for-grade at the end of elementary school are old-for-grade already in preschool class, while very few children are redshirted from the first grade and onwards.18 In addition, there is no relationship between the cut-off and being held back or skipping grades during primary school. 18 As an additional check we report results from the first stage regression at various grade levels by use of these recent data. Measurement errors in school starting age will impact on our results to the extent that 67 Background Characteristics Using the registers we combine information on the childrens birth weight, gestational length, APGAR score,19 demographic variables, educational variables, and crime by the unique individual identification number. In addition, we link these data to information about parents’ education and labour market outcomes as measured one year prior to the child’s birth. Importantly, we center all covariates and outcome variables on the cut-off dates instead of by calendar year. Hence, we compare background information on children born in January year t to the information on children born in December year t − 1 instead of comparing information on children born in January year t to the information on children born in December year t.20 Similarly, outcome variables aligned by age for individuals born on each side of the cut-off date are measured at the same point in time: exactly one’s birthday. Table II.2 shows joint F-tests from a regression of the instrument on the rich set of background variables for children born ± 30 days around January 1st for girls and boys separately. These tests clearly suggest that the sample is balanced across the cut-off. Table A3 in the Appendix shows mean background characteristics for December and January born children. These are not jointly significantly different either (p=0.19). We see that for some variables, means are significantly different but the differences are all very small in size21 and the sign of the difference often varies by gender. We include all variables as covariates and bound potential biases by the approach by Nevo and Rosen (2012); see discussion in Section 5. they are correlated with the instrument. If children born in December are more likely to repeat a grade as suggested by Elder and Lubotsky (2009), our results will be biased towards zero. 19 The APGAR score ranges from zero to 10 and summarizes the health of a newborn child based on five simple criteria: Appearance, Pulse, Grimace, Activity and Respiration. 20 For children born in December 1981 or January 1982 we use parental characteristics measured in 1980, while we for children born in December 1982 or January 1983 use parental characteristics measured in 1981 etc. 21 The difference in birth weight is for example 16 grams, which corresponds to 0.03 point difference in IQ according to Black et al. (2007). Figure IIA.4 shows the variation of birthweight on either side of the cut off. Even though the (numerically small) difference is significantly different for boys and girls, there is no systematic pattern across dates. 68 Table II.2: Balancing Tests: Regression of Instrument on Background Characteristics Girls 0.53 0.94 48,546 X X F-statistic P-value N Distance to cut-off Covariates Boys 0.70 0.81 50,383 X X Note: Table shows F-statistics and associated pvalues from regressions of birth-month (January=1) on the full set of covariates (background characteristics presented in Table IIA.3 and cohort fixed effects) as well as distance to cut-off in days. 5 5.1 Results Timing of Birth Within the Calendar Year and School Starting Age Table II.3 presents our first stage results, using a dummy for birth in January as instrument for school starting age. The table shows the first stage results estimated both with and without controls. All specifications include cohort fixed effects (indicator variables for being born Dec 1981-Jan 1982, Dec 1982-Jan 1983 etc.) and the distance in days to the cut-off linearly.22 Remaining estimates may be found in Table IIA.4 in the Appendix. In line with Figure II.2, we see that the instrument strongly predicts whether children start school at age 7.6 or 6.6: children born in January are significantly more likely to be relatively old when they start school compared to children born in December and the effect is large. This is despite the tendency for some children born in December to delay enrolment and start at age 7.6 instead. The instrument is highly significant and the associated Fstatistic easily passes the Staiger-Stock rule-of-thumb.With one endogenous variable and 1 instrument, F should be greater than 10,23 22 Results are robust to including more flexible polynomials of the running variable. As discussed above, our measure of school starting age is based on age in grade 8 rather than actual age at school start. In Table IIA.5, we use the richer data from 2007 onwards to demonstrate that the first stage is literally unaffected by this approximation; for individuals born Dec 2000-Jan 2001, we can compare first stage as measured by being old-for-grade in preschool versus 2nd grade, while for individuals born Dec 1996-Jan 1997, we can compare first stage as measured by being old-for-grade in 4th versus 8th grade. While the first stage differs slightly across cohorts, it does not differ by grade level and thus supports the approximation 23 69 Table II.3: First Stage Estimation Results: Children Born in December and January January=1 Days to cut-off, December Days to cut-off, January Constant N Cohort Fixed Effects Remaining covariates F-value for January dummy Girls 0.245∗∗∗ 0.244∗∗∗ -0.005∗∗∗ -0.005∗∗∗ 0.003∗∗∗ 0.003∗∗∗ 0.714∗∗∗ 2.282∗∗∗ 48,546 50,383 X X X 877.29 891.59 Boys 0.172∗∗∗ 0.171∗∗∗ -0.005∗∗∗ -0.005∗∗∗ 0.001∗∗∗ 0.001∗∗∗ 0.872∗∗∗ 1.540∗∗∗ X 574.01 X X 576.22 Note: Table shows results from linear regressions of indicators for starting school at age 7.6 instead of 6.6 while conditioning on the cut-off dummyies (January=1), distance to cut-off, cohort fixed effects and background characteristics (see Table IIA.3). p<0.05: ∗ , p<0.01: ∗∗ , p<0.001: ∗∗∗ . 5.2 Crime Results: 2SLS The Propensity to Commit Crime Figure II.3 shows our main estimation results; see also the corresponding Tables IIA.6 and IIA.7. The left hand side figures show the estimation results for crime at a given age and the right hand side figures show the estimation results for crime at or before a given age (conditional on the full set of covariates). We find that being old-for-grade leads to a significant reduction in the propensity to commit crime at each age until age 19 for boys but only at age 15 for girls. Point estimates at older ages are primarily negative for girls but become very close to zero for boys.24 We cannot formally detect statistically significant differences in coefficient estimates across ages because confidence bands are too wide. Note that individuals who are young-for-grade turn 15 during their final year in comprehensive school, while individuals who are old-for-grade turn 16. Individuals who are young-for-grade turn 18 or 19 during their final year in high school, while individuals who are old-for-grade turn 19 or 20 (depending on whether they took the optional 10th grade or not). Formal analyses of the impact of being old-for-grade on enrolment and completed years of schooling at a given age show that a main effect of being old-for-grade is postponement of the educational cycle (see 24 This is not driven by the smaller sample for which we observe outcomes at all ages. The estimate profile is in fact similar if we only include individuals born in 1985 or before. 70 Figure IIA.5). While short run effects are large and fluctuate widely at different steps of the education-ladder, long-term effects of school starting age on educational outcomes are small or zero (in line with Black et al. (2011)). We therefore interpret the age pattern in our crime results as supportive of the incapacitation hypothesis. It suggests that compulsory school is protective against crime for girls, while also high school is protective against crime for boys. When we instead look at the propensity to commit crime at or before a given age in Figure II.3, we find a statistically significant effect for girls until age 19. After age 19, estimates are again primarily negative but imprecise. Hence it seems that for girls, a higher school starting age initially reduces crime and we see no catching up at older ages. Estimates for boys are significant until age 22, after which they become very close to zero. For boys therefore, we see a longer-lasting initial effect that eventually fade. The fading effects suggest that crime at the extensive margin is aligned to key life events rather than age. If criminal behavior instead was fixed to age, any effects of school starting age on crime should shift the crime-age profile in vertical direction. However, Figure II.3 show that the effects fade in the long run when old-for-graders and young-for-graders educational attainment and life-course converge, which is consistent with a horizontal shift in the crime-age profile. Moreover, the fact that the effects only approach zero from below suggest that the crime-age profile is shifted in both vertical and horizontal direction. The ’delay’ in crime for late starters is large relative to the mean. The share of girls with any criminal charges at or before age 18, for example, is 0.054 among children born 30 days around January first. The effect of starting school at age 7.6, in comparison, is 1.5 percentage points reduction, or just below 30 % of the mean. For boys, the effect of school starting at age 7.6 on criminal charges at age 18 is a 4 percentage point reduction, which should be seen relative to a share of boys with criminal charges of 0.19. Appendix A, Table A6 shows detailed estimation results where we gradually add control variables. Our results are robust to standard robustness checks: extended bandwidth, ’donut hole’ 71 strategies or including polynomials of the assignment variable. In addition, we perform a robustness check computing bounds according to Nevo and Rosen (2012), which allows for imperfect instruments. Our results are robust although the confidence bands are much wider. Finally, results are unchanged by modelling SSA as a continuous variable instead of a binary variable indicating late (7.6 years) as opposed to early (6.6 year) school start. Appendix Figures IIA.7-IIA.9 report a selection of these robustness checks for the outcome crime at or before a given age. Types of Crime In Figure II.4 we distinguish between types of crime. For girls the significant effects of school starting age on crime at or before a given age are mainly driven by the impact on violent crimes, while for boys the effects are primarily driven by the impact on property crimes, although the effects on the two other categories of crime are significant for some ages. 72 Figure II.3: Estimation Results: Crime Across Age ((a)) Girls, at age ((b)) Girls, at or before age ((c)) Boys, at age ((d)) Boys, at or before age Note: Figures show the estimated effects of being old-for-grade based on 2SLS regressions on the probability of crime at (left) and at or before (right) a given age. Cut-off dummy (January=1) used as instrument. Conditioning set includes distance to cut-off, cohort fixed effects, and background characteristics (see Table IIA.3). Dashed lines indicate 95% confidence intervals. 73 74 ((e)) Boys, violence ((d)) Boys, property ((f)) Boys, other crime ((c)) Girls, other crime Note: Figures show the estimated effects of being old-for-grade based on 2SLS regressions on the probability of property, violent, and other crime at (left) and at or before (right) a given age. Cut-off dummy (January=1) used as instrument. Conditioning set includes distance to cut-off, cohort fixed effects, and background characteristics (see Table IIA.3). Dashed lines indicate 95% confidence intervals. ((b)) Girls, violence ((a)) Girls, property Figure II.4: Estimation Results: Crime At or Before Age by Types of Crime Grade Alignment Just as Black et al. (2011) show in the case of test scores, the way we align crime is extremely important for our conclusions. In particular, when age is held constant, the differences between individuals born before or after the cut-off reflect differences in age at school start, educational attainment and the probability to be enrolled in school. On the other hand, when outcomes are aligned by grade levels, the differences reflect the effects of age at school start, age at measurement, and time of measurement. Here we use the grade level that individuals are expected to attend given their timing of school start. Figure II.5 shows results that align children in terms of grades instead of age. The top figures show the estimation results for crime at a given grade and the bottom figures show the estimation results for crime at or before a given grade (conditional on the full set of covariates). If the effects of school starting age only originated from delayed life-course, aligning grade should nullify the effects. The figure shows that the age-gradient is indeed smaller than for the age-aligned crime (though not significantly so); it nullifies the effect at some but not all grade levels (top part of figure). For girls, the effect of school starting age on crime is significantly negative for the two final years in comprehensive school (grade levels 8 and 9). This corresponds to the significantly negative results at age 15 in our main analysis above. For boys the effect is only significant at the transitions from one level to the next in the educational cycle (grade levels 10 and 13). For none of the genders the effects on the accumulated crime measure are significant (bottom part of figure). These results speak in favor of our interpretation of the results presented in Figure II.3; being old-for-grade primarily lowers crime due to changes in the timing of life events, though we also see some signs of lower crime per se around the transitions. 75 Figure II.5: Estimation Results: Crime Across School Grade ((a)) Girls, p(crime) at grade ((b)) Boys, p(crime) at grade ((c)) Girls, p(crime) at or before grade ((d)) Boys, p(crime) at or before grade Note: Figures show the estimated effects of being old-for-grade based on 2SLS regressions on the probability of crime at a given grade level (years since actual school start). Cut-off dummy (January=1) used as instrument. Conditioning set includes distance to cut-off, cohort fixed effects, and background characteristics (see Table IIA.3). Dashed lines indicate 95% confidence intervals. Number of Criminal Charges In Figure II.6, we consider the effects on crime at the intensive margin. The figure presents the effects of increasing school starting age on the number of charges at or before a given age. The estimates for girls are significant in the same age range as the results for the indicator variable above. This is likely because the majority of girls only commit very few crimes and it suggests that the school starting age mainly influences the extensive margin. For boys, however, this 76 Figure II.6: Estimation Results: Number of Crimes At or Before Age ((a)) Girls ((b)) Boys Note: Figures show the estimated effects of being old-for-grade based on 2SLS regressions on the number of crimes before or at a given age. Cut-off dummy (January=1) used as instrument. Conditioning set includes distance to cut-off, cohort fixed effects, and background characteristics (see Table A3). Dashed lines indicate 95% confidence intervals. exercise reveals interesting additional insights that were not clear from the simple indicator analysis: estimates are much larger and effects last long into the twenties. In their midtwenties young men who started school later as a consequence of being born in January have been charged with half a crime less than those who did not. This is a substantial effect which has large consequences for both the offenders and potential victims. Moreover, these large and persistent effects also show that on the intensive margin, the crime-age profile for boys is more strongly related to age than the extensive margin crime-age profile which we investigated earlier.25 Criminal behavior is thus not only determined by either age or key events but by both in interaction; it matters at what age one is exposed to different key events. 25 Even when we align outcomes by grade levels, we see a negative significant effect of being old-for-grade on the accumulated number of crimes at grade levels 11 to 14 for boys (available on request). 77 5.3 Heterogeneity In this section we first investigate characteristics of those individuals who comply with the instrument and then study whether effects vary by subgroups defined by parental background. Complier Characteristics In Table II.4 we summarize the average characteristics of the compliers (those who shift to being old-for-grade because of a change in the value of the instrument; see Abadie (2003)) along with the average characteristics of the full sample as comparison. Families who change the school start decision as a consequence of being born in January rather than December tend to have more favorable characteristics: parents are more often living together, birth weight is higher, and parents have higher education and stronger attachment to the labour market. Heterogeneity by Observable Characteristics Figure II.7 presents heterogeneous effects. When we divide the sample according to mother’s education, we find that the effect of school starting age on crime is numerically smaller and insignificant when mothers have completed at least 12 years of education, which is what we would expect. However, when we divide the sample according to the employment status of the father, the picture is different: for girls, the effect of school starting age on crime is numerically smaller and insignificant when fathers are not employed, while for boys, the effects are of the same magnitude in either case. 78 Table II.4: Complier Characteristics Variable Immigrant Parents married/cohabiting Apgar score=9 Apgar score=8 Apgar score lower Birth weight, grams Gestational length, weeks Mother: Months of schooling Completed HS or equivalent Unemployed Out of the labour force Age at birth of first child Father: Months of schooling Completed HS or equvalent Unemployed Out of the labour force Girls Sample Compliers 0.04 0.02 0.79 0.81 0.18 0.17 0.07 0.06 0.08 0.09 3349.43 3414.11 39.55 39.61 T-value 4.99∗∗∗ -2.72∗∗ 1.39 2.74∗∗ -0.12 -5.90∗∗∗ -1.59 Boys Sample Compliers 0.04 0.02 0.79 0.83 0.19 0.17 0.07 0.08 0.10 0.08 3473.43 3589.40 39.47 39.62 T-value 3.91∗∗∗ -3.45∗∗∗ 1.25 -2,39∗ 1.78 -7.21∗∗∗ -2.67∗∗ 137.41 0.29 0.13 0.11 24.85 139.08 0.29 0.11 0.10 24.91 -2.49∗ 0.08 3.00∗∗ 1.29 -0.76 137.75 0.30 0.13 0.11 24.92 142.92 0.34 0.10∗ 0.09 25.18 -5.87∗∗∗ -3.51∗∗∗ 2,31∗ 1.59 -2.18∗ 140.15 0.19 0.08 0.06 143.26 0.18 0.07 0.04 -3.85∗∗∗ 2.58∗∗ 1,67 4.51∗∗∗ 140.38 0.20 0.07 0.06 146,86 0.23 0.06 0.04 -5.71∗∗∗ -3.29∗∗∗ 2.28∗ 3.26∗∗ Note: Table shows selected mean characteristics of the full sample and compliers (i.e. those who are induced to be old-for-grade because they are born on January 1st and not December 31st) following Abadie (2003). Standard errors calculated from 100 bootstraps. p<0.05: ∗ , p<0.01: ∗∗ , p<0.001: ∗∗∗ 5.4 Potential Mechanisms and Effects on Alternative Outcomes This section first attempts to shed light on some of the different channels through which school starting age may affect crime outcomes. Specifically, we further investigate the importance of incapacitation and also consider the role played by the relative age of peers as in Black et al. (2013). We next address effects on an alternative range of outcomes. We discussed above that parents may choose to enroll their child later in school even if there are no long run effects on income, for example. Because school starting age is linked to grades, it may also be linked to the quality of and consumption value associated with the type of degree. Finally, municipality based variation in culture may impact on parents choices. We first address incapacitation. In Figure IIA.6, we investigate how school starting age affects crime across the week. For boys, the effect is driven by crime committed during weekdays (Mon-Fri) and to a smaller extent by crime committed during weekends (SatSun). For girls, the effect during the weekdays is not statistically different from zero, while 79 80 ((f)) Boys, mother ≥12yos ((e)) Boys, mother <12yos ((g)) Boys, father employed ((c)) Girls, father employed ((h)) Boys, father not empl. ((d)) Girls, father not empl. Note: Figures show the estimated effects of being old-for-grade based on 2SLS regressions on the probability of crime at or before a given age by mothers education (≥12 years of education (40%) v. <12 years of education (60%)) and fathers employment status (employed (86%) v. not employed (14%)). Cut-off dummy (January=1) used as instrument. Conditioning set includes distance to cut-off, cohort fixed effects, and background characteristics (see Table A3). Dashed lines indicate 95% confidence intervals. ((b)) Girls, mother ≥12yos ((a)) Girls, mother <12yos Figure II.7: Estimation Results: Crime At or Before Age by Types of Crime it is for weekends until age 19. We interpret these findings as supportive of incapacitation effect for boys throughout high school, which the age pattern of the main results already pointed at above. For girls the effects are smaller and the mechanism is more subtle: the effect is driven by violent crimes, crimes taking place on weekends and the effect dies out at a younger age. Thus is appears that girls who are old-for-grade are less likely to be involved in this type of crime while they are still in school. Table IIA.8 analyzes the effect of the age of peers in line with Fredriksson and Öckert (2005) and Black et al. (2013). Formally, we include the average age of peers in one’s school in 8th grade as an additional control variable in our models of crime outcomes. To handle endogeneity of the average age of peers, we instrument with the predicted average age of peers had everybody started on time.26 We see that the mean age of peers has no statistically significant effect on crime outcomes and that the effect of own school starting age is completely unaffected by the inclusion of this extra control variable.27 This is in line with the findings in the mentioned previous studies for Norway and Sweden. We finally investigate other mechanisms that may explain why parents choose to postpone school start even when effects on childrens primary long-term outcomes are moderate in size. Table II.5 shows the estimated effects of being old-for-grade on alternative outcomes such as grades and type of degree that may enhance the consumption value of school for children and parents. We find that the impact of school starting age on standardized math grades is statistically significant and large in magnitude.28 This is in line with previous studies supporting large age-at-test effects (Crawford et al. (2010) and Black et al. (2011)). If such grades make a difference for educational preferences or feasible choices, they may influence long term outcomes. Indeed we do see that girls are more likely to enroll in one of the selective and competitive Medical Schools, while boys tend to obtain a slightly longer 26 We impute average age of peers for observations with fewer than 10 other children enrolled at the school in grade 8. 27 Compare the results presented in Table IIA.6 to those in Table IIA.8. 28 Danish grades are not affected (not shown). 81 Table II.5: Effects of School Starting Age on Other Outcomes Variable Grades Math Effort N Years of schooling, age 27 N College enrollment Med School Law School N Distance to cut-off Covariates OLS Girls 2SLS 2SLS OLS Boys 2SLS 2SLS -0.028∗ (0.011) -0.116∗∗∗ (0.011) 27,909 0.502∗∗∗ (0.101) 0.036 (0.090) 27,909 0.510∗∗∗ (0.093) 0.065 (0.088) 27,909 -0.052∗∗∗ (0.014) -0.093∗∗∗ (0.015) 27,974 0.356∗ (0.145) 0.201 (0.150) 27,974 0.295∗∗∗ (0.135) 0.195 (0.146) 27,974 -0.448 (0.050) 11,045 -0.048 (0.136) 11,046 -0.045 (0.124) 11,047 -0.335 (0.052) 11,273 0.234 (0.147) 11,274 0.165 (0.135) 11,275 -0.007∗∗∗ (0.001) -0.007∗∗∗ (0.002) 30,209 0.026∗ (0.011) -0.013 (0.013) 30,209 X 0.026∗ (0.011) -0.013 (0.013) 30,209 X X -0.006∗∗∗ (0.001) -0.003∗∗∗ (0.001) 31,405 0.019 (0.011) -0.006 (0.012) 31,405 X 0.019 (0.011) -0.006 (0.012) 31,405 X X X X Note: Table shows the estimated effects of being old-for-grade based on 2SLS regressions of alternative education outcomes. Cut-off dummy (January=1) used as instrument. Conditioning set includes distance to cut-off, cohort fixed effects, and background characteristics (see Table II.A3). p<0.05: ∗ , p<0.01: ∗∗ , p<0.001: ∗∗∗ education if they are old-for-grade as a consequence of being born on the other side of the cut-off. Organisation (or effort) grades are not affected for boys or girls. To investigate the variation in the enforcement of the stipulated school starting age across municipalities we look at the distribution of predicted school starting age using a rich set of observable characteristics against the actual school starting age across municipalities. Results are available upon request. We find little relationship between the predicted and actual school starting pattern on municipal level. Moreover, in 10% of all municipalities, less than 68% (49%) of all boys (girls) born ±30 days from the cut-off are old-for-grade, while in another 10% of all municipalities more than 84% (67%) are old-for-grade conditional of observables. These numbers would be 50% if all families followed the stipulated school starting rule. This suggests that local school start culture and legal enforcement of the regulations may play a role for the parents decision. 82 6 Conclusion This paper uses Danish register-based data to investigate the effect of school starting age on crime-age profiles while using exogenous variation in school starting age generated by administrative rules. We find that a higher school starting age lowers the propensity to commit crime in youth. In addition, boys experience a persistent reduction in the number of crimes committed. We show that crime at the extensive margin is largely driven by life events whereas crime at the intensive margin is a complex function of both age and life-course. Detailed studies of the age-profile of the effects indicate that the reductions to crime are likely to be caused by an incapacitating effect of schooling, as those who start school later graduate later. Although not directly testable, the pattern of results supports this hypothesis: Boys who are old-for-grade are less likely to be charged during the period until age 19 years, and this effect mainly stems from property crime. Girls who are old-for-grade are less likely to be charged until age 16, and this effect stems from violent crimes. For boys we find significant effects of school starting age on the accumulated number of crimes at or before a given age throughout the twenties. For girls the effects on accumulated crime measures die out which suggests that school starting age only influences the criminal debut. For boys mainly property crime is reduced while for girls violent crime is reduced. We also find that the effects are not caused by relative age of peers but by ones own school starting age. Our results suggest that increasing school starting age could lower crime more so for boys than for girls. Yet, our findings do not necessarily suggest that school starting age should be increased for everybody: our heterogeneity analyses show, for example, that children born to parents with favorable characteristics gain relatively little from an increase in school starting age. More fundamentally, we only analyze school start choice at the individual level and not a policy change influencing all children. 83 References Abadie, A., (2003): Semiparametric instrumental variable estimation of treatment response models. Journal of Econometrics, 113(2): 231-263. Aliprantis, D., (2012): Redshirting, Compulsory Schooling Laws, and Educational Attainment. Journal of Educational and Behavioral Statistics, 37(2): 316-338. Anderson, M. (2014): In School and Out of Trouble? The Minimum Dropout Age and Juvenile Crime. Review of Economics and Statistics 96(2): 318-331. Barua, R. and K. Lang (2012): : School Entry, Educational Attainment and Quarter of Birth: A Cautionary Tale of LATE. Manuscript. Bedard, K. and E. Dhuey (2006): The Persistence of Early Childhood Maturity: International Evidence of Long-Run Age Effects. Quarterly Journal of Economics 121 (4): 1437-1472. Black, S. E., P. J. Devereux and K. G. Salvanes (2007): From the Cradle to the Labor Market? The Effect of Birth Weight on Adult Outcomes. Quarterly Journal of Economics, 122(1), 409-439. Black, S. E., P. J. Devereux and K. G. Salvanes (2011): Too young to leave the nest? The effects of school starting age. Review of Economics and Statistics 93, 455-467. Black, S. E., P. J. Devereux and K. G. Salvanes (2013): Under Pressure? The Effect of Peers on Outcomes of Young Adults. Journal of Labor Economics 31(1), 119-153. Cook, P. J. and S. Kang (forthcoming): Regression-discontinuity Analysis of School Performance, Delinquency, Dropout, and Crime Initiation. American Economic Journal: Applied Economics. 84 Crawford, C., L. Dearden and C. Meghir (2010): When you are born matters: the impact of date of birth on educational outcomes in England. DoQSS WP 10-09. Cunha, F., and J. Heckman (2008): Formulating, Identifying and Estimating the Technology of Cognitive and Noncognitive Skill Formation. Journal of Human Resources, 43(4), 738-782. Dalsgaard, S., M. K. Humlum, H. S. Nielsen and M. Simonsen (2012): Relative standards in ADHD Diagnoses: The role of specialist behavior. Economics Letters 117, 663-665. Elder, T. E. and D. Lubotsky (2009): Kindergarten Entrance Age and Children’s Achievement Impacts of State Policies, Family Background, and Peers. Journal of Human Resources 44(3), 641-683. Elder, T. E., (2010): The importance of relative standards in ADHD diagnoses: Evidence based on exact birth dates. Journal of Health Economics 29, 641-656. Evans, W. N., M. S. Morrill, and S. T. Parente (2010): Measuring excess medical diagnosis and treatment in survey data: the case of ADHD among school-age children. Journal of Health Economics 29, 657-673. Fiorini, M., K. Stevens, M. Taylor and B. Edwards (2013): Monotonically Hopeless? Monotonicity in IV and fuzzy RD designs. Manuscript. Fredriksson, P. and B. Öckert (2005): Is Early Learning Really More Productive? The Effect of School Starting Age on School and Labor Market Performance. IZA DP #1659. Fredriksson, P. and B. Öckert (2013): Life-Cycle Effects of Age at School Start.Economic Journal 124, 977-1004. Gaviria, A., and S. Raphael (2001): School Based Peer Effects and Juvenile Behavior. Review of Economics and Statistics, 83, 257-268. 85 Gottfredson, M. R., and T. Hirschi (1990): A General Theory of Crime. Stanford, CA: Stanford UP. vspace0.2cm Grogger, J. (1998): Market Wages and Youth Crime, Journal of Labor Economics, 16(4), 756-791. Jacob, B. and L. Lefgren (2003): Are Idle Hands the Devil’s Workshop? Incapacitation, Concentration, and Juvenile Crime. American Economic Review 93 (5), 1560-1577. Lochner, L. (2011): Non-Production Benefits of Education: Crime, Health, and Good Citizenship. in Hanushek, Machin and Woessman, eds., Handbook of Economics of Education, vol.4. Elsevier Luallen, J. (2006): School’s Out...Forever: A Study of Juvenile Crime, At-Risk Youths and Teacher Strikes. Journal of Urban Economics, 59:75-103. Lubotsky, D. and R. Kaestner (2014): Effects of Age at School Entry on Child Cognitive and Behavioral Development. Unpublished Manuscript. Nevo, A. and A. M. Rosen (2012): Identification with Imperfect Instruments. Review of Economics and Statistics 94(3): 659-671. OECD (2009): Society at a Glance 2009. OECD Social Indicators. OECD, Paris, France. Sacerdote, B. (2001): Peer Effects With Random Assignment: Results from Dartmouth Roommates. Quarterly Journal of Economics, 116, 681-704. Sampson, R. J. and J. H. Laub (1995): Crime in the Making: Pathways and Turning Points through Life. Harvard UP. Uggen. C. (2000): Work as a Turning Point in the Life Course of Criminals: A Duration Model of Age, Employment, and Recidivism. American Sociological Review 65: 529-46. 86 UNI-C (2009): Age at school start in first grade (In Danish: Alder ved skolestart i frste klasse). UNI-C Aug 6th, 2009. 87 A Supplementary Results Table IIA.1: Means of Selected Outcome Variables by Month of Birth Criminal charge (0/1) at age 15 16 17 18 19 20 21 22 23 24 25 26 27 December 0.021 0.016 0.015 0.014 0.021 0.021 0.021 0.022 0.020 0.020 0.023 0.019 0.019 Girls January Difference N 0.017 0.003∗∗ 48,546 0.015 0.001 48,546 0.013 0.002∗ 48,546 0.013 0.001 48,546 0.020 0.001 43,668 0.022 -0.001 39,037 0.019 0.002 34,559 0.020 0.001 30,209 0.019 0.001 26,093 0.018 0.003 22,125 0.018 0.005∗ 18,240 0.016 0.002 14,630 0.020 -0.001 11,045 December 0.042 0.058 0.093 0.095 0.114 0.117 0.112 0.105 0.101 0.093 0.085 0.081 0.080 Boys January Difference 0.036 0.007∗∗∗ 0.051 0.007∗∗∗ 0.081 0.012∗∗∗ 0.084 0.010∗∗∗ 0.102 0.012∗∗∗ 0.108 0.009∗∗ 0.103 0.008∗∗ 0.097 0.008 0.097 0.005 0.092 0.002 0.082 0.003 0.077 0.004 0.071 0.009 N 50,383 50,383 50,383 50,383 45,368 40,606 36,012 31,405 26,937 22,781 18,723 14,949 11,273 Criminal charge (0/1) at or before age 15 16 17 18 19 20 21 22 23 24 25 26 27 December 0.021 0.035 0.047 0.057 0.073 0.086 0.098 0.109 0.116 0.125 0.136 0.140 0.142 Girls January Difference 0.017 0.003∗∗∗ 0.030 0.004∗∗ 0.041 0.006∗∗∗ 0.051 0.006∗∗∗ 0.065 0.008∗∗∗ 0.081 0.005 0.093 0.005 0.104 0.005 0.115 0.001 0.123 0.003 0.130 0.006 0.136 0.004 0.144 0.001 December 0.042 0.086 0.147 0.196 0.252 0.298 0.336 0.368 0.391 0.410 0.423 0.433 0.446 Boys January Difference 0.036 0.007∗∗∗ 0.076 0.010∗∗∗ 0.133 0.014∗∗∗ 0.181 0.015∗∗∗ 0.234 0.018∗∗∗ 0.282 0.017∗∗∗ 0.318 0.018∗∗∗ 0.348 0.020∗∗∗ 0.374 0.018∗∗∗ 0.395 0.016∗∗ 0.410 0.013 0.424 0.009 0.433 0.013 N 50,383 50,383 50,383 50,383 45,368 40,606 36,012 31,405 26,937 22,781 18,723 14,949 11,273 N 48,546 48,546 48,546 48,546 43,668 39,037 34,559 30,209 26,093 22,125 18,240 14,630 11,045 Table shows fraction of individuals who have been charged for crime at (upper panel) and at or before (lower panel) a given age by gender. Population of children born December 1981 to January 1993. T-test for difference in means across month of birth: p<0.05: ∗ , p<0.01: ∗∗ , p<0.001: ∗∗∗ 88 Table IIA.2: Fraction of Students Being Retained at Each Grade Level Grade level Kindergarten 1st grade 2nd grade 3rd grade 4th grade 5th grade 6th grade 7th grade 8th grade 9th grade Fraction being delayed/retained 0.136 0.014 0.003 0.004 0.003 0.003 0.003 0.002 0.003 0.005 Note: Table shows fraction of children who are retained / red-shirted at each grade level. Calculation based of grade level data from 2007 and onwards. 89 Table IIA.3: Summary Statistics of the Sample Variable December Immigrant 0.043 0.001 Parents married/cohabiting 0.788 0.003 Apgar score=9 0.181 0.002 Apgar score=8 0.071 0.002 Apgar score lower 0.084 0.002 Birth weight. grams 3341 3.987 Gestational length, weeks 39.564 0.012 Father: Months of schooling 137.276 0.230 Completed HS or equvalent 0.292 0.003 Unemployed 0.130 0.002 Out of the labour force 0.104 0.002 Age at birth of first child 24.819 0.027 Father: Months of schooling 139.736 0.281 Completed HS or equvalent 0.191 0.003 Unemployed 0.078 0.002 Out of the labour force 0.065 0.002 Observations 24,279 Girls January 0.037 0.001 0.784 0.003 0.184 0.002 0.067 0.002 0.085 0.002 3358 3.813 39.543 0.011 T-value 0.006∗∗ 0.004 -0.002 0.004 -0.001 -16.59∗∗ 0.021 137.502 0.235 0.287 0.003 0.125 0.002 0.111 0.002 24.886 0.027 -0.226 140.507 0.277 0.190 0.003 0.077 0.002 0.062 0.002 24,267 -0.771 0.005 0.005 -0.006∗ -0.068 0.002 0.001 0.003 December 0.042 0.001 0.789 0.003 0.187 0.002 0.066 0.002 0.097 0.002 3466 4.152 39.482 0.012 Boys January 0.034 0.001 0.792 0.003 0.184 0.002 0.070 0.002 0.094 0.002 3481 3.909 39.464 0.012 136.986 0.229 0.290 0.003 0.128 0.002 0.105 0.002 24.851 0.026 138.480 0.226 0.302 0.003 0.122 0.002 0.105 0.002 24.990 0.026 139.544 0.277 0.195 0.002 0.076 0.002 0.064 0.064 25,157 141.215 0.273 0.204 0.003 0.073 0.002 0.061 0.001 25,226 T-value 0.008∗∗∗ -0.003 0.004 -0.004 0.003 -15.02∗∗ 0.019 -1.494∗∗∗ -0.013∗∗ 0.006∗ 0.000 -0.139∗∗∗ -1.671∗∗∗ -0.009∗ 0.004 0.003 Note: Table shows summary statistics of the sample by month of birth and gender. Std. errors shown below sample means. Columns T-value shows results from t-test of difference in sample means. p<0.05: ∗ , p<0.01: ∗∗ , p<0.001: ∗∗∗ 90 Table IIA.4: First Stage Results (suppl. to Table II.3 in main text) Variable January=1 Days to cut-off, December Days to cut-off, January 0.245∗∗∗ (0.008) -0.005∗∗∗ (0.000) 0.003∗∗∗ (0.000) Apgar score=9 Apgar score=8 Apgar score lower than 8 Immigrant Birth weigth Gestation length Gestation length squared Mother’s months of schooling Mother’s months of schooling sq. Father’s months of schooling Father’s months of schooling sq. Mother has compl. high school Father has compl. high school Mother unemployed Mother out of labour force Father unemployed Father out of labour force Mother’s age at first child Parents are married Girls 0.245∗∗∗ 0.244∗∗∗ (0.008) (0.008) -0.005∗∗∗ -0.005∗∗∗ (0.000) (0.000) 0.003∗∗∗ 0.003∗∗∗ (0.000) (0.000) 0.005 0.005 (0.005) (0.005) 0.014 0.012 (0.008) (0.008) 0.015 0.011 (0.008) (0.008) ∗∗ -0.033 -0.041∗∗∗ (0.011) (0.012) -0.000∗∗∗ -0.000∗∗∗ (0.000) (0.000) -0.065∗∗∗ -0.070∗∗∗ (0.019) (0.019) 0.001∗∗ 0.001∗∗ (0.000) (0.000) 0.001∗∗∗ (0.000) -0.000∗∗∗ (0.000) 0.001∗∗ (0.000) -0.000∗∗∗ (0.000) -0.026∗∗∗ (0.006) -0.065∗∗∗ (0.007) 0.020∗∗ (0.006) 0.002 (0.007) 0.017∗ (0.008) 0.009 (0.009) 0.002∗∗∗ (0.001) 0.001 (0.005) Cut-off 1981-1982 Cut-off 1982-1983 Cut-off 1983-1984 Cut-off 1984-1985 Cut-off 1985-1986 Cut-off 1986-1987 Cut-off 1987-1988 Cut-off 1988-1989 Cut-off 1989-1990 Cut-off 1990-1981 Cut-off 1992-1993 Constant N 0.714∗∗∗ (0.006) 48,546 2.294∗∗∗ (0.369) 48,546 2.282∗∗∗ (0.368) 48,546 0.246∗∗∗ (0.008) -0.005∗∗∗ (0.000) 0.003∗∗∗ (0.000) 0.005 (0.005) 0.012 (0.008) 0.013 (0.008) -0.067∗∗∗ (0.012) -0.000∗∗∗ (0.000) -0.068∗∗∗ (0.019) 0.001∗∗ (0.000) 0.001∗∗∗ (0.000) -0.000∗∗∗ (0.000) 0.000∗ (0.000) -0.000∗∗ (0.000) -0.022∗∗∗ (0.006) -0.060∗∗∗ (0.007) 0.017∗∗ (0.006) -0.003 (0.007) 0.020∗ (0.008) -0.002 (0.009) 0.000 (0.001) 0.004 (0.005) -0.143∗∗∗ (0.010) -0.132∗∗∗ (0.010) -0.135∗∗∗ (0.010) -0.145∗∗∗ (0.010) -0.117∗∗∗ (0.010) -0.088∗∗∗ (0.010) -0.045∗∗∗ (0.010) -0.029∗∗ (0.010) -0.032∗∗∗ (0.010) 0.006 (0.009) 0.006 (0.009) 2.372∗∗∗ (0.365) 48,546 0.172∗∗∗ (0.007) -0.005∗∗∗ (0.000) 0.001∗∗∗ (0.000) 0.872∗∗∗ (0.005) 50,383 Boys 0.171∗∗∗ 0.171∗∗∗ (0.007) (0.007) -0.005∗∗∗ -0.005∗∗∗ (0.000) (0.000) 0.001∗∗∗ 0.001∗∗∗ (0.000) (0.000) -0.001 -0.001 (0.005) (0.005) 0.003 0.001 (0.007) (0.007) 0.006 0.004 (0.006) (0.006) ∗∗∗ -0.097 -0.070∗∗∗ (0.009) (0.011) -0.000∗∗∗ -0.000∗∗∗ (0.000) (0.000) -0.035∗ -0.034∗ (0.017) (0.017) 0.000 0.000 (0.000) (0.000) 0.001∗∗∗ (0.000) -0.000∗∗∗ (0.000) 0.001∗∗∗ (0.000) -0.000∗∗ (0.000) -0.005 (0.005) -0.051∗∗∗ (0.006) 0.007 (0.006) -0.025∗∗∗ (0.006) 0.001 (0.007) 0.009 (0.008) 0.004∗∗∗ (0.001) 0.002 (0.005) 1.707∗∗∗ (0.321) 50,383 1.540∗∗∗ (0.321) 50,383 0.171∗∗∗ (0.007) -0.005∗∗∗ (0.000) 0.001∗∗∗ (0.000) -0.003 (0.005) -0.002 (0.007) 0.004 (0.006) -0.097∗∗∗ (0.011) -0.000∗∗∗ (0.000) -0.031 (0.017) 0.000 (0.000) 0.001∗∗∗ (0.000) -0.000∗∗∗ (0.000) 0.000∗∗ (0.000) -0.000∗ (0.000) -0.002 (0.005) -0.048∗∗∗ (0.006) 0.005 (0.006) -0.029∗∗∗ (0.006) 0.004 (0.007) -0.000 (0.008) 0.002∗∗∗ (0.000) 0.005 (0.005) -0.147∗∗∗ (0.009) -0.136∗∗∗ (0.009) -0.117∗∗∗ (0.009) -0.109∗∗∗ (0.009) -0.095∗∗∗ (0.009) -0.070∗∗∗ (0.009) -0.042∗∗∗ (0.009) -0.010 (0.008) -0.012 (0.008) 0.016∗ (0.008) 0.002 (0.008) 1.590∗∗∗ (0.318) 50,383 Note: Table shows results from linear regressions of indicators for starting school at age 7.6 instead of 6.6 while conditioning on cut-off dummies (January=1), distance to cut-off, cohort fixed effects and background characteristics (see Table IIA.3). p<0.05: ∗ , p<0.01: ∗∗ , p<0.001: ∗∗∗ 91 92 8th grade Born December 1996 January 1997 0.231∗∗∗ 0.215∗∗∗ 0.129∗∗∗ (0.023) (0.023) -0.005∗∗∗ -0.005∗∗∗ (0.001) (0.001) 0.003∗∗∗ 0.003∗∗∗ (0.001) (0.001) 0.795∗∗∗ 0.797∗∗∗ (0.017) (0.017) 5,264 5,264 275.32 262.96 Girls 4th grade Boys Preschool 2nd grade 4th grade 8th grade Born Born December 2000 December 1996 January 2001 January 1997 0.134∗∗∗ 0.112∗∗∗ 0.109∗∗∗ (0.017) (0.018) (0.018) (0.018) -0.004∗∗∗ -0.004∗∗∗ -0.004∗∗∗ -0.003∗∗∗ (0.001) (0.001) (0.001) (0.001) 0.001∗∗∗ 0.001∗∗∗ 0.001∗∗∗ 0.001∗∗∗ (0.001) (0.001) (0.001) (0.001) 0.941∗∗∗ 0.941∗∗∗ 0.923∗∗∗ 0.928∗∗∗ (0.013) (0.013) (0.013) (0.013) 5,328 5,328 5,433 5,433 162.94 173.46 146.03 132.19 Note: Table shows results from linear regressions of indicators for starting school at age 7.6 instead of 6.6 while conditioning on cut-off dummies (January=1), distance to cut-off, cohort fixed effects and background characteristics (see Table A3). School starting age is measured at different grade levels in the period 2007-2013. p<0.05: ∗ , p<0.01: ∗∗ , p<0.001: ∗∗∗ . N F-value for January dummy Constant Days to cut-off, January Days to cut-off, December January=1 Preschool 2nd grade Born December 2000 January 2001 0.249∗∗∗ 0.251∗∗∗ (0.024) (0.024) -0.005∗∗∗ -0.005∗∗∗ (0.001) (0.001) 0.003∗∗∗ 0.003∗∗∗ (0.001) (0.001) 0.792∗∗∗ 0.809∗∗∗ (0.016) (0.017) 4,977 4,977 280.69 287.42 Table IIA.5: First Stage Result at Different Grade Levels Table IIA.6: Detailed Estimation Results: Crime At or Before Age Criminal charge (0/1) at or before age 15 16 17 18 19 20 21 22 23 24 25 26 27 Distance to cut-off Yearly cut-off FE Covariates OLS -0.005∗∗∗ (0.001) -0.003 (0.002) -0.003 (0.002) -0.003 (0.002) -0.003 (0.002) -0.001 (0.003) -0.004 (0.003) -0.005 (0.004) -0.002 (0.004) -0.001 (0.004) -0.002 (0.005) -0.004 (0.006) 0.001 (0.007) X X Girls 2SLS 2SLS -0.009∗ -0.008∗ (0.003) (0.003) -0.011∗ -0.010∗ (0.004) (0.004) -0.016∗∗ -0.015∗∗ (0.005) (0.005) -0.017∗∗ -0.016∗∗ (0.006) (0.006) -0.020∗∗ -0.019∗∗ (0.007) (0.007) -0.012 -0.011 (0.008) (0.007) -0.012 -0.011 (0.008) (0.008) -0.012 -0.011 (0.009) (0.009) -0.002 -0.000 (0.010) (0.010) -0.006 -0.005 (0.012) (0.011) -0.015 -0.013 (0.013) (0.013) -0.011 -0.010 (0.015) (0.014) 0.004 0.004 (0.017) (0.017) X X X X X N 48,546 48,546 48,546 48,546 43,668 39,037 34,559 30,209 26,093 22,125 18,240 14,630 11,045 OLS -0.005∗ (0.002) -0.009∗∗∗ (0.003) -0.006 (0.004) -0.002 (0.004) 0.001 (0.005) 0.000 (0.005) 0.004 (0.005) 0.004 (0.006) 0.008 (0.006) 0.005 (0.007) 0.003 (0.008) 0.005 (0.009) 0.004 (0.010) X X Boys 2SLS 2SLS -0.025∗∗∗ -0.019∗∗ (0.007) (0.006) -0.035∗∗∗ -0.025∗∗ (0.009) (0.009) -0.051∗∗∗ -0.037∗∗ (0.012) (0.012) -0.054∗∗∗ -0.038∗∗ (0.013) (0.013) -0.066∗∗∗ -0.045∗∗ (0.015) (0.015) -0.058∗∗∗ -0.036∗ (0.016) (0.016) -0.061∗∗∗ -0.039∗ (0.017) (0.017) -0.068∗∗∗ -0.043∗ (0.019) (0.018) -0.057∗∗ -0.035 (0.020) (0.019) -0.049∗ -0.030 (0.021) (0.020) -0.038 -0.018 (0.023) (0.022) -0.026 -0.006 (0.025) (0.024) -0.036 -0.019 (0.028) (0.027) X X X X X N 50,383 50,383 50,383 50,383 45,368 40,606 36,012 31,405 26,937 22,781 18,723 14,949 11,273 Note: Table shows the estimated effects of being old-for-grade based on 2SLS regressions on crime at or before a given age. Cut-off dummy (January=1) used as instrument. Conditioning set includes distance to cut-off, cohort fixed effects, and background characteristics (see Table A3). p<0.10: + , p<0.05: ∗ , p<0.01: ∗∗ , p<0.001: ∗∗∗ 93 Table IIA.7: Detailed Estimation Results: Crime At Age Criminal charge (0/1) at age 15 16 17 18 19 20 21 22 23 24 25 26 27 Distance to cut-off Yearly cut-off FE Covariates OLS -0.005∗∗∗ (0.001) 0.002 (0.001) 0.001 (0.001) 0.001 (0.001) 0.001 (0.001) 0.002 (0.002) -0.001 (0.002) 0.001 (0.002) 0.002 (0.002) -0.001 (0.002) -0.000 (0.002) -0.000 (0.002) -0.000 (0.003) X X Girls 2SLS 2SLS -0.009∗ -0.008∗ (0.003) (0.003) -0.003 -0.003 (0.003) (0.003) -0.006∗ -0.006∗ (0.003) (0.003) -0.002 -0.002 (0.003) (0.003) -0.002 -0.002 (0.004) (0.004) 0.003 0.003 (0.004) (0.004) -0.004 -0.004 (0.004) (0.004) -0.003 -0.002 (0.004) (0.004) -0.002 -0.002 (0.005) (0.004) -0.007 -0.007 (0.005) (0.005) -0.012∗ -0.011∗ (0.005) (0.005) -0.006 -0.006 (0.006) (0.006) 0.002 0.002 (0.007) (0.007) X X X X X N 48,546 48,546 48,546 48,546 43,668 39,037 34,559 30,209 26,093 22,125 18,240 14,630 11,045 OLS -0.005∗ (0.002) -0.007∗∗ (0.002) 0.001 (0.003) 0.004 (0.003) 0.002 (0.003) 0.002 (0.004) 0.011∗∗ (0.004) 0.003 (0.004) 0.006 (0.004) 0.009∗ (0.004) 0.005 (0.004) -0.002 (0.005) 0.002 (0.005) X X Boys 2SLS 2SLS -0.025∗∗∗ -0.019∗∗ (0.007) (0.006) -0.026∗∗∗ -0.019∗ (0.008) (0.008) -0.044∗∗∗ -0.036∗∗∗ (0.010) (0.009) -0.039∗∗∗ -0.030∗∗ (0.010) (0.010) -0.045∗∗∗ -0.033∗∗ (0.011) (0.011) -0.031∗∗ -0.021 (0.011) (0.011) -0.029∗ -0.019 (0.012) (0.011) -0.028∗ -0.018 (0.012) (0.012) -0.015 -0.005 (0.012) (0.012) -0.005 0.003 (0.012) (0.012) -0.009 -0.001 (0.013) (0.013) -0.011 -0.003 (0.014) (0.013) -0.026 -0.020 (0.015) (0.015) X X X X X N 50,383 50,383 50,383 50,383 45,368 40,606 36,012 31,405 26,937 22,781 18,723 14,494 11,273 Note: Table shows the estimated effects of being old-for-grade based on 2SLS regressions of crime at or before a given age. Cut-off dummy (January=1) used as instrument. Conditioning set includes distance to cut-off, cohort fixed effects, and background characteristics (see Table A3). p<0.10: + , p<0.05: ∗ , p<0.01: ∗∗ , p<0.001: ∗∗∗ 94 Table IIA.8: Absolute and Relative Effects of being ’Old-for-Grade’: Crime At or Before Age Criminal charge (0/1) at or before age 15 16 17 18 19 20 21 22 23 24 25 26 27 Distance to cut-off Yearly cut-off FE Covariates Old-for-grade -0.008∗ (0.003 -0.010∗ (0.004) -0.015∗∗ (0.005) -0.016∗∗ (0.006) -0.018∗∗ (0.007) -0.010 (0.008) -0.009 (0.00)8 -0.010 (0.009) 0.001 (0.010) -0.002 (0.012) -0.008 (0.013) -0.006 (0.015) 0.005 (0.018) X X X Girls Peer age 0.014 (0.028) -0.033 (0.036) -0.011 (0.042) -0.006 (0.046) -0.060 (0.054) -0.063 (0.061) -0.113 (0.068) -0.116 (0.076) -0.075 (0.083) -0.124 (0.094) -0.230 (0.102) -0.152∗ (0.113) -0.030 (0.130) X N 48,546 48,546 48,546 48,546 43,668 39,037 34,559 30,209 26,093 22,125 18,240 14,630 11,045 Old-for-grade -0.019∗ (0.006) -0.025∗ (0.009) -0.037∗∗ (0.012) -0.038∗∗ (0.013) -0.046∗∗ (0.015) -0.037∗ (0.016) -0.041∗ (0.017) -0.044∗ (0.018) -0.036 (0.019) -0.030 (0.021) -0.017 (0.022) 0.005 (0.025) -0.009 (0.028) X X X Boys Peer age 0.002 (0.044) -0.017 (0.062) -0.058 (0.078) 0.049 (0.088) 0.116 (0.101) 0.065 (0.112) 0.093 (0.119) 0.027 (0.129) 0.054 (0.136) 0.022 (0.146) -0.058 (0.159) -0.321 (0.177) -0.321 (0.201) N 50,383 50,383 50,383 50,383 45,368 40,606 36,012 31,405 26,937 22,781 18,723 14,949 11,273 X Note: Table shows the estimated effects of being old-for-grade based on 2SLS regressions on crime at or before a given age. Cut-off dummy (January=1) used as instrument for own school starting decision, predicted school starting age of peers if compliant with rules used as instrument for average peer age. In addition to average peer age, conditioning set includes distance to cut-off, cohort fixed effects, and background characteristics (see Table A3). p<0.05: ∗ , p<0.01: ∗∗ , p<0.001: ∗∗∗ 95 Figure IIA.1: Age 4 Social and Emotional Difficulties Among Punctual and Late School Starters Note: Figure shows kernel density densities of social and emotional difficulties scores at age 4 by school starting age. Data stem from the Danish Longitudinal Survey of Children that surveys children born in September and October of 1995. ’Punctual school starters’ obey the rules and start school when they are supposed to start according to the rules, while late school starters have been granted an exemption. 96 97 ((d)) Boys, age 27 ((b)) Girls, age 27 Note: Figures show Crime-Age Profiles: The fraction of individuals who have been charged with a crime at (left) and at or before (right) a given age. Population of children born 1981-1993 ((c)) Boys, age 19 ((a)) Girls, age 19 Figure IIA.2: Crime At or Before Age 98 ((d)) Boys, age 27 ((b)) Girls, age 27 Note: Figures show scatterplots of crime at or before a given age (19 and 27) by date of birth. The solid line is a local polynomial smoothed line and the corresponding dashed lines indicate 95% confidence intervals. ((c)) Boys, age 19 ((a)) Girls, age 19 Figure IIA.3: Crime At or Before Age Figure IIA.4: Mean Birthweight for Girls (bottom) and Boy (top) by Date of Birth Note: Figure shows average birth weight measured in grams, by date of birth and gender. Population of children born in December or January 1981-1993. 99 Figure IIA.5: Estimation Results: Years of Completed Schooling ((a)) Girls ((b)) Boys Note: Figures show the estimated effects of being old-for-grade based on 2SLS regressions on years of completed schooling at a given age. Cut-off dummy (January=1) used as instrument. Conditioning set includes distance to cut-off, cohort fixed effects, and background characteristics (see Table A3). Dashed lines indicate 95% confidence intervals. 100 101 ((d)) Boys, weekend ((c)) Girls, weekend Note: Figures show the estimated effects of being old-for-grade based on 2SLS regressions on the probability of crime by weekdays (Mon-Fri) and weekends (Sat-Sun). Cut-off dummy (January=1) used as instrument. Conditioning set includes distance to cut-off, cohort fixed effects, and background characteristics (see Table A3). Dashed lines indicate 95% confidence intervals. ((b)) Boys, weekday ((a)) Girls, weekday Figure IIA.6: Estimation Results: Crime At or Before Age by Weekday/Weekend Figure IIA.7: Estimation Results: Crime At or Before Age by Continuous School Starting Age ((a)) Girls ((b)) Boys Note: Figures show the estimated effects of continuous school starting age based on 2SLS regressions on the probability of crime at or before a given age. Cut-off dummy (January=1) used as instrument. Conditioning set includes distance to cut-off, cohort fixed effects, and background characteristics (see Table A3). Dashed lines indicate 95% confidence intervals. 102 103 ((e)) Boys, ± 60 days ((d)) Boys, ± 30 days ((f)) Boys, ± 90 days ((c)) Girls, ± 90 days Note: Figures show the estimated effects of being old-for-grade based on 2SLS regressions on the probability of crime at or before a given age using varying sample bandwidths. Cut-off dummy (January=1) used as instrument. Conditioning set includes distance to cut-off, cohort fixed effects, and background characteristics (see Table A3). Dashed lines indicate 95% confidence intervals. ((b)) Girls, ± 60 days ((a)) Girls, ± 30 days Figure IIA.8: Estimation Results: Crime At or Before Age, Extended Bandwidth Figure IIA.9: Estimation Results: Crime At or Before Age with Nevo-Rosen Bounds ((a)) Girls ((b)) Boys Note: Figures show estimated effects of being old-for-grade on crime at or before a given age applying Nevo Rosen bounds (Nevo and Rosen (2012)) and using a uniform cut-off as instrument and no covariates. Because the covariance between the instrument (here defined as January=1) and the treatment (old-for-grade) is positive σOf G,Z , as being born in January predicts a higher probability of being old-for-grade, the bounds are given as Bounds B ∗ = σOf G (σOf G,Z −σZ σOf G ) σOf G σZ,y −σOf G σOf G,y ] σOf G (σOf G,Z −σZ σOf G ) [ σOf G σZ,y −σOf G σOf G,y , βZIV ] [βZIV , if σOf G,Z > 0 if σOf G,Z < 0 Where Z denotes the instrument, y crime at or before a given age ,and Of G a binary indicator of being old-for-grade. Standard errors computed from 50 bootstraps 104 Part III The Effects of Admissions to Psychiatric Hospitals 105 The effects of admissions to psychiatric hospitals Rasmus Landersø and Peter Fallesen? Abstract his paper studies the effects of an admission to a psychiatric hospital on subsequent psychiatric treatments, self-inflicted harm, crime, and labour market outcomes. To circumvent non-random selection into hospital admission we use a measure of hospital occupancy rates the weeks prior to a patient’s first contact with a psychiatric hospital as an instrument. Admission reduces crime rates and self-harming behaviour substantially in the short run, but leads to higher re-admission rates and lower labour market attachment in the long run. Effects are heterogeneous across observable and unobserveable patient characteristics. We also identify positive externalities of admissions on spouses’ employment rates. Keywords: crime, inpatient care, labour market, mental health, treatment effects JEL: I10, J10, K42 The authors thank Richard Breen, Mette Ejrnæs, Jane Greve, Eskil Heinesen, Helena Skyt Nielsen, Marianne Simonsen, Torben Tranæs, Christopher Wildeman, Hanne-Lise Falgreen Eriksen, Erik Roj Larsen, Bas van der Klaauw, Peter Sandholt Jensen, Anna Piil Damm, and seminar participants at the Health Economics Workshop at University of Chicago, EEA Conference 2014, Labour and Public Policy Seminar at Aarhus University, the Danish National Centre for Social Research, SDU Applied Micro Workshop, RES 2015 conference, and the SPP 1764 conference ? Rockwool Foundation Research Unit, University of Copenhagen 106 1 Introduction Psychiatric disorders are costly for society and may have substantial negative consequences for both the patients and their relatives. Psychiatric patients have lower labour market attachment, more somatic health problems, and higher crime rates than the general population, and may burden their next of kin (e.g. Ettner et al. , 1997; Greve & Nielsen, 2013; Kupers & Toch, 1999; Noh & Turner, 1987). Finding the optimal strategy for treating psychiatric disorders will therefore reduce strain on society, patients, and their families. During the past decades, most OECD countries have downsized treatment capacity at psychiatric hospitals substantially (WHO, 2011a), even though little is known about the causal effects of admitting a patient to a psychiatric hospital on the patient’s later outcomes. Hence, the consequences of lowering hospital admission rates are largely unknown. Our study examines the effects of admitting a patient as an inpatient upon first contact with psychiatric health care on the patient’s subsequent contacts and admissions to psychiatric hospitals, criminal and self-harming behaviour, and on the patients’ and their spouses’ labour market outcomes. We use a sample of 24,277 adults aged 18–45 who had their first contact with a psychiatric hospital between 1999 and 2001. We use Danish administrative data containing information on all contacts to mental health facilities for all Danish citizens. We address the fundamental differences between the counterfactual outcomes of individuals who are admitted as inpatients and individuals who are not admitted by using an instrumental variable: the intensity of patient contacts to a hospital during the weeks before an individual’s first contact, which serves as a proxy for a given hospital’s occupancy rate. By exploiting the variation in contact intensity we can identify the causal effect of admitting individuals as inpatients at first contact relative to not providing immediate care. Our results show that immediate hospital admissions have large but ambiguous effects. In the short run, inpatient care reduces the patients’ adverse behaviour. In particular, we 107 find that inpatient treatment leads to large reductions in crime shortly after the admission. Admitting an extra 100 marginal patients to inpatient care at first contact leads to 10 fewer crimes during the subsequent six months. We show that the reduction in crime is caused by incapacitation during the period of the most severe mental distress. In a similar vein, we also find that inpatient admission lowers the risk of hospitalisation for health issues likely resulting from self-harming behaviour in the months following admission. Thus, failing to admit a patient or only offering outpatient care leaves the patient’s immediate needs unaddressed1 resulting in large negative externalities. In the longer run, however, people admitted to inpatient care experience a higher degree of institutionalisation, which is likely to leave them with poorer long term labour market outcomes. In line with the previous medical literature, we find that an inpatient admission increases the probability of subsequent admission to psychiatric treatment facilities by as much as 20 percentage points. There is on average no significant increase in patients’ subsequent number of contacts to the psychiatric system. We therefore attribute the increase in admission rates to an institutionalising effect. In addition, we show that inpatient admission reduces employment and labour market attachment further strengthening the institutionalisation hypothesis. We also identify large effect heterogeneity across both observable and unobservable characteristics. Males experience the largest reductions to crime whereas females experience the largest increase in re-admissions and reductions to labour market attachment. By estimating Marginal Treatment Effects we find that patients with the largest unobservable gains from treatment (more serious conditions) experience the largest reductions to crime and self-harming behaviour, whereas patients with the smallest unobservable gains (less serious conditions) experience the largest increase in the probability of re-admission. Finally, we show that inpatient admission increases the employment rates of spouses, most likely because hospital admissions reduces the strain otherwise experienced when living with and being the main caretaker of a(n) (untreated) psychiatric patient. 1 A concern voiced also in an editorial in The Lancet (2011). 108 We contribute to the existing literature in four ways. First, we consider the effects of the foremost assessment made when a patient contacts a psychiatric hospital; namely whether to immediately admit and treat the patient or not. To our knowledge we are the first to investigate this pivotal point. Patients may be affected by in-, out- or day patient treatments later on, but any subsequent effects of one or the other form of psychiatric treatment are conditional on the foremost decision. Second, we consider a wider range of outcomes, such as labour market outcomes, self-harming behaviour, and crime, compared to the existing literature that focuses on re-admissions and subjective well-being questionnaires. Third, we use a rich data set based on administrative register data which provides us with a sample size that is more than 20 times larger than used by previous studies. Fourth, we consider effects on externalities by studying how admittance affects crime rates and patients’ spouses.2 The article progresses as follows: Section 2 introduces the study’s background, relevant literature, and the institutional framework in Denmark. Section 3 describes the data and the construction of the main variables and section 4 describes the econometric framework. Section 5 presents the results. Finally, section 6 concludes. 2 Background Psychiatric treatment facilities in the Western world have seen substantially downsizing over the past decades.3 Few will cast doubt on the improvements in psychiatric treatments in this period; both with respect to civil rights and quality of treatment.4 However, the vast 2 We do not explicitly address the degree to which various treatment forms reduce core symptoms of psychiatric illnesses or improve the quality of life for patients, but only the derived effects on realised outcomes. 3 Since the 1960s a comprehensive change has taken place in the psychiatric treatment systems in the majority of developed countries.The change is sometimes labelled the third revolution of psychiatry (Castel et al. , 1982). As a result, psychiatric treatment has moved out of the hospitals and into the community (Goodwin, 1997; Killaspy, 2006; Oosterhuis, 2005). Psychiatric health care today has its focus on on treating the patients core symptoms, and no longer only on being a safe haven for the patients (Castel et al. , 1982). Advances in psychopharmacological treatments and an increasing focus on the benefits from ongoing contact with the familiar community, family, and friends during treatment have contributed highly to these changes. See also Knowles (2005) for a discussion of the negative consequences of the downsizing of inpatient care. 4 See Frank & Glied (2006) and Gijswijt-Hofstra et al. (2005) for in-depth discussions. 109 changes raise a central question: Has the move away from treatment in hospitals and the reductions of hospital beds reached its limits or even gone too far? Are patients, whose illnesses are of a nature where immediate inpatient care would be the best option, instead receiving outpatient care, only medical treatment, or no treatment at all? A substantial body of literature has convincingly shown that individuals with psychiatric disorders fare significantly worse compared to the average population, measured on a wide range of outcomes. Psychiatric disorders reduce subsequent employment rates and earnings (Ettner et al. , 1997), and people with mental disorders are heavily overrepresented in crime statistics (Fazel et al. , 2015) and jails and prisons (Kupers & Toch, 1999). A recent Danish study (Greve & Nielsen, 2013) find similar results for schizophrenics in Denmark. Several studies have investigated the demand, cost and determinants of different types of psychiatric care (e.g. Davis & Russell, 1972; Scheffler & Watts, 1986; Vitikainen et al. , 2010).5 We add to the literature by taking a first step in investigating the optimal response to mental illnesses and focusing on the social consequences rather than on demand and supply issues. Several medical studies have investigated the effects of inpatient versus outpatient or day-patient treatment in adult psychiatry on recidivism and well-being (see Marshall et al. , 2009; Shek et al. , 2010, for recent reviews). Except for an increased tendency to higher readmission rates for inpatients, the literature generally finds few or no differences between the effects of inpatient and day treatments. The lack of significant findings may be driven by the small sample sizes investigated,6 which also reduce the external validity of the studies.7 In addition, these studies only consider individuals who actually receive some form of treatment. Instead, we evaluate the effects of inpatient admission for all individuals who seek treatment. 5 See Frank & McGuire (2000) for a review of the literature on mental health economics. One exception is Kallert et al. (2007) who use a cross country RCT with a sample size of 1,055 to investigate the differences between the outcomes of inpatient and day patients. As for the remaining studies, they find few significant differences between the two alternatives, and only consider the outcomes between two types of treatment instead of the full sample of patients seeking treatment. 7 For an example see Creed et al. (1990) who find no differences between inpatient and day treatments using a U.K. RCT with a sample size of 89 individuals. 6 110 Although previous work has found little evidence on the effects of admittance, there are several reasons to suspect that admittance directly and indirectly can affect later health and social outcomes; especially when studying the effect of the first admittance. Goffman (1961) argued that mental patients have careers, just as Dahl et al. (2013) show with regard to welfare recipients. That is, patients experience a trajectory through the mental health system, which marks them as mentally ill. The pivotal moment in such a career may very well be the treatment a patient receives at initial contact, since it marks the beginning of the mental health system’s record of that patient and may affect the threshold for subsequent admittances for that patient. Taking a patient in immediate mental distress to inpatient care might also alleviate public safety risks through incapacitation (see, e.g. Anderson, 2014; Appelbaum, 2001; Jacob & Lefgren, 2003; Winerip, 1999), for example of patients whose mental illness makes them a threat to themselves. If the psychiatric system fails to identify all patients with illnesses that makes them prone to commit crime if untreated, lowering the number of inpatient beds could directly lead to an increase in criminality. 2.1 Institutional framework - mental health care in Denmark In Denmark, basic health care (including treatment at psychiatric hospitals) is fully publicly funded and thus not directly affected by individual credit constraints. A person with psychiatric problems may gain access to psychiatric treatment in two ways. First, the person can contact an emergency room at a hospital. Alternatively, the patient’s general practitioner (GP) may refer him or her (the GP serves as the gatekeeper to the entire public health care sector).8 If the GP assesses that the patient should receive treatment from trained psychiatrists, the GP has three options: (1) refer the patient to a psychiatric practitioner (the 8 GPs may also prescribe most antidepressants themselves, but only do so in mild cases (WHO, 2011b). The Danish prescription practice stands in contrast to e.g. the practice in the U.S. In the U.S. GPs prescribe most psychopharmaceuticals such as antidepressants (Mojtabai & Olfson, 2011). 111 mildest cases); (2) refer the patient to psychiatric hospital (the mild to severe cases); (3) or admit the patient to a psychiatric hospital by force (the most severe cases).9 The publicly funded psychiatric treatment regime consists of inpatient treatments and outpatient treatments. Inpatient treatment takes place at hospitals or at designated facilities and outpatient care in the patient’s home or local community.10 We refer to Appendix B for a brief overview of the trends in number of inpatient beds and patients. 3 Data This section describes the construction of the data and the main variables. We obtain information on contacts with psychiatric care facilities from the Danish national psychiatric register. The register contains information on all contacts with public Danish psychiatric facilities from 1980 to 2011 including date of initial contact, treatment (if any), location, hospital and ward of contact, the date of admission, type of admission if any (inpatient 24-hour, inpatient part time, and from 1996 also information on outpatient), and diagnoses. From the psychiatric register we identify the time of first contact with the psychiatric care system and the corresponding relevant information. Each observation contains a unique individual identifier (and a unique case number), which allows us to link information on gender, date of birth, country of origin, and educational attainment using demographic and educational registers. We also link individuals to the labour market register and the criminal registers. The demographic registers also include unique identifiers of parents and spouses, 9 In 2000, one third of all psychiatric patients received treatment from a specialist, with the last twothirds receiving treatment from a public psychiatric treatment facility (Bengtsson, 2011). An unknown, but relatively small group of patients, received treatment from a privately funded psychiatric practitioner (Bengtsson, 2011). 10 Both types were until 2004 governed and funded by individual counties. The outpatient treatments where the individuals are treated while remaining at home or in residential care - are coordinated by local social services together with psychiatrists and funded and governed by the individual municipalities. During the period in question, Denmark consisted of 279 municipalities and 13 counties. Hoff et al. (2012) provide an overview of the Danish system compared to the English and the Dutch. With the exception of the funding schemes, these two strands of treatment possibilities resemble those of many OECD countries, including the U.S. and the U.K. 112 Table III.1: Average Treatment Length Total length of treatment as inpatient Outpatient treatment at psych. hospitals Total length of treatment as outpatient Observations Not admitted 0.502 (0.500) 68.398 (152.048) 16928 Admitted 30.203 (76.790) 7349 Note: Table shows the means and std. dev. of treatment status. Standard deviation in parentheses. which allows us to identify parental information on educational attainment, age, and previous contacts with psychiatric facilities, as well as spouses’ labour market outcomes. We limit our sample to individuals who have their first contact between the age of 18 and 45 from 1999 to 2001 as we neither wish to focus on child or geriatric psychiatric treatments.11 This results in a final sample of 24,277 individuals.12 Treatment Variable We define our main explanatory variable of interest as a dummy variable which is equal to 1 if an individual is admitted either as a 24-hour or part-time patient to a psychiatric care facility and 0 otherwise. Importantly, 0 comprises all outcomes where the individual is not immediately admitted to the facility but sent home. This includes both scheduled outpatient treatments and complete rejections, as Table III.1 shows by summarizing the treatment characteristics for those who are admitted and the subsequent treatment statuses of those who are not admitted upon first contact. 11 Danish psychiatric facilities have been the focus of gradual budget cuts and capacity reductions during the past two decades, especially with respect to inpatient treatments (Bengtsson, 2011). However, the period between 1998 and 2001 is a relatively stable period without any major reforms and reductions on adult psychiatric treatment possibilities. Different conditions apply for child and geriatric patients relative to the average adult population once in contact with a psychiatric care facility. 12 We also discard individuals who are diagnosed with mental retardation, dementia, or disorders of early psychological development (because these patient groups suffer from chronic disorders and have little or no labour market attachment), and individuals who are diagnosed with eating disorders (mainly teenage girls) or non-organic sexual dysfunctions in order to obtain a more homogeneous sample. Finally, we exclude individuals who contact the few countryside treatment facilities that only treat 10-50 individuals per year. 113 Table III.1 presents data on the 7,349 people who are admitted as inpatients and on the 16,928 who are not admitted as inpatients over the study period. We denote the former the treatment group and the latter the control group. The average admission length is 30.2 days for those admitted as inpatients (treatment=1). Of those who are not admitted as inpatients (treatment=0), around 50 percent receive treatment as outpatients. Table III.1 also shows that the average treatment length is larger for outpatients than inpatients, because many of those who are admitted as inpatients continue into outpatient treatment once deemed ready. Lengths of subsequent outpatient treatments for inpatients are not included in the summary measure in Table III.1. Sample Descriptives The first three columns of Table III.2 summarise the characteristics of the overall sample and the sample divided by whether they individuals received immediate inpatient treatment or not. The fourth column shows the corresponding average characteristics for a random sample of the Danish population between ages 18 and 45.13 In Table III.2, columns 2 and 3 show that more women than men contact psychiatric hospitals, but men are more often admitted to inpatient care at first contact. Admitted patients are also older, more likely to have committed a crime previously, have higher unemployment rates, and lower welfare dependency the year prior. In addition, admitted patients have parents with less education than individuals who are not admitted. Individuals who suffer from either disorders associated with substance use, psychosis or schizophrenia, or adult-onset affective disorders are more likely to be admitted. Individuals who suffer from nervous or stress related disorders, personality disorders, or pre-adult onset affective or emotional disorders are less likely to be admitted. When comparing column 2 to column 5 we see large differences between the individuals who contact psychiatric hospitals and the average 13 The random sample has been weighted by the age-distribution of our main sample for comparison. 114 Table III.2: Descriptive Statistics for Sample Full sample Not admitted Admitted Representative sample 0.463 0.423 0.555 0.509 (0.499) (0.494) (0.497) (0.500) Immigrant 0.126 0.126 0.124 0.081 (0.331) (0.332) (0.330) (0.273) Age at admission 31.141 30.821 31.879 31.995 (7.535) (7.522) (7.515) (7.794) Gross income in year -1 (2000DKK) 194426.591 193501.875 196556.785 264456.310 (130354.872) (131851.746) (126822.629) (207096.723) Committed crime prior to year 0.280 0.255 0.337 0.138 (0.449) (0.436) (0.473) (0.345) Unemployment degree in year -1 0.183 0.179 0.193 0.071 (0.305) (0.304) (0.308) (0.199) Welfare dependency in year -1 0.334 0.344 0.311 0.215 (0.383) (0.387) (0.371) (0.347) Mother’s months of schooling 125.239 126.928 121.348 125.526 (34.248) (34.427) (33.510) (38.394) Father’s months of schooling 135.892 137.235 132.801 137.650 (35.023) (34.924) (35.058) (40.440) Mother’s age at birth 26.206 26.215 26.188 25.360 (5.035) (5.007) (5.099) (5.164) Admitted in own municipality 0.164 0.166 0.157 (0.370) (0.372) (0.364) Admitted in Copenhagen 0.193 0.207 0.161 (0.395) (0.405) (0.368) Admitted in metropolitan area 0.152 0.150 0.158 (0.359) (0.357) (0.365) Disorder associated with substance use 0.152 0.125 0.213 (0.359) (0.331) (0.410) Psychosis, schizophrenia 0.074 0.037 0.161 (0.262) (0.188) (0.368) Affective disorder 0.199 0.180 0.243 (0.399) (0.384) (0.429) Nervous or stress-related 0.444 0.504 0.305 (0.497) (0.500) (0.460) Personality disorder 0.103 0.118 0.071 (0.305) (0.322) (0.256) Affective/emotionel, pre-adult origin 0.028 0.036 0.007 (0.164) (0.187) (0.086) Admitted 0.303 0.000 1.000 (0.459) (0.000) (0.000) Observations 24277 16928 7349 716411 Note: Table shows the means and std. dev. of covariates for the full sample and divided by treatment status. T-tests of differences between inpatient and outpatient sub-samples. Far right coloumn shows summary statistics of a randomly selected sample of individuals aged between 18-45 in 1999-2001. Male 115 population. Individuals who seek access to psychiatric treatment have weaker labour market attachment and lower income, come from more disadvantaged backgrounds, and are more likely to have committed crime. Outcome Variables Below we introduce the outcome variables: later psychiatric inpatient admissions and contacts, criminal convictions, self-harming behaviour, and labour market outcomes. We measure all variables for the first three years after the initial contact with a psychiatric hospital. We use the psychiatric register to identify the probability of subsequent contacts and admissions to psychiatric hospitals. We define subsequent contact as appearing in the psychiatric registers with at least one new entry after the date of discharge for admitted patients or date of first contact for not admitted patients. We define subsequent inpatient admission as at least one admission taking place after first contact (and in the case of inpatient admission after first discharge). This is by construction a subset of contacts. We also link the data to the criminal registers containing criminal convictions, charges, and incarcerations. We combine the exact date of contact with the exact date of committing a criminal act and we define crimes as convictions with no ongoing appeals. The combined information allows us to determine the exact time between the time of contact to psychiatric hospitals and the crime. We measure crimes as an accumulated count variable over the entire three year period following first contact, and later contact and admission as a binary, absorbing indicator. We use two measures of self-harm: (a) hospitalisations for overdoses; and (b) hospitalisations for lesions. We construct the two variables using information on exact dates of somatic hospital-treatments relative to date of first contact to a psychiatric hospital together with the main diagnosis. We obtain labour market outcomes from the national register on labour force statistics (RAS). The register includes annual listings of type of occupation, unemployment status, and dependency on other forms of welfare benefits. We generate three categories: 116 employment, unemployment, and not in the labour force. Contrary to the other outcome variables, we only have access to an annual measure of labour market outcomes measured ultimo November each year. Tables III.3 and III.4 summarise the outcome measures by treatment status (immediate inpatient admission vs. no immediate inpatient admission). Table III.3: Summary of Outcomes for Admission, Contact, Crimes, Overdose, and Lesion 3 months 6 months 9 months 12 months 15 months 18 months 21 months 24 months 27 months 30 months 33 months 36 months Observations Admission Not adm. Adm. 0.017 0.030 (0.145) (0.186) 0.035 0.059 (0.219) (0.285) 0.055 0.086 (0.291) (0.373) 0.072 0.110 (0.355) (0.447) 0.088 0.135 (0.400) (0.511) 0.103 0.158 (0.450) (0.564) 0.117 0.180 (0.496) (0.615) 0.132 0.203 (0.541) (0.665) 0.147 0.228 (0.592) (0.733) 0.160 0.251 (0.634) (0.800) 0.172 0.276 (0.671) (0.865) 0.186 0.301 (0.717) (0.925) 16928 7349 Contact Not adm. Adm. 0.088 0.220 (0.283) (0.414) 0.123 0.284 (0.329) (0.451) 0.143 0.317 (0.350) (0.465) 0.153 0.346 (0.360) (0.476) 0.162 0.366 (0.368) (0.482) 0.169 0.382 (0.375) (0.486) 0.176 0.398 (0.380) (0.489) 0.181 0.408 (0.385) (0.492) 0.187 0.419 (0.390) (0.493) 0.192 0.427 (0.394) (0.495) 0.196 0.433 (0.397) (0.496) 0.200 0.440 (0.400) (0.496) 16928 7349 Crimes Not adm. Adm. 0.226 0.497 (0.418) (0.500) 0.281 0.572 (0.450) (0.495) 0.313 0.602 (0.464) (0.490) 0.333 0.623 (0.471) (0.485) 0.351 0.638 (0.477) (0.481) 0.365 0.649 (0.482) (0.477) 0.381 0.660 (0.486) (0.474) 0.393 0.668 (0.488) (0.471) 0.403 0.675 (0.490) (0.468) 0.413 0.683 (0.492) (0.466) 0.421 0.688 (0.494) (0.464) 0.429 0.693 (0.495) (0.461) 16928 7349 Overdose Not adm. Adm. 0.010 0.019 (0.101) (0.135) 0.016 0.030 (0.127) (0.172) 0.020 0.041 (0.140) (0.199) 0.024 0.049 (0.154) (0.215) 0.028 0.054 (0.165) (0.227) 0.031 0.060 (0.174) (0.237) 0.035 0.064 (0.184) (0.245) 0.038 0.067 (0.190) (0.251) 0.040 0.072 (0.196) (0.258) 0.043 0.077 (0.202) (0.267) 0.045 0.082 (0.208) (0.274) 0.048 0.087 (0.213) (0.281) 16928 7349 Lesion Not adm. Adm. 0.001 0.002 (0.034) (0.042) 0.002 0.003 (0.047) (0.055) 0.003 0.005 (0.054) (0.069) 0.004 0.006 (0.059) (0.079) 0.005 0.007 (0.067) (0.083) 0.005 0.008 (0.073) (0.088) 0.006 0.009 (0.077) (0.094) 0.007 0.010 (0.082) (0.097) 0.007 0.011 (0.085) (0.104) 0.008 0.012 (0.089) (0.107) 0.009 0.013 (0.093) (0.112) 0.009 0.013 (0.096) (0.114) 16928 7349 Note: Table shows means and std. dev. of outcome variables for the sample dependent on treatment status. Time 0 is month of initial contact. Crimes are aggregating count variables. Admission, Contact, Overdose and Lesion are absorbing state dummies. Standard deviation in parentheses. 117 Table III.4: Summary of Labor Market Outcomes 12 months 24 months 36 months Observations Empl. Not adm. Adm. 0.550 0.455 (0.498) (0.498) 0.537 0.444 (0.499) (0.497) 0.532 0.431 (0.499) (0.495) 16928 7349 Unempl. Not adm. Adm. 0.169 0.202 (0.375) (0.402) 0.166 0.190 (0.372) (0.393) 0.160 0.177 (0.367) (0.382) 16928 7349 Out of labor force Not adm. Adm. 0.282 0.343 (0.450) (0.475) 0.297 0.366 (0.457) (0.482) 0.308 0.391 (0.461) (0.488) 16928 7349 Note: Table shows means and std. dev. of outcome variables for the sample dependent on treatment status. Time 0 is month of initial contact. Standard deviation in parentheses. From the tables we see that the average rates of subsequent admissions and crime are substantial and that, overall, the individuals who received inpatient care at first contact have worse outcomes than individuals who did not receive inpatient treatment in terms of readmissions, criminal convictions, and labour market attachment. We do not explicitly address how inpatient treatment upon initial contact reduces core symptoms of psychiatric illnesses or improve patients’ the quality of life, as these are highly subjective variables that may relate spuriously to realised treatment status. Instead, we investigate the derived effects on realized objective outcomes. We implicitly assume that hospital contacts largely reflect patients’ subjective need of treatment. Consequently, we expect that institutional settings predominately drive estimated increases (reductions) in admission rates above (below) estimated increases (reductions) in hospital contacts. Instrumental Variable As an instrument for inpatient admission we use hospital specific contact intensity. We construct our instrumental variable as the fraction, for each date and each hospital, of the number of weekly contacts relative to the maximum number of contacts within a given seven day period during the past year. We use the complete psychiatric register to identify the number of daily contacts and admissions to wards that receive or treat 24-hour or part-time admitted patients at each hospital. We compute the number of contacts with inpatient 118 admission potential within the past week for each individual contact from 1998 to 2001 (excluding the day of an individual’s first contact). This variable will be the numerator in our instrumental variable. It contains information on the total demand for psychiatric treatments at a given date at a given hospital. In order to control for hospital size, we create a similar variable that contains the maximum number of contacts within a given seven day interval during the past year from the date of each individual contact. This will be the denominator in our instrumental variable.14 For individual i at hospital h the instrument equals: lag(1week) Zi lag(2weeks) Zi = = P−1 d=−7 d=−7 Contactshd , P−2 Contactshd P−359 d=−8 Contactshd , ..., d=−365 P−8 d=−14 Contactshd P−1 P−2 P max( d=−7 Contactshd , d=−8 Contactshd , ..., −359 d=−365 max( P−1 Contactshd ) (1) Contactshd ) Figure III.1 shows the probability of being admitted for different values of the instrumental variable. The probability of being admitted is unambiguously downward sloping across the entire interval of hospital specific contact intensity in the two weeks preceding initial contact. The figures also show that the relationship is strongest at low numbers of contacts during the past weeks but monotone across the entire sample space. We therefore assess that the IVs are both relevant and monotonous. Section 5 will present the first stage results formally. Although there is little reason to believe that information related to the IVs is readily available to patients, GPs may observe a hospital’s vacancy rate and react to this (e.g., if the GP delays referring a patient until there is room at a hospital or commences the treatment him- or herself). In this case, we should find correlation between the contact intensity and patient’s number of visits to the GP prior to first contact. Table IIIA.1 in the Appendix shows estimates from regressions of GP and specialist visits on the two IVs. Because patients may postpone contacts to hospital as a result of high contact intensity in 14 Importantly, we only count an individual once within a given week in order to avoid that e.g. high admission thresholds affects the number of contacts as patients keep reappearing. 119 Figure III.1: Probability of Inpatient Admission Across the Instrument ((a)) Contact within last 7 days relative to max within last year ((b)) Contact within last 8-14 days relative to max within last year Note: Figures show observed fraction of inpatient admissions across the values of the two intruments. 120 the two weeks preceding initial contact, we consider the number of visits during the year of initial contact as well as during the previous year. Table IIIA.1 shows that none of the estimates are significant on any conventional significance level; i.e., hospitals’ deviations in contact intensity are independent of patients’ contacts to other sources of treatment for their disorder.15 In addition, systematic relationships between the IVs and covariates could indicate endogeneity. Besides the explanatory variables described in Table III.2 we also include one year lagged information on crime and labour market attachment, as well as indications on whether the patient’s mother was ever admitted. Table IIIA.2 in the Appendix presents estimates of the IVs regressed on these variables. The table shows that there is no significant relationship between the past weeks’ contact intensity and the patients’ diagnoses, prior unemployment, welfare dependency, prior crime, or parents’ psychiatric history. Only mothers’ schooling and region are significant for both of the instruments. We have performed the analyses with metropolitan areas excluded16 finding no qualitative differences from the overall results (see Table IIIA.6 in the Appendix). As all other central estimates are insignificant, we consider our key assumptions met. 4 Econometric Framework This paper investigates the effects of admittance/non-admittance to a psychiatric hospital for an individual i — a person in psychiatric distress — on his or her subsequent psychiatric admissions and contacts, labour market outcomes, self-harming behaviour, and crime Yi : ∆i = Y1i − Y0i 15 (2) If rejected patients turn up at other hospitals with more vacant beds and are admitted there, this could result in a misspecification of the treatment variable and result in attenuation bias. But as only 1.7% of the no-admission group contacts another hospital within the first week of their initial rejection, we do not consider this to be a problem. 16 Major cities also constitute the areas most likely to allow for selection between hospitals according to occupancy rate. 121 Y1i denotes person i’s potential outcome if treated, and Y0i denotes the potential outcome if not treated. ∆i is our parameter of interest. Yet, we face several obstacles in estimating ∆i , foremost that we do not observe both counterfactual treatment states. Let Di = {0, 1} be a dummy variable indicating whether a hospital admits a person upon initial contact. We may describe the potential outcomes Y0 , Y1 (suppressing the subscript i) as functions of observable characteristics X and unobservable characteristics U (Heckman & Honore, 1990; Heckman et al. , 2006): Yk = µk (X) + Uk , k = {0, 1} (3) Individuals “select” into inpatient admission (D = 1) if the total potential net gain of admission is positive: D = 1[Y1 − UC > Y0 ] (4) = 1[µ1 (X) − µ0 (X) > U0 − U1 + UC ] where −V = U1 − U0 − UC is unobserved net gains of selecting into treatment. UC refers to costs; psychic and monetary. The definition implies that the outcomes we observe equal: Y = µ0 (X) + D(µ1 (X) − µ0 (X) + U1 − U0 ) + U0 (5) If ignored, unobserved characteristics will be embedded in the error term when estimating a basic OLS, which will thus be biased. Therefore, to circumvent that the counterfactual unobservable components of the outcomes are likely to differ, we employ an instrumental variable Z = {z1 , z2 }, which is the fraction of the number of contacts one and two weeks prior to initial contact relative to the maximum number of contacts within a given week during the past year at a given hospital. Inserting Z in equation (4) and implementing the instrument in a 2SLS approach yield the following 1st stage equation: D = π10 X + π20 Z − Ṽ 122 (6) A pivotal assumption is that the fraction of the number of weekly contacts relative to the maximum number of contacts within a given week during the past year is independent of other unobservable selection mechanisms captured by (U1 , U0 ). I.e., neither the number of contacts during the past couple of weeks, nor the maximum number of contacts during a week in the past year, is allowed to affect whether a patient chooses a specific day and a specific hospital to have his or her first contact with the psychiatric system. In addition to independence between the instrument and the unobservable characteristics, we also need a monotonic effect of the instrument on the endogenous regressor for all values of the instrument. Figure III.1 confirms that this is indeed the case. Because Z constitutes an exogenous and monotonous instrument for Admitted, we can identify the Local Average Treatment Effect17 of being admitted as: β2LAT E = E[β2 |π10 X + π20 Z < Ṽ < π10 X + π20 Z ∗ ], f or Z > Z ∗ (7) The LATE captures the average effect of being admitted for those who are moved from ’no admission’ to ’admission’ when Z decreases to Z ∗ .18 Consequently, β2LAT E may incorporate heterogeneous treatment effects across individuals with different levels of unobservable characteristics, such as severity of mental disorder. Some may be on the margin of treatment assignment only for very low values of Z and others might be on the margin of treatment assignment at the highest values of Z. To allow for heterogeneity across the different levels of pre-admission levels of unobservable characteristics we estimate the Marginal Treatment 17 We estimate the first stage equation using the instrument linearly. The results are robust to using alternative specifications that allow for non-linear relationships between the probability of admission and previous weeks’ contact intensity. These results are presented in Figure IIIA.3 in the Appendix. 18 Alternatives to treatment (inpatient admission upon initial contact) comprise e.g., no treatment at all, subsequent outpatient treatment, and a new subsequent contact to the hospital that results in an inpatient admission. Our treatment effects will identify the effect of inpatient admission upon initial contact relative to an average of all of these individual alternatives. Identifying each individual alternative would require at least one instrument per margin of treatment. Authors refer to Heckman & Urzua (2010) for a general discussion of this issue and Kirkebøen et al. (2014) for an application (investigating field of study) where all individual treatment margins are identified. 123 Effects (MTE) of admission (Björklund & Moffitt, 1987; Heckman & Vytlacil, 2005): β2M T E = E[β2 |UD = u∗D , X = X ∗ ] (8) where UD = FUD [V ] (see eq. (4)). β2M T E captures the effect of being admitted on the outcomes for those who are on the margin of treatment at each level of unobservable net gains UD (V ).19 The smaller the UD (V ), the larger the unobserved net gains of being selected into inpatient admission. To be on the margin of treatment, a given level of observed probability of admission (observed gains) must be matched by a corresponding inverse level of unobserved net gains. As unobserved gains of admission are arguably an increasing function of the severity of mental disorder, we interpret UD (V ) as an aspect of severity (though interpretations may differ by the content of and interaction with UC ), where low values of UD (V ) correspond to severe mental disorders and vice versa.20 5 Results This section presents the estimation results. First, we show that our instrumental variable is appropriate. Thereafter, we present our main results: the effects of inpatient admission on the probability of subsequently contacting a psychiatric hospital again, the probability of later admissions to psychiatric hospitals, crime, (somatic) hospitalisations due to drug overdoses or lesions, and labour market outcomes. Finally, we present results from Marginal Treatment Effects of admission and the effect of admission on spouses’ LMOs. 19 We estimate the first stage equation of the MTE by a probit. We report estimates of the parametric version of the MTE, see Heckman et al. (2006). 20 The MTE is a generalisation of treatment effects such as the LATE: β2LAT E = 1 UD − UD ∗ Z UD ∗ UD β2M T E dU where UD is the random/latent variable for those who are affected by Z. 124 5.1 2SLS Results Table III.5 presents the results of the first stage regression of our instrumental variables on a dummy for inpatient admission. The first column shows the first stage estimates without any additional control variables, the second column shows the estimates with a set of socioeconomic and demographic control variables, and the third column shows the estimates with additional dummies for the patient’s diagnosis. Table III.5: 1st stage Estimation Results Average occupancy rate, deviation from hospital mean Previous 1-7 days Previous 8-14 days F-value Observations SES and demograpic controls Diagnosis controls Inpatient adm. Inpatient adm. Inpatient adm. -0.154∗∗∗ (0.030) -0.143∗∗∗ (0.030) -0.142∗∗∗ (0.027) -0.103∗∗∗ (0.029) 99.372 24277 -0.092∗∗ (0.028) 82.671 24277 X -0.089∗∗∗ (0.026) 87.776 24277 X X + p < .10;∗ p < .05;∗∗ p < .01; ∗∗∗ p < .001 Note: Table shows OLS regression results of inpatient admission (0/1) on the two instrumental variables; hospital specific contact intensity the two week prior to the individual’s initial contact. Standard errors clustered by hospital and month in parentheses. SES and demographic controls include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category from ICD-10. The relationships are highly significant across the three different specifications when tested individually or jointly (the F-values all exceed the Staiger-Stock rule-of-thumb of 10 (Staiger & Stock, 1997)). This confirms the findings from Figure III.1, that also showed that the probability of inpatient admission decreased unambiguously as both contact intensity rates increased. 125 Subsequent Admissions and Contacts Figure III.2 shows OLS and 2SLS results of the effect of inpatient admission on the probability of later admission to psychiatric care from three months after first contact and until 36 months after first contact. The estimates are also reported in table IIIA.3 in the appendix. From Figure III.2(a) we see that inpatient admission at first contact correlates positively with the likelihood of later admissions—on average inpatients have just below 10%-points higher likelihood of inpatient admission three months after first contact than the patients whom the psychiatric hospital chose not to admit at first contact. The association increases to just below a 20%-points increase at 30 months after first contact. Figure III.2(b) reports the 2SLS estimates of the effect of inpatient admission. For the first half of the period the likelihood of inpatient re-admission is increased by about 15%-points, whereas for the second half of the period the effect size is an increase in the likelihood of about 20%-points. The estimates are at least borderline significant for the entire period, and significant at the 5% level for the first couple of months, and from month 15 and onwards. The 2SLS estimates are of similar sign and larger or as large as the OLS estimates.21 In order to investigate the mechanisms behind this notable result, we consider the effects of admission on the likelihood of subsequently contacting a psychiatric hospital again after first contact. Figure III.3 shows OLS and 2SLS estimates of the effect of inpatient admission on the probability of later contact to psychiatric hospitals from three months after first contact and until 36 months after first contact. The estimates are also reported in Table IIIA.4 in the appendix. From the OLS estimates in Figure III.3(a) we see that inpatient admission is associated with an increase of around 20%-points in the likelihood of subsequent contact with the psychiatric health care system for the following three to 36 months after first contact. Yet, when we examine the causal effect of admitting the patient on the margin, Figure 21 The variance of estimates is large. This could indicate the use of a weak instrument. However, the first stage results from Table III.5 show that this is not the case. 126 III.3(b) shows that there is no significant effect across the period. Again, the estimates retain the same sign as the OLS estimates but are significantly smaller in size. The figures show that admitting people to inpatient treatment leads to a higher re-admission rate for the subsequent 36 months after initial contact, but does not lead to significantly higher re-contact rates in the same period. These findings indicate that institutionalisation is a pivotal mechanism behind the size of the estimates in Figure III.2(b).22 I.e., if a person gets his or her foot in the door at first contact, he/she is much more likely to be re-admitted as an inpatient in subsequent years. Crimes Figure III.4 shows the OLS and 2SLS estimates of inpatient admission on the future number of criminal convictions. The estimates are also reported in Table IIIA.5 in the appendix. Figure III.4(a) shows that individuals admitted to inpatient care at first contact have higher crime rates during all of the first 36 months after initial contact. Moreover, their overrepresentation in the criminal justice system grows as the time from first contact approaches 36 months. When we focus on the causal effect of admitting the marginal person to inpatient care in Figure III.4(b), we see a significant and substantial decrease in crime for the first six months after first contact. Moreover, the effect persists as borderline significant up until 22 months after first contact. Admitting 100 patients on the margin of treatment to inpatient care upon their first contact leads to ten fewer crimes during the subsequent six months. Inpatient admittance reduces subsequent crime, either because of incapacitation or because inpatient treatment better meets the immediate medical needs of crime-prone patients at first contact. 22 Note, however, that the 2SLS estimates for admittance and contacts are significantly different from each other at the start of the period only (6 months). 127 Figure III.2: Effect on Probability of Subsequent Inpatient Admission by Months Since 1st Contact ((a)) OLS estimates ((b)) 2SLS estimates Note: Figure shows OLS and 2SLS regression results of inpatient admission (0/1) on subsequent re-admission (0/1). The dashed lines indicate 95% confidence intervals and the dotted line indicate 90% confidence intervals. Standard errors clustered by hospital and month in parentheses. Time 0 is month of initial contact. SES and demographic controls include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category form ICD-10. 128 Figure III.3: Effect on Probability of Subsequent Contact to a Psychiatric Hospital by Months Since 1st Contact ((a)) OLS estimates ((b)) 2SLS estimates Note: Figure shows OLS and 2SLS regression results of inpatient admission (0/1) on subsequent contact to psych. hospitals (0/1). The dashed lines indicate 95% confidence intervals and the dotted line indicate 90% confidence intervals. Standard errors clustered by hospital and month in parentheses. Time 0 is month of initial contact. SES and demographic controls include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category form ICD-10. 129 Figure III.4: Effect on Subsequent Crimes by Months Since 1st Contact ((a)) OLS estimates ((b)) 2SLS estimates Note: Figure shows OLS and 2SLS regression results of inpatient admission (0/1) on subsequent crimes. The dashed lines indicate 95% confidence intervals and the dotted line indicate 90% confidence intervals. Standard errors clustered by hospital and month in parentheses. Time 0 is month of initial contact. SES and demographic controls include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category form ICD-10. 130 As earlier results showed that inpatient admissions increase the probability readmissions, we suspect that the crime reductions are caused by an incapacitating effect of being admitted as an inpatient. To examine whether incapacitation drives the effect of inpatient treatment, we also estimate models where we set time 0 at time of discharge, such that e.g. 12 months refer to one year after end of treatment and not 12 months after initial contact. The figure shows that the estimates are now close to zero and insignificant (see Figure IIIA.1 in Appendix), so part of the effect of inpatient treatment on subsequent crime comes from incapacitation of patients at the immediate time of need. Self-harming Behavior Figure III.5 shows the effect of admission on the subsequent risk of at least once experiencing hospitalisation due to drug overdose or lesions. Individuals admitted at first contact are more likely to experience hospitalisation due to drug overdose for the three following years as seen from Figure III.5(a). Yet, when we focus on the 2SLS estimates, there is no significant effect while the point estimates are negative. For lesions there is no significant association with admission at first contact (Figure III.5(c)). However, the 2SLS estimates show that admittance leads to a significant reduction in the risk of hospitalisation due to lesions in the six months following first contact, after which the effect fades and becomes insignificant.23 23 This reduction is not driven by incapacitation. The estimates do not differ qualitatively when we center outcomes around day of discharge. 131 Figure III.5: Effect on Subsequent Probability of Being Hospitalized for Self-harm by Months Since 1st Contact ((a)) OLS overdose ((b)) 2SLS overdose ((c)) OLS lesion ((d)) IV lesion Note: Figure shows OLS and 2SLS regression results of inpatient admission (0/1) on the risk of ever experiencing hospitalization through overdoses (0/1) or lesions (0/1) the three years following first contact. The dashed lines indicate 95% confidence intervals and the dotted line indicate 90% confidence intervals. Standard errors clustered by hospital and month in parentheses. Time 0 is month of initial contact. SES and demographic controls include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category form ICD-10. 132 Labor Market Outcomes To further investigate whether institutionalisation is more likely to happen when people enter inpatient care at first contact, we also consider how entering inpatient care affects subsequent labour market outcomes. If inpatient treatment has an lock-in effect, we should expect inpatient admittance to also affect labour market outcomes. Table III.6 reports the OLS and 2SLS results for admitting people to inpatient treatment at first contact on labour market position the following three years. A person can either be in employment, registered as unemployed, or registered as outside the labour force. Column 2 presents the OLS results and columns 3-5 show the 2SLS estimates. Table III.6: Estimation Results on Subsequent Labor Market Outcomes OLS Employment First year since contact Second year since contact Third year since contact Unemployment First year since contact Second year since contact Third year since contact Out of Labor Force First year since contact Second year since contact Third year since contact N SES and demograpic controls Diagnosis controls 2SLS 2SLS 2SLS -0.054∗∗∗ (0.007) -0.049∗∗∗ (0.007) -0.053∗∗∗ (0.007) -0.053 (0.098) -0.053 (0.088) -0.168∗ (0.083) -0.112 (0.097) -0.143 (0.088) -0.253∗∗ (0.085) -0.096 (0.095) -0.126 (0.085) -0.240∗∗ (0.083) 0.016∗∗ (0.006) 0.013∗ (0.006) 0.009+ (0.005) -0.069 (0.069) -0.020 (0.066) 0.017 (0.056) -0.045 (0.073) 0.009 (0.068) 0.059 (0.060) -0.053 (0.072) 0.003 (0.068) 0.054 (0.060) 0.038∗∗∗ (0.007) 0.036∗∗∗ (0.007) 0.044∗∗∗ (0.007) 24277 X X 0.122 (0.075) 0.073 (0.080) 0.151∗ (0.072) 24277 0.158∗ (0.080) 0.134 (0.086) 0.194∗ (0.076) 24277 X 0.149+ (0.079) 0.124 (0.085) 0.186∗ (0.076) 24277 X X + p<.10;∗ p<.05;∗∗ p<.01;∗∗∗ p<.001 Note: Table shows 2SLS regression results of inpatient admission (0/1) on subsequent employment (0/1), unemployment (0/1), and being out of the labor force (0/1). Standard errors clustered by hospital and month in parentheses. SES and demographic controls include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category from ICD-10. 133 We see that inpatients are more often unemployed, more often outside the labour force, and less often employed for the three years after initial contact than people who did not enter inpatient care. When we examine the causal effect of admitting the marginal person to inpatient care at initial contact (columns 3 to 5), we see that patients admitted to inpatient treatment become gradually less employed for the following three years, and instead are outside the labour force. Three years after first contact the marginal individual who is admitted has 25 percentage point lower employment level, and are about 20 percentage point more likely to be outside the labour force. Although the estimates for labour market outcomes appear very large at first, they should be viewed in the light of the large effects of inpatient admission on likelihood of later admission.24 Psychiatric hospital records are a strong indicator for whether social services provide individuals with disability pension, early retirement, or other forms of social assistance without labour market requirements. Hence, by construction, being admitted at first contact with a psychiatric hospital likely makes the patient eligible for permanent welfare benefits; if not directly from first admission then from subsequent derived admissions. 5.2 Gender and age differences Men and women might seek psychiatric help for different reasons and different illnesses, and respond to admissions at different margins. Figure III.6 shows the LATE estimates for admissions, contacts, and crime for men and women separately. Table III.7 reports the effects on labour market outcomes by gender. We only report 2SLS results and only for the models where we control for SES, demographics, and diagnosis types.25 24 Also, the changes from OLS to 2SLS appear counterintuitive if one has the likely selection bias of an average population in mind. Yet, as we have shown, our sample is not drawn from the average population and very different selection mechanisms may play a role here. For example, stronger patients may be better at manipulating institutions (see Moustsen et al. (2015) for a similar example from cancer-treatment). 25 We also examine whether there exist heterogeneous effects across types of diagnoses, but find no significant or substantial differences. Results available on request. 134 Figure III.6: 2SLS Estimates of Effects of Inpatient Admission on Likelihood of Subsequent Admission, Contact, and Number of Crimes by Months Since 1st Contact and by Gender ((a)) Males, admission ((b)) Females, admission ((c)) Males, contact ((d)) Females, contact ((e)) Males, crimes ((f)) Female, crimes Note: Figures shows 2SLS regression results of inpatient admission (0/1) on subsequent re-admission (0/1), contact to psych. hospitals (0/1), and crimes for males and females separately. The dashed lines indicate 95% confidence intervals and the dotted line indicate 90% confidence intervals. Time 0 is month of initial contact. Standard errors clustered by hospital and month in parentheses. SES and demographic controls include: Age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category form ICD-10. 135 Figure III.6 shows a clear gender disparity in the way men and women react to inpatient treatment. Men, across all ages, experience a borderline significant increase in the likelihood of re-admissions until 24 months after first contact. Additional analyses across age (not shown here) indicate that men below the age of 30 at the time of initial contact experience a corresponding increase in the likelihood of subsequent contact, while men above the age of 30 experience a significant institutionalisation effect, as they suffer from significant increases in re-admission rates but does not change contacting-behaviour correspondingly. Women significantly increase re-admission and contact rates as time approaches 36 months after initial contact. The estimate of the effect of inpatient treatment on probability of re-admission is almost three times as high for women as for men at month 30 (just below 0.30 for women and around 0.12 and insignificant for men), while the likelihood of contacting a psychiatric hospital again has increased in similar magnitude for women. The results indicate that (older) men are more likely to experience further institutionalisation if admitted to inpatient care at first contact than is the case for women. The effects of inpatient treatment by gender also differ for the crime outcome. 21 months after first contact, the effect of inpatient admission on crime for men is around 45 criminal acts less per 100 men admitted to inpatient treatment. For women, there is no significant effect. As seen from Table III.7 there is no effect of inpatient treatment on men’s labour market attachment, but there are significant effects for women. Three years after first contact, female labour market attachment has dropped drastically, with women moving from employment out of the labour force. The drop in employment dovetails with women’s higher admission rates. Comparatively, men who enter the psychiatric treatment system have generally lower SES, worse labour market attachment, and higher crime rates than their female counterparts at time of first contact (see Table IIIA.7 in appendix), which could suggest that the gender disparities in treatment effects of inpatient treatment originate from gender differences in the pre-treatment levels. 136 Table III.7: 2SLS Estimation Results on Subsequent Labor Market Outcomes by Gender Employment First year since contact Second year since contact Third year since contact Unemployment First year since contact Second year since contact Third year since contact Out of Labor Force First year since contact Second year since contact Third year since contact Observations SES and demographic controls Diagnosis controls + Male Female 0.022 (0.113) -0.006 (0.114) -0.044 (0.112) -0.231 (0.141) -0.256+ (0.136) -0.470∗∗∗ (0.136) -0.017 (0.095) -0.040 (0.103) 0.034 (0.083) -0.093 (0.107) 0.045 (0.0900) 0.080 (0.090) -0.004 (0.102) 0.045 (0.117) 0.010 (0.105) 0.324∗ (0.130) 0.211 (0.130) 0.390∗∗ (0.132) 11246 X X 13031 X X p < .10;∗ p < .05;∗∗ p < .01; ∗∗∗ p < .001 Note: Table shows 2SLS regression results of inpatient admission (0/1) on subsequent employment (0/1), unemployment (0/1), and being out of the labor force (0/1) for males and females separately. Standard errors clustered by hospital and month in parentheses. SES and demographic controls include: Age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category from ICD-10. 137 5.3 Marginal Treatment Effects In the previous section we documented heterogeneous effects across observable characteristics such as gender and age. We now examine heterogeneity across unobserved, or latent, characteristics by estimating the marginal treatment effects of inpatient admission on later admissions, contacts, crime, hospitalisations for drug overdose and lesions, and labour market outcomes.26 Importantly, a sizable fraction of those who are not admitted as inpatients receive some form of outpatient treatment. The counterfactual that people at different margins of treatment face (no admission or outpatient admission) is crucial to our interpretation of the results. However, whether the compliers are on the margin between inpatient admission and outpatient admission or no treatment is inherently unidentified. The differences in Marginal Treatment Effect M T E = δ∆i /δp(Z, X) |UD =UD∗ across UD may be a composite of two different effects which may not be of equal sign. On the one hand, individuals with low UD ’s are only at the margin of treatment when the contact intensity is high and vice versa. Hence UD may be interpreted as an inverse scale of severity of the psychiatric disorder.27 The patients with most severe disorders are likely to benefit more from an inpatient admission, which would result in an upward sloping MTE for subsequent admissions, contacts, crime, hospitalisations due to drug overdose or lesions, and unemployment and a downward sloping MTE for employment. On the other hand, we cannot rule out that various levels of Z also coincide with different counterfactuals to inpatient admission. There is no a-priori knowledge about the ordering of the vast multitude of every imaginable treatment state. Consequently, we cannot identify margins within the alternative to treatment without making further (strong) assumptions about one or both of our instrumental variables (Heckman & Urzua, 2010), 26 We abstain from presenting results from MTEs for spousal labour market outcomes as estimates are inconclusive due to large variance. The results can be obtained from the authors. 27 Or as an alternative formulation in the Roy-model framework, UD = U0 − U1 assuming no costs of admission (UC = 0), see Heckman & Vytlacil (2005). 138 but only note that the weights attached to each alternative embedded in the category No treatment (D=0) may change across UD . Figure III.7 reports the estimated MTEs for the likelihood of subsequent admission to, and contact with, a psychiatric hospital, and crime. Figure III.8 reports the results for drug overdose and lesions, and Figure III.9 reports the effects for the labour market outcomes. Because we estimate the effects on a quarterly basis for admissions, contacts, and crimes, we report the marginal treatment effects jointly across time after first contact and across the latent variable UD (see section 4). We can only identify the marginal treatment effect across the interval of common support, which ranges from .05-.80 (cf. Figure IIIA.2 in Appendix A). As we cannot report confidence intervals in the three axis sub-figures shown in Figure III.7, we instead colour-code the estimates based on whether they are significant and positive at a 10 % level (dark blue), insignificant and positive (light blue), insignificant and negative (light red), or significant and negative (dark red). For labour market outcomes we show 10% confidence intervals for each of the three years after first contact. Figure III.7(a) shows the marginal treatment effect of inpatient admission at first contact on likelihood of later psychiatric hospital admission. The MTE is positive across the entire distribution, but only significant for patients with a high value of UD . The MTE also has a strong upward slope in UD , which suggests that the negative effect of being admitted as an inpatient decreases in severity of disorder. These results also supports the institutionalisation hypothesis – patients with more severe disorders would likely experience re-admission to psychiatric hospitals in either case, whereas patients with milder conditions need to already have a record of admittance to get re-admitted. Figure III.7(b) shows the MTE of inpatient admission on the likelihood of contacting a psychiatric hospital again. Results are insignificant for most values, though negative for the lowest values of UD , while individuals with high values of UD have significantly higher likelihood of contacting a psychiatric hospital again following an inpatient admission at first 139 contact. This support the notion that the benefits of inpatient admission increase as UD decreases. Figure III.7(c) shows the MTE of inpatient admission on number of crimes. The effect is negative across the entire period and all values of latent characteristics. The effects are largest and most persistent for patients with the lowest values of UD . The effects are insignificant for all values of UD after 30 months. There is an overall crime reducing effect of admission upon first contact with a psychiatric hospital that originates during the first months after initial contact. Figure III.8 shows the MTEs for self-harming behaviour through overdoses or lesions. For overdoses (Figure III.8(a)) we find that patients with low values of UD decrease their risk following an inpatient admission, and that this reduction is persistent across the three years we study. In contrast, we find that the risk for hospitalisation due to lesions is significantly lower for all values of UD , albeit the effect is only short-term (Figure III.8(b)). Thus, inpatient admission lowers the likelihood of one dimension of self-harm or suicide attempts persistently for the most at-risk patients, while another dimension of self-harm is uniformly but only temporarily reduced. 140 Figure III.7: Marginal Treatment Effects of Inpatient Admission on Likelihood of Subsequent Admission, Contacts, and Number of Crimes ((a)) Admission ( ((b)) Contact ( / % ( ! " # ( $ % & # $ ( ! % ' ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ● ●● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ●●● ● ● ●● ● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ● ●●● ●●● ● ●● ● ●● ● ● ● ● ● ●● ● ● ● ● ● ● ● ●●●● ● ● ●● ●●● ● ● ● ● ● ● ●● ●●●● ● ●●● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ●● ●● ●● ●●●● ●●●●● ● ● ● ● ● ● ● ● ●● ●● ●● ●●●●●●● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ●●● ●●●●●●●● ● ● ● ● ● ● ● ● ● ●● ●● ●● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ●●● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●●● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● # ( ! " # ( $ % & ( # $ ( ( ! % ' ( 0 - . + , ) * ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ●● ● ●● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ●● ● ●● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ●● ●● ● ● ● ● ● ●● ● ● ● ● ●● ●● ● ● ● ●● ●● ● ● ● ● ●● ● ● ● ● ● ●● ● ●● ●● ● ●● ● ● ● ● ●● ● ●● ● ● ● ● ● ●● ● ●● ● ● ●● ●● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ●● ● ● ● ● ● ●● ● ●● ●● ● ● ●● ● ● ● ● ● ●● ●●● ● ●● ● ● ● ● ●● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ●● ● ● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ((c)) Crimes ( ( 6 0 1 2 ( 3 0 & 4 50 ( & # 6 ( 4 ● ● ● ● ● ● ● ● ● ● ● ● ●●●●● ● ●●●● ● ● ●●●●● ● ● ● ●●●● ● ● ● ●●●● ● ● ● ● ●●●● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ●●●● ● ● ● ● ● ● ●●● ●●● ● ● ● ● ● ● ● ● ●● ● ●●● ● ● ● ● ● ●● ● ● ● ●●● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ●● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ●●● ●● ● ● ● ● ● ● ● ● ● ● ●● ● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ●● ● ● ● ● ●● ● ● ● ●● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ●● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ●● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ●● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ●● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ●● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ●● ● ● ● ● ● ● ● ● ● ●● ● ●● ● ● ● ● ● ● ●● ● ●● ●● ● ● ● ● ● ● ● ● ● ●● ● ●● ●● ● ● ● ● ● ● ● ●● ● ●● ●● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0 0 ( 0 6 ( )* + , - . Note: Figures show estimated Marginal Treatment Effects of inpatient admission (0/1) on subsequent re-admission (0/1), contact to psych. hosptial (0/1), and crimes. Time 0 is month of initial contact. Covariates include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies, dummies for each F. diagnosis category form ICD-10. Confidence levels are color-coded. Dark blue = positive estimate, p < 0.10; light blue = positive estimate, p > 0.10; light red = negative estimate, p > 0.10; dark red = negative estimate, p < 0.10. 141 Figure III.8: Marginal Treatment Effects of Inpatient Admissions on Subsequent Probability of Being Hospitalized for Overdoses or Lesions During the Subseq. Three Years ((a)) Overdose ( ((b)) Lesion 6 ( $ ! % ( 6 0 ' 7 $ ( 0 & 8 4 4 0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●●●● ●●●●●●●●●●●●●● ● ● ● ● ● ● ● ● ● ● ● ●●● ●●● ● ● ●●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ●●●● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ●●● ●●● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ●●● ●●● ●●●● ● ● ● ● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ●●● ●●● ● ● ● ●●● ● ● ● ● ●●● ●●●●●● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ● ● ●●● ● ● ●●● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ●●● ●●●●● ● ●●● ● ● ●●● ● ●● ● ● ● ●●● ● ● ● ● ●●● ● ● ● ●●● ● ●●● ●●● ● ● ● ●● ● ● ● ● ●● ● ● ● ● ●● ● ● ● ● ● ● ● ●● ● ● ●●● ●● ● ●● ●● ●●● ● ● ● ● ● ●● ● ●● ● ● ●● ● ●● ●● ● ●●● ●● ● ● ● ● ● ● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ●● ● ●● ●● ● ●● ●● ● ● ● ● ● ●● ● ●● ●● ●● ●● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●● ●● ●● ● ●● ● ● ●● ● ● ● ● ● ● ● ● ●● ●● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ●● ● ● ●● ● ● ● ●● ● ● ● ● ● ● ●● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● K ( M N O O N P Q ( R P P S O ( N ( L R U T V W N R ( 6 S 0 O V X ) * N O 0 P Q ( + , ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ●●●●●● ●●●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●●● ● ● ● ● ●●● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●●●●●●●●●● ●●●●●●●● ●●●●●●● ●●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ●●● ● ●●●● ● ● ●●●●● ●●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ●●●●● ●●●●● ●●●●●●● ●●●●●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ●● ● ● ● ● ● ● ●●● ● ●●●● ● ● ●●● ●●● ●●●●● ● ● ● ● ● ● ● ●●● ● ● ● ● ●●● ●●●● ● ● ●● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ●●● ●●● ● ● ●● ● ● ●● ●●● ● ● ● ● ● ● ● ● ● ● ●●● ●●● ● ●●● ● ● ●● ●● ● ● ● ●● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ●● ● ●● ●● ● ● ●● ●●● ● ●●● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●● ● ●● ●●● ● ● ●● ●●● ● ● ●● ● ● ● ● ●● ● ●● ● ● ● ●● ●● ● ● ● ● ● ●● ● ●● ● ● ● ●● ●● ● ●● ●● ● ● ● ● ● ● ●● ● ● ● ● ●● ● ● ●● ●● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ●● ● ● ●● ● ●● ●● ●● ● ● ● ● ● ● ●● ● ●● ●● ● ● ●● ● ●● ● ● ● ● ●● ● ● ● ●● ● ●● ● ● ● ●● ● ● ● ●● ● ● ● ● ● ●● ● ● ● ● ● ● ● ●● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ●● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● = . ● ● ● ● ● ● ● ● ● ● ; - > < 9 ? @ A B : C A F D D E B G H F E I D @ A B F B J Note: Figures show estimated Marginal Treatment Effects of inpatient admission (0/1) on the risk of ever being hospitalized the next three years for self-harm due to overdose (0/1), or lesion (0/1). Time 0 is month of initial contact. Covariates include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies, dummies for each F. diagnosis category form ICD-10. Confidence levels are color-coded. Dark blue = positive estimate, p < 0.10; light blue = positive estimate, p > 0.10; light red = negative estimate, p > 0.10; dark red = negative estimate, p < 0.10. Finally, Figure III.9 shows the MTEs for labor market outcomes one, two, and three years after first contact. The patients with the lowest values of UD experience the largest drop in employment three years after first contact, whereas the effect for patients with high values of UD are negative but insignificant. The patients with low values of UD also have higher risk of being out of the labor force following an inpatient admission at first contact. 142 Figure III.9: Marginal Treatment Effects of Inpatient Admissions on Labour Market Outcomes ((a)) Employment in year 1 ((b)) Employment in year 2 ((c)) Employment in year 3 ((d)) Unemployment in year 1 ((e)) Unemployment in year 2 ((f)) Unemployment in year 3 ((g)) Out of labor force in year 1 ((h)) Out of labor force in year 2 ((i)) Out of labor force in year 3 Note: Figures show estimated Marginal Treatment Effects of inpatient admission (0/1) on subsequent employment (0/1), unemployment (0/1), and being out of the labor force (0/1). Time 0 is month of initial contact. Covariates include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies, dummies for each F. diagnosis category form ICD-10. The dashed lines indicates 90% confidence intervals. 143 Taken together, the MTE estimates suggest two very different dimensions through which inpatient admission may affect the patients’ later life. One is institutionalizing and results in increased admission rates and lower labour market attachment for patients with less severe disorders. The other arises as patients with more serious disorders are given immediate help and thus also incapacitated. This lowers the risks of overdoses and crime. 5.4 Effect of Admittance on Spouses’ Labour Market Outcomes Mental illness may also have effects beyond the life of the patient. Therefore, we also consider how an inpatient admission affects spouses to people who contact the psychiatric care system. Table III.8 reports the effect of inpatient admission at first contact on the spouse’s labour market outcomes. 5,763 people in the data had an identifiable spouse at their first contact with the psychiatric system. Columns 2, 4, and 6 present outcomes from OLS models, and columns 3, 5, and 7 report 2SLS results. We follow the spouses for three years, regardless of whether they remain in the relationship across the period or not. The spouses of inpatients have slightly higher employment rates (column 2) and are less out of the labour force (column 6) one year after first contact than spouses of people who do not receive inpatient treatment. The 2SLS results show that inpatient admission at first contact has substantial and at least borderline significant effect on spouses’ labour market outcomes, even though the table also shows that the smaller sample results in large standard errors, as could be expected. Three years after first contact, spouses are significantly less unemployed (23.9 percentage points drop) and borderline significantly more employed (26.6 percentage points increase). There is no effect on the probability of being outside the labour force. The effect sizes in Table III.8 should be viewed in the light of the possible strain posed by living with an untreated psychiatric patients. The counterfactual to life with an untreated patient—admission as an inpatient—will not only result in short run relief to the spouse, but also in subsequent admissions and consequently further relief, cf. Figure III.2. 144 Table III.8: Estimation Results on Spouses’ Labor Market Outcomes First year since contact Second year since contact Third year since contact Observations Employment OLS 2SLS 0.030∗ 0.246+ (0.013) (0.146) 0.008 0.221 (0.013) (0.145) 0.004 0.266+ (0.013) (0.156) 5763 5763 Unemployment OLS 2SLS -0.004 -0.195+ (0.009) (0.106) -0.006 -0.161+ (0.009) (0.092) 0.009 -0.239∗ (0.009) (0.107) 5763 5763 Out of labor force OLS 2SLS -0.026∗ -0.052 (0.012) (0.113) -0.002 -0.059 (0.011) (0.119) -0.013 -0.028 (0.011) (0.119) 5763 5763 + p < .10;∗ p < .05;∗ ∗p < .01; ∗ ∗ ∗p < .001 Note: Table shows 2SLS regression results of inpatient admission (0/1) on his/her spouse’s subsequent employment (0/1), unemployment (0/1), and being out of the labor force (0/1). Standard errors clustered by hospital and month in parentheses. SES and demographic controls include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category from ICD-10. 6 Conclusion Treatment of psychiatric patients in the Western world has gone through radical changes during the 20th century. For several decades, the main function of psychiatric treatment has been downsized and partly moved away from hospitals towards community-based care. Although the quality of psychiatric treatment has increased substantially during the last century, medical professionals and the medical research community have recently voiced their concern that the move away from admitting patients into inpatient care may have been too substantial. In this paper we have used hospital specific contact intensity during the weeks prior to a patient’s first contact with the mental health system to identify the effects of admitting patients immediately to inpatient care at first contact relative to not admitting them at first contact. Using a sample of all first contacts to psychiatric hospitals by individuals aged 18-45 between 1999 and 2001, we showed that admitting a first time patient on the margin of being admitted had large but ambiguous effects. We found that in the short run, inpatient treatment leads to less crime and self-harming behaviour, especially among men 145 and the patients who suffer the most severe conditions. We also found significant effects on the probability of subsequent admissions and labour market outcomes, especially for women and, in the case of admission, for people with less severe disorders. For labour market outcomes, we found that women and people with the most severe disorders left employment and decreased labour force participation. Both the effects on admission and labour market outcomes increased in magnitudes across the subsequent three years after first contact. We found little or no effect on the likelihood of contacting a psychiatric hospital again, which indicates that admission at first contact affects how the mental health system treats patients after initial contact, but does not affect the patients’ demand for subsequent treatment. We also found persistent effects on the labour market outcomes for patients’ spouses—inpatient admission led to significant lower unemployment levels and borderline significant higher employment levels for spouses the three subsequent years after first contact. Mental health issues are causes of strain and distress for the patients, the immediate family, and for society as a whole. To make the adverse impact of mental disorders as small as possible, society should aim at optimizing the available treatment possibilities for patients. Yet our study has shown that this is not a simple task. When only considering whether to admit more patients as inpatients or not, we found that inpatient admission has very different effects across the dimensions of social life and also depends on the severity of a patient’s condition. If the aim is to lower crime rates among mental health patients with more severe conditions, it would make sense to increase the number of available inpatient beds. If the aim, however, is to increase labour market affiliation, we should decrease the number of beds. 146 References Anderson, D Mark. 2014. In school and out of trouble? The minimum dropout age and juvenile crime. Review of Economics and Statistics, 96(2), 318–331. Appelbaum, Paul S. 2001. Thinking carefully about outpatient commitment. Psychiatric Services, 52(3), 347–350. Bengtsson, Steen. 2011. Danmark venter stadig på sin psykiatrireform [Denmark still awaits its mental health care reform]. SFI - The Danish National Centre for Social Research. Björklund, Anders, & Moffitt, Robert. 1987. The estimation of wage gains and welfare gains in self-selection models. The Review of Economics and Statistics, 1, 42–49. Castel, Françoise, Castel, Robert, & Lovell, Anne. 1982. The psychiatric society. New York, NY: Columbia University Press. Creed, Francis, Black, Dawn, Anthony, Philip, Osborn, Madeline, Thomas, Philip, & Tomenson, Barbara. 1990. Randomised controlled trial of day patient versus inpatient psychiatric treatment. BMJ: British Medical Journal, 300(6731), 1033. Dahl, Gordon B, Kostol, Andreas Ravndal, & Mogstad, Magne. 2013. Family welfare cultures. Tech. rept. National Bureau of Economic Research. Davis, Karen, & Russell, Louise B. 1972. The substitution of hospital outpatient care for inpatient care. The Review of Economics and Statistics, 54(2), 109–120. Ettner, Susan L, Frank, Richard G, & Kessler, Ronald C. 1997. The Impact of psychiatric disorders on labor market outcomes. Industrial and Labor Relations Review, 51(1), 64–81. Fazel, Seena, Wolf, Achim, Chang, Zheng, Larsson, Henrik, Goodwin, Guy M, & Lichtenstein, Paul. 2015. Depression and violence: a Swedish population study. The Lancet Psychiatry, 2(3), 224–232. 147 Frank, Richard G., & Glied, Sherry A. 2006. Better But Not Well: Mental Health Policy in the United States Since 1950. Baltimore: JHU Press. Frank, Richard G., & McGuire, Thomas G. 2000. Chapter 16: Economics and mental health. Pages 893 – 954 of: Culyer, Anthony J., & Newhouse, Joseph P. (eds), Handbook of Health Economics. Handbook of Health Economics, vol. 1, Part B. Amsterdam: Elsevier. Gijswijt-Hofstra, Marijke, Oosterhuis, Harry, Vijselaar, Joost, & Freeman, Hugh. 2005. Psychiatric Cultures Compared: Psychiatry and Mental Health Care in the Twentieth Century: Comparisons and Approaches. Amsterdam: Amsterdam University Press. Goffman, Erving. 1961. Asylums: Essays on the Social Situation of Mental Patients and Other Inmates. New York: Anchor Books. Goodwin, Simon. 1997. Comparative mental health policy: from institutional to community care. New York: Sage. Greve, Jane, & Nielsen, Louise Herrup. 2013. Useful beautiful minds: An analysis of the relationship between schizophrenia and employment. Journal of Health Economics, 32(6), 1066 – 1076. Heckman, James J, & Honore, Bo E. 1990. The Empirical Content of the Roy Model. Econometrica, 58(5), 1121–1149. Heckman, James J, & Urzua, Sergio. 2010. Comparing IV with structural models: What simple IV can and cannot identify. Journal of Econometrics, 156(1), 27–37. Heckman, James J, & Vytlacil, Edward. 2005. Structural Equations, Treatment Effects, and Econometric Policy Evaluation. Econometrica, 73(3), 669–738. Heckman, James J, Urzua, Sergio, & Vytlacil, Edward. 2006. Understanding instrumental variables in models with essential heterogeneity. The Review of Economics and Statistics, 88(3), 389–432. 148 Hoff, F, Knipsel, A, Schneider, J, Beeley, C, Aagaard, Jørgen, Putten, M, Keet, R, & Snuverik, S. 2012. Outpatient care and community support for persons with severe mental health problems. A comparison of national policies and systems in Denmark, England and the Netherlands. Trimbos Institut, Utrecht. Jacob, Brian A., & Lefgren, Lars. 2003. Are Idle Hands the Devil’s Workshop? Incapacitation, Concentration, and Juvenile Crime. American Economic Review, 93(5), 1560–1577. Kallert, Thomas W, Priebe, Stefan, McCabe, Rosemarie, Kiejna, Andrzej, Rymaszewska, Joanna, Nawka, Petr, Ocvar, Ladislav, Raboch, Jiri, Starkova-Kalisova, Lucie, Koch, Rainer, et al. . 2007. Are Day Hospitals Effective for Acutely III Psychiatric Patients? A European Multicenter Randomized Controlled Trial. Journal of Clinical Psychiatry, 68(2), 278–287. Killaspy, Helen. 2006. From the asylum to community care: learning from experience. British Medical Bulletin, 79(1), 245–258. Kirkebøen, Lares, Leuven, Edwin, & Mogstad, Magne. 2014. Field of study, earnings, and self-selection. NBER Working Paper. Knowles, Caroline. 2005. Bedlam on the Streets. New York: Routledge. Kupers, Terry Allen, & Toch, Hans. 1999. Prison madness: The mental health crisis behind bars and what we must do about it. San Francisco, CA: Jossey-Bass. Marshall, Max, Crowther, Ruth, Almaraz-Serrano, Ana M, & Tyrer, Peter. 2009. Day hospital versus out-patient care for psychiatric disorders. In: Cochrane Database of Systematic Reviews. John Wiley & Sons, Ltd. Mojtabai, Ramin, & Olfson, Mark. 2011. Proportion of antidepressants prescribed without a psychiatric diagnosis is growing. Health Affairs, 30(8), 1434–1442. 149 Moustsen, Ida R, Larsen, Signe B, Vibe-Petersen, Jette, Trier, Karen, Bidstrup, Pernille E, Andersen, Klaus K, Johansen, Christoffer, & Dalton, Susanne O. 2015. Social position and referral to rehabilitation among cancer patients. Acta Oncologica, 1–7. Noh, Samuel, & Turner, R Jay. 1987. Living with psychiatric patients: Implications for the mental health of family members. Social Science & Medicine, 25(3), 263–272. Oosterhuis, Harry. 2005. Outpatient psychiatry and mental health care in the twentieth century: International perspectives. In: Gijswijt-Hofstra, Marijke, Oosterhuis, Harry, Vijselaar, Joost, & Freeman, Hugh (eds), Psychiatric Cultures Compared: Psychiatry and Mental Health Care in the Twentieth Century: Comparisons and Approaches. Amsterdam: Amsterdam University Press. Scheffler, Richard M., & Watts, Carolyn A. 1986. Determinants of Inpatient Mental Health Use in a Heavily Insured Population. The Journal of Human Resources, 21(3), pp. 338– 358. Shek, Elena, Stein, Airton T, Shansis, Flavio M, Marshall, Max, Crowther, Ruth, & Tyrer, Peter. 2010. Day hospital versus outpatient care for people with schizophrenia. In: Cochrane Database of Systematic Reviews. John Wiley & Sons, Ltd. Staiger, Douglas, & Stock, James H. 1997. Instrumental Variables Regression with Weak Instruments. Econometrica: Journal of the Econometric Society, 65(3), 557–586. The Lancet. 2011. Editorial address: The need for asylum. The Lancet, 378(9785), 1. Vitikainen, Kirsi, Linna, Miika, & Street, Andrew. 2010. Substituting inpatient for outpatient care: What is the impact on hospital costs and efficiency? The European Journal of Health Economics, 11(4), 395–404. WHO. 2011a. Mental Health Atlas. Geneva, CH: WHO. WHO. 2011b. Mental Health Atlas country profiles ”Denmark”. Geneva, CH: WHO. 150 Winerip, Michael. 1999. Bedlam on the Streets. New York Time Magazine, May 23, 12,13,45–49,56,65,70. 151 A Supplementary Results Table IIIA.1: Regression of GP and Specialists Visits on the Instruments GP visits, year of admission IV, day 1-7 IV, day 8-14 GP visits, year prior to admission IV, day 1-7 IV, day 8-14 Specialists visits, year of admission IV, day 1-7 IV, day 8-14 Specialists visits, year prior to admission IV, day 1-7 IV, day 8-14 SES controls Diagnosis controls Observations Model 1 Model 2 Model 3 -0.558 (0.681) -0.507 (0.653) -0.495 (0.652) -0.389 (0.704) 0.223 (0.672) 0.205 (0.673) 0.220 (0.593) 0.137 (0.577) 0.152 (0.580) -0.582 (0.617) -0.146 (0.600) -0.153 (0.600) 0.239 (0.214) 0.132 (0.212) 0.127 (0.213) 0.333 (0.219) 0.259 (0.217) 0.259 (0.218) 0.122 (0.189) 0.033 (0.188) 0.030 (0.187) 0.008 (0.207) -0.058 (0.204) X 24,277 24,277 -0.054 (0.204) X X 24,277 Note: Table shows OLS regression results of the two intrumental variables (hospital specific contact intensity the two week prior to the individual’s initial contact) on number of visits to general practioners and specialists the year of and the year prior to initial contact. Standard errors clustered by hospital and month in parentheses. SES and demographic controls include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category from ICD-10. + p < .10;∗ p < .05;∗∗ p < .01; ∗∗∗ p < .001 152 Table IIIA.2: Estimation Results of Covariates Regressed on the Instruments Male Age at admission Mother’s age at birth Mother’s months of schooling Year 1999 Year 2000 Mother has been admitted Mother missing Father’s months of schooling Father missing Admitted in own municipality Admitted in Copenhagen Admitted in metropolitan area Disorder associated with substance use Psychosis, schizophrenia Nervous or stress-related Personality disorder Affective/emotionel, pre-adult origin Previous crime (0/1) Out of labor force in year -1 Unemployed in year -1 Observations IV day 1-7 IV day 8-14 -0.006∗ -0.003 (0.002) (0.002) -0.000 -0.000 (0.000) (0.000) 0.000 -0.000 (0.000) (0.000) 0.000∗ 0.000∗ (0.000) (0.000) -0.023 -0.026+ (0.015) (0.015) 0.014 0.011 (0.015) (0.015) 0.001 0.000 (0.004) (0.004) 0.002 0.000 (0.005) (0.005) 0.000 0.000∗ (0.000) (0.000) 0.009∗ 0.010∗ (0.004) (0.004) -0.002 -0.005 (0.005) (0.005) 0.040∗∗∗ 0.043∗∗∗ (0.009) (0.009) 0.036∗∗ 0.036∗∗ (0.012) (0.012) -0.001 -0.000 (0.005) (0.005) -0.000 -0.001 (0.006) (0.006) 0.000 0.000 (0.005) (0.005) -0.006 -0.007 (0.005) (0.006) 0.009 0.014 (0.009) (0.009) -0.001 -0.004 (0.003) (0.003) -0.005 -0.007∗ (0.003) (0.003) -0.003 -0.000 (0.004) (0.004) 24,277 24,277 Note: Table shows OLS regression results of the two intrumental variables (hospital specific contact intensity the two week prior to the individual’s initial contact) on gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies, and dummies for each F. diagnosis category form ICD-10. Standard errors clustered by hospital and month in parentheses. + p < .10;∗ p < .05;∗∗ p < .01; ∗∗∗ p < .001 153 Table IIIA.3: Estimation Results on Likelihood of Subsequent Admission to a Psychiatric Hospital 3 months from 1st contact 6 months from 1st contact 9 months from 1st contact 12 months from 1st contact 15 months from 1st contact 18 months from 1st contact 21 months from 1st contact 24 months from 1st contact 27 months from 1st contact 30 months from 1st contact 33 months from 1st contact 36 months from 1st contact Observations SES and demograpic controls Diagnosis controls OLS 0.115∗∗∗ (0.005) 0.140∗∗∗ (0.005) 0.150∗∗∗ (0.006) 0.167∗∗∗ (0.006) 0.175∗∗∗ (0.006) 0.181∗∗∗ (0.006) 0.188∗∗∗ (0.006) 0.191∗∗∗ (0.006) 0.195∗∗∗ (0.006) 0.196∗∗∗ (0.006) 0.198∗∗∗ (0.006) 0.200∗∗∗ (0.006) 24,277 X X 2SLS 0.152∗ (0.064) 0.134+ (0.078) 0.129 (0.081) 0.115 (0.081) 0.146+ (0.078) 0.148+ (0.078) 0.188∗ (0.078) 0.184∗ (0.078) 0.176∗ (0.078) 0.179∗ (0.079) 0.172∗ (0.078) 0.186∗ (0.078) 24,277 2SLS 0.160∗ (0.066) 0.162∗ (0.077) 0.158∗ (0.079) 0.146+ (0.080) 0.180∗ (0.078) 0.181∗ (0.077) 0.222∗∗ (0.078) 0.216∗∗ (0.079) 0.205∗ (0.080) 0.210∗∗ (0.081) 0.199∗ (0.080) 0.214∗∗ (0.080) 24,277 X 2SLS 0.160∗ (0.068) 0.160∗ (0.080) 0.155+ (0.082) 0.142+ (0.083) 0.176∗ (0.079) 0.176∗ (0.078) 0.217∗∗ (0.070) 0.211∗∗ (0.080) 0.199∗ (0.080) 0.204∗ (0.081) 0.193∗ (0.080) 0.208∗∗ (0.079) 24,277 X X Note: Table shows OLS and 2SLS regression results of inpatient admission (0/1) on subsequent re-admisison (0/1) corresponding to Figure III.2. Time 0 is month of initial contact. Standard errors clustered by hospital and month in parentheses. SES and demographic controls include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category from ICD10. + p < .10;∗ p < .05;∗∗ p < .01; ∗∗∗ p < .001 154 Table IIIA.4: Estimation Results on Likelihood of Subsequent Contact with Psychiatric Hospitals 3 months from 1st contact 6 months from 1st contact 9 months from 1st contact 12 months from 1st contact 15 months from 1st contact 18 months from 1st contact 21 months from 1st contact 24 months from 1st contact 27 months from 1st contact 30 months from 1st contact 33 months from 1st contact 36 months from 1st contact Observations OLS 0.247∗∗∗ (0.006) 0.261∗∗∗ (0.007) 0.257∗∗∗ (0.0067) 0.256∗∗∗ (0.007) 0.251∗∗∗ (0.007) 0.247∗∗∗ (0.007) 0.243∗∗∗ (0.007) 0.236∗∗∗ (0.007) 0.233∗∗∗ (0.007) 0.228∗∗∗ (0.007) 0.225∗∗∗ (0.007) 0.223∗∗∗ (0.007) 24,277 2SLS 0.079 (0.091) 0.117 (0.096) 0.105 (0.092) 0.063 (0.091) 0.0790 (0.086) 0.0806 (0.086) 0.124 (0.087) 0.0977 (0.087) 0.109 (0.086) 0.109 (0.087) 0.0913 (0.088) 0.072 (0.088) 24,277 2SLS 0.093 (0.096) 0.160 (0.099) 0.147 (0.094) 0.098 (0.094) 0.110 (0.090) 0.110 (0.090) 0.157+ (0.090) 0.127 (0.091) 0.139 (0.091) 0.137 (0.091) 0.118 (0.092) 0.097 (0.093) 24,277 2SLS 0.096 (0.097) 0.160 (0.103) 0.145 (0.099) 0.094 (0.097) 0.105 (0.093) 0.105 (0.093) 0.152 (0.093) 0.122 (0.094) 0.133 (0.093) 0.130 (0.093) 0.111 (0.094) 0.090 (0.094) 24,277 Note: Table shows OLS and 2SLS regression results of inpatient admission (0/1) on subsequentcontact to psych. hospitals (0/1) corresponding to Figure III.3. Time 0 is month of initial contact. Standard errors clustered by hospital and month in parentheses. SES and demographic controls include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category from ICD-10. + p < .10;∗ p < .05;∗∗ p < .01; ∗∗∗ p < .001 155 Table IIIA.5: Estimation Results on Subsequent Crimes 3 months from 1st contact 6 months from 1st contact 9 months from 1st contact 12 months from 1st contact 15 months from 1st contact 18 months from 1st contact 21 months from 1st contact 24 months from 1st contact 27 months from 1st contact 30 months from 1st contact 33 months from 1st contact 36 months from 1st contact Observations OLS 0.008∗∗∗ (0.002) 0.023∗∗∗ (0.005) 0.036∗∗∗ (0.010) 0.055∗∗∗ (0.015) 0.078∗∗∗ (0.020) 0.106∗∗∗ (0.027) 0.138∗∗∗ (0.034) 0.174∗∗∗ (0.041) 0.217∗∗∗ (0.050) 0.266∗∗∗ (0.059) 0.323∗∗∗ (0.069) 0.388∗∗∗ (0.079) 24,277 2SLS -0.034 (0.026) -0.101+ (0.061) -0.182+ (0.110) -0.275 (0.172) -0.363 (0.242) -0.468 (0.322) -0.580 (0.409) -0.678 (0.503) -0.758 (0.602) -0.786 (0.706) -0.787 (0.821) -0.781 (0.943) 24,277 2SLS -0.054+ (0.0285) -0.153∗ (0.0675) -0.270∗ (0.121) -0.405∗ (0.189) -0.541∗ (0.265) -0.706∗ (0.353) -0.886∗ (0.449) -1.057+ (0.551) -1.223+ (0.660) -1.339+ (0.774) -1.437 (0.899) -1.536 (1.033) 24,277 2SLS -0.056+ (0.0291) -0.156∗ (0.0690) -0.277∗ (0.124) -0.416∗ (0.193) -0.557∗ (0.270) -0.729∗ (0.360) -0.916∗ (0.456) -1.096+ (0.561) -1.270+ (0.673) -1.396+ (0.789) -1.504 (0.916) -1.615 (1.051) 24,277 Note: Table shows OLS and 2SLS regression results of inpatient admission (0/1) on subsequent crimes corresponding to Figure III.4. Time 0 is month of initial contact. Standard errors clustered by hospital and month in parentheses. SES and demographic controls include: Gender (dummy), Age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, Mother has prior psych. history (dummy), Admitted in own municipality (dummy), Greater CPH area (dummy), Other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category form ICD10. + p < .10;∗ p < .05;∗∗ p < .01; ∗∗∗ p < .001 156 Table IIIA.6: Regional Robustness Checks Without Copenhagen OLS 2SLS Crime 3 months after first contact 6 months after first contact 9 months after first contact 12 months after first contact 15 months after first contact 18 months after first contact 21 months after first contact 24 months after first contact 27 months after first contact 30 months after first contact 33 months after first contact 36 months after first contact Admissions 3 months after first contact 6 months after first contact 9 months after first contact 12 months after first contact 15 months after first contact 18 months after first contact 21 months after first contact 24 months after first contact 27 months after first contact 30 months after first contact 33 months after first contact 36 months after first contact Contacts 3 months after first contact Without major cities OLS 2SLS 0.008∗∗ (0.003) 0.013∗∗∗ (0.004) 0.013∗ (0.005) 0.0123∗ (0.006) 0.016∗ (0.007) 0.020∗∗ (0.007) 0.024∗∗ (0.008) 0.028∗∗ (0.009) 0.032∗∗∗ (0.010) 0.040∗∗∗ (0.011) 0.046∗∗∗ (0.011) 0.054∗∗∗ (0.012) −0.040 (0.026) −0.084∗ (0.040) −0.096+ (0.056) −0.118+ (0.068) −0.115 (0.075) −0.137 (0.086) −0.152+ (0.091) −0.146 (0.099) −0.151 (0.105) −0.096 (0.109) −0.070 (0.119) −0.065 (0.126) 0.010∗∗∗ (0.003) 0.014∗∗ (0.004) 0.012∗ (0.005) 0.0132∗ (0.006) 0.017∗ (0.007) 0.022∗∗ (0.008) 0.026∗∗ (0.009) 0.030∗∗ (0.010) 0.034∗∗ (0.011) 0.040∗∗∗ (0.012) 0.045∗∗∗ (0.012) 0.054∗∗∗ (0.013) 0.116∗∗∗ (0.005) 0.143∗∗∗ (0.006) 0.158∗∗∗ (0.006) 0.173∗∗∗ (0.006) 0.182∗∗∗ (0.006) 0.189∗∗∗ (0.006) 0.197∗∗∗ (0.006) 0.200∗∗∗ (0.007) 0.205∗∗∗ (0.007) 0.207∗∗∗ (0.007) 0.208∗∗∗ (0.007) 0.210∗∗∗ (0.007) 0.171∗∗ (0.066) 0.168∗ (0.077) 0.164∗ (0.079) 0.148+ (0.079) 0.185∗ (0.075) 0.181∗ (0.074) 0.212∗∗ (0.074) 0.214∗∗ (0.074) 0.202∗∗ (0.074) 0.204∗∗ (0.076) 0.195∗∗ (0.075) 0.211∗∗ (0.074) 0.115∗∗∗ (0.006) 0.137∗∗∗ (0.007) 0.152∗∗∗ (0.007) 0.167∗∗∗ (0.007) 0.175∗∗∗ (0.007) 0.182∗∗∗ (0.007) 0.188∗∗∗ (0.007) 0.192∗∗∗ (0.007) 0.196∗∗∗ (0.007) 0.197∗∗∗ (0.007) 0.199∗∗∗ (0.008) 0.202∗∗∗ (0.008) 0.124∗ (0.053) 0.133∗ (0.061) 0.140∗ (0.064) 0.140∗ (0.064) 0.153∗ (0.063) 0.143∗ (0.064) 0.180∗∗ (0.065) 0.184∗∗ (0.065) 0.176∗∗ (0.066) 0.180∗∗ (0.067) 0.172∗∗ (0.067) 0.179∗∗ (0.065) 0.252∗∗∗ (0.007) 0.121 (0.097) 0.251∗∗∗ (0.009) 0.121 (0.074) 157 −0.034 (0.021) −0.053 (0.033) −0.049 (0.046) −0.0559 (0.055) −0.059 (0.062) −0.088 (0.070) −0.090 (0.075) −0.086 (0.081) −0.089 (0.087) −0.054 (0.095) −0.023 (0.103) −0.011 (0.110) 6 months after first contact 9 months after first contact 12 months after first contact 15 months after first contact 18 months after first contact 21 months after first contact 24 months after first contact 27 months after first contact 30 months after first contact 33 months after first contact 36 months after first contact Employment 12 months after first contact 24 months after first contact 36 months after first contact Unemployment 12 months after first contact 24 months after first contact 36 months after first contact Out of Labor Force 12 months after first contact 24 months after first contact 36 months after first contact Observations 0.268∗∗∗ (0.007) 0.264∗∗∗ (0.007) 0.264∗∗∗ (0.007) 0.259∗∗∗ (0.008) 0.256∗∗∗ (0.008) 0.252∗∗∗ (0.008) 0.246∗∗∗ (0.008) 0.243∗∗∗ (0.008) 0.239∗∗∗ (0.008) 0.236∗∗∗ (0.008) 0.233∗∗∗ (0.008) 0.160 (0.102) 0.148 (0.098) 0.0993 (0.095) 0.106 (0.092) 0.105 (0.092) 0.148 (0.092) 0.128 (0.092) 0.133 (0.091) 0.132 (0.091) 0.116 (0.091) 0.0980 (0.091) 0.263∗∗∗ (0.008) 0.260∗∗∗ (0.008) 0.259∗∗∗ (0.008) 0.253∗∗∗ (0.008) 0.249∗∗∗ (0.008) 0.245∗∗∗ (0.009) 0.239∗∗∗ (0.009) 0.236∗∗∗ (0.009) 0.232∗∗∗ (0.009) 0.229∗∗∗ (0.009) 0.227∗∗∗ (0.009) 0.153∗ (0.076) 0.138+ (0.072) 0.080 (0.073) 0.070 (0.072) 0.067 (0.072) 0.096 (0.074) 0.084 (0.074) 0.087 (0.074) 0.090 (0.074) 0.081 (0.074) 0.073 (0.073) −0.0525∗∗∗ (0.008) −0.0468∗∗∗ (0.008) −0.0483∗∗∗ (0.008) −0.0834 (0.092) −0.130 (0.080) −0.238∗∗ (0.077) −0.0517∗∗∗ (0.009) −0.0496∗∗∗ (0.009) −0.0526∗∗∗ (0.009) −0.160∗ (0.0726) −0.189∗∗ (0.0658) −0.272∗∗∗ (0.0669) −0.039 (0.070) 0.066 (0.065) 0.056 (0.056) 0.019∗∗ (0.007) 0.016∗ (0.007) 0.007 (0.006) 0.050 (0.056) 0.111∗ (0.053) 0.109∗ (0.048) 0.033∗∗∗ (0.008) 0.034∗∗∗ (0.008) 0.046∗∗∗ (0.008) 0.110+ (0.061) 0.078 (0.064) 0.163∗∗ (0.060) 0.015∗ (0.006) 0.012∗ (0.006) 0.007 (0.006) 0.038∗∗∗ (0.007) 0.035∗∗∗ (0.007) 0.041∗∗∗ (0.007) 19595 0.123+ (0.074) 0.064 (0.081) 0.182∗∗ (0.070) 19595 15898 15898 ) Note: Table shows 2SLS regression results of inpatient admission (0/1) on subsequent re-admisison (0/1), contact to psych. hospitals (0/1), and crimes while excluding the greater Copenhagen area. Time 0 is month of initial contact. Standard errors clustered by hospital and month in parentheses. SES and demographic controls include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category from ICD-10. + p < .10;∗ p < .05;∗∗ p < .01; ∗∗∗ p < .001 159 Not Admitted, Female 0.120 (0.325) 30.401 (7.497) 179003.786 (107239.963) 0.123 (0.329) 0.168 (0.299) 0.386 (0.401) 128.149 (34.394) 137.994 (34.794) 26.275 (4.993) 0.170 (0.376) 0.214 (0.410) 0.145 (0.352) 0.050 (0.217) 0.022 (0.148) 0.203 (0.402) 0.571 (0.495) 0.123 (0.328) 0.031 (0.174) 9763 Admitted, Female 0.122 (0.327) 31.778 (7.593) 180252.508 (103035.132) 0.159 (0.366) 0.187 (0.309) 0.353 (0.387) 122.925 (33.387) 133.954 (34.734) 26.249 (4.935) 0.156 (0.363) 0.161 (0.368) 0.166 (0.372) 0.107 (0.309) 0.146 (0.353) 0.309 (0.462) 0.340 (0.474) 0.088 (0.284) 0.010 (0.100) 3268 Not Admitted, Male 0.135 (0.342) 31.394 (7.518) 213254.889 (157252.309) 0.435 (0.496) 0.192 (0.308) 0.300 (0.368) 125.263 (34.406) 136.200 (35.076) 26.132 (5.025) 0.161 (0.367) 0.196 (0.397) 0.156 (0.363) 0.228 (0.420) 0.056 (0.230) 0.148 (0.355) 0.413 (0.492) 0.111 (0.314) 0.043 (0.204) 7165 Note: Table shows means and std. dev. of covariates for the full sample divided by treatment status and gender. Observations Affective/emotionel, pre-adult origin Personality disorder Nervous or stress-related Affective disorder Psychosis, schizophrenia Disorder associated with substance use Admitted in metropolitan area Admitted in Copenhagen Admitted in own municipality Mother’s age at birth Father’s months of schooling Mother’s months of schooling Welfare dependency in year -1 Unemployment degree in year -1 Previous crime (0/1) Gross income in year -1 Age at admission Immigrant Table IIIA.7: Summary statistics by gender and treatment status Admitted, Male 0.126 (0.332) 31.959 (7.451) 209616.192 (141716.768) 0.479 (0.500) 0.196 (0.307) 0.284 (0.359) 120.085 (33.558) 131.877 (35.292) 26.138 (5.227) 0.158 (0.365) 0.161 (0.367) 0.152 (0.359) 0.298 (0.458) 0.174 (0.379) 0.189 (0.392) 0.276 (0.447) 0.057 (0.232) 0.005 (0.073) 4081 Figure IIIA.1: Effect on Subsequent Crimes by Months Since 1st Contact for Outpatients and 1st Discharge for Inpatients Note: Figure shows 2SLS regression results of inpatient admission (0/1) on subsequent crimes where time 0 is defined as hospital discharge instead of initial contact. The dashed lines indicate 95% confidence intervals and the dotted line indicate 90% confidence intervals. Standard errors clustered by hospital and month in parentheses. SES and demographic controls include: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. Diagnosis controls include: Dummies for each F. diagnosis category from ICD-10. 160 Figure IIIA.2: Common Support for Treated and Untreated Note: Figure shows areas of common support for treatment and control groups from a probit model of treatment on two instruments (hospital specific contact intensity the two week prior to the individual’s initial contact) and SES and demographic controls: Gender (dummy), age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies and diagnosis controls: Dummies for each F. diagnosis category from ICD-10. 161 Figure IIIA.3: Effect on Admission on Outcomes using Non-Linear Slope in First Stage, 2SLS Estimates ((a)) Admission ((b)) Contact ((c)) Crimes ((d)) Crimes ((e)) Overdose ((f)) Lesion Note: Figures show main results on probability of re-admission, contact, admission for lesions, admission for overdoses, and number of crimes re-estimated with a non-linear function in the first stage of the form: D = γ1 z1 + γ2 z2 + γ3 1[z1 > 0.4] + γ4 1[z2 > 0.4] + γ5 1[z1 > 0.4]z1 + γ6 1[z2 > 0.4]z2 X 0 Π The dashed lines indicate 95% confidence intervals and the dotted line indicate 90% confidence intervals. Time 0 is month of initial contact. Standard errors clustered by hospital and month in parentheses. SES and demographic controls include: Age at adm., mother’s age at birth, mothers years of schooling, father’ age at birth, father’s years of schooling, mother has prior psych. history (dummy), admitted in own municipality (dummy), greater CPH area (dummy), other metropolitan area (dummy), year dummies. 162 B Data Appendix First, this section provides a brief overview of the trends in the number of patients and inpatient capacity at psychiatric hospitals during the period studied. Second, the section describes the data construction in detail, as a supplement to section 3 from the main text. Trends in Psychiatric Hospital Treatments Figure IIIB.1 shows the number of inpatient beds in Danish psychiatric hospitals per 1,000 inhabitants and the number of 20 to 64 year old patients per 1,000 persons in that age group between 1996 and 2011. Figure IIIB.1: Psychiatric beds and admissions, 1996-2011 ϭ ϰ Ϭ͕ϳϱ ϯ Ϭ͕ϱ Ϯ Ϭ͕Ϯϱ ϭ Ϭ Ϭ ϭϵϵϲ ϭϵϵϳ ϭϵϵϴ ϭϵϵϵ ϮϬϬϬ ϮϬϬϭ ϮϬϬϮ ϮϬϬϯ ϮϬϬϰ ϮϬϬϱ ϮϬϬϲ ϮϬϬϳ ϮϬϬϴ ϮϬϬϵ ϮϬϭϬ ϮϬϭϭ EŽ͘ĞĚƐͬϭϬϬϬŝŶŚĂďŝƚĂŶƚƐ ĚŵŝƐƐŝŽŶƐͬϭϬϬϬϮϬͲϲϰLJĞĂƌŽůĚƐ Source: Own calculations on data from Statistics Denmark. 163 The Danish psychiatric system gradually downsized the number of inpatient beds from 1996–2011. The reductions from 1997 to 2001 were relatively small, from 0.80 in 1997 to 0.75 in 2001. From 2002 and onwards, the reductions increased in magnitude. In 2011 there were 0.53 beds per 1,000 inhabitants. The figure also shows that the number of adult inpatients between the age of 20 and 64 increased from 2.36 in 1996 to 3.05 in 2004, only to stagnate at around 2.90 until 2011. The downward trend in the number of psychiatric beds resembles that of many Western countries (WHO, 2011a). The reductions to inpatient capacities relative to the number of treated patients moved treatment from a hospital setting to the patients’ home environments while also reducing the costs of treatment. Construction of Mental Health Data We constructed the mental health data using the Danish national psychiatric register (a subset of the Danish patient register (LPR)) made available by Statistics Denmark. We study the period 1999 to 2001, and only consider patients who had their first contact in this period. This yields 50,439 patients. Next, we only include individuals who had their first contact between the ages of 18 and 45 as we do not wish to focus on child or geriatric psychiatric treatment as admittance or non-admittance of these patients is not likely to influence the outcomes we are interested in. Also, once in contact with a psychiatric care facility, different conditions apply for these two groups relative to the average adult population. The age limitation reduces the sample from 50,439 to 31,248. We discard individuals who are diagnosed with mental retardation, dementia, or disorders of early psychological development because these patient groups suffer from chronic disorders and have little or no labor market attachment. We also discard individuals who are diagnosed with eating disorders or non-organic sexual dysfunctions in order to obtain a more homogeneous sample. Individuals who enter treatment via the criminal justice system are also discarded. Finally, a few countryside treatment facilities only treat 10-50 individuals per year and we discard individuals who contact these facilities in order to not skew our instrument. Limiting the 164 diagnoses used, disregarding criminal justice court ordered psychiatric treatments, and setting a lower threshold for facility size leave us with a final sample of 24,277 individuals. The final sample contains the following ICD-10 diagnosis categories: (a) F10-F19: Mental and behavioral disorders due to psychoactive substance use (b) F20-F29: Schizophrenia, schizotypal and delusional disorders (c) F30-F39: Mood (affective) disorders (d) F40-F48: Neurotic, stress-related and somatoform disorders (e) F60-F69: Disorders of adult personality and behavior (f) F90-F98: Behavioral and emotional disorders with onset usually occurring in childhood and adolescence (g) F99: Unspecified mental disorder We control for each diagnosis category. Only 52 people in the data receives a diagnosis in the range F90-F98, so they are grouped together with F99 diagnoses. Results do not change if we group them apart. We define the outcome of a contact on the basis of a patient’s experiences across an entire day. I.e., if a patient contacts a psychiatric emergency ward and later in the day is admitted to a normal psychiatric ward, that is treated as one incident (contact) leading to an admission. In order not to exclude people who show up late in the day, we use the same procedure for patient who have their first contact at an emergency ward at day one and are admitted to a normal word at day two. We distinguish between admittance to inpatient care, admittance into outpatient care, and no admittance. We define the treatment variable as admittance to inpatient care. 165 Construction of the Instrument We used the Danish National Psychiatric Register to obtain the number of unique individual contacts to each hospital each day for the period 1998-2001 for all ages and types of diagnoses. We included the year 1998 in order to construct the instrument as a contact intensity measure as shown in Equation 1. We then computed the number of weekly contacts separately for the two seven day periods prior to a person’s first contact by aggregating the number across day -1 to -7, and day -8 to -14. This gave us our numerators. In order to obtain the denominator, which measures the seven day period with the highest number of contacts within the last 365 days, we calculated the aggregated number for all successive seven days combination from the day prior to an individual’s contact and going backward 365 days. We then used the highest number of contacts measured across a seven day period as the denominator. The two fractions made up the instruments. Construction of Covariates We constructed covariate data using a number of databases. Unique individual identification numbers allowed us to directly link observations across registers, and also link information on patients with information on their parents and potential spouse. We used the following databases: (a) Danish demographic database (FAIN), from where we obtained age, ethnicity, and municipality of residence for year of first contact. (b) The register based labor force database (RAS), unemployment register (CRAM), and the Danish Rational Economic Agents Model (DREAM) to obtain information on unemployment degree and public dependency degree the year prior to first contact. (c) The database of individual usage of health services (SYIN) to obtain information on 166 admittances to hospital for somatic reasons as well as visits to GP and specialist doctors the year prior to first contact. (d) The educational database (UDDA) to obtain information on parental schooling. We obtain the information for each year since 1981, and use the information recorded closest to the year of first contact. (e) The criminal justice databases on convictions (KRAF) and indictments (KRIN) to obtain information on prior criminality. We use information of whether an individual ever has received a conviction for a non-traffic related offense or felony from 1980 and until the year before first contact. (f) The income database (INDK) to obtain information on gross income reported the year before first contact. Construction of Outcomes We measured all outcomes for the following three years after first contact. Some were measured quarterly, others annually. The following paragraphs provide details on how we constructed the different measures. Contacts and Admissions We followed each patient in the Danish national psychiatric register data for the 36 subsequent months after first contact measured quarterly. Contact was constructed as an absorbing state dummy, indicating if the patient had contact again to a psychiatric hospital during this period. Admission was constructed as a subset of the contact dummy, indicating if the patient ever was admitted as an inpatient following a contact after first contact. Crime Using the criminal justice databases on convictions (KRAF) and indictments (KRIN) we constructed the crime variable as a quarterly count variable that aggregated all criminal 167 convictions received by an individual in the 36 months following first contact. We excluded traffic violations. Hospitalizations for Self-harm We constructed two quarterly outcomes for self-harm: whether an individual was hospitalized due to (a) an overdose; and (b) lesions. We constructed both measures as absorbing state dummies over a 36 month period. We used data from the Danish national patient register (LPR). Labor Market Outcomes We use the register based labor force database (RAS) to obtain annual information on individual’s labor market position for the three years after contact. We define three categories: employed, unemployed, and outside the labor force. Employed entails any form of paid work, either as an employee or as self-employed. Unemployed entails receiving either social assistance or unemployment insurance while being available for the labor market or undertaking workfare. Outside the labor force entails receiving public welfare without work or workfare requirements. We also obtain the same information for patients’ spouses. The information is obtained at the end of November for each year. 168 DEPARTMENT OF ECONOMICS AND BUSINESS AARHUS UNIVERSITY SCHOOL OF BUSINESS AND SOCIAL SCIENCES www.econ.au.dk PhD Theses since 1 July 2011 2011-4 Anders Bredahl Kock: Forecasting and Oracle Efficient Econometrics 2011-5 Christian Bach: The Game of Risk 2011-6 Stefan Holst Bache: Quantile Regression: Three Econometric Studies 2011:12 Bisheng Du: Essays on Advance Demand Information, Prioritization and Real Options in Inventory Management 2011:13 Christian Gormsen Schmidt: Exploring the Barriers to Globalization 2011:16 Dewi Fitriasari: Analyses of Social and Environmental Reporting as a Practice of Accountability to Stakeholders 2011:22 Sanne Hiller: Essays on International Trade and Migration: Firm Behavior, Networks and Barriers to Trade 2012-1 Johannes Tang Kristensen: From Determinants of Low Birthweight to Factor-Based Macroeconomic Forecasting 2012-2 Karina Hjortshøj Kjeldsen: Routing and Scheduling in Liner Shipping 2012-3 Soheil Abginehchi: Essays on Inventory Control in Presence of Multiple Sourcing 2012-4 Zhenjiang Qin: Essays on Heterogeneous Beliefs, Public Information, and Asset Pricing 2012-5 Lasse Frisgaard Gunnersen: Income Redistribution Policies 2012-6 Miriam Wüst: Essays on early investments in child health 2012-7 Yukai Yang: Modelling Nonlinear Vector Economic Time Series 2012-8 Lene Kjærsgaard: Empirical Essays of Active Labor Market Policy on Employment 2012-9 Henrik Nørholm: Structured Retail Products and Return Predictability 2012-10 Signe Frederiksen: Empirical Essays on Placements in Outside Home Care 2012-11 Mateusz P. Dziubinski: Essays on Financial Econometrics and Derivatives Pricing 2012-12 Jens Riis Andersen: Option Games under Incomplete Information 2012-13 Margit Malmmose: The Role of Management Accounting in New Public Management Reforms: Implications in a Socio-Political Health Care Context 2012-14 Laurent Callot: Large Panels and High-dimensional VAR 2012-15 Christian Rix-Nielsen: Strategic Investment 2013-1 Kenneth Lykke Sørensen: Essays on Wage Determination 2013-2 Tue Rauff Lind Christensen: Network Design Problems with Piecewise Linear Cost Functions 2013-3 Dominyka Sakalauskaite: A Challenge for Experts: Auditors, Forensic Specialists and the Detection of Fraud 2013-4 Rune Bysted: Essays on Innovative Work Behavior 2013-5 Mikkel Nørlem Hermansen: Longer Human Lifespan and the Retirement Decision 2013-6 Jannie H.G. Kristoffersen: Empirical Essays on Economics of Education 2013-7 Mark Strøm Kristoffersen: Essays on Economic Policies over the Business Cycle 2013-8 Philipp Meinen: Essays on Firms in International Trade 2013-9 Cédric Gorinas: Essays on Marginalization and Integration of Immigrants and Young Criminals – A Labour Economics Perspective 2013-10 Ina Charlotte Jäkel: Product Quality, Trade Policy, and Voter Preferences: Essays on International Trade 2013-11 Anna Gerstrøm: World Disruption - How Bankers Reconstruct the Financial Crisis: Essays on Interpretation 2013-12 Paola Andrea Barrientos Quiroga: Essays on Development Economics 2013-13 Peter Bodnar: Essays on Warehouse Operations 2013-14 Rune Vammen Lesner: Essays on Determinants of Inequality 2013-15 Peter Arendorf Bache: Firms and International Trade 2013-16 Anders Laugesen: On Complementarities, Heterogeneous Firms, and International Trade 2013-17 Anders Bruun Jonassen: Regression Discontinuity Analyses of the Disincentive Effects of Increasing Social Assistance 2014-1 David Sloth Pedersen: A Journey into the Dark Arts of Quantitative Finance 2014-2 Martin Schultz-Nielsen: Optimal Corporate Investments and Capital Structure 2014-3 Lukas Bach: Routing and Scheduling Problems - Optimization using Exact and Heuristic Methods 2014-4 Tanja Groth: Regulatory impacts in relation to a renewable fuel CHP technology: A financial and socioeconomic analysis 2014-5 Niels Strange Hansen: Forecasting Based on Unobserved Variables 2014-6 Ritwik Banerjee: Economics of Misbehavior 2014-7 Christina Annette Gravert: Giving and Taking – Essays in Experimental Economics 2014-8 Astrid Hanghøj: Papers in purchasing and supply management: A capability-based perspective 2014-9 Nima Nonejad: Essays in Applied Bayesian Particle and Markov Chain Monte Carlo Techniques in Time Series Econometrics 2014-10 Tine L. Mundbjerg Eriksen: Essays on Bullying: an Economist’s Perspective 2014-11 Sashka Dimova: Essays on Job Search Assistance 2014-12 Rasmus Tangsgaard Varneskov: Econometric Analysis of Volatility in Financial Additive Noise Models 2015-1 Anne Floor Brix: Estimation of Continuous Time Models Driven by Lévy Processes 2015-2 Kasper Vinther Olesen: Realizing Conditional Distributions and Coherence Across Financial Asset Classes 2015-3 Manuel Sebastian Lukas: Estimation and Model Specification for Econometric Forecasting 2015-4 Sofie Theilade Nyland Brodersen: Essays on Job Search Assistance and Labor Market Outcomes 2015-5 Jesper Nydam Wulff: Empirical Research in Foreign Market Entry Mode 2015-6 Sanni Nørgaard Breining: The Sibling Relationship Dynamics and Spillovers 2015-7 Marie Herly: Empirical Studies of Earnings Quality 2015-8 Stine Ludvig Bech: The Relationship between Caseworkers and Unemployed Workers 2015-9 Kaleb Girma Abreha: Empirical Essays on Heterogeneous Firms and International Trade 2015-10 Jeanne Andersen: Modelling and Optimisation of Renewable Energy Systems 2015-11 Rasmus Landersø: Essays in the Economics of Crime ISBN: 9788793195196
© Copyright 2025