PROTOCOL: Probation Intensity Effects on Probationers' Criminal Conduct
2010; The Campbell Collaboration; Volume: 6; Issue: 1 Linguagem: Inglês
10.1002/cl2.64
ISSN1891-1803
AutoresCharlotte Gill, Jordan M. Hyatt, Lawrence W. Sherman,
Tópico(s)Crime Patterns and Interventions
ResumoProbation is one of the most frequently-used criminal sanctions in the United States (American Correctional Association, 2006). At the end of 2008, nearly 5.1 million adults were on probation alone – 84 per cent of all adults under community supervision. In all, one in forty-five U.S. adults is on probation or parole.1 Although growth slowed slightly in 2008, the population under community supervision has been steadily rising for some time, increasing by more than half a million between 2000 and 2008 (Glaze & Bonczar, 2009). Despite the extent of its use, probation has suffered from image problems, particularly a public perception that it is a 'soft' approach to crime for often serious offenders who are highly likely to recidivate.2 Subsequently, many probation agencies have struggled to access sufficient funding (Petersilia, 1997). This highlights a clear need for probation agencies to identify supervision practices that are effective at reducing recidivism, and at the same time represent an efficient use of scarce resources. Taxman (2002) notes that considerable research has been dedicated to programming and services that are often provided in conjunction with or on referral from probation, such as cognitive-behavioral therapy, drug courts, and skill-building programs (see also MacKenzie, 2006a; 2006b). Yet comparatively little attention has been paid to the impact of probation supervision itself on crime: the number of cases a probation officer handles, the frequency of contact between officer and client, and the nature of the interaction. Supervision is perhaps considered an uninteresting part of the probation process, "in the background of other programming" and therefore "inconsequential to effectiveness" (Taxman, 2002, p. 179). On the contrary, supervision is a crucial aspect of probation not only because it is the bedrock of programming, but also because in a chronically under-funded enterprise it may constitute the only interaction between client and agency. In this regard it may directly impact the client's future criminal behavior. If a probation officer with a caseload of 150 clients has inadequate time to spend with each one, s/he may find it impossible to build an accurate picture of individuals' needs in order to target programming most effectively. Supervision levels vary widely, from weekly or twice-weekly meetings for high-risk or delinquent probationers, to telephone reporting for those near the end of their sentences. In some busy agencies 'supervision' may constitute nothing more than a mail-in contact detail confirmation card (Petersilia & Turner, 1993, p. 285). It is not always clear whether supervision intensity is related to the client's needs or risk, or whether it is simply determined by operational capabilities. Intensive supervision probation (ISP) is one aspect of probation that has received considerable research attention. ISP programs usually consist of small caseloads and enhanced reporting requirements. However, interest in the practice has evolved from a need to find punitive alternatives to imprisonment rather than a general desire to understand more about supervision practices. As a result, there has been very little articulation of the theoretical basis for its hypothesized effectiveness beyond the assumption that 'more is better.' Indeed, Bennett (1988) described ISP as "a practice in search of a theory." Skeem and Manchak (2008) propose that probation supervision may follow one of three broad guiding philosophies: control/surveillance, treatment, or a hybrid of both. ISP programs developed over the last fifty years have fallen into all three of these categories, but the 'classic' model has been a surveillance strategy designed to keep track of serious offenders who would otherwise be incarcerated. As such, ISP appears rooted in traditional theories of formal social control and deterrence. Offenders are offered the opportunity to remain in the community on the understanding that they are being constantly monitored, and the consequence of failure is the loss of liberty. Several qualitative studies have noted that most offenders express a preference for incarceration over intermediate sanctions like ISP (e.g., Crouch, 1993; Petersilia & Deschenes, 1994), which perhaps suggests that ISP is a more unpleasant prospect than prison for adjudicated offenders and could therefore have a strong deterrent effect against future offending. MacKenzie and Brame (2001) suggested an alternative mechanism by which social controls operate through ISP. They proposed that increased supervision intensity could lead to increased involvement in conventional and therapeutic activities, and found some support for that hypothesis through empirical testing. Overall, ISP studies have usually focused on the field testing of programs and avoided any explication of the theoretical foundations of probation supervision. Clear and Hardyman (1990) describe two waves of interest in ISP research: the first in the 1960s, and another in the mid-1980s. More recently, a third wave of research has refined the application of increased supervision intensity, considering its relationship with carefully matched programming and treatment. The earliest set of field studies of what may be characterized as ISP programs focused on the impact of reducing probation officers' caseload sizes, and followed the 'treatment' philosophy. At the time, the rehabilitative ideal prevailed in corrections, and it was believed that smaller caseloads allowed probation officers more time to help their clients (Petersilia & Turner, 1990). However, these initiatives appeared to make little impact on recidivism, and even increased probation failures and technical violations. Clear and Hardyman (1990) suggest that one important reason for the lack of effectiveness of these initiatives was a lack of insight into how probation supervision activity could best serve the treatment goal. Probation officers simply did not know how to use the additional time made available to them. The collapse of the rehabilitative ideal and the subsequent 'nothing works' paradigm of the 1970s, along with a sharp rise in crime, led to an exponential increase in prison growth (and the cost of corrections) that has persisted ever since (e.g., Ruth & Reitz, 2003). The probation population was also growing at a similar pace, and probation officer caseloads were becoming too large to allow them to serve the increasing number of serious and high-need offenders being granted probation or parole (Petersilia & Turner, 1993). By the 1980s there was renewed interest in ISP as part of a battery of 'intermediate sanctions' that sought to alleviate prison overcrowding and save money, while maintaining the appearance of being tough on offenders who would otherwise have been incarcerated. The focus was on surveillance and control of the offender through small caseloads, frequent contacts, increased drug testing, and mandatory employment. The new ISP was rooted in the classical theory of deterrence through swift, certain punishment, effected by close supervision (Petersilia & Turner, 1990). Georgia was the first state in the U.S.A. to implement this new generation of ISP program. Participants had very low recidivism rates, maintained employment, and paid probation fees that helped offset the cost of supervision. The Georgia model was subsequently adopted elsewhere in the United States, with mixed results. The Bureau of Justice Assistance (BJA) responded to the interest in and uncertainty about the Georgia model by funding a large, multi-site randomized controlled trial in the mid-1980s, which was evaluated by the RAND Corporation. Twelve of the fourteen experiments compared ISP to routine supervision, while two compared ISP to incarceration. By and large, the results of the evaluations were disappointing, again showing little impact on new crimes and an increase in technical violations compared to usual practice. Furthermore, a program intended to reduce the strain on the prison system actually resulted in more incarcerations, as increased surveillance and drug testing raised the likelihood of probation failure (Petersilia & Turner, 1993). The inability of ISP to demonstrate potential as a crime prevention program under the scrutiny of a rigorous research design largely killed off interest in the surveillance/control model of probation supervision by the 1990s. ISP was listed in the influential University of Maryland report to the United States Congress, Preventing Crime: What Works, What Doesn't, What's Promising, as a program that did not work (Sherman et al., 1997; MacKenzie, 2006b). However, the 'what works' movement also led to an increased focus on the factors that influence successful programming. Andrews, Bonta, and Hoge (1990) introduced what are now commonly described as the 'principles of effective intervention' (PEI), which posit that programs should be designed to be responsive to offenders' specific risk and need levels (the risk-need-responsivity, or RNR, model: see also Taxman & Thanner, 2006). The risk principle in particular suggests that more intensive supervision and treatment should be targeted at higher-risk offenders, an idea that is strongly supported by empirical research (see Lowenkamp, Latessa, & Holsinger, 2006, for a summary). The PEI suggest that ISP might be more effective if, through increased contact and control, the probation officer were able to establish offenders' risk and need levels and direct them into appropriate treatment. Treatment provision was not a priority of the BJA/RAND-evaluated programs, and few participants received such services (Latessa et al., 1998). However, results from some of the study sites indicated that intensive supervision combined with treatment might have a positive effect on crime, which led the evaluators to call for more research into such interaction effects (Petersilia, Turner, & Deschenes, 1992; Petersilia & Turner, 1993). Several more recent studies also suggest that ISP programs that adhere to the PEI and offer a balance of treatment and surveillance (the 'hybrid' philosophy) show promise in improving offender outcomes (e.g., Latessa et al., 1998; Paparozzi & Gendreau, 2005). A recent meta-analysis of a wide range of correctional interventions also supports the contention that modern treatment-focused ISPs are more effective at reducing recidivism than surveillance-based programs (Aos, Miller, & Drake, 2006). MacKenzie (2006b), in a detailed update to the University of Maryland report, lists intensive supervision with a treatment component as a 'promising' strategy in corrections, which means that further rigorous research is needed but several studies have produced encouraging results. Uncertainty about the effectiveness of ISP indicates a clear need for work to unpack the complex relationships between surveillance and treatment, probation officer and client. Taxman (2008a) notes that efforts are now under way to effect organizational change in probation departments that will allow for greater rapportbuilding between officers and offenders, which is intended to lead to behavioral change. She is currently leading experimental research into "proactive" and "seamless" criminal justice supervision and treatment programs that embody these new directions and have so far shown substantial reductions in recidivism for participants (Taxman, 2008b). A recent randomized controlled trial in Hawaii indicated that intensive probation programs rooted in the classical deterrence tradition may be effective when a consistent, incentive-based structure is implemented. The Hawaii HOPE program combined increased drug testing with swift, certain adjudication and shock incarceration for violations. A novel aspect of the program was the handling of violations. Non-compliant offenders continued their supervision with probation officers trained in therapeutic techniques, and repeat violators were directed to treatment services as well as being punished (Hawken & Kleiman, 2009). Taken together, the research on ISP to date suggests a complex dynamic that goes beyond earlier assertions that the programs do not work. Furthermore, even less is known about the converse of ISP: increasing caseloads and reducing contacts ('low-intensity' supervision). The PEI would suggest that ISP be reserved for the highest-risk offenders, with reduced surveillance and services for those at the lowest end of the risk-need spectrum. There is some speculation that increased caseloads can lead to harmful reductions in supervision, putting society at risk from offenders whose probation officers have too many clients to ensure that each one is not a threat to public safety (e.g., Worrall et al., 2004;3 Lemert, 1993). However, Glaser (1983) speculated that reduced frequency of contact would not adversely affect low-risk or low-need clients. This suggestion is supported empirically, notably by a recent randomized experiment (Barnes et al., 2010; also Johnson, Austin, & Davies, 2003; Wilson, Naro, & Austin, 2007). Additionally, several studies have indicated that more intensive supervision can have unfavorable effects on the recidivism of low-risk offenders (Erwin, 1986; Hanley, 2006; Lowenkamp, Latessa, & Holsinger, 2006). We still have much to learn about probation and parole supervision, and the circumstances under which its use is effective in reducing crime. We will undertake a comprehensive review and synthesis of the most rigorous research available on the effects of probation supervision intensity on recidivism. The focus of the review is programs that include among their primary features a change in the ratio of probationers to probation officers (caseload size), frequency of contact between officers and clients, or other 'frontline' supervisory behavior, such as drug testing. The effects of these changes are tested against a counterfactual of 'supervision as usual' – offenders who remained part of standard probation caseloads. The primary outcome measure is recidivism, as measured by arrests, charges, or convictions. We also examine the impact of probation intensity on technical violations. As we have seen, there is conflicting evidence about the effectiveness of increasing the intensity of probation supervision. It may depend on the specific philosophies and components of the programs and how they interact with supervision levels. The risk and need levels, and other characteristics, of offenders who participated in ISP research studies may also impact the relative effectiveness of the programs. We will systematically code the characteristics of each program and sample to examine which, if any, of these characteristics moderate the overall effect of the change in intensity. Eligible studies will test the effect of a change in intensity of probation supervision on subsequent crime. A change in intensity could be brought about by increasing or decreasing the ratio of clients to probation officers (changing caseload size); increasing or decreasing the frequency of contact between clients and their officers; or increasing or decreasing the frequency of other forms of supervisory control effected by probation officers, such as drug testing.4 Studies in which the primary purpose of the research design is to estimate the impact of these specific measures on recidivism and/or technical violations will be considered. Most studies have tested increases in intensity rather than decreases, but changes in both directions are eligible for inclusion in the review. We impose a number of restrictions on program type in order to preserve comparability between what we already know will be a highly diverse set of studies. Some programs have examined the provision of supervision as part of a 'team' approach; for example, multi-agency collaboration between probation officers, police officers, and treatment providers. Evaluations of these programs will be eligible as long as the probation officer is the primary supervisor. This limitation will allow us to maintain a degree of equivalence between treatment providers and settings, and between treatment and control group conditions. We will also restrict our analysis to the study of adjudicated offenders sentenced to probation or granted parole. Probation services may also be provided at the pretrial stage, or as part of diversion strategies for first-time juvenile arrestees or 'predelinquent' adolescents. We hypothesize that there may be substantial differences in the offending propensities of participants in these programs compared to adjudicated offenders, particularly because offenders at the pretrial stage are not guaranteed to receive any conviction or sentence. There is also no straightforward comparison condition to pretrial probation in the same way that 'supervision as usual' simply involves more or less of the same intervention. We attempt to maximize internal validity in our selection of studies by limiting our sample to randomized controlled trials (RCTs) and highly rigorous quasi-experiments involving subject-level matching and, pre- and post-program measures of offending behavior. We justify these strict inclusion criteria on the basis of a priori knowledge of a large body of the highest-quality research on ISP. The BJA/RAND studies alone were the largest randomized experiment in corrections undertaken in the United States at the time (Petersilia & Turner, 1993, p. 292). Thus, we expect to find sufficient numbers of experimental and quasi-experimental studies meeting our other eligibility criteria to permit a meta-analysis to be conducted. The control condition must be regular probation or parole supervision ('supervision as usual'). This may vary widely between studies in terms of number and type of contacts, caseload size, and so on, as long as the control group participants are exposed to the regular practices of the probation agency. The specific components of the control group will be coded. In some evaluations, ISP programs based on the 'Georgia model' were compared to the agency's existing intensive supervision program, rather than 'routine' probation (e.g., Ventura County, California: Petersilia & Turner, 1990). We will consider these studies for inclusion as long as there are differences between the existing and experimental ISPs that meet the requirements set out in the previous section. Evaluations in which ISP is compared to incarceration or a different program (e.g., a boot camp) are excluded. The aim of this review is to investigate the impact of changing probation/parole supervision intensity, so our baseline for assessing such change must be probation/parole supervision of a different intensity than that received by the treatment group. We will include both juvenile and adult probationers in the review. Since probation agencies supervise a broad range of offenders, most studies will include mixed caseloads of male and female offenders with different risk and need levels and varying offending histories. However, we expect that most participants will be the moderate to high-risk male offenders usually targeted in high-intensity probation programs. Some experimental ISPs are directed at specific offending problems (e.g., focusing on drug-involved offenders), while others accept a range of offender types. Many probation and parole agencies do not have different policies for the supervision of probationers as compared to parolees, so studies may include mixed caseloads. Specific details about all these variations will be coded. Eligible studies will measure recidivism in terms of new arrests and/or convictions. Technical violations of probation, such as absconding or failing a drug test, will also be included as a separate outcome measure. While technical violations do not inevitably result in a recorded arrest or charge for a new offense, they represent a failure to comply with probation conditions that could be affected by the intensity of supervision. Outcome data will most likely be drawn from official records, but we will also include self-reported data if available. The use of technical violations as an outcome measure comes with the caveat that increased supervision intensity could increase the likelihood of a violation being detected through increased surveillance, rather than simply a failure to comply. This caveat applies to new criminal cases too, but to a lesser extent. New crimes are more likely to be detected by the police than by probation officers, so future arrests are less likely to be affected by the offender's probation status. This also makes arrest a preferable outcome measure to charges or convictions that come further along the criminal justice process and may be more affected by disclosure of prior sentences. Of course, police officers in smaller beat areas probably know the repeat offenders too and will adjust their discretion to arrest accordingly. All recidivism measures suffer from inherent limitations. Studies will not be excluded on the basis of language or geography. Studies from the late 1950s, when the earliest wave of research on intensive probation began, to the present day will be considered for inclusion. We will use several strategies to conduct a comprehensive search for literature on probation intensity. The primary literature search will involve keyword searches of online abstract databases and the websites of research organizations and government agencies (see below). Specialist search engines like Google Scholar will also provide a rich source of 'grey literature.' We will also consulte lists of references from existing reviews of probation supervision and intensity, and of randomized trials in general (Petersilia & Turner, 1993; Phipps et al., 1999; Taxman, 2002; Weisburd, Sherman, & Petrosino, 1990), and library book and microfilm collections. Online searches will be supplemented with hand searches of key journals in the field.5 Every effort will made to locate unpublished material where possible. Eligibility of studies will be assessed by reading titles and abstracts, and obtaining the full text of documents that appeared to be relevant. The following search strings of key words will be used to search the databases and websites, adapted as necessary to meet the requirements of the different search engines. The search terms are deliberately broad (they do not include limiting terms such as 'evaluation,' 'experiment,' 'trial') so that relevant background literature may also be systematically obtained through the searches. '*' indicates where terms are truncated to find all possible variants of the word: probation* AND supervis* AND case* AND (intens* OR frequen* OR ratio) AND (recidiv* OR *arrest* OR *convict*) The 'classic' model of ISP was tested in the BJA/RAND experiments from the 1980s (Petersilia & Turner, 1993), which serve as a convenient illustration of a typical study design. The BJA/RAND studies were a fourteen-site randomized controlled trial of largely surveillance/control-oriented ISP programs. Two of the study sites compared ISP to incarceration (so were not eligible for inclusion in this review), while the remaining twelve contrasted ISP with supervision as usual (SAU) or existing intensive supervision models. Enhancements of both probation and parole supervision were tested. The exact nature of the program depended on the study site – each jurisdiction selected components of the Georgia ISP model for inclusion as it saw fit. Key common features of all the evaluations included smaller caseloads of around 25–30 offenders per officer (usually compared to 100 or more in SAU), increased frequency of contact (usually at least once a week at first, gradually decreasing in phases), drug testing, and mandated employment. Participants in the ISP evaluations had to be adults. Their risk levels varied, but they were generally more serious offenders. Petersilia and Turner (1993) state: "People placed on enhancement ISPs [as opposed to prison diversion or early release] are generally deemed too serious to be supervised on routine caseloads" (p. 292). However, persons convicted of homicide, robbery, or sex crimes were excluded as a matter of policy from the experiment. Participants were primarily males in their late twenties to early thirties, with extensive criminal records. A substantial proportion of participants were drug dependent. The study sites set their own eligibility criteria for participants beyond these initial requirements. Participants were randomly assigned to treatment and control conditions by RAND researchers. The study sites implemented the randomization sequence. Data collection occurred in several waves. A baseline assessment of demographic characteristics and criminal history was conducted shortly after assignment. Supervision details and services received were recorded at six and twelve months; and recidivism (proportion with new technical violations, arrests, convictions, and incarcerations) was recorded at twelve months. Data on drug testing were collected monthly. Cost data and calendars for assessing time at risk were also collected. Each site obtained its own data, and procedures were checked for validity by RAND staff. Recidivism data came from official records rather than self-reports. Many ISP studies report data on multiple outcome measures, which cannot be considered independent treatment effects for the purposes of quantitative meta-analysis because they are taken from the same sample of participants. In this analysis we will not attempt to pool outcome measures. As described above, the different outcome measures can be affected in different ways by the offenders' probation status. We will initially take the more conservative approach of handling different types of outcome measure separately. However, we will combine arrests and convictions in some analyses. In these cases, arrest outcomes will take precedence over convictions so that multiple outcomes from the same study are not used. We prioritize arrest because a successful conviction is dependent on many external factors and may not represent the most accurate picture of the offender's actual behavior. We will analyze technical violations separately because of the strong likelihood that they will be related to the treatment condition due to the increased surveillance inherent in ISP programs. In the event that samples or outcomes are broken down by subgroups (e.g., new arrests are reported for the full sample and then broken out into drug, property, and violent crime arrests), we will use the data for the full sample or outcome only. Where enough studies provide results broken down by the same types of subgroups, we will analyze those outcomes separately. A related threat to the independence of findings is the measurement of follow-up outcomes for the same sample at multiple time periods. In such cases, the longest follow-up period is preferred. However, sample sizes may decrease significantly over time as cases are lost to follow-up. In these cases we will select the follow-up period with the closest number of cases to the original sample size to minimize bias from attrition. Where multiple reports are based on the same dataset or sample, we will count the sample as one study. The report containing the longest follow-up period and/or the most detail is considered the primary study, and other reports will be used to supplement the data from the primary study where necessary. We will ensure that each re-analyses of the same datasets are not inadvertently included with primary evaluation data from the same research project. The coding protocol developed for this study is reproduced in an appendix to this protocol. It is designed to capture the hierarchical nature of evaluation data: a single study may report separate effect sizes for multiple outcome constructs for multiple samples in multiple treatment-comparison contrasts or study sites ('modules'). We will record a range of methodological details about each study to assist in decision-making about eligibility and study quality. A host of items capturing information about program, setting, and participant characteristics will serve as both determinants of eligibility and potential moderator variables. We do not expect all these factors to influence outcomes and will not test them all to avoid the problem of finding statistically significant results merely by chance. However, we also aim to be as inclusive as possible so that potentially relevant information is not missed. The main items of interest relate to the four research questions set out above. The first and second authors will double-code a subset of studies and separately code those remaining to ensure that both coders share the same understanding of the coding protocol. Qualitative research studies are not included in the systematic review results, but relevant qualitative data are used to inform the background, framing, and analysis of our questions. The broad definition of our search terms allows qualitative studies to be systematically identified in the literature searches. Our search strategy covers a range of databases that will enable us to identify unpublished literature (e.g., dissertations and technical reports) as well as published works. We will compare results from published and unpublished studies to estimate reporting bias, and if data are sufficient we will statistically test for publication bias using the funnel plot and trim-and-fill methods (Duval & Tweedie, 2000). Meta-analytic procedures will be used to quantitatively combine effect size data from the eligible studies where appropriate (i.e., where two or more studies are available that measure a common outcome, such as arrests, and contained sufficient information to calculate an effect size). Effect sizes for each outcome measure in the studies will be encoded according to the procedures outlined in
Referência(s)