|
|
||||||||
Regular Article |
Received November 27, 2000; revised March 17, 2001; accepted March 20, 2001. From the Department of Psychiatry, University of Pennsylvania, Room 650, 3535 Market Street, Philadelphia, PA 19104. Address correspondence to Dr. Crits-Christoph.
| ABSTRACTS |
|---|
|
|
|---|
Key Words: Cognitive Therapy Psychotherapy, Psychodynamic Early Sessions
| INTRODUCTION |
|---|
|
|
|---|
An alternative to predicting outcome from pre-treatment characteristics is to examine early response as a predictor. If likely treatment failures can be identified early in the treatment process, alternative treatment strategies can be pursued. For example, an individual case could be reviewed so that techniques could be altered or increased clinical supervision added. If a patient has a disorder for which there are efficacious medications, early identification of likely therapy failures could lead to a decision to add medication to the psychotherapy treatment or to switch to medication. The identification of early failures during the treatment of major depressive disorder would be especially important given the risk for suicide that accompanies depression and the potential legal implications of not considering alternative or enhanced treatments when psychotherapy is failing. Moreover, identification of early failures has important cost implications.
Identification of potential failures in manual-guided psychotherapy, in particular, has both clinical and research implications. Knowledge of the limits of these approaches (i.e., when they are likely to fail) would be highly useful in conveying a more realistic sense of the application of manual-guided treatments in clinical practice. Although therapists may be aware of published treatment response rates for manual-based psychotherapies, therapists probably begin, and continue, therapy with most new patients by taking the view that success will occur with the implementation of an empirically supported treatment, rather than by taking a probabilistic view of success. A method indicating, for example, that at session 4 the current patient has an 80% chance of failure is more likely to raise consideration of alternative treatments.
In a research context, the early identification of likely nonresponders could be used to design trials that would allow investigators to evaluate the efficacy of alternative treatments for nonresponders without having to wait until a full course of treatment is administered. This approach would be cost-effective for research as well as reduce the amount of time that symptomatic patients have to endure an ongoing ineffective treatment before an alternative is considered.
Two studies have examined the relationship of early treatment response to final outcome for cognitive therapy (CT) of depression. Fennell and Teasdale3 report that 8 of 17 patients treated with CT decreased 50% or more in Beck Depression Inventory4 (BDI) scores from baseline to session 4. All 8 of these patients received a BDI score less than 10 at the termination of treatment. Beckham5 describes the results of 23 patients in CT for depression who were classified as early responders or nonresponders at session 6 based on greater than or less than 50% improvement from baseline to session 6 on the BDI. The average BDI score for early responders at termination was 6.9, whereas nonresponders had an average BDI at termination of 27.2. Although these studies suggest a strong relationship between early improvement and final treatment response in CT for depression, both studies had small sample sizes. In addition, whether or not this relationship applies to other manual-based psychotherapies, other disorders besides depression, and equally to patients in longer-term and short-term treatment has not been investigated. A number of additional articles have recently addressed the issue of rapid early improvement in manual-based cognitive therapy for major depressive disorder, but these articles address the issue of average improvement levels, not individual differences in early response to treatment, as an indicator of eventual treatment response.69
No studies have examined whether early nonresponders to manual-guided psychotherapy might respond to an alternative treatment. However, a recent small study found that nonresponders (assessed at termination) to manual-based cognitive-behavioral and interpersonal psychotherapy for bulimia nervosa subsequently responded more to fluoxetine than to placebo.10 Early nonresponse has been investigated in psychopharmacological studies, in which week 2 nonresponse to fluoxetine treatment of major depression was found to be associated with a 0.45 probability of nonresponse at week 8.11
The examination of conditional probabilities of final response or remission given early response or nonresponse, as reported in the study of fluoxetine,11 has particular clinical appeal. Such conditional probabilities can be further enhanced by employing receiver-operator characteristic (ROC) curve analyses12,13 that examine sensitivity (probability of a true positive response) and specificity (probability of a true nonresponse) at varying levels of the predictor variable. Most applications of ROC analysis in medicine have been confined to the context of evaluating how well a screening test predicts the presence/absence of a disorder. Our purpose in using this tool here is to supplement significant statistical findings with clinically useful guidelines for treatment decisions. These additional analyses help translate dimensionless quantities, such as effect size estimates or P-values, into a more interpretable form, such as the likelihood of eventual remission of symptoms based on early treatment response or nonresponse. A similar approach has been previously applied to the pharmacological treatment of obsessive-compulsive disorder,14 although no other studies have applied ROC analysis to psychotherapeutic treatment studies to address the relationship of early response to final remission.
The purpose of the present study was to examine early change in symptoms as a predictor of subsequent remission status during manual-guided cognitive and psychodynamic treatment of patients with elevated levels of anxiety and depressive symptoms. Specifically, we combined data from eight studies to ask the following question: to what extent does improvement in depressive and anxiety symptoms from baseline to weeks 2, 3, and 4 predict clinical remission at 4 months? To this end, we applied logistic regression to predict remission status from baseline and weekly symptom measures. The obtained logistic regression equation was then cross-validated in a second sample of patients. Derived predicted remission scores from the logistic regression analyses were employed as a "test," and sensitivity and specificity values were examined using ROC analyses. Potential clinical guidelines for decision-making are provided, based on cutoff scores obtained empirically from the ROC analyses. In examining this question of early change as a predictor of remission, we focused our attention on two instruments, the Beck Depression Inventory and the Beck Anxiety Inventory,15 that are easy to administer on a session basis and that assess symptoms relevant to the bulk of patients presenting with anxiety, affective, and personality disorders. In addition, a priori definitions of clinical remission (i.e., the relative absence of symptoms) exist for these instruments.
We also examined, on an exploratory basis, the extent to which successful prediction of 4-month symptom remission varied as a function of type of treatment (cognitive or dynamic therapy), diagnosis, and eventual length of treatment (16 weeks or 1 year). We hypothesized: 1) that the relationship between early response and outcome seen for cognitive therapy in previous studies3,5 would be stronger than that for less-structured psychodynamic therapy, in which the important insights that drive change may occur at any point in treatment; 2) that patients with chronic disorders (e.g., Axis II disorders) would demonstrate less connection between early symptom change and 16-week remission compared with those with transient (Axis I) disorders; and 3) that the connection between early response and 16-week remission was hypothesized to be less robust in long-term treatment that does not press as strongly for change early in the treatment process compared with brief therapy.
| METHODS |
|---|
|
|
|---|
The second sample was used to cross-validate the predictive equations obtained in the first sample. The CT group from the National Institute of Mental Health Treatment of Depression Collaborative Research Program (TDCRP)19 was used as this cross-validation sample. (Although the TDCRP study involved other treatments as well, only the CT group received weekly assessments of depressive symptoms.)
Patients
In the first sample, each patient initially participated in a 20- to 30-minute telephone screening to determine likely eligibility for one of the 8 open trials. Patients who did not meet any of the medical or psychiatric exclusion criteria were scheduled for an initial evaluation. Patients were diagnosed by using the Structured Clinical Interview for the DSM-III-R20 and the Structured Clinical Interview for the DSM-III-R Personality Disorders.21
In order to limit the analyses to patients with clinically meaningful initial anxiety and depression symptoms, the current study focused on two subgroups of the total number of patients enrolled in the 8 trials. The first subgroup consisted of only the 105 patients who received a score of 16 or greater on the BDI (indicating at least moderate symptoms of depression) at baseline and attended at least 10 treatment sessions (18 patients having dropped out prior to session 10). The second subgroup consisted of those patients (n=79) who received a score of 16 or greater on the BAI (indicating at least moderate symptoms of anxiety) at intake and attended at least 10 treatment sessions (18 also having dropped out prior to session 10). For analyses of BDI and BAI at specific treatment sessions, the samples are slightly lower than the respective full samples of 105 and 79 because of missing data.
A full description of the patients in the TDCRP (the validation sample for the current study) is given elsewhere.19
Treatments
For the derivation sample, patients with diagnoses of chronic depression or generalized anxiety received 16 weekly sessions followed by 3 monthly booster sessions of either SE psychotherapy or CT. The SE psychotherapy was based on a general SE treatment manual22 in conjunction with diagnostically specific manuals for the treatment of generalized anxiety disorder23 and of depression.24 The CT therapy was conducted according to manuals for depression25 and anxiety.26
The patients with avoidant or obsessive-compulsive personality disorders received 52 weekly sessions of SE or CT. SE again was based on a general SE manual, as well as unpublished manuals addressing issues specific to patients with personality disorders. CT was conducted according to a personality disorder manual.27
Patients in the cross-validation sample (TDCRP) received 20 sessions of manual-based CT25 within 16 weeks.
Therapists and Supervision
For the derivation sample, there were 18 therapists who delivered SE therapy (5 with M.D.; 11 Ph.D., Psy.D., or Ed.D.; 2 M.A.) and 16 therapists (primarily graduates of a postdoctoral clinical fellowship in cognitive therapy) who delivered CT (15 with Ph.D., Psy.D., or Ed.D.; 1 M.A.). The CT and SE supervisors were experienced psychotherapists with extensive experience in training therapists in manual-based therapies. All therapists received 1 hour of individual supervision for every 2 hours of therapy provided.
Description of therapists, training, and supervision for the TDCRP (cross-validation sample) are provided elsewhere.19
Measures
All patients in the derivation sample received an extensive diagnostic battery as well as a self-report battery at treatment intake, at 4 and 8 months for the longer-term therapy patients, at treatment termination, and at treatment follow-up. In addition, selected self-report measures were completed at the beginning of each treatment session. The present study focused only on the month 4 assessments in order to equate for time across the short-term (16 weeks) and long-term (1 year) psychotherapy studies. However, in order to maximize sample size and generalize to a larger population, patients who dropped out of treatment (BDI: n=8 for CT, n=6 for SE; BAI: n=3 for CT, n=3 for SE) between sessions 10 and 16 were included in the sample, with their last session assessment determining their remission status.
The BDI consists of 21 items designed to assess the common symptoms of depression. The BDI was completed by all patients in the derivation sample at intake (baseline score), at the beginning of each session, and at the 4-month evaluation. Remission status on the BDI (minimal to no symptoms of depression) was defined as a 4-month score less than 10.28
The BAI is a 21-item self-report inventory designed to assess the common features of anxiety. The BAI was completed by all patients in the derivation sample at intake, at the beginning of each session, and at the 4-month evaluation. As on the BDI, remission status was defined on the BAI as a 4-month score less than 10.29
In the cross-validation sample, the BDI was administered at intake, at each treatment session, and at termination (16 weeks). Remission was defined in the same way as in the derivation sample. As in the derivation sample, only patients who had BDI scores for session 10 or thereafter were included (n=46 out of 59 patients who began treatment).
Statistical Analyses
Using the derivation sample, logistic regression analyses were conducted to examine the relationship between early improvement and 16-week remission status. Each logistic regression analysis had two predictors: the baseline BAI or BDI score and one of the weekly (session 2, 3, or 4) BAI or BDI scores. Separate logistic regressions were performed for the BAI and BDI scores at weeks 2, 3, and 4 (i.e., a total of 6 logistic regression analyses). When baseline scores are included in the logistic regression, the statistical significance of the logistic regression parameter estimate for the week 2, 3, or 4 BAI or BDI measure reflects the effect of early change, corrected for initial level, on 16-week remission status.
If the logistic regression equation is to be implemented in a clinical context, two additional statistics are useful. The first is a cutoff score for the index of early improvement that would inform a clinician as to whether a given patient was likely to be a remitter or nonremitter. The second is the probability of a patient's being a remitter/nonremitter if he or she scores above or below the cutoff. To derive these statistics, the actual baseline and specified weekly BAI or BDI scores for all patients are entered into the logistic regression equation to yield "predicted remission scores." These predicted remission scores are the particular combination of baseline and specified weekly BAI or BDI scores that best predicts 16-week remission status. Conceptualizing this index of early improvement as a predictive "test" of remission status, we applied ROC analysis. Within the context of the present study, ROC curves provide plots of estimates of the conditional probability of final remission given early improvement (sensitivity) or relative nonimprovement (1 minus specificity) for each cutoff in the range of early treatment improvement. Thus, the curve provides a convenient visual display of the predictive outcomes as well as the tradeoffs between sensitivity and specificity for a given cutoff. In addition, the ROC curve has an appealing analytical property: the area under the ROC curve (AUC) also provides an indication of classification accuracy. An AUC of 0.50 indicates chance levels of prediction of the outcome criteria, and a larger AUC indicates better classification or decision rules.
In addition to evaluating the AUC, we determined an optimal cutoff for the predictive measure by simultaneously maximizing both sensitivity and specificity.30 Sensitivity and specificity values were also calculated by using a cutoff score that increased the accuracy of identification of nonresponders (i.e., sensitivity of approximately 0.80). In addition to sensitivity and specificity, the predictive value of a positive test (PVP) and predictive value of a negative test (PVN) were calculated. The former is the percentage of subjects with a positive test (i.e., "test," as defined above, indicates likely remission) that actually do remit, while the latter is the percentage of subjects with a negative test (i.e., likely not to remit) who actually do not remit.
The logistic regression equation coefficients (and constant) calculated on the original sample were applied to data from the cross-validation sample. The two most useful clinical statistics for identifying nonremitters, specificity and PVN, were then also calculated for the cross-validation sample by using the optimal cutoff from the derivation sample that achieved a specificity of 0.80.
| RESULTS |
|---|
|
|
|---|
The mean age of the derivation sample was 37 years (SD=10, range 19 to 65). Fifty-four percent of patients were females. Eleven percent of the sample were African American, 2% were Asian, 2% were Hispanic, and 81% were white. Forty-seven percent of the sample were married or cohabiting, 37% were single, separated, or widowed, and 14% were divorced. Seventy-eight percent of patients had at least some college education. At treatment intake, the mean BDI score of those with initial BDIs of 16 or greater was 25.2 (SD=7.3). The mean intake BAI score for the subsample with initial BAIs of 16 or greater was 26.9 (SD=8.2). Sixty-one patients had both an intake BDI and BAI of 16 or greater, and therefore there was considerable overlap between the two samples.
Regarding outcome, 44.8% of patients met criteria for remission on the BDI at 4 months, and 60.8% met remission criteria on the BAI.
Logistic Regressions
Table 1 presents the results of the logistic regression analyses from the derivation sample. The results indicated that change on the BDI from baseline to each of weeks 2, 3, and 4 significantly predicted remission at week 16 (all P<0.005). Similarly, change on the BAI from baseline to each of weeks 2, 3, and 4 significantly predicted remission at week 16 (all P<0.005).
|
|
|
|
2=5.2, df=1, P=0.02), but not for the BAI (
2=0.8, df=1, P=0.38). Separate logistic regression analyses and ROC analyses for those patients with and without an MDD diagnosis revealed very high prediction (AUC=0.90) for patients without a diagnosis of MDD and good prediction (AUC =0.72) for patients with a diagnosis of MDD.
Cross-Validation of Predictive Equation
Individual patients' BDI scores at baseline and session 3 in the cross-validation sample (TDCRP) were input into the logistic regression equation that was determined on the full derivation sample using baseline and session 3 as predictors. This equation was: [predicted remission score=1.9+(0.03xbaseline BDI)-(0.16x session 3 BDI)]. Using the optimal cutoff of 0.0007 obtained in the derivation sample for a specificity of 0.80, a specificity of 0.71 and a PVN of 0.86 was obtained in the cross-validation sample. A logistic regression calculated within the TDCRP sample resulted in a very similar equation: [1.5+(0.03xbaseline BDI)-(0.13xsession 3 BDI)].
Implementation of Predictive Model
To implement the predictive model clinically, an individual patient's "predicted remission" score is calculated by using that patient's baseline and weekly scores on the BDI or BAI. For example, using the above derivation sample equation for baseline and session 3 BDI scores and the cutoff score for 0.80 specificity, a patient with a baseline BDI of 25 and a week 3 BDI of 18 would have a predicted remission score of 0.23 [1.9+(0.03x25)-(0.16x18)]. Since the score for this patient is below the cutoff of 0.0007, the patient is predicted to not remit by week 16. The probability of not remitting is given by the predicted value negative (PVN) statistic, which is 0.76 (see PVN for week 3 BDI in Table 3).
| DISCUSSION |
|---|
|
|
|---|
The mechanism for the strong relationship between early improvement and final remission status is not clear. This relationship is not necessarily linked to a specific psychotherapy "technique" effect. Studies of the relationship of early response to pharmacological treatment outcome with major depression have suggested that the effect is more related to the nonspecific treatment factors than to a specific drug effect.32,33 Thus, it may be largely "placebo" responders who evidence marked improvement in the first few treatment sessions and then continue to improve to the point of remission by week 16. Another possibility is that early improvement reflects the natural course of the disorder, rather than anything to do with treatment per se. Disorders that have a more transient nature might show rapid early improvement (continuing through to termination), while chronic disorders might not. Arguing against this explanation is the fact that the natural course of untreated GAD, MDD, AVPD, and OCPD (the primary disorders in the sample) is not likely to be rapid improvement within two weeks. Moreover, the predictions did not vary by presence/absence of Axis II disorders, and Axis II disorders are by definition chronic.
The clinical implications of these results relate to the potential for modifying treatment for those patients who early on are highly predicted to be nonresponders. If a patient is likely to be a nonresponder, alternative treatments (other psychosocial treatments or medications) can be considered, either instead of the current treatment or as an augmentation strategy (combined treatment). If a clinician has data that suggest with a high degree of probability that a given patient will not respond to the current treatment, it may be worth considering other options that may yield greater benefit. However, no empirical research exists in regard to the efficacy/effectiveness of alternative treatments for nonresponders to psychotherapy. It may be that such nonresponders will fail in all treatments, and clinicians may be reluctant to disrupt the current therapeutic relationship without hope that a different treatment will be better. Augmentation strategies may be most practical here, allowing the current therapeutic relationship to continue while adding another treatment (e.g., medication) that has some possibility of improving response.
Several limitations of these analyses should be mentioned. Although the sensitivity and specificity values obtained in this study were generally good, there was still room for improvement, and some clinicians may prefer values at or above 90% before relying on a statistical model to guide clinical decision-making. However, the predictive accuracy of a "test" needs to be evaluated relative to the reproducibility of the "gold standard" outcome. Within depressed samples, the testretest reliability of the BDI has ranged from 0.6534 to 0.86.35 Short-term testretest of the BAI has been reported to be 0.75.15 Considering the reliability constraints of these instruments, there is only minimal room for improving the sensitivity and specificity values of the predictive equation. However, to the extent that prediction success can be increased, future research could include additional (e.g., pre-treatment) predictors of outcome such as those identified by Howard et al.1 in order to enhance the identification of responders and nonresponders. Additional process variables, such as the therapeutic alliance, could also be considered as candidates for evaluating treatment progress in order to predict final remission.36 However, it is likely that there is a limit on the predictive power of pre-treatment, early process, and early improvement measures in predicting final outcome. Recent research, for example, has shown that "sudden gains" occur throughout the course of CT for depression.37 The existence of such sudden gains beyond the early phase of treatment would tend to decrease the relationship of early improvement to final remission status.
Another limitation is that the analyses focused solely on symptom-based outcome measures. In making the decision about whether to change or augment treatment, it may be important to consider other types of outcomes (i.e., functioning and quality of life). However, functioning and quality of life indices do not change as markedly within a short duration of psychotherapy,38,39 and most managed care agencies allow for only a relatively short duration of treatment (e.g., 10 to 20 sessions). Furthermore, some managed care agencies define the need for psychotherapy in terms of "medical necessity," which translates into a focus on diagnosis or symptoms related to diagnosis.
A further limitation is that the number of weeks until remission was not fully controlled in regard to the assessment of remission status. A small number of patients in the sample had their endpoint (remission) evaluation between 10 and 16 weeks. Kraemer13 has explained that misleading ROC results can be found if prevalence rates vary over time. In the current context, if remission rates varied significantly between sessions 10 and 16, this confound of time would be a problem. However, within the derivation sample most of the change in BDI (85%) and BAI (86%) had occurred by session 10, so it appears this issue would have little effect on the results.
Although derivation sample sizes were reasonably large (n=105 for BDI analyses; n=79 for BAI analyses), the study may have had limited statistical power for the detection of subgroup differences (interactions). Power would be more of an issue, however, if the overall results were weak or negative, masking underlying subgroup differences. Our results showed strong prediction for the sample as a whole. When subgroup differences were detected (i.e., for presence/absence of MDD), they revealed very high prediction for one subgroup versus moderately high for the other. However, it is possible that for diagnoses not included or infrequent within the sample, a different pattern of prediction would emerge. Furthermore, there may be other patient characteristics besides diagnosis that influence the degree of relationship between early response and final remission of symptoms.
The current results may not generalize to other forms of manual or nonmanual-based psychotherapies. Further research is necessary to examine the generalizability of these findings to other forms of psychotherapy, outcome assessments, and patient populations.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
Early Response to Psychotherapy Predicts 4-Month Outcome Journal Watch Psychiatry, August 7, 2001; 2001(807): 1 - 1. [Full Text] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ALL ISSUES | SEARCH | TABLE OF CONTENTS |