The British Journal of Psychiatry (2008) 192: 52-58. doi: 10.1192/bjp.bp.106.032532
© 2008 The Royal College of Psychiatrists
OPEN ACCESS ARTICLE
This Article
This is an Open Access article
Right arrow Abstract
Right arrow Full Text (PDF)
Right arrow Submit an eLetter
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Williams, J. B. W.
Right arrow Articles by Kobak, K. A.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Williams, J. B. W.
Right arrow Articles by Kobak, K. A.

Development and reliability of a structured interview guide for the Montgomery–Åsberg Depression Rating Scale (SIGMA)

Janet B. W. Williams, DSW

Biometrics Research Unit, Columbia University, New York, New York

Kenneth A. Kobak, PhD

MedAvante, Inc., Madison, Wisconsin, USA

Correspondence: Janet B. W. Williams, MedAvante, Inc., 100 American Metro Blvd., Suite 106, Hamilton, NJ 08619, USA. Email: jwilliams{at}medavante.net

Declaration of interest

None. Funding detailed in Acknowledgements.


arrow
ABSTRACT
 
Background

The Montgomery–Åsberg Depression Rating Scale (MADRS) is often used in clinical trials to select patients and to assess treatment efficacy. The scale was originally published without suggested questions for clinicians to use in gathering the information necessary to rate the items. Structured and semi-structured interview guides have been found to improve reliability with other scales.

Aims

To describe the development and test–retest reliability of a structured interview guide for the MADRS (SIGMA).

Method

A total of 162 test–retest interviews were conducted by 81 rater pairs. Each patient was interviewed twice, once by each rater conducting an independent interview.

Results

The intraclass correlation for total score between raters using the SIGMA was r=0.93, P<0.0001. All ten items had good to excellent interrater reliability.

Conclusions

Use of the SIGMA can result in high reliability of MADRS scores in evaluating patients with depression.


arrow
INTRODUCTION
 
Observer-rated depression rating scales are used in clinical trials of antidepressants to select patients for study, and to assess the efficacy of the treatment being tested. The importance of the quality of ratings in clinical trials has recently been emphasised.1 Accumulating evidence suggests that the quality of ratings can make the difference between a failed trial and one in which drug separates from placebo.2 Therefore, any method that improves the quality of clinical trial ratings may improve our ability to conduct successful antidepressant trials. We describe the development of a structured interview guide for the Montgomery–Åsberg Depression Rating Scale (MADRS) and its test–retest reliability, assessed in a sample of 51 persons with varying levels of depressive symptoms.

Montgomery–Åsberg Depression Rating Scale
The MADRS was developed in the late 1970s from items that were found in several studies to be sensitive to change with anti-depressant treatment.3 Since its publication the scale has become increasingly popular worldwide. Dissatisfaction with the leading alternative, the Hamilton Rating Scale for Depression (HRSD), has further contributed to the popularity of the MADRS.4

The importance of reliability of assessments in a clinical trial cannot be overestimated. Without good interrater agreement the chances of detecting a difference in effect between drug and placebo are significantly reduced. Muller & Szegedi demonstrated that as the reliability of a rating scale decreases from 0.8 to 0.5, the power of the test to detect a significant difference between drug and placebo drops from 71% to 51%, increasing the risk of type II error.5 In general, total scale score reliability of the most commonly used depression rating scales such as the MADRS and HRSD is high, with or without the use of a structured interview guide.3,6 However, as compounds have become targeted to specific symptoms and clinical trials have revealed specific drugs’ effects on clusters of symptoms,7,8 it has become more important to be able to depend on the reliable measurement of an individual symptom or a subgroup of symptoms. Self-report versions of clinician-administered scales have been developed9,10 that show high degrees of correlation with the clinician versions; however, there are limited empirical data on their signal detection relative to the clinician in placebo-controlled trials.

There is minimal information available concerning the interrater reliability of the MADRS. The original article4 reported excellent agreement between rater pairs, but only as ‘conjoint’ reliability, and in only 11 patients. Maier et al reported total score intraclass coefficients (ICCs) of 0.73, 0.66 and 0.82 in three separate samples, using joint interviews in the first sample, and independent interviews in the second and third samples (which were actually the same patients pre- and post-treatment).11 Unfortunately item reliability was not provided, although the authors did report that three of the MADRS items (inner tension, lassitude and suicidal thoughts) had ICCs lower than 0.60 in all three samples. Davidson et al tested the reliability of the MADRS in 44 people receiving in-patient treatment for depression, using an experienced research nurse and a psychiatrist as joint interviewers.12 Overall agreement was ‘acceptable’ and ranged from ‘fair’ to ‘good’ on individual items. More recently, a Japanese version was developed and tested in Japan in joint interviews on a sample of seven patients with DSM–IV major depressive disorder.13 Individual-item ICCs were in the ‘very good’ to ‘excellent’ range; however, the weakness of the testing method (small sample size and repeated assessment of the same patients by the same three raters in joint interviews) compromised the significance of the results. Therefore, there is reason to believe that the interrater reliability of the MADRS in a typical research study could be improved.

The MADRS was originally published without suggested questions for clinicians to use in gathering the information necessary to rate the ten items. However, several studies have found that using a structured or semi-structured interview guide improves reliability on similar rating scales.14,15 Moberg et al compared independent interviews using the standard unstructured HRSD with the Structured Interview Guide for the HRSD (SIGH–D) and concluded that the SIGH–D ‘produced uniformly higher item- and summary scale reliabilities than the unstructured HDRS’.16 Further, in one placebo-controlled antidepressant trial, raters who more closely adhered to a semi-structured interview guide were found to have better signal detection than raters who did not.2 Such an interview guide provides some assurance that raters across clinical trial sites administer the scale in approximately the same way. Structured interview guides also facilitate training in the use of a scale by providing new raters with explicit instructions and specific interview questions that have been derived from expert interviews. Structured interview guides have become fairly standard for diagnostic interviews,16 as well as for many rating scales, including the Hamilton scales for depression (SIGH–D)14 and anxiety (SIGH–A).17,18 In general, they are designed to approximate an expert administration of the scale.

Development of SIGMA probes and conventions
A semi-structured interview guide similar to the SIGH–D was originally developed by J.B.W.W. for the MADRS in 1988 and has undergone several revisions since then, based on user experience and feedback from raters. More recently, K.A.K. joined as co-author in a major overhaul of the interview guide. The Structured Interview Guide for the MADRS (SIGMA) provides structured probes to ensure standardisation of administration and comprehensiveness of coverage of the ten items of the scale. The SIGMA questions were developed to obtain the information needed to assess each of the items’ anchor points (see Appendix). Each item begins with questions in bold type that should be asked exactly as written. Often these questions will elicit enough information about the severity and frequency of a symptom for the item to be rated with confidence. Follow-up questions are provided, however, for use when further exploration or additional clarification of symptoms is necessary. The specified questions should be asked until the rater has enough information to rate the item confidently. Raters are also encouraged to add their own probes as necessary to obtain enough information to rate each item accurately.

In the SIGMA the original MADRS appears on the right-hand side of the page and the structured interview guide questions appear on the left. The interview guide begins with the ‘overview’, which is a brief explanation of the time period to be covered, and initial questions to allow some rapport to develop and to give the interviewer some sense of the context of the interviewee’s current situation. The interview guide then follows, with questions for each of the ten MADRS items.

In the SIGMA the only change that was made to the original MADRS was to reverse the order of administration of the first two items (apparent sadness and reported sadness). There was consensus from users that asking about reported sadness first made for a more logical flow to the interview. Direct probes were added to the apparent sadness item to supplement the rater’s observation (e.g. ‘In the past week, do you think you have looked sad or depressed to other people?) The rationale for these additional probes was that without the aid of an informant who has seen the patient over the past week it is difficult to rate the persistence and depth of this item based on observation during the interview alone. This technique has been used successfully in self-report and telephone-administered versions of the MADRS,10,19 as well as in computerised and paper-and-pencil self-report versions of the HRSD20,21 and the Hamilton Rating Scale for Anxiety.22 Raters are instructed to consider both sources of information (direct observation and self-report) in rating this item.

In the interview guide there is an emphasis on open-ended questions, to encourage respondents to describe their experience in their own words, and to avoid raters’ ‘putting words in the person’s mouth’. Thus, for example, instead of asking the person at the beginning of the interview, ‘Have you been feeling sad or unhappy?’, the enquiry begins, ‘How have you been feeling since last [day of week]?’ Likewise, instead of asking whether the person has had trouble sleeping in the past week, the sleep item begins, ‘How has your sleeping been in the past week?’ Some items are assessed more directly, to improve the efficiency of the interview. For many responses the person is asked to provide examples; for instance, if there is a positive response to the question, ‘Have you had trouble concentrating or collecting your thoughts in the past week?’ the interviewer is instructed to ask, ‘Can you give me some examples?’ Once the person has described the symptom in his or her own words, the interviewer can then decide whether concentration difficulty is truly present, which would be rated in this item.

Once the revised SIGMA was completely drafted, revisions were made based on feedback from a number of users in the field, and the instrument was finalised. This report describes a formal assessment of the test–retest reliability of the 2006 version of the SIGMA, which is presented in full in the Appendix.


arrow
Method
 
To test the interrater reliability of the SIGMA, 162 test–retest interviews (81 dyads) were conducted. Within each dyad each patient was interviewed twice, once by each rater. Interviews were conducted independently from each other, with raters masked to the results of the other rater’s interview. Conducting independent (v. joint) evaluations of the same patient mitigates the artificial inflation of reliability coefficients that occurs when one rater interviews the patient and the second rater simply observes the first rater’s interview. Conducting independent evaluations is thus a more rigorous test of interrater reliability, and addresses the question of whether you would achieve the same result if the person were interviewed by a different rater using the same instrument. Interviews were conducted on the same day, in order to control for changes in participants’ conditions due to time. A brief (5–15 min) distracting task was given between interviews to minimise memory effects.

There is growing interest in the use of remote technologies for delivering assessments in clinical trials.23 Therefore, of the 81 pairs of interviews, 30 pairs were done using two face-to-face interviews, 30 pairs were done using one face-to-face and one videoconference interview, and 21 pairs were done using one face-to-face and one telephone interview. To control for the confounding influence of participant differences on mode of administration, the same 30 people were used in the ‘face-to-face v. face-to-face’ and the ‘video v. face-to-face’ cohorts. To minimise memory effects these cohorts were interviewed on different days (1–3 days apart) and no rater ever saw the same patient twice.

Fifty-one participants (14 men and 37 women; mean age 43 years, s.d.=12.35, range 20–72) with a mood disorder diagnosed according to DSM–IV criteria were included.24 The diagnoses were major depression, n=27; major depression in partial remission, n=15; minor depression, n=2; dysthymia, n=1; bipolar disorder (depressed), n=2; and depression not otherwise specified, n=4. Diagnoses were determined using a modified version of the mood module of the Mini International Neuropsychiatric Interview (MINI)25 and the overview section of the Structured Clinical Interview for DSM–IV (SCID).16 Since previous versions of the SIGMA are already widely used in clinical trials, a range of mood disorders was included in order to evaluate the reliability of the SIGMA across a wide range of symptom severity, including patients in partial recovery. The sample was 82% White, 10% African American, 2% Hispanic and 6% ‘other’. About half (47%) had a college degree. Participants were recruited from the Madison, Wisconsin area in response to advertisements in a local newspaper looking for people who were currently or had recently experienced symptoms of depression. Respondents were screened over the telephone by a research assistant for gross exclusions (current or lifetime schizophrenia, current psychosis or acute suicidal ideation) and were then scheduled for further follow-up screening with a clinician. All participants signed informed consent statements approved by the Allendale institutional review board, and were paid US $50.

The rater cohort consisted of two male and four female interviewers. Five had doctoral degrees (four in psychology and one in social work), and one a master’s degree in counselling psychology. Prior to the study, raters underwent reliability training on the scale, consisting of a didactic review of the scale and scoring conventions, followed by at least three practice interviews observed by a trainer (two group and one individual). Raters also observed each other’s training sessions in order to enhance learning. Raters had a range of prior experience with the MADRS, ranging from extensive (J.B.W.W., K.A.K.) to minimal. All raters’ interviewing skills were evaluated using the Raters Applied Performance Scale (RAPS)26 and all raters scored at least ‘good’ on all dimensions before the study began. Raters were paired using numerous permutations of dyads, to maximise generalisability. Order was counterbalanced, so that raters conducted an equal number of first and second interviews.


arrow
Results
 
The sample represented a wide range of depression severity, with scores on the MADRS ranging from 0 to 38. Overall, the level of depression in the sample was moderately severely (mean MADRS score 20.5, s.d.=10.35). Half the sample had a MADRS score over 22.

The intraclass correlation for total score between raters conducting MADRS interviews using the SIGMA was r=0.93 (P<0.0001, 95% CI 0.89–0 .95). In addition, there was no significant difference between the mean MADRS scores obtained by the first interviewer and the second interviewer: 20.49 (s.d.=10.5) v. 20.65 (s.d.=10.6); mean difference 0.16 points (t(80)=0.36, P=0.72). Internal consistency reliability (coefficient alpha) was also examined. Similar levels of internal consistency were found for interviews administered by the first interviewer (r=0.90) and those done by the second interviewer (r=0.90), z=–0.19, P=0.85; the 95% confidence interval for the difference (d(r)=0.0057) was –0.50 to 0.12. An examination of the interrater reliability (ICC) at the item level is presented in Table 1. As Blacker & Endicott have indicated, ‘it is sometimes said that an [ICC] above 0.8 can be considered excellent, 0.7–0.8 good, 0.5–0.7 fair, and less than 0.5 poor’ (page 9).27 All of the ten MADRS items using the SIGMA had good to excellent interrater reliability, with more than half of them in the excellent range, as was the total score.


View this table:
[in this window]
[in a new window]

 
Table 1 Intraclass correlations between raters on individual items of the Montgomery—Åsberg Depression Rating Scale, by mode of administration

The correlation (ICC) between the SIGMAs administered by videoconference and those administered face-to-face was r=0.95 (P<0.0001, 95% CI 0.89–0.97). The correlation between SIGMAs administered by telephone and those given face-to-face was r=0.90 (P<0.0001, 95% CI 0.78–0.96) and that between the 30 pairs of face-to-face interviews was r=0.93 (P<0.0001, 95% CI 0.87–0.97). There was no statistically significant difference among these correlations: z=0.45, P=0.66, 95% CI for the difference (d(r)=0.0141) –0.09 to 0.98 for two face-to-face v. video and face-to-face; z=0.07, P=0.95, 95% CI for the difference (d(r)=0.0048) –0.59 to 0.729 for two face-to-face v. telephone and face-to-face.

We also took this opportunity to examine the diagnostic accuracy of the MADRS in differentiating major depression from the other diagnoses. Using a cut-off score of 18 on the MADRS, the sensitivity of the scale for the diagnosis of major depression was 87%, its specificity was 61%, the positive predictive value was 74% and the negative predictive value 79%.

Interview length
The mean length of interviews conducted with the SIGMA was 25.8 min (s.d.=10.04, range 5–56). The mean interview length for the interview conducted second was 2.4 min shorter than the mean length of the interview conducted first (26.9 min v. 24.5 min; t(78)=2.38, P=0.020).


arrow
Discussion
 
In our development and testing of a structured interview guide for the MADRS, the most stringent test of interrater reliability was used: independent test–retest interviews. This method best approximates the interrater agreement that would be achieved in many clinical trials in which patients are assessed by a different rater at each visit. A strength of our study is that six different raters participated. This suggests that the positive results were not due to a single pair of raters who work closely together; rather, these results are likely to be generalisable to a pool of similarly trained raters. Although the training was fairly rigorous, it is replicable. All raters in this study were experienced mental health clinicians, although they had varying degrees of prior experience with the MADRS scale. The level of agreement that can be achieved with raters who have less clinical experience is unknown, although it is likely that the structured interview guide facilitates the reliable use of this scale by less-experienced raters because it standardises the questions used to elicit the information necessary to rate the items.

To our knowledge, this is the first assessment of the reliability of the MADRS in which all test–retest interviews were independent, and in which agreement at the item level was reported. In this study agreement on the total MADRS score was in the excellent range, and the reliability of all ten of the items was good to excellent. Our results also support the equivalence of remote administration of the MADRS using the SIGMA, by both telephone and videoconference, to face-to-face interviews. This finding has important implications for the way in which clinical trial assessments are conducted: remote assessments can offer more efficient and flexible administration paradigms than face-to-face assessments.

This study has demonstrated that with the use of the SIGMA, a group of interchangeable raters can achieve high reliability of MADRS total and item scores in a range of patients with depression. The extent to which this improvement in interrater agreement translates into improved signal detection in trials using the MADRS remains to be demonstrated.

Go


View this table:
[in this window]
[in a new window]

 
Appendix Structured Interview Guide for the Montgomery–Åsberg Depression Rating Scale (SIGMA)


arrow
ACKNOWLEDGMENTS
 
The authors would like to acknowledge the contributions of Drs Elizabeth Jeglic, Ian Sharp and Karen Rybechenko, and Donna Salvucci, MEd, for conducting the clinical interviews in the study and for feedback on the items; Drs Alan Feiger and Norm Rosenthal for suggestions on the probes; Dr Kia Crittenden and Kristy Harris, MA, for patient screening; and Don Borque for assistance with study coordination. Support for this study was provided by MedAvante, Inc.


arrow
REFERENCES
 
    1
  1. Kobak KA, Engelhardt N, Williams JB, Lipsitz JD. Rater training in multicenter clinical trials: issues and recommendations. J Clin Psychopharmacol 2004; 24: 113 –17.[CrossRef][Medline]
  2. 2
  3. Kobak KA, Feiger AD, Lipsitz JD. Interview quality and signal detection in clinical trials. Am J Psychiatry 2005; 162: 628 .[Free Full Text]
  4. 3
  5. Montgomery SA, Åsberg M. A new depression scale designed to be sensitive to change. Br J Psychiatry 1979; 134: 382 –9.[Abstract/Free Full Text]
  6. 4
  7. Bagby RM, Ryder AG, Schuller DR, Marshall MB. The Hamilton Depression Rating Scale: has the gold standard become a lead weight? Am J Psychiatry 2004; 161: 2163 –77.[Abstract/Free Full Text]
  8. 5
  9. Muller MJ, Szegedi A. Effects of interrater reliability of psychopathologic assessment on power and sample size calculations in clinical trials. J Clin Psychopharmacol 2002; 22: 318 –25.[CrossRef][Medline]
  10. 6
  11. Hedlund JL, Vieweg BW. The Hamilton Rating Scale for Depression: a comprehensive review. J Oper Psychiatry 1979; 10: 149 –61.
  12. 7
  13. Gibbons RD, Clark DC, Kupfer DJ. Exactly what does the Hamilton Depression Rating Scale measure? J Psychiatr Res 1993; 27: 259 –73.[CrossRef][Medline]
  14. 8
  15. Faries D, Herrera J, Rayamajhi J, DeBrota D, Demitrack M, Potter WZ. The responsiveness of the Hamilton Depression Rating Scale. J Psychiatr Res 2000; 34: 3 –10.[CrossRef][Medline]
  16. 9
  17. Kobak KA, Greist JH, Jefferson JW, Mundt JC, Katzelnick DJ. Computerized assessment of depression and anxiety over the telephone using interactive voice response. MD Comput 1999; 16: 64 –8.[Medline]
  18. 10
  19. Mundt JC, Katzelnick DJ, Kennedy SH, Eisfeld BS, Bouffard BB, Greist JH. Validation of an IVRS version of the MADRS. J Psychiatr Res 2006; 40: 243 –6.[CrossRef][Medline]
  20. 11
  21. Maier W, Philipp M, Heuser I, Schlegel S, Buller R, Wetzel H. Improving depression severity assessment: I. reliability, internal validity and sensitivity to change of three observer depression scales. J Psychiatr Res 1988; 22: 3 –12.[Medline]
  22. 12
  23. Davidson J, Turnbull CD, Strickland R, Miller R, Graves K. The Montgomery–Åsberg Depression Scale: reliability and validity. Acta Psychiatr Scand 1986; 73: 544 –8.[Medline]
  24. 13
  25. Takahashi N, Tomita K, Higuchi T, Inada T. The inter-rater reliability of the Japanese version of the Montgomery–Asberg depression rating scale (MADRS) using a structured interview guide for MADRS (SIGMA). Hum Psychopharmacol 2004; 19: 187 –92.[CrossRef][Medline]
  26. 14
  27. Williams JBW. A structured interview guide for the Hamilton Depression Rating Scale. Arch Gen Psychiatry 1988; 45: 742 –7.[Abstract/Free Full Text]
  28. 15
  29. Moberg PJ, Lazarus LW, Mesholam RI, Bilker W, Chuy IL, Neyman I, Markvart V. Comparison of the standard and structured interview guide for the Hamilton Depression Rating Scale in depressed geriatric inpatients. Am J Geriatr Psychiatry 2001; 9: 35 –40.[CrossRef][Medline]
  30. 16
  31. Spitzer RL, Williams JBW, Gibbon M, First MB. The Structured Clinical Interview for DSM–III–R (SCID). II. Multisite test–retest reliability. Arch Gen Psychiatry 1992; 49: 630 –6.[Abstract/Free Full Text]
  32. 17
  33. Shear MK, Vander Bilt J, Rucci P, Endicott J, Lydiard B, Otto MW, Pollack MH, Chandler L, Williams J, Ali A, Frank DM. Reliability and validity of a structured interview guide for the Hamilton Anxiety Rating Scale (SIGH–A). Depress Anxiety 2001; 13: 166 –78.[CrossRef][Medline]
  34. 18
  35. Williams JBW. Structured Interview Guide for the Hamilton Anxiety Scale (SIGH–A). New York State Psychiatric Institute, 1996.
  36. 19
  37. Hermens ML, Ader HJ, van Hout HP, Terluin B, van Dyck R, de Haan M. Administering the MADRS by telephone or face-to-face: a validity study. Ann Gen Psychiatry 2006; 5: 3.[CrossRef][Medline]
  38. 20
  39. Kobak KA, Reynolds WM, Rosenfeld R, Greist JH. Development and validation of a computer-administered version of the Hamilton Depression Rating Scale. Psychol Assess 1990; 2: 56 –63.[CrossRef]
  40. 21
  41. Reynolds WM, Kobak KA. Reliability and validity of the Hamilton Depression Inventory: a paper-and-pencil version of the Hamilton Depression Rating Scale clinical interview. Psychol Assess 1995; 7: 472 –83.[CrossRef]
  42. 22
  43. Kobak KA, Reynolds WM, Greist JH. Development and validation of a computer-administered version of the Hamilton Anxiety Scale. Psychol Assess 1993; 5: 487 –92.[CrossRef]
  44. 23
  45. Kobak KA, Kane JM, Thase ME, Nierenberg AA. Why do clinical trials fail? The problem of measurement error in clinical trials: time to test new paradigms? J Clin Psychopharmacol 2007; 27: 1 –5.[CrossRef][Medline]
  46. 24
  47. American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders (4th edn) (DSM–IV). APA, 1994.
  48. 25
  49. Sheehan DV, Lecrubier Y, Sheehan K, Amorim P, Janavs J, Weiller E, Baker R, Dunbar G. The mini international neuropsychiatric interview (MINI): the development and validation of a structured diagnostic psychiatric interview for DSM–IV and ICD–10. J Clin Psychiatry 1998; 59: 22 –3.[Medline]
  50. 26
  51. Lipsitz J, Kobak K, Feiger A, Sikich D, Moroz G, Engelhard A. The Rater Applied Performance Scale: development and reliability. Psychiatry Res 2004; 127: 147 –55.[CrossRef][Medline]
  52. 27
  53. Blacker D, Endicott J. Psychometric properties: concepts of reliability and validity. In Handbook of Psychiatric Measures (ed. AJ Rush): 7–14. American Psychiatric Association, 2000.
Received for publication October 18, 2006. Revision received February 28, 2007. Accepted for publication March 13, 2007.





This Article
This is an Open Access article
Right arrow Abstract
Right arrow Full Text (PDF)
Right arrow Submit an eLetter
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Williams, J. B. W.
Right arrow Articles by Kobak, K. A.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Williams, J. B. W.
Right arrow Articles by Kobak, K. A.