Incidence

In medical contexts, incidence is most often used in its epidemiologic sense, ie, the number of new cases of a disease occurring over a defined period among persons at risk for that disease. When thus used, incidence may be expressed as a percentage (new cases divided by number of persons at risk during the period) or as a rate (number of new cases divided by number of person-years at risk).

Reporting several incidence values in the same sentence can nearly always be accomplished using the singular form (eg, “the incidence of nonfatal myocardial infarction during follow-up was 10% at 6 months, 19% at 12 months, and 26% at 18 months” or “the incidence of clinical stroke decreased significantly, from 7.6 to 5.3 per 1000 person-years in men and from 6.2 to 5.1 per 1000 person-years in women). However, in rare instances, sentence construction may necessitate the use of the plural, which of course is… what, exactly? The understandable urge to simply add an “s” at the end of the word to form the plural results in incidences — a form not found in most dictionaries and a clunker of a word if ever there was one. Writers wishing for a more mellifluous plural sometimes use incidence rates, a valid term but one perhaps best reserved for reporting incidence values expressed as actual rates rather than simple percentages. Moreover, incidences is sometimes used when reporting values either as percentages or as rates, in the latter case missing a valuable opportunity to emphasize that rates rather than percentages are being reported.

Thus, it is perhaps best to use incidences, awkward as it may be, when reporting multiple incidence values as percentages and incidence rates when reporting such values as rates, eg, “at first follow-up, the incidences of falls resulting from frailty, neuromuscular disorders, or improper use of mobility devices were 15% (95% CI, 10%-20%), 12% (95% CI, 7%-17%), and 12% (5%-19%), respectively” or “the incidence rates for falls resulting from frailty, neuromuscular disorders, or improper use of mobility devices were 5.1, 6.3, and 4.6 per person-year, respectively.” Incidentally, these 2 examples report occurrences (falls) rather than diseases or conditions, and so represent 2 instances reporting the incidence of incidents.

To further muddy the waters, incidence is sometimes confused with prevalence, defined as the proportion of persons with a disease at any given time (ie, total number of cases divided by total population). Thus, whereas incidence describes how commonly cases are diagnosed, prevalence describes how widespread the disease already is; on a more personal level, incidence describes one’s risk of developing the disease, whereas prevalence describes the likelihood that one already has it. The confusion between the terms is perhaps attributable to the occasional use of prevalence in place of incidence in the study of rare, chronic diseases for which few newly diagnosed cases are available; however, this circumstance is unusual, and incidence and prevalence should always be distinguished from one another and used appropriately. (See also §20.9, Glossary of Statistical Terms, in the AMA Manual of Style, p 872 in print.)

Whereas prevalence is often used in general contexts to indicate predominance or general acceptance, the circumstances calling for the use of incidence in general contexts are quite few and become fewer still when one takes into account that incidence is often used when incidents (the simple plural of incident) or instance (again denoting an occurrence) would be the better choice. Perhaps incidents or instances was intended but never made it to the page — as is so often the case with homophones and near-homophones, even the careful writer who usually would not confuse incidence, incidents, and instance might one day look back over a hastily typed passage only to see that a wayward incidence has crept in; if the passage is hastily edited to boot, the error might well go unnoticed until the passage is in print and a discerning reader takes pains to point it out in a letter or e-mail. The plural form, incidences, has virtually no use outside of the epidemiologic discussed above, although it has been used to subtly disorienting effect by translators rendering the Kafkaesque works of Russian writer Daniil Kharms (1904-1942) into English, most notably when rendering the 1-word title of Incidences, Kharms’ 1934 collection of absurdist critiques on life in the Soviet Union under Stalin. However, writers who are not political dissidents aiming for absurdist effect — presumably all medical writers — would do well to proofread carefully and often. — Phil Sefton, ELS

Questions From Users of the Manual

Q: If a product name appears in all caps in a company’s product literature (with or without a trademark symbol or registered symbol), must the editor retain the all caps in a journal article? Companies use caps for graphic impact or emphasis, but caps can be distracting and can make the text difficult to read. Would it be acceptable to substitute only an initial cap for an all-cap product name, particularly if the product is the main subject of the manuscript and occurs frequently?

A: Our journals do not require use of the trademark symbol (™) or the registered symbol (®) as the use of the initial cap frequently used on proprietary names indicates the proprietary nature of the name (see 5.6.16, Legal and Ethical Considerations, Intellectual Property:  Ownership Access, Rights, and Management, Trademark). There are exceptions to the use of the initial cap (eg, pHisoHex; see section 10.8, “Intercapped” Compounds) and in these cases, as in all others, we advise using the name according to the presentation of the legal trademark. To avoid a plethora of caps—which certainly can be distracting—we would suggest varying the way in which the product is referred to (eg, “this product,” “it”) as long as the meaning remains clear.

Q: Your manual indicates that references should be numbered consecutively with arabic numerals in the order in which they are cited in the text. But what about the distinction between references cited in a range and references cited individually? If an author cites references 1 through 5, does this count as only the citation of reference 1, as the first number in the range, or does it count as citation of all 5 references included in the range?

A: It matters not if the references are cited as part of a range or cited individually. Even if a reference is cited as part of a range, when any one of those references is cited later, it retains the same reference number.  This is not specifically stated in the Manual and, perhaps wrongly, we assumed that it would be understood. Thank you for allowing us to clarify this point.

Q: Convention seems to be to use the leading zero in P values, but why is this necessary since P cannot be greater than 1?

A: JAMA and the Archives Journals do not use a zero to the left of the decimal point, since statistically it is not possible to prove or disprove the null hypothesis completely when only a sample of the population is tested (P cannot equal 1 or 0, except by rounding). If convention dictates otherwise, we are unconventional!

Q: I have been unable to find specific rules on the use of nonbreaking hyphens and spaces. Do you have any suggestions for the correct and preferred use of nonbreaking hyphens and spaces?

A: You are right. We do not have any section devoted to this. However, there is information about line breaks scattered throughout the Manual. For example:

  • On page 29 (section 1.20.4), there is information on how to break an e-mail address. The same guidelines apply to breaking URLs.
  • On page 646 (section 15.6.4), there is information on breaking long karyotypes.
  • On page 910 (section 21.5), there is information on breaking long formulas.

There may be other instances like this scattered throughout the Manual where specific guidance is needed. However, individual publishers or clients may have their own preferences that require attention when editing material for their publications.

Q: I am working on a manuscript in which one of the authors has listed the degree MAS (Master of Advanced Studies). This abbreviation is not included in the Manual. Is it acceptable?

A:  This is a perfectly acceptable abbreviation. We simply did not have space to list all possible degrees and their abbreviations in the Manual and attempted to list some of the more common ones.—Cheryl Iverson, MA

Statistical Rounding and the (Mis)Leading Zero

Sometimes editors (not you or I, of course) obey the rules of their institution’s preferred style manual without fully understanding, or really thinking about, why some of these rules exist. For example, some editors (not you or I, of course) automatically delete (or, if they’re lucky, their editing program deletes for them) the leading zero in a few statistics, but not all. They know exactly when and where to delete the leading zero, but not why. Or they round some statistics, but not all, assuming that all of this has something to do with saving space. It does, of course,1(p830) but this isn’t the only reason we do it.

The AMA Manual of Style defines a P value as “The probability of obtaining the observed data (or data that are more extreme) if the null hypothesis were exactly true.”1(p888) Per AMA style, P values greater than .01 are expressed to a maximum of 2 decimal places and those less than .01 are expressed to a maximum of 3 decimal places. I set out in search of the complicated statistical reason why we use this specific number of decimal places and found that, in addition to saving space, we do it for one simple reason: it’s all we need. Yep, that’s it. It’s all we need to know. P < .00000001 doesn’t tell us any more of value than P < .001. Both tell us that the probability is very low, and that’s good enough. Of course, if the author protests or rounding will make P appear nonsignificant, an exception is made (for example, if P = .046 and significance is set at P = .05).1(pp851-852) Also, studies such as genome-wide association studies report P values of P < .00001 or smaller, often in scientific notation, to address the issue of multiple comparisons; it is essential not to round these. So every rule has exceptions, I guess (remember Spanish class, anyone?).

Why then, you ask, do we not save ourselves the confusion and simply round P < .001 to P = .00? There’s a reason for that, too, and it’s the same reason we don’t use leading zeros with certain probability statistics (ah, you say, it all comes together). If probability is the chance that a given event will occur,2 and we have only surveyed a sample of a given population, probability cannot equal 1.0 or 0 because we can’t say absolutely that a null hypothesis will definitely or definitely not happen in that population.1(p889) And if P can’t equal 1.0 or 0, why include a zero that doesn’t tell us anything new? For this reason, we use P > .99 and P < .001 as the highest and lowest P values. For the same reason, and because they are used often, the leading zero rule applies to α and β probabilities as well. Why? To save space, of course.–Roya Khatiblou, MA

1. Iverson C, Christiansen S, Flanagin A, et al. AMA Manual of Style: A Guide for Authors and Editors. 10th ed. New York, NY: Oxford University Press; 2007.

2. Merriam-Webster’s Collegiate Dictionary. 10th ed. Springfield, MA: Merriam-Webster Inc; 1997.

Quiz Bowl: Statistical Terms

What’s the difference between an α level and a β level? Do you know your y-axis from your x-axis from your z-axis? What term means the spread or dispersion of data? This month’s quiz, which subscribers can find at http://www.amamanualofstyle.com/, can help you learn the answers to these and other questions on statistical terms. On the basis of your understanding of section 20.9 of the AMA Manual of Style, select the correct answer from the choices listed in the following sample quiz question.

Which of the following terms means the correlation coefficient for bivariate analysis?
r
R
r2
R2

So, how did you do? Here’s the answer (use your mouse to highlight the blank line):

Which of the following terms means the correlation coefficient for bivariate analysis?
r

R is the correlation coefficient for multivariate analysis. r2 is the coefficient of determination for bivariate analysis. R2 is the coefficient to determination for multivariate analysis.

If you want to further test your knowledge of statistical terms, subscribe to the AMA Manual of Style online and take the full quiz. Stay tuned next month for another edition of Quiz Bowl.—Laura King, MA, ELS