Which of the following is (are) among the ways that epidemiologists can contribute to health policy?

With regular news updates about the current COVID-19 pandemic, you may have noticed the use of different scientific terms, ‘epidemiology’ being one of them.

Epidemiology is the study of diseases in populations, investigating how, when and why they occur. The diseases studied are wide-ranging, including infectious diseases like coronavirus and non-infectious diseases like arthritis.

People who work in this field are referred to as epidemiologists. When a disease occurs in a population, epidemiologists help us to understand where the disease is coming from, and who it is most likely to impact. The information gathered can then be used to control the spread of the disease and prevent future outbreaks.

Professor Will Dixon, Director of the Centre for Epidemiology Versus Arthritis:

“As a rheumatologist, some of the commonest questions my patients ask in clinic are ‘Why did I get this?’ ‘Now that I have it, what does the future hold for me?’ and ‘What can I do to control the disease?’.

The discipline of epidemiology allows us to answer all these questions. It’s interesting detective work, requiring us to gather all of the relevant information, to piece it together, and to analyse it thoughtfully in order to come to the right conclusions. And it’s rewarding, given that it can answer questions of real importance for people living with arthritis and other diseases.”

What can epidemiology tell us about arthritis?

Epidemiology can help to explain the possible causes and, progression of musculoskeletal conditions and help to direct treatment.

Our Centre for Epidemiology Versus Arthritis is based at the University of Manchester and has a particular focus on inflammatory arthritis (such as rheumatoid arthritis and psoriatic arthritis) and osteoarthritis.

The work they do includes:

  • Understanding how common musculoskeletal conditions are, what influences their occurrence, how they progress in the short and long-term, and what is the impact of them on people’s lives.
  • Evaluating the effectiveness and safety of treatments for musculoskeletal conditions, including biologic drugs such as anti-TNF, pain relieving drugs and non-pharmacological treatments.
  • Bringing together large, digital data sets to study arthritis in the population.
  • Developing new methods and using existing methods to make sense of epidemiology data.
  • Ensuring research influences policy and practice by partnering with others.

Examples of our epidemiological funded research and what we’ve learnt about arthritis

NOAR is the largest community-based study in the world to look at the onset, development and long-term impact of inflammatory arthritis, with over 4,000 participants.

Over the last 30 years, NOAR has helped to rewrite textbooks and inform healthcare professionals on the management and treatment of rheumatoid arthritis.

The study has helped us to understand how widespread inflammatory arthritis is, factors that might be related to the onset of arthritis (e.g. smoking), people’s responses to treatment, and the long-term impact of the condition.

Going forward, NOAR aims to explore the genetic basis of inflammatory arthritis, the risk of cardiovascular disease, and investigating the link between cognition (how our mind processes information, for example, thinking, problem-solving and planning) and rheumatoid arthritis.

Cloudy with a Chance of Pain

This study used a smartphone app to examine the relationship between local weather and daily pain in people living with long-term pain conditions like arthritis.

Around three-quarters of people living with arthritis believe their pain is affected by the weather, but no study had been able to prove the link before this.

Over a 15-month period, 2658 people recorded their daily pain in the app, which was combined with local weather data. Researchers found that days with higher humidity, lower pressure, and stronger winds are more likely associated with high pain days.

Temperature and rain, however, were not associated with pain. These results could allow scientists to better understand pain mechanisms and support better pain treatments and management in the future.

BRAGGSS (Biologics in Rheumatoid Arthritis Genetics and Genomics Study Syndicate)

Based at our Centre for Genetics and Genomics, BRAGGSS is the largest worldwide study of rheumatoid arthritis patients undergoing treatment with biological therapies for the first time.

Using blood samples and questionnaires, the study aims to evaluate the role of multiple factors in a patient’s response to treatment, including genetic variation, psychological status and environmental factors.

A recent publication from the BRAGGSS study found rheumatoid arthritis patients taking biologics for the first time still face significant limitations regarding their ability to work.

Over the 12-month study, approximately 11% of patients left work, 16% reported sick-leave, and those with higher rates of disability predicted poorer work outcomes.

Fatigue, psychological distress and job type were also connected to different work outcomes.

Furthermore, good treatment response was associated with people being absent from work or not.

Musculoskeletal Health and Work in the General Population

Our Centre for Musculoskeletal Health and Work has a dedicated theme of research which seeks to understand the prevalence and impact of musculoskeletal disorders (MSDs) on work and populations.

These studies include The Health and Employment After Fifty (HEAF) Study, which assesses the impact of musculoskeletal diseases on ability to work, employment status and job retention in older ages, exploring the factors, risks, and impacts of this relationship.

COVID-19 and rheumatic diseases

At the moment, little is known about how patients with rheumatic diseases or autoimmune conditions are affected by COVID-19.

To address this and inform care we are supporting two global COVID-19 initiatives, organised by the COVID-19 Global Rheumatology Alliance which was formed in response to the coronavirus pandemic.

Natalie Carter, Head of Research Engagement "These initiatives will help clinicians to collect and share information about how COVID-19 is affecting people with inflammatory arthritis and other rheumatological conditions.

We hope the information gathered will help to inform the way doctors to advise and care for their patients and evaluate the risk of COVID-19 infection on patients who may be on immunosuppressant treatment."

Read more about our research

We’ll keep you posted on progress and updates, find out more about our research.

More chapters in Epidemiology for the uninitiated

Epidemiology is the study of how often diseases occur in different groups of people and why. Epidemiological information is used to plan and evaluate strategies to prevent illness and as a guide to the management of patients in whom disease has already developed.

Like the clinical findings and pathology, the epidemiology of a disease is an integral part of its basic description. The subject has its special techniques of data collection and interpretation, and its necessary jargon for technical terms. This short book aims to provide an ABC of the epidemiological approach, its terminology, and its methods. Our only assumption will be that readers already believe that epidemiological questions are worth answering. This introduction will indicate some of the distinctive characteristics of the epidemiological approach.

All findings must relate to a defined population
A key feature of epidemiology is the measurement of disease outcomes in relation to a population at risk. The population at risk is the group of people, healthy or sick, who would be counted as cases if they had the disease being studied. For example, if a general practitioner were measuring how often patients consult him about deafness, the population at risk would comprise those people on his list (and perhaps also of his partners) who might see him about a hearing problem if they had one. Patients who, though still on the list, had moved to another area would not consult that doctor. They would therefore not belong to the population at risk.

The importance of considering the population at risk is illustrated by two examples. In a study of accidents to patients in hospital it was noted that the largest number occurred among the elderly, and from this the authors concluded that “patients aged 60 and over are more prone to accidents.” Another study, based on a survey of hang gliding accidents, recommended that flying should be banned between 11 am and 3 pm, because this was the time when 73% of the accidents occurred. Each of these studies based conclusions on the same logical error, namely, the floating numerator: the number of cases was not related to the appropriate “at risk” population. Had this been done, the conclusions might have been different. Differing numbers of accidents to patients and to hang gliders must reflect, at least in part, differing numbers at risk. Epidemiological conclusions (on risk) cannot be drawn from purely clinical data (on the number of sick people seen).

Implicit in any epidemiological investigation is the notion of a target populationabout which conclusions are to be drawn. Occasionally measurements can be made on the full target population. In a study to evaluate the effectiveness of dust control measures in British coal mines, information was available on all incident (new) cases of coal workers’ pneumoconiosis throughout the country.

More often observations can only be made on a study sample, which is selected in some way from the target population. For example, a gastroenterologist wishing to draw general inferences about long term prognosis in patients with Crohn’s disease might extrapolate from the experience of cases encountered in his own clinical practice. The confidence that can be placed in conclusions drawn from samples depends in part on sample size. Small samples can be unrepresentative just by chance, and the scope for chance errors can be quantified statistically. More problematic are the errors that arise from the method by which the sample is chosen. A gastroenterologist who has a special interest in Crohn’s disease may be referred patients whose cases are unusual or difficult, the clinical course and complications of which are atypical of the disease more generally. Such systematic errors cannot usually be measured, and assessment therefore becomes a matter for subjective judgement.

Systematic sampling errors can be avoided by use of a random selection process in which each member of the target population has a known (non-zero) probability of being included in the study sample. However, this requires an enumeration or censusof all members of the target population, which may not be feasible.

Often the selection of a study sample is partially random. Within the target population an accessible subset, the study population, is defined. The study sample is then chosen at random from the study population. Thus the people examined are at two removes from the group with which the study is ultimately concerned:

Target population – study population – study sample

This approach is appropriate where a suitable study population can be identified but is larger than the investigation requires. For example, in a survey of back pain and its possible causes, the target population was all potential back pain sufferers. The study population was defined as all people aged 20-59 from eight communities, and a sample of subjects was then randomly selected for investigation from within this study population. With this design, inference from the study sample to the study population is free from systematic sampling error, but further extrapolation to the target population remains a matter of judgement.

The definition of a study population begins with some characteristic which all its members have in common. This may be geographical(“all UK residents in 1985” or “all residents in a specified health district”); occupational(“all employees of a factory,” “children attending a certain primary school”, “all welders in England and Wales”); based on special care(“patients on a GP’s list”, “residents in an old people’s home”); or diagnostic (“all people in Southampton who first had an epileptic fit during 1990-91”). Within this broad definition appropriate restrictions may be specified – for example in age range or sex.

Oriented to groups rather than individuals
Clinical observations determine decisions about individuals. Epidemiological observations may also guide decisions about individuals, but they relate primarily to groups of people. This fundamental difference in the purpose of measurements implies different demands on the quality of data. An inquiry into the validity of death certificates as an indicator of the frequency of oesophageal cancer produced the results shown in Table 1.1

Inaccuracy was alarming at the level of individual patients. Nevertheless, the false positive results balanced the false negatives so the clinicians’ total (53 + 21 = 74 cases) was about the same as the pathologists’ total (53 + 22 = 75 cases). Hence, in this instance, mortality statistics in the population seemed to be about right, despite the unreliability of individual death certificates. Conversely, it may not be too serious clinically if Dr. X systematically records blood pressure 10 mm Hg higher than his colleagues, because his management policy is (one hopes) adjusted accordingly. But choosing Dr. X as an observer in a population study of the frequency of hypertension would be unfortunate.

Table 1.1 Cause of death diagnosed clinically compared with at necropsy
Diagnosis of oesophageal cancerNo.
Diagnosed by clinician and confirmed by pathologist53
Diagnosed by clinician and not confirmed by pathologist21
First diagnosed post mortem22

Conclusions are based on comparisons
Clues to aetiology come from comparing disease rates in groups with differing levels of exposure – for example, the incidence of congenital defects before and after a rubella epidemic or the rate of mesothelioma in people with or without exposure to asbestos. Clues will be missed, or false clues created, if comparisons are biased by unequal ascertainment of cases or exposure levels. Of course, if everyone is equally exposed there will not be any clues – epidemiology thrives on heterogeneity. If everyone smoked 20 cigarettes a day the link with lung cancer would have been undetectable. Lung cancer might then have been considered a “genetic disease”, because its distribution depended on susceptibility to the effects of smoking.

Identifying high risk and priority groups also rests on unbiased comparison of rates. The Decennial Occupational Supplement of the Registrar General of England and Wales(1970-2) suggested major differences between occupations in the proportion of men surviving to age 65:

Table 1.2 Men surviving to 65, by occupation
Farmers (self employed)82%
Professionals77%
Skilled manual workers69%
Labourers63%
Armed forces42%

These differences look important and challenging. However, one must consider how far the comparison may have been distorted either by inaccurate ascertainment of the deaths or the populations at risk or by selective influences on recruitment or retirement (especially important in the case of the armed forces).

Another task of epidemiology is monitoring or surveillance of time trends to show which diseases are increasing or decreasing in incidence and which are changing in their distribution. This information is needed to identify emerging problems and also to assess the effectiveness of measures to control old problems. Unfortunately, standards of diagnosis and data recording may change, and conclusions from time trends call for particular wariness.

The data from which epidemiology seeks to draw conclusions are nearly always collected by more than one person, often from different countries. Rigorous standardisation and quality control of investigative methods are essential in epidemiology; and if an apparent difference in disease rates has emerged, the first question is always “Might the comparison be biased?”