Methodology and definitions
Published 5 December 2024
Applies to England
Summary of data sources and analytical datasets used in this report
This report uses the following analytical datasets to produce the chapter outputs of the Tuberculosis Annual Report, England, 2024 (data up until 31 December 2023):
- National Tuberculosis Surveillance (NTBS) is used in:
- TB incidence and epidemiology, England 2023
- TB prevention, England, 2023; section on contact tracing of close contacts of people notified with active TB in NTBS
- TB diagnosis and microbiology, England, 2023
- TB treatment and outcomes, England, 2023
- TB in children, England, 2023
- Pre-entry TB Screening dataset is used in:
- TB prevention, England, 2023; section on pre-entry TB screening of active pulmonary TB in people applying for long-term visas to the UK
- National NHS England (NHSE) Latent TB Infection Testing dataset (NHSE LTBI) is used in:
-
TB prevention, England, 2023; section on LTBI testing of new-entrant migrants to the UK with linkage to:
- (i) GP registration data for people assigned a type 4 flag to identify the population eligible for NHSE LTBI testing
- (ii) linkage to NTBS dataset for follow-up of people who subsequently develop active TB disease
-
TB prevention, England, 2023; section on LTBI testing of new-entrant migrants to the UK with linkage to:
NTBS dataset
TB notifications
People who are diagnosed with TB in England, Wales and Northern Ireland must be notified through NTBS. This report only includes data for individuals with TB who are resident in England or are treated in England (including individuals who are homeless or visiting from abroad).
Only individuals with disease caused by Mycobacterium tuberculosis complex (MTBC) are reported. Individuals were denotified and removed from the data set if the infective agent was identified as non-MTBC or M. bovis Bacillus Calmette-Guerin (BCG) subspecies.
Data production
In 2021, NTBS was launched and replaced 2 historical surveillance systems:
- the Enhanced Tuberculosis Surveillance system (ETS)
- the London TB Register (LTBR)
Datasets from 2018 onwards were extracted from ETS and LTBR and were merged with NTBS following a series of data migrations between July and December 2021. Data reported here were obtained from the merged data sets (NTBS, ETS, LTBR) and the final extract was on 19 June 2024.
Data cleaning to improve data quality
Denotifications
People with BCGosis, on chemoprophylaxis for latent TB infection or with a non-tuberculous mycobacterial infection who were notified in error were identified using comments fields, and denotified. People with culture confirmed TB who had been denotified were queried with clinics, and lab contaminations were removed, or people were renotified if they were found to have been denotified in error.
In addition, a probabilistic matching process was carried out for notifications between January 2021 and December 2023 to identify people with more than one notification within a 12-month period. Identified duplicates were denotified with any missing information transferred from the duplicate to the original notification.
Geography
The postcode field (used to map postcodes to geographic areas) was cleaned by identifying invalid postcodes based on matching to the May 2023 Postcode Directory from ONS. Where cleaning was necessary, the correct postcode was identified using the address fields.
For people who were homeless or who had a residence outside the UK, but were notified in England, the postcode of the clinic or hospital at which they were treated was assigned to the notification. For people with no postcode or treatment clinic or hospital, the local authority and UK Health Security Agency (UKHSA) centre were updated using the local authority field recorded based on the area that the notifying case manager was located in.
UKHSA region was derived from UKHSA region of residence based on the individual’s residential postcode. If missing, UKHSA region in which treatment occurred (most recently, as care may have been transferred) was used, for example if a person had no fixed abode.
Cleaned postcodes were assigned boundary layers and merged with boundaries for clinical commissioning groups, integrated care boards, upper-tier local authorities (UTLAs) and local authorities sourced from the Central Lookups Database within the UKHSA Data Lake which is managed by the Public Health Data Science (PHDS) team. These are available in the UKHSA layers of the map software (GIS).
Site of disease
The site of disease was reclassified to pulmonary if a positive sputum smear (microscopy) sample was recorded or if a positive culture was grown from a pulmonary laboratory specimen. People with laryngeal TB were included in pulmonary breakdowns, and people with miliary TB were included in both pulmonary and extra-pulmonary breakdowns. Site of disease for people with culture confirmation was reclassified based on the site in the body from which the specimen was taken. Site of disease classifications were also updated using the free text field for site of disease.
Social risk factors including prison and asylum status
The presence or absence of the social risk factors (current or a history of drug misuse, current alcohol misuse, current or history of homelessness, current or history of prison, current mental health needs and current asylum status; including if remanded in an immigration detention centre) were updated from missing or unknown if relevant information was found in the free text comments fields within NTBS.
Homelessness was updated to ‘yes’ if mentioned in the comments fields or if the address given was ‘no fixed abode’ or a shelter or hostel for homeless people was named.
Prison (current or in the past) was updated to ‘yes’ if mentioned in the comment’s fields, if HMP or a prison name was recorded as the address or if the residential postcode corresponded with a prison. Up until 2020, data on incident TB individuals reported to the Public Health in Prisons (PHiP) log were used to further identify people who had been imprisoned, but this was not conducted in 2022 or 2023.
The immigration detainee variable was updated if the address given at notification, comments fields or occupation field showed the person to be an immigration detainee.
The asylum seeker variable (newly introduced in NTBS) was updated as asylum seeker if recorded in the occupation field sub-category under ‘no occupation’. For analysis, asylum seeker was then recoded as ‘yes’ if either asylum seeker variable. The asylum seeker variable was further updated so that all UK-born individuals with a missing value for this variable was updated to ‘no’.
Demographic characteristics
Sex is reported as male or female. Where missing from the raw data, it was derived from the name of the individual where names were unambiguous.
Age and age groups were derived from the date of notification and date of birth. Notification demographics were used for tracing against the Personal Demographics Service (PDS). The Demographics Batch Service (DBS) enables a user to submit a file of patient demographics for tracing against the Personal Demographics Service (PDS), providing back the NHS Number and most up to date demographics where an exact match is found. Those with conflicting values for age or inconsistent mortality information were cross-referenced against the matched PDS data to resolve and checked with case managers.
UK and non-UK born status occurs in the raw data. It was amended if missing and the country of birth indicated non-UK birth.
Entry to the UK is entered as year only by NTBS users. Time since entry is derived as year of notification minus entry year.
HIV co-infection
HIV test result data is not collected within user-entered NTBS. Instead, HIV test results are obtained from the UKHSA HIV and AIDS Reporting Section (HARS). The data includes people aged 15 or over who have been diagnosed with HIV in England up to 31 December 2023. HIV data was extracted on 28 May 2024. The TB dataset used for matching was extracted from NTBS on 31 May 2024. The data included people aged 15 or over with a TB notification between 2000 and 2023 (up to 31 December 2023). A combination of probabilistic matching followed by deterministic matching was carried out using country of birth, surname soundex, initial, date of birth, gender, ethnicity, postcode, lower super output area (LSOA), hospital, TB flag within HIV dataset and date of death. Probabilistic matching was undertaken by the TB unit using SQL on a secure SQL cluster, with code saved in a UKHSA github repository. Deterministic matching was carried out by HARS to provide the final matched dataset.
Pre-entry screening dataset
This dataset comprises person-level data of the results of screening for active pulmonary TB in people when applying for long-term (more than 6 months) visas to visit the UK.
Data sources, collection and dataset production
The pre-entry screening dataset comprises data collected from International Organisation for Migration (IOM) and non-IOM clinics. IOM data was collected by IOM panel physicians, entered via a secure web-based IOM system and collated by the central office in Manila. This data was then securely transferred to UKHSA. Data from non-IOM providers was collected by the clinics, collated via the Home Office United Kingdom Visas and Immigration (UKVI) unit and securely transferred to UKHSA.
Data from IOM clinics was updated via their web-based portal prior to submission to UKHSA.
Data from the period up until 31 December 2023, as received by 7 July 2024 was used in this report. Number of screening episodes were reported for the period of January 2014 to December 2023. The number of people with confirmed TB and confirmed TB case detection rates were reported for the period January 2018 to December 2023 only due to poor data quality in the period of January 2014-December 2017 to determine people with confirmed TB.
Data production and cleaning
Data for IOM and non-IOM data was analysed separately due to the different methods through which this data is recorded and differences in data quality.
In both non-IOM and IOM clinics, an individual person was defined as a unique passport number, but this may not be accurate as one individual may have more than one passport number.
In data from non-IOM clinics, one screening episode was defined as one unique passport number and examination date. This variable was used to link to the sputum updates data. In data from IOM clinics, one screening episode was defined as one unique system identification number.
NHSE LTBI dataset
This comprises 2 data subsets, a dataset of people eligible for testing and a dataset of individuals for whom LTBI test data was received and who appeared to be eligible. The LTBI test dataset was matched to the eligible population dataset to determine the proportion of the eligible population tested. This was then matched to the cleaned NTBS analytical dataset to derive datasets of; (i) eligible individuals who were not tested through the programme and who developed active TB disease, and; (ii) eligible individuals who were tested through the programme who subsequently developed active TB disease.
Data sources, collection and processing
New-entrant migrant population eligible for NHSE LTBI testing (Type 4) dataset
The Patient Registration Data System (PRDS), managed by the NHS, holds records of all patients registered with GPs in England and Wales. When individuals whose previous address was outside the UK and who have spent more than three months abroad register with a local GP, they are assigned a specific code (Flag 4). This data is also known as Type 4 GP data.
This data is processed by TB Unit for the purpose of providing TB services in participating Integrated Care Board (ICB) subsections and formerly clinical commissioning groups (CCGs) lists of eligible new entrant migrants in their areas. The Flag 4 data is obtained from the Spine Demographic Service (SDS) on a quarterly basis by NHS Digital via a secure encrypted user interface and provided to TB unit. The data is then deduplicated and cleaned including country of birth and age as key eligibility criteria. TB unit then send data to participating ICB subsections in an encrypted format in encrypted emails. A few (5) TB services choose other methods to determine newly registered patients.
As of 1 of July 2022, CCGs were replaced by ICBs as implemented by the Health and Care Act 2022. Therefore this report presents data by ICBs. The linkage between ICBs to previous CCGs is reported in Supplementary Table 10 of the TB prevention in England data set.
Data cleaning to improve data quality of the eligible population dataset (Type 4 data)
Data extracted from the LTBI portal was deduplicated by accepting the first valid test result and NHS number. This data is then added to the database the LTBI programme queries. If the first test result was valid but indeterminate the subsequent retest result was used, if available. To improve the completeness and accuracy of patient identifiable information, records were matched to NHS Spine, NTBS and Type 4 GP datasets using NHS number. Where no NHS number was available, the forename, surname and date of birth were used. Data for NHS Spine were sent with the unique dataset identifier to match the data to the correct record. Postcodes were matched to those in the ONS NSPL (National Statistics Postcode Lookup) look-up to obtain geographical locations. After cleaning, records of individuals who were not born in or travelled from an eligible country or not aged 16 to 35 years at the time of testing (based on date of birth and first LTBI test date) or those that were not tested in programmatic ICBs were removed from this analytical dataset.
Records with implausible dates including: treatment date being earlier than test date, retest date before test date and treatment start date after treatment completion date were excluded from the dataset.
Matching of datasets to NTBS analytical dataset
After cleaning, single person records were deterministically matched from the eligible population dataset with those from the LTBI tested dataset and the cleaned NTBS analytical dataset using various combinations of NHS number (including PDS matched NHS Number), forename, initial, family name, family name soundex, date of birth, gender and postcode. This was undertaken using SQL and on a secure server and the code saved in a UKHSA internal GitHub repository.
Limitations of the data
The estimate of the eligible population for NHSE LTBI testing is based on the Flag 4 GP data for participating services. Readers should be aware that not all new entrant migrants will register with a GP and will thus be missed by the programme. It does not capture new entrant migrants who would be eligible but who register with a GP in a non-participating, typically a low TB incidence area of England.
The LTBI test dataset is also likely an underestimate of the total tests completed in people meeting the eligibility criteria; due to some regions not uploading data – for example Manchester within the Greater Manchester ICB or incomplete data. In 2023, place of birth was missing for 6,811 test records from laboratories, leading to these being excluded as eligibility could not be determined.
Data on prophylactic treatment for LTBI is poorly completed within the LTBI portal. TB services receive the programmatic incentive payment based on the number of valid test results entered, but not treatment data. Variation in local human resources have resulted in variable data entry on treatment completion over time and location.
Report methodology and definitions
TB notifications
Individuals with TB are reported by area of residence and by calendar year of notification.
Social risk factors
People with TB are reported as having at least one social risk factor (SRF) (‘yes’) if any of the 6 social risk factors (current alcohol misuse, current or a history of homelessness, drug misuse, imprisonment, current asylum seeker status and current mental health needs) had ‘yes’ recorded. As a result, the denominator is all notifications. This assumes that people for whom no data were recorded for individual SRFs were a ‘no’ and may result in under-estimation.
Data for individual social risk factors reported are limited to those with recorded data, for example a ‘yes’ or a ‘no’. As a result, the denominators for these are smaller than all notifications due to missing data. If there is significant under-reporting of SRFs in those with missing data, this should result in a better estimate of the true proportion of the people with each SRF. However, if data is more likely to be recorded if the response is a ‘yes’ this could result in an over-estimates. This may be the case for the asylum seeker SRF, especially in years before 2021.
Mental health is recorded by TB case managers and is based on their judgement if mental health concerns are likely to affect the person’s ability to adhere to treatment. This was added to surveillance in London UKHSA centre in 2018 and is a simple ‘yes’ or ‘no’ response. It was introduced nationally in 2021 with the introduction of NTBS. Here we report this as the person has need of support for mental health and therefore has ‘mental health needs’.
Asylum seeker status and immigration removal centre were first added to national surveillance as discrete variables in 2020. Prior to this, ‘asylum seeker’ status was extracted from free-text comment fields and user entered values within occupation (LTBR). As a result, more complete data on this exposure is apparent from 2020 to and after complete roll out of NTBS in 2021 when compared with previous years.
Alcohol misuse is as recorded by case managers and is based on their judgement if current alcohol misuse is likely to affect adherence to treatment.
History of drug misuse, homelessness and prison are self-reported by individuals and are first asked as a ‘yes’ or ‘no’ response and then with additional information on duration; as current, within last 5 years or more than 5 years ago. Unless indicated otherwise, analyses here present these SRFs as ‘yes’ if either history of, or a duration value, was recorded.
Pre-entry screening active TB case definitions
In both non-IOM and IOM clinics, a person with laboratory confirmed TB met the following criteria: had an abnormal chest x-ray (CXR) consistent with TB and a positive sputum culture result or positive culture result in the absence of a valid CXR result (for example in pregnant women or young children).
In IOM clinics, in the absence of sputum test confirmation, TB was clinically confirmed if the following were documented: a clinician’s judgement that the patient’s clinical and/or radiological signs and/or symptoms are compatible with tuberculosis, and a clinician’s decision to treat the patient with a full course of anti-tuberculosis therapy.
In non-IOM clinics, in the absence of sputum test confirmation, TB was clinically confirmed if the reason for no certificate given was referred for treatment and/or TB was recorded as confirmed and a clearance certificate was not issued. For the latter criteria, applicants from 2 clinics were excluded due to poor data quality submitted from the clinics.
In non-IOM data, in the absence of reported sputum culture results where there was an abnormal CXR suggestive of TB, an applicant was considered a suspected TB case if the following criteria was additionally met: TB suspected, grade 4 CXR code and TB clearance certificate not issued. This was because most of all sputum culture positive cases showed this combination of variables.
Previously published reports used both possible and confirmed TB cases to determine the TB case detection rate, but the inclusion of possible cases is likely to result in an overestimate of the true number of cases detected. In the current report the TB case detection rate was determined by the number of people with laboratory or clinically confirmed TB only out of the number of people screened.
Diagnostic and laboratory tests
Data for TB isolates from the National Mycobacterial Reference Service (NMRS) are matched to TB notifications. Isolates are de-duplicated and summarised to only report one isolate per TB notification per notification period.
NTBS also includes user-entered fields to record whether a culture sample and other diagnostic tests, such as PCR, were undertaken and the results of these tests. These data fields are combined to generate a final test status variable for the different tests for all the notified cases.
Culture and other diagnostic test results are then reported as below.
Any test performed:
- Yes: any value recorded in NTBS of any test type variables (culture, PCR, microscopy, histology, or chest X-ray), test result and date of test regardless of result
- No: no recorded value of variables of test type, test result and date of test
Any test positive:
- Yes: positive test result recorded for any test type (culture, PCR, microscopy, histology or chest x-ray)
- No: no results or negative test result recorded to all test types (as above)
Culture confirmed:
- Culture confirmed: supported by NMRS laboratory result of a positive culture for MTBC
- Culture unconfirmed: negative culture, or no NMRS results for culture, surveillance system states no culture undertaken, no other supporting information
Speciation
Species defined as M. tuberculosis, M. bovis, M. microti, M. africanum or Mycobacterium tuberculosis complex.
MTBC is assigned to those not fully speciated by previous PCR based methods or WGS. The introduction of WGS has decreased the number of notifications in this category.
Drug resistance definitions
The resistance reported follows this classification. Most resistance proportions use the number of culture positive cases as the denominator. The only exception to this is the proportion of quinolone resistance (see TB diagnosis and microbiology in England, 2023, Supplementary Table 19) and the phenotypic testing of 2nd line drugs for RR-MDR TB where the denominator is test performed (see TB diagnosis and microbiology in England, 2023, Supplementary Table 20).
Definitions:
- rifampicin resistance or multidrug resistant TB (RR or MDR TB) is defined as resistance to rifampicin with or without isoniazid resistance
- isoniazid mono-resistance is defined as resistant to isoniazid but not reported as resistant to rifampicin
- pre-extensively drug resistant TB (pre-XDR TB) are TB strains which fulfil the definition of multidrug resistant or rifampicin-resistant TB (MDR or RR TB) and which are also resistant to any fluoroquinolone (levofloxacin and or moxifloxacin plus historically used levofloxacin and ofloxacin)
- extensively drug resistant TB (XDR TB) are strains that fulfil the definition of MDR or RR TB and which are also resistant to any fluoroquinolone and at least one additional Group A drug in the WHO updated classification (Group A drugs are the most potent group of drugs in the ranking of second-line medicines for the treatment of drug-resistant forms of TB using longer treatment regimens and comprise levofloxacin, moxifloxacin, bedaquiline and linezolid)
Isolates may be resistant to other antibiotics in addition to those described above.
Laboratory confirmed resistance
Resistance is reported as either resistant or sensitive. Testing is by whole genome sequencing alone or in combination with phenotypic testing. Discordances between the 2 testing methods were resolved by the reference laboratory and the reported value is used for this data analysis. The denominator for all resistance proportions is culture positive notifications, for example known resistance reported as a proportion of culture positive cases (regardless of whether the others were sensitive or unknown resistance).
M. bovis is intrinsically resistant to pyrazinamide. The designation of resistance using genomics relies on a database of known resistance. A change in the pncA gene (gene encoding pyrazinamidase) associated with lineage 1 confers resistance to pyrazinamide. Isolates that are lineage 1 with changes in above gene are checked using phenotypic methods and coded as resistant if resistant by phenotype and sensitive by WGS.
Where ethambutol and pyrazinamide results are missing or unknown (and not lineage 1 for pyrazinamide) but results are known to be sensitive for isoniazid and rifampicin, these are coded as sensitive for ethambutol and pyrazinamide.
In July 2024, the manufacturer of culture media for pyrazinamide pDST issued a field safety notice, mentioning the risk of false resistance results from June 2023. Therefore, in this report we report on all 4 first line drugs up to 2022 but omit pyrazinamide resistance results in 2023. In agreement, the category resistance to any first line drug in 2023 does not include pyrazinamide.
Quinolone resistance is determined by detection of mutations detected by WGS in the quinolone resistance determining region genes: gyrA, gyrB and parC, or if resistant to moxifloxacin or levofloxacin when tested pDST.
Currently, WGS mutations are not used to confirm resistance to the following group A drugs and novel agents: linezolid, bedaquiline and delamanid. Isolates with rifampicin resistance and others meeting particular clinical criteria are sent to a specialised laboratory for pDST for these drugs.
Acquired resistance
This is resistance in a person with more than one sample over time where the first sample shows sensitivity to a given drug and second sample is resistant.
Treated as resistant
This includes notifications that have no culture result but are recorded in NTBS, or the multidrug resistance database, or comments in NTBS indicate the individual has been treated as MDR with a second line drug regimen (for example, contacts of MDR individuals with active TB treated for MDR, or those diagnosed and or started treatment abroad, or people intolerant to rifampicin).
Total MDR or RR cohort
This includes both those with culture confirmed MDR or RR TB and those who were treated as resistant with second line drug regimen.
Clustering isolates
WGS was implemented for all of England in 2018. Results are available only if the isolate was successfully cultured. An isolate is defined as being in a cluster if it has 12 or fewer genetic differences (known as single nucleotide polymorphisms or SNPs) between it and another isolate that has previously been sequenced.
More detail on UKHSA’s approach to WGS-based typing is found in the WGS handbook.
The current database includes samples from devolved nations and research samples; therefore, we report positive clusters where there is more than one person in the cluster from England. The definition for clustered is:
- Yes: 12 SNPs or fewer from another person’s sample and there is more than 1 person resident in England in the cluster
- No: the sample is 12 SNPs or more from any other person’s sample that has been sequenced in the UKHSA database
However, the proportions of notifications clustered are reported as the percentage of clustered isolates (corresponding to a single notification) as a percentage of all culture positive cases. This is also used for the risk ratio analysis of risk of a notification being in a cluster.
Note that contacts may be identified and assumed to be in clusters based on epidemiological information obtained through contact tracing. However, only those with active disease and WGS information are reported here.
TB treatment, diagnostic and treatment delays
Enhanced Case Management and directly observed treatment
Numbers and proportions of people with Enhanced Case Management (ECM) per level, and those receiving directly observed treatment (DOT) were calculated for all of those with information on ECM, and DOT available. People who had information on DOT but were missing ECM data, were coded as “Yes” for any ECM and coded into level 3 of ECM. Those who had missing information on any ECM but were recorded as being in level 0 of ECM (equivalent to standard treatment) were recoded as having “No” in the ECM binary variable of ECM required, thereby considerably reducing proportion of notification with missing information.
The percentage of any ECM was calculated as the proportion of cases that reported “Yes” (1) to ECM, or (2) to DOT offered or (3) DOT received out of all cases with information. The percentage of ECM per level was calculated as the proportion of cases with a known level of ECM out of all cases with information on ‘any ECM required’ (“Yes” or “No”). The percentage missing data is calculated as the proportion of all TB notifications for each year with no information recorded in (1) ECM required, (2) ECM level required or (3) DOT offered or (4) DOT received.
Diagnostic delays
Delays to TB diagnosis is calculated as the days difference between self-reported date of TB symptom onset and the date of TB diagnosis as recorded in NTBS. Diagnostic delays are not calculated for those who were diagnosed with TB at post-mortem and those with missing data, so these are not included in the denominator for the proportion of people with delays to TB diagnosis. Negative diagnostic delays, resulting from symptoms presenting post diagnosis, were also excluded from the analysis as these are likely to indicate data errors or treatment side effects as opposed to disease symptoms.
Reporting delays
Reporting delay is calculated as the days difference between TB diagnosis date and date of TB notification to NTBS/ETS. Reporting delays are not calculated for those with missing data for date of diagnosis, so these are not included in the denominator for the proportion of people with reporting delays. People where the notification date was before the diagnosis date (that is, clinical diagnosis happened before lab confirmed diagnosis) were reported as having 0 days of report delay.
Treatment delays
Treatment delay is calculated as the days difference between self-reported date of TB symptom onset and the date treatment started as recorded in UKHSA’s surveillance systems (NTBS/ETS). Treatment delays are not calculated for those who have not started treatment, those who were diagnosed with TB at post-mortem and those with missing data, so these are not included in the denominator for the proportion of people with treatment delays.
Treatment delays exceeding 2 years (730 days) are excluded from analysis as symptoms lasting for over 2 years are thought to relate to another episode of TB. Negative treatment delays, resulting from symptoms presenting post treatment start, were also excluded from analysis as these are likely to indicate data errors or treatment side effects as opposed to disease symptoms.
TB cohort definitions
For the purposes of reporting treatment outcomes for people with TB, 2 mutually exclusive cohorts are defined. They are:
- MDR/RR-TB cohort: people with TB who were diagnosed with MDR or RR-TB and or were treated with a second line drug regimen for MDR or RR TB
- Non-MDR/non-RR-TB cohort: people who were not identified as MDR or RR-TB and were treated with a first line treatment regimen for non-MDR or non-RR TB
Under this definition, people with TB resistance to isoniazid, ethambutol and/or pyrazinamide but without resistance to rifampicin are included in the non-MDR / non-RR-TB cohort.
Outcomes are reported for the non-MDR/non-RR-TB cohort according to the year of notification up to, and including, 2021. This is to ensure that at least one year of data is available to report treatment outcome by the expected standard treatment duration of less than 12 months. In this cohort, outcomes are reported separately for persons with CNS disease, or in those in whom CNS disease cannot be excluded, which includes those with spinal, cryptic disseminated or miliary disease. For this sub-group, the last recorded treatment outcome is reported as standard treatment is a minimum of 12 months.
Outcomes are reported for the MDR or RR-TB cohort according to the year of notification, up to, and including, 2020. This is to ensure availability of data for the expected standard treatment duration of up to 24 months.
TB treatment outcomes were extracted from NTBS (2020 to 2022) and ETS (2001 to 2020) and cleaned and validated using comment fields, post-mortem diagnoses, date of key events and case-manager follow-up. TB diagnoses that were recorded at post-mortem were excluded from TB treatment outcomes as these cases were not treated. These deaths are reported separately and added to TB treatment deaths to report total TB deaths. This is a change from methodology in reports earlier than the 2021 data annual report. Therefore, please note that all treatment outcome results in this report are not directly comparable with reports prior to this.
LTBI
Completion of prophylactic treatment for LTBI was defined as the presence of a treatment completion date, with or without the presence of a treatment start date.
Disclosure control methods
Only aggregate data is reported. Aggregated data values less than 5 are suppressed except if it is:
- the aggregate number of notifications within a single year for England for children aged under 5 years for each sex as the risk of disclosure is considered very low compared with the importance of monitoring changes in young children
- the aggregated number across multiple years for large geographic areas (England or UKHSA centre)
- the average notifications over multiple years for a geographical area, the smallest of which (by population) is lower local authority
Data analysis
TB rates
TB rates per 100,000 population are calculated using the mid-year population estimates from ONS.
Average annual rates per 100,000 for the 3-year period are calculated by dividing the numerator (the number of TB notifications in the 3-year period) by the denominator (the sum of the mid-year population estimates for the same 3-year period) and multiplying by 100,000.
Confidence intervals
95% confidence intervals are model derived and were calculated using assumptions of the Poisson distribution for rates and the binomial distribution for proportions.
Risk ratios
Risk ratios are model derived using the binomial distribution for proportions.
Software packages and code
Data cleaning and analyses were undertaken using R (R4.3.1) and Stata 17 SE. The code is reviewed and output quality assured using a standard template. Code is held in UKHSA internal GitHub repositories.
Glossary
95% confidence interval
A confidence interval is a measure of the degree of uncertainty in an estimate based on a sample distribution. 95% confidence intervals indicates that if we repeated the study many times, 95% of the confidence intervals would contain the true population value. Wider confidence intervals indicate more uncertainty in the estimate. Overlapping confidence intervals indicate that there may not be a true difference between estimates.
Diagnostic delay
The diagnostic delay represents the time (in days) from when a person self-reported TB symptom onset to when they are diagnosed with TB.
Directly observed treatment (DOT)
DOT is a treatment strategy which refers to the patient taking treatment under direct in-person observation of a trained health care worker or designated individual to ensure treatment adherence for patients requiring ECM.
Enhanced case management
ECM is defined as the increased level of patient monitoring for people with (complex) clinical or social issues or both affecting treatment. There are 3 levels of ECM depending on the complexity of the clinical or social issues or both and the intensity of patient monitoring required, ranging from fortnightly or weekly visits to necessitating DOT or VOT. ECM may be required for children with TB, those with HIV and taking antiretrovirals, people with complex side effects or single drug resistance and those with complex contact tracing or cases in which the involvement of social services is required. For more information see the Nurse guidance document.
International migrant
An international migrant is classified as the movement of a person across international borders to seek temporary or permanent residence in another country.
INH resistant
TB that is resistant to isoniazid, a first-line anti-TB drug, and not other drugs.
Monoresistant to a drug other than INH
Resistance to a first-line treatment drug other than INH, for example, ethambutol.
Multi-drug resistant TB (MDR TB)
Multi-drug resistant TB (MDR TB) is defined as resistance to at least isoniazid and rifampicin, with or without resistance to other drugs.
Pansensitive
Fully sensitive to all first line drugs, for example, isoniazid.
Poly-drug resistant
Poly-drug resistance refers to resistance to 2 or more first-line drugs but not to both isoniazid and rifampicin.
Post-mortem diagnosis
A person diagnosed at post-mortem is defined as having TB which was not suspected before death, but a TB diagnosis was made at post-mortem, with pathological and/or microbiological findings consistent with active TB that would have warranted anti-TB treatment if discovered before death.
Pulmonary TB
A person with pulmonary TB is defined as having TB involving the lungs and/or tracheo-bronchial tree, with or without extra-pulmonary TB diagnosis. In this report, in line with the WHO’s recommendation and international reporting definitions, miliary TB is classified as pulmonary TB due to the presence of lesions in the lungs, and laryngeal TB is also classified as pulmonary TB.
pDST
Phenotypic Drug Sensitivity Testing.
Quinolone resistant TB
TB bacilli that have either mutations detected by WGS in the quinolone resistance determining region genes: gyrA, gyrB and parC, or when tested with pDST were found resistant to moxifloxacin or levofloxacin.
Social risk factors for TB
These include current alcohol misuse, current or history of homelessness, current or history of imprisonment, current or history of drug misuse, current mental health needs, or current status as an asylum seeker or detainee in an immigration removal centre. Please see relevant section under reporting methodology for further details of these variables.
Risk Ratios (RR)
RRs quantify the relative risk of the outcome we are interested in between 2 different groups. For example, the relative risk of pulmonary disease in males compared with females. This is calculated as the proportion of males with pulmonary disease divided by the proportion of females with pulmonary disease, which is a RR of 1.18, (95% CI: 1.11 to 1.25). This is interpreted that males have an 18% increased risk of pulmonary disease compared with females and we have 95% confidence that the true increased risk lies within the range of 11% to 25%. If a 95% CI for a RR includes the value of 1.0, then we cannot infer that the true RR is different from 1.
As a result, we would say that these results are not providing any evidence that the observed magnitude of the RR is ‘statistically important’. If an RR of less than 1.0 is reported, such as RR 0.85, this is interpreted that the group of interest have a 15% reduced risk of the outcome.
RR TB
Resistant to rifampicin, a first-line drug, and not other drugs.
Pre-XDR TB
Resistant to rifampicin, isoniazid, and any quinolone.
XDR TB
Resistant to rifampicin, isoniazid, any quinolone and one additional group A drug (bedaquiline, linezolid).