Accredited official statistics

Data quality and methodology

Published 29 October 2024

Applies to England

Introduction

This report describes the quality assurance arrangements for the private registered provider (PRP) social housing stock and rents in England statistics, providing more detail on the regulatory and operational context for data collection and the safeguards that aim to maximise data quality.

Background

The statistics we publish are based on data collected directly from PRPs through the Statistical Data Return (SDR) survey. We use the SDR data extensively as a source of administrative data within the delivery of our operational approach to regulating the economic and consumer standards (see regulatory context on page 5). The United Kingdom Statistics Authority (UKSA) encourages public bodies to use administrative data for statistical purposes, and, as such, we publish these data.

Accredited Official Statistics status

The statistics derived from the SDR data and published as PRP social housing stock and rents in England are considered by the Office for Statistics Regulation (OSR), the regulatory arm of the UKSA, to have met the highest standards of trustworthiness, quality and public value.

Designation history

These statistics were designated as a National Statistic by the OSR in  2014 following an assessment against the Code of Practice for Statistics. Following the designation as a National Statistic, producers of the statistics must comply with the statutory requirement to ensure the Code of Practice continues to be observed. We keep the trustworthiness, quality and value of the statistics under constant review.

Change of designation name

On 7 June 2024, the Office for Statistics Regulation introduced the new Accredited Official Statistics[footnote 1] badge, to denote official statistics that have been independently reviewed by the Office for Statistics Regulation (OSR) and judged to meet the standards in the Code of Practice for Statistics. The new badge and naming convention replace the National Statistics badge.

Publication schedule

We intend to publish these statistics in Autumn each year, with the data pre-announced in the release calendar.

All data, supplementary tables, data tools and additional information (including a list of individuals (if any) with 24hour pre-release access) are published on our statistics pages.

These statistics are also presented in the Ministry for Housing, Communities and Local Government (MHCLG), formally the Department for Levelling Up, Housing and Communities (DLUHC), live tables and our registered provider social housing statistics.

Improvements since designation

Since being designated a National Statistic, the following improvements have been made:

Accessibility

Excel based “look up tools”, allowing the easy interrogation of data at a PRP and local authority (LA) level, are created annually.

Additional supplementary notes, guidance and documents to provide simple access to information to understand the statistics in greater detail (including this data quality and methodology note) are provided.

In line with the Web Content Accessibility Guidelines, we seek to make more of our publication accessible to users by introducing html versions (or html summaries) of documents.

In 2021, we ensured that our additional table and raw data files were published in spreadsheets which employed the principles of accessibility, as recommended in the Government Statistical Service guidance.

Timeliness

We keep the publication timescales under close review. In 2021, we pushed back the publication date from September to October and we believe this later publication date still provides timely access to our data for our key users.

Transparency

The revisions policy and processes have been enhanced to improve transparency in data changes.

Relevance to users

A briefing note style was adopted following user feedback on use and value.

A ‘quick feedback’ link was introduced across all published documents to allow users an easy facility to provide feedback.

User feedback

We are always keen to increase the understanding of the data, including the accuracy and reliability, and the value to users. Please email feedback, including suggestions for improvements, or queries as to the source data or processing to enquiries@rsh.gov.uk.

Quality assurance of administrative data

The data used in the production of these statistics are classed as administrative data. In 2015, the UKSA published a regulatory standard for the quality assurance of administrative data. As part of our compliance to the Code of Practice, and in the context of other statistics published by the UK Government and its agencies, we have determined that the statistics drawn from the SDR are likely to be categorised as low quality risk – medium public interest (with a requirement for basic/enhanced assurance).

The publication of these statistics can be considered as medium public interest, as there has been mainstream media interest, but they have only moderate economic and/or political sensitivity. Concerns over data quality are considered low given the data checks by providers and our data quality checks conducted on the submitted data and analytical processes.

Notwithstanding this, we aim for the highest standards of data quality possible within the constraints of available resources and the existing regulatory and operational context. Through ongoing internal analysis, we seek to understand the strengths and limitations of the data, the overall quality of the data and to identify potential means by which it may be improved.

Regulatory context

The regulatory framework for social housing in England provides both the basis for collecting SDR data and the framework which ultimately underpins data quality. For more information about the Regulator of Social Housing (RSH) and the regulatory framework, please see our website.

Regulatory framework

We collect SDR data to facilitate our operational approach to regulating the economic and consumer standards set out in the regulatory framework for social housing in England. The regulatory framework consists of three elements:

  • Regulatory requirements – the requirements with which PRPs need to comply (including our economic and consumer standards, and requirements on data and information submission).
  • Codes of practice – to assist PRPs in understanding how compliance can be achieved.
  • Regulatory guidance – further explanatory information on the regulatory requirements, including how we will carry out our role of regulating the requirements.

As part of the regulatory framework, PRPs are required to submit SDR data by 31 May each year, along with other data returns and regulatory documents at various points during the year.

The regulatory framework and data quality

The regulatory framework places the onus for data quality on PRPs and their Boards. The Governance and Financial Viability Standard sets out the specific expectation that “providers shall communicate with the regulator in an accurate and timely manner”. This expectation is amplified in guidance on the regulatory approach, Regulating the Standards. This states that we will consider that “the submission of late and incomplete or inaccurate regulatory data may be indicative of a weak control environment” and “failure to provide accurate and timely data may be reflected in [the regulator’s] judgement of a provider’s compliance with the regulatory standards”.

Addressing issues with data quality

We have a range of statutory and enforcement powers and act proportionately to address issues of data quality through our regulatory approach. We publish regulatory judgements relating to compliance with the Consumer, and Governance and Financial Viability Standard (which includes data quality) for PRPs that own 1,000 or more social housing units.

Governance of data and statistics at RSH

The statistician responsible for the publication of these statistics is also responsible for the SDR data collection and the cleansing of incoming SDR data. This entails working with PRPs to directly address anomalies within the data submissions and producing the final dataset on which the statistics are produced.

All SDR data are stored and analysed within secure, access-controlled networks and access to the sector level analysis work undertaken on the data is restricted until after publication (PRP level data are accessed by our staff as part of operational work). Further information on the data quality assurance processes we employ is provided on page 7.

Data submitted by PRPs are redacted within the public release to remove all contact information submitted alongside the return. This contact information is not publicly available. There is no other administrative data held by us which can be made available for use in statistics. However, we publish a range of summary data from other information collected which are available on our website’s analysis and statistical reports page.

SDR collection

All PRPs are expected to complete the SDR. The size of return completed is based on the size of the PRP, with those owning 1,000 or more units of social stock providing more information.

History

The first collection occurred in 2012 and it has been collected annually from PRPs since. The SDR collects data on stock size, types, location and rents as at 31 March each year, and data on sales, acquisitions and Decent Homes Standard activity during the 12 months up to the 31 March each year.

Systems

The SDR is collected via a web-based system called NROSH+. We control the requirements for data input processes, storage, verification, sign-off and extraction of submitted data and produce the statistical releases. Data are either imported or entered directly into the NROSH+ system by PRPs.

In 2023, a revised NROSH+ application was launched. The principles of the site remained the same, but new features such as on-screen live validation updates and better navigation were introduced. We do not believe the new system has had any impact on providers and the response rates to the survey since the revision have remained similar to previous years.

Communication with data suppliers

We work closely with PRPs, through email messages and phone discussions, to ensure there is a common understanding of the data collection requirements throughout the data collection process. Guidance materials are also promoted to users and published on NROSH+.

Quality assurance processes

We do not have oversight of the systems and data quality assurance processes employed by the PRPs before submitting data to SDR. However, we do provide clear guidance and documentation on the NROSH+ system and subject SDR submissions to a series of checks to identify potential quality issues before each data return is signed off.

The final SDR data file that supports the statistical release is only created once all outstanding queries are resolved. Any returns not meeting our quality standards are excluded from the final dataset.

Submission checks

SDR data submitted to us are subject to both automated validation checks and manual inspection.

Automated validations are programmed into NROSH+ and check the SDR data at the point of submission for correct formatting, consistency and logical possibility (within expected limits). For example, ensuring numbers of units are consistent across different parts of the SDR and that chains of follow-up questions are completed. Automated validations are either ‘hard’ or ‘soft’.

Hard validations – PRPs cannot submit without the issue being resolved (e.g. when a rent value is provided but the number of units it applies to is missing).

Soft validations – PRPs can submit but are required to check their information and, if correct, to submit a supporting document (e.g. when a value appears to be outside of a normally expected range, such as higher than expected rents).

Manual inspections are systematically undertaken on all data submitted. All returns are checked for basic consistency and likely errors, for example where proportions of stock recorded as particular excepted categories are outside the expected range or where rents are particularly high or low. This includes comparison to previously submitted data with unexpected movements in stock levels or changes outside those anticipated in reported rent values being queried with PRPs. Where we identify a potential anomaly with the submitted SDR data, a query is raised with the submitting PRP. The sign-off of an SDR submission is dependent on the resolution of all queries which could materially impact the quality of the published data. This overall checking process is outlined in the diagram in Annex A.

Submission checks and sign-off

It should be noted that the process of signing off data is distinct from our ongoing regulatory work. The sign-off of data confirms that we have investigated areas of potential data error, as highlighted in the validation and checking work, and accepted that the provider has submitted data they believe to be accurate. However, during the course of our regulatory activity, the data will be reviewed alongside other evidence, and we may subsequently challenge classification of stock, rent setting or any other aspect covered by the data with providers. As such, the sign-off and publishing of these data does not constitute our agreement that the provider has appropriately classified units or complied with our standards.

Post publication checks

SDR data (excluding contact details or optional pilot year questions) are published at a disaggregated level as part of the statistical release. Releasing data into the public domain serves as an additional route through which erroneous data may be identified by the PRP or third parties.

Misreporting

There are no numerical measures of misreporting of SDR data by PRPs. However, one source of possible quality weakness is inconsistent interpretation of guidance with providers not applying this consistently across the sector. This issue is most likely to arise where there are technical or legal definitions that are complex or, to some degree, ambiguous. It is more likely to arise among PRPs owning fewer than 1,000 units due to the reduced level of contact with regulation staff and their specialisms in certain types of activities (e.g. supported housing/leasehold). Please see technical notes and definitions for more information on other factors which impact on the data collected.

Corrections

Where errors in the SDR data are discovered within a survey year, either through regulatory activity or through provider contact, we do allow providers to resubmit SDR data through the NROSH+ portal. Returns can be amended until mid-March the year following their launch.

Under the revisions policy (see technical notes and definitions), errors identified will be investigated and revised data gathered. Some corrections may be only minor changes to the data, with little or no impact on the published statistics. These changes will be published at the next scheduled release with no specific announcement.

However, if we become aware of substantial errors in the submitted data, statistical process or other methodology and where a major revision to the published data is required, a non-scheduled revision of the statistical release will be published. This will include full details of the revisions, clearly marked data amendments and summary tables showing the overall impact of the changes.

We seek to ensure transparency in processes to maximise user confidence in the quality of our statistical releases.

Data quality regulation

If, through either manual checks or subsequent information, SDR data have been submitted with significant material errors that may reasonably have been found by a PRP during their internal quality control process, we will consider the extent to which this offers evidence of failure to meet requirements for data quality and timeliness under the Governance and Financial Viability Standard. Within the regulatory framework set out on page 5, we will consider the most appropriate response.

Statistical release methodology

The data presented in the PRP social housing stock and rents statistics are drawn from the SDR data.

Accounting for missing data

A list of late/missing returns has been published alongside the release since 2018.

In 2014, following consideration of alternative methods and discussions with the National Statistician’s Office and (now) MHCLG, formally known as DLUHC, weighting was selected to account for the small proportion of missing data. This method was chosen given the incomplete prior year data for some of the non-submitting PRPs and the relative simplicity of the dataset.

2024 responses

All PRPs are required to complete the SDR. However, due to non-submission or exclusion due to unresolved errors, there is a small level of known SDR non-response. In 2024, the overall non-response rate was 3.6%. This is similar to that seen in previous years. Using other administrative data held by us, we have identified that all the PRPs excluded from the SDR data are those which own fewer than 1,000 social housing units/bedspaces. The response rate for these ‘small’ providers in 2024 was 96% (with the response rate for providers owning 1,000 or more units being 100%). Data are weighted to account for this small proportion of census respondents for which data are not available (see weighting section below).

Weighting

SDR data (2012 to 2024) have been weighted. Weighted data are highlighted in the release and are covered in the supporting notes for the relevant tables.

2024 weighting

The impact of weighting the 2024 data is shown in published supplementary tables. As in previous years the effect of weighting on totals is relatively minor (0.1% to 2.0%) given the minimal missing information.

Data are weighted in the following categories:

Categories Large PRPs Small PRPs
General needs  
Supported housing  
Housing for older people  
LCHO  
Social leasehold  
Non-social rented/leasehold  
Affordable Rent (all categories) not weighted not weighted
Evictions (all categories) ✔(some1) N/A
Mutual exchanges 2 N/A
No. of Decent Home Standard failures  

1 - Excepting 2012 data, where an accurate response rate for this question cannot be determined and from 2020 where this question became mandatory for all large PRPs.

2 - Excepting 2012 data, where an accurate response rate for this question cannot be determined.

Caution should be used when viewing weighted results for evictions and mutual exchanges. Given the optional nature of the questions (in some or all years), non-responding PRPs may not have the same trends as responding PRPs. It may be that those with high rates of evictions were more likely to choose not to disclose that information in an optional question or those with a low rate may have felt it unnecessary. For more information on our prior weighting of evictions, please see our 2020 release.

Basic method

Weighting based on the response rate to the SDR has been applied to the categories shown above. It assumes that the trends in the data submitted by PRPs would also apply to the PRPs who did not submit (e.g. that the proportion of social housing stock owned in each region of England is the same for the small providers that did not respond as it is for the small providers that did).

The general formula used to conduct this weighting was as shown:

weighted result = unweighted result x 1/response rate

For regional and national totals, weighting was carried out at the LA level and aggregated upwards. Due to the discrete nature of the data (number of units), all data were rounded up (i.e. a weighted result of 10.1 units would be recorded as 11 units as it represented a figure greater than ten units).

Exceptions

The number of Affordable Rent units was not weighted. To own this type of stock, PRPs must be signed up to the Affordable Homes Programme which requires additional reporting and active engagement with Homes England/Greater London Authority and with us. The likelihood that any Affordable Rent stock has not been captured in the SDR is therefore considered to be very small. Accordingly, these units were removed from the dataset before weighting occurred, then added back in.

Affordable Rent data cannot always be split by stock type. The number of supported housing and housing for older people units cannot be separated so it was assumed that the distribution of Affordable Rent stock followed the distribution of social rent stock (e.g. supported housing and housing for older people). Affordable Rent was assumed to be divided between the two component stock types in the same proportion as the units not designated as Affordable Rent supported housing and housing for older people stock.

Average rent and service charge calculations

Rent data for large and small providers are collected on a slightly different basis. Large PRPs report detailed rent information on general needs and supported housing rents, by geography and bedsize. Small PRPs only report an average figure for general needs and one for supported housing across stock owned at a provider level. All PRPs with Affordable Rent stock are required to submit rental information for that stock regardless of the total number of units owned.

Calculation of averages

All averages relating to rents for large PRPs in this statistical release are fully weighted by stock owned by PRPs for the appropriate geography and/or sub-group. Small PRP data is a single average calculated from provider averages of stock number and type.

Average service charges and gross rents

The average service charges presented in the rent sections relate only to the stock where there is a ‘housing benefit or Universal Credit eligible’ service charge present. Therefore, zero service charges are excluded from this calculation.

However, gross rents presented in these tables do include stock without a service charge. Because of this, the sum of the average net rent and average service charge will not equal the average gross rent.

Calculation of formula rents

PRPs are required to follow the guidance we set out when calculating formula rents. It should be noted that formula rents are not applicable to homes let under the Affordable Rent programme, those classified as temporary social housing or intermediate rent properties.

Unit sizes for which rent data are collected

PRPs owning 1,000 or more units are required to submit LA level breakdowns for rent and service charges for the following unit sizes. PRPs owning fewer than 1,000 units submit rent figures at a PRP level only, combining all unit sizes and locations. Analysis presented in the statistical release focuses on the detailed rent data submitted by larger PRPs only.

General needs and
Affordable Rent general needs Supported housing/housing for older people and Affordable Rent supported housing/ housing for older people
Bedspaces/non-self-contained Bedspaces/non-self-contained
Bedsit Bedsit
1 bedroom 1 bedroom
2 bedroom 2 bedroom
3 bedroom 3 bedroom
4 bedroom 4 or more bedrooms
5 bedroom  
6 or more bedrooms  

For all material in the 2024 release (including briefing notes, supplementary tables and the 2024 dataset) visit Statistics at RSH - Regulator of Social Housing - GOV.UK (www.gov.uk).

Quality assurance of the published statistics

The data, briefing note, look up tools and tables are quality assured by analysts within our statistics production team. This process ensures the figures are consistent across the release, and match the raw data submitted through the SDR. Each check is signed off and recorded by the responsible statistician when it has been completed.

Revisions

Under the revisions policy (see technical notes and definitions), errors identified will be investigated and revised data gathered. Some corrections may be only minor changes to the data, with little or no impact on the published statistics. These changes will be published at the next scheduled release with no specific announcement.

However, if we become aware of substantial errors in the submitted data, statistical process or other methodology and where a major revision to the published data is required, a non-scheduled revision of the statistical release will be published. This will include full details of the revisions, clearly marked data amendments and summary tables showing the overall impact of the changes.

In 2024, through our regulatory activity, we became aware, that a provider had submitted some data in 2023 which needed revision. The impacted their reporting of general needs and supported housing social rent information, However, the impact of this was relatively minimal with the maximum impact on general needs service charges (£0.18 or 2.4%).

In addition, during the 2024 cleaning, a single provider was unable to provide assurance on the accuracy of their rent data in the 2024 year. As such we took the decision to exclude their data from the publication of these statistics for both the 2023 (baseline) and 2024 (current) year. This allows for comparability in the year-on-year change analysis. The impact of this was relatively minimal with the greatest impact on the 2023 figures being a £1.20 (or 0.5%) decrease in supported housing Affordable Rent gross rent in London (note this impact is amplified due to the relatively small number of overall units).

A breakdown of the impact these changes have had on the 2023 data is provided in the PRP rent briefing note.

Annual cycle of regulatory activity

The SDR dataset is used to inform our engagement on registered providers’ compliance with the Rent Standard. As part of their response to any issues raised, providers subject their data to increased validation and may identify errors in the data submitted. We are committed to ensuring the quality of the SDR data and will gather corrected data from PRPs as part of this work.

We will republish these statistics in the April of the year following the initial publication if the aggregate changes made by providers require a major revision. If a major revision to published data is not required, the changes will be incorporated (and clearly marked) in the published baseline data for the following years’ release.

Why not have your say on our statistics in 2024/25?

We want to hear your views on how the format and range of documents in this statistical release meet your needs. Please email feedback, including suggestions for improvements to enquiries@rsh.gov.uk or click below to quickly rate how this document meets your needs.

Our statistical practice is regulated by the Office for Statistics Regulation (OSR).

OSR sets the standards of trustworthiness, quality and value in the Code of Practice for Statistics that all producers of official statistics should adhere to.

These accredited official statistics were independently reviewed by the Office for Statistics Regulation in 2014. They comply with the standards of trustworthiness, quality and value in the Code of Practice for Statistics and should be labelled ‘accredited official statistics’.

You are welcome to contact us directly with any comments about how we meet these standards.

Alternatively, you can contact OSR by emailing regulation@statistics.gov.uk or via the OSR website.

Annex A: Quality assurance processes – process map

Data import, entry and system validation (pre-submission to RSH)

Manual data processing and outlier checking (post submission to RSH)

Contact information

  • Responsible statistician:    Amanda Hall

  • Public enquiries:  enquiries@rsh.gov.uk or 0300 124 5225

  1. Accredited official statistics are called National Statistics in the Statistics and Registration Service Act 2007. https://osr.statisticsauthority.gov.uk/accredited-official-statistics/