Restricted fields

#CheadleDA UK Biobank Data Analysts
#CheadleDA UK Biobank Data Analysts The helpers that keep the community running smoothly. UKB Community team Data Analyst
  • Updated

Fields such as home location, facial images, job code at visit as entered and date of birth are restricted in UK Biobank (UKB).   Example restricted field_ids are listed at the end of this article.   If a field is restricted, it will not be available in a UK Biobank Research Analysis Platform (UKB-RAP) project of any cost tier unless special permission has been granted.

 

Finding restricted fields

Restricted fields are marked as # in UKB Showcase field lists, and say “This information is regarded as restricted …” on the UKB Showcase field page

For example, Category 100094 holds 6 data fields, including field 33 which is restricted.

 

Requesting access 

To request access to a restricted field, researchers should contact the UKB Access team by submitting a ticket in the UK Biobank Community website.  To avoid unnecessary delay, please include full details as described below.  

For Covid-19 vaccination fields or for geographical fields such as home locations the Principal Investigator (PI) of the AMS application should submit the request.

For a few restricted fields, such as field 132 which is scheduled for removal, permission will not be granted at all.  Field 20277 is an unrestricted alternative to field 132.   Some phased restricted fields are temporarily restricted, and will only be available when the restriction is removed.   For the majority of restricted fields, permission may be granted if the proposed use of the field is within the scope of the AMS application and is for health-related research in the public interest.   This will be assessed by the UKB Epidemiology team.   Permission will not be granted if there are unrestricted alternatives that should be sufficient.   For example, the date of birth in field 33 is restricted.  Most researchers are expected to use a combination of year of birth in field 34 and month of birth in field 52.   Any request for field 33 should include an explanation of why year and month are not sufficient.

 

Derived data from particularly sensitive fields

For restricted fields that are particularly sensitive, such as date of birth or 100m grid home locations, permission is not granted directly.   Permitted research groups are provided with a separate project inside their UKB-RAP application, containing a different set of EIDs, the restricted fields, and a set of data fields that are deemed essential for creating derived fields.   Researchers derive the fields they need.  UKB map the derived fields back to the original project EIDs and provide the rebadged data in the dedicated project, once the restricted data has been deleted.   For example, an application to investigate the effects of fine particulate matter might request 100m grid locations and use it to generate a derived field stating  the particulate matter level for each participant.   There will be a negligible cost for storing the file containing the derived fields.   The process will entail delay. 

Particularly sensitive fields requiring a separate project and derived fields include the following:

33 Date of birth
132 Job code at visit - entered
22686 Home location at assessment - east co-ordinate (100m resolution)
22687 Home location at assessment - north co-ordinate (100m resolution)
22701 Home location - east co-ordinate (100m resolution) [obsolete]
22703 Home location - north co-ordinate (100m resolution) [obsolete]           
28172 Home location, east coordinate (100m resolution), wave 1
28173 Home location, north coordinate (100m resolution), wave 1
28177 Home location, east co-ordinate (100m resolution), wave 7
28178 Home location, north co-ordinate (100m resolution), wave 7
20269 Home location at assessment - Output Area (2001 census, old code format)
20270 Home location at assessment - Output Area (2001 census)
20273 Home location at assessment - Output Area (2011 census)
32221 Home location history - east co-ordinate (100m resolution)
32222 Home location history - north co-ordinate (100m resolution)

 

Requests for particularly sensitive restricted fields should include a description of the proposed derived fields, together with a justification of the value of the research and an explanation of how it relates to the scope of the application.

 

Covid-19 research

Record-table field 32040 COVID-19 vaccination records and derived tabular field 32041 When COVID19 vaccinations administered have been provided by national NHS data sources for Covid-19 research only.   When requesting these Covid-19 vaccination fields, the PI should explicitly confirm that they will ensure that this data will only be used for Covid-19 research.

 

Home location data

Participant addresses are not held in the main UKB database and are not available to any researchers in any circumstances.   Home location data fields have been created from participant address data.   For more details, see the category descriptions and the resources associated with each field in Showcase, such as  Category 150 , Category 100024  and Resource 2060

Researchers who only require a very general idea of location, such as “Wales” or “London”, may find that instance 0 of field 54 UK Biobank assessment centre is sufficient.   At the baseline visit instance 0 each participant was invited to an assessment centre within 30 miles of their home.   This is not true for the imaging visits instance 2 and 3, where the distance from home address to centre location can be much greater.

There are four sets of home location fields.

     1km grid

22688 Home location at assessment - east co-ordinate (1km resolution)
22689 Home location at assessment - north co-ordinate (1km resolution)
22702 Home location - east co-ordinate (1km resolution) [obsolete]
22704 Home location - north co-ordinate (1km resolution) [obsolete]             
28170 Home location, east coordinate (1km resolution), wave 1
28171 Home location, north coordinate (1km resolution), wave 1
28175 Home location, east co-ordinate (1km resolution), wave 7
28176 Home location, north co-ordinate (1km resolution), wave 7
32223 Home location history - east co-ordinate (1km resolution)
32224 Home location history - north co-ordinate (1km resolution)

 

100m grid (particularly sensitive)

22686 Home location at assessment - east co-ordinate (100m resolution)
22687 Home location at assessment - north co-ordinate (100m resolution)
22701 Home location - east co-ordinate (100m resolution) [obsolete]
22703 Home location - north co-ordinate (100m resolution) [obsolete]           
28172 Home location, east coordinate (100m resolution), wave 1
28173 Home location, north coordinate (100m resolution), wave 1
28177 Home location, east co-ordinate (100m resolution), wave 7
28178 Home location, north co-ordinate (100m resolution), wave 7
32221 Home location history - east co-ordinate (100m resolution)
32222 Home location history - north co-ordinate (100m resolution)

 

Output area (particularly sensitive)

20269 Home location at assessment - Output Area (2001 census, old code format)
20270 Home location at assessment - Output Area (2001 census)
20273 Home location at assessment - Output Area (2011 census)

 

Admin or super area

20271 Home location at assessment - Lower layer Super Output Area/Data Zone (2001 census)
20272 Home location at assessment - Middle layer Super Output Area/Intermediate Zone (2001 census)
20274 Home location at assessment - Lower layer Super Output Area/Data Zone (2011 census)
20275 Home location at assessment - Middle layer Super Output Area/Intermediate Zone (2011 census)
20276 Home location at assessment - Local Authority District/Council Area (2011 boundaries)

 

Research groups will not be granted both grid fields and area fields in the same AMS application, because of potential risk of identification within overlap regions.   Research groups will not be granted fields from any two different home location sets simultaneously.   It is possible to receive 32223/4 (1km grid) together with 22702/4 (1km grid).   It is not possible to receive 32223/4 together with 20271.   It is not possible to receive 32223/4 simultaneously with 32221/2.   For this reason, we ask the PI to confirm any request for home location fields.

 

Updated grid fields

Fields 22702/4 are being replaced by fields 32223/4.   Fields 22701/3 are being replaced by fields 32221/2.   For more details, please see the article “Replacement of Category 150 location history fields”  which is available from the UKB community forum.

Researchers are encouraged to wait for fields 32223/4, which will be available in UKB-RAP version 19.  The UKB-RAP version 19 update is scheduled to happen in the first half of 2025.   The current UKB-RAP version can be seen here.   Researchers with a tight timeline may request fields 22702/4, if they confirm that they understand the limitations of these fields.   Fields 22702/4 will be available for dispense to new UKB-RAP projects during UKB-RAP versions 18 and 19, but will be unavailable for dispense to new projects during UKB-RAP version 20 onwards.   Once dispensed, the fields will continue to be available within that UKB-RAP project until deletion of the project or expiry of the AMS application.   It is acceptable to request both old 22702/4 and new 32223/4.

 

Assessment visit or history

Fields 22688/9 and 22686/7 contain only the grid locations for the addresses that were correct at the time of a UKB assessment centre visit.   For most participants (~80%) this is a single grid location for the home address that was correct at the baseline visit instance 0.  

Fields 22702/4, 22701/3, 32221/2 and 32223/4 include the grid locations for the addresses that were correct at the time of a UKB assessment centre visit, and also the grid locations for any later participant addresses where these have been notified to UKB.  In general, there is no need to request 22688/9 as well as 32223/4 or 22702/4, as they will not provide any extra data, but it is acceptable to request 22688/9, 32223/4 and 22702/4.

Similarly, fields 28170 - 28178 are convenient subsets of grid location data that is relevant to the participants in the serology study.  They do not provide any extra data compared with 32221 - 32224.

 

Dispense a new UKB-RAP project

When access to a restricted field has been granted, and you have been notified of this, please create a new UKB-RAP project and dispense data to it to receive the restricted fields.

The restricted fields will only be available for processing within the UKB-RAP.   It is no longer permissible to download these fields.    If a researcher has previously been using basket downloads, and makes a new request to access any of the restricted fields, they will need to start using the UKB-RAP.

Example restricted fields

33 Date of birth
115 Father's month of birth
117 Mother's month of birth
118 Mother's day of birth
132 Job code at visit - entered
146 Father's day of birth
20215 Scout images for brain scans - DICOM
20216 T1 structural brain images - DICOM
20220 T2 FLAIR structural brain images - DICOM
20221 T2/PD brain images - DICOM
20269 Home location at assessment - Output Area (2001 census, old code format)
20270 Home location at assessment - Output Area (2001 census)
20271 Home location at assessment - Lower layer Super Output Area/Data Zone (2001 census)
20272 Home location at assessment - Middle layer Super Output Area/Intermediate Zone (2001 census)
20273 Home location at assessment - Output Area (2011 census)
20274 Home location at assessment - Lower layer Super Output Area/Data Zone (2011 census)
20275 Home location at assessment - Middle layer Super Output Area/Intermediate Zone (2011 census)
20276 Home location at assessment - Local Authority District/Council Area (2011 boundaries)
22686 Home location at assessment - east co-ordinate (100m resolution)
22687 Home location at assessment - north co-ordinate (100m resolution)
22688 Home location at assessment - east co-ordinate (1km resolution)
22689 Home location at assessment - north co-ordinate (1km resolution)
22701 Home location - east co-ordinate (100m resolution) [obsolete]
22702 Home location - east co-ordinate (1km resolution) [obsolete]
22703 Home location - north co-ordinate (100m resolution) [obsolete]
22704 Home location - north co-ordinate (1km resolution) [obsolete]
28170 Home location, east coordinate (1km resolution), wave 1
28171 Home location, north coordinate (1km resolution), wave 1
28172 Home location, east coordinate (100m resolution), wave 1
28173 Home location, north coordinate (100m resolution), wave 1
28175 Home location, east co-ordinate (1km resolution), wave 7
28176 Home location, north co-ordinate (1km resolution), wave 7
28177 Home location, east co-ordinate (100m resolution), wave 7
28178 Home location, north co-ordinate (100m resolution), wave 7
32040 Records in COVID-19 vaccination dataset
32041 When COVID19 vaccinations administered
32221 Home location history - east co-ordinate (100m resolution)
32222 Home location history - north co-ordinate (100m resolution)
32223 Home location history - east co-ordinate (1km resolution)
32224 Home location history - north co-ordinate (1km resolution)
41265 Records in HES inpatient birth dataset
41288 Date of birth of baby

 

When researchers request access to any of these restricted fields, the request ticket should include the researcher's AMS application_id, the field_ids of the restricted fields being requested, and a justification of the need for the restricted data.   For some requests (see text above) the request should come from the PI.    When requesting Covid-19 vaccination fields, the PI should explicitly confirm that they will ensure that this data will only be used for Covid-19 research.

 

Related to

Was this article helpful?

0 out of 0 found this helpful

Have more questions? Submit a request

Comments

0 comments

Article is closed for comments.