Machine learning derived retinal pigment score from ophthalmic imaging shows ethnicity is not biology | Nature Communications
Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain
the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in
Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles
and JavaScript.
Advertisement
Machine learning derived retinal pigment score from ophthalmic imaging shows ethnicity is not biology
Download PDF
Download PDF
Subjects
Genome-wide association studies
Machine learning
Outcomes research
Abstract
Few metrics exist to describe phenotypic diversity within ophthalmic imaging datasets, with researchers often using ethnicity as a surrogate marker for biological variability. We derived a continuous, measured metric, the retinal pigment score (RPS), that quantifies the degree of pigmentation from a colour fundus photograph of the eye. RPS was validated using two large epidemiological studies with demographic and genetic data (UK Biobank and EPIC-Norfolk Study) and reproduced in a Tanzanian, an Australian, and a Chinese dataset. A genome-wide association study (GWAS) of RPS from UK Biobank identified 20 loci with known associations with skin, iris and hair pigmentation, of which eight were replicated in the EPIC-Norfolk cohort. There was a strong association between RPS and ethnicity, however, there was substantial overlap between each ethnicity and the respective distributions of RPS scores. RPS decouples traditional demographic variables from clinical imaging characteristics. RPS may serve as a useful metric to quantify the diversity of the training, validation, and testing datasets used in the development of AI algorithms to ensure adequate inclusion and explainability of the model performance, critical in evaluating all currently deployed AI models. The code to derive RPS is publicly available at:
Similar content being viewed by others
Automated OCT-tailored and same-visit biomarker-targeted microperimetry in geographic atrophy
Article
Open access
27 March 2026
Phenotypic characterization of autosomal dominant progressive cone dystrophies associated with a heterozygous variant c.2512C>T of
GUCY2D
gene in a large kindred
Article
12 December 2022
Relative ellipsoid zone reflectivity and its association with disease severity in age-related macular degeneration: a MACUSTAR study report
Article
Open access
02 September 2022
Introduction
Retinal diseases are a significant global cause of vision loss, but not all populations are affected equally. In 2020, there were estimated to be 103.1 million adults worldwide with diabetic retinopathy (DR) and 196 million people with age-related macular degeneration (AMD)
. Studies have found DR prevalence is highest in Africa (35.90%), then North America and the Caribbean (33.30%). In contrast, AMD has a significantly higher prevalence in people of European than in those of Asian or African ancestry
. In response to the overwhelming global burden of disease, many artificial intelligence (AI) algorithms have been developed to enable more efficient care delivery. These AI algorithms have been widely published, and several of them are already in clinical practice for providing automated diagnoses of diseases such as DR, AMD, and glaucoma
The success of AI algorithms in ophthalmology is partly due to the availability of large imaging datasets that have been collected from routine clinical practice
. Describing the demographic characteristics of these large datasets used for training is critical for understanding algorithm generalisability, limitations and overall performance. Despite this need, less than 20% of publicly available retinal imaging datasets contain patient characteristics such as age, sex, or ethnicity
10
. Studies that compare model performance across different populations are limited due to methodological limitations and unlabelled data, making bias evaluation challenging
11
12
Previous studies assessing bias in general AI algorithms have found that the performance of image-based algorithms is often worse among people with a greater degree of skin pigmentation, for instance, in skin cancer classification, facial recognition and object detection
13
14
15
. In these studies, image pixel values are measured and converted into a categorical pigmentation scale by human labellers or an algorithm. These categories are then used to estimate the relative performance of algorithms within subcategories of pigmentation, either in the presence or absence of additional demographic data. Additionally, these scales allow for the decoupling of race/ethnicity from the biological differences in skin pigmentation
16
In the eye, melanin is present in the uvea (iris, retina, and choroid) and is responsible for blue or brown iris colour and retinal pigmentation
17
18
19
20
21
22
. We aimed to develop a continuous scale called the Retinal Pigment Score (RPS) to quantify the background pigmentation of retinal colour fundus photographs captured in UK Biobank. We then sought to validate the RPS by comparing against the self-reported ethnicity of the UK Biobank participants and by performing both genome-wide (GWAS) and phenome-wide (PheWAS) association studies. Mechanistic insight was attained through gene prioritisation and functional annotation whilst causal associations with clinically relevant outcomes were tested using Mendelian randomisation. We validated our results with a replication GWAS study in an independent cohort (European Prospective Investigation into Cancer and Nutrition [EPIC]--Norfolk) and reproduced the RPS method in three non-white populations (Tanzanian, Australian and Chinese datasets).
Results
Retinal Pigment Score
We designed an algorithm to extract the background retinal pigmentation from colour fundus photographs. Briefly, the algorithm uses published open-access deep learning models
23
to identify and exclude ungradable images, create a mask of the tissue that excludes the retinal vasculature and optic nerve, then finds the average chromaticity of this background tissue. Chromaticity in this study is the quality of colour independent of illumination measured in the CIELAB colour space
24
. The chromaticity is then converted into a single continuous metric: the RPS (Fig.
, Methods Retinal Pigmentation Score). A greater RPS correlates with a greater degree of pigmentation in the retina. A total of 135,592 colour fundus photographs (67,982 right eyes, 67,610 left eyes) from 68,504 participants in the UK Biobank study were available for analysis. From these, 74,851 images (40,329 right eyes, 34,388 left eyes) from 44,320 participants (55% female) were deemed gradable by our pipeline and included in the analysis (Supplementary Table
). A previous study of this dataset deemed only 11% of the images as “Good” quality when assessed by human graders. Additionally, this study described a similar imbalance in laterality
25
. Supplementary Fig.
shows the percentage area of the image identified as vessels and optic disc was comparable across ethnic groups. Moreover, the approximate area of the optic nerve head would fall in line with previous reports by ethnicity
26
. Additionally, small differences in the size of the segmentation masks of the vessels and optic disc have a minimal effect on the RPS, as shown in Supplementary Fig.
. The patient-level characteristics are summarised in Supplementary Table
. The median age was 56 years (Interquartile Range[IQR]: 49-63) and 92% (40,704/44,320) of participants self-described their ethnicity as white. The proportional eigenvalues of the Principal Component Analysis used to normalise the RPS of the first two dimensions were 0.899 and 0.101. The median (IQR) RPS was −0.82 (−9.89, 10.39). Each ethnic group had images in every quintile of the RPS range, except for Chinese participants in the lowest (less pigmented) quintile (Fig.
).
Fig. 1: Schematic showing the method to generate the retinal pigmentation score (RPS) from a colour fundus image.
The alternative text for this image may have been generated using AI.
Full size image
Input images are fed into the deep learning algorithm to generate segmentation masks. These are added together to make a retinal background mask, which is then transformed into L,a,b colorspace. The chromaticity vectors are then extracted and transformed by the principal component analysis model to create the RPS. Created with Biorender.com.
Fig. 2: Representative fundus photos with associated RPS.
The alternative text for this image may have been generated using AI.
Full size image
Randomly sampled colour fundus photographs from each UK Biobank self-reported ethnicity and from the Tanzanian, Australian, and Chinese (ODIR) datasets, sorted by quintiles of retinal pigment score (RPS) across the entire distribution of RPS for the UK Biobank cohort. The RGB colour of the pixel value that is converted into RPS as well as the RPS is shown at the bottom of each fundus photograph. Black spaces represent when there are no suitable images within the respective ethnicity subgroup and quintile
Normalised kernel density estimation plot of the distribution of RPS for all participants grouped by self-reported ethnicity as reported in the UK Biobank as well as the Tanzanian, Australian, and Chinese (ODIR) datasets. Relative frequencies are normalised so the area under each curve is equal for each ethnicity subgroup. The subpanel consists of examples where for a given RPS and the a,b values in the CIELAB colour space are constant but the L vector changes. The
-axis is shared in both subpanels. Source data are provided as a Source Data file.
Our RPS method was reproduced in three independent non-white populations from cohorts outside of the UK (described here by their geographical location as “Tanzanian, Australian, and Chinese” datasets. Self-described ethnicity data was not available for the Tanzanian and Chinese datasets and was described as Indigenous Australian for the Australian dataset). To aid in comparison, we elected to fit the RPS model trained on the UK Biobank to these datasets to create RPS with a unified scale. A total of 1150 colour fundus photographs of 675 eyes from 348 people were included in the Tanzanian dataset, 715 colour fundus photographs of 715 eyes from 439 people were included in the Australian dataset, and 2088 colour fundus photographs of 2088 eyes from 1431 people were included from the Chinese dataset. The Chinese dataset was from the publicly available Ocular Disease Intelligent Recognition (ODIR) dataset
27
. Supplementary Table
provides details on the number of images analysed, included, and deemed as inadequate quality, hence excluded by the pipeline. The median RPS were 40.24 (IQR: 36.38–44.94) in the Tanzanian dataset, 27.92 (18.67–35.17) in the Australian dataset, and 29.79 (21.94–35.41) in the Chinese dataset. Figure
shows the RPS distribution from Tanzanian, Australian, and Chinese datasets in relation to the RPS distribution of self-described ethnicity groups from the UK Biobank.
Retinal pigment score reliability
We assessed RPS reliability in an independent dataset of 27 eyes from 24 patients with two images per eye. The one-way consistency intraclass correlation coefficient (ICC) was 0.920 (95% CI 0.834, 0.963,
-value < 0.001).
Among 30,407 participants in the UK Biobank that had available imaging, the mean and standard deviation of the difference in RPS between the right and left eye was −1.36 (8.30) and the one-way ICC was 0.788 (95% CI: 0.784, 0.792). In contrast, the use of the L, a, b vectors to calculate RPS among the same group of participants yielded a mean score of −2.40 (10.70) and a one-way ICC of 0.757 (95% CI: 0.753, 0.762).
Associations of RPS with clinical variables
We first examined associations of mean RPS (average score between right and left eyes per participant) with sociodemographic and clinical variables. Supplementary Fig.
shows the association of RPS with the covariates of interest (deciles of continuous variables) adjusted for age, sex, and UK Biobank centre. Non-white self-described ethnicities were associated with increased RPS when compared with white individuals. A positive graded association was observed with increased skin pigmentation, hair pigmentation, and deprivation. There was an inverse linear association between RPS and height.
Next, the associations were tested with multivariable linear regression adjusting for age, sex, height, self-described ethnicity, self-described hair and skin colour, Townsend deprivation index (TDI), refractive status, and UK Biobank assessment centre (Supplementary Table
). The RPS was modelled as a z-score. Coefficients represent the standard deviation (SD) change in RPS per specified increase in covariates or the standardised difference between groups. Formal variance inflation factor testing on the final model with adjusted generalised standard error inflation factors showed no strong collinearity (Supplementary Table
) When compared with very fair skin colour, darker skin tones showed a graded increase in RPS (p for linear trend 2.5 × 10
−231
). People with black skin colour showed a 0.83 SD increase in RPS (95%CI 0.72, 0.94;
5.7 × 10
−48
) when compared with people of very fair skin colour. Similarly, when compared with people with blonde hair, darker hair colours showed a graded positive association with RPS (
for linear trend 2.2 × 10
−155
). People with black hair colour showed a 0.53 SD increase in RPS when compared with people with blonde hair colour (
4.8 × 10
−122
).
There was a strong association of ethnicity with RPS. When compared with white individuals, Chinese (1.49, [1.35, 1.62];
4.6 × 10
−103
) and black (1.15, [1.07, 1.23];
1.6 × 10
−158
) people showed the largest effect sizes. However, within ethnic groups, there was a wide spread of overlapping RPS values (Fig.
). Every 5-year rise in age was associated with a small 0.02 SD increase in RPS (
1.3 × 10
−8
), and every 5 cm increase in height conferred a small −0.02 SD change in RPS (
3.6 × 10
−8
). However, sensitivity analyses with stratified linear regression models across the three main ethnic groups showed an association in different direction for age in white ethnic groups when compared with models from Black and Asian ethnic groups (Supplementary Table
). The association with height remained significant and in the same direction for white and Asian ethnic group models, and was not significant for Black ethnic groups. Supplementary Fig.
shows mean RPS adjusted for sex, and UK Biobank centre by deciles of age and height for the three main ethnic groups. A non-linear association was evidenced for refractive status. A higher RPS was observed in people with emmetropia (0.16 [95%CI 0.11, 0.20];
1.1 × 10
−12
), and hyperopia (0.11, [0.06, 0.15];
1.1 × 10
−6
) when compared with people with high myopia. The most deprived TDI quintile showed a 0.06 SD increase in RPS when compared with the least deprived TDI (
for linear trend 3 × 10
−4
). Townsend Deprivation Index showed, however, an association in a different direction in sensitivity analysis in the white ethnic group model when compared with the Black ethnic group model (Supplementary Table
). Sex was not associated with RPS.
Genome-wide association study discovery analysis
A genome-wide association study (GWAS) was performed to assess potential associations with standardised mean RPS (average score between right and left eyes per participant); robust associations would potentially support the biological plausibility of the metric. The discovery analysis included 37,067 individuals of European ancestry from the UK Biobank cohort. The genomic inflation factor was 1.071, and the linkage disequilibrium score regression intercept was 1.013 with a ratio of 0.09. Conditional analysis identified 20 independent autosomal genomic loci reaching genome-wide significance (
< 5 × 10
−8
), the majority of which (17/20) have previously been shown to associate with hair, skin and/or iris colour (Table
, Fig.
).
Table 1 Genome-wide significant associations with retinal pigment score in the UK Biobank cohort
Full size table
Fig. 3: Manhattan plot of GWAS results from the discovery cohort (UKBiobank,
= 37067).
The alternative text for this image may have been generated using AI.
Full size image
The
-axis represents the two-sided
-values from the linear mixed effects model. Lead variants identified by GCTA-COJO are annotated with the nearest gene. Points are truncated at −log10(
) = 70 for clarity. The dashed red line indicates genome-wide significance (
= 5 × 10
−8
) which is adjusted for multiple comparisons and the
-values are two-sided and calculated with the z-statistic. Source data are provided as a Source Data file.
Positional and expression quantitative trait locus mapping in retina, skin and dermal fibroblasts were performed to identify candidate causal genes at each independent risk locus. This produced a set of 100 prioritised genes (Supplementary Table
), which were then annotated in biological context. A number of these had existing entries in the GWAS Catalog
28
, with enrichment for traits including hair, eye and skin colour, as well as various skin malignancies (Supplementary Fig.
). Enrichment for several Gene Ontology entries was also apparent, especially those related to melanin and pigmentation biological processes (Supplementary Fig.
).
Genome-wide association study replication analysis
A replication GWAS was conducted in the independent EPIC Norfolk Eye Study cohort, which is a predominantly white (99.7%), longitudinal cohort from Norfolk, England
29
. This replication analysis included 4273 individuals of European ancestry. Due to differences in genotyping platforms and imputation methods, three of the lead variants highlighted in the discovery GWAS were either unavailable or did not pass quality control in the replication dataset (rs173273, rs762948237, and rs766338951). Replication was therefore assessed for 17 out of the 20 lead variants.
The direction of effect was concordant for all 17 variants and highly correlated with estimates from the discovery analysis (Pearson’s rho = 0.986 [95% CI: 0.961, 0.995]) (Fig.
). Of the 17 variants, 15 variants were significant at
< 0.05, 8 remained significant after adjusting for multiple testing (
< 0.05/17), and 2 achieved genome-wide significance (Supplementary Table
).
Fig. 4: Comparison of betas for lead variants identified from the discovery and replication cohort.
The alternative text for this image may have been generated using AI.
Full size image
Comparison of betas expressed as change in standard deviation of mean RPS for lead variants identified from the discovery (UK Biobank,
= 37067) genome-wide association study (GWAS) with their corresponding betas in the replication (EPIC-Norfolk,
= 4273) analysis, with 95% confidence intervals. Betas in the cohort were calculated using a generalised linear mixed model, adjusting for age, sex and the first ten principal components.
-values are two-sided, calculated from the z-statistic and corrected for multiple comparisons. Variants meeting the Bonferroni-adjusted replication significance threshold (
= 0.05/ variants) in the EPIC-Norfolk GWAS are shaded black. The nearest gene is annotated for variants achieving genome-wide significance. Source data are provided as a Source Data file.
Phenome-wide association study
A phenome-wide association study (PheWAS) was performed within the discovery UK Biobank GWAS sub-cohort (
= 37,067) to assess potential associations between standardised mean RPS (average score between right and left eyes per participant) with 308 diseases. After correction for multiple testing (
< 0.05/308), significant associations were observed for higher RPS (indicating more pigmentation of the retina) with decreased odds of ‘Actinic keratosis’ (OR: 0.87, 95% CI: [0.81, 0.93]), ‘Primary Malignancy - Other Skin and subcutaneous tissue’ (0.90 [0.86, 0.94]), and ‘Migraine’ (0.91, [0.87, 0.95]). Higher RPS was associated with a greater risk of chronic obstructive pulmonary disease (COPD); (1.11, [1.05, 1.16]). A further 26 diseases were significantly associated at
< 0.05, including decreased odds for ‘Primary Malignancy - Malignant Melanoma’ (
= 0.02) (Supplementary Fig.
, Supplementary Table
).
Mendelian randomisation
Two-sample Mendelian randomisation (MR) analyses were performed to probe potential causal relationships between genetically predicted retinal pigmentation with outcomes of particular interest, as highlighted by the PheWAS analysis, using outcome summary statistics from FinnGen
30
(Supplementary Table
). 13 variants were included in the instrumental variable following clumping of the 20 conditionally independent lead variants from the discovery RPS GWAS, and harmonisation with FinnGen (see Methods). All variants had an F-statistic >10 (mean 139.9). Inverse-variance weighted (IVW) MR estimates provided evidence for protective causal effects on actinic keratosis (OR 0.44 per SD RPS [0.24, 0.83];
= 0.01), basal cell carcinoma of the skin (OR 0.59 per SD RPS [0.38, 0.92];
= 0.02), squamous cell carcinoma of the skin (OR 0.38 per SD RPS [0.20, 0.73];
= 0.003), non-melanoma skin cancer (OR 0.40 per SD RPS [0.22, 0.73];
= 0.03) and malignant melanoma of the skin (OR 0.60 per SD RPS [0.38, 0.94];
= 0.003). These findings were supported by weighted median, weighted mode, and MR-Egger sensitivity analyses (Supplementary Table
10
, Supplementary Fig.
12
). Despite the presence of global heterogeneity for the MR instrument (Cochran’s Q statistic
< 0.001), the MR-Egger intercept test did not indicate average directional pleiotropy (
> 0.05) (Supplementary Table
11
). There was no evidence for a causal relationship with either COPD or migraine (Supplementary Table
10
, Supplementary Figs.
13
14
).
Discussion
We introduce a metric, the RPS, which quantifies the background pigmentation of the retina from colour fundus photographs along a continuous scale and is strongly associated with genetic variants linked to human skin, eye, and hair phenotypes with replication in an additional cohort. We studied datasets derived from people living in the UK, including more than 3000 people who describe their own ethnicity as not white, and from people with diabetes living in Tanzania (with a single self-reported ethnicity), people living in Australia who describe their ethnicity as indigenous Australian or Aboriginal and Torres Strait Islander, and more than 1,400 people living in China. The RPS captures the biological variability of retinal colour without recourse to the distinct, social and political constructs of race and ethnicity. In our study, there is a significant overlap in the distribution of RPS among a variety of ethnic groups and a wide range of RPS within each ethnicity (Fig.
). Many self-described ethnicities have RPS that fall within each quintile of the RPS distribution, a feature that makes it impossible to determine ethnicity from RPS alone.
Pigmentation is found everywhere in the body and is often studied in relation to skin tone. In the field of dermatology, many scales describe the degree of pigmentation in the skin. The Fitzpatrick classification of skin types (FST), developed in 1975, categorises skin colour into 6 types based on the skin’s reaction to UV radiation exposure, from type I (fair skin, always burns, never tans) to type VI (deeply pigmented, never burns)
31
32
. This scale has been adopted in the field of computer science to describe the diversity in imaging datasets, and expose underlying biases within AI algorithms. Studies have found that facial recognition AI performs worse on individuals with darker skin colour measured by the FST scale
14
and object detection software is worse at detecting darker skinned pedestrians by FST scale from images of street traffic
13
. This led to the Google Ethical AI team recommending that all computer vision models report their performance across a range of FST scales
33
. Recent work to widen the range of skin tones has also been adopted as an industry standard
34
There is some evidence that retinal colour affects model performance. A deep learning model trained to predict age-related macular degeneration (AMD) found that patients with the minor allele at the rs12913832/
HERC2
locus, associated with retinal pigmentation in our study, were more likely to have false positives for age-related macular degeneration
35
. Another study postulated that drusen, a pathological feature of AMD, may be more noticeable against a darker fundus background
36
. Hirsch et al. showed saturation values from retinal oximetry vary according to retinal pigmentation
37
38
. We found associations between the RPS and multiple genetic loci previously associated with skin, hair and iris colour, providing strong biological evidence that the RPS does indeed reflect the degree of retinal pigmentation. Of the 20 genome-wide significant loci identified by conditional analysis in the discovery GWAS analysis, 17 had pre-existing evidence for being associated with hair, skin or iris pigmentation, including 3 that are known to be associated with oculocutaneous albinism (
TYR
OCA2
and
TYRP1
39
. Furthermore, despite differences in study populations and cameras, we observed robust replication for these loci in the EPIC-Norfolk cohort and a strong correlation between beta coefficients in the two cohorts. This suggests that despite a range of input data characteristics, the RPS is still estimating retinal pigmentation. Post-GWAS analyses for a set of 100 prioritised causal genes demonstrated enrichment for various melanin and pigmentation pathways, as well as enrichment for pigmentation-related traits in the GWAS Catalog
28
The two most significantly associated loci in the discovery GWAS analysis were at
HERC2
(rs12913832), and
SLC45A2
(rs16891982). These also reached genome-wide significance in the replication analysis. The former is known to influence melanin production via effects on OCA2 expression, and iris colour
40
41
while rs16891982 is a missense mutation in the
SLC45A2
gene
42
. rs12913832 modulates human pigmentation by affecting chromatin-loop formation between a long-range enhancer and the
OCA2
promoter, leading to decreased expression of OCA2 and lighter pigmentation
40
. This variant is strongly associated with brown iris colour in European populations
41
. The
SLC45A2
gene encodes a membrane protein involved in the transport of solutes including tyrosine (a precursor to melanin synthesis), which is implicated in the regulation of skin, hair and iris colour
43
44
45
. rs16891982 encodes a missense mutation in
SCL45A2
, and has been associated with skin pigmentation as well as a strong association with risk for cutaneous malignant melanoma
46
Interestingly, the lead variants at
PDE3A
SIK1
and
IFT122
have not been previously associated with skin, hair or iris pigmentation, thus these new variants may be specifically related to retinal pigmentation.
PDE3A
has been previously associated with arteriolar tortuosity
47
SIK1
with the regulation of circadian rhythms
48
, and in vitro work has implicated alternative splicing for
IFT122
to play a role in PRPF31 retinitis pigmentosa pathogenesis
49
The PheWAS analysis suggested that some diseases associated with skin pigmentation were also associated with retinal pigmentation. Both actinic keratosis and cutaneous malignancy were inversely associated with increased RPS at phenome-wide significance. Malignant melanoma was also inversely associated with increased RPS, albeit only nominally, possibly due to the limited number of participants (n = 643) with malignant melanoma. MR analyses furthermore provided evidence that genetic predisposition to increased retinal pigmentation is causally protective for skin malignancies, including malignant melanoma. Migraine and COPD also showed phenome-wide significant associations with RPS. The observational association with migraine is consistent with a recent study showing an increased risk of migraine with lightly pigmented skin
50
. Similarly, the effect direction of the association of RPS with COPD and allergic rhinitis is concordant with previous literature postulating skin pigmentation and vitamin D relationships
51
52
53
. However, further investigation with MR did not yield any evidence for causality between retinal pigmentation with either migraine or COPD.
The RPS may be utilised for several indications in the future. Reporting the RPS range for any AI training datasets could offer an immediate description of the phenotypic diversity within the datasets. Training or test sets could incorporate the RPS in the descriptive summaries along with other important biological and demographic covariates. The RPS could be used as a standard metric in evaluating differences in AI algorithm performance across a diverse set of input images, similar to how FST has been used in dermatology. This would help ensure that real-world biological variability in health and disease remains included in the generation of new algorithms. This has implications not just for ophthalmic diseases, but also for other diseases using retinal imaging biomarkers, such as Alzheimer’s disease
54
55
and cardiovascular disease
56
57
. With the genetic associations demonstrated in this study, the RPS may be used to study retinal pigmentation itself in relation to normal eye and brain development, and a myriad of ophthalmic and systemic diseases.
There are several limitations in this study. Both the UK Biobank and EPIC-Norfolk participants are predominantly self-reported white and European. The UK Biobank has a higher proportion of non-white participants compared to EPIC-Norfolk cohort, but this number comprises only 7.5% in the UK Biobank. Fortunately, this number still equates to over 3000 people reporting non-white ethnicity. We also showed that the RPS generated separately in an ethnically distinct Tanzanian, Australian, and Chinese dataset is feasible when fit to the UK Biobank RPS. It is our hope that the open-source availability of the RPS method will allow this technique to be applied to other datasets to address this limitation. Secondly, RPS is currently dataset-specific, so that absolute RPS values from different cohorts cannot be directly compared if they are not fit to the same RPS scale. In this work, we elected to fit the Chinese, Australian and Tanzanian datasets to the UK Biobank RPS scale to aid in comparison. This may be resolved with standardisation of the metric between camera types, using device-specific raw RGB values, and is the subject of future work. Thirdly, large datasets that link genetic data with retinal images are only available in limited geographic areas in the world. Finally, the performance of the RPS in retinal disease states will need to be assessed in future work.
In conclusion, the RPS is a continuous metric of retinal pigmentation directly derived from a non-invasive retinal image with associations with genes implicated in hair, eye and skin colour. Although race and ethnicity are believed to determine the biological phenotype of retinal pigmentation, it is likely that the RPS is a more precise measure of pigmentation. This may have implications for AI algorithm development, testing, and for inclusion and algorithmic fairness across all fields of medicine that use retinal imaging as a biomarker.
Methods
Ethics
We analysed data from UK Biobank participants who as part of their examinations underwent enhanced ophthalmic review. Ethics approval was obtained by the Northwest Multi-centre Research Ethics Committee (REC reference number 06/MRE08/65; approved project number 28541), our research adhered to the tenets of the Declaration of Helsinki. Informed consent was obtained from all study participants and all participants were free to withdraw from the study at any time
58
The EPIC-Norfolk Eye Study was carried out following the principles of the Declaration of Helsinki and the Research Governance Framework for Health and Social Care and was approved by the Norfolk Local Research Ethics Committee (identifier: 05/Q0101/191) and the East Norfolk and Waveney National Health Service Research Governance Committee (identifier: 2005EC07L). All participants gave written informed consent. The study protocol is available online at
The Tanzanian imaging data was collected following review and approval by the Tanzanian National Institute for Medical Research (Reference id: NIMR/HQ/R.8a/Vol.IX/2402), the Kilimanjaro Christian Medical Centre (Reference number: 776), and the London School of Hygiene & Tropical Medicine Ethics Committees (Reference number: 10172).
The Australian dataset was approved by the Western Australian Aboriginal Health Ethics Committee (Reference number: 864).
The Chinese dataset was collected as part of a private dataset and ethical collection was enforced by the original dataset creators. The authors state the publishing of the dataset follows the ethical and privacy rules of China
27
Study population
The UK Biobank is a national research resource aiming to improve prevention, diagnosis, and treatment of a wide range of diseases. More than 500,000 people aged 37–73 were recruited at 22 study assessment centres across the UK between January 2006 and October 2010. Further details of the overall study protocol and protocols for individual tests are available online (
).
All participants completed a detailed touchscreen questionnaire on demographic, clinical and lifestyle-related information. Sex was acquired from a central NHS registry at recruitment, but in some cases updated by the participant, hence this field contained a mixture of the sex the NHS had recorded for the participant and self-reported sex. Analysis of sex on RPS was performed in supplemental figures. The choices for ethnic background were categorised as white, mixed, Asian or Asian British, Black or Black British, Chinese, or other ethnic group. Participant postcode at the time of recruitment was used to determine Townsend Deprivation Index (TDI), based on the corresponding output area from the preceding national census; a higher positive score implies a greater degree of deprivation. Medical history was obtained through verbal interview with a trained nurse, including the date of first diagnosis for non-cancer and cancer illnesses, as well as any major operations. Participants gave broad consent for prospective data linkage to national electronic health records (EHR) and registries, including hospital episode statistics, death register and cancer register. Linkage to primary care records is currently available for approximately 45% of the cohort (~230,000 participants, up to 2016 or 2017 depending on data supplier). Further details of the overall study protocol and protocols for individual tests are available online (
).
The EPIC-Norfolk Eye Study is a study of 8623 participants from Norfolk, England and was added onto the EPIC Cohort
29
. The EPIC study is a collaborative study involving 10 countries that began participant recruitment in 1989
59
. The EPIC-Norfolk, a United Kingdom branch of this study, comprises a population-based cohort of 25 639 participants between 40 and 79 years of age at enrolment recruited from 35 participating general practices in Norfolk, United Kingdom. Baseline examinations were carried out between 1993 and 1997
60
The Tanzanian dataset consisted of images acquired from people attending a diabetic eye screening service in the Kilimanjaro region of northern Tanzania between June 2017 and August 2018. All participants were African and had a diagnosis of diabetes mellitus. A total of 2076 retinal photographs of 1345 eyes from 690 people with diabetes comprise the dataset. Only images that were graded as having no, or mild, retinopathy were included in the analyses.
The Australian dataset consisted of images acquired from a single Aboriginal Community Controlled Health Service located within a metropolitan area of Perth, Western Australia. Participants were Aboriginal people with diabetes mellitus attending a retinal screening service. Retinal photographs of 1682 eyes of 864 people were acquired consecutively between July 2013 and October 2020. Only images that were graded as having no retinopathy or mild retinopathy were included in this study.
The Chinese dataset is the publicly available ODIR dataset. In brief, it is a labelled collection of manually curated colour fundus photos with a wide range of disease and pathology. There are 10,000 images from 5000 individuals from 487 clinical hospitals in 26 provinces across China
27
. From this dataset, we included only the 3098 normal fundus images.
Ophthalmic assessment
In the UK Biobank, more than 133,000 participants underwent an enhanced ophthalmic assessment between 2009 and 2010 at 6 assessment centres, including visual acuity (with the LogMAR scale), refractive error and intraocular pressure (IOP) measurements, as well as ophthalmic imaging
61
. Baseline best corrected VA was measured using a computerised semi-automated system at 3 metres distance. Autorefraction was performed using an RC5000 Auto refractor keratometer (Tomey, Nagoya, Japan). The spherical equivalent was calculated by adding the sum of the spherical power and half of the cylindrical power. Single-field undilated colour fundus photographs (45° field of view, centred to include both optic disc and macula) and macular optical coherence tomography (OCT) scans were captured using a digital Topcon-1000 integrated ophthalmic camera (Topcon 3D OCT-1000 Mark II, Topcon Corp., Tokyo, Japan). Imaging was performed after visual acuity, non-cycloplegic autorefraction, and intraocular pressure (IOP) measurement. The right eye was imaged first.
An ophthalmic examination for the EPIC-Norfolk participants was performed on 8623 of participants as part of the third health examination, carried out between 2004 and 2011. Fundus photography was acquired using a TRC-NW6S non-mydriatic retinal camera with a 10 megapixel Nikon D80 camera (Nikon corporation, Tokyo, Japan) via the IMAGEnet Telemedicine System (Topcon Corporation, Tokyo, Japan)
29
The Tanzanian population underwent colour fundus photography using a Topcon TRC-NW8 non-mydriatic retinal camera with a 12.3 megapixel Nikon D90 camera (Nikon corporation, Tokyo, Japan). All images were acquired after dilation with 1% tropicamide and followed a standardised protocol whereby two 45° images (disc centred and macular centred) were collected per eye.
The Indigenous Australian cohort underwent colour fundus photography and OCT imaging acquired using a Topcon ophthalmic camera (Topcon 3D-OCT1 Maestro, Topcon Corp., Tokyo, Japan). Images were undilated, 45°, single-field, macular-centred photographs.
The Chinese dataset was acquired with multiple different cameras with varying fields-of-view. The information about camera type and field-of-view was not publically available in the data labels.
Retinal pigment score
Each fundus image was run through the AutoMorph pipeline
23
to create a segmentation mask of the retinal vasculature and optic disc. The deep learning models used for each step of the segmentation pipeline are described in detail in their manuscript. In summary, each image is pre-processed and then passed through an image quality classifier that was pre-trained on the EyePACs dataset. All images of insufficient quality are excluded from the subsequent segmentation steps. The binary vessel segmentation module is an ensembled UNet architecture and the optic disc segmentation is a cascaded UNet with a ResNet backbone. We modified the AutoMorph code by changing the file system organisation between input and output nodes, adding in error handling, and reducing output file size.
We took the segmentation masks for the disc segmentation and the binary vessel segmentations from the AutoMorph modules and added them together to make a combined disc and vessel mask. The background of the fundus image was identified by finding all pixels that were at or below the 0.5 percentile of the distribution of all grayscale pixels from the input image. The background mask was added to the combined vessel and disc mask. This mask was then successively dilated using a 2-dimensional binary structuring kernel with connectivity of two. The number of dilation iterations was four multiplied by the pixel width of the image divided by 600 rounded to the nearest integer, which was derived empirically. All pixels not contained in dilated masks of the background/disc/segmentation mask were used to create a new retinal background mask.
From the retinal background mask, we found the median RGB pixel value and converted it into the CIELAB colour space
24
which is a colour space designed to have luminance (L vector) stored in a separate vector from chromaticity (a,b vectors). To reduce the effect of illumination on the measured colour from the retina, we only used the a,b coordinates from the CIELAB space and ignored the L vector. To transform the two-dimensional a,b chromaticity vectors for each eye, we used a principal component analysis model to perform dimensionality reduction. For each dataset, a two-component PCA model was fitted to the median a,b value of the retinal background for all images in the dataset. Then, each eye’s median a,b value was transformed with the PCA model along the eigenvector with the greatest eigenvalue. This new transformed vector was stored as the 1-dimensional RPS vector. Figure
represents a schematic of the pipeline. Sensitivity analyses were conducted to test for RPS performance across different background colour image pigmentation by calculating the vessel and optic disc mask area from the total pixel image area by ethnicity and by RPS quintiles. The effect of increasing or decreasing the vessel and disc segmentation area on the RPS was examined with a random selection of either a binary erosion or binary dilation for 1 iteration with a 3×3 kernel to the combined vessel and disc masks for a random selection of 100 fundus images from each of the white, Black, Asian, Chinese and Mixed ethnic groups of the UK Biobank.
The image analysis to derive the Retinal Pigment Score (RPS) was performed with Python, version 3.8
62
and PyTorch, version 1.7.0
63
. The code to derive RPS is publicly available at:
RPS reliability was assessed in an independent dataset of 27 eyes from 24 patients (22 healthy patients and 5 patients with mild non-proliferative diabetic retinopathy). The images were captured on Topcon 3D OCT-1 Maestro. One-way intraclass correlation (ICC) was calculated between 2 repeat images from the same eye.
We assessed the mean difference, standard deviation and ICC between the RPS of right and left eyes from the same patients within the UK Biobank dataset. Additionally, we assessed the mean, standard deviation and ICC of the RPS when the L, a and b vectors were used to fit the principal component analysis (PCA).
Genome-wide association study
Genome-wide association study (GWAS) was performed to assess potential genetic associations with standardised mean RPS (average score between right and left eyes per participant). A beta coefficient of 1 therefore corresponds to a 1 standard deviation increase in standardised mean RPS. Analyses were conducted using a generalised linear mixed model, adjusting for age, sex and the first ten principal components. The initial discovery GWAS analysis was performed in the UK Biobank cohort. Lead variants reaching genome-wide significance (
< 5 × 10
−8
) were re-evaluated in a replication GWAS analysis, conducted in the EPIC-Norfolk cohort. A Bonferroni-adjusted replication significance threshold was set at
= 0.05/17.
Lead variants were furthermore investigated for previously identified associations with hair, skin and eye colour, by manually referring to the Open Targets Genetics
64
65
and PhenoScanner
66
67
databases and the results are listed in Table
Full details for genotyping and imputation in the UK Biobank cohort have been described previously
68
. In brief, genotype calling was performed using two arrays: the UK BiLEVE Axiom array (~50,000 participants) and the UK Biobank Axiom array (~450,000 participants). Marker positions are in GRCh37 coordinates. There were 805,426 markers available in the released data after quality control. Genotype imputation was then performed using a combined Haplotype Reference Consortium and UK10K reference panel, expanding the number of testable variants to ~96 million. The majority of the EPIC-Norfolk cohort were also genotyped using the Affymetrix UK Biobank Axiom Array, however, genotyping for a small subset was undertaken using the Affymetrix GeneChip Human Mapping 500 K Array Set.
The following exclusions were applied for sample quality control: individuals with relatedness corresponding to third-degree relatives or closer, excess of missing genotypes or more heterozygosity than expected. The GWAS analysis was furthermore restricted to individuals of European ethnicity only. The following exclusions were applied for variant-level quality control: call rate <95%, Hardy–Weinberg equilibrium
< 1 × 10
−6
, posterior call probability <0.9, INFO score <0.9 and minor allele frequency <0.01.
The SNP2GENE function within FUMA
69
was used to perform positional and expression quantitative trait locus (eQTL) gene mapping/prioritisation. SNPs in high LD (
> 0.6) with any independent lead variant were positionally mapped to genes located within 10 kb. Variants were also mapped to a set of prioritised genes within 1 Mb if associated with the expression of those genes in retina (reported in EyeGEx), skin (reported in TwinsUK, GTEx v8) and dermal fibroblasts (reported in GTEx v8). To test the prioritised genes for enrichment in biological pathways, FUMA’s GENE2FUNC function was applied, using hypergeometric mean pathway analysis against gene sets obtained from MsigDB and WikiPathways. Multiple testing correction (Benjamini-Hochberg method) per data source of tested gene sets was performed and gene sets with adjusted
≤ 0.05 and >1 overlapping genes were reported by default.
Phenome-wide association study
A phenome-wide association (PheWAS) analysis was conducted within the discovery UK Biobank GWAS sub cohort using 308 CALIBER codelists drawing on the following diagnostic records: verbal interview responses, linked hospital episode statistics, death register and primary care records. Read 2, ICD-10 and OPCS-4 clinical codelists were minimally adapted from the CALIBER Portal
70
. The former two coding systems were expanded to Read 3 and ICD-9 equivalents, respectively, using the mapping files provided by UK Biobank Resource
592
The PheWAS analysis performed logistic regression to assess potential disease associations with standardised mean RPS (average score between right and left eyes per participant), adjusting for age at baseline assessment and self-reported sex. All available diagnostic records, both before and after the date of attendance for retinal imaging, were included. Conditions with fewer than 200 cases were excluded. Associations meeting the Bonferroni-corrected
-value threshold (
= 0.05/308) were considered phenome-wide significant.
Mendelian randomisation
Mendelian randomisation (MR) is a technique for evaluating causality between exposure and outcome variables by utilising genetic variants as instrumental variables (IV)
71
. In comparison to traditional epidemiological methods, MR is relatively immune to bias from confounding and reverse causation. The random allocation of genetic variants at conception is analogous to the random allocation of an intervention in a randomised controlled trial. For the results from an MR analysis to be valid, the IV must satisfy three critical assumptions: (1) the IV must be associated with the exposure; (2) the IV must not be associated with confounders of the exposure-outcome association; (3) the IV must only affect the outcome via the exposure and not through alternative pathways. Provided these criteria are met, the estimates from a MR analysis reflect the causal association between a genetically determined risk factor (here, a genetic predisposition to increased retinal pigmentation) and the development of a particular outcome over the course of a lifetime.
Two-sample MR analyses were conducted drawing on the discovery RPS GWAS results (exposure) and summary statistics from the independent FinnGen cohort
72
(data release 8) for selected outcomes of interest, as highlighted by the RPS PheWAS analysis. The IV for retinal pigmentation was constructed by selecting all conditionally independent genome-wide significant single nucleotide polymorphisms (SNPs) from the discovery RPS GWAS analysis, and performing clumping to exclude those with linkage disequilibrium
> 0.01 and within 10,000 kb, using the 1000 Genomes Project European reference population
73
. Palindromic SNPs with minor allele frequency >0.46 were excluded. Effect alleles were harmonised across exposure and outcome datasets.
The main MR analyses were performed using a multiplicative random-effects inverse-variance weighted (IVW) approach
74
. This method provides precise and efficient estimates but is sensitive to invalid IVs and pleiotropy
75
. We therefore conducted sensitivity analyses using three alternative MR methods: weighted median
76
, weighted mode
77
and MR-Egger
78
. Each method makes different assumptions about the nature of pleiotropy and consistent estimates across methods strengthens causal inferences
79
Under the IVW method, we calculated the mean F-statistic as an indicator of instrument strength (a value > 10 is usually considered a strong instrument)
80
. We assessed for heterogeneity with the
and Cochran’s
statistics in the IVW model and with Rucker’s
Q’
statistic in MR-Egger regression. The
GX
statistic is an indicator of expected relative bias (or dilution) of the MR-Egger causal estimate
81
. In MR-Egger regression, a significant difference of the intercept from zero is evidence for average directional horizontal pleiotropy
78
Statistical analysis
All measurements were repeated from the same sample unless otherwise reported. Linear regression models with standardised RPS (z-score) adjusting for age, sex, self-reported ethnicity (categorised as white, Black, Asian, mixed, Chinese, or other), hair colour (categorised as blonde, red, light brown, dark brown, black and other), skin colour (categorised as very fair, fair, light olive, dark olive, brown and black), spherical equivalent, height, TID (scores categorised in quintiles where a higher quintile implies a greater degree of deprivation), and UK Biobank assessment centre were used to examine associations with RPS. Collinearity was examined using variance inflation factor testing on the final model with adjusted generalised standard error inflation factors
82
. Missing data points were categorised as “Missing” within each variable. Formal two-way tests for interaction were examined for ethnicity and height, and ethnicity and age. Sensitivity analyses were conducted with three stratified linear regression models for the three main ethnic groups (white, Black, and Asian ethnic groups).
The GWAS analysis was performed using REGENIE software
83
. GCTA-COJO was used to prioritise lead variants
84
. Genomic inflation factor and heritability estimates were calculated using the LDSC tool
85
and pre-calculated LD scores for European ancestry (
). All other statistical analyses and were performed in R (R for GNU macOS, Version 4.2.0, The R Foundation for Statistical Computing, Vienna, Austria)
86
. R packages used included PheWAS
87
, TwoSampleMR
88
, MendelianRandomization
89
, ukbwranglr
90
, codemapper
91
, targets
92
, tarchetypes
93
, tidyverse
94
, workflowr
95
, flextable
96
, gtsummary
97
and knitr
98
Inclusion and ethics statement
All collaborators of this study have fulfilled the criteria for authorship required by Nature Portfolio journals have been included as authors. Roles and responsibilities were agreed among collaborators ahead of the research. This research was not severely restricted or prohibited in the setting of the researchers, and does not result in stigmatisation, incrimination, discrimination or personal risk to participants. Local and regional research relevant to our study was considered in citations.
Reporting summary
Further information on research design is available in the
Nature Portfolio Reporting Summary
linked to this article.
Data availability
Access to the UK Biobank is restricted to safeguard the privacy of the participants and requires an application. The restrictions depend on the level of access granted. One can apply for access on their website. You cannot share the UK Biobank data with researchers who are not registered with the UK Biobank. Registrations are reviewed within 10 working days of submission. The length of access depends on the access granted. Access to the EPIC-Norfolk Eye study is restricted and requires an application because of the desire to safeguard the privacy of participants. One can request access via the EPIC-Norfolk Management Committee. The data is available to researchers with relevant scientific and ethics approvals for their research, including those in other countries and in commercial companies who are looking for new treatments or laboratory tests. Applications are generally reviewed within 1 month. The Tanzanian fundus photo dataset was transferred to LSHTM under a formal data transfer agreement with the National Institute for Medical Research in Tanzania. This agreement stipulates that the dataset be used for teaching or academic research purposes only. Requests for access to the dataset can be made to Charles Cleland (charles.cleland@lshtm.ac.uk) and replies will be within ten working days. If access to the dataset is granted it will be for a period of six weeks. The Australian dataset is available under restricted access in order to safeguard participant privacy. This dataset, also known as the Derbarl Yerrigan Health Service data, is a First Nations of Western Australia diabetic screening dataset. This dataset is subject to ethical approval for use by the Ethics Committee operated by the Aboriginal Health Council of Western Australia (AHCWA). A written request will be considered and responses should be returned in less than one month. Access to the data is subject to further ethics applications to AHCWA and the duration of access will depend on the applications. Please reach out to angus.turner@uwa.edu.au for inquiries. The Chinese dataset is a subset of the publically available ODIR dataset and can be accessed as described in the corresponding manuscript.(Li et al. 2021). The raw data used in this study are protected and are not available due to data privacy laws. The data generated for figures presented in this study are provided in the Source Data file. Retinal pigment scores for UK Biobank participants will be made available to approved UK Biobank researchers as a returned dataset (
). FinnGen genome-wide association study (GWAS) summary statistics are publicly available online (
). Summary statistics from the GWAS analyses presented in this study will be made publicly available from the NHGRI-EBI Catalog of human genome-wide association studies (
).
Source data
are provided with this paper.
Code availability
The code to derive RPS is publicly available at
References
Flaxman, S. R. et al. Global causes of blindness and distance vision impairment 1990-2020: a systematic review and meta-analysis.
Lancet Glob. Health
, e1221–e1234 (2017).
Article
PubMed
MATH
Google Scholar
Wong, W. L. et al. Global prevalence of age-related macular degeneration and disease burden projection for 2020 and 2040: a systematic review and meta-analysis.
Lancet Glob. Health
, e106–e116 (2014).
Article
PubMed
MATH
Google Scholar
Teo, Z. L. et al. Global prevalence of diabetic retinopathy and projection of burden through 2045: Systematic review and meta-analysis.
Ophthalmology
128
, 1580–1591 (2021).
Article
PubMed
MATH
Google Scholar
Lee, A. Y. et al. Multicenter, head-to-head, real-world validation study of seven automated artificial intelligence diabetic retinopathy screening systems.
Diabetes Care
44
, 1168–1175 (2021).
Article
PubMed
PubMed Central
MATH
Google Scholar
Tufail, A. et al. Automated diabetic retinopathy image assessment software: diagnostic accuracy and cost-effectiveness compared with human graders.
Ophthalmology
124
, 343–351 (2017).
Article
PubMed
MATH
Google Scholar
Gunasekeran, D. V. et al. Acceptance and Perception of Artificial Intelligence Usability in Eye Care (APPRAISE) for ophthalmologists: a multinational perspective.
Front. Med.
, 875242 (2022).
Article
Google Scholar
Liu, H. et al. Development and validation of a deep learning system to detect glaucomatous optic neuropathy using fundus photographs.
JAMA Ophthalmol.
137
, 1353–1360 (2019).
Article
PubMed
PubMed Central
MATH
Google Scholar
Keane, P. A. & Topol, E. J. With an eye to AI and autonomous diagnosis.
NPJ Digit Med.
, 40 (2018).
Article
PubMed
PubMed Central
MATH
Google Scholar
Ting, D. S. W. et al. Deep learning in ophthalmology: the technical and clinical considerations.
Prog. Retin. Eye Res.
72
, 100759 (2019).
Article
PubMed
MATH
Google Scholar
Khan, S. M. et al. A global review of publicly available datasets for ophthalmological imaging: barriers to access, usability, and generalisability.
Lancet Digit Health
, e51–e66 (2021).
Article
CAS
PubMed
MATH
Google Scholar
Bellemo, V. et al. Artificial intelligence using deep learning to screen for referable and vision-threatening diabetic retinopathy in Africa: a clinical validation study.
Lancet Digit Health
, e35–e44 (2019).
Article
PubMed
MATH
Google Scholar
Zhelev, Z. et al. Test accuracy of artificial intelligence-based grading of fundus images in diabetic retinopathy screening: a systematic review.
J. Med. Screen
30
, 97–112 (2023).
Wilson, B., Hoffman, J. & Morgenstern, J. Predictive inequity in object detection. Preprint at
(2019).
Buolamwini, J. & Gebru, T. Gender shades: intersectional accuracy disparities in commercial gender classification.
81
, 77–91 (2018).
Gong, S., Liu, X. & Jain, A. K. Jointly de-biasing face recognition and demographic attribute estimation. Preprint at
(2019).
Flanagin, A., Frey, T., Christiansen, S. L. & Manual of Style Committee, A. M. A. Updated guidance on the reporting of race and ethnicity in medical and science journals.
JAMA
326
, 621–627 (2021).
Article
PubMed
Google Scholar
Mackey, D. A. What colour are your eyes? Teaching the genetics of eye colour & colour vision. Edridge Green Lecture RCOphth Annual Congress Glasgow May 2019.
Eye
36
, 704–715 (2022).
Article
PubMed
Google Scholar
Currant, H. et al. Genetic variation affects morphological retinal phenotypes extracted from UK Biobank optical coherence tomography images.
PLoS Genet.
17
, e1009497 (2021).
Article
CAS
PubMed
PubMed Central
Google Scholar
Simcoe, M. et al. Genome-wide association study in almost 195,000 individuals identifies 50 previously unidentified genetic loci for eye color.
Sci. Adv.
(2021).
Larsson, M. et al. GWAS findings for human iris patterns: associations with variants in genes that influence normal neuronal pattern development.
Am. J. Hum. Genet.
89
, 334–343 (2011).
Article
CAS
PubMed
PubMed Central
MATH
Google Scholar
Mackey, D. A., Wilkinson, C. H., Kearns, L. S. & Hewitt, A. W. Classification of iris colour: review and refinement of a classification schema.
Clin. Exp. Ophthalmol.
39
, 462–471 (2011).
Article
PubMed
MATH
Google Scholar
Lona-Durazo, F. et al. Investigating the genetic architecture of eye colour in a Canadian cohort.
iScience
25
, 104485 (2022).
Article
ADS
CAS
PubMed
PubMed Central
MATH
Google Scholar
Zhou, Y. et al. AutoMorph: automated retinal vascular morphology quantification via a deep learning pipeline.
Transl. Vis. Sci. Technol.
11
, 12 (2022).
Article
PubMed
PubMed Central
MATH
Google Scholar
Robertson, A. R. The CIE 1976 color-difference formulae.
Color Res. Appl.
, 7–11 (1977).
Article
ADS
MATH
Google Scholar
Warwick, A. N. et al. UK Biobank retinal imaging grading: methodology, baseline characteristics and findings for common ocular diseases.
Eye
37
, 2109–2116 (2022).
Lee, R. Y. et al. Ethnic variation in optic disc size by fundus photography.
Curr. Eye Res.
38
, 1142–1147 (2013).
Article
PubMed
MATH
Google Scholar
Li, N., Li, T., Hu, C., Wang, K. & Kang, H. A benchmark of ocular disease intelligent recognition: one shot for multi-disease detection. Preprint at
(2021).
Sollis, E. et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource.
Nucleic Acids Res.
51
, D977–D985 (2023).
Article
CAS
PubMed
MATH
Google Scholar
Khawaja, A. P. et al. The EPIC-Norfolk Eye Study: rationale, methods and a cross-sectional analysis of visual impairment in a population-based cohort.
BMJ Open
(2013).
Kurki, M. I. et al. Author Correction: FinnGen provides genetic insights from a well-phenotyped isolated population.
Nature
615
, E19 (2023).
Article
CAS
PubMed
PubMed Central
MATH
Google Scholar
Sachdeva, S. Fitzpatrick skin typing: applications in dermatology.
Indian J. Dermatol. Venereol. Leprol.
75
, 93–96 (2009).
Article
PubMed
MATH
Google Scholar
Fitzpatrick, T. B. Soleil et peau [Sun and skin].
J. Med. Esthet.
33
, 34 (1975).
MATH
Google Scholar
Mitchell, M. et al. Model cards for model reporting.
Proceedings of the Conference on Fairness, Accountability, and Transparency
(2019).
Skin Tone Research @ Google.
Guenther, F. et al. Chances and challenges of machine learning-based disease classification in genetic association studies illustrated on age-related macular degeneration.
Genet. Epidemiol.
44
, 759–777 (2020).
Article
PubMed
MATH
Google Scholar
Age-Related Eye Disease Study Research Group. Risk factors associated with age-related macular degeneration. A case-control study in the age-related eye disease study: Age-Related Eye Disease Study Report Number 3.
Ophthalmology
107
, 2224–2232 (2000).
Google Scholar
Hirsch, K., Cubbidge, R. P. & Heitmar, R. Dual wavelength retinal vessel oximetry—influence of fundus pigmentation.
Eye
37
, 2246–2251 (2022).
Yin, G. et al. Ocular axial length and its associations in Chinese: the Beijing Eye Study.
PLoS ONE
, e43172 (2012).
Article
ADS
CAS
PubMed
PubMed Central
Google Scholar
Arveiler, B., Lasseaux, E. & Morice-Picard, F. Clinique et génétique de l’albinisme.
Presse Med
46
, 648–654 (2017).
Article
PubMed
Google Scholar
Visser, M., Kayser, M. & Palstra, R.-J. HERC2 rs12913832 modulates human pigmentation by attenuating chromatin-loop formation between a long-range enhancer and the OCA2 promoter.
Genome Res.
22
, 446–455 (2012).
Article
CAS
PubMed
PubMed Central
Google Scholar
Eiberg, H. et al. Blue eye color in humans may be caused by a perfectly associated founder mutation in a regulatory element located within the HERC2 gene inhibiting OCA2 expression.
Hum. Genet.
123
, 177–187 (2008).
Article
CAS
PubMed
Google Scholar
Branicki, W., Brudnik, U., Draus-Barini, J., Kupiec, T. & Wojas-Pelc, A. Association of the SLC45A2 gene with physiological human hair colour variation.
J. Hum. Genet.
53
, 966–971 (2008).
Article
CAS
PubMed
Google Scholar
Wiriyasermkul, P., Moriyama, S. & Nagamori, S. Membrane transport proteins in melanosomes: regulation of ions for pigmentation.
Biochim. Biophys. Acta Biomembr.
1862
, 183318 (2020).
Article
CAS
PubMed
PubMed Central
Google Scholar
Sturm, R. A. Molecular genetics of human pigmentation diversity.
Hum. Mol. Genet.
18
, R9–R17 (2009).
Article
CAS
PubMed
MATH
Google Scholar
Karawita, A. C. et al. The swan genome and transcriptome, it is not all black and white.
Genome Biol.
24
, 13 (2023).
Article
PubMed
PubMed Central
Google Scholar
Duffy, D. L. et al. Multiple pigmentation gene polymorphisms account for a substantial proportion of risk of cutaneous malignant melanoma.
J. Invest. Dermatol.
130
, 520–528 (2010).
Article
CAS
PubMed
MATH
Google Scholar
Jiang, X. et al. GWAS on retinal vasculometry phenotypes.
PLoS Genet.
19
, e1010583 (2023).
Article
CAS
PubMed
PubMed Central
Google Scholar
Jagannath, A. et al. The CRTC1-SIK1 pathway regulates entrainment of the circadian clock.
Cell
154
, 1100–1111 (2013).
Article
CAS
PubMed
PubMed Central
MATH
Google Scholar
Buskin, A. et al. Disrupted alternative splicing for genes implicated in splicing and ciliogenesis causes PRPF31 retinitis pigmentosa.
Nat. Commun.
, 4234 (2018).
Article
ADS
PubMed
PubMed Central
MATH
Google Scholar
Kobus, M. et al. Risk of migraine in europeans with low melanin levels—a population based case-control study.
Brain Sci.
12
(2022).
Majewski, S. et al. Skin condition and its relationship to systemic inflammation in chronic obstructive pulmonary disease.
Int. J. Chron. Obstruct. Pulmon. Dis.
12
, 2407–2415 (2017).
Article
CAS
PubMed
PubMed Central
MATH
Google Scholar
Kokturk, N., Baha, A., Oh, Y.-M., Young Ju, J. & Jones, P. W. Vitamin D deficiency: what does it mean for chronic obstructive pulmonary disease (COPD)? a compherensive review for pulmonologists.
Clin. Respir. J.
12
, 382–397 (2018).
Article
CAS
PubMed
Google Scholar
Clemens, T. L., Adams, J. S., Henderson, S. L. & Holick, M. F. Increased skin pigment reduces the capacity of skin to synthesise vitamin D3.
Lancet
, 74–76 (1982).
Article
CAS
PubMed
MATH
Google Scholar
Liao, H., Zhu, Z. & Peng, Y. Potential utility of retinal imaging for Alzheimer’s disease: a review.
Front. Aging Neurosci.
10
, 188 (2018).
Article
PubMed
PubMed Central
MATH
Google Scholar
Cheung, C. Y. et al. Retinal imaging in Alzheimer’s disease.
J. Neurol. Neurosurg. Psychiatry
92
, 983–994 (2021).
Article
PubMed
MATH
Google Scholar
Ho, H. et al. Retinopathy signs improved prediction and reclassification of cardiovascular disease risk in diabetes: a prospective cohort study.
Sci. Rep.
, 41492 (2017).
Article
ADS
CAS
PubMed
PubMed Central
Google Scholar
Wong, T. Y., Knudtson, M. D., Klein, R., Klein, B. E. K. & Hubbard, L. D. A prospective cohort study of retinal arteriolar narrowing and mortality.
Am. J. Epidemiol.
159
, 819–825 (2004).
Article
PubMed
MATH
Google Scholar
Ethics.
Riboli, E. & Kaaks, R. The EPIC Project: rationale and study design. European Prospective Investigation into Cancer and Nutrition.
Int. J. Epidemiol.
26
, S6–S14 (1997).
Article
PubMed
MATH
Google Scholar
Day, N. et al. EPIC-Norfolk: study design and characteristics of the cohort. European Prospective Investigation of Cancer.
Br. J. Cancer
80
, 95–103 (1999).
PubMed
MATH
Google Scholar
Chua, S. Y. L. et al. Cohort profile: design and methods in the eye and vision consortium of UK Biobank.
BMJ Open
, e025077 (2019).
Article
PubMed
PubMed Central
MATH
Google Scholar
Python Software Foundation. Python Language Reference, Version 3.8. Available at
Paszke, A. et al. Pytorch: An imperative style, high-performance deep learning library.
Adv. Neural Inf. Process. Syst
32
(2019).
Mountjoy, E. et al. An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci.
Nat. Genet.
53
, 1527–1533 (2021).
Article
CAS
PubMed
PubMed Central
MATH
Google Scholar
Ghoussaini, M. et al. Open Targets Genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics.
Nucleic Acids Res.
49
, D1311–D1320 (2021).
Article
CAS
PubMed
Google Scholar
Staley, J. R. et al. PhenoScanner: a database of human genotype–phenotype associations.
Bioinformatics
32
, 3207–3209 (2016).
Article
CAS
PubMed
PubMed Central
MATH
Google Scholar
Kamat, M. A. et al. PhenoScanner V2: an expanded tool for searching human genotype–phenotype associations.
Bioinformatics
35
, 4851–4853 (2019).
Article
CAS
PubMed
PubMed Central
MATH
Google Scholar
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data.
Nature
562
, 203–209 (2018). 10.
Article
ADS
CAS
PubMed
PubMed Central
MATH
Google Scholar
Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA.
Nat. Commun.
, 1826 (2017).
Article
ADS
PubMed
PubMed Central
Google Scholar
Kuan, V. et al. A chronological map of 308 physical and mental health conditions from 4 million individuals in the English National Health Service.
Lancet Digital Health
, e63–e77 (2019). 06.
Article
PubMed
MATH
Google Scholar
Davies, N. M., Holmes, M. V. & Davey Smith, G. Reading Mendelian randomisation studies: a guide, glossary, and checklist for clinicians.
BMJ
362
, k601 (2018).
Article
PubMed
PubMed Central
Google Scholar
Kurki, M. I. et al. FinnGen provides genetic insights from a well-phenotyped isolated population.
Nature
613
, 508–518 (2023).
Article
ADS
CAS
PubMed
PubMed Central
MATH
Google Scholar
1000 Genomes Project Consortium. et al. A global reference for human genetic variation.
Nature
526
, 68–74 (2015).
Article
Google Scholar
Burgess, S., Butterworth, A. & Thompson, S. G. Mendelian randomization analysis with multiple genetic variants using summarized data.
Genet. Epidemiol.
37
, 658–665 (2013).
Article
PubMed
PubMed Central
MATH
Google Scholar
Burgess, S., Bowden, J., Fall, T., Ingelsson, E. & Thompson, S. G. Sensitivity analyses for robust causal inference from mendelian randomization analyses with multiple genetic variants.
Epidemiology
28
, 30–42 (2017).
Article
PubMed
MATH
Google Scholar
Bowden, J., Davey Smith, G., Haycock, P. C. & Burgess, S. Consistent estimation in mendelian randomization with some invalid instruments using a weighted median estimator.
Genet. Epidemiol.
40
, 304–314 (2016).
Article
PubMed
PubMed Central
Google Scholar
Hartwig, F. P., Davey Smith, G. & Bowden, J. Robust inference in summary data Mendelian randomization via the zero modal pleiotropy assumption.
Int. J. Epidemiol.
46
, 1985–1998 (2017).
Article
PubMed
PubMed Central
MATH
Google Scholar
Bowden, J., Davey Smith, G. & Burgess, S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression.
Int. J. Epidemiol.
44
, 512–525 (2015).
Article
PubMed
PubMed Central
Google Scholar
Lawlor, D. A., Tilling, K. & Davey Smith, G. Triangulation in aetiological epidemiology.
Int. J. Epidemiol.
45
, 1866–1886 (2016).
PubMed
MATH
Google Scholar
Burgess, S., Thompson, S. G. & CRP CHD Genetics Collaboration. Avoiding bias from weak instruments in Mendelian randomization studies.
Int. J. Epidemiol.
40
, 755–764 (2011).
Article
PubMed
MATH
Google Scholar
Bowden, J. et al. Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the I2 statistic.
Int. J. Epidemiol.
45
, 1961–1974 (2016).
PubMed
PubMed Central
MATH
Google Scholar
Fox, J. & Monette, G. Generalized collinearity diagnostics.
J. Am. Stat. Assoc.
87
, 178–183 (1992).
Article
Google Scholar
Mbatchou, J. et al. Computationally efficient whole-genome regression for quantitative and binary traits.
Nat. Genet.
53
, 1097–1103 (2021). 07.
Article
CAS
PubMed
MATH
Google Scholar
Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits.
Nat. Genet
44
(03 2012).
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies.
Nat. Genet.
47
, 291–295 (2015). 03.
Article
CAS
PubMed
PubMed Central
Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing.
(2022).
Carroll, R. J., Bastarache, L. & Denny, J. C. R. PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment.
Bioinformatics
30
, 2375–2376 (2014).
Article
CAS
PubMed
PubMed Central
Google Scholar
Hemani, G. et al. The MR-Base platform supports systematic causal inference across the human phenome.
Elife
(2018).
Yavorska, O. O. & Burgess, S. MendelianRandomization: an R package for performing Mendelian randomization analyses using summarized data.
Int. J. Epidemiol.
46
, 1734–1739 (2017).
Article
PubMed
PubMed Central
MATH
Google Scholar
Warwick, A. Ukbwranglr: Functions to Load and Wrangle UK Biobank Data.
(2022).
Warwick, A. Codemapper: Functions for Mapping between Clinical Coding Systems.
(2022).
Landau, W. M. The targets R package: a dynamic Make-like function-oriented pipeline toolkit for reproducibility and high-performance computing.
J. Open Source Softw.
, 2959 (2021).
Article
ADS
MATH
Google Scholar
Landau, W. M. Tarchetypes: Archetypes for targets.
(Accessed Jan 1st, 2023).
Wickham, H. et al. Welcome to the tidyverse.
J. Open Source Softw.
, 1686 (2019).
Article
ADS
MATH
Google Scholar
Blischak, J. D., Carbonetto, P. & Stephens, M. Creating and sharing reproducible research code the workflowr way [version 1; peer review: 3 approved].
F1000Res
(2019).
Gohel, D. Flextable: Functions for Tabular Reporting
(2022).
Sjoberg, D., Whiting, K., Curry, M., Lavery, J. & Larmarange, J. Reproducible summary tables with the gtsummary package.
R. J.
13
, 570 (2021).
Article
Google Scholar
Xie, Y. Knitr A. General-Purpose Package for Dynamic Report Generation in R.
(2022).
Download references
Acknowledgements
The authors would like to thank the 500,000 UK Biobank participants, the participants and the investigators of the FinnGen study, the EPIC-Norfolk study, the ODIR dataset, the Australian dataset and the Tanzanian dataset; and the funders of this project. We thank Topcon Healthcare, particularly Mary Durbin, Ted Spaide, and Mark Evans, UK & Ireland Service and Operations Manager, and Leigh Maher, IT and Systems, for their invaluable contribution in providing an independent dataset for standardisation techniques. The EPIC-Norfolk study (DOI 10.22025/2019.10.105.00004) has received funding from the Medical Research Council (MR/N003284/1 and MC-UU_12015/1) and Cancer Research UK (C864/A14136). The genetics work in the EPIC-Norfolk study was funded by the Medical Research Council ((MC_PC_13048)). We are grateful to all the participants who have been part of the project and to the many members of the study teams at the University of Cambridge who have enabled this research. The authors would like to acknowledge Professor Nick Wareham for his assistance in access to the EPIC-Norfolk Eye study. C.E. and A.T. received a proportion of their financial support from the UK Department of Health through an award made by the National Institute for Health Research to Moorfields Eye Hospital NHS Foundation Trust and UCL Institute of Ophthalmology for a Biomedical Research Centre for Ophthalmology. C.E. receives a proportion of financial support to Moorfields Eye Hospital NHS Foundation Trust from the Lowy Medical Research Institute. C.S.L. receives grant funding from NIH/NIA R01AG060942, U19AG066567, OT2OD032644, and the Klorfine Family Endowed Chair, and the Karalis Johnson Retina Center. A.Y.L. has received an unrestricted and career development award from Research to Prevent Blindness, Latham Vision Science Awards, NEI/NIH K23EY029246, OT2OD032644, the C. Dan and Irene Hunter Endowed Professorship, and the Karalis Johnson Retina Center. A.O.-B. is supported by the Lowy Medical Research Institute, La Jolla, California. A.P.K. is supported by a UK Research and Innovation Future Leaders Fellowship, an Alcon Research Institute Young Investigator Award and a Lister Institute for Preventive Medicine Award. This research was supported by the NIHR Biomedical Research Centre at Moorfields Eye Hospital and the UCL Institute of Ophthalmology. A.N.W. is supported by the Wellcome Trust (220558/Z/20/Z; 224390/Z/21/Z). P.A.K. is supported by a UK Research & Innovation Future Leaders Fellowship (MR/T019050/1) and The Rubin Foundation Charitable Trust. M.J.B. is supported by a Wellcome Trust Senior Research Fellowship (207472/Z/17/Z), and a Wellcome Trust Collaborative Award (206275/Z/17/Z). K.V.S. is supported by a UCL Overseas Research Scholarship and grants from Fight for Sight (1956A) and the Desmond Foundation. The research was supported by the National Institute for Health and Care Research (NIHR) Biomedical Research Centre based at Moorfields Eye Hospital NHS Foundation Trust, UCL Institute of Ophthalmology, and the NIHR Birmingham Biomedical Research Centre. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health and Social Care.
Author information
Author notes
These authors contributed equally: Anand E. Rajesh, Abraham Olvera-Barrios
Authors and Affiliations
Department of Ophthalmology, University of Washington, Seattle, WA, USA
Anand E. Rajesh, Yue Wu, Cecilia S. Lee & Aaron Y. Lee
The Roger and Angie Karalis Johnson Retina Center, Seattle, WA, USA
Anand E. Rajesh, Yue Wu, Cecilia S. Lee & Aaron Y. Lee
NIHR Biomedical Research Centre, Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology, London, UK
Abraham Olvera-Barrios, Alasdair N. Warwick, Kelsey V. Stuart, Mahantesh I. Biradar, Anthony P. Khawaja, Robert Luben, Paul J. Foster, Matthew J. Burton, Pearse A. Keane, Mark A. Chia, Adnan Tufail, Catherine Egan, Konstantinos Balaskas, Michelle Chan, Sharon Chua, Alexander Day, Parul Desai, David Ted Garway-Heath, Alison Hardcastle, Phil Luthert, Tony Moore, Praveen Patel, Axel Petzold, Nikolas Pontikos, Sobha Sivaprasad, Nicholas Strouthidis, Zihan Sun, Dhanes Thomas & Ananth Viswanathan
University College London Institute of Cardiovascular Science, London, UK
Alasdair N. Warwick
Guy’s and St Thomas’ NHS Foundation Trust, London, UK
Chuin Ying Ung & Tasanee Braithwaite
MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
Anthony P. Khawaja & Robert Luben
International Centre for Eye Health, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, London, UK
Charles R. Cleland & Matthew J. Burton
Eye Department, Kilimanjaro Christian Medical Centre, Moshi, United Republic of Tanzania
Charles R. Cleland, William U. Makupa & Andrew Bastawrous
NIHR Birmingham Biomedical Research Centre, Birmingham, UK
Alastair K. Denniston
PEEK Vision, Berkhamsted, UK
Andrew Bastawrous
Lions Eye Institute, University of Western Australia, Nedlands, WA, Australia
Angus W. Turner
Nuffield Department of Population Health, University of Oxford, Oxford, UK
Naomi Allen, Roxana Carare, Jane Gibson & Thomas Littlejohns
Manchester Royal Eye Hospital, The University of Manchester, Manchester, UK
Tariq Aslam
Bristol Eye Hospital, University of Bristol, Bristol, UK
Denize Atan, Andrew Dick & Cathy Williams
Department of Computer Science and Mathematics, Kingston University, London, UK
Sarah Barman
School of Medicine, University of Leeds, Leeds, UK
Jenny Barrett, Sarah Mackie & Martin McKibbin
Division of Evolution, Infection and Genomics, The University of Manchester, Manchester, UK
Paul Bishop, Graeme Black & Panagiotis Sergouniotis
Centre for Public Health, School of Medicine, Dentistry and Biomedical Sciences, Queen’s University Belfast, Belfast, UK
Usha Chakravarthy, Ruth Hogg, Bernadette McGuinness, Gareth McKay, Euan Paterson, Tunde Peto & Jayne Woodside
Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, Scotland
Baljean Dhillon & Tom MacGillivray
Pat Macpherson Centre for Pharmacogenomics and Pharmacogenetics, Division of Population Health & Genomics, School of Medicine, University of Dundee, Dundee, UK
Alexander Doney & Emanuele Trucco
Human Development and Health, Faculty of Medicine, University Hospital Southampton, Southampton, Hampshire, UK
Sarah Ennis
Dementias Platform UK, Oxford, UK
John Gallacher
School of Optometry & Vision Sciences, Cardiff University, Cardiff, UK
Jeremy Guggenheim & James Morgan
Department of Twin Research and Genetic Epidemiology, King’s College London, London, UK
Chris Hammond, Pirro Hysi & Katie Williams
Department of Eye and Vision Science, Institute of Life Course and Medical Sciences, University of Liverpool, Liverpool, UK
Simon Harding, Savita Madhusudhan & Yalin Zheng
Department of Ophthalmology, King’s College Hospital NHS Foundation Trust, London, UK
Gerassimos Lascaratos & Eoin O’Sullivan
Clinical and Experimental Sciences, Faculty of Medicine, University of Southampton, Southampton, UK
Andrew Lotery & Jay Self
University of Exeter College of Medicine & Health, Exeter, UK
Richard Oram & Mike Weedon
Population Health Research Institute, St George’s, University of London, London, UK
Chris Owen & Alicja Rudnicka
Population, Policy and Practice Research and Teaching Department, Great Ormond Street Institute of Child Health, University College London, London, UK
Jugnoo Rahi
School of Cardiovascular and Metabolic Health, University of Glasgow, Glasgow, UK
Naveed Sattar
Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
David Steel
Gloucestershire Retinal Research Group, Cheltenham General Hospital, Gloucestershire Hospitals NHS Foundation Trust, Cheltenham, UK
Irene Stratton
Centre for Medical Informatics, Usher Institute of Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, UK
Cathie Sudlow & Veronique Vitart
Research Centre for Intelligent Health Care, Coventry University, Coventry, UK
Robyn Tapp
Norwich Epidemiology Centre, Norwich Medical School, University of East Anglia, Norwich, UK
Max Yates
Authors
Anand E. Rajesh
View author publications
Search author on:
PubMed
Google Scholar
Abraham Olvera-Barrios
View author publications
Search author on:
PubMed
Google Scholar
Alasdair N. Warwick
View author publications
Search author on:
PubMed
Google Scholar
Yue Wu
View author publications
Search author on:
PubMed
Google Scholar
Kelsey V. Stuart
View author publications
Search author on:
PubMed
Google Scholar
Mahantesh I. Biradar
View author publications
Search author on:
PubMed
Google Scholar
Chuin Ying Ung
View author publications
Search author on:
PubMed
Google Scholar
Anthony P. Khawaja
View author publications
Search author on:
PubMed
Google Scholar
Robert Luben
View author publications
Search author on:
PubMed
Google Scholar
Paul J. Foster
View author publications
Search author on:
PubMed
Google Scholar
Charles R. Cleland
View author publications
Search author on:
PubMed
Google Scholar
William U. Makupa
View author publications
Search author on:
PubMed
Google Scholar
Alastair K. Denniston
View author publications
Search author on:
PubMed
Google Scholar
Matthew J. Burton
View author publications
Search author on:
PubMed
Google Scholar
Andrew Bastawrous
View author publications
Search author on:
PubMed
Google Scholar
Pearse A. Keane
View author publications
Search author on:
PubMed
Google Scholar
Mark A. Chia
View author publications
Search author on:
PubMed
Google Scholar
Angus W. Turner
View author publications
Search author on:
PubMed
Google Scholar
Cecilia S. Lee
View author publications
Search author on:
PubMed
Google Scholar
Adnan Tufail
View author publications
Search author on:
PubMed
Google Scholar
Aaron Y. Lee
View author publications
Search author on:
PubMed
Google Scholar
Catherine Egan
View author publications
Search author on:
PubMed
Google Scholar
Consortia
UK Biobank Eye and Vision Consortium
Naomi Allen
, Tariq Aslam
, Denize Atan
, Konstantinos Balaskas
, Sarah Barman
, Jenny Barrett
, Paul Bishop
, Graeme Black
, Tasanee Braithwaite
, Roxana Carare
, Usha Chakravarthy
, Michelle Chan
, Sharon Chua
, Alexander Day
, Parul Desai
, Baljean Dhillon
, Andrew Dick
, Alexander Doney
, Sarah Ennis
, John Gallacher
, David Ted Garway-Heath
, Jane Gibson
, Jeremy Guggenheim
, Chris Hammond
, Alison Hardcastle
, Simon Harding
, Ruth Hogg
, Pirro Hysi
, Gerassimos Lascaratos
, Thomas Littlejohns
, Andrew Lotery
, Phil Luthert
, Tom MacGillivray
, Sarah Mackie
, Savita Madhusudhan
, Bernadette McGuinness
, Gareth McKay
, Martin McKibbin
, Tony Moore
, James Morgan
, Eoin O’Sullivan
, Richard Oram
, Chris Owen
, Praveen Patel
, Euan Paterson
, Tunde Peto
, Axel Petzold
, Nikolas Pontikos
, Jugnoo Rahi
, Alicja Rudnicka
, Naveed Sattar
, Jay Self
, Panagiotis Sergouniotis
, Sobha Sivaprasad
, David Steel
, Irene Stratton
, Nicholas Strouthidis
, Cathie Sudlow
, Zihan Sun
, Robyn Tapp
, Dhanes Thomas
, Emanuele Trucco
, Ananth Viswanathan
, Veronique Vitart
, Mike Weedon
, Katie Williams
, Cathy Williams
, Jayne Woodside
, Max Yates
& Yalin Zheng
Contributions
A.E.R. wrote the code for generating the RPS. A.O.B. performed statistical tests of clinical variables. A.N.W. performed the initial GWAS on the UKBiobank cohort. A.N.W., K.V.S., M.I.B., A.P.K., R.L., P.J.F. performed additional GWAS analysis on the UKBiobank cohort and the EPIC-Norfolk cohort. C.R.C., W.U.M., M.J.B., A.B. provided, evaluated, and maintained the Tanzanian dataset. P.A.K., M.A.C., A.W.T. provided, evaluated and maintained the Australian dataset. A.E.R., A.O.B., C.S.L. A.Y.L., A.T., C.S.L., Y.W. wrote the manuscript along with input from all other coauthors.
Corresponding author
Correspondence to
Catherine Egan
Ethics declarations
Competing interests
A.P.K. has acted as a paid consultant or lecturer to Abbvie, Aerie, Allergan, Google Health, Heidelberg Engineering, Novartis, Reichert, Santen,Thea and Topcon. A.Y.L. reports support from the US Food and Drug Administration, grants from Santen, Carl Zeiss Meditec, and Novartis, personal fees from Genentech, Topcon, and Verana Health, outside of the submitted work; This article does not reflect the opinions of the Food and Drug Administration. A.T. report grants from Bayer and Novartis and personal fees from Abbvie, Allegro, Annexon, Apellis, Bayer, Heidelberg Engineering, Iveric Bio, Kanghong, Novartis, Oxurion, Roche/Genentech, Thea. C.E. reports personal fees from Heidelberg Engineering, Boehringer Ingelheim, and Inozyme pharmaceuticals outside of the submitted work. P.A.K. has acted as a consultant for Retina Consultants of America, Topcon, Roche, Boehringer-Ingleheim, and Bitfount and is an equity owner in Big Picture Medical. He has received speaker fees from Zeiss, Novartis, Gyroscope, Boehringer-Ingleheim, Apellis, Roche, Abbvie, Topcon, and Hakim Group. He has received travel support from Bayer, Topcon, and Roche. He has attended advisory boards for Topcon, Bayer, Boehringer-Ingleheim, RetinAI, and Novartis. P.J.F. has acted as a consultant for Alphasights, GLG, Google Health, Guidepoint, PwC, Santen. A.B. is Founder and CEO of not-for-profit Peek Vision and receives a salary. The remaining authors declare no competing interests.
Peer review
Peer review information
Nature Communications
thanks Kamanasish Bhattacharjee, Brandon Grossardt and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary Information (download PDF
Reporting Summary (download PDF
Peer Review File (download PDF
Source data
Source Data (download CSV
Rights and permissions
Open Access
This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit
Reprints and permissions
About this article
Cite this article
Rajesh, A.E., Olvera-Barrios, A., Warwick, A.N.
et al.
Machine learning derived retinal pigment score from ophthalmic imaging shows ethnicity is not biology.
Nat Commun
16
, 60 (2025). https://doi.org/10.1038/s41467-024-55198-7
Download citation
Received
29 June 2023
Accepted
05 December 2024
Published
02 January 2025
Version of record
02 January 2025
DOI
Share this article
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
Download PDF
Advertisement
Advanced search
Quick links
Explore articles by subject
Find a job
Guide to authors
Editorial policies
Sign up for the
Nature Briefing
newsletter — what matters in science, free to your inbox daily.
Get the most important science stories of the day, free in your inbox.
Sign up for Nature Briefing
US