Next Article in Journal
Effects of a Telerehabilitation Program in Women with Fibromyalgia at 6-Month Follow-Up: Secondary Analysis of a Randomized Clinical Trial
Next Article in Special Issue
Epigenetic Mechanisms and Nephrotic Syndrome: A Systematic Review
Previous Article in Journal
Changes in Alprazolam Metabolism by CYP3A43 Mutants
Previous Article in Special Issue
MicroRNA Associated with the Invasive Phenotype in Clear Cell Renal Cell Carcinoma: Let-7c-5p Inhibits Proliferation, Migration, and Invasion by Targeting Insulin-like Growth Factor 1 Receptor
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Genome-Wide Association Study into the Aetiology of Congenital Solitary Functioning Kidney

by
Sander Groen in ’t Woud
1,2,
Carlo Maj
3,
Kirsten Y. Renkema
4,
Rik Westland
5,
Tessel Galesloot
1,
Iris A. L. M. van Rooij
1,
Sita H. Vermeulen
1,
Wout F. J. Feitz
6,
Nel Roeleveld
1,
Michiel F. Schreuder
2,* and
Loes F. M. van der Zanden
1,*,† on behalf of the SOFIA Study Group
1
Radboud Institute for Health Sciences, Department for Health Evidence, Radboud University Medical Center, 6500 HB Nijmegen, The Netherlands
2
Radboud Institute for Molecular Life Sciences, Department of Paediatric Nephrology, Radboudumc Amalia Children’s Hospital, 6500 HB Nijmegen, The Netherlands
3
Centre for Human Genetics, University of Marburg, 35037 Marburg, Germany
4
Department of Genetics, University Medical Center Utrecht, Utrecht University, 3584 CS Utrecht, The Netherlands
5
Department of Pediatric Nephrology, Emma Children’s Hospital, Amsterdam UMC, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands
6
Division of Pediatric Urology, Department of Urology, Radboud Institute for Molecular Life Sciences, Radboudumc Amalia Children’s Hospital, 6500 HB Nijmegen, The Netherlands
*
Authors to whom correspondence should be addressed.
Collaborators/Membership of the SOFIA Study Group is provided in the Acknowledgments.
Biomedicines 2022, 10(12), 3023; https://doi.org/10.3390/biomedicines10123023
Submission received: 5 October 2022 / Revised: 15 November 2022 / Accepted: 18 November 2022 / Published: 23 November 2022
(This article belongs to the Special Issue Genetics and Epigenetics of Kidney Diseases)

Abstract

:
Congenital solitary functioning kidney (CSFK) is a birth defect that occurs in 1:1500 children and predisposes them to kidney injury. Its aetiology is likely multifactorial. In addition to known monogenic causes and environmental risk factors, common genetic variation may contribute to susceptibility to CSFK. We performed a genome-wide association study among 452 patients with CSFK and two control groups of 669 healthy children and 5363 unaffected adults. Variants in two loci reached the genome-wide significance threshold of 5 × 10−8, and variants in 30 loci reached the suggestive significance threshold of 1 × 10−5. Of these, an identified locus with lead single nucleotide variant (SNV) rs140804918 (odds ratio 3.1, p-value = 1.4 × 10−8) on chromosome 7 was most promising due to its close proximity to HGF, a gene known to be involved in kidney development. Based on their known molecular functions, both KCTD20 and STK38 could explain the suggestive significant association with lead SNV rs148413365 on chromosome 6. Our findings need replication in an independent cohort of CSFK patients before they can be established definitively. However, our analysis suggests that common variants play a role in CSFK aetiology. Future research could enhance our understanding of the molecular mechanisms involved.

1. Introduction

Living with a solitary functioning kidney from childhood predisposes to kidney injury later in life, with up to 80% of children with a solitary functioning kidney showing signs of kidney injury at 18 years of age [1]. Moreover, 20–40% of these children may develop end-stage kidney disease by 30 years of age [2]. Even in less severely affected individuals, signs of kidney injury that develop during childhood can result in a higher risk of cardiovascular disease later in life [3].
A congenital solitary functioning kidney (CSFK) is most often the result of developmental kidney anomalies, such as unilateral kidney agenesis (UKA), multicystic kidney dysplasia (MCDK), or kidney hypo/dysplasia (KHD). These anomalies fall within a spectrum of anomalies referred to as congenital anomalies of the kidney and urinary tract (CAKUT), which are thought to partly have a shared aetiology [4]. Several causative mechanisms may play a role in the aetiology of CAKUT: environmental risk factors, monogenic causes, pathogenic copy number variants (CNVs), and common genetic variants [5].
Environmental risk factors, such as maternal diabetes, overweight, and the use of artificial reproductive technologies, have been shown to play a role in CAKUT aetiology [6,7,8]. For CSFK specifically, our group identified important roles for maternal stress and infections during pregnancy, as well as a protective effect of folic acid supplementation, in addition to previously identified factors such as conception using in vitro fertilization/intracytoplasmic sperm injection, maternal smoking during pregnancy, and older maternal age [9]. Monogenic causes for CAKUT have been discovered in over 150 different genes, and this number keeps rising due to increased availability and decreased costs of exome sequencing [10,11]. Approximately one-third of these genes have been associated with isolated forms of CAKUT, whereas the other genes with defects in multiple organ systems [11]. As CSFK has been reported to be familial in ±10%, and a CSFK may be part of a specific syndrome, several monogenic causes have been found (for a review, please see Westland et al. [12]). Studies performing whole exome sequencing (WES) in patients with sporadic and isolated CSFK are rare, and the yield is relatively low compared to non-isolated or familial CSFK [13]. Wu et al. and Lei et al. identified causative variants in 11% of mostly sporadic and isolated patients with UKA and MCDK, respectively, and Liu et al. solved 7% of patients with isolated UKA or MCDK using a combination of chromosomal microarray analysis and WES [13,14,15]. CNVs also play an important role in the aetiology of CAKUT. This was first identified in a cohort of 192 patients with KHD and later extended to other CAKUT phenotypes, including CSFK [16,17,18]. As CAKUT is between common and rare disorders, low-frequency variants with intermediate effects may explain an important fraction of its genetic aetiology [19]. Nonetheless, genome-wide association studies (GWAS) have rarely been performed for CAKUT because of the large cohorts needed. A GWAS including almost 1400 patients with vesicoureteral reflux (VUR) identified three loci that reached genome-wide significance, with odds ratios ranging from 1.4 to 3.7 [20], whereas a previous GWAS on 500 VUR patients and a GWAS including 756 patients with posterior urethral valves (PUV) failed to identify genome-wide significant loci [21,22]. In contrast, two genome-wide significant variants were identified in a GWAS on whole genome sequencing data from 132 PUV patients: a common variant with a minor allele frequency (MAF) of 0.2 and odds ratio (OR) of 0.4 near TBX5 and a rare variant with a MAF of 0.05 and OR of 7.2 near PTK7 [23]. Up to now, no GWAS has been published for CSFK patients. To fill this knowledge gap and investigate the role of common genetic variants in the aetiology of CSFK, we performed a genome-wide association study in a Dutch cohort of patients with CSFK.

2. Patients and Methods

2.1. Patients

Patients with CSFK were derived from the AGORA (Aetiologic research into Genetic and Occupational/environmental Risk factors for Anomalies in children) data- and biobank [24]. This data- and biobank is coordinated from the Radboud university medical centre and Amalia children’s hospital, and contains clinical information, DNA samples, and questionnaire data from patients with congenital malformations and their parents. Patients were recruited prospectively during a visit to the Department of Paediatric Nephrology or Division of Paediatric Urology of the Amalia children’s hospital or University Medical Centre Utrecht or retrospectively in one of 36 Dutch hospitals participating in the SOFIA (Solitary functioning kidney: aetiology and prognosis) study [1]. For the current GWAS, we selected patients born with a solitary functioning kidney (defined as <20% differential function on MAG-3 or DMSA scans and/or unilateral absence of kidney tissue on kidney ultrasound) resulting from UKA, MCDK or KHD. Patients were selected if they were born between 1 January 1993 and 31 December 2020 and if their DNA was available in the AGORA data- and biobank. Patients with either a known genetic cause or a syndrome (with or without a molecular diagnosis) were excluded from the analyses.
Two healthy control populations were used. In 2011, 39 Dutch municipalities selected random samples of 150 or 300 inhabitants born between 1990–2010 for the AGORA data- and biobank. These children and their parents were invited to participate in AGORA via regular mail. Over 2000 families filled out a questionnaire, while mothers and children from a subsample of 748 families also donated a saliva sample for DNA isolation. Due to this relatively small control population available in the AGORA data- and biobank, we included a second control population of 6468 participants who donated blood samples for the Nijmegen Biomedical Study (NBS) [25]. NBS is a population-based study carried out by the Departments for Health Evidence, Laboratory Medicine, and Human Genetics of the Radboud university medical centre together with the community health service and the municipality of Nijmegen. Participants were recruited via random sampling of inhabitants of Nijmegen who were 18 years or older. Participants filled out one or more questionnaires and donated blood samples. All study protocols were approved by the Regional Committee on Research Involving Human Subjects, and informed consent was obtained from all participants and/or legal representatives.

2.2. Genotyping

Genotyping of all patients’ DNA samples was performed by deCODE genetics (Reykjavik, Iceland) using Infinium Global Screening Array deCodeGenetics_V3 (Illumina, San Diego, CA, USA), which contains variants present on the Infinium Global Screening Array (GSA) v3.0 BeadChip as well as 50,000 high-quality markers from the OmniExpress bead chips in order to make data derived from these two chips more compatible. The AGORA controls were genotyped using Infinium Global Screening Arrays, while the NBS controls were genotyped using OmniExpress bead chips (Illumina, San Diego, CA, USA).

2.3. Quality Control

Quality control (QC) was performed separately for all three cohorts using the PLINK toolset (https://www.cog-genomics.org/plink/2.0/ accessed on 30 July 2021) [26]. First, a sex check was performed to identify sex discrepancies. Next, variant QC was carried out, removing variants with a call rate <98%, a deviation from Hardy-Weinberg equilibrium with a p-value < 1 × 10−10 (for patients) or <1 × 10−6 (for controls), or a MAF < 0.01. Samples were removed if the call rate was <98%. We used the Kinship-based INference for Gwas (KING) toolset to check family relationships within the cohort and to remove individuals related up to the 3rd degree [27].

2.4. Imputation

Before imputation, the datasets containing patients and controls were merged. We decided to use a two-stage approach for the analyses. Since the patients and AGORA controls were genotyped using similar chips, we started our analyses with the patients and AGORA controls. Due to a limited number of controls, however, the power to detect an association in this dataset was limited. We had 80% power to detect variants with an allelic OR of 3.2 for a MAF of 0.05, an OR of 2.5 for a MAF of 0.10, and an OR of 2.1 for a MAF of 0.25 with genome-wide significance. In contrast, the NBS cohort had more power given the higher sample size (80% power to detect variants with an OR of 2.9 for MAF of 0.05, OR of 2.3 for MAF 0.10, and OR of 1.9 for MAF 0.25), but could be more prone to array-specific batch effects.
To enable this approach, we created two datasets: the patients were merged with the controls from the AGORA data- and biobank to form the AGORA dataset, and the patients were merged with the controls from the NBS to form the NBS dataset. Imputation was based on the overlap in variants genotyped in patients and controls (n = 524,412 in the AGORA cohort and n = 207,692 in the NBS cohort). Genotypes of the samples were phased using Eagle (v. 2.4.1) and imputed using Minimac (v. 4) with 1000 Genomes Project Phase 3 data as a reference panel [28,29,30]. After imputation, ancestry estimation was performed using data from 1000 Genomes Project Phase 3 by assigning individuals to closest reference super- and subpopulations according to the geometric distance in the ten principal component space [31]. Further analyses were performed using individuals with predicted GBR or CEU ancestry only to create a homogeneous study population [29].

2.5. Statistical Analyses

Using PLINK, ORs were calculated for each SNV using logistic regression models. Analyses were limited to variants with a Minimac imputation quality score (r2) of 0.6 and a minor allele count of 20 (for patients and controls combined). The first four principal components and sex were included as confounders. Inflation was visually inspected using quantile-quantile (QQ) plots, and results were depicted in Manhattan plots, which were created in Rstudio using the ‘qqman’ package. Variants with a p-value < 5 × 10−8 were considered statistically significant on a genome-wide level, while variants with a p-value < 1 × 10−5 were considered suggestively significant. FUMA (https://fuma.ctglab.nl/ accessed on 15 April 2022) was used to infer independent genomic risk loci, and we selected loci with at least 5 SNVs showing suggestive significance [32]. To further filter our results and select only the most promising loci, we checked the MAF of the lead SNP in the AGORA and NBS controls and compared these with the MAF in openly available reference sets, such as the Dutch GoNL population [33] or, if GoNL data was not available, data for individuals with European ancestry from the 1000 Genomes Project [29]. We restricted to loci with a lead SNV that: (1) reached genome-wide statistical significance or (2) had a MAF > 0.05 in our cases, and this MAF in our cases was closer to the MAF in the controls of our discovery cohort than to the MAF in the reference database (i.e., using the MAF of the reference database as control would result in a stronger effect of the SNV). For the selected loci, we used LocusZoom to visualise results [34], checked the location (intergenic, intronic, or exonic), and used the Open Target Genetics platform (https://genetics.opentargets.org/ accessed on 19 August 2022) to estimate relatedness with nearby genes. Furthermore, we obtained information on topologically associated domains from TADKB (http://dna.cs.miami.edu/TADKB/ accessed on 12 September 2022) [35], while the RegulomeDB (https://regulomedb.org/ accessed on 12 September 2022) [36] was used to assess the functional importance of loci. Lastly, spatiotemporal and species-specific gene expression was visualised using the web-based application developed by the Kaessmann group (https://apps.kaessmannlab.org/evodevoapp/ accessed on 12 September 2022) [37].

3. Results

3.1. Participants

The AGORA data- and biobank contained DNA from 184 prospectively collected patients with a solitary functioning kidney and saliva samples from 660 patients collected for the SOFIA study (Figure 1). For 10 participants, both a DNA sample and a saliva sample were retrieved, meaning that 844 samples from 834 unique patients were available for genotyping. Of these 834 patients, 32 were excluded due to failed genotyping, 35 patients were excluded because of a known genetic cause or syndrome and 315 patients presented with a phenotype other than UKA, MCDK, or KHD. This resulted in 452 successfully genotyped CSFK patients. For 669 AGORA controls and 5363 NBS controls, genotyping was performed successfully at an earlier stage.

3.2. AGORA Dataset Results

After quality control (see Supplementary Materials) and imputation, the AGORA GWAS dataset consisted of 9,956,431 SNVs in 403 patients and 622 controls. The lambda values (1.03) and visual inspection of the QQplot (Figure 2) showed no signs of inflation. None of the SNVs reached genome-wide significance, but several peaks reached suggestive significance (Figure 3). In total, 43 SNVs had a p-value below 1 × 10−5, and 11 genomic risk loci were identified. After restricting to loci with at least five SNVs, six were visualized using LocusZoom and listed in Table 1.

3.3. NBS Dataset Results

In the NBS dataset, 403 patients and 4366 controls were available for analysis after QC, and 9,313,318 SNVs were present after imputation. Again, no signs of inflation were found based on the lambda value (1.04) and QQplot (Figure 4). Sixteen SNVs from two independent loci reached genome-wide significance (Figure 5). Additionally, 335 SNVs from 42 loci had a p-value below 1 × 10−5. Twenty-four of these loci contained at least five SNVs and were further investigated (Table 1).

3.4. Loci with a Genome-Wide Statistically Significant p-Value

Two loci on chromosomes 7 and 18, reached genome-wide significant p-values (Table 1). The locus on chromosome 7, with lead SNV rs140804918, was found to have a MAF of 0.05 in the patients, 0.02 in the NBS controls, and 0.03 in the AGORA controls and the GoNL database, and a corresponding p-value of 1.4 × 10−8 in the NBS dataset. The resulting ORs in the NBS and AGORA datasets were 3.1 and 1.9, respectively, which are the usual effect sizes for an SNV associated with a congenital malformation [19,38]. Although the SNV is in an intergenic region, the location of this SNV is promising, given its proximity to the Hepatocyte Growth Factor (HGF) gene (Figure 6A). Moreover, this genomic risk locus is in the same topologically associated domain as the HGF gene, and several of the variants in this locus are located in the regulatory region of HGF, with RegulomeDB scores of 0.75–1.0, indicating that they are likely to be regulatory variants [36]. The SNV that we found to be associated could be a marker for such a regulatory variant. HGF gene expression in human kidney tissue was found to be highest in weeks 5 and 6 post-conception [37], which aligns with two important events in embryonic kidney development: the appearance of the ureteric buds on day 28 and invasion of the newly developed metanephric mesenchyme on day 32, followed by the start of branching morphogenesis [39].
The lead variant on chromosome 18 was rs184382636, which had allelic odds ratios of 5.9 and 12.9 in AGORA and NBS controls, respectively, and a p-value of 1.2 × 10−10 in the NBS dataset. The MAF in cases was 0.017, but only 0.002 in the NBS controls and the GoNL database and 0.003 in the AGORA controls, which increases the likelihood of a false positive association. This variant is located in an intergenic region, approximately 500 kbp from the SMIM21 gene, and has a RegulomeDB score of 0.07 (Figure 6B). SMIM21 mRNA tissue expression is highest in the testis, with no expression in developing or mature kidney tissue. The SMIM21 protein is thought to be part of the cell membrane [37]. Five other genes are located in the same topologically associated domain, but none could be linked to kidney development.

3.5. Other Selected Loci

Of the thirty other loci (6 from the AGORA dataset and 24 from the NBS dataset, Table 1), we selected the six loci with a lead SNV with a MAF >0.05 in the cases and in which the effect estimate using the reference population MAF would be stronger than the effect estimate identified in the discovery dataset. The MAFs varied between 0.05 and 0.50, and odds ratios ranged between 0.5 and 2.1 (Table 1). Four of the six loci were located in an intergenic region, whereas one was intronic, and one was located just upstream of a gene (Table 2, Figure 6C,D and Figure 7A–D).
Although two of the intergenic SNVs shared a topologically associated domain with genes involved in kidney development (rs9547854 with FREM2 and rs148251525 with VRK1), these SNVs had low RegulomeDB scores (<0.20, Table 2), suggesting no regulatory effect on the nearby genes. The other two intergenic SNVs could not be linked to genes involved in kidney development. Only two variants were located in or close to protein-coding genes: rs148413365 on chromosome 6 is an intronic variant of KCTD20, but also in close proximity to STK38 (also known as NDR1) (Table 2, Figure 6C). Therefore, rs148413365 could be a marker for other variants influencing either of these genes. Both genes show high expression in embryonic kidney tissue and are involved in pathways linked to kidney development. Lastly, rs111283115 on chromosome 15 is a 2kbp upstream variant of ACAN (Table 2, Figure 6D). The encoded protein is an important part of the extracellular matrix in cartilage, and causal ACAN gene variants have been found in patients with short stature. [41] It has relatively low expression in kidney tissue compared to other genes and could not be linked to kidney development, making it unlikely that variants in this gene are involved in the aetiology of CSFK.

4. Discussion

We performed the first GWAS in a phenotypically homogeneous population of patients with CSFK and identified several loci that may be linked to kidney development. Despite the relatively small number of patients included in the GWAS, variants in two loci reached genome-wide statistical significance. In addition, variants in thirty loci reached suggestive significance, of which six were selected as most promising based on the minor allele frequencies seen in our study population and acquired from reference databases. Although we lacked an independent replication cohort, we were able to identify two genomic loci which are most likely to be involved in kidney development based on the result of our GWAS and in silico evaluation.
Of the two loci with genome-wide significance, the locus on chromosome 7 with lead SNV rs140804918 is most interesting because of its MAF, OR, and location close to the HGF gene. The involvement of HGF and its receptor Met in kidney development was already suggested by Santos et al. in 1994 [42] and by Woolf et al. in 1995 [43]. Both Hgf and Met proteins were identified in mouse kidneys from embryonic day 11.5 onwards, and branching morphogenesis in cultured kidney cells could be decreased using anti-HGF serum [42,43]. Although complete Hgf or Met knockout is lethal [44], conditional knockout models showed decreased ureteric bud branching and nephron numbers [45]. Indeed, HGF has been identified as the candidate gene in the Munich Wistar Frömter inbred rat that has been shown to have an inherited nephron deficit [46]. Furthermore, Met protein was shown to be involved in branching morphogenesis together with the epidermal growth factor receptor (EGFR) [45], while activation of Met through glial cell line-derived neurotrophic factor (GDNF) was hypothesized to create a positive feedback loop further stimulating branching morphogenesis [33]. Variants affecting HGF protein levels could thus influence branching morphogenesis through activation of Met.
Only one of the loci with suggestive significance could be linked to kidney development based on in silico analyses. The lead SNV rs148413365 of this locus on chromosome 6 is an intronic variant of the KCTD20 gene. The molecular function of the encoded KCTD20 protein is not well-characterised but is thought to be within the AKT-mTOR-p70 S6k signalling cascade because of protein-protein-interactions with MAP3K8, PPP1CA, and MARK4 [47,48]. In rodents with reduced ureteric bud branching due to intrauterine growth restriction, AKT-mTOR is one of the pathways that is downregulated [49,50]. In addition, this signalling cascade is crucial for compensatory kidney hypertrophy after a reduction in nephron number [51]. As such, it may also be involved in an adequate response to the maldevelopment of one kidney by the contralateral kidney. In normal human embryonic kidney tissue, activated AKT, mTOR, and P70 S6 kinase are constantly expressed in the ureteric bud [52]. Embryonic tissue from MCDK patients showed different expression patterns in a recent study, indicating that inappropriate mTOR pathway activation may be involved in cyst formation in MCDK [52]. Whether this could be a primary cause of MCDK or a response secondary to, e.g., ureteral blockage is still unclear. The other gene that could possibly explain an association between rs148413365 and CSFK is STK38. The STK38 protein was recently identified as part of the Hippo pathway and is able to inhibit YAP/TAZ signalling [53]. Deletion of LATS1/2, which has a similar effect on YAP/TAZ expression as STK38, leads to upregulated YAP/TAZ signalling and kidney agenesis, which could be rescued by reducing YAP/TAZ levels [54]. On the other hand, YAP deletion is known to limit nephrogenesis [55], and YAP and TAZ have been suggested to have dosage-sensitive roles in kidney development [56]. This suggests an important role for YAP/TAZ signalling in kidney development, which could be affected by STK38 gene variants. Both KCTD20 and STK38 are expressed throughout embryonic kidney development and are in the 10% of genes with the strongest expression in foetal kidney tissue [57]. Combined with the evidence of functional relevance, both may be involved in the aetiology of CSFK and should be topics of further investigation.
An important feature of a GWAS is the unbiased testing of many common variants across the genome. This approach has the advantage that it enables the identification of genes or pathways not previously linked to a certain trait. However, it has the inherent risk of false positive results as well. To reduce this risk, a strict threshold for genome-wide significance (5 × 10−8) is generally applied, and replication in an independent cohort is required before an association is established. In our study, two variants reached the genome-wide significance threshold. Unfortunately, we were unable to find an independent cohort with a large enough number of CSFK patients to replicate our findings, which is an important limitation of our study. Although replication in other populations remains needed, the fact that the MAFs of the identified SNVs were consistent across different control populations reduces the chance of type one errors.
Ideally, our analyses would have been further stratified based on subtypes (i.e., UKA, MCDK, and KHD analysed separately), but this would have led to insufficient power to identify relevant associations. Although patients with known disease-causing mutations or syndromes were excluded, we were unable to investigate the presence of monogenic causes or pathogenic CNVs in most patients (~75%), which would have increased our ability to identify SNVs involved in CSFK aetiology. A strength of our study was the large and homogeneous study population, including only patients with CSFK due to UKA, MCDK or KHD, which are phenotypes likely sharing a common aetiology. Furthermore, we restricted our analyses to participants with the two most common ancestral backgrounds and were able to use two large geographically matched control populations.
Our study provides important knowledge about the role of common variants in the aetiology of CSFK. This may ultimately provide us with new insights into the pathophysiology of unilateral kidney maldevelopment and help explain the CSFK to patients and their parents in clinical practice. The limitations of our study illustrate, however, that establishing large, well-characterized patient cohorts remains important to replicate GWAS findings and identify additional variants. As an example, at least 3000 genotyped patients would be needed for 80% power to identify a variant with an OR of 1.5 and a MAF of 0.05 in the case of a 1:5 case-control ratio. As such, unravelling the aetiology of relatively rare congenital anomalies, such as CSFK, will only be possible if large international collaborations are created. The establishment of the European Reference Networks (e.g., ERKNet and eUROGEN) and the intended foundation of a European Health Data Space will hopefully facilitate such initiatives.

5. Conclusions

In conclusion, we identified several variants that reached or approached genome-wide significance in our GWAS of CSFK. Variants in two loci reached the genome-wide significance threshold of 5 × 10−8. Of these, the identified locus with lead SNV rs140804918 on chromosome 7 was most promising due to its close proximity to HGF, a gene of which the role in kidney development could be larger than expected so far. In addition, variants in 30 loci reached the suggestive significance threshold of 1 × 10−5. Of these, SNV rs148413365 on chromosome 6 was most promising as both KCTD20 and STK38 could explain the suggestive significant association based on their molecular functions. Due to the lack of an independent replication cohort, these findings need replication before they can be established, and future research is warranted to enhance our understanding of the molecular mechanisms involved in the origin of CSFK.

Supplementary Materials

The supporting information and can be downloaded at: https://www.mdpi.com/article/10.3390/biomedicines10123023/s1, it contains a detailed description of the numbers of patients and controls excluded in the different quality control steps.

Author Contributions

Conceptualization, S.G.i.’t.W., I.A.L.M.v.R., M.F.S. and L.F.M.v.d.Z.; Data curation, S.G.i.’t.W., K.Y.R., T.G., S.H.V. and L.F.M.v.d.Z.; Formal analysis, S.G.i.’t.W. and C.M.; Funding acquisition, S.G.i.’t.W., W.F.J.F., N.R., M.F.S. and L.F.M.v.d.Z.; Methodology, S.G.i.’t.W., C.M., M.F.S. and L.F.M.v.d.Z.; Project administration, S.G.i.’t.W.; Resources, K.Y.R., R.W., T.G., I.A.L.M.v.R., S.H.V., W.F.J.F. and M.F.S.; Software, C.M.; Supervision, W.F.J.F., N.R., M.F.S. and L.F.M.v.d.Z.; Visualization, S.G.i.’t.W.; Writing—original draft, S.G.i.’t.W.; Writing—review & editing, C.M., K.Y.R., R.W., T.G., I.A.L.M.v.R., S.H.V., W.F.J.F., N.R., M.F.S. and L.F.M.v.d.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by a Junior Investigator Grant from the Radboud Institute for Health Sciences. Loes van der Zanden is funded by a Veni grant (91618036) from the Dutch Research Council (NWO). Loes van der Zanden and Sander Groen in ’t Woud are funded by a Consortium grant from the Dutch Kidney Foundation (20OC002). Michiel Schreuder is funded by a Vidi grant (016.156.454) from the Dutch Research Council (NWO). Kirsten Renkema is funded by a Kolff Senior grant (15OKG18) from the Dutch Kidney Foundation.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Regional Committee on Research Involving Human Subjects Arnhem-Nijmegen (registration number 2018-4524, date of approval 23 March 2020).

Informed Consent Statement

Informed consent was obtained from all subjects and/or their legal caregivers involved in the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available to protect the privacy of the study participants.

Acknowledgments

We would like to thank all participating children and their parents for their contributions to this study. Furthermore, we are thankful for the support concerning the identification of patients and collection of data provided by many paediatricians, paediatric nephrologists, urologists, and other colleagues. The Nijmegen Biomedical Study is a population-based survey conducted at the Department for Health Evidence and the Department of Laboratory Medicine of the Radboud university medical centre. Principal investigators of the Nijmegen Biomedical Study are L.A.L.M. Kiemeney, A.L.M. Verbeek, D.W. Swinkels, and B. Franke. The following physicians are included as non-author collaborators in the SOFIA study group: J.A.E. van Wijk (Emma Children’s Hospital, Amsterdam UMC), M.R. Lilien and M.G. Keijzer-Veen (Wilhelmina Children’s Hospital, University Medical Center Utrecht), F.J. Kloosterman and M.G. Steffens (Isala Klinieken), V. Gracchi (Beatrix Children’s Hospital, University Medical Center Groningen), B. Zegers (Máxima MC), P.E. Jira (Jeroen Bosch Ziekenhuis), H. van der Deure (Deventer Ziekenhuis), R.W.G. van Rooij (LUMC Willem Alexander Children’s Hospital), E. Wijnands-van den Berg (Medisch Spectrum Twente (currently Isala Klinieken)), M. Breukels (Elkerliek ziekenhuis), S.M.H.B. de Pont (Amphia Ziekenhuis), E. Harnisch (Franciscus Gasthuis & Vlietland), C.M.L. van Dael (VieCuri/Maastricht UMC+), D. Creemers (Rijnstate (currently Deventer Ziekenhuis)), R. de Moor (Elisabeth-TweeSteden Ziekenhuis), A.Y. Konijnenberg (Ziekenhuis St Jansdal), E. Knots (Catharina Ziekenhuis Eindhoven), E.C. van der Kuur (Streekziekenhuis Koningin Beatrix), M.J. Jacobs (Maasziekenhuis Pantein), M. Koppejan-Stapel (Ziekenhuis Gelderse Vallei), A. Pijning (Slingeland Ziekenhuis), E. Dorresteijn (Erasmus MC), R.W.J. Leunissen (Haaglanden Medisch Centrum), R. Rijlaarsdam (ZGT), R. del Canho (Maasstad Ziekenhuis), B. Semmekrot (CWZ), A. Dings-Lammertink (Gelre ziekenhuizen loc Zutphen), I.J.M. Nijhuis (Wilhelmina Ziekenhuis Assen), M.J. van Ledden-Klok (Van Weel-Bethesda), L.M. van den Broek (St Jans Gasthuis), C. Meine Jansen (Groene Hart Ziekenhuis), M.C.G. Beeren (St Anna Zorggroep), H.E. Blokland-Loggers and C. Dorrepaal (St Antonius Ziekenhuis), L.J.W.M. Pierik (Ommelander Ziekenhuis Groningen), and A.L. Tanja (Martini Ziekenhuis).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Groen in ’t Woud, S.R.; Westland, R.; Renkema, K.Y.; Steffens, M.G.; Gracchi, V.; Lilien, M.R.; van Wijk, J.A.E.; Feitz, W.F.J.; Schreuder, M.F.; van der Zanden, L.F.M.; for the SOFIA Study Group. Risk factors for kidney injury in children with solitary functioning kidney. Kidney Int. 2022, in press. [Google Scholar] [CrossRef] [PubMed]
  2. Sanna-Cherchi, S.; Ravani, P.; Corbani, V.; Parodi, S.; Haupt, R.; Piaggio, G.; Innocenti, M.L.; Somenzi, D.; Trivelli, A.; Caridi, G.; et al. Renal outcome in patients with congenital anomalies of the kidney and urinary tract. Research Support, Non-U.S. Gov’t. Kidney Int. 2009, 76, 528–533. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Jacobs, D.R., Jr.; Woo, J.G.; Sinaiko, A.R.; Daniels, S.R.; Ikonen, J.; Juonala, M.; Kartiosuo, N.; Lehtimaki, T.; Magnussen, C.G.; Viikari, J.S.A.; et al. Childhood Cardiovascular Risk Factors and Adult Cardiovascular Events. N. Engl. J. Med. 2022, 386, 1877–1888. [Google Scholar] [CrossRef] [PubMed]
  4. Knoers, N. The term CAKUT has outlived its usefulness: The case for the defense. Pediatr. Nephrol. 2022, 37, 2793–2798. [Google Scholar] [CrossRef]
  5. Nicolaou, N.; Renkema, K.Y.; Bongers, E.M.H.F.; Giles, R.H.; Knoers, N.V.A.M. Genetic, environmental, and epigenetic factors involved in CAKUT. Nat. Rev. Nephrol. 2015, 11, 720–731. [Google Scholar] [CrossRef] [PubMed]
  6. Dart, A.; Ruth, C.; Sellers, E.; Au, W.; Dean, H. Maternal diabetes mellitus and congenital anomalies of the kidney and urinary tract (CAKUT) in the child. Am. J. Kidney Dis. 2015, 65, 684–691. [Google Scholar] [CrossRef] [PubMed]
  7. Groen in ’t Woud, S.; Renkema, K.; Schreuder, M.; Wijers, C.; van der Zanden, L.F.M.; Knoers, N.V.A.M.; Feitz, W.F.J.; Bongers, E.; Roeleveld, N.; van Rooij, I.A.L.M. Maternal risk factors involved in specific congenital anomalies of the kidney and urinary tract: A case-control study. Birth Defects Res. Part A Clin. Mol. Teratol. 2016, 106, 596–603. [Google Scholar] [CrossRef] [PubMed]
  8. Macumber, I.; Schwartz, S.; Leca, N. Maternal obesity is associated with congenital anomalies of the kidney and urinary tract in offspring. Pediatr. Nephrol. 2017, 32, 635–642. [Google Scholar] [CrossRef] [PubMed]
  9. Groen in ’t Woud, S.R.; van Rooij, A.L.M.; Feitz, W.F.J.; Schreuder, M.F.; van der Zanden, L.F.M. Environmental risk factors for congenital solitary functioning kidney—A case-control study. 2022; submitted for publication. [Google Scholar]
  10. Sanna-Cherchi, S.; Westland, R.; Ghiggeri, G.M.; Gharavi, A.G. Genetic basis of human congenital anomalies of the kidney and urinary tract. J. Clin. Investig. 2018, 128, 4–15. [Google Scholar] [CrossRef]
  11. Seltzsam, S.; Wang, C.; Zheng, B.; Mann, N.; Connaughton, D.M.; Wu, C.W.; Schneider, S.; Schierbaum, L.; Kause, F.; Kolvenbach, C.M.; et al. Reverse phenotyping facilitates disease allele calling in exome sequencing of patients with CAKUT. Genet. Med. 2022, 24, 307–318. [Google Scholar] [CrossRef]
  12. Westland, R.; Schreuder, M.F.; van Goudoever, J.B.; Sanna-Cherchi, S.; van Wijk, J.A.E. Clinical implications of the solitary functioning kidney. Clin. J. Am. Soc. Nephrol. 2014, 9, 978–986. [Google Scholar] [CrossRef] [Green Version]
  13. Liu, L.; Li, J.; Li, Y.; Li, H.; Yang, B.; Fan, H.; Wang, J.; Gu, Y.; Yu, H.; Bai, M.; et al. Genetic diagnosis of common fetal renal abnormalities detected on prenatal ultrasound. Prenat. Diagn. 2022, 42, 894–900. [Google Scholar] [CrossRef] [PubMed]
  14. Lei, T.Y.; Fu, F.; Li, R.; Yu, Q.X.; Du, K.; Zhang, W.W.; Deng, Q.; Li, L.S.; Wang, D.; Yang, X.; et al. Whole-exome sequencing in the evaluation of fetal congenital anomalies of the kidney and urinary tract detected by ultrasonography. Prenat. Diagn. 2020, 40, 1290–1299. [Google Scholar] [CrossRef] [PubMed]
  15. Wu, H.; Xu, Q.; Xie, J.; Ma, J.; Qiao, P.; Zhang, W.; Yu, H.; Wang, W.; Qian, Y.; Zhang, Q.; et al. Identification of 8 Novel Mutations in Nephrogenesis-Related Genes in Chinese Han Patients with Unilateral Renal Agenesis. Am. J. Nephrol. 2017, 46, 55–63. [Google Scholar] [CrossRef] [PubMed]
  16. Sanna-Cherchi, S.; Kiryluk, K.; Burgess, K.E.; Bodria, M.; Sampson, M.G.; Hadley, D.; Nees, S.N.; Verbitsky, M.; Perry, B.J.; Sterken, R.; et al. Copy-Number Disorders Are a Common Cause of Congenital Kidney Malformations. Am. J. Hum. Genet. 2012, 91, 987–997. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Verbitsky, M.; Westland, R.; Perez, A.; Kiryluk, K.; Liu, Q.X.; Krithivasan, P.; Mitrotti, A.; Fasel, D.A.; Batourina, E.; Sampson, M.G.; et al. The copy number variation landscape of congenital anomalies of the kidney and urinary tract. Nat. Genet. 2019, 51, 117–127. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Westland, R.; Verbitsky, M.; Vukojevic, K.; Perry, B.J.; Fasel, D.A.; Zwijnenburg, P.J.G.; Bokenkamp, A.; Gille, J.J.P.; Saraga-Babic, M.; Ghiggeri, G.M.; et al. Copy number variation analysis identifies novel CAKUT candidate genes in children with a solitary functioning kidney. Kidney Int. 2015, 88, 1402–1410. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  19. Manolio, T.A.; Collins, F.S.; Cox, N.J.; Goldstein, D.B.; Hindorff, L.A.; Hunter, D.J.; McCarthy, M.I.; Ramos, E.M.; Cardon, L.R.; Chakravarti, A.; et al. Finding the missing heritability of complex diseases. Nature 2009, 461, 747–753. [Google Scholar] [CrossRef] [Green Version]
  20. Verbitsky, M.; Krithivasan, P.; Batourina, E.; Khan, A.; Graham, S.E.; Marasa, M.; Kim, H.; Lim, T.Y.; Weng, P.L.; Sanchez-Rodriguez, E.; et al. Copy Number Variant Analysis and Genome-wide Association Study Identify Loci with Large Effect for Vesicoureteral Reflux. J. Am. Soc. Nephrol. 2021, 2, 805–820. [Google Scholar] [CrossRef]
  21. Darlow, J.M.; Dobson, M.G.; Darlay, R.; Molony, C.M.; Hunziker, M.; Green, A.J.; Cordell, H.J.; Puri, P.; Barton, D.E. A new genome scan for primary nonsyndromic vesicoureteric reflux emphasizes high genetic heterogeneity and shows linkage and association with various genes already implicated in urinary tract development. Mol. Genet. Genom. Med. 2014, 2, 7–29. [Google Scholar] [CrossRef]
  22. Van der Zanden, L.F.M.; Borisov, O.; van Rooij, I.A.L.M.; Quaedackers, J.S.L.T.; Steffens, M.G.; Schierbaum, L.; Schneider, S.; Waffenschmidt, L.; Kiemeney, L.A.L.M.; de Wall, L.L.L.; et al. Genome-wide association study in patients with posterior urethral valves. Front. Pediatr. 2022, 10, 988374. [Google Scholar] [CrossRef] [PubMed]
  23. Chan, M.M.Y.; Sadeghi-Alavijeh, O.; Lopes, F.M.; Hilger, A.C.; Stanescu, H.C.; Voinescu, C.D.; Beaman, G.M.; Newman, W.G.; Zaniew, M.; Weber, S.; et al. Diverse ancestry whole-genome sequencing association study identifies TBX5 and PTK7 as susceptibility genes for posterior urethral valves. eLife 2022, 11, E74777. [Google Scholar] [CrossRef] [PubMed]
  24. Van Rooij, I.A.; van der Zanden, L.F.; Bongers, E.M.; Renkema, K.Y.; Wijers, C.H.; Thonissen, M.; Dokter, E.M.; Marcelis, C.L.; de Blaauw, I.; Wijnen, M.H.; et al. AGORA, a data- and biobank for birth defects and childhood cancer. Birth Defects Res. A Clin. Mol. Teratol. 2016, 106, 675–684. [Google Scholar] [CrossRef]
  25. Galesloot, T.E.; Vermeulen, S.H.; Swinkels, D.W.; de Vegt, F.; Franke, B.; den Heijer, M.; de Graaf, J.; Verbeek, A.L.M.; Kiemeney, L. Cohort Profile: The Nijmegen Biomedical Study (NBS). Int. J. Epidemiol. 2017, 46, 1099–1100j. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.; Daly, M.J.; et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef] [Green Version]
  27. Manichaikul, A.; Mychaleckyj, J.C.; Rich, S.S.; Daly, K.; Sale, M.; Chen, W.M. Robust relationship inference in genome-wide association studies. Bioinformatics 2010, 26, 2867–2873. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Das, S.; Forer, L.; Schonherr, S.; Sidore, C.; Locke, A.E.; Kwong, A.; Vrieze, S.I.; Chew, E.Y.; Levy, S.; McGue, M.; et al. Next-generation genotype imputation service and methods. Nat. Genet. 2016, 48, 1284–1287. [Google Scholar] [CrossRef] [Green Version]
  29. The 1000 Genomes Project Consortium; Auton, A.; Brooks, L.D.; Durbin, R.M.; Garrison, E.P.; Kang, H.M.; Korbel, J.O.; Marchini, J.L.; McCarthy, S.; McVean, G.A.; et al. A global reference for human genetic variation. Nature 2015, 526, 68–74. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Loh, P.R.; Danecek, P.; Palamara, P.F.; Fuchsberger, C.; Reshef, Y.A.; Finucane, H.K.; Schoenherr, S.; Forer, L.; McCarthy, S.; Abecasis, G.R.; et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 2016, 48, 1443–1448. [Google Scholar] [CrossRef] [Green Version]
  31. Privé, F. Ancestry inference and grouping from principal component analysis of genetic data. bioRxiv 2020, 109, 12–23. [Google Scholar]
  32. Watanabe, K.; Taskesen, E.; van Bochoven, A.; Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 2017, 8, 1826. [Google Scholar] [CrossRef] [PubMed]
  33. Boomsma, D.I.; Wijmenga, C.; Slagboom, E.P.; Swertz, M.A.; Karssen, L.C.; Abdellaoui, A.; Ye, K.; Guryev, V.; Vermaat, M.; van Dijk, F.; et al. The Genome of the Netherlands: Design, and project goals. Eur. J. Hum. Genet. 2014, 22, 221–227. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Pruim, R.J.; Welch, R.P.; Sanna, S.; Teslovich, T.M.; Chines, P.S.; Gliedt, T.P.; Boehnke, M.; Abecasis, G.R.; Willer, C.J. LocusZoom: Regional visualization of genome-wide association scan results. Bioinformatics 2010, 26, 2336–2337. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Liu, T.; Porter, J.; Zhao, C.; Zhu, H.; Wang, N.; Sun, Z.; Mo, Y.Y.; Wang, Z. TADKB: Family classification and a knowledge base of topologically associating domains. BMC Genom. 2019, 20, 217. [Google Scholar] [CrossRef] [Green Version]
  36. Boyle, A.P.; Hong, E.L.; Hariharan, M.; Cheng, Y.; Schaub, M.A.; Kasowski, M.; Karczewski, K.J.; Park, J.; Hitz, B.C.; Weng, S.; et al. Annotation of functional variation in personal genomes using RegulomeDB. Genome Res. 2012, 22, 1790–1797. [Google Scholar] [CrossRef] [Green Version]
  37. Cardoso-Moreira, M.; Halbert, J.; Valloton, D.; Velten, B.; Chen, C.; Shao, Y.; Liechti, A.; Ascencao, K.; Rummel, C.; Ovchinnikova, S.; et al. Gene expression across mammalian organ development. Nature 2019, 571, 505–509. [Google Scholar] [CrossRef] [PubMed]
  38. Van der Zanden, L.F.; van Rooij, I.A.; Feitz, W.F.; Knight, J.; Donders, A.R.; Renkema, K.Y.; Bongers, E.M.; Vermeulen, S.H.; Kiemeney, L.A.; Veltman, J.A.; et al. Common variants in DGKK are strongly associated with risk of hypospadias. Nat. Genet. 2011, 43, 48–50. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  39. Schoenwolf, G.C.; Bleyl, S.B.; Brauer, P.R.; Francis-West, P.H. Larsen’s Human Embryology, 5th ed.; Churchill Livingstone: London, UK, 2015. [Google Scholar]
  40. McLaren, W.; Gil, L.; Hunt, S.E.; Riat, H.S.; Ritchie, G.R.; Thormann, A.; Flicek, P.; Cunningham, F. The Ensembl Variant Effect Predictor. Genome Biol. 2016, 17, 122. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Nilsson, O.; Guo, M.H.; Dunbar, N.; Popovic, J.; Flynn, D.; Jacobsen, C.; Lui, J.C.; Hirschhorn, J.N.; Baron, J.; Dauber, A. Short stature, accelerated bone maturation, and early growth cessation due to heterozygous aggrecan mutations. J. Clin. Endocrinol. Metab. 2014, 99, E1510–E1518. [Google Scholar] [CrossRef] [PubMed]
  42. Santos, O.F.; Barros, E.J.; Yang, X.M.; Matsumoto, K.; Nakamura, T.; Park, M.; Nigam, S.K. Involvement of hepatocyte growth factor in kidney development. Dev. Biol. 1994, 163, 525–529. [Google Scholar] [CrossRef] [PubMed]
  43. Woolf, A.S.; Kolatsi-Joannou, M.; Hardman, P.; Andermarcher, E.; Moorby, C.; Fine, L.G.; Jat, P.S.; Noble, M.D.; Gherardi, E. Roles of hepatocyte growth factor/scatter factor and the met receptor in the early development of the metanephros. J. Cell Biol. 1995, 128, 171–184. [Google Scholar] [CrossRef] [PubMed]
  44. Uehara, Y.; Minowa, O.; Mori, C.; Shiota, K.; Kuno, J.; Noda, T.; Kitamura, N. Placental defect and embryonic lethality in mice lacking hepatocyte growth factor/scatter factor. Nature 1995, 373, 702–705. [Google Scholar] [CrossRef] [PubMed]
  45. Ishibe, S.; Karihaloo, A.; Ma, H.; Zhang, J.; Marlier, A.; Mitobe, M.; Togawa, A.; Schmitt, R.; Czyczk, J.; Kashgarian, M.; et al. Met and the epidermal growth factor receptor act cooperatively to regulate final nephron number and maintain collecting duct morphology. Development 2009, 136, 337–345. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Schulte, L.; Schulz, A.; Unland, J.; Schulz, H.; Hubner, N.; Schmidt-Ott, K.M.; Kreutz, R. MWF rats with spontaneous albuminuria inherit a reduced efficiency of nephron induction during early nephrogenesis in comparison to SHR rats. J. Hypertens. 2012, 30, 2031–2038. [Google Scholar] [CrossRef] [PubMed]
  47. Skoblov, M.; Marakhonov, A.; Marakasova, E.; Guskova, A.; Chandhoke, V.; Birerdinc, A.; Baranova, A. Protein partners of KCTD proteins provide insights about their functional roles in cell differentiation and vertebrate development. BioEssays 2013, 35, 586–596. [Google Scholar] [CrossRef] [PubMed]
  48. Nawa, M.; Matsuoka, M. KCTD20, a relative of BTBD10, is a positive regulator of Akt. BMC Biochem. 2013, 14, 27. [Google Scholar] [CrossRef] [Green Version]
  49. Awazu, M.; Hida, M. Maternal nutrient restriction inhibits ureteric bud branching but does not affect the duration of nephrogenesis in rats. Pediatr. Res. 2015, 77, 633–639. [Google Scholar] [CrossRef] [Green Version]
  50. De Barros Sene, L.; Lamana, G.L.; Schwambach Vieira, A.; Scarano, W.R.; Gontijo, J.A.R.; Boer, P.A. Gestational Low Protein Diet Modulation on miRNA Transcriptome and Its Target During Fetal and Breastfeeding Nephrogenesis. Front. Physiol. 2021, 12, 648056. [Google Scholar] [CrossRef]
  51. Chen, J.K.; Chen, J.; Neilson, E.G.; Harris, R.C. Role of mammalian target of rapamycin signaling in compensatory renal hypertrophy. J. Am. Soc. Nephrol. 2005, 16, 1384–1391. [Google Scholar] [CrossRef] [Green Version]
  52. Apostolou, A.; Poreau, B.; Delrieu, L.; Thevenon, J.; Jouk, P.S.; Lallemand, G.; Emadali, A.; Sartelet, H. High Activation of the AKT Pathway in Human Multicystic Renal Dysplasia. Pathobiology 2020, 87, 302–310. [Google Scholar] [CrossRef]
  53. Martin, A.P.; Aushev, V.N.; Zalcman, G.; Camonis, G.H. The STK38-XPO1 axis, a new actor in physiology and cancer. Cell Mol. Life Sci. 2021, 78, 1943–1955. [Google Scholar] [CrossRef] [PubMed]
  54. Reginensi, A.; Enderle, L.; Gregorieff, A.; Johnson, R.L.; Wrana, J.L.; McNeill, H. A critical role for NF2 and the Hippo pathway in branching morphogenesis. Nat. Commun. 2016, 7, 12309. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Reginensi, A.; Scott, R.P.; Gregorieff, A.; Bagherie-Lachidan, M.; Chung, C.; Lim, D.S.; Pawson, T.; Wrana, J.; McNeill, H. Yap- and Cdc42-dependent nephrogenesis and morphogenesis during mouse kidney development. PLoS Genet. 2013, 9, e1003380. [Google Scholar] [CrossRef] [Green Version]
  56. Drake, K.A.; Chaney, C.; Patel, M.; Das, A.; Bittencourt, J.; Cohn, M.; Carroll, T.J. Transcription Factors YAP/TAZ and SRF Cooperate To Specify Renal Myofibroblasts in the Developing Mouse Kidney. J. Am. Soc. Nephrol. 2022, 33, 1694–1707. [Google Scholar] [CrossRef] [PubMed]
  57. Holloway, E.M.; Wu, J.H.; Czerwinski, M.; Sweet, C.W.; Wu, A.; Tsai, Y.H.; Huang, S.; Stoddard, A.E.; Capeling, M.M.; Glass, I.; et al. Differentiation of Human Intestinal Organoids with Endogenous Vascular Endothelial Cells. Dev. Cell. 2020, 54, 516–528.e7. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Flowchart of the number of included patients and controls after recruitment, genotyping and quality control. GWAS: genome-wide association study; UKA: unilateral kidney agenesis; MCDK: multicystic dysplastic kidney; KHD: kidney hypo/dysplasia.
Figure 1. Flowchart of the number of included patients and controls after recruitment, genotyping and quality control. GWAS: genome-wide association study; UKA: unilateral kidney agenesis; MCDK: multicystic dysplastic kidney; KHD: kidney hypo/dysplasia.
Biomedicines 10 03023 g001
Figure 2. Quantile-quantile plot with lambda value for the AGORA dataset (403 patients and 622 controls).
Figure 2. Quantile-quantile plot with lambda value for the AGORA dataset (403 patients and 622 controls).
Biomedicines 10 03023 g002
Figure 3. Manhattan plot for the AGORA dataset (403 patients and 622 controls). The blue line indicates the threshold for suggestive statistical significance (p < 1 × 10−5).
Figure 3. Manhattan plot for the AGORA dataset (403 patients and 622 controls). The blue line indicates the threshold for suggestive statistical significance (p < 1 × 10−5).
Biomedicines 10 03023 g003
Figure 4. Quantile-quantile plot with lambda value for the NBS dataset (403 patients and 4366 controls).
Figure 4. Quantile-quantile plot with lambda value for the NBS dataset (403 patients and 4366 controls).
Biomedicines 10 03023 g004
Figure 5. Manhattan plot for the NBS dataset (403 patients and 4366 controls). The blue line indicates the threshold for suggestive statistical significance (p-value < 1 × 10−5) and the red line indicates the threshold for genome-wide statistical significance (p-value < 5 × 10−8).
Figure 5. Manhattan plot for the NBS dataset (403 patients and 4366 controls). The blue line indicates the threshold for suggestive statistical significance (p-value < 1 × 10−5) and the red line indicates the threshold for genome-wide statistical significance (p-value < 5 × 10−8).
Biomedicines 10 03023 g005
Figure 6. LocusZoom plots of: (A) SNV rs140804918 on chromosome 7. (B) SNV rs184382636 on chromosome 18. (C) SNV rs148413365 on chromosome 6. (D) SNV rs111283115 on chromosome 15.
Figure 6. LocusZoom plots of: (A) SNV rs140804918 on chromosome 7. (B) SNV rs184382636 on chromosome 18. (C) SNV rs148413365 on chromosome 6. (D) SNV rs111283115 on chromosome 15.
Biomedicines 10 03023 g006
Figure 7. LocusZoom plots for: (A) rs148251525. (B) rs10433490. (C) rs9547854. (D) rs2830456.
Figure 7. LocusZoom plots for: (A) rs148251525. (B) rs10433490. (C) rs9547854. (D) rs2830456.
Biomedicines 10 03023 g007
Table 1. Genomic risk loci with at least five SNVs with a p-value below 1 × 10−5 identified in a genome-wide association study of congenital solitary functioning kidney, with the lead SNV for each locus being shown.
Table 1. Genomic risk loci with at least five SNVs with a p-value below 1 × 10−5 identified in a genome-wide association study of congenital solitary functioning kidney, with the lead SNV for each locus being shown.
ChrPositionrsIDDiscovery DatasetMAF PatientsMAF ReferenceAGORA DatasetNBS Datasetn SNVs
MAF ControlsOR p-ValueMAF ControlsORp-Value
Genome-wide significant SNVs
781228890rs140804918NBS0.0480.0310.0291.918.4 × 10−30.0203.101.4 × 10−851
1873636290rs184382636NBS0.0170.0020.0035.932.4 × 10−30.00212.91.2 × 10−1042
SNVs selected based on MAF in discovery and reference populations
3189912222rs10433490NBS0.0920.1690.1530.594.0 × 10−40.1500.534.3 × 10−66
636451716rs148413365NBS0.1060.188 *0.1560.947.0 × 10−10.1660.511.9 × 10−684
1338000591rs9547854NBS0.0870.0470.0521.831.6 × 10−30.0511.972.1 × 10−666
1498514525rs148251525AGORA0.1270.0480.0752.085.2 × 10−60.1061.228.2 × 10−25
1589344863rs111283115AGORA0.3330.4490.4990.657.8 × 10−60.3870.713.5 × 10−424
2128122146rs2830456NBS0.2810.3700.4100.642.5 × 10−50.3500.634.5 × 10−646
Other suggestively significant SNVs
179317005rs140198115NBS0.0250.0090.0112.371.9 × 10−20.0103.797.8 × 10−617
192663276rs11406716AGORA0.3190.283 *0.2461.648.1 × 10−60.2771.278.4 × 10−376
322488640rs182382581NBS0.0250.0120.0094.101.5 × 10−30.0085.048.6 × 10−87
36328767rs7629003NBS0.0090.0030.0034.684.9 × 10−20.00214.487.6 × 10−620
4106051876rs190397903NBS0.0200.0070.0131.978.5 × 10−20.0075.232.0 × 10−69
470836149rs146356251NBS0.0180.0080.0082.792.1 × 10−20.0064.847.4 × 10−67
578674386rs138487999NBS0.0340.0190.0162.335.5 × 10−30.0142.847.4 × 10−6125
5139646076rs145922969NBS0.0300.0250.0133.151.6 × 10−30.0153.202.5 × 10−68
596015902rs181945740NBS0.0180.0010.0084.207.0 × 10−30.0055.901.4 × 10−67
5134199058rs73282857NBS0.0190.0060.0054.689.4 × 10−30.0067.005.6 × 10−8150
710014656rs190937828NBS0.0900.0630.0661.761.3 × 10−30.0542.301.7 × 10−614
8134595232rs10112722AGORA0.4370.3660.3471.557.5 × 10−60.3701.342.3 × 10−413
1044117049rs60001082NBS0.0850.069 *0.0651.464.3 × 10−20.0542.007.3 × 10−617
112275888rs1325019515AGORA0.380n/a0.3051.614.9 × 10−60.3121.401.1 × 10−419
1158767505rs11229723NBS0.0220.0060.0112.233.4 × 10−20.0074.411.8 × 10−664
1195161244rs192152837NBS0.0180.0040.00410.22.4 × 10−40.0056.353.6 × 10−78
1243256387rs181072352NBS0.0120.0050.0081.444.8 × 10−10.00210.401.2 × 10−65
12385846rs554290784NBS0.0150.0050.0044.231.1 × 10−20.0035.596.8 × 10−65
1375949902rs144419778AGORA0.2670.212 *0.191.712.7 × 10−60.2051.471.9 × 10−524
1460728107rs146557102NBS0.041n/a0.0182.754.8 × 10−40.0172.965.5 × 10−78
1850196707rs77493404NBS0.0220.0090.0142.251.2 × 10−20.0084.237.5 × 10−622
2044453790rs201841698NBS0.0330.0290.0202.426.1 × 10−30.0133.402.1 × 10−67
2051885312rs6126804NBS0.0300.0080.0162.587.3 × 10−30.0133.306.0 × 10−68
2232945612rs73172143NBS0.0950.0780.0681.472.7 × 10−20.0591.887.0 × 10−66
* MAF from European reference population due to absence in the GoNL database. Chr: chromosome; rsID: Reference SNV cluster ID; MAF: minor allele frequency; OR: odds ratio; n SNVs: amount of single nucleotide variants in associated genomic risk locus; n/a: not available.
Table 2. A selection of genomic risk loci with candidate affected protein-coding genes.
Table 2. A selection of genomic risk loci with candidate affected protein-coding genes.
ChrrsIDDatasetCandidate Affected GeneDistance from SNV to TSSRegulome Score [36]Variant Effect Predictor [40]Gene Function *
Genome-wide significant SNVs
7rs140804918NBSHGF171 kbp0.247IntergenicRegulates cell growth, motility and morphogenesis in various cell types and tissues
18rs184382636NBSSMIM21497 kbp0.071IntergenicIntegral membrane component
SNVs selected based on MAF in discovery and reference populations
3rs10433490NBSP3H272 kbp0.154IntergenicPosttranslational modifier of collagen IV
6rs148413365NBSKCTD2041 kbp0.163IntronicMember of AKT-mTOR-p70 S6k signalling cascade
STK3864 kbp YAP/TAZ inhibitor in hippo pathway
13rs9547854NBSPOSTN172 kbp0.112IntergenicInduces cell attachment and spreading and plays a role in cell adhesion
14rs148251525AGORABCL11B1224 kbp0.000IntergenicT cell transcription factor
15rs111283115AGORAACAN1.8 kbp0.145Upstream gene variantMajor component of extracellular matrix of cartilaginous tissues
21rs2830456NBSADAMTS196 kbp0.184IntergenicHas antiangiogenic activity, involved in inflammation and cancer cachexia
* Based on https://www.ncbi.nlm.nih.gov/gene/. Chr: chromosome; rsID: Reference SNV cluster ID; SNV: single nucleotide variant; TSS: transcription start site; kbp: kilo base pair; MAF: minor allele frequency.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Groen in ’t Woud, S.; Maj, C.; Renkema, K.Y.; Westland, R.; Galesloot, T.; van Rooij, I.A.L.M.; Vermeulen, S.H.; Feitz, W.F.J.; Roeleveld, N.; Schreuder, M.F.; et al. A Genome-Wide Association Study into the Aetiology of Congenital Solitary Functioning Kidney. Biomedicines 2022, 10, 3023. https://doi.org/10.3390/biomedicines10123023

AMA Style

Groen in ’t Woud S, Maj C, Renkema KY, Westland R, Galesloot T, van Rooij IALM, Vermeulen SH, Feitz WFJ, Roeleveld N, Schreuder MF, et al. A Genome-Wide Association Study into the Aetiology of Congenital Solitary Functioning Kidney. Biomedicines. 2022; 10(12):3023. https://doi.org/10.3390/biomedicines10123023

Chicago/Turabian Style

Groen in ’t Woud, Sander, Carlo Maj, Kirsten Y. Renkema, Rik Westland, Tessel Galesloot, Iris A. L. M. van Rooij, Sita H. Vermeulen, Wout F. J. Feitz, Nel Roeleveld, Michiel F. Schreuder, and et al. 2022. "A Genome-Wide Association Study into the Aetiology of Congenital Solitary Functioning Kidney" Biomedicines 10, no. 12: 3023. https://doi.org/10.3390/biomedicines10123023

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop