Quantifying mean, variability, and uncertainty in indoor radon exposure in Pennsylvania using random forest and quantile regression forest models.
Radon is a naturally occurring radioactive gas that poses a serious health risk as the primary cause of lung cancer in non-smokers.
- 표본수 (n) 718,111
APA
Lee H, Maguire D, et al. (2026). Quantifying mean, variability, and uncertainty in indoor radon exposure in Pennsylvania using random forest and quantile regression forest models.. Scientific reports. https://doi.org/10.1038/s41598-026-37891-3
MLA
Lee H, et al.. "Quantifying mean, variability, and uncertainty in indoor radon exposure in Pennsylvania using random forest and quantile regression forest models.." Scientific reports, 2026.
PMID
41786814
Abstract
Radon is a naturally occurring radioactive gas that poses a serious health risk as the primary cause of lung cancer in non-smokers. Despite the well-known adverse association with health outcomes, current radon exposure assessments are limited to county-level or average-level estimates, which fail to capture regional variability. This study uses Machine Learning models, including Random Forest (RF) and Quantile Regression Forest (QRF), to estimate the indoor radon concentrations at the ZCTA (Zip code tabulation area)-level and characterize uncertainties in model estimates. Incorporating geological, meteorological, and building-specific data, the models aim to improve radon risk assessment by capturing mean exposure, variability, and extreme concentration levels. Processed radon test data (n = 718,111) were analyzed using average, variability, and quantile prediction methods. Models that estimate the average radon exposure at the ZCTA-level can yield promising model-fit results, but they do not capture the underlying variability of indoor radon exposure within a ZCTA. We utilize volatility analyses to identify characteristics indicative of high variability of indoor radon exposure. We also show that a QRF model can be used to estimate upper quantiles of residential radon exposure, thereby uncovering localized areas of elevated exposure that were not apparent in mean estimates. The results highlighted the need for a deep characterization of exposure risk and show that regions with moderate average exposure levels could still harbor extreme outliers with implications for evaluating health risks. Utilizing multiple radon exposure models allows for a deeper characterization of radon risk within a geographic area and can better identify high-risk areas. The results from this study provide a foundation for developing mitigation strategies and examining associations between radon exposure and health outcomes at fine scales. Future research should extend the geographic scope and incorporate additional environmental risk factors to establish a comprehensive framework for risk assessment.
같은 제1저자의 인용 많은 논문 (5)
- Superior Efficacy of Bismuth-Containing Rifabutin Quadruple Therapy over Rifabutin Triple Therapy as Salvage Treatment.
- [Preventive Surgery for Hereditary Gastric Cancer].
- Characteristics of Germline and Somatic Mutations of DNA Repair Genes in Korean Men with Prostate Cancer.
- Identification and Validation of a Novel Theranostic Target in Triple Negative Breast Cancer with Transcriptomics and Protein Analyses.
- Structural insights into disease-associated mutations in the microRNA processing machinery.