TY - JOUR
T1 - Application of Gaussian Mixture Regression for the Correction of Low Cost PM2.5Monitoring Data in Accra, Ghana
AU - McFarlane, Celeste
AU - Raheja, Garima
AU - Malings, Carl
AU - Appoh, Emmanuel K.E.
AU - Hughes, Allison Felix
AU - Westervelt, Daniel M.
N1 - Publisher Copyright:
© 2021 The Authors. Published by American Chemical Society
PY - 2021/9/16
Y1 - 2021/9/16
N2 - Low-cost sensors (LCSs) for air quality monitoring have enormous potential to improve air quality data coverage in resource-limited parts of the world such as sub-Saharan Africa. LCSs, however, are affected by environment and source conditions. To establish high-quality data, LCSs must be collocated and calibrated with reference grade PM2.5monitors. From March 2020, a low-cost PurpleAir PM2.5monitor was collocated with a Met One Beta Attenuation Monitor 1020 in Accra, Ghana. While previous studies have shown that multiple linear regression (MLR) and random forest regression (RF) can improve accuracy and correlation between PurpleAir and reference data, MLR and RF yielded suboptimal improvement in the Accra collocation (R2= 0.81 andR2= 0.81, respectively). We present the first application of Gaussian mixture regression (GMR) to air quality data calibration and demonstrate improvement over traditional methods by increasing the collocated PM2.5correlation and accuracy toR2= 0.88 and MAE = 2.2 μg m-3. Gaussian mixture models (GMMs) are a probability density estimator and clustering method from which nonlinear regressions that tolerate missing inputs can be derived. We find that even when given missing inputs, GMR provides better correlation than MLR and RF performed with complete data. GMR also allows us to estimate calibration certainty. When evaluated, 95% confidence intervals agreed with reference PM2.5data 96% of the time, suggesting that the model accurately assesses its own confidence. Additionally, clustering within the GMM is consistent with climate characteristics, providing confidence that the calibration approach can learn underlying relationships in data.
AB - Low-cost sensors (LCSs) for air quality monitoring have enormous potential to improve air quality data coverage in resource-limited parts of the world such as sub-Saharan Africa. LCSs, however, are affected by environment and source conditions. To establish high-quality data, LCSs must be collocated and calibrated with reference grade PM2.5monitors. From March 2020, a low-cost PurpleAir PM2.5monitor was collocated with a Met One Beta Attenuation Monitor 1020 in Accra, Ghana. While previous studies have shown that multiple linear regression (MLR) and random forest regression (RF) can improve accuracy and correlation between PurpleAir and reference data, MLR and RF yielded suboptimal improvement in the Accra collocation (R2= 0.81 andR2= 0.81, respectively). We present the first application of Gaussian mixture regression (GMR) to air quality data calibration and demonstrate improvement over traditional methods by increasing the collocated PM2.5correlation and accuracy toR2= 0.88 and MAE = 2.2 μg m-3. Gaussian mixture models (GMMs) are a probability density estimator and clustering method from which nonlinear regressions that tolerate missing inputs can be derived. We find that even when given missing inputs, GMR provides better correlation than MLR and RF performed with complete data. GMR also allows us to estimate calibration certainty. When evaluated, 95% confidence intervals agreed with reference PM2.5data 96% of the time, suggesting that the model accurately assesses its own confidence. Additionally, clustering within the GMM is consistent with climate characteristics, providing confidence that the calibration approach can learn underlying relationships in data.
KW - Africa
KW - Gaussian mixture regression
KW - air quality
KW - low-cost sensors
KW - particulate matter
UR - http://www.scopus.com/inward/record.url?scp=85114701095&partnerID=8YFLogxK
U2 - 10.1021/acsearthspacechem.1c00217
DO - 10.1021/acsearthspacechem.1c00217
M3 - Article
AN - SCOPUS:85114701095
SN - 2472-3452
VL - 5
SP - 2268
EP - 2279
JO - ACS Earth and Space Chemistry
JF - ACS Earth and Space Chemistry
IS - 9
ER -