NMMAPSdata R Package

Publications  |  Software   |  Data  |   Rweb  |  iHAPSS    

 

Frequently Asked Questions about the NMMAPS Data

  1. Why are these data here?

    The NMMAPS database is made available here for the purpose of reproducing published findings about air pollution and health. Users are free to download the data and conduct their own analyses and explore their own models.

    Unfortunately, we are unable to distribute more detailed or finer grained data than that which is available on the website.

  2. Why are there negative values in the pollution series?

    In short, the pollutant data have been detrended—more details can be found in the document PollutantProcess.pdf. Before the pollution data are averaged across monitors they have a very smooth trend subtracted off. That is why variables with the "tmean" suffix have negative values (the same is true for variables with the "mean" suffix).

    The median of the trends is stored in a variable with suffix "mtrend". Adding a variable ending in "tmean" with its corresponding "mtrend" variable should get you something resembling the original averaged values. There is a basic flowchart describing the processing of the pollutant data.

    Adding the "tmean" and "mtrend" variables adds the average detrended series with the median of the long term trends from each monitor. It is not an exact reconstruction of any particular series.

    Variables ending with the "mean" suffix have been processed similarly, but instead of trimmed mean, a standard arithmetic mean is used to combine data across monitors.

  3. What types of monitors were used in NMMAPS?

    The pollution data here are taken from the "population-oriented" monitors.


Please send comments to Roger D. Peng (rpeng at jhsph.edu)