Index
DELIVERABLE 1
DELIVERABLE 2
DELIVERABLE 2 Revised
The main goal of deliverable 3 is to revise and improve upon deliverables 1 and 2. I will also discuss possible continuation studies, and how we can benefit from the models that were generatated from this study.
The only revision to Phase 1 was adding additional comments to the lines of code in order to ensure better understanding for inexperienced users.
After looking over the dataset obtained in phase 2, I realized that the populations only go back as far as 2010, while the dataset from phase 1 goes back as far as 2003. This is a significant amount of data that has been left out of our model. In order to fix this, I will use a new dataset which provides county populations in California as far back as FIND DATE. To view this revised version of phase 2, Click here
Our model gives us the unexpected result that human population (although significant), is not a strong enough predictor to say that places in highly populated areas have less harmful molecules in the air. This could be caused by several factors.
1. It could be that locations that produce the most of the harmful air particles are located in places with smaller populations (farming communities, large factories, etc)
2. Counties are not in a bubble, therefore air pollution from one area may spread to another area.
3. Other untested variables may come into play, such as environmental region, enforced CA Clean Air Acts, weather, etc.
Furthermore, our models low ability to explain the variation could be due to a flaw in our model. A potential reason for this flaw could be that we have so few locations throughout the state that the air quality was tested. This severely limits the number of comparisons that our model will make with Concentration and Population. In order to fix this, we should gather air quality data from a much larger pool of locations.
For future studies, it would be a good idea to not only gather more data on air pollution for comparisons, but to also include other variables, such as weather patterns of each area, elevation, and regional information that could influence air pollution, such as whether or not the location is surrounded by mountains, or flatlands. This could provide a basis for more accurate preditions with our model. However, because our model does show some significance between population and air pollution, we can take this study further by looking into what activites involving humans has the biggest effect on the level of air pollution. For example, maybe driving cars, running factories, or raising animals could be the reason that people directly influence air pollution.
By determining exactly what the causes are of significant air pollution, we would be able to cut back on the amount of pollution that we as a species are directly influencing. For example, creating more economically friendly vehicles for people to drive at reduced costs and increasing the cost of gasoline, coming up with new ways to dispose of harmful chemicals, and limiting the use of large powerplants and factories would be a good start.
Unintended consequences that could arise by the operationalization of this study could be an unseen, negative effect on the environment. For example, one area of study suggests that increases in certain harmful air particles causes a decrease in the ozone layer of the planet, causing global warming. It has also been observed that historically, the Earth follows a cycle of heating up and then cooling down. If we were to mess with the levels of pollution in the air, we may inadvertently trigger a massive environmental change which may not be observed several years later.
In addition, by changing the types of vehicles on the road and the type of fuel that is used, many auto companies could go out of business. Similarly, if other businesses are limited or forced to shut down because of the amount of pollution they are emitting, it could cause a substantial ecomomic change.