Documentation A5

Exploring Datasets and Practice on 311


World Happiness Report

I've seen other datasets from Gallup World Poll before and was interested in how it quantifies happiness and its parameters. Here's the details of this dataset:

    Collector/contributor:  United Nations
    Purpose: Happiness Indicators to inform government's decision makings
    Collection Method: Data are collected from Gallup World Poll
    Dimensions: 156 lines(countries), 12 columns

The dataset quantifies happiness of citizens from different countries from a macro perspectives. It combines individual polls and objective statistics. It's convenient to use machine learning and do a regression to estimate the correlation between Happiness and each parameters(GDP, life expectancy, etc..)

311 p5 sketch exercise

https://editor.p5js.org/roger1mjh/sketches/ZG2BkJS07


Originally I want to make some edits on the existed code to conduct a different use of the simplified 311csv and get to understand the codes. I noticed there are different types of noise complaint a lot. Therefore I focused on noise complaint.
As for the other inputs parameters, I think agency is a bit useless because almost all complaints of the same type are dealt by the same agency. After I got rid of "agency", I simplified borough and community by ZIP code, thought it would be a clear representation.
Moreover, I edit some HTML code in the index because it matters to the interface. And I used list instead of type input for complaint type, because it's more intuitively.

It runs as I expected, without any error. However, there's a big issue. The prediction value is almost always the same, no matter how I change the parameter and noise type. It doesn't seem like a bug here. So I downloaded the csv file that Lydia simplified and inspected it. I found many of the ZIP code only have few lines of report, and none of them are noise reports. Therefore, I assume the problem of my code is because the inputs I chose to use, which is Incident Zip, is not diverse/representative enough.
In response, I replaced Incident Zip with Borough. And it fixed the problem.


    
    

Comments

Popular posts from this blog

Assigment 4

Assignment 2A

A9 - Draw my Drum Machine