Lesson 19: Comparing Distributions with Seaborn
April 2, 2018
Review:
- Quantitative exploratory data analysis
- Term Project
- Vehicle Population predictions
- I assign (or you request) a Make-Model (e.g., Honda Accord or Ford Mustang)
- You estimate population for each County in Colorado (n=64)
- Details Wed
- Vehicle Population predictions
Presentation:
- Visual Comparison of Data Distributions
- Understanding differences among categories (nominal)
- Histograms (and density curves)
- Good for evaluating shape of distribution
- Not as good for comparison of multiple distributions
- Better Options
- Boxplots
- Stripplots (with jitter)
- Other (sort of weird) options
- Swarmplots
- Violinplots
- Sample Code
Activity:
- Compare distributions of Selling Price by neighborhood
- Use Python and Seaborn
- Experiment with Boxplots, Stripplots and Swarmplots
- Select the graphic, or combination of graphics, you think best illustrates neighborhood differences
- Be sure to include a title, axis labels, legend, your name, section
- Print and submit (in class today)
Assignment:
- Complete Ch. 3 Thinking Probabilistically – Discrete Variables in Statistical Thinking in Python (Part 1)