DATA ANALYTICS REFERENCE DOCUMENT |
|
---|---|
Document Title: | Document Title |
Document No.: | 1540309128 |
Author(s): | Rita Raher, Gerhard van der Linde |
Contributor(s): |
REVISION HISTORY
Revision | Details of Modification(s) | Reason for modification | Date | By |
---|---|---|---|---|
0 | Draft release | Document description here | 2018/10/23 15:38 | Gerhard van der Linde |
52446 Fundamentals Project Remit
Project 2018 - Fundamentals of Data Analysis
Due: last commit on or before December 14th
This document contains the instructions for Project 2018 for Fundamentals of Data Analysis. Please be advised that all students are bound by the Quality Assurance Framework [3] at GMIT which includes the Code of Student Conduct and the Policy on Plagiarism. The onus is on the student to ensure they do not, even inadvertently, break the rules. A clean and comprehensive git history (see below) is the best way to demonstrate to the examiner that your submission is your own work. It is, however, expected that you draw on works that are not your own to build your submission and you should systematically reference those works to enhance your submission.
Problem statement
The box plot is common in data analysis for investigating individual numerical variables. In this project, you will investigate and explain box plots and their uses. The boxplot function from the Python package matplotlib.pyplot can be used to create box plots. Your submission should be in the form of a repository containing a Jupyter notebook in which you detail your findings. In your notebook, you should:
- Summarise the history of the box plot and situations in which it used.
- Demonstrate the use of the box plot using data of your choosing.
- Explain any relevant terminology such as the terms quartile and percentile.
- Compare the box plot to alternatives.