~~NOTOC~~ |<100% 25% - >| ^ \\ DATA ANALYTICS REFERENCE DOCUMENT\\ \\ ^^ ^ Document Title:|Document Title| ^ Document No.:|1540309128| ^ Author(s):|Rita Raher, Gerhard van der Linde| ^ Contributor(s):| | **REVISION HISTORY** |< 100% 10% - - 10% 17% 10% >| ^ \\ Revision\\ \\ ^\\ Details of Modification(s)^\\ Reason for modification^ \\ Date ^ \\ By ^ | [[:doku.php?id=assesment:52446:project:one&do=revisions|0]] |Draft release|Document description here| 2018/10/23 15:38 | Gerhard van der Linde | ---- ====== 52446 Fundamentals Project Remit ====== ===== Project 2018 - Fundamentals of Data Analysis ===== **Due:** last commit on or before December 14th This document contains the instructions for Project 2018 for Fundamentals of Data Analysis. Please be advised that all students are bound by the Quality Assurance Framework [3] at GMIT which includes the Code of Student Conduct and the Policy on Plagiarism. The onus is on the student to ensure they do not, even inadvertently, break the rules. A clean and comprehensive git history (see below) is the best way to demonstrate to the examiner that your submission is your own work. It is, however, expected that you draw on works that are not your own to build your submission and you should systematically reference those works to enhance your submission. **Problem statement** The box plot is common in data analysis for investigating individual numerical variables. In this project, you will investigate and explain box plots and their uses. The boxplot function from the Python package matplotlib.pyplot can be used to create box plots. Your submission should be in the form of a repository containing a Jupyter notebook in which you detail your findings. In your notebook, you should: * **Summarise the history** of the box plot and situations in which it used. * **Demonstrate the use** of the box plot using data of your choosing. * Explain any relevant **terminology** such as the terms quartile and percentile. * Compare the box plot to **alternatives**.