DATA ANALYTICS REFERENCE DOCUMENT |
|
---|---|
Document Title: | Document Title |
Document No.: | 1540309050 |
Author(s): | Rita Raher, Gerhard van der Linde |
Contributor(s): |
REVISION HISTORY
Revision | Details of Modification(s) | Reason for modification | Date | By |
---|---|---|---|---|
0 | Draft release | Document description here | 2018/10/23 15:37 | Rita Raher |
52446 Fundamentals Assignment One Remit
Fundamentals of Data Analysis: Assignment 2018
Due: last commit on or before November 11th
This document contains the instructions for Assignment 2018 for Fundamentals of Data Analysis. Please be advised that all students are bound by the Quality Assurance Framework [4] at GMIT which includes the Code of Student Conduct and the Policy on Plagiarism. The onus is on the student to ensure they do not, even inadvertently, break the rules. A clean and comprehensive git history (see below) is the best way to demonstrate to the examiner that your submission is your own work. It is, however, expected that you draw on works that are not your own to build your submission and you should systematically reference those works to enhance your submission. Problem statement
The following assignment concerns the well-known Anscombe’s quartet dataset [2]. You are required to create a Jupyter [5] notebook analysing the dataset. There are four distinct tasks to be carried out in your Jupyter notebook.
- Explain the background to the dataset – who created it, when it was created, and any speculation you can find regarding how it might have been created.
- Plot the interesting aspects of the dataset.
- Calculate the descriptive statistics of the variables in the dataset.
- Explain why the dataset is interesting, referring to the plots and statistics above.
Useful Links
Frank Anscombe's 1973 paper: Graphs in Statistical Analysis
http://www.sjsu.edu/faculty/gerstman/StatPrimer/anscombe1973.pdf