SlideShare ist ein Scribd-Unternehmen logo
1 von 8
Assignment Content
About
This assignment must be completed in a group of minimum 3
students and maximum 4 students.
This assignment is a prelude to the third assignment. It aims at
providing you with an authentic experience in carrying a simple
data science project that covers all essential stages in a data
science lifecycle. Since most professional science projects are
performed by teams, you are therefore required to complete this
assignment in a team.
Tasks
In short, you are required to complete the following tasks:
Pitch a public, open dataset of your choice.
Pitch 3 or 4 initial hypotheses to be pursued later in Assignment
3.
Profile the data using descriptive and/or inferential statistics
techniques (which also requires that you demonstrate proficient
data wrangling skills).
Present items 1, 2, and 3 above via a recorded presentation.
Your tasks are open-ended tasks, similar to most real data
science projects. This means no two teams are likely to go to
the same direction and produce similar results. You will find
that your group will become experts in interpreting your own
data and answering your own problems. Comparing performance
across teams may not be meaningful and your team will be
assessed solely against the rubric.
Your Python code base must be available on your Github repo.
The extent of the group's collaboration and individual
contribution will be evaluated solely based on Github.
General advice:
Select an open (publicly available) data - data that can be freely
downloaded, preferably with an open license, allowing you to
share the data freely. Choosing non-public data is not advisable
as your instructor may be restricted from accessing the data.
Choose data in the domain for which team member(s) has some
background.
Formulate open-ended hypotheses.
Carry out fresh data and/or analysis.
Where possible, choose a dataset and formulate problems
pertaining to practical Australian contexts.
Data
Choose only 1 dataset.
It is fine to choose a dataset that has been analysed by others
outside of the university. This is the natural consequence of
selecting open data. However, you should either show that the
analysis and exploration you plan has not been done before, or
show that there is no code already available to do the analysis
you intend. Your instructor is likely to view highly any original
investigation.
Sources of open datasets include but are not limited to:
https://data.gov.au/
https://data.nt.gov.au/
https://data.worldbank.org/
https://www.data.gov/
https://datasetsearch.research.google.com/
https://www.kaggle.com/datasets
- Be careful. Many Kaggle datasets have published analyses.
Choose something that has not been done before.
Github Classroom
Group work activities must be visible on Github Classroom.
The instructor will send an invitation to all students to join
Github Classroom after all groups are formed. To accept this
invitation, every student must have a free Github account. If
you do not already have it, please
sign up
. This is compulsory.
Marking
You should refer to the detailed marking rubric that appears on
the side panel of this window.
Submission item
Please submit the following via Learnline
by latest 11.59pm
on the due date:
1 URL to a recorded presentation per team published privately
on YouTube. Do not submit multiple recordings and do not
submit recording file unless requested specifically.
(optional) supplementary information, where applicable.
Latest Python code base on GitHub repo must be accessible to
your instructor. Snapshot of the repo will be taken at the time of
submission.
The duration of the presentation is commensurate with the team
size. Inline with the Unit Information, 2 to 3 minutes of
presentation
per team member
is required. Not complying with this requirement may attract a
mark penalty.
Example:
For a team of 3: the minimum duration is 6 minutes (2 minutes
x 3 members) and the maximum duration is 9 minutes (3
minutes x 3 members).
For a team of 4: the minimum duration is 8 minutes (2 minutes
x 4 members) and the maximum duration is 12 minutes (3
minutes x 4 members).
Academic integrity and assessment irregularities
Academic integrity is a core value at CDU and must be upheld
at all times when completing this assignment. You must not
plagirise the work of others. Please be referred to the
Students - Breach of Academic Integrity Procedures
.
Other assessment irregularities are governed by CDU's
Higher Education Assessment Procedures
.
Tips and example
Broadly speaking, your instructor is looking for evidence of
your demonstrative competency in the following key data
science skills implemented in Python:
(1) hypothesis formulation
,
(2) exploratory data analytics
,
(3) data wrangling skills
, and
(4) data visualisations
.
When pitching your dataset, consider addressing the following
concerns:
source of data
accesssibility of data
validity of data
why the dataset matters (in practical or academic terms)
domain knowledge
relevance to you
etc.
In profiling the data, consider addressing the following
concerns:
dimensionality
data types
centrality
spread
shape of data
distributions
etc.
The last task is to pitch 3 to 4 initial hypotheses. Consider
addressing the following concerns:
what might the data tells us
what would you like to explore first based on your initial data
profiling
what would you like to predict
what existing assumption you want to test previous finding
what new idea you want to test
etc.

Weitere ähnliche Inhalte

Ähnlich wie Assignment ContentAboutThis assignment must be complet.docx

Research and Commercialisation Challenges
Research and Commercialisation ChallengesResearch and Commercialisation Challenges
Research and Commercialisation ChallengesDr. Mazlan Abbas
 
Decision support systems
Decision support systemsDecision support systems
Decision support systemsMR Z
 
1. introduction to data science —
1. introduction to data science —1. introduction to data science —
1. introduction to data science —swethaT16
 
MAT115 ProjectFall 201630 pointsDue Monday December 12, 2016.docx
MAT115 ProjectFall 201630 pointsDue Monday December 12, 2016.docxMAT115 ProjectFall 201630 pointsDue Monday December 12, 2016.docx
MAT115 ProjectFall 201630 pointsDue Monday December 12, 2016.docxdrennanmicah
 
How to build a data science project in a corporate setting, by Soraya Christi...
How to build a data science project in a corporate setting, by Soraya Christi...How to build a data science project in a corporate setting, by Soraya Christi...
How to build a data science project in a corporate setting, by Soraya Christi...WiMLDSMontreal
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Predictionijtsrd
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsfBrad Houston
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsfBrad Houston
 
Cloudera Data Science Challenge 3 Solution by Doug Needham
Cloudera Data Science Challenge 3 Solution by Doug NeedhamCloudera Data Science Challenge 3 Solution by Doug Needham
Cloudera Data Science Challenge 3 Solution by Doug NeedhamDoug Needham
 
Writing Question.docx
Writing Question.docxWriting Question.docx
Writing Question.docxbkbk37
 
Research data management
Research data managementResearch data management
Research data managementHugo Besemer
 
Modules module5mod5home.htmlmodule 5 homecomparing models
Modules module5mod5home.htmlmodule 5   homecomparing modelsModules module5mod5home.htmlmodule 5   homecomparing models
Modules module5mod5home.htmlmodule 5 homecomparing modelsPOLY33
 
Predicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithmsPredicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithmsIJDKP
 
Assignment Title Conducting Primary ResearchDeveloping the ab.docx
Assignment Title Conducting Primary ResearchDeveloping the ab.docxAssignment Title Conducting Primary ResearchDeveloping the ab.docx
Assignment Title Conducting Primary ResearchDeveloping the ab.docxssuser562afc1
 

Ähnlich wie Assignment ContentAboutThis assignment must be complet.docx (20)

Research and Commercialisation Challenges
Research and Commercialisation ChallengesResearch and Commercialisation Challenges
Research and Commercialisation Challenges
 
Decision support systems
Decision support systemsDecision support systems
Decision support systems
 
1. introduction to data science —
1. introduction to data science —1. introduction to data science —
1. introduction to data science —
 
De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
 
MAT115 ProjectFall 201630 pointsDue Monday December 12, 2016.docx
MAT115 ProjectFall 201630 pointsDue Monday December 12, 2016.docxMAT115 ProjectFall 201630 pointsDue Monday December 12, 2016.docx
MAT115 ProjectFall 201630 pointsDue Monday December 12, 2016.docx
 
How to build a data science project in a corporate setting, by Soraya Christi...
How to build a data science project in a corporate setting, by Soraya Christi...How to build a data science project in a corporate setting, by Soraya Christi...
How to build a data science project in a corporate setting, by Soraya Christi...
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
 
Cloudera Data Science Challenge 3 Solution by Doug Needham
Cloudera Data Science Challenge 3 Solution by Doug NeedhamCloudera Data Science Challenge 3 Solution by Doug Needham
Cloudera Data Science Challenge 3 Solution by Doug Needham
 
Webquest
WebquestWebquest
Webquest
 
Turning Information chaos into reliable data
Turning Information chaos into reliable dataTurning Information chaos into reliable data
Turning Information chaos into reliable data
 
Writing Question.docx
Writing Question.docxWriting Question.docx
Writing Question.docx
 
Chapter 1: Introduction to Data Mining
Chapter 1: Introduction to Data MiningChapter 1: Introduction to Data Mining
Chapter 1: Introduction to Data Mining
 
Research data management
Research data managementResearch data management
Research data management
 
Modules module5mod5home.htmlmodule 5 homecomparing models
Modules module5mod5home.htmlmodule 5   homecomparing modelsModules module5mod5home.htmlmodule 5   homecomparing models
Modules module5mod5home.htmlmodule 5 homecomparing models
 
Deep learning for NLP
Deep learning for NLPDeep learning for NLP
Deep learning for NLP
 
Machine Learning - Deep Learning
Machine Learning - Deep LearningMachine Learning - Deep Learning
Machine Learning - Deep Learning
 
Predicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithmsPredicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithms
 
Assignment Title Conducting Primary ResearchDeveloping the ab.docx
Assignment Title Conducting Primary ResearchDeveloping the ab.docxAssignment Title Conducting Primary ResearchDeveloping the ab.docx
Assignment Title Conducting Primary ResearchDeveloping the ab.docx
 

Mehr von williejgrant41084

Assignment ContentIn your first meeting, you will have to .docx
Assignment ContentIn your first meeting, you will have to .docxAssignment ContentIn your first meeting, you will have to .docx
Assignment ContentIn your first meeting, you will have to .docxwilliejgrant41084
 
Assignment ContentMany different threats can arise to the .docx
Assignment ContentMany different threats can arise to the .docxAssignment ContentMany different threats can arise to the .docx
Assignment ContentMany different threats can arise to the .docxwilliejgrant41084
 
Assignment ContentMany information security policies cross the.docx
Assignment ContentMany information security policies cross the.docxAssignment ContentMany information security policies cross the.docx
Assignment ContentMany information security policies cross the.docxwilliejgrant41084
 
Assignment ContentMaintaining a healthy work-life balance .docx
Assignment ContentMaintaining a healthy work-life balance .docxAssignment ContentMaintaining a healthy work-life balance .docx
Assignment ContentMaintaining a healthy work-life balance .docxwilliejgrant41084
 
Assignment ContentIn this section, you will be evaluating .docx
Assignment ContentIn this section, you will be evaluating .docxAssignment ContentIn this section, you will be evaluating .docx
Assignment ContentIn this section, you will be evaluating .docxwilliejgrant41084
 
Assignment ContentIssues related prejudice, discrimination.docx
Assignment ContentIssues related prejudice, discrimination.docxAssignment ContentIssues related prejudice, discrimination.docx
Assignment ContentIssues related prejudice, discrimination.docxwilliejgrant41084
 
Assignment ContentIn your first meeting, you will have to pres.docx
Assignment ContentIn your first meeting, you will have to pres.docxAssignment ContentIn your first meeting, you will have to pres.docx
Assignment ContentIn your first meeting, you will have to pres.docxwilliejgrant41084
 
Assignment ContentIn Week 1, you discussed GIG, Inc.s ben.docx
Assignment ContentIn Week 1, you discussed GIG, Inc.s ben.docxAssignment ContentIn Week 1, you discussed GIG, Inc.s ben.docx
Assignment ContentIn Week 1, you discussed GIG, Inc.s ben.docxwilliejgrant41084
 
Assignment ContentIn the health care industry, a variety of st.docx
Assignment ContentIn the health care industry, a variety of st.docxAssignment ContentIn the health care industry, a variety of st.docx
Assignment ContentIn the health care industry, a variety of st.docxwilliejgrant41084
 
Assignment ContentIn the health care industry, a variety o.docx
Assignment ContentIn the health care industry, a variety o.docxAssignment ContentIn the health care industry, a variety o.docx
Assignment ContentIn the health care industry, a variety o.docxwilliejgrant41084
 
Assignment ContentIn the workplace, a team manages many p.docx
Assignment ContentIn the workplace, a team manages many p.docxAssignment ContentIn the workplace, a team manages many p.docx
Assignment ContentIn the workplace, a team manages many p.docxwilliejgrant41084
 
Assignment ContentImagine that your hospital has recently me.docx
Assignment ContentImagine that your hospital has recently me.docxAssignment ContentImagine that your hospital has recently me.docx
Assignment ContentImagine that your hospital has recently me.docxwilliejgrant41084
 
Assignment ContentImagine you have been working for a health c.docx
Assignment ContentImagine you have been working for a health c.docxAssignment ContentImagine you have been working for a health c.docx
Assignment ContentImagine you have been working for a health c.docxwilliejgrant41084
 
Assignment ContentImagine your Learning Team is the human reso.docx
Assignment ContentImagine your Learning Team is the human reso.docxAssignment ContentImagine your Learning Team is the human reso.docx
Assignment ContentImagine your Learning Team is the human reso.docxwilliejgrant41084
 
Assignment ContentImagine you have been hired to conduct a s.docx
Assignment ContentImagine you have been hired to conduct a s.docxAssignment ContentImagine you have been hired to conduct a s.docx
Assignment ContentImagine you have been hired to conduct a s.docxwilliejgrant41084
 
Assignment ContentImagine you are working as a manager in a lo.docx
Assignment ContentImagine you are working as a manager in a lo.docxAssignment ContentImagine you are working as a manager in a lo.docx
Assignment ContentImagine you are working as a manager in a lo.docxwilliejgrant41084
 
Assignment ContentImagine you have just been promoted as.docx
Assignment ContentImagine you have just been promoted as.docxAssignment ContentImagine you have just been promoted as.docx
Assignment ContentImagine you have just been promoted as.docxwilliejgrant41084
 
Assignment ContentImagine you are the office manager at .docx
Assignment ContentImagine you are the office manager at .docxAssignment ContentImagine you are the office manager at .docx
Assignment ContentImagine you are the office manager at .docxwilliejgrant41084
 
Assignment ContentImagine you are asked to design an inf.docx
Assignment ContentImagine you are asked to design an inf.docxAssignment ContentImagine you are asked to design an inf.docx
Assignment ContentImagine you are asked to design an inf.docxwilliejgrant41084
 
Assignment ContentImagine you are an IT manager for an org.docx
Assignment ContentImagine you are an IT manager for an org.docxAssignment ContentImagine you are an IT manager for an org.docx
Assignment ContentImagine you are an IT manager for an org.docxwilliejgrant41084
 

Mehr von williejgrant41084 (20)

Assignment ContentIn your first meeting, you will have to .docx
Assignment ContentIn your first meeting, you will have to .docxAssignment ContentIn your first meeting, you will have to .docx
Assignment ContentIn your first meeting, you will have to .docx
 
Assignment ContentMany different threats can arise to the .docx
Assignment ContentMany different threats can arise to the .docxAssignment ContentMany different threats can arise to the .docx
Assignment ContentMany different threats can arise to the .docx
 
Assignment ContentMany information security policies cross the.docx
Assignment ContentMany information security policies cross the.docxAssignment ContentMany information security policies cross the.docx
Assignment ContentMany information security policies cross the.docx
 
Assignment ContentMaintaining a healthy work-life balance .docx
Assignment ContentMaintaining a healthy work-life balance .docxAssignment ContentMaintaining a healthy work-life balance .docx
Assignment ContentMaintaining a healthy work-life balance .docx
 
Assignment ContentIn this section, you will be evaluating .docx
Assignment ContentIn this section, you will be evaluating .docxAssignment ContentIn this section, you will be evaluating .docx
Assignment ContentIn this section, you will be evaluating .docx
 
Assignment ContentIssues related prejudice, discrimination.docx
Assignment ContentIssues related prejudice, discrimination.docxAssignment ContentIssues related prejudice, discrimination.docx
Assignment ContentIssues related prejudice, discrimination.docx
 
Assignment ContentIn your first meeting, you will have to pres.docx
Assignment ContentIn your first meeting, you will have to pres.docxAssignment ContentIn your first meeting, you will have to pres.docx
Assignment ContentIn your first meeting, you will have to pres.docx
 
Assignment ContentIn Week 1, you discussed GIG, Inc.s ben.docx
Assignment ContentIn Week 1, you discussed GIG, Inc.s ben.docxAssignment ContentIn Week 1, you discussed GIG, Inc.s ben.docx
Assignment ContentIn Week 1, you discussed GIG, Inc.s ben.docx
 
Assignment ContentIn the health care industry, a variety of st.docx
Assignment ContentIn the health care industry, a variety of st.docxAssignment ContentIn the health care industry, a variety of st.docx
Assignment ContentIn the health care industry, a variety of st.docx
 
Assignment ContentIn the health care industry, a variety o.docx
Assignment ContentIn the health care industry, a variety o.docxAssignment ContentIn the health care industry, a variety o.docx
Assignment ContentIn the health care industry, a variety o.docx
 
Assignment ContentIn the workplace, a team manages many p.docx
Assignment ContentIn the workplace, a team manages many p.docxAssignment ContentIn the workplace, a team manages many p.docx
Assignment ContentIn the workplace, a team manages many p.docx
 
Assignment ContentImagine that your hospital has recently me.docx
Assignment ContentImagine that your hospital has recently me.docxAssignment ContentImagine that your hospital has recently me.docx
Assignment ContentImagine that your hospital has recently me.docx
 
Assignment ContentImagine you have been working for a health c.docx
Assignment ContentImagine you have been working for a health c.docxAssignment ContentImagine you have been working for a health c.docx
Assignment ContentImagine you have been working for a health c.docx
 
Assignment ContentImagine your Learning Team is the human reso.docx
Assignment ContentImagine your Learning Team is the human reso.docxAssignment ContentImagine your Learning Team is the human reso.docx
Assignment ContentImagine your Learning Team is the human reso.docx
 
Assignment ContentImagine you have been hired to conduct a s.docx
Assignment ContentImagine you have been hired to conduct a s.docxAssignment ContentImagine you have been hired to conduct a s.docx
Assignment ContentImagine you have been hired to conduct a s.docx
 
Assignment ContentImagine you are working as a manager in a lo.docx
Assignment ContentImagine you are working as a manager in a lo.docxAssignment ContentImagine you are working as a manager in a lo.docx
Assignment ContentImagine you are working as a manager in a lo.docx
 
Assignment ContentImagine you have just been promoted as.docx
Assignment ContentImagine you have just been promoted as.docxAssignment ContentImagine you have just been promoted as.docx
Assignment ContentImagine you have just been promoted as.docx
 
Assignment ContentImagine you are the office manager at .docx
Assignment ContentImagine you are the office manager at .docxAssignment ContentImagine you are the office manager at .docx
Assignment ContentImagine you are the office manager at .docx
 
Assignment ContentImagine you are asked to design an inf.docx
Assignment ContentImagine you are asked to design an inf.docxAssignment ContentImagine you are asked to design an inf.docx
Assignment ContentImagine you are asked to design an inf.docx
 
Assignment ContentImagine you are an IT manager for an org.docx
Assignment ContentImagine you are an IT manager for an org.docxAssignment ContentImagine you are an IT manager for an org.docx
Assignment ContentImagine you are an IT manager for an org.docx
 

Kürzlich hochgeladen

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 

Kürzlich hochgeladen (20)

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 

Assignment ContentAboutThis assignment must be complet.docx

  • 1. Assignment Content About This assignment must be completed in a group of minimum 3 students and maximum 4 students. This assignment is a prelude to the third assignment. It aims at providing you with an authentic experience in carrying a simple data science project that covers all essential stages in a data science lifecycle. Since most professional science projects are performed by teams, you are therefore required to complete this assignment in a team. Tasks In short, you are required to complete the following tasks: Pitch a public, open dataset of your choice. Pitch 3 or 4 initial hypotheses to be pursued later in Assignment 3. Profile the data using descriptive and/or inferential statistics techniques (which also requires that you demonstrate proficient data wrangling skills). Present items 1, 2, and 3 above via a recorded presentation.
  • 2. Your tasks are open-ended tasks, similar to most real data science projects. This means no two teams are likely to go to the same direction and produce similar results. You will find that your group will become experts in interpreting your own data and answering your own problems. Comparing performance across teams may not be meaningful and your team will be assessed solely against the rubric. Your Python code base must be available on your Github repo. The extent of the group's collaboration and individual contribution will be evaluated solely based on Github. General advice: Select an open (publicly available) data - data that can be freely downloaded, preferably with an open license, allowing you to share the data freely. Choosing non-public data is not advisable as your instructor may be restricted from accessing the data. Choose data in the domain for which team member(s) has some background. Formulate open-ended hypotheses. Carry out fresh data and/or analysis. Where possible, choose a dataset and formulate problems pertaining to practical Australian contexts.
  • 3. Data Choose only 1 dataset. It is fine to choose a dataset that has been analysed by others outside of the university. This is the natural consequence of selecting open data. However, you should either show that the analysis and exploration you plan has not been done before, or show that there is no code already available to do the analysis you intend. Your instructor is likely to view highly any original investigation. Sources of open datasets include but are not limited to: https://data.gov.au/ https://data.nt.gov.au/ https://data.worldbank.org/ https://www.data.gov/ https://datasetsearch.research.google.com/ https://www.kaggle.com/datasets - Be careful. Many Kaggle datasets have published analyses. Choose something that has not been done before.
  • 4. Github Classroom Group work activities must be visible on Github Classroom. The instructor will send an invitation to all students to join Github Classroom after all groups are formed. To accept this invitation, every student must have a free Github account. If you do not already have it, please sign up . This is compulsory. Marking You should refer to the detailed marking rubric that appears on the side panel of this window. Submission item Please submit the following via Learnline by latest 11.59pm on the due date: 1 URL to a recorded presentation per team published privately on YouTube. Do not submit multiple recordings and do not submit recording file unless requested specifically. (optional) supplementary information, where applicable.
  • 5. Latest Python code base on GitHub repo must be accessible to your instructor. Snapshot of the repo will be taken at the time of submission. The duration of the presentation is commensurate with the team size. Inline with the Unit Information, 2 to 3 minutes of presentation per team member is required. Not complying with this requirement may attract a mark penalty. Example: For a team of 3: the minimum duration is 6 minutes (2 minutes x 3 members) and the maximum duration is 9 minutes (3 minutes x 3 members). For a team of 4: the minimum duration is 8 minutes (2 minutes x 4 members) and the maximum duration is 12 minutes (3 minutes x 4 members). Academic integrity and assessment irregularities Academic integrity is a core value at CDU and must be upheld at all times when completing this assignment. You must not plagirise the work of others. Please be referred to the Students - Breach of Academic Integrity Procedures .
  • 6. Other assessment irregularities are governed by CDU's Higher Education Assessment Procedures . Tips and example Broadly speaking, your instructor is looking for evidence of your demonstrative competency in the following key data science skills implemented in Python: (1) hypothesis formulation , (2) exploratory data analytics , (3) data wrangling skills , and (4) data visualisations . When pitching your dataset, consider addressing the following concerns: source of data accesssibility of data validity of data why the dataset matters (in practical or academic terms) domain knowledge
  • 7. relevance to you etc. In profiling the data, consider addressing the following concerns: dimensionality data types centrality spread shape of data distributions etc. The last task is to pitch 3 to 4 initial hypotheses. Consider addressing the following concerns: what might the data tells us what would you like to explore first based on your initial data profiling what would you like to predict what existing assumption you want to test previous finding what new idea you want to test