Data Analytics for Business Capstone
Semester 2, 2022
Assignment 1 (individual assignment)

1. Key information

Required submissions:
Written report (in pdf, due date: Monday, September 5 by the end of the day).Confidentiality Deep Poll online form (deadline for submission: August 29).ubmission instructions for the report will be posted on Canvas in Week 5.
Weight: 30% of your final grade.
Length: Your written report should have a maximum of 12 pages (single spaced, 11pt). Coverpage, references, and appendix (if any) will not count towards the page limit. Please keep inmind that making good use of your audience’s time is an essential business skill: every
sentence, table or figure should serve a purpose.

2. Problem description

Please start by reading through the Project Outline pdf document posted on the pagecontaining the full dataset for your industry project (you will receive an email with a link tothis page after filling out the Confidentiality Deed Poll online form on the Assignment overviewpage); the Project Outline document is also provided on the last slide of the presentation for
your industry project in Module 1. Focus on the Problem Description section of the ProjectOutline, especially the first and the third bullet points in this section (EDA and Strategy), which
are the most relevant bullet points for Assignment 1.
As a business analyst, you will conduct Exploratory Data Analysis (EDA) of the datacorresponding to your industry project. You should aim to find or reveal all relevant properties,characteristics, patterns, and statistics hidden in the data, supporting your findings witinsightful plots and relevant statistical output.

Your analysis should be in line with the tasks listed under the first and the third bullet points(EDA and Strategy) in the Problem Description section of the corresponding Project Outline.Use the results from your EDA to outline a preliminary strategy or provide preliminaryrecommendations to the management team corresponding to your selected industry project.
You will have a chance to refine these recommendations in Assignment 2. Please refrain fromextensive modelling and model selection – you will do them in Assignment 2. However, feelfree to fit simple models (e.g., linear regression or logistic regression) for the purposes of EDAand understanding the relationships among the variables in the dataset.BUSINESS SCHOOLPage 2 of 4

3. Written report

The purpose of the report is to describe, explain, and justify your findings to the managementteam corresponding to your selected industry project. You may assume that team membershave training in business analytics, however, they are not experts in statistics or machinelearning. The team’s time is important: please be concise and objective.
Suggested outline for the main parts of the report (further details below):

  1. Problem formulation.
  2. Data processing.
  3. Exploratory Data Analysis (EDA).
  4. Conclusions and preliminary recommendations.
    You should consider breaking down the longer parts into smaller sections.

4. Marking Scheme

Business context and problem formulation. 5 marks
Data processing. 30 marksExploratory Data Analysis (EDA). 45 marks
Conclusions and preliminary recommendations. 10 marksWriting and presentation of the report. 10 marksTotal 100 marks

5. Rubric (basic requirements)

Business context and problem formulation. Your report gives a detailed description of theproblem that is being investigated, providing the context and background for the analysis.

Data processing. You describe the data processing steps clearly and in sufficient detail,justifying and explaining your choices and decisions. You handle missing values and otherdata issues appropriately. You describe and explain your data transformations and/or yourfeature engineering process. Your choices and decisions are justified by data analysis, domainknowledge, logic, and trial and error (if necessary).

Exploratory data analysis (EDA). Your report provides a comprehensive description of yourEDA process, presenting selected results. Your analysis is sufficiently rich, and yourvisualizations are insightful. You study key variables and relationships among them usingappropriate plots and descriptive statistics. You note any features of the data that may berelevant for model building in Assignment 2. You note the presence of outliers and any otheranomalies that can affect the analysis. You explain the relevance of the EDA results to theunderlying business problem and your subsequent recommendations. Youclearly describeand justify the methods in your analysis. The choice of methods is logically related to thesubstantive problem, underlying theoretical knowledge, and data analysis. You interpret thestatistical outputs that you provide. You report crucial assumptions and whether they are
potentially violated.BUSINESS SCHOOLPage 3 of 4Conclusions and recommendations. The reasoning from the analysis andresults to yourconclusions and recommendations is logical and convincing. Yourconclusions andrecommendations are written in plain language appropriate for nontechnical audience.

Writing. Your writing is concise, clear, precise, and free of grammatical and spelling errors.You use appropriate technical terminology. Your paragraphs and sentences follow a clear logicand are well connected. If you use an abbreviation or label, you define it first.Report layout. Your report is well organised and professionally presented, as if it had beenprepared for a client later in your career. There are clear divisions between sections andparagraphs.
Tables. Your tables are appropriately formatted and have a clear layout. The tables haveinformative row and column labels. The tables are relatively easy to understand on their own.The tables do not contain information which is irrelevant to the discussion in your report. Thetables are placed near the relevant discussion in your report. There is no text around yourtables, and your tables are not images.

Figures (plots). Your figures are easy to understand and have informative titles, captions,
labels, and legends. The figures are well formatted and laid out. The figures are placed near
the relevant discussion in your report. Your figures have appropriate definition and quality.There is no text around your figures, and your figures are not screenshots.

Numbers. All numerical results are reported to suitable precision (typically no more than threedecimal places, in some cases fewer).

Referencing. You follow the University of Sydney referencing rules and guidelines.ython code. The text of your report should be entirely free of Python code.Note: you are strongly encouraged to use Python for all the steps of your data analysis. Whilethere is no Python code submission for Assignment 1, you should keep your code well-organized, so that you can easily extend/modify/reuse this code for the purposes of
Assignment 2 (which will have a Python code submission requirement).

6. Deductions

Marks may be deducted from each item in the marking scheme in the following cases:The report is disorganised and/or has a poor layout.
There is an excess of abbreviations or labels that the reader may be unfamiliar with.The report has an excessive number of grammatical or spelling mistakes.The tables are difficult to read, for example, due to poor layout or labelling.The figures are difficult to read, for example, due to poor layout or labelling.Numbers are not appropriately rounded.

7. Late Submission of the report

Late submissions are subject to a deduction of 5% of the maximum mark for each calendarday after the due date. After ten calendar days late, a mark of zero will be awarded.

8. Late submission of the Confidentiality Deed Poll online form

It is a requirement of our QBUS6600 unit that all students complete the Confidentiality DeedPoll online form before gaining access to the datasets for the industry projects. The datasetsare highly confidential, and you have responsibility to keep themsecure and only use themfor your QBUS6600 coursework. Submission of the Confidentiality Deed Poll online formafter the August 29st deadline is subject to a penalty of 25 points for Assignment 1.Furthermore, assignments without a submission of the online form will not be marked.
WX:codehelp


pjlgzvaj
1 声望0 粉丝