ORBS7120 - Big Data Analytics and Visualisation
Individual project in struction
1. Assessment structure
The individual project accounts for 80% of the module grade. Please choose one data set in the list below for your project. The length of the report should be of 3000 words, excluding references. All the relevant literature and resources for your project should be properly cited in the Harvard referencing style (you will find this website helpful). For your convenience,a template is provided here. Previous exemplars are available here.
Marks allocated to criteria:** | Criteria |
---|---|
20% | 1. Introduction to data and resea rch question (~1000 words)Please introduce the data set used and its background. The relevant literature (e.g., academic journal articles and textbooks) should be surveyed and properly cited with Harvard referencing style. Moreimportantly, please identify a problem to be addressed with this data set (i.e., the research question). Please note that the problem should bespecific (i.e., relevant in the application domain and linked to the variables available from the data set). |
15% | 2. Data processing and explorat ion (~500 words)Please explain: Which variables are available from the data set? Which variables have been selected for the analysis and why? Any datatransformations have been done and why? |
25% | 3. Data visualisation and interpretation (~800 words)Please provide at* least three data visualisations *as descriptive analytical results (e.g., properties of the variables selected) and advanced analytical results (e.g., relationships between the variables selected, machinelearning results). Please follow best practices taught in the moduleregarding data代 写ORBS7120 Data Analytics and Visualisation visualization. Importantly, please interpret the results and findings with details. Note that the data visualisations should be nontrivial representations of information, yet easy to interpret. |
20% | 4. Data insights and conclusions (~700 words)Please provide the insights drawn from the analytics and summarise the findings. In particular, is the problem (i.e., research question) identified at the beginning addressed by the analytics? How? |
20% | 5. Writing, styling and re ferencesThe clarity, logic and presentation of the report, including spelling,grammar and punctuation. The general styling and references should be clear and consistent. |
---|
2. Recommended datasets
NOTICE: Before starting the individual project, you will need to confirm your choice of data
set withthis link on Moodle . The link will be available from Monday, April 29 th 2024, 12:00 pm (UK time) . Any submissions without data choice confirmation will have the marks reduced accordingly.
While the use of generative AI technologies such as ChatGPT could be helpful for learning Python programming, it is important to note that employing it to produce any portion of a written report is strictly prohibited.
Please find a list of recommended datasets below. All of them are from
https://nijianmo.github.io/amazon/index.htmland have significant textual content **(i.e.,
Amazon reviews). Therefore, text analytics tools should be employed. Please note that each dataset includes reviews (ratings, text, helpfulness votes) as well as product metadata**
(descriptions, category information, price, brand, and image features), which are linked by product ASIN **number. Depending on your device’s capacity, you might use the original
dataset or the alternative 5-core version (which is smaller and easier to be processed) . You
may also use the latest version of the dataset fromhttps://amazon-reviews- 2023.github.io/main.html, which covers a longer period until 2023.
• Amazon review Amazon Fashion**
• Amazon review All Beauty**
• Amazon review Appliances
• Amazon review Arts Crafts and Sewing**
• Amazon review Automotive
• Amazon review Books
• Amazon review CDs and Vinyl
• Amazon review Cell Phones and Acc essories**
• Amazon review Clothing Shoes and Jewelry
• Amazon review Digital Music**
• Amazon review Electronics
• Amazon review Grocery and Gourmet Food**
• Amazon review Home and Kitchen**
• Amazon review Industrial and Scientific
• Amazon review Kindle Store
• Amazon review Luxury Beauty**
• Amazon review Movies a nd TV
• Amazon review Musical Instruments**
• Amazon review Office Products**
• Amazon review Patio Lawn and Garden**
• Amazon review Pet Supplies
• Amazon review Software
• Amazon review Sport s and Outdoors
• Amazon review Tools and Home Impro vement
• Amazon review Toys and Games
• Amazon review Video Games
WX:codinghelp
**粗体** _斜体_ [链接](http://example.com) `代码` - 列表 > 引用
。你还可以使用@
来通知其他用户。