AM11 CNN SVM Text Mining

Individual Assignment AM11

Project Selection: Choose a problem where you will use at least one out of the 5
topics that you have learnt to help solve a problem of your choice (CNN, SVM, Text Mining,
PCA, Recommendation Systems).
Þ The project should have a well-defined goal, such as classification, clustering,
recommendation etc.
Þ Plagiarism will result in 0 marks (e.g. replication of an existing Kaggle notebook).
Your work must be original and well documented to explain your workings.
Þ The complexity of your project should match the time available for submission.
Þ The complexity of your work will reflect your grade (e.g. if you decide to work with a
dataset that requires PCA pre-processing before classifying with SVM, thus utilising
two out of five algorithms that you have learnt).
Dataset: Use an open dataset (e.g., from Kaggle, UCI ML Repository, etc.) or collect
your own, ensuring it has enough samples but that it is not too large (you should be able to
run your analysis on your laptop). For classification problems, ensure to properly balance
your classes.
Methodology:
• Explain why the chosen technique is suitable for the problem.
• Preprocess the data appropriately.
• Train and evaluate the model using appropriate performance metrics.
• Compare with at least one baseline model
Implementation (.py or .ipynb):
• Use Python (with libraries like TensorFlow, Scikit-learn, Pandas, etc.).
• Ensure reproducibility (seed the random number generator where
appropriate, provide a Jupyter Notebook (and its knitted output) or a well-documented .py
script).
Report (pdf):
• Introduction: Explain the problem and dataset. Ensure to supply references. If
you can produce your how to use TeX Studio and LaTeX.
• Methodology: Describe preprocessing, model selection, and training.
• Results & Discussion: Present evaluation metrics, visualizations, and insights.
• Conclusion: 代写AM11 CNN SVM Text MiningSummarize the findings and suggest future improvements.
Your report should be a maximum of 3 pages long, in an Arial 11 font with standard margins.
Demonstrate the art of concise writing (brevity, economy of words, clarity and precision).
Ensure your figure axes labelling and tickers are legible.
Grading Criteria:
You will be evaluated on both the technical execution and on your ability to communicate
your findings.
Category Weight Description
Problem clarity & justification 20% Clearly defines the problem, explains its
relevance, and justifies the chosen ML
technique.
Data preprocessing & exploratory
analysis
20% Properly cleans, preprocesses, and
visualizes the data; identifies key patterns
and challenges.
Model selection, training, and
evaluation
30% Implements an appropriate model, explains
parameter choices, evaluates performance
with meaningful metrics, and compares with
a baseline.
Interpretation & discussion of
results
20% Provides insightful analysis, interprets
results, discusses limitations, and suggests
improvements.
Code quality & reproducibility 10% Code is well-documented, structured, and
reproducible; submission includes a Jupyter
Notebook or well-commented script.
WX：codinghelp

AM11 CNN SVM Text Mining

lzehlzt7

引用和评论

COMP4033 Computer graphics

MyBatis-Plus结合Spring Boot实现数据权限

70k star，取代Postman！这款轻量级API工具，太香了！

大模型时代，后端程序员如何避免被AI卷死？

LSM-TREE从入门到入魔：从零开始实现一个高性能键值存储｜得物技术

C++ 中 VS 项目引入公共配置文件

疯狂推荐！从零开始 Dify 部署全攻略！

AM11 CNN SVM Text Mining

lzehlzt7

引用和评论

COMP4033 Computer graphics

MyBatis-Plus结合Spring Boot实现数据权限

70k star，取代Postman！这款轻量级API工具，太香了！

大模型时代，后端程序员如何避免被AI卷死？

LSM-TREE从入门到入魔：从零开始实现一个高性能键值存储 ｜ 得物技术

C++ 中 VS 项目引入公共配置文件

疯狂推荐！从零开始 Dify 部署全攻略！

LSM-TREE从入门到入魔：从零开始实现一个高性能键值存储｜得物技术