Write a report (max 1,500 words) on the challenges the dataset presents, the solutions, and your findings, which will be assessed as follows: 1) Discuss the following feature extraction techniques and explain how they work and their advantages and disadvantages a) Term Frequency-Inverse Document Frequency (TF-IDF) [10%] b) BERT [10%] 2) Two step Classification: a) Related/Unrelated classification: i) Use TF-IDF features to train a standard […]



