I am not sure of what our comparison between the 'reduced data' and 'original data' is
supposed to show.
Is it that performing data reduction yields a similar accuracy with reduced time and memory
requirements? Is accuracy supposed to improve?
Also, should we use the J48 Decision Tree to Compare the performance of the two models?
You can compare them based on accuracy, F1, etc., or compare them on data set size, etc. Data
reduction usually leads to lower accuracy, but sometimes may lead to higher accuracy.
You can use J48 or other models of your choice.