Master's Thesis : A study of text classification framework in R for classification of functional software requirements in enterprise systems; involves cleaning and pre-processing data, and using classifiers like kNN and ensemble methods to study the performance and accuracy of the model.
Please see the link to the presentation and Thesis document for detailed information on this subject.
https://docs.google.com/presentation/d/1vxnm5LodVoqhHE1YziJW7OZ3TPyavaCmDFBwZD2eUus/edit?usp=sharing
** Data sets not available for public use**