A project for the univeristy lesson of "Knowledge Discovery from Databases".
Using as training datasets csv files, comprised of numerical attributes only with the exception of the class attribute, are implemented in Python 3.9 the following operations:
- NormalizeValues(inputCsvFile): Normalizing the values of all attributes-except for the class attribute(i.e., transforms them in the [0,1] range).
- ENN(inputNormalizedCsvFile, K): Takes as input a normalized csv file and the required algorithm parameter K. Applies the editing algorithm ENN on it.
- IB2(inputNormalizedCsvFile): Takes as input a normalized csv file and applies the instance reduction algorithm IB2 on it.
The code is tested with the "iris.csv" and "letter-recognition.csv" datasets.
The output in each program is written to a csv file as well.