Evaluating-Machine-Learning-Models-for-Disparate-Computer-Systems-Performance-Prediction

Paper available at the following link

Performance prediction is an active area of research due to its applications in the advancements of hardware-software co-development. Several empirical machine-learning models such as linear models, tree-based models, neural network etc are used for performance prediction each having different prediction accuracy. Furthermore, the prediction model’s accuracy may differ depending on performance data collected for different software types (compute-bound, memory-bound) and different hardware (simulation-based or physical systems). We have studied fourteen machine-learning models on simulation-based hardware and physical systems by executing several benchmark programs with different computation and data access patterns. Our results show that the tree-based machine-learning models outperform all other models with median absolute percentage error (MedAPE) of less than 5% followed by bagging and boosting models that help to improve weak learners. We have also observed that prediction accuracy is higher on simulation-based hardware due to its deterministic nature as compared to physical systems. Moreover, in physical systems, prediction accuracy of memorybound algorithms is higher as compared to compute-bound algorithms due to manufacturer variability in processors.

Citation

If you find this repo useful for your research, please consider citing our paper:

@INPROCEEDINGS{9198512,
  author={A. {Mankodi} and A. {Bhatt} and B. {Chaudhury} and R. {Kumar} and A. {Amrutiya}},
  booktitle={2020 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)}, 
  title={Evaluating Machine Learning Models for Disparate Computer Systems Performance Prediction}, 
  year={2020},
  volume={},
  number={},
  pages={1-6},
  doi={10.1109/CONECCT50063.2020.9198512}}

For any enquiries, please contact the main authors.

Folders

Dataset_CSV: Folder Containing input dataset: Please contact the authors for full dataset.

Physical Systems Dataset:

dijkstra_lab.csv
matmul_lab_omp.csv
montecarlo_lab_omp.csv
mser_lab.csv
qsort_actual_lab_omp.csv
runtimes_final_mantevo_miniFE.csv
runtimes_final_npb_ep.csv
runtimes_final_npb_mg.csv
sha_lab.csv
stitch_lab.csv
svm_lab.csv
tracking_lab.csv

Simulated Dataset using Gem5:

dijsktra.csv
matmul.csv
montecarlocalcpi.csv
mser.csv
qsort.csv
sha.csv
stitch.csv
svm.csv
tracking.csv

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Codes		Codes
Dataset_CSV		Dataset_CSV
Images		Images
Results_CSV/all_csv		Results_CSV/all_csv
Results_Plots_Code		Results_Plots_Code
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaluating-Machine-Learning-Models-for-Disparate-Computer-Systems-Performance-Prediction

Citation

Folders

Dataset_CSV: Folder Containing input dataset: Please contact the authors for full dataset.

Physical Systems Dataset:

Simulated Dataset using Gem5:

Codes: Contain all Experimental Codes

data: Contains saved models and cross-folds data, which is available as a link to google drive.

Images: Contains images used for Paper

Results_CSV: Contains Metircs Scores for each Dataset application

Results_Plots_Code: Contains code used for plotting and saving images for paper

About

Releases 1

Packages

Languages

License

rajat-tech-002/Evaluating-Machine-Learning-Models-for-Disparate-Computer-Systems-Performance-Prediction

Folders and files

Latest commit

History

Repository files navigation

Evaluating-Machine-Learning-Models-for-Disparate-Computer-Systems-Performance-Prediction

Citation

Folders

Dataset_CSV: Folder Containing input dataset: Please contact the authors for full dataset.

Physical Systems Dataset:

Simulated Dataset using Gem5:

Codes: Contain all Experimental Codes

data: Contains saved models and cross-folds data, which is available as a link to google drive.

Images: Contains images used for Paper

Results_CSV: Contains Metircs Scores for each Dataset application

Results_Plots_Code: Contains code used for plotting and saving images for paper

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages