Includes: - 41,323 binaries (exe, dll) - legitimate - 96,724 malware files from virusshare.com
An "advanced*" malware analysis tool powered by Machine Learning, designed to help security researchers and professionals analyze and classify malicious software more effectively.
- ML-powered malware detection and classification
- More to come soon!
- Python 3.8+
- pip requirements:
- SciKit Learn
- Numpy
- Pandas
- PEFile
# Clone the repository
git clone https://github.com/Ashthetik/Malware-DataSet.git
# Navigate to the project directory
cd Malware-DataSet
# Install required dependencies
pip install -r requirements.txt
python3 main.py <malwareFile>
Contributions are welcome! Please feel free to submit a Pull Request.
This tool is intended for security research and legitimate malware analysis purposes only. Users should ensure they have appropriate permissions and are following all applicable laws and regulations when analyzing malware.
MIT
VirusShare for their collection of malwares and detailed features of each file
This tool is provided for educational and research purposes only. The author(s) are not responsible for any misuse or damage caused by this tool.
*This tool is only advanced as RandomForests it's developments go, please use with consideration and caution. This does not do you job for you