Paul C's Blog

To be funny,to grow up!

0%

恶意软件变种分类数据集构建01

Decoding the Secrets of Machine Learning in Malware Classification: A Deep Dive into Datasets, Feature Extraction, and Model Performance

数据集的分布,样本数量和家族数量对结果的影响,特征之间的互补性(权重设置以及是否有重复。)

数据集:670家族×每家族100样本

特征提取方法:

VGG-16, RestNet-18, and EfficientNet-B0