Skip to main content

Table 4 Analysis of the datasets and trained model

From: Using deep learning to solve computer security challenges: a survey

Topic

Paper

Source Available

Raw Data Available1

Dataset Available 2

Quality of Dataset

     

Sample Num

Balance

PA

RFNBNN [9]

N/A

N/A

 

EKLAVYA [10]

N/A

N/A

ROP

ROPNN [11]

N/A

N/A

 

HeNet [12]

N/A

N/A

CFI

Barnum [13]

N/A

N/A

 

CFG-CNN [14]

N/A

N/A

Network

50b(yte)-CNN [15]

115835

 

PCCN [16]

1168671

Malware

Rosenber [17]

500000

 

DeLaRosa [18]

100000

LogEvent

DeepLog [8]

P3

P

N/A

 

LogAnom [41]

P

N/A

MemoryFoensic

DeepMem [19]

N/A

 

MDMF [48]

N/A

FUZZING

NeuZZ [20]

N/A

N/A

 

Learn & Fuzz

N/A

N/A

  1. 1“Raw data” refers to the data that used to generate training set but cannot be feed into the model directly. For instance, a collection of binary files is raw file
  2. 2“Dataset” is the collection of data sample that can be feed in to the DL model directly. For instance, a collection of image, sequence
  3. 3“P” denotes that its source code or dataset is partially available to public