Skip to main content

Table 2 Datasets statistics

From: Data and knowledge-driven named entity recognition for cyber security

Datasets

Sentences

Tokens

Entities

Training set

16162

595068

47628

Test set

4635

170736

14002

Validation set

2298

85345

6868

Total

23095

851149

68498