From: Data and knowledge-driven named entity recognition for cyber security
Datasets
Sentences
Tokens
Entities
Training set
16162
595068
47628
Test set
4635
170736
14002
Validation set
2298
85345
6868
Total
23095
851149
68498