Skip to main content

Table 2 Datasets statistics

From: Data and knowledge-driven named entity recognition for cyber security

Datasets Sentences Tokens Entities
Training set 16162 595068 47628
Test set 4635 170736 14002
Validation set 2298 85345 6868
Total 23095 851149 68498