From: Unleashing the power of pseudo-code for binary code similarity analysis
 | Train data | Test data | ||||||
---|---|---|---|---|---|---|---|---|
Dataset | sg3utils | findutils | usbutils | coreutils | utillinux | binutils | inetutils | diffutils |
Binary count | 504 | 32 | 16 | 832 | 720 | 118 | 208 | 32 |
Total count | 2104 | 358 | ||||||
Pairs | 119272 | 19177 |