Skip to main content

Table 7 Blockchain security data sets summary and analysis

From: Blockchain abnormal behavior awareness methods: a survey

Dataset type Labeled or not Dataset name Dateset description Data size Table name Table description Columns number Columns description Applicable scene Source
Transaction Yes Elliptic datasets (Elliptic 2019; Weber et al. 2019) Transaction graph classifying the illicit and licit nodes collected from the Bitcoin blockchain 200,000 bitcoin transactions Elliptic_ txs_classes Licit transactions or not 2 Transaction id and its class Money laundring detection; Ponzi schemes detection Ellipic
Elliptic_txs _edgelist Nodes and edges 2 Source and destination transaction ids
Elliptic_ txs_features Transactions features 167 Transaction features
Transaction network of phishing nodes (Wu et al. 2020c; Yuan et al. 2020) Phishing account information from Etherscan 1262 phishing accounts Address The node list for phishing detection 7 Part of block head information, transactions flow and contract address Phishing account detection Xblock
Ethereum-network Transaction subgraph 4 Transactions flow with time
The label of phishing account on ethereum (Chen et al. 2020) The different attack tags of phishing account 2881 phishing addresses Phishing_label The different attack tags of phishing account 4 Account classes and its balance with transaction counts
Ethereum phishing transaction network (Chen et al. 2020) A huge Ethereum transaction network extending from phishing nodes reported in Etherscan 2,973,489 nodes, 13,551,303 edges and 1165 labeled nodes MulDiGraph The network attributes 6 Transaction parties are phishing account or not and the transaction flow
Bitcoin partial transaction datasets (Wu et al. 2020b) Snapshots containing partial transaction records of Bitcoin transaction data from 2014 to 2016 22,500,000 Bitcoin transactions Blockhash Information of block 4 Block head information Money laundering detection; Mixing service detection
txhash Transaction ID and hash pairs 2 Transactions id with its hash
Addresses Bitcoin address ID and address pairs 2 Address with its ID
tx Information of transaction 5 Transaction flow with time and its block id
txin List of all transaction inputs 3 Input flow
txout List of all transaction outputs 3 Output flow
Label Addresses of mixing services 1 Addresses of mixing services
No Ethereum on-chain data (Zheng et al. 2020) Ethereum on-chain data getting from the Ehtereum full node 10,999,999 blocks information Block Ethereum block information 14 Complete block head information Extracting and exploring Ethereum
Normal transaction Normal transactions information 10 Transaction flow with block head and gas information
Internal EtherTransaction Smart contract execution transactions information 8 Contracts execution flow with its re-lated transactions and block information
ContractInfo Contract information 11 Contracts information in its whole lifetime
ContractCall Contract calling 11 Contracts calling flow with calling function, type and status
ERC20 transaction ERC20 token transaction information 7 Transaction flow in ERC20
ERC721 transaction ERC721 token transaction information 7 Transaction flow in ERC721
Contract Yes Smart Ponzi scheme labels (Chen et al. 2018) Labels of smart Ponzi contracts by manually check 3794 contracts labels Ponzi_label The labels of whether a contract is a smart Ponzi scheme 2 Contract type Ponzi schemes detection Xblock
DEFIER extended DApp event Dataset (Su et al. 2021) Labels of attack stages on contracts calling flow of DApps 92,644 Labels of attack stages on contracts calling flow Stage_labels The labels of which attack stages (using the kill chain model) the contract calling flow at 4 DApp’s attack stages with its transaction lists DApp events detection Institute of Information Engineering, CAS
No Smart contract attribute dataset (Huang et al. 2019) All open source contracts in Ethereum 14,000 contracts opening source in Ethereum Open source contract info All open source contracts in Ethereum 9 Contract basic information with contract code and transactions in it Ponzi schemes detection Xblock
Market No Ether price and volume dataset (Han et al. 2020) Price and Volume from 2015 to 2019 of Ether 7892 market data about Ether 4 h_data_eth Market data about Ether as the exchange rate is ETH/USDT 8 High and low price and volume of Ether Ether market causality analysis Xblock
Bitcoin price and volume dataset (Han et al. 2020) Price and Volume from 2015 to 2019 of Ether 7892 market data about Bitcoin 4 h_data_btc Market data about Bitcoin as the exchange rate is BTC/USDT 8 High and low price and volume of Bitcoin Bitcoin market causality analysis
Mt.Gox leaked transaction (Chen et al. 2019) Transaction data leaked by Mt.Gox exchange Mt.Gox leaked transactions from 2012 to 2013 Complete_edge_v2 Transactions of bitcoin market 8 Transaction flow with users’ type Bitcoin market user behavior analysis
Activity information of DApps (Zheng et al. 2017) Information about DApps’activity 1,400,000 DApp’s activity information Radar Activity information of DApps on DappRadar 15 DApp’s basic information, transactions and contracts in it and users information DApp activity analysis
State_of_the_dapp Activity information of DApps on State of the Dapps 28 DApp’s state information, transactions and contracts in it and users information