Skip to main content

Table 2 Our collected dataset categorized as malicious, benign, and incompatible PDF files (bracketed)

From: Keeping pace with the creation of new malicious PDF files using an active-learning based detection framework

Dataset source

Year

Malicious files

Benign files

VirusTotal a repository

2012–2014

17,596 (1017)

Srndic and Laskov [6]

2012

27,757 (437)

Contagio project

2010

410 (175)

Internet and Ben-Gurion University (random selection)

2013–2014

0

5145

Total

 

45,763 (1629)

5145

  1. a https://www.virustotal.com/