I have around 200 CSV files containing data from vulnerability assessment with similar headings. There are about 20 headings on each file. The most important headings in the files are IP addresses, Ports, Severity and Vulnerability Synopsis. Each file represents a location in a single assessment. My goal is to combine all the files for R to analyze and generate useful trends for me.
One thing that I can't figure out is to assign date to each file. The files do not have date under any of the headings. These files dated since 2015 until this year where the assessment is done twice a year in April and October session. Each session will generate around 20 files so all of them have the same date.
I want to see these trends from R analysis:
- IP addresses which found in all the files, meaning always vulnerable
- IP address with most vulnerabilities
- Vulnerabilities which persist in all files
- The number of IP addresses which appear in one file and not the next for example found in April 2016 files but not in October 2016 files.
I need some guidance on how to achieve this. Thank you in advanced for any help offered.