The Virtual Center for Network and Security Data
Sponsored by Department of Homeland Security (DHS) Science & Technology (S&T) Directorate
Today's networked systems are being attacked with increasing frequency and intensity. In order for researchers to fully understand the scope and impact of these attacks as well as develop defensive mechanisms, researchers require Internet-wide datasets. These dataset, going beyond simple single point packet traces, provide a broad view of events with rich correlated data.
In order to meet the demand for creation of such a repository, we have: (1) Interested potential data providers and secure their commitment to participate; (2) Coordinated the creation of meta-data for the repository; (3) Created a query interface for searching for data sources based on the meta-data; (4) Provided access to the data made available from member institutions; (5) Provided centralized aggregation and storage of specific data sets.
Virtual Repository Datasets:
In order to jump-start Internet-scale research we have developed a virtual data repository of rich, correlated datasets representing Internet scale behaviors. Data available from this virtual repository includes both infrastructure level data as well as data from distributed forensics tools. Just a few examples include:
Current and Past Virtual Repository Participants:
As part of this work we have brought together a diverse set of consortium partners representing tier-1 ISPs, national research networks, and existing global data collection infrastructure with the goal of providing a wide range of extremely relevant data sources, providing a global perspective on Internet behavior. Some current and past participants include:
DHS Project Portal
The DHS PREDICT portal where access to datasets can be requested by researchers is here: https://predict.org