Encrypted Web Traffic Dataset: Event Logs and Packet Traces

Logo poskytovatele
Autoři

ŠPAČEK Stanislav VELAN Petr ČELEDA Pavel TOVARŇÁK Daniel

Rok publikování 2022
Druh Článek v odborném periodiku
Časopis / Zdroj Data in Brief
Fakulta / Pracoviště MU

Ústav výpočetní techniky

Citace
www https://doi.org/10.1016/j.dib.2022.108188
Doi http://dx.doi.org/10.1016/j.dib.2022.108188
Klíčová slova HTTPS dataset; TLS 1.2 encryption; Host-based data collection; Network data collection; Encrypted traffic analysis; Event-flow correlation
Přiložené soubory
Popis We present a dataset that captures seven days of monitoring data from eight servers hosting more than 800 sites across a large campus network. The dataset contains data from network monitoring and host-based monitoring. The first set of data are packet traces collected by a probe situated on the network link in front of the web servers. The traces contain encrypted HTTP over TLS 1.2 communication between clients and web servers. The second set of data is an event log captured directly on the web servers. The events are generated by the Internet Information Services (IIS) logging and include both the IIS default features and custom features, such as client port and transferred data volume. Anonymization of all features in the dataset has been carefully carried out to prevent private information leakage while preserving the information value of the dataset. The dataset is suitable mainly for training machine learning techniques for anomaly detection and the identification of relationships between network traffic and events on web servers. We also add tools, settings, and a guide to convert the packet traces to IP flows that are often preferred for network traffic analysis.
Související projekty:

Používáte starou verzi internetového prohlížeče. Doporučujeme aktualizovat Váš prohlížeč na nejnovější verzi.

Další info