Unsupervised host behavior classification from connection patterns

Abstract : A novel host behavior classification approach is proposed as a preliminary step toward traffic classification and anomaly detection in network communication. Though many attempts described in the literature were devoted to flow or application classifications, these approaches are not always adaptable to operational constraints of traffic monitoring (expected to work even without packet payload, without bidirectionality, on highspeed networks or from flow reports only...). Instead, the classification proposed here relies on the leading idea that traffic is relevantly analyzed in terms of host typical behaviors: typical connection patterns of both legitimate applications (data sharing, downloading,...) and anomalous (eventually aggressive) behaviors are obtained by profiling traffic at the host level using unsupervised statistical classification. Classification at the host level is not reducible to flow or application classification, and neither is the contrary: they are different operations which might have complementary roles in network management. The proposed host classification is based on a nine-dimensional feature space evaluating host Internet connectivity, dispersion and exchanged traffic content. A Minimum Spanning Tree (MST) clustering technique is developed that does not require any supervised learning step to produce a set of statistically established typical host behaviors. Not relying on a priori defined classes of known behaviors enables the procedure to discover new host behaviors, that potentially were never observed before. This procedure is applied to traffic collected over the entire year 2008 on a transpacific (Japan/USA) link. A cross-validation of this unsupervised classification against a classical port-based inspection and a state-of-the-art method provides assessment of the meaningfulness and the relevance of the obtained classes for host behaviors.
Type de document :
Article dans une revue
International Journal of Network Management, Wiley, 2010, 20 (5), pp.317-337
Liste complète des métadonnées

Littérature citée [34 références]  Voir  Masquer  Télécharger

https://hal-ens-lyon.archives-ouvertes.fr/ensl-00488248
Contributeur : Pierre Borgnat <>
Soumis le : mercredi 9 juin 2010 - 09:47:07
Dernière modification le : jeudi 19 avril 2018 - 14:54:04
Document(s) archivé(s) le : jeudi 1 décembre 2016 - 07:07:00

Fichier

ijnm_rev2_bis.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : ensl-00488248, version 2

Citation

Guillaume Dewaele, Yosuke Himura, Pierre Borgnat, Kensuke Fukuda, Patrice Abry, et al.. Unsupervised host behavior classification from connection patterns. International Journal of Network Management, Wiley, 2010, 20 (5), pp.317-337. 〈ensl-00488248v2〉

Partager

Métriques

Consultations de la notice

407

Téléchargements de fichiers

167