HPC is an open dataset of logs collected from System 20 of the high performance computing cluster at the [Los Alamos National Laboratories](http://www.lanl.gov/). But the link (http://institutes.lanl.gov/data/fdata/) to the original data has been out of service. The log has been used for benchmarking log parsing methods in the following papers, where you may find more details about the usage of this dataset.
HPC is an open dataset of logs collected from System 20 of the high performance computing cluster at the [Los Alamos National Laboratories](http://www.lanl.gov/). But the link (http://institutes.lanl.gov/data/fdata/) to the original data has been out of service. The log has been used for benchmarking log parsing methods in the following papers, where you may find more details about the usage of this dataset.
### Download
The raw logs are available for downloading at https://github.com/logpai/loghub.
### Citation
### Citation
If you use this dataset from loghub in your research, please cite the following papers.
If you use this dataset from loghub in your research, please cite the following papers.
+ Adetokunbo Makanju, A. Nur Zincir-Heywood, Evangelos E. Milios. [Clustering Event Logs Using Iterative Partitioning](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.503.7668&rep=rep1&type=pdf), in Proc. of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2009.
+ Adetokunbo Makanju, A. Nur Zincir-Heywood, Evangelos E. Milios. [Clustering Event Logs Using Iterative Partitioning](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.503.7668&rep=rep1&type=pdf), in Proc. of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2009.