1. Liang Y L, Zhang Y Y, Jette M, et al. BlueGene/L failure analysis and prediction models. Proceedings of the International Conference on Dependable Systems and Networks (DSN’06), Jun 25-28, 2006, Philadelphia, PA, USA. New York, NY, USA: ACM, 2006: 425-434
2. Salfner F, Malek M. Using hidden semi-Markov models for effective online failure prediction. Proceedings of the 26th IEEE Symposium on Reliable Distributed Systems (SRDS’07), Oct 10-12, 2007, Beijing, China. Piscataway, NJ, USA: IEEE, 2007:161-174
3. Fu S, Xu C Z. Exploring event correlation for failure prediction in coalitions of clusters. Proceedings of the 21st International Conference on High Performance Computing, Networking, Storage and Analysis (SC’07), Nov 10-16, 2007, Reno, NV, USA. Los Alamitos, CA, USA: IEEE Computer Society, 2007: 456-468
4. Hacker T J, Romero F, Carothers C D. An analysis of clustered failures on large supercomputing systems. Journal of Parallel and Distributed Computing, 2009, 69 (7): 652-665
5. Salfner F, Tröger P, Tschirpke S. Cross-core event monitoring for processor failure prediction. Proceedings of the 23rd International Symposium on High Performance Computing and Simulation (HPCS’09), Jun 21-24, 2009, Leipzig, Germany. Los Alamitos, CA, USA: IEEE Computer Society, 2009: 67-73
6. Taerat N, Nakisinehaboon N, Chandler C, et al. Using log information to perform statistical analysis on failures encountered by large-scale HPC deployments. Proceeding of the 5th High Availability and Performance Computing Workshop (HAPCW’08), Apr 2-4, 2008, Denver, CO, USA. 2008
7. Solano-Quinde L D, Bode B M. Module prototype for online failure prediction for the IBM BlueGene/L. Proceeding of the IEEE International Conference on Electro/Information Technology (EIT’08), May 18-20, 2008, Ames, IA, USA. Piscataway, NJ, USA: IEEE, 2008: 470-474
8. Zhang Y Y, Sivasubramaniam A. Failure prediction in IBM BlueGene/L event logs. Proceedings of the 7th IEEE International Conference on Data Mining (ICDM’07), Oct 28-31, 2007, Omaha, NE, USA. Los Alamitos, CA, USA: IEEE Computer Society, 2007: 583-588
9. Liang Y L, Zhang Y Y, Sivasubramaniam A, et al. Filtering failure logs for a BlueGene/L prototype. Proceedings of the International Conference on Dependable Systems and Networks (DSN’05), Jun 28-Jul 1, 2005, Yokohama, Japan. Los Alamitos, CA, USA: IEEE Computer Society, 2005: 476-485
10. Schroeder B, Gibson G A. A large-scale study of failures in high-performance computing systems. Proceedings of the International Conference on Dependable Systems and Networks (DSN’06), Jun 25-28, 2006, Philadelphia, PA, USA. Los Alamitos, CA, USA: IEEE Computer Society, 2006: 249-258
11. Williams A W, Pertet S M, Narasimhan P. Tiresias: Black-box failure prediction in distributed systems. Proceedings of the 21st IEEE International on Parallel and Distributed Processing Symposium (IPDPS’07), Mar 26-30, 2007, Long Beach, CA, USA. Los Alamitos, CA, USA: IEEE Computer Society, 2007: 8p
12. De Silva V, Tenenbaum J B. Global versus local methods in nonlinear dimensionality reduction. Proceedings of the 16th Annual Conference on Neural Information Processing Systems (NIPS’02), Dec 9-14, 2002, Vancouver, Canada. 2002: 705-712
13. Roweis S T, Saul L K. Nonlinear dimensionality reduction by local linear embedding. Science, 2000, 290 (5500): 2323-2326
14. De Ridder D, Duin R P W. Locally linear embedding for classification. Technical Report, PH-2002-01. Delft, The Netherland: Delft University of Technology, 2002
15. Kégl B. Intrinsic dimension estimation using packing numbers. Proceedings of the 16th Annual Conference on Neural Information Processing Systems (NIPS’02), Dec 9-14, 2002, Vancouver, Canada. 2002: 681-688 |