3.5 本章小结
本章在一定程度上科普了机器学习的常见概念,并介绍了后面章节将频繁使用的一些知名数据集,我们在后面章节将频繁和这些数据打交道。本章并没有详细讲解各种概念的定义,如果对机器学习的基本概念还是很模糊,也没有关系,通过后面章节的学习,我们会不断加深对这些概念的理解。
参考资源
我写作本章时参考了以下网站,读者要想进一步学习,欢迎到这些网站进一步了解更多信息:
http://kdd.ics.uci.edu/databases/kddcup99/kddcup99.html
http://blog.csdn.net/com_stu_zhang/article/details/6987632
http://www.isi.csic.es/dataset/
http://www.schonlau.net
http://www.fx361.com/page/2016/1018/284843.shtml
http://www.freebuf.com/articles/system/97703.html
http://yann.lecun.com/exdb/mnist/
http://www.cs.cornell.edu/People/pabo/movie-review-data/
http://archive.ics.uci.edu/ml/datasets/Spambase
http://www2.aueb.gr/users/ion/data/enron-spam/
还有一些参考文献值得一读:
[1] R Lippmann, J W Haines, D J Fried, J Korba, K Das. Analysis and Results of the The 1999 DARPA Off-Line Intrusion Detection Evaluation. International Symposium on Recent Advances in Intrusion Detection, 2000, 34 (4): 162-182.
[2] J McHugh. Testing Intrusion Detection Systems: A Critique of the 1998 and 1999 DARPA Intrusion Detection System Evaluations as Performed by Lincoln Laboratory. ACM Transactions on Information & System Security, 2000, 3 (4):262-294.
[3] Schonlau M, Couper M. Semi-Automated Categorization of Open-Ended Questions. Survey Research Methods, August 2016.
[4] 张有新,曾华燊,贾磊.入侵检测数据集KDD CUP99研究[J].计算机工程与设计,2010,22.