git*_*oob 2 machine-learning svm
我有一个应用程序需要区分良好的HTTP GET请求和坏.
For example:
http://somesite.com?passes=dodgy+parameter # BAD
http://anothersite.com?passes=a+good+parameter # GOOD
My system can make a binary decision about whether or not a URL is good or bad - but ideally I would like it to predict whether or not a previously unseen URL is good or bad.
http://some-new-site.com?passes=a+really+dodgy+parameter # BAD
I feel the need for a support vector machine (SVM) ... but I need to learn machine learning. Some questions:
Run Code Online (Sandbox Code Playgroud)
1)SVM是否适合此任务?2)我可以使用原始URL进行训练吗? - 没有明确指定'功能'3)我需要多少个网址来擅长预测?4)我应该使用什么样的SVM内核?5)训练完毕后,如何保持最新状态?6)如何再次测试看不见的URL以确定它是好还是坏?一世
我认为史蒂夫和StompChicken都提出了很好的观点:
检查其他人如何处理类似问题也很有用: