CodeGarden 10: The sixth annual Umbraco Developer Conference
June 23-25th 2010 - free ASP.NET MVC pre-conference. Register today!

taking it forward

7/21/2009 4:37:15 PMAvatarSimon DingleyLocation: Plymouth, UKposts: 189Karma: 334

Great work Lee, I was considering something similar which never really got started. I was looking at making use of wwwRobotRules on CodePlex http://robotrules.codeplex.com/ - may be of interest or use to you if you decided to do anything further to this package.

7/22/2009 11:23:00 PMAvatarLee KelleherLocation: Yatton, Bristol, UKposts: 463Karma: 1356
Comment with ID: 10410

(I'd completely forgotten that I'd set-up this forum **yikes**)

Hi Simon,

I looked at WWWRobotRules to do the validation... but it was slightly overkill for our needs.  That library is more targetted towards crawler apps, downloading a remote robots.txt, parsing it and checking/verifying links/URLs against the allow/disallow rules.

If you take a look at the source-code for our Robots.txt Editor - I do a simple validation, looping through each line checking for valid keywords (i.e. "User-Agent", "Disallow", etc).

- Lee

Please login or Sign up To post replies