Hi,
I'm using the web spider in LWE to crawl a ColdFusion site, prior to demonstrating to a customer. There are a lot of errors: inspecting the logs (lucid.log.[datetime]) gives errors such as:
2010-11-05 13:18:38,857 INFO handler.SolrApertureCallbackHandler - accessingObject crawler: org.semanticdesktop.aperture.crawler.web.WebCrawler@ffc3fc url: http://www.somethingsomething/something
2010-11-05 13:18:39,615 INFO handler.SolrApertureCallbackHandler - new http://www.somethingsomething/something
2010-11-05 13:18:39,620 WARN handler.SolrApertureCallbackHandler - Doc failed: http://www.somethingsomething/something
However there's no further information. How do I find out *why* the document failed (and tweak the crawl appropriately)?