Search: Svalue stack overflow
Imported from http://bugzilla.roxen.com/bugzilla/show_bug.cgi?id=6717
Reported by Marcus Wellhardh wellhard@roxen.com
Bad formed documents can cause crawler to abort. Expected behavior is to give a warning and continue indexing. Reported in [RT 20059].
0d 3h 9m : Crawler 24352: Svalue stack overflow. (99624 of 100000 entries on stack, needed 256 more entries)
: Crawler 24352: pike/lib/modules/Search.pmod/Filter.pmod/HTML.pmod:382: Search.Filter.Word->__lambda_66105_21_line_380(Parser._parser.HTML(),"\n \n",([])) : Crawler 24352: pike/lib/modules/Parser.pmod/_parser.so:1: Parser._parser.HTML()->finish("<!DOCTYPE HTML PUBLIC \"-//W3C//DTD HTML 4.0 Transitional//EN\" \"http://www.w3.org/TR/REC-html40/loose.dtd\">\n<html> \n<head> \n<META HTTP-EQUIV=\"Content-Type\" CONTENT=\"text/html; charset=utf-8\">\n<META"+[5124697])
: Crawler 24352: pike/lib/modules/Search.pmod/Filter.pmod/HTML.pmod:407: Search.Filter.Word->filter(URI("sitebuilder://dmrbackend/student/opleidingen/hbo-r/jaar-1-deeltijd/cova-samenwerken-en-proje-1/Stm_2010_HBO-Rechten.doc"),"<!DOCTYPE HTML PUBLIC \"-//W3C//DTD HTML 4.0 Tran"+[5124845],"text/html",([]),"utf-8")
: Crawler 24352: modules/search/pike-modules/Search.pmod/Filter.pmod/Word.pmod:52: Search.Filter.Word->filter(@0,"ÐÏ\21ࡱ\32á\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0>\0\3\0þÿ\t\0\6\0\0\0\0\0\0\0\0\0\0\09\0\0"+[3717074],"application/msword")
17:27:00 : Crawler 24352: modules/search/programs/multiprocess_crawler.pike:496: /main()->low_do_filter("¶ke0\1\0\6\0data\u00867¶üÐÏ\21ࡱ\32á\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0>\0\3\0þÿ\t\0\6\0\0\0\0\0\0\0\0\0\0\09\0\0\0W\34\0\0\0\0\0\0\0\20\0\0Y\34\0\0\1\0\0\0þÿÿÿ\0\0\0\0\36\34\0\0\37\34\0\0 \34\0\0!\34\0\0\"\34\0\0#\34\0\0$\34\0\0%\34\0\0&\34\0\0'\34\0\0(\34\0\0)\34\0\0*\34\0\0+\34\0\0,\34\0\0-\34\0\0.\34\0\0/\34\0\u00000\34\0\u00001\34\0\u00002\34\0\u00003\34\0\u00004\34\0\u00005\34\0\u00006\34\0\u00007\34\0\0"+[3717525])
0d 3h 9m : Crawler 24352: modules/search/programs/multiprocess_crawler.pike:376: /main()->got_data(0,"<\e\0\0=\e\0\0>\e\0\0?\e\0\0@\e\0\0A\e\0\0B\e\0\0C\e\0\0D\e\0\0E\e\0\0F\e\0\0G\e\0\0H\e\0\0I\e\0\0J\e\0\0K\e\0\0L\e\0\0M\e\0\0N\e\0\0O\e\0\0P\e\0\0Q\e\0\0R\e\0\0S\e\0\0T\e\0\0U\e\0\0V\e\0\0W\e\0\0X\e\0\0Y\e\0\0Z\e\0\0[\e\0\0\\\e\0\0]\e\0\0^\e\0\0_\e\0\0`\e\0\0a\e\0\0b\e\0\0c\e\0\0d\e\0\0e\e\0\0f\e\0\0g\e\0\0h\e\0\0i\e\0\0j\e\0\0k\e\0\0l\e\0"+[7998])
: Crawler 24352: -:1: Pike.Backend(0)->`()(3600.0)
: Crawler 24352: pike/lib/master.pike:3895: master()->_main(({"/opt/roxen/server-5.2.200/bin/roxen","-M","modules/search/pike-modules/","modules/search/programs/multiprocess_crawler.pike","single_process"}))
: Crawler 24352: Exiting due to uncaught error.