Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • pike pike
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 104
    • Issues 104
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Container Registry
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • pikelang
  • pikepike
  • Issues
  • #6717
Closed
Open
Issue created Mar 21, 2013 by Peter Bortas@zinoOwner

Search: Svalue stack overflow

Imported from http://bugzilla.roxen.com/bugzilla/show_bug.cgi?id=6717

Reported by Marcus Wellhardh wellhard@roxen.com

Bad formed documents can cause crawler to abort. Expected behavior is to give a warning and continue indexing. Reported in [RT 20059].

 0d 3h 9m : Crawler 24352: Svalue stack overflow. (99624 of 100000 entries on stack, needed 256 more entries)
          : Crawler 24352: pike/lib/modules/Search.pmod/Filter.pmod/HTML.pmod:382: Search.Filter.Word->__lambda_66105_21_line_380(Parser._parser.HTML(),"\n  \n",([]))          : Crawler 24352: pike/lib/modules/Parser.pmod/_parser.so:1: Parser._parser.HTML()->finish("<!DOCTYPE HTML PUBLIC \"-//W3C//DTD HTML 4.0 Transitional//EN\" \"http://www.w3.org/TR/REC-html40/loose.dtd\">\n<html> \n<head> \n<META HTTP-EQUIV=\"Content-Type\" CONTENT=\"text/html; charset=utf-8\">\n<META"+[5124697])
          : Crawler 24352: pike/lib/modules/Search.pmod/Filter.pmod/HTML.pmod:407: Search.Filter.Word->filter(URI("sitebuilder://dmrbackend/student/opleidingen/hbo-r/jaar-1-deeltijd/cova-samenwerken-en-proje-1/Stm_2010_HBO-Rechten.doc"),"<!DOCTYPE HTML PUBLIC \"-//W3C//DTD HTML 4.0 Tran"+[5124845],"text/html",([]),"utf-8")
          : Crawler 24352: modules/search/pike-modules/Search.pmod/Filter.pmod/Word.pmod:52: Search.Filter.Word->filter(@0,"ÐÏ\21ࡱ\32á\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0>\0\3\0þÿ\t\0\6\0\0\0\0\0\0\0\0\0\0\09\0\0"+[3717074],"application/msword")
17:27:00  : Crawler 24352: modules/search/programs/multiprocess_crawler.pike:496: /main()->low_do_filter("¶ke0\1\0\6\0data\u00867¶üÐÏ\21ࡱ\32á\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0>\0\3\0þÿ\t\0\6\0\0\0\0\0\0\0\0\0\0\09\0\0\0W\34\0\0\0\0\0\0\0\20\0\0Y\34\0\0\1\0\0\0þÿÿÿ\0\0\0\0\36\34\0\0\37\34\0\0 \34\0\0!\34\0\0\"\34\0\0#\34\0\0$\34\0\0%\34\0\0&\34\0\0'\34\0\0(\34\0\0)\34\0\0*\34\0\0+\34\0\0,\34\0\0-\34\0\0.\34\0\0/\34\0\u00000\34\0\u00001\34\0\u00002\34\0\u00003\34\0\u00004\34\0\u00005\34\0\u00006\34\0\u00007\34\0\0"+[3717525])
 0d 3h 9m : Crawler 24352: modules/search/programs/multiprocess_crawler.pike:376: /main()->got_data(0,"<\e\0\0=\e\0\0>\e\0\0?\e\0\0@\e\0\0A\e\0\0B\e\0\0C\e\0\0D\e\0\0E\e\0\0F\e\0\0G\e\0\0H\e\0\0I\e\0\0J\e\0\0K\e\0\0L\e\0\0M\e\0\0N\e\0\0O\e\0\0P\e\0\0Q\e\0\0R\e\0\0S\e\0\0T\e\0\0U\e\0\0V\e\0\0W\e\0\0X\e\0\0Y\e\0\0Z\e\0\0[\e\0\0\\\e\0\0]\e\0\0^\e\0\0_\e\0\0`\e\0\0a\e\0\0b\e\0\0c\e\0\0d\e\0\0e\e\0\0f\e\0\0g\e\0\0h\e\0\0i\e\0\0j\e\0\0k\e\0\0l\e\0"+[7998])
          : Crawler 24352: -:1: Pike.Backend(0)->`()(3600.0)
          : Crawler 24352: pike/lib/master.pike:3895: master()->_main(({"/opt/roxen/server-5.2.200/bin/roxen","-M","modules/search/pike-modules/","modules/search/programs/multiprocess_crawler.pike","single_process"}))
          : Crawler 24352: Exiting due to uncaught error.
Assignee
Assign to
Time tracking