GNOME Bugzilla – Bug 303324
on restart, move not indexed directories to head of uncrawled list
Last modified: 2007-01-13 20:29:12 UTC
When beagle is stopped for any reason in the middle of indexing and restarted later, it will recrawl all directories methodically ignoring that it still has directories that weren't even indexed once, and so they're content can't be searched. That could be a problem when the number of directories/files to crawl is very large (hundreds and up) My request is to move all the not yet indexed(dirty) directories to the head of the uncrawled directories list, so that beagle will first finish indexing and only then check the rest of the directories for changes.
There is no way to distinguish between a directory which has not been indexed and a directory in which all the files have been changed. While crawling, beagle chooses the directory with the oldest last_modified_time as the next directory to crawl. Thats the best that can be done.