ProjectWise Full Text Search - Separating the Wheat from the Chaff

As you may know, ProjectWise leverages the Microsoft Indexing Service to provide full text search capabilities. There are many benefits to this approach, and far too many to discuss here. However one worth mentioning is the ability to filter out words that provide no value in text searches. In other words, we will not likely ever search for all files containing small words like "we", "the", "it", etc., so why include them in the index? This would only serve to degrade performance with nothing really to be gained.

Good news is, the Microsoft engine excludes most of these words by default. However you may wish to place additional words on the excluded list, or maybe even remove one or two. As you might expect, the exception list is language specific so there are actually multiple exception lists to maintain. Customizing an exception list is a simple matter though since these lists are stored as plain text files.

On Windows Server 2003, exceptions lists are stored in %SystemRoot%\System32. In that location, you should find a series of files named Noise.yyy, where yyy is a 3-digit code for the language in use. Noise.eng is the exception list for American English, and contains the following list by default:

about, 1, after, 2, all, also, 3, an, 4, and, 5, another, 6, any, 7, are, 8, as, 9, at, 0, be, $, because, been, before, being, between, both, but, by, came, can, come, could, did, do, does, each, else, for, from, get, got, has, had, he, have, her, here, him, himself, his, how, if, in, into, is, it, its, just, like, make, many, me, might, more, most, much, must, my, never, now, of, on, only, or, other, our, out, over, re, said, same, see, should, since, so, some, still, such, take, than, that, the, their, them, then, there, these, they, this, those, through, to, too, under, up, use, very, want, was, way, we, well, were, what, when, where, which, while, who, will, with, would, you, your, a b c d e f g h i j k l m n o p q r s t u v w x y z.

Keep in mind though that these lists are used to filter the file as it is processed. Any file processed prior to your modifications will still be indexed against the original exception list. So anyway, open up Notepad or your favorite text editor, and remove the noise from your searches!