Frequently Asked Questions about Subject Search Scanner™

What does the "Subject Search (fuzzy)" method do?
 

"Subject Search" is an extremely intelligent method invented by Sergey Kryloff, a founder of Kryloff Technologies. Unlike traditional search techniques, this method is capable of finding relevant paragraphs even if the order of words in the query and the text being scanned are not exactly the same, the spelling varies, some words are not present or even misspelled! The "Subject Search" method is based on intensive mathematical investigations in the pattern recognition area Sergey began in 1992, and keeps on carrying out until now. To let you better understand the core idea of the method: during the search, words in the query and the documents are split on phonetic chunks; matched are not entire words but only some parts of them. This approach easily allows finding "Solution" for "solved", "Businessmen" for "business men" (two words are matched to one), etc. not only in English, but also in virtually any human language, and at a tremendously high speed! The "Subject Search" method does not make distinction between capital and small letters; for example, "A" and "a" (in English) are considered to be the same.

This method is especially useful when searching for information you are just getting familiar with. SSScanner accepts natural queries (entering something like "how can I rent a car in Italy" is OK), so it is the most powerful and the convenient one. Despite this fact, in a number of cases selecting one of the traditional "Literal Search" methods SSScanner™ also provides you with, may be preferable. For example, when you are looking for occurrences of a particular person's name, geographical location, or/and digits or numbers.


When I select the "Literal (regular)" method, a number of additional options becomes available. How do they affect my search results?
 

1. The literal search method "Entire phrase as is" selects documents containing your search phrase as you type it in including leading and trailing spaces, special characters, etc. For example, "Kryloff Technologies" gets matched to just "Kryloff Technologies", "KRYLOFF technologies", but not to "KRYLOFF   (two or more spaces or line breaks)   Technologies".

2. The literal search method "Same words in same order" selects documents containing all of the words in your phrase following each other and separated by one or several spaces, non-alphabetical letters or/and line breaks. For example, "Kryloff Technologies" is matched to:
a) "  Kryloff        Technologies  ",
b) "KRYLOFF (end of line)
   TECHNOLOGIES
", etc., but
c) not to "Kryloff Technology".
To match the words "Technology" and "Technologies", use the Subject Search method described above.

3. The literal search method "All words in any order" allows finding documents that contain just all words in your search phrase anywhere within the document. Enter entire words only using this method: it does not select "Kryloff" when you specify just Kryl. If you want to enter some parts of the words only (such as, word roots, etc.), use either the Subject Search method or, the following one.

4. "All patterns in any order" selects documents containing just all patterns (strings) in your Search Phrase in any order. Unlike the previous method, you may enter some part of the words only. As an example, the search phrase "Kryl  Tech" will be matched to "Kryloff  Technologies"; also selected will be any other documents containing, for example, both "Kryloff" and "Technical", "Kryloff" and "Technology", (anywhere in the text or document title), etc.

5. The literal search method "At least one word" selects documents containing at least one (any) word from your search phrase. Specify entire words only: this method does not select "Kryloff" for just Kryl. If you want to enter some parts of the words only (such as, word roots, etc.), use either the Subject Search method or, the following one.

6. And, finally, the method "At least one pattern" selects documents containing at least one (any) pattern (a string) you have specified in your search phrase. For example, issuing the query "Kryl  Tech" will cause SSScanner to select documents which contain either "Kryloff" or "Technologies", or both as well as "Technology", "Technical", etc.


What does the box "Case sensitive" mean?
 

By default, all methods described above are case insensitive: the patterns "Kryloff", "KRYLOFF" and "kryloff" are considered to be the same. To make distinction between uppercase and lowercase characters when searching for occurrences of your search terms, check the box "Case sensitive". Note, that even though some Asian languages include Cyrillic and Greek characters, SSScanner pairs capital and small characters for Latin alphabet only when searching in Asian languages case insensitively. As for the "Subject Search" method, it is always case-insensitive: no distinction between lowercase and uppercase characters is made.


Folder "Root" contains 3 subfolders: "S1", "S2" and "S3". How can I search in "S1" and "S3" only (excluding "S2") in one query?
 

Press the "Browse" button, then point to "Root\S1", and click "OK". After it, press "Browse", again; this time point to "Root\S3", hold down the <Ctrl> key, and click "OK".

As a general rule, when you want to add a new item (i.e., a folder or a file to scan) to the list of items in the "Search in" field, press and hold down <Ctrl> when clicking OK. To replace all items in the "Search in" field with a new one, just click OK. The semicolon character (;) serves as a separator between search items.


Can SSScanner search in folders I select in Windows Explorer?
 

Yes, it can, and this is probably the most convenient way to specify files and folders you want to search in. All you need is to select one or several files or/and folders, then click them using the right mouse button, and select the pop-up menu item "Scan with SSScanner ..." as it is shown on the picture:

How to launch SSScanner from Windows Expolrer
 

Furthermore, when SSScanner™ functions under Windows® Vista or Windows® 7, it places a small box for entering queries in the right upper corner of every window of Windows Explorer that appears. After pressing <Enter>, SSScanner™ immediately starts searching for your search phrase right in the folder that is currently open in Windows Explorer as it is shown on the picture:

Launching SSScanner in Vista Windows Expolrer

Additional information:

  • to disable balloon help messages in the Windows Explorer, select the SSScanner menu items "Help|Hide balloon help messages";
  • to disable the possibility to enter SSScanner queries in the Windows Explorer, uncheck the SSScanner menu item "Tools|Miscellaneous|Integrate into Windows Explorer".

  • I need to search in regular HTML ("*.HTM;*.HTML") files only. Why SSScanner selects files in .ZIP and other archives even though I do not specify "*.ZIP"?
     

    The information you enter in the "File types" field specifies files to be scanned both on your computer drives and those within archives; just consider archives to be drives as well. Unless you want to process archives as binary files, you do not need to enter "*.ZIP" to make SSScanner process archived files -- the product automatically selects archives and scans all enclosed files that match file types, dates, and attributes you've specified.

    If you want SSScanner to skip all or some particular archives, select the tab "Archives", click a desired archive type (i.e., "ZIP") using the right mouse button, and select the pop-up menu item "Never scan files within ZIP archives".

    Just in case, if you want SSScanner to search in ZIP archives on the binary level (treating .ZIP-files as a sequence of bytes and alphabetical letters), specify "*.ZIP" for the "File types" field having disabled processing files within ZIP archives in the SSScanner "Archives" tab.


    Along with regular files, some of my .ZIP files contain other archives, which in turn, contain other ones, etc. Can SSScanner scan all those archives as well?
     

    Yes, it can. Furthermore, this feature is completely unique to SSScanner. By default, the product processes enclosed .ARJ, .RAR;, .TAR, and .ZIP archives only; this is done on purpose to prevent SSScanner from wasting time "digging too deeply". But, in case of necessity, you may easily change the default settings, and make SSScanner recursively search within other nested archives (such as, .CAB files). Just do the following: activate the tab "Filters" on the SSScanner main window, then click any row using the right mouse button, and select the item "Properties" in the corresponding pop-up menu. Look at the .INI-file which should appear after the above actions, and make sure that the property "NESTED ARCHIVES=1". Then, add extensions of the desired archive types in the option "ENCLOSED EXTENSIONS".

    For example, you may enter:
    ENCLOSED EXTENSIONS=.ARJ;.CAB;.GZIP;.GZ;.JAR;.RAR;.TAR;.TAZ;.TBZ;.TGZ;.TPZ;.Z;.ZIP
    to make SSScanner recursively scan any or, just all archives that SSScanner finds within upper-level ones. Separate archive extensions by semicolons (";"), and start each one from the dot (".") symbol.


    When I instruct SSScanner to search using the "Subject Search" method, sometimes it mentions the same file twice or more in its reports. Why? Is there any way to get rid of such duplicates?
     

    Due to the fact that some files may contain answers to your questions in more than just one paragraph, by default, SSScanner selects and displays all such ones. If you want the product to report to you only with one (the best) paragraph from each file, select the tab "File Date and Size", and adjust the value of the field called "Skip".


    Can SSScanner select more than 1,000 quotations?
     

    No, it can't; otherwise, the average user would need spending hours looking at everything SSScanner might have found! The product just reports with no more than 1,000 best quotes (100, by default).


    Can SSScanner perform "batch" searches, i.e. a sequence of searches without my intervention? Does it provide some Application Program Interface using which software developers could create searching programs that would include the SSScanner functionality?
     

    SSScanner is a product for final-end users; therefore, it does not perform "batch" searches nor provides any API. Software developers may use another product of Kryloff Technologies called, Subject Search Sleuth™, which along with ready-to-work utilities for final-end users, also includes a comprehensive Application Program Interface (API). A trial version of this product is available for downloads from the Kryloff Technologies Web site at http://www.kryltech.com.


    When I run SSScanner for the very first time, it recommends installing Microsoft Outlook® to let it search in e-mail messages. But, I've already got Microsoft Outlook Express®, what's the difference?
     

    Microsoft Outlook and Outlook Express are different products: Outlook Express is a simple mailing application which comes with Windows whilst Microsoft Outlook is a part of Microsoft Office® suite of products (which also includes Microsoft Word, Excel, and some other applications). Apart from the fact that Microsoft Outlook is much more convenient for final-end user, it also allows SSScanner to scan e-mail messages in all of its folders, in any encoding tables, etc. Once you install and start using Microsoft Outlook as a replacement for Outlook Express, you'll immediately feel the difference -- SSScanner can search in thousands of its e-mail messages and profile stores per a second in any language of your choice thus providing the best possible search quality and performance.


    What the "Relevancy Percentage" field is for? When I zeroize it, SSScanner selects much more files, but the top hits appear to be the same.
     

    You are right on 100%: this field just allows to cut off files which are unlikely to contain relevant information. However, under some circumstances (for example, when SSScanner does not find anything but you are sure the answer must be somewhere "there"!), you may decrease the "Minimum Relevancy Percentage" field to include into the product report just everything SSScanner has found. Or, on the contrary, you might want to increase it in order to exclude hits you consider to be irrelevant.


    Does SSScanner create file indexes (or, index files)?
     

    Yes, it does (upon your request): along with searches the product performs "on the fly", it can search even faster by processing files in so-called, Virtual Folders. Virtual Folder is a collection of files, which are physically stored in one or several drives and folders either on your computer, your company's or your home local area network. When you add files to a virtual folder, SSScanner indexes them to allow your queries be performed faster; the product maintains indexes between sessions.

    To create, open and manipulate Virtual Folders, click the corresponding pane using the right mouse button. To search in a particular Virtual Folder, go to the tab "Search Phrase and Location", select "Virtual Folders" in the "Search in" drop-down list, after which click the "Browse" button and choose a desired folder.


    I am completely happy with Subject Search Scanner, but my company needs additional functionality such as, the possibility to search in database "memo" fields. Can Kryloff Technologies add this functionality for us?
     

    You are invited to send us a request by visiting the Kryloff Technologies Web site at http://www.kryltech.com. We shall consider it and come back to you with our ideas or/and pieces of suggestion.


    Subject Search Scanner™, SSScanner™, Subject Search Sleuth™, and SSSleuth™ are trademarks of Kryloff Technologies, Inc.
    Other products or companies mentioned in this document are copyright and/or trademarks of the respective companies.