Tutorial 4 > Next tutorial
NRS Tutorial: Crawl and index ODP categories
This tutorial applies to NRS Enterprise
In order to crawl and index an ODP category, you
must create a collection. In this example we will
setup a collection to crawl and index the ODP
category for all publicly traded companies.
Step 1: Create a collection
In admin under "Crawler|Manage Collections",
use the add collection form, select 'category'
for type, and enter an arbitrary name, say 'public
companies'. Click 'Add'.
Step 2: Edit the collection
Click on your new collection name. You can now edit
the properties of the collection. Click the 'category
picker' link, and in the filter box, type 'publicly
traded' and press enter. Select the checkbox for
the 'publicly traded' category, and click 'Add
Categories'. Then click the 'Back to Collection'
button.
You have now setup a collection that will by default
crawl all links found in the publicly traded companies
category. Every link will be crawled as well as
any links found on the pages that point to pages
that share the same path. This will ensure that
all crawled pages under every link in the category
is relevant.
You can consult the online help in admin for more
details on the other properties of collections.
Step 3: Start the Crawl
Click on the crawler tab, and click the 'Start' button
next to start crawl. The crawl will start in a
minute or so. By repeatedly clicking on the crawler
tab you can find out the status of the crawl.
The crawl will probably take hours or days to
complete. You can stop it at any time. By default
the crawler is set to run every night at midnight
for 5 hours.
For the sake of this tutorial, after 20 minutes or
so you might want to stop the crawl and continue
to the next step. The crawl can take up to 2 minutes
to stop.
Step 4: Index full-text
Click on the crawler tab, and click on the 'Start' button
next to reindex full text. The full text indexing
process can take anywhere from an hour or so,
to several days if you have millions of documents.
You can keep tabs on the progress by clicking
on the crawler tab. Another way to see what it
happening is to look at the system log file found
under "Webserver|System log" or in the
NRS application directory.
Step 5: Create a template
After finishing the full-text indexing process you need
to create a template in order to search against
the index. Under "Crawler|Manage Collections|Create
Template", select your collection and click
'Add'. Now click on the 'Templates' tab and you'll
see your new template. Click on it and try some
searches.
FYI: Only templates of 'type' directory let you search
against a full-text index. In the properties of
directory templates, you need to select the 'Full-text
index' checkbox, and specify for roots one or
more categories that have been crawled and full-text
indexed.
Back to support
|