Services Products Purchase Free Trial Partner Support About LoopIP Contact Us Home

Tutorial 4 > Next tutorial

NRS Tutorial: Crawl and index ODP categories

This tutorial applies to NRS Enterprise

In order to crawl and index an ODP category, you must create a collection. In this example we will setup a collection to crawl and index the ODP category for all publicly traded companies.

Step 1: Create a collection

In admin under "Crawler|Manage Collections", use the add collection form, select 'category' for type, and enter an arbitrary name, say 'public companies'. Click 'Add'.

Step 2: Edit the collection

Click on your new collection name. You can now edit the properties of the collection. Click the 'category picker' link, and in the filter box, type 'publicly traded' and press enter. Select the checkbox for the 'publicly traded' category, and click 'Add Categories'. Then click the 'Back to Collection' button.

You have now setup a collection that will by default crawl all links found in the publicly traded companies category. Every link will be crawled as well as any links found on the pages that point to pages that share the same path. This will ensure that all crawled pages under every link in the category is relevant.

You can consult the online help in admin for more details on the other properties of collections.

Step 3: Start the Crawl

Click on the crawler tab, and click the 'Start' button next to start crawl. The crawl will start in a minute or so. By repeatedly clicking on the crawler tab you can find out the status of the crawl. The crawl will probably take hours or days to complete. You can stop it at any time. By default the crawler is set to run every night at midnight for 5 hours.

For the sake of this tutorial, after 20 minutes or so you might want to stop the crawl and continue to the next step. The crawl can take up to 2 minutes to stop.

Step 4: Index full-text

Click on the crawler tab, and click on the 'Start' button next to reindex full text. The full text indexing process can take anywhere from an hour or so, to several days if you have millions of documents. You can keep tabs on the progress by clicking on the crawler tab. Another way to see what it happening is to look at the system log file found under "Webserver|System log" or in the NRS application directory.

Step 5: Create a template

After finishing the full-text indexing process you need to create a template in order to search against the index. Under "Crawler|Manage Collections|Create Template", select your collection and click 'Add'. Now click on the 'Templates' tab and you'll see your new template. Click on it and try some searches.

FYI: Only templates of 'type' directory let you search against a full-text index. In the properties of directory templates, you need to select the 'Full-text index' checkbox, and specify for roots one or more categories that have been crawled and full-text indexed.

Back to support



 
LoopIP search
Web search
Net Research Server
Net Research Server - demonstration website
visit Net Research Server - demonstration website
Demo Links
Web Search
Shopping Engine
Local Search
Directory
Metasearch
Enterprise
Wiki
Integration

Copyright © 2008 LoopIP LLC. All rights reserved | Terms | Privacy