SOURCES SOUGHT
70 -- Search Engine
- Notice Date
- 2/13/2003
- Notice Type
- Sources Sought
- Contracting Office
- Department of Agriculture, Office of Procurement and Property Management, Procurement Operations Division, Mailing: 1400 Independence Ave., S.W., Stop 9307 Location: 300 7th St., SW, Rm. 377, Reporters Building, Washington, DC, 20250
- ZIP Code
- 20250
- Solicitation Number
- RFI-3142-3-2001SD
- Archive Date
- 3/7/2003
- Point of Contact
- Sherri Davis, Contract Specialist, Phone (202) 720-8309, Fax (202) 720-4529,
- E-Mail Address
-
Sherri.Davis@usda.gov
- Description
- The RFI seeks information about potential sources for a commercial-off-the-shelf (COTS)search engine which will integrate into or interface with a Lotus Domino application on an IBM x370 Windows platform. The product can be, but is not required to be, a stand-alone product. The engine must have the capability to earch meta data as well as the text in documents stored in a Lotus Domino (Domino.Doc) environment. The search engine must be able to search for text in documents in standard work processing and spreadsheet formats and within PDF files; however, the capablity to search any database or file format is desirable. Many of the docments stored in the system are tiff files produced by scanning paper documents. The current document management application produces a text version of the documents from these tiff images through the use of an optical character recognition versions of the tiff images (scanned documents). The search engin must be capable of "fuzzy seraches" in order to maximize hits when searching the uncorrected text produced by the OCR engine. This "contaminated data" typically has a large number of errors per page (i.e. misspelled words) that limit the success of searches dependent on exact string matching. A search engine limited to exact string matching is not adequate (even if wild card are supported). The results of a text search should be displayed as a list of "hits" (i.e., occurrences of matches to the search criteria) that are displayed in context, that is, within a snippet of text surrounding each "hit." The results should be displayed in rank order with the highest ranking "hit" listed first. Each document should be displayed once even though there are multiple hits within that document (or that should be a user-selected option). Along with each snippet of text should be a link t the actual file. The link should open the document in a viewer or the native application. When the target document is viewed, the "hits" within that document should be suitably displayed or highlighted, and there should be a provision for the user to step through the hits in a document using a "find next" button. The engine should have the ability to rank the results of the search using internal algorithms and/or criteria provided by the user at time of search. Typical user criteria would include the use of standard Boolean operators and wild cards. A "sound like" for text and "pattern recognition" for a graphics document are examples of desirable capabilities. RESPONSES SHOULD INCLUDE: 1) Technical documentation limited to twenty (20) pages, which addresses each of the above characteristics and any other important technical aspects of the product. 2) Commercial price list or GSA schedule for license and maintenance fees software. 3) Points of contact (including name, email address, phone number, and fax number) from three customers currently using the identified product. This is an RFI and not a request for comment or request for proposals. Nothing shall be construed herein or through the RFI process to commit or obligate the Government to further action as a result of this RFI. Firms responding to this RFI shall bear all risk and expense of any resources used to provide the requested information, and all information submitted in response to this request shall become the property of the Government and will not be returned to the submitter. To be considered, responses must be received within seven (7) calendar days of the publication of this announcement. Written responses to this notice are to be directed to Sally Soule; Strategic Planning & Support Branch; National Information Technology Center; 8930 Ward Parkway; Kansas City, Missouri 64114-3363. Telephone inquiries will not be accepted.****
- Place of Performance
- Address: National Information Technology Center, 8930 Ward Parkway, Kansas City, Missouri 64114-3363
- Record
- SN00258867-W 20030215/030213213354 (fbodaily.com)
- Source
-
FedBizOpps.gov Link to This Notice
(may not be valid after Archive Date)
| FSG Index | This Issue's Index | Today's FBO Daily Index Page |