
Arch Search Engine 1.6b
Arch is an open source extension of Apache Nutch (a popular, highly scalable general purpose search engine) for intranet search. Not happy with your corporate search engine? Not surprising, very few people are. To the best of our knowledge, there are no intranet engines that work as well as the Google's global Web search does. There is a fundamental reason for this: the algorithms used by Google on the global Web (or similar) do not work nearly as well on intranets for the lack of statistical data. Arch (finally!) solves this problem. It uses a novel method to deliver high precision search results that works great. Don't believe it? Blind test evaluation tools are included. You can deploy Arch and compare its performance to your current search engine and/or Google (on the public part of your site) using a blind test methodology.
In addition to the excellent search quality, Arch has many features critical for corporate environments:
- Document level security. Users can find only documents that they are authorized to see.
- Inexpensive index updates. Arch is able to keep indexes up to date and avoid regular complete site recrawling.
- 24/7 availabilty. There is always a working index available, even if a crawl fails.
- Support for simultaneous indexing and search of multiple web sites, with ability to search and administer any site separately, if needed. Dynamic adding and removal of web sites is easy.
- An automatically generated site directory.
- Low cost support once deployed.
- Dual interface (PHP and Java) for easy deployment and customization.
- Faceted search "out of the box".
- An extensive and extensible set of parsers for parsing a variety of file formats: HTML, PHP, PDF, MS Office, Open Office, etc.
- A modular, plugin-based architecture that can be easily customized and extended.
- The source code is included.
- High performance and scalability. Arch can run on computer clusters to index very large data sets.
arch-src.zip file was thoroughly tested by our system on Nov 13, 2012 by the three antivirus programs and passed. Is absolutely clean, enjoy!
This archive is 100% safe to download and install.
Have a look at the full Arch Search Engine 1.6b antivirus scan reports.
Download alternate Arch Search Engine solution
Look at the free or trial alternatives and similar apps to Arch Search Engine software by the tags. It's possible also to find substitutes for the most popular titles in the Internet & Network category.
| Web Site Search | Search Engine | Open Source | Nutch | Java Search Engine | Intranet Search Engine | Full Text Search | Corporate Search Engine | Company Search Engine |
History updates (Complete changelogs since the listing on this site)
1.6b [04-17-13]
Other versions : 1.43 1.42 1.41 1.4 1.4b2 1.4b 1.23 1.22 1.21 1.2
v1.43 [06-29-12]
Added a GUI based cinfiguration manager. Fixed known bugs.
v1.42 [06-26-12]
Added a GUI based cinfiguration manager. Fixed known bugs.
v1.41 [05-08-12]
Added a GUI based cinfiguration manager. Fixed known bugs.
v1.4 [03-18-12]
Ported to Nutch 1.4 and tuned query precision
v1.4b2 [01-17-12]
Ported to Nutch 1.4
v1.4b [12-24-11]
Added Windows and Cygwin compatibility.
v1.23 [03-24-11]
Added Windows and Cygwin compatibility.
v1.22 [12-24-10]
Added an alternative way to switch to the new index after recrawl (using a http request), fixed a bug in reuse of old index segments.
v1.21 [12-01-10]
Ported to the latest release of Apache Nutch 1.2, changed default set of parser to achieve better document parsing, added email notifications, added bookmarks indexing, fixed automatic switching to new index, fixed a few minor bugs.
v1.2 [10-06-10]
Ported to Apache Nutch 1.2, upgraded the PDF parser, added test and tuning tools, resolved issues found in the beta version, enabled use of computer clusters.
Predicted future versions and notices:
The doDownload.com constantly monitors the update of all programs, including information from the Arch Search Engine 1.6b changelog file, however sometimes it can happen that data are not complete or are outdated.We assume that author continue's to develop 1.6c version with further advanced features, and soon you will be informed. Equally important 2.0 upgrades of the program we will continue to monitor. Full Arch Search Engine description has been compared with the overall software database and our algorithm has found the following applications (are showed below).
(16.04MB, Extension: ZIP)

Del.icio.us
Digg
StumbleUpon
Google
Facebook
Reddit
Live
Average review rating :




Useful independent reviews and opinions of the users