CATEGORIES
Internet & Network
Lookup & Search Programs
Current Highlights
Follow us on Facebook
 

Web Data Extractor Pro 3.8

Web Data Extractor Pro is a web scraping tool specifically designed for mass-gathering of various data types. It can harvest URLs, phone and fax numbers, email addresses, as well as meta tag information and body text. Special feature of WDE Pro is custom extraction of structured data. This high-speed and multithreaded program works by using a keyword into search engines, by spidering a website or a list of URLs from a file. You can also allow it to follow external links from the original pages, with the capability to go as deep into the URL paths as you need and actually search the entire Internet. Web Data Extractor is superior for harvesting structured information and specific data types related to the keywords you provide by searching through multiple layers of websites.

User's rating:

  • Currently 2.62/5
  • 1
  • 2
  • 3
  • 4
  • 5
Enlarge the screenshot of Web Data Extractor Pro
[ Enlarge Image ]
Download 7.92MB Web Data Extractor Pro

Download Direct

(7.92MB, Extension: EXE)

Download alternate to Web Data Extractor Pro solution

Look at the free or trial alternatives and similar apps to Web Data Extractor Pro software by the tags. It's possible also to find substitutes for the most popular titles in the Internet & Network category.

| Website Spider | Url Extractor | Shareware | Meta Tag Extractor | Link Extractor | Extraction | Email Extractor | Data Mining | Data Extractor |

History updates (Complete changelogs since the listing on this site)

3.8 [12-29-17]

Added ability to load and extract information from PDF files;Added ability to load the license file directly from the UI form, when the trial period of using the program expires. Alternatively, the license file can be uploaded from the Options -> About form if the trial period has not yet expired;Significantly improved work through the proxy servers;Parser of encoded JS-emails has been improved;The context menu item "Re-start URL" was added to the "Bad URLs" list;Improved work with the software internal data repository;Added the ability to delete sessions along with all it’s data and the service files, also software automatically compress the internal repository of the program to reduce the required disk space;Added "Initial Referrer" text field in UI. Some websites may display different information depending on which external site they come from. The "Initial Referrer" field allows you to specify the web address of such a site;We also made various minor changes and improvements based on feedbacks from our customers

Other versions : 3.7 3.6 3.5 3.4 3.3 3.2 3.1 3.0 2.3 2.2 2.1

v3.7 [03-02-17]

Improved work of "Search Engines" mode;Improved "Remove HTML Tags" and "Page must contain the following text to extract data" filters;Added "Use country IP filter" filter which allows to exclude results of servers which does not related (by geolocation) to country selected in "Search Engines» option;Significantly improved email parser and «Custom Builder» parser;General improvements in data detection and extraction;We also made various minor changes and improvements based on feedbacks from our customers

v3.6 [08-24-16]

Added checkbox "Get redirected URL" on the "Custom Data Editor" form to extract urls (e.g. website addresses) that are presented through a redirect; Added checkbox "Mark Non-Responding Proxies Like Inactive Automatically". If during the session proxy server determined as «bad» (not working), it is automatically marked as inactive, and it’s not used in the session; Added new option "Use single line merge" to merge data into a single string. For example, you can export t-shirt colors like: "T-Shirt", "Black, Yellow, Red, Green»; Significantly improved loading of public proxy servers from the Internet; "Human Factor" option has been improved; Improved a parser of closed by JS email adresses; Improved option of passing Google-captcha when searching data via Google; We also made various minor changes and improvements based on feedbacks from our customers

v3.5 [10-28-15]

Significantly improved mechanism of searching data through search engines (added a mechanism to work with Google captcha etc.); Added the ability to capture cookies (new button «Capture Cookie») and run a session with cookies (it is very useful in cases where the parameters of the search forms through cookies); Added ability to import a proxy servers from the service where laid out fresh proxies every 30 minutes. Imports about 100-140 proxies. Each new import changes the earlier downloaded list. During the session, the server which became 100% inoperative, will automatically become inactive so in the list remain only actual servers; Added a new parser to decrypt hidden by javascript email addresses; Revised and improved server errors handling, which has a positive impact on work through proxy servers; Fixed email/fax adresses parser; Various minor improvements

v3.4 [09-03-15]

Improved parser of javascript protected email adresses, added 2 new decoders; Improved algorithm for merge the data for export; Added checkbox "Add in results" in filter "URL Filter: Page must contain the following text to extract data". If you turn it on, then the results table will have with the keywords of this filter, that satisfy the search criteria when retrieving data; Improved parser of links, added case that cover not quoted links in the page sources; Software improved for work with large data; Improved export data mechanism; Improved filter mode "Url List" work; Added recognition of servers that do not support the issuance of uncompressed content and a form correct request to such servers; Added new search engine - IXQUICK; Various minor additions/fixes

v3.3 [05-05-15]

Improved parser of javascript protected email adresses; Improved handling of network errors. Now better recognized temporarily unavailable pages, for example due to high activity on the server; Added use of regular expressions in filters. To recognize a regular expression, please enclose it between the symbols "^" and "$"; Added detection of specific symbols of the German language in urls; Added "Recovery" button in the settings. It allows you to export all the collected data for the selected date range, even if the main database of the program has been damaged for some reason; Added the ability to export data to Excel file format; Greatly improved algorithm for traversing large sites containing millions and tens of millions links; Various fixes/additions based on your feedbacks

v3.2 [12-31-14]

Added an option “Remove duplicates” for phone and faxes; Fixed crashes in some cases when building in “Custom Data Editor”; Principe of extracting links and domains was changed (in case check-boxes “URLs” and “Domains” were marked in session's form) for “Search Engines” mode. Now these lists include urls on websites and their domains, which have sought-for keywords. Before this list consisted of all founded urls, which made it not useful in “Search Engines” mode; We have increased the maximum depth of search on websites from 10 till 100; SQLire library is upgraded to the last version; We have increased the stability of programs' work. Now in cases of abrupt computer reload or system's breaking, all datas collected during one session will be saved. Auto-saving works within 30 seconds gap; Added the possibility to search in local website copies on the disk. For example, using this way "c:inetpubwwwrootspadix. We can set it up as “Start URL” in “Site” mode as well as in the links file for “Url List” mode; We have improved parsing of emails, which are protected by java script, we have added algorithm for decryption of new kind of emails protection (the example was sent by one of our customers)

v3.1 [09-06-14]

Added the ability to edit url and email filters in stopped session, and then to continue with already edited filters. Added the ability to download the list of proxy serves from the text files (*txt). Also we added support of files with format like “host:port”. Added progress in per cents for requests. Now the list of requests updates very quickly. Added the name of proxy, through what the request to field “Title” is sending (for running requests only). Improved the dispatch on proxies – now with a big list of proxies it works much more efficiently.

v3.0 [06-24-14]

Added support of working with proxy servers' list, Reworked the algorithm for determining the depth of scan, Program sustainability to the physical damage of the database is added, Improved streams control, which has a positive impact on the overall performance, Improved work with a large list of keywords in "Search Engines" mode

v2.3 [01-08-14]

Reworked the algorithm for determining the depth of scan, Program sustainability to the physical damage of the database is added, Improved streams control, which has a positive impact on the overall performance, Improved work with a large list of keywords in "Search Engines" mode

v2.2 [05-15-13]

Reworked the algorithm for determining the depth of scan, Program sustainability to the physical damage of the database is added, Improved streams control, which has a positive impact on the overall performance, Improved work with a large list of keywords in "Search Engines" mode

v2.1 [01-09-13]

Reworked the algorithm for determining the depth of scan, Program sustainability to the physical damage of the database is added, Improved streams control, which has a positive impact on the overall performance, Improved work with a large list of keywords in "Search Engines" mode

Average review rating :

Useful independent reviews and opinions of the users

Review Web Data Extractor ProWrite a review « Be the first to post a review for Web Data Extractor Pro download!

Predicted future versions and notices:

The doDownload.com constantly monitors the update of all programs, including information from the Web Data Extractor Pro 3.8 changelog file, however sometimes it can happen that data are not complete or are outdated.We assume that author continue's to develop 3.9 version with further advanced features, and soon you will be informed. Equally important 4.0 upgrades of the program we will continue to monitor. Full Web Data Extractor Pro description has been compared with the overall software database and our algorithm has found the following applications (are showed below).