A text mining system for collecting business intelligence about a client,
as well as for identifying prospective customers of the client, for use
in a lead generation system accessible by the client via the Internet.
The text mining system has various components, including a data
acquisition process that extracts textual data from Internet web sites,
including their logs, content, processes, and transactions. The system
compares log data to content and process data, and relates the results of
the comparison to transaction data. This permits the system to provide
aggregate cluster data representing statistics useful for customer lead
generation.