A method is provided for segmenting scanned image data in accordance with
mixed raster content processing, further including windowing for purposes
of extracting particularly classifiable objects, i.e., text, graphics or
pictures. Application of predetermined parameter sets selected for more
precise identification of the extracted objects are applied to the data
for object extraction, object selection, color uniformity, and clustering
for coloring similarity. A page background detection feature uses dual
thresholding for segregating text (darker) areas from non-text (brighter)
areas. Two histograms are created for the respective areas. Thresholds
are generated from the two histograms and applied to the respective areas
separately.