Techniques are disclosed, for use in a computer system including a plurality of processing units coupled over a system fabric, to identify a lockstep error associated with a first packet to be transmitted over the system fabric; set a viral indicator in the first packet to indicate the lockstep error; and transmit the modified packet over the system fabric.

 
Web www.patentalert.com

> Scalable method of continuous monitoring the remotely accessible resources against the node failures for very large clusters

~ 00390