US 7,392,320 B2
Segmentation and processing of continuous data streams using transactional semantics
Lawrence A. Bookman, Weston, Mass. (US); David Albert Blair, Wayland, Mass. (US); Steven M. Rosenthal, Lexington, Mass. (US); Robert Louis Krawitz, Chestnut Hill, Mass. (US); Michael J. Beckerle, Needham, Mass. (US); Jerry Lee Callen, Cambridge, Mass. (US); Allen M. Razdow, Cambridge, Mass. (US); and Shyam R. Mudambi, Wayland, Mass. (US)
Assigned to International Business Machines Corporation, Armonk, N.Y. (US)
Filed on May 14, 2004, as Appl. No. 10/846,342.
Application 10/846342 is a continuation of application No. 09/596482, filed on Jun. 19, 2000, granted, now 6,801,938.
Claims priority of provisional application 60/140005, filed on Jun. 18, 1999.
Claims priority of provisional application 60/185665, filed on Feb. 29, 2000.
Prior Publication US 2005/0021745 A1, Jan. 27, 2005
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 15/16 (2006.01)
U.S. Cl. 709—231  [709/203; 709/219; 709/223; 709/229; 714/15; 718/101] 16 Claims
OG exemplary drawing
 
1. A method for processing a continuous stream of transactional data, the method comprising:
applying transactional semantics to the continuous stream of transactional data to identify a plurality of segments of the continuous stream of transactional data, wherein the continuous stream of transactional data is from at least one transaction in progress, and wherein each of the plurality of segments contains transactional data,
processing the transactional data in each of the plurality of segments of the continuous stream of transactional data to produce results for each of the plurality of segments; and
in response to processing the transactional data in one of the plurality of segments to produce results for one of the plurality of segments, outputting the results produced for the one of the plurality of segments to a storage device,
wherein the continuous stream of transactional data is a log of information relating to a plurality of reguests issued to a server, and wherein the applying step comprises reading information relating to one of the plurality of reguests from the log of information; and applying the transactional semantics to the information,
wherein the information relating to the one of the plurality of reguests includes a plurality of fields, and wherein the transactional semantics are defined by a function of the plurality of fields, and
wherein the transactional data in the continuous stream of transactional data has a seguence, and wherein there are multiple sources of the continuous stream of transactional data,
determining whether a particular portion of transactional data in the continuous stream of transactional data is in seguence, and
if the particular portion of transactional data is determined to be out of sequence, interrupting the processing step, inserting the particular portion of transactional data in a particular segment in the plurality of segments according to the transactional semantics, reprocessing the particular segment, and continuing the processing step.