References on Reliability and Robustness in Grid Computing

This is the original list of references provided to the research group when it was first chartered.

 

L. Alvisi and et. al., "Wrapping Server-Side TCP to Mask Connection Failures," in INFOCOM 2001, 22-26 April 2001, vol. 1, pp. 329-337.

 

A. Avizienis, J. C. Laprie, B. Randell, and C. Landwehr, “Basic Concepts and Taxonomy of Dependable and Secure Computing”, IEEE Transactions on Dependable and Secure Computing, Vol.1, No. 1, January-March 2004.

 

N. Bartolini, F.L. Presti, and C. Petrioli, "Optimal Dynamic Replica Placement in Content Delivery Networks," in The 11th IEEE International Conference on Networks, ICON 2003, 2003, pp. 125-130.

 

S. Chandrasekaran and S. Madden and M. Ionescu, "Ninja Paths: An Architecture for Composing Services over Wide Area Networks", CS262 class project writeup, UC Berkeley (2000) .

 

M. Chen, E. Kiciman, E. Fratkin, A. Fox, and E. Brewer, “Pinpoint: Problem Determination in Large, Dynamic Internet Services”, Proceedings of 2002 International Conference on Dependable Systems and Networks (DSN), IPDS track, Washington, DC, June 23-26, 2002 .

 

Y. Chen, R. H. Katz, and J. Kubiatowicz, "Dynamic Replica Placement for Scalable Content Delivery", 1st International Workshop on Peer-to-Peer Systems (IPTPS'02).

 

Y.S. Dai, M. Xie, and K. L. Poh. “Reliability analysis of grid computing systems”, Proceedings of the 2002 Pacific Rim International Symposium on Dependable Computing (PRDC 2002), IEEE Computer Society Press, pp 97-103, 2002.

 

P. Felber, et al. Failure Detectors as First Class Objects, Proceedings of the International Symposium on Distributed Objects and Applications (DOA’99), IEEE Computer Society Press, September 5-7, 1999, p. 132.

 

A. Iamnitchi and I. Foster, "A problem specific fault tolerance mechanism for asynchronous, distributed systems," in Proceedings of 2000 International Conference on Parallel Processing (29th ICPP'00), Toronto, Canada, August 2000, IEEE.

 

S. Frolund et al. “Building Dependable Internet Services with E-speak”, Proceedings of the Workshop on Dependability of IP Applications, Platforms, and Networks, June 26, 2000, held in conjunction with the 2000 International Conference on Dependable Systems and Networks, IEEE Computer Society, and also available as Hewlett-Packard Labs Technical Report 2000-78.

 

Z. Juhasz, A. Andics, and S. Pota, “Towards A Robust And Fault-Tolerant Discovery Architecture For Global Computing Grids”, Scalable Computing: Practice and Experience, Volume 6, Number 2, pp. 22-33. 2003.

 

P. Keyani, B. Larson, and M. Senhil, “Peer Pressure: Distributed Recovery from Attacks in Peer-to-Peer Systems”, in Web Engineering and Peer-to-Peer Computing, Gregori, E. et al. (eds.), NETWORKING 2002 Workshops, Pisa, Italy, May 19-24, 2002, Revised Papers, Lecture Notes in Computer Science 2376 Springer 2002, ISBN 3-540-44177-8, pp. 306-320

 

J. Lan, Cache Consistency Techniques for Peer-to-Peer File Sharing Networks, Master’s Thesis, Department of Computer Science, University of Massachusetts Amherst, June 26, 2002.

 

B. Lee and J. B. Weissman, "An adaptive Service Grid Architecture Using Dynamic Replication," in IEEE 2nd International Workshop on Grid Computing, November, 2001.

 

H. M. Lee and et. al., "Grid Fault Tolerance Service for Quality of Service", The 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2003), 2003.

 

D. Liang et al., “A Fault-Tolerant Object Service on CORBA,” The Journal of Systems and Software, vol. 48, 1996

 

C. Marchetti, A. Virgillito, and R. Baldoni, “Design of an Interoperable FT-CORBA Compliant Infrastructure,” Proceedings of the European Research Seminar on Advances in Distributed Systems (ERSADS), 2001.

 

D. S. Milojicic, F. Douglis, Y. Paindaveine, R. Wheeker, and S. Zhou, "Process Migration Survey," ACM Computing Surveys, September, 2000.

 

L. Qiu, V.N. Padmanabhan, and G. M. Voelker, "On the Placement of Web Server Replicas", Twentieth Annual Joint Conference of the IEEE Computer and Communications Societies - INFOCOM 2001, pp. 1587-1596.

 

K.  Ranganathan  and  et.  al.,   "Improving  Data  Availability  through Dynamic  Model-Driven  Replication  in Large  Peer-to-Peer Communities,"  in Global and Peer-to-Peer Computing on Large Scale Distributed Systems Workshop, Berlin, May 2002, p. 376.

 

P. Radoslavov, R. Govindam, and D. Estrin, "Topology-Informed Internet Replica Placement," in Sixth International Workshop on Web Caching and Content Distribution, 2001.

 

P. Stelling, I. Foster, C. Kesselman, C. Lee, and G. von Laszewski, “A Fault Detection Service for Wide Area Distributed Computations”, Cluster Computing, Vol. 2, No. 2, 1999, pp. 117-128.

 

X. Tang and J. Xu, "On Replica Placement for QoS-Aware Content Distribution," in INFOCOM 2004, 2004.

 

B. Urganonkar, et al. “Maintaining Mutual Consistency for Cached Web Objects”, Proceedings of the 21st International Conference on Distributed Computing Systems (ICDCS-21), Phoenix, Arizona, April 2001

 

D. C. Verma, et al. “SRIRAM: A scalable resilient autonomic mesh”, IBM SYSTEMS JOURNAL, VOL 42, NO 1, 2003, pp. 19-28

 

L. Valcarenghi, L. and C. Piero.  QoS-Aware Connection Resilience for Network-Aware Grid Computing Fault Tolerance”, Proceedings of 2005 7th International Conference on Transparent Optical Networks, July 3-7 2005, Barcelona Spain, Volume 1, pp. 417-422.

 

J. B. Weissman, "Fault Tolerant Wide-Area Parallel Computing," in IEEE Workshop on Fault-Tolerant Parallel and Distributed Systems, International Parallel and Distributed Processing Symposium IPDPS, May, 2000.

 

R. Wu, A. Chien, M. Hiltunen, R. Schlichting, S. Sen, “A High Performance Configurable Transport Protocol for Grid Computing”, CCGrid 2005.

 

V.C. Zandy, B.P. Miller, and M. Livny, "Process Hijacking," in The Eighth International Symposium on High Performance Distributed  Computing, 3-6 Aug. 1999, pp. 177-184.

 

X. Zhang, D. Zagorodnov, M. Hiltunen, K. Marzullo, and R. D. Schlichting, “Fault–tolerant Grid Services Using Primary–Backup: Feasibility and Performance”, Cluster 2004, San Diego, California, September 20-23 2004.