DPSS Data Set Loading Instructions

Lawrence Berkeley National Laboratory, Data Intensive Distributed Computing Project


This page contains instructions for impage Pyramid data sets onto the DPSS.

The DPSS can be used for several types of data such as images, pyramids of image tiles, video frames, and so on. Since the DPSS just deals with "blocks" of data, every application needs it's own program to load data a way that it will be able to retrieve the desired data based on a "block ID".

The programs loaddata is for image data.


Loading Tile Data (for TerraVision or ibrowse)

loaddata requires a configuration file of the following format:

tile_set: data_set_url
servers: server_name1  server_name2 .... 

For example, a file containing the following will load set ntc-1-utm-2scene (both DEM and OI) from a local disk onto servers 1, 2, and 3:

#          Tile Set Name            TSM URL
tile_set: file://localhost/home/data3/DPSS/tile-data/Pyramids/ntc-1-utm-2scene.33.dem 
tile_set: file://localhost/home/data3/DPSS/tile-data/Pyramids/ntc-1-utm-2scene.128.oi 
servers:  lbl-server1 lbl-server2 lbl-server3

To load data from one DPSS to another, set your DPSSHOST to the destination system, and use the x-dpss:// URL for the source data set location. This example will load data from a DPSS system who's master is on host iss4.lbl.gov to servers 1 and 3.

#          Tile Set Name            TSM URL
tile_set: x-dpss://iss4.lbl.gov/ntc-1-utm-2scene.oi
tile_set: x-dpss://iss4.lbl.gov/ntc-1-utm-2scene.dem
servers:  lbl-server1 lbl-server3

To run loaddata, do:

	loaddata -i config_filename

This program loads tiles from the Tile Set Manager (TSM), from Yvan LeClerc at SRI. Data for use by TerraVision or ibrowse must be tiled and formatted in a very specific way, with a header format defined by SRI. The TSM loads data from a given URL. The following are all valid URL's for the TSM:

	x-dpss://iss4-atm.lbl.gov
	file://localhost/home/itg2/SRI-tiles/
	http://www.ai.sri.com/~magic/Tile_Sets

x-dpss means to load from another DPSS, file means load from local disk, and http is for loading from an http server.

The fastest way to load a set is using the "x-dpss" url type. Here is a list of currently available DPSS's to load from. Contact (BLTierney@lbl.gov) if you have trouble loading data from these servers. This method may not work unless you have a direct ATM connect to the MAGIC network.

If you can't connect to an DPSS, you can also load via http, but this is much slower. Here is list of http servers for loadding data.

If you have a tiled data set on your local disk, you can use the file:// URL. This will be much faster than http://, and about half the speed of x-dpss://. The file: type should point to a directory which contains the following subdirectories: GeoPyramids and Pyramids; and the GeoPyramids directories must contain a file GeoPyramidUrls which contains the names of the tile sets in the Pyramids directory. This is the structure required by the TSM library.

For a list of of currently available image sets, set your DPSSHOST to iss-1.edc.magic.net or iss-4.lbl.magic.net, and run dsm_list .

When running loaddata, be sure to set your DPSSHOST environment variables to the system you are sending data to, not getting data from.

TerraVision requires two types of data for each data set to be loaded. These are "OI" (ortho-image) sets, "DEM" (digital elevation map) sets. ibrowse only requires the "OI" data. For more info on the SRI data set format see the SRI Tile Set Information Page .


Loading Image Data

The loaddata program also works with raw image data. You just use the "data_set:" keyword instead of the "tile_set:" keyword in the configuration file. For example:
#   data_set:   location   filename   width height bytes/pixel
#for example:
#data_set: /home/itg11/data/image-data/Yosemite  Yosemite-1  6420 7710 1

For loading EDC DDR images, use the "edc_image_set" keyword. For example:

edc_image_set: /home/itg11/data/image-data/iowa iowa_small
For this type of image, the image size is read from the "filename.ddr" header file.

Checking if the load was successful

Run "dsm_list -D", and check that the set is "LOADED", and that the server names are as you expect.


If you are having trouble loading data, run loaddata with the options "-d -v" (debug and verbose), and send the output to BLTierney@lbl.gov.
COPYRIGHT NOTICE: Distributed Parallel Storage System and DPSS/ISS are Copyright © 1994 by Lawrence Berkeley Laboratory


This page is the resonsiblity of Brian L. Tierney. (BLTierney@lbl.gov). Support Credits are here. This document and its uses are subject to LBL's disclaimers and legal notices.

Credits: The research and development of the Distributed Systems Department i s funded by the U.S. Dept. of Energy, Office of Science, Office of Advanced Scie ntific Computing Research, Mathematical, Information, and Computational Sciences Division

If there is a problem with this page please, e-mail webmaster@www-itg.lbl.gov.


This page last modified: Sunday, 09-May-1999 18:48:33 PDT