Difference between revisions of "Bookkeeping"

From ecology
Jump to: navigation, search
(FILE_REPLICA)
(Flysafe Grid-Store Catalog)
Line 24: Line 24:
 
|}
 
|}
  
=== Flysafe Grid-Store Catalog ===
+
= Flysafe Grid-Store Catalog =
  
 
The following three tables are used to implement the Flysafe Grid-Store catalog. If files are being archived or copied the Grid-Storage, this is administratred in the following tables.  
 
The following three tables are used to implement the Flysafe Grid-Store catalog. If files are being archived or copied the Grid-Storage, this is administratred in the following tables.  
  
 
For Grid-Storage it is preferable to have a limited number of  big files instead of many small files. Therefore files are first collected into bigger archives, which are then stored as  
 
For Grid-Storage it is preferable to have a limited number of  big files instead of many small files. Therefore files are first collected into bigger archives, which are then stored as  
archive to the Grid-Storage. This is also maintained in the tables below.  
+
archive to the Grid-Storage. This is also maintained in the tables below.
  
 
== FILE_CATALOG ==  
 
== FILE_CATALOG ==  

Revision as of 11:19, 2 June 2008

Description of tables in BOOKKEEPING schema in the Flysafe database

FILES

The FILES table describes the status of all data-files which have been or need to be processed. This information is used by the bookkeeping service on the flysafe website

ID unique file id
filename unique file name
file_type Type of file ie MPR, ECMWF,
file_size Size of file in bytes
status Status of file ie new, processed, archived
filename_time Time associated with filename as timestamp.
time_in Timestamp when file was first registered
time_changed Timastamp of last status change

Flysafe Grid-Store Catalog

The following three tables are used to implement the Flysafe Grid-Store catalog. If files are being archived or copied the Grid-Storage, this is administratred in the following tables.

For Grid-Storage it is preferable to have a limited number of big files instead of many small files. Therefore files are first collected into bigger archives, which are then stored as archive to the Grid-Storage. This is also maintained in the tables below.

FILE_CATALOG

Every file to be stored gets an entry in the FILE_CATALOG table.

id ID for file
filename Unique filename
filetype Type of file (MPR, tar, etc)
path Original path of file on Flysafe server


ARCHIVE_FILE

FILE_REPLICA

The FILE_REPLICA table contains the actual storage location of the stored files. It is possible to have more replicas of the same file.

file_id References in file_catalog
replica_id replicanumber, ie 1 for first replica, 2 for second etc
surl SURL is the storage location.
stored timestamp with time of storage
adler32 adler32 checksum of file
verified boolean indicator