Roger Sage CDL / MELVYL SSH Library Home
SSHL Home Data, Gov't & GIS Home
FAQ  Downloading Tips  Glossary  Processing and Quality Control 
Data Migration Project Home

File Processing and Quality Control

Title: U.S. crude oil, natural gas, and natural gas liquids reserves [electronic resource]

Original diskette files: All diskettes were mounted on a 5 1/4" drive of a Windows 98 PC and the diskette files were copied to the PC hard drive. To preserve the original file content, all files were transferred to a UNIX server in binary format and retain their original time and date stamps.

Processed documentation file: The original DOS ASCII documentation FILES.DOC was examined with Perl "cio" (check-it-out) to identify extraneous control and high ASCII characters. Nulls (octal 0) and the substitute control code (octal 32) were replaced with a blank using Perl "fix". DOS carriage return characters (octal 15) were removed with UNIX "tr". The "fixed" file was saved with a .TXT file extension.

Processed data files: The original rectangular non-delimited DOS ASCII data files were translated to ASCII comma separated value (ASCII CSV) data file format with Windows DBMS/COPY using the specifications in the original documentation. DBMS/COPY logs were checked for translation errors. Variable labels were added to the ASCII CSV output files. No other checks were made on the translated output files. The input and output DBMS/COPY data dictionary statements follow:

INPUT DATA DICTIONARY:
dictionary
extension=DAT
missing=
numeric=n
fixed=y
dictionary=dct
date=mm/dd/yyyy
variables
     1     32 c AREA NAME
    33      4 c HYDROCARBON TYPE CODE
    37      5 R YEAR
    42      7 c PUBLICATION CODE
    49     11 c AREA CODE
    60      8 R BEGINNING OF YEAR RESERVES
    68      8 R NET ADJUSTMENTS
    76      8 R REVISION INCREASES
    84      8 R REVISION DECREASES
    92      8 R EXTENSIONS
   100      8 R NEW FIELD DISCOVERIES
   108      8 R NEW RESERVOIRS IN OLD FIELDS
   116      8 R PRODUCTION
   124      8 R TOTAL END OF YEAR RESERVES
endvars

OUTPUT DATA DICTIONARY:
dictionary
parmfile=ascii
variables
     1     32 c AREA NAME
    34      4 c HYDROCARBON TYPE CODE
    39      5 R YEAR
    45      7 c PUBLICATION CODE
    53     11 c AREA CODE
    65      8 R BEGINNING OF YEAR RESERVES
    74      8 R NET ADJUSTMENTS
    83      8 R REVISION INCREASES
    92      8 R REVISION DECREASES
   101      8 R EXTENSIONS
   110      8 R NEW FIELD DISCOVERIES
   119      8 R NEW RESERVOIRS IN OLD FIELDS
   128      8 R PRODUCTION
   137      8 R TOTAL END OF YEAR RESERVES
endvars

 

ROGER | Sage | CDL/MELVYL | UCSD Home | UCSD Libraries Home

Official Web Page of the University of California, San Diego
© Copyright 2000, UCSD, All Rights Reserved. This site may not be reproduced.
Social Sciences & Humanities Library, 9500 Gilman Drive, La Jolla, CA 92093, 858-534-3336
Email SSDC Webmaster