Roger Sage CDL / MELVYL SSH Library Home
SSHL Home Data, Gov't & GIS Home
FAQ  Downloading Tips  Glossary  Processing and Quality Control 
Data Migration Project Home

File Processing and Quality Control

Title: Annual energy review database [electronic resource]

Original diskette files: All diskettes were mounted on a 5 1/4" drive of a Windows 98 PC and the diskette files were copied to the PC hard drive. To preserve the original file content, all files were transferred to a UNIX server in binary format and retain their original time and date stamps.

Processed documentation files: The original ASCII documentation was examined with Perl "cio" (check-it-out) to identify extraneous control and high ASCII characters. Form feeds (octal 14) and the substitute control code (octal 32) were replaced with a blank using Perl "fix". DOS ASCII carriage return characters (octal 15) were removed with UNIX "tr". The "fixed" documentation files were saved with .TXT file extensions.

Processed data files: The original ASCII data files were examined with Perl "cio" (check-it-out) to identify extraneous control and high ASCII characters. Form feeds (octal 14) and the substitute control code (octal 32) were replaced with a blank using Perl "fix". DOS ASCII carriage returns (octal 15) were removed with UNIX "tr". The "fixed" data files were saved with .CSV data file extensions.

You can view the "cio" output for the 1990 and 1991 original diskette files.


 

ROGER | Sage | CDL/MELVYL | UCSD Home | UCSD Libraries Home

Official Web Page of the University of California, San Diego
© Copyright 2000, UCSD, All Rights Reserved. This site may not be reproduced.
Social Sciences & Humanities Library, 9500 Gilman Drive, La Jolla, CA 92093, 858-534-3336
Email SSDC Webmaster