Roger Sage CDL / MELVYL SSH Library Home
SSHL Home Data, Gov't & GIS Home
FAQ  Downloading Tips  Glossary  Processing and Quality Control 
Data Migration Project Home

File Processing and Quality Control

Title: Health data on older Americans, United States, 1992 [electronic resource] / National Center for Health Statistics

Original diskette files: All diskettes were mounted on a 5 1/4" drive of a Windows 98 PC and the diskette files were copied to the PC hard drive. To preserve the original file content, all files were transferred to a UNIX server in binary format and retain their original time and date stamps.

Compressed files: The DOS self extracting compressed files were extracted in Windows 98. Windows OS versions that use the NT (rather than the DOS) kernel will not extract these files due to memory allocation errors. DOS Decompressed files were transferred to a UNIX server in binary format and retain their original time and date stamps.

Processed documentation files: The original DOS ASCII documentation file INSTRUCT.TXT was examined with Perl "cio" (check-it-out) to identify extraneous control and high ASCII characters. A substitute control code (octal 32) was replaced with a blank using Perl "fix". DOS carriage return characters (octal 15) were removed with UNIX "tr". The "fixed" file was saved with a .TXT file extension.

The decompressed DOS Lotus 123 table finding guide was converted with Microsoft Excel version 2002. The guide was reformatted and the spreadsheet table finder was extracted and converted to ASCII text format. This file was saved with a .TXT file extension.

Processed data tables: The Lotus 123 spreadsheet tables are not in a commonly accepted format for data files. That is, one record consisting of variable names or labels followed by n records of data values. The authors of these data used Lotus to display the data tables. These data tables were translated to ASCII CSV table format with Microsoft Excel version 2002 so they can be displayed with different spreadsheet software packages. No comparisons were made on the translated data cell values. These data tables are also available in print and PDF formats.


 

ROGER | Sage | CDL/MELVYL | UCSD Home | UCSD Libraries Home

Official Web Page of the University of California, San Diego
© Copyright 2000, UCSD, All Rights Reserved. This site may not be reproduced.
Social Sciences & Humanities Library, 9500 Gilman Drive, La Jolla, CA 92093, 858-534-3336
Email SSDC Webmaster