Roger Sage CDL / MELVYL SSH Library Home
SSHL Home Data, Gov't & GIS Home
Social Science Data Collection

Tools

Selected tools for the data librarian. These tools were also used in the Data Migration Project.

    cio (check it out)
    A Perl script that reports information about plain text files: notes presence of Non-ASCII characters. Reports minimum and maximum record lengths, number of records of each record length, etc. Prints first 70 columns of first 5 and last 5 records.

    fix
    A Perl script that cleans up plain text files. Replaces bad ASCII chars (high and control chars) with spaces, strips leading and trailing blanks from lines, etc. Configurable.

Related Web Sites:

    IASSIST tools page
    This file archive for IASSIST members is intended to encourage the sharing of utilities for managing, manipulating and analyzing machine readable social sciences data files. This archive is intended for compiled programs, source code, and scripts, as well as other documentation about the management, manipulation, and use of machine readable social sciences data files. These files are provided as a courtesy of the authors, and no warranty is implied on behalf of IASSIST.


 

ROGER | Sage | CDL/MELVYL | UCSD Home | UCSD Libraries Home

Official Web Page of the University of California, San Diego
© Copyright 2000, UCSD, All Rights Reserved. This site may not be reproduced.
Social Sciences & Humanities Library, 9500 Gilman Drive, La Jolla, CA 92093, 858-534-3336
Email SSDC Webmaster