Selected tools for the data librarian. These tools were also used in
the Data Migration Project.
cio (check it out)
A Perl script that reports
information about plain text files: notes presence of Non-ASCII characters.
Reports minimum and maximum record lengths,
number of records of each record length, etc.
Prints first 70 columns of first 5 and last 5 records.
fix
A Perl script that cleans up plain text
files. Replaces bad ASCII chars (high and control chars) with spaces,
strips leading and trailing blanks from lines, etc. Configurable.
Related Web Sites:
IASSIST tools page
This file archive for IASSIST members is intended to encourage
the sharing of utilities for managing, manipulating and analyzing
machine readable social sciences data files. This archive is intended
for compiled programs, source code, and scripts, as well as other
documentation about the management, manipulation, and use of machine
readable social sciences data files. These files are provided as a
courtesy of the authors, and no warranty is implied on behalf of
IASSIST.
|