inls 461
Information Tools

Professor: Serena Fenton
School of Information and Library Science at UNC-Chapel Hill

Unix, Linux, *nix
Week 1 - Fundamentals Week 2 - Sorting, Filtering and Wildcards Week 3 - Grep

Unix - Week 1 - Fundamentals

Due on Friday:

  • Reading and labs only this week
  • no assignment; no quiz

Required Readings:

Textbook: Unix: Visual QuickStart Guide, 2nd edition (2003)

  • Chapter 1 - Getting Started with Unix
  • Chapter 2 - Using Directories and Files
  • Chapter 4 - Creating and Editing Files
  • Chapter 6 - Manipulating Files
  • Skim Chapter 5 - Controlling Ownership & Permissions <- we'll use this for html

Recommended Resources and Readings:

Lab: SSH Install Note: this lab is essential prior to beginning the next labs

Download SSH Secure Shell and install it on you home computer
You will use this software for the next 5 weeks of class

Video presentation of install
Notes for Video Installation (pdf)
Lab: Command Line text editing Using Pico
Lab: Some basic UNIX moves Getting Comfortable with UNIX
  Quick troubleshooting

Unix - Week 2 - Sorting, Filtering and Wildcards

Due on Friday:

  • Assignment 6: UNIX-part 1

Required Readings and Labs:

textbook: Unix: Visual QuickStart Guide, 2nd edition (2003)

  • Chapter 7 - Getting Information About the System
  • Skim Chapter 8 - Configuring Your Unix Environment <- for more advanced users
  • Chapter 10 - Writing Basic Scripts
  • Filters, Piping and Wildcards from Sams Teach Yourself UNIX in 24 Hours Second Edition by Dave Taylor, James C. Armstrong Jr.
    note: this book is availalbe through UNC Library's Safari Bookshelf
  • File System Navigation Techniques from Sams Teach Yourself Unix in 10 Minutes by Robert Shimonski; Second Edition;2005;isbn 0672327643 (pdf)
UNIX – working with large files - tutorial and lab
  • Intro to UNIX tools - part 1 (17:49)
  • Intro to UNIX tools - part 2 (14:20)
  • Intro to UNIX tools - part 3 -wc, split, sort, diff (14:48)

powerpoint slides from the above tutorials (print them and follow along)

Lab- using find and wc Lab and reference on find and wc

Unix- Week 3 - Grep

Due on Friday:

  • Assignment 6: UNIX-part 2
  • Quiz 7: UNIX

Required Readings and Labs:

  • Grep Overview
  • re-read Chapter 8 - Manipulating Files from textbook: UNIX: Visual QuickStart Guide
  • Review the readings from previous weeks.
Using Grep - tutorial and lab

Grep - part 1 (22:37)

Grep - part 2 (8:49)

powerpoint slides for Grep tutorials (print them and follow along)

Resources:

Single sheet data sets
Download from the web and upload to your isis account to practice grep searches. These data sets are provided by the American Statistical Organization

  • Specifications are given for 93 new car models for the 1993 year.
  • Senate Vote on Corporate Average Fuel Economy (CAFE) Standard
    • cafe.txt - description and data information
    • cafe.dat - data file - save to isis and search
  • Information about each of the winners for each of the 77 annual Oscar awards.
  • more data sets

Large data sets
UCI KDD is an online repository of large data sets which encompasses a wide variety of data types, analysis tasks, and application areas. Summary of Data Sets by Data Type

  • entree_data.tar.gz (note: tar.gz is a UNIX compression) tar gunzip
    unzip on isis using the following commands:
    • gunzip entree_data.tar.gz
    • tar -xvf entree_data.tar

Gnu documentation for grep


 


revised June 8, 2006