Does sensors report the memory and the voltages/temperature correctly?
No alarms shown?
PASS
Are there any error or warning messages on bootup? Any errors
or warnings in /var/log/messages?
In /var/log/messages: There is a suggestion that etherwake might
have been updated kernel: ether-wake uses obsolete (PF_INET,SOCK_PACKET) This can be ignored for now.
There is no newer version of ether-wake that I know of.
Also, I found: modprobe: modprobe: Can't locate module
net-pf-10 I think this might be network packet filtering. It would be nice to know how to fix this.
The addition of the corresponding module in
the kernel should fix it. There was no clear indication of what it was in the
kernel config. I'll attempt enabling network packet filtering as a module next
time around.
The error messages about the missing random
number generator are OK.
Does the /data directory have reduced # of inodes and reserved space for
root?
PASS
Does X run properly using startx as root?
PASS
Do gcc and ddd work correctly?
PASS
Does networking run properly using dhcp? Is it running full duplex
100baseT?
PASS
Does the system shut down and power off with shutdown -h now?
PASS
Does the system reboot with shutdown -r now
PASS
After power-down, if AC power is cycled, does system remain off?
PASS
If a running system is unplugged, then plugged in, does it remain off?
PASS
Can that system now be powered on from another machine using etherwake?
Does it fsck and boot up correctly?
PASS
When plugged into the UPS with a serial line, does the UPS properly
shut down the machine when its battery gets low?
If power to the UPS is then restored, does the node remain turned
off? PASS
Is NTP running correctly?
PASS
Is the hardware clock synchronized with the software clock after the software
clock has had its time synched with an ntp server?
PASS
Is the hardware clock synchronized with the software clock by a crontab
file?
PASS
Is there a running script that calls vga_screenoff 10 minutes after keyboard
input stops, and then calls vga_screenon when input starts again?
PASS
Does running drag 1.2 show > 290 Mflops provided the screen is blanked
by the above?
PASS
Does hdparm -tT /dev/hda report good speeds (~120 and ~28 MB/s)?
PASS
Does /hdparm /dev/hdd report jazzed up parameters for CDROM?
PASS
Is automount on the slave configured so that cd /mnt/floppy and cd /mnt/cdrom
work correctly if a floppy or cd are present?
PASS. Does this work with both ext2 and msdos floppies?
PASS
Should we install fftw, lam, and mpich on each machine (just to have the
libraries local on each machine to cut down network use)?
PASS - in /ldcg
Does man -k work?
PASS
Is there a decent /root/.netscape file with some
sensible bookmarks?
FAIL
Does /proc/fans/ exist and contain sensible information?
PASS
Are big files properly supported? Does the cp /etc/termcap a; cat a a a
a a > b; cat b b b b b > c; etc allow creation of files > 2 GB?
PASS using bash2
Here are a set of tests that the cloned node should pass when plugged into
the master via a switch
Correctly gets its identity (eg, n012) using dhcp from the master on bootup.
PASS
Does automount work?
On the master, does cd /net/s012 work properly (with whatever the correct
slave node # is).
PASS Does cd /net/m001 on the slave properly mount the master?
PASS Does cp file /net/s012/ work correctly?
PASS
When connected to the master via a switch, does an mpi job run properly?
pending
Can root on the master log into a node with just rsh s012 (no password
needed)?
PASS
Does rsh work correctly, eg does rsh s012 uptime correctly return the uptime
on s012 from m001
PASS
Do the above automount and rsh tests work correctly between two slave nodes?
PASS
Are warning messages from the logging daemon correctly logged to the master?
pending
we are not taking this approach anymore.
rather, a script will be written to pick up logs on demand (and/or cron'd)
Any opinions, findings, and conclusions or recommendations
expressed in this material are those of the author(s) and do not necessarily
reflect the views of the National Science Foundation.