Here are a set of tests that the cloned node should pass:
THIS IS THE TEST REPORT FOR CLONE CD VERSION 1.2
Does sensors report the memory and the voltages/temperature correctly?
No alarms shown?
PASS
Are there any error or warning messages on bootup? Any errors or
warnings in /var/log/messages
In /var/log/messages: Jun 19 01:38:09 localhost xfs: ignoring font
path element /usr/share/fonts/default/TrueType (unreadable)
In /var/log/dmesg: CPU serial number disabled. This is enabled in bios,but probably disabled
in kernel config. It should be turned on in the kernel config: will
make it easier to keep track of CPU serial numbers etc.
i810_rng: RNG not detected (may be because
my machine doesn't have needed hardware; Kevin says absent on microATX
board)
Does the /data directory have reduced # of inodes and reserved space for
root?
PASS
Do /boot, /lib/modules contain vestiges of old kernels, etc?
In /boot, can't we eliminate the bootblocks
boot.b, chain.b,and os2_d.b? Could we rm -f /var/lib/games? in the kickstart
postinstall
Does X run properly using startx as root? This should work on a generic
monitor such as a Viewtronics 771.
Conditional pass. /etc/X11/ should contain a few config files
for standard monitors Flat screen monitors should be tested with
Intel MB from ACE I would prefer seeing windowmaker rather than
Gnome as the default window system for root. Current default for
first root login is Gnome. How abour running wmaker.inst in the post-install
script.
Do gcc and ddd work correctly?
PASS
Does networking run properly using dhcp? Is it running full duplex
100baseT?
PASS
Does the system shut down and power off with shutdown -h now?
PASS
Does the system reboot with shutdown -r now
PASS
After power-down, if AC power is cycled, does system remain off?
PASS
If a running system is unplugged, then plugged in, does it remain off?
PASS
Can that system now be powered on from another machine using etherwake?
Does it fsck and boot up correctly?
I can't test this. It should be tested
for the final cloning CD.
When plugged into the UPS with a serial line, does the UPS properly shut
down the machine when its battery gets low?
If power to the UPS is then restored, does the node remain turned off?
David, when the final cloning CD is ready,
you should test this again.
Are there any files with dates in the future?
I couldn't figure out a find script to look
for this. I did notice a large number of empty files with find / -empty
Is NTP running correctly?
PASS Is the hardware clock synchronized with the software clock after the
software clock has had its time synched with an ntp server?
PASS. We might also want to have this
in a cron job
Is there a running script that calls vga_screenoff 10 minutes after keyboard
input stops, and
Then calls vga_screenon when input starts again?
Not in version 1.2 of the clone cd.
Please add in 1.3.
Does running drag 1.2 show > 290 Mflops provided the screen is blanked
by the above?
FAIL: because of missing screenblank--
currently reports 242 Mflops not 292.
Does /hdparm -tT /dev/hda report good speeds (~120 and ~28 MB/s)?
PASS
Does /hdparm /dev/hdd report jazzed up parameters for CDROM
FAIL: These are set in the cloning scripts,
but not in rc.local. They should be set, then checked with ACE machine
& CDROM.
Is automount on the slave configured so that cd /mnt/floppy and cd /mnt/cdrom
work correctly if a floppy or cd are present?
PASS Does this work with both ext2 and msdos floppies?
Fails on clone CD version 1.2, will be fixed
in next version with alternate mount point for msdos
Should we install fftw, lam, and mpich on each machine (just to have the
libraries local on each machine to cut down network use)?
Not yet installed -- will be done on the next
version
Are big files properly supported? Does the cp /etc/termcap a; cat a a a
a a > b; cat b b b b b > c; etc allow creation of files > 2 GB?
Fails with bash, works with bash2. I wish
there was a better solution.
Does man -k work?
FAIL: run catman in postinstall script
Here are a set of tests that the cloned node should pass when plugged into
the master via a switch
Correctly gets its identity (eg, n012) using dhcp from the master on bootup.
Does automount work?
On the master, does cd /net/n012 work properly (with whatever the correct
node # is).
Does cd /net/n001 on the slave properly mount the master?
Does cp file /net/n012/ work correctly?
When connected to the master via a switch, does an mpi job run properly?
Can root on the master log into a node with just rsh n012 (no password
needed)?
Does rsh work correctly, eg does rsh n012 uptime correctly return the uptime
on n012 from n001
Do the above automount and rsh tests work correctly between two slave nodes?
Are warning messages from the logging daemon correctly logged to the master?
Any opinions, findings, and conclusions or recommendations
expressed in this material are those of the author(s) and do not necessarily
reflect the views of the National Science Foundation.