UWMLSC > Beowulf Systems > Medusa
   Slave tests CD 1.2

Slave tests CD 1.2

Here are a set of tests that the cloned node should pass:

THIS IS THE TEST REPORT FOR CLONE CD VERSION 1.2

  • Does sensors report the memory and the voltages/temperature correctly? No alarms shown?

  • PASS
  • Are there any error or warning messages on bootup? Any errors or warnings in /var/log/messages

  • In /var/log/messages:
    Jun 19 01:38:09 localhost xfs: ignoring font path element /usr/share/fonts/default/TrueType (unreadable)

    In /var/log/dmesg:
    CPU serial number disabled.
    This is enabled in bios,but probably disabled in kernel config. It should be turned on in the kernel config: will make it easier to keep track of CPU serial numbers etc.

    i810_rng: RNG not detected (may be because my machine doesn't have needed hardware; Kevin says absent on microATX board)

  • Does the /data directory have reduced # of inodes and reserved space for root?

  • PASS
  • Do /boot, /lib/modules contain vestiges of old kernels, etc?

  • In /boot, can't we eliminate the bootblocks boot.b, chain.b,and os2_d.b?
    Could we rm -f /var/lib/games? in the kickstart postinstall
  • Does X run properly using startx as root? This should work on a generic monitor such as a Viewtronics 771.

  • Conditional pass.
    /etc/X11/ should contain a few config files for standard monitors
    Flat screen monitors should be tested with Intel MB from ACE
    I would prefer seeing windowmaker rather than Gnome as the default window system for root. Current default for first root login is Gnome. How abour running wmaker.inst in the post-install script.
  • Do gcc and ddd work correctly?

  • PASS
  • Does networking run properly using dhcp? Is it running full duplex 100baseT?

  • PASS
  • Does the system shut down and power off with shutdown -h now?

  • PASS
  • Does the system reboot with shutdown -r now

  • PASS
  • After power-down, if AC power is cycled, does system remain off?

  • PASS
  • If a running system is unplugged, then plugged in, does it remain off?

  • PASS
  • Can that system now be powered on from another machine using etherwake? Does it fsck and boot up correctly?

  • I can't test this. It should be tested for the final cloning CD.
  • When plugged into the UPS with a serial line, does the UPS properly shut down the machine when its battery gets low?

  • If power to the UPS is then restored, does the node remain turned off?
    David, when the final cloning CD is ready, you should test this again.
  • Are there any files with dates in the future?

  • I couldn't figure out a find script to look for this. I did notice a large number of empty files with find / -empty
  • Is NTP running correctly?

  • PASS
    Is the hardware clock synchronized with the software clock after the software clock has had its time synched with an ntp server?
    PASS. We might also want to have this in a cron job
  • Is there a running script that calls vga_screenoff 10 minutes after keyboard input stops, and

  • Then calls vga_screenon when input starts again?
    Not in version 1.2 of the clone cd. Please add in 1.3.
  • Does running drag 1.2 show > 290 Mflops provided the screen is blanked by the above?

  • FAIL: because of missing screenblank-- currently reports 242 Mflops not 292.
  • Does /hdparm -tT /dev/hda report good speeds (~120 and ~28 MB/s)?

  • PASS
  • Does /hdparm /dev/hdd report jazzed up parameters for CDROM

  • FAIL: These are set in the cloning scripts, but not in rc.local. They should be set, then checked with ACE machine & CDROM.
  • Is automount on the slave configured so that cd /mnt/floppy and cd /mnt/cdrom work correctly if a floppy or cd are present?

  • PASS
    Does this work with both ext2 and msdos floppies?
    Fails on clone CD version 1.2, will be fixed in next version with alternate mount point for msdos
  • Should we install fftw, lam, and mpich on each machine (just to have the libraries local on each machine to cut down network use)?

  • Not yet installed -- will be done on the next version
  • Are big files properly supported? Does the cp /etc/termcap a; cat a a a a a > b; cat b b b b b > c; etc allow creation of files > 2 GB?

  • Fails with bash, works with bash2. I wish there was a better solution.
  • Does man -k work?

  • FAIL: run catman in postinstall script

Here are a set of tests that the cloned node should pass when plugged into the master via a switch

  • Correctly gets its identity (eg, n012) using dhcp from the master on bootup.
  • Does automount work?

  • On the master, does cd /net/n012 work properly (with whatever the correct node # is).
    Does cd /net/n001 on the slave properly mount the master?
    Does cp file /net/n012/ work correctly?
  • When connected to the master via a switch, does an mpi job run properly?
  • Can root on the master log into a node with just rsh n012 (no password needed)?
  • Does rsh work correctly, eg does rsh n012 uptime correctly return the uptime on n012 from n001
  • Do the above automount and rsh tests work correctly between two slave nodes?
  • Are warning messages from the logging daemon correctly logged to the master?
Check this page for dead links, sloppy HTML, or a bad style sheet; or strip it for printing.
Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.