NMI Release 2.2.1

Tags:

Download

nmi-2.2.1.tar.gz

Release Date: 01/26/2007
MD5 checksum: ff63ee14c79d58de384fa8f3d9542c58 nmi-2.2.1.tar.gz

Release Notes

NOTE: Be sure to read the NMI 2.2.0 Release Notes to understand the changes made since NMI 2.1.8. Most notable are the configuration parameter changes.

This release fixes two major bugs discovered in the last month at our production facility. These bugs were somewhat related and long standing; they became more prevalent due to the addition of the exponential backoff polling in the database logfile monitor.

NMI 2.2.1 is marked as a STABLE release; all users of NMI 2.2.0 are strongly encouraged to upgrade to this latest release. The NMI team will soon be supporting concurrent stable and development releases. More information will be posted in the future.

New Features

  • Add support for Ubuntu in the nmi_platform Hawkeye module.

Bugs Fixed

  • Fixed the log file checksumming feature of the database update script. This prevented log files from being re-read properly when platform_jobs were evicted from a resource (either due to the machine going down or the job being put on hold). This could cause the updated status of remote tasks which re-ran after an eviction not to appear in the database. A symptom of this problem was tasks that successfully completed but still had a -9 status in the database. More information can be found in this bug report.
  • Fixed a race condition between the DB update script and the platform_job_prescript script. This would prevent the platform_job task information from being stored in the database. This bug did not have an adverse affect on the ability for jobs to run, but produced missing status information and SQL errors in the DB update script’s error file.
  • When a platform_job is evicted from a resource, any remote tasks that were running at the time are now be marked with a special -1003 result code instead of -9 (SIGKILL).
  • Corrected spelling mistake in the URL_PREFIX parameter of the email notification script. This caused emails to have incomplete URLs to build/test information.

Known Bugs

  • None

Requirements

  • NMI Submit/Archive Host
    • Condor >= 6.8.0 or Condor >= 6.9.0
    • Perl >= 5.005 (including DBI and DBD::mysql modules)
    • Apache >= 2.0
  • PHP >= 4.2.3 (i.e, with Session & MySQL support)

  • NMI DB Host
  • MySQL 4.1.20

  • Condor Central Manager Host
  • Condor >= 6.8.0 or Condor >= 6.9.0

  • Build/Test Execution Hosts
    • Condor >= 6.8.0 or Condor >= 6.9.0
  • Perl >= 5.005

Special Feature Requirements

  • For Parallel jobs: Condor >= 6.9.0 on central manager, submit and execute hosts.