controlMonitor API client

The controlMonitor API client (cmonClient) is a graphical user interface that supports monitoring and control of LDAS system via requests made to the control and Monitor API (cntlmonAPI) server in the LDAS system; request are of the types:

How to install cmonClient

General Operations

  1. Connection to cntlmonAPI server
  2. Site information maintanence
  3. Operation of special widgets
  4. Graph operations

Navigation through Pages

Specific functions of the client are accessed by hitting the Tab button on the top of each Page. Functions are described here in the order of left to right as they appear in cmonClient. It is highly recommended to bring up cmonClient for reference while going through this help session.
  1. LDAS Control
    1. LDAS API control - add, shutdown, restart individual APIs
    2. LDAS System control - start or stop LDAS
    3. LDAS Resource control - modify API resources
    4. LDAS Utilities control - restart log, database monitors, compare databases etc.
    5. LDAS User Administration - maintain user accounts
    6. Block DSO - block DSOs
    7. LDAS Core File Management - examine, delete core files
  2. Status
    1. API status - API run status e.g. memory, cpu, data objects
    2. NTP Server Diagnostics - monitor API network time synchronization
    3. API state summary - summary of running and queued jobs
  3. System Load
    1. LDAS Non beowulf servers - display unix ps information for non-beowulf servers
    2. LDAS beowulf nodes - display unix ps information for beowulf and nodes
  4. APIs
    1. API System Resources -
    2. - plot of API memory, cpu usage vs time

    3. Time Metrics
    4. - plot of time spent by jobs in API vs time

  5. Job Information and control
    1. Job Statistics - plot of jobs passed/failed vs time
    2. Database Statistics - plot of insertions/queries vs time
    3. Queue Statistics - plot of #jobs in queue vs time
    4. Active Job Control - view job details, abort jobs
    5. Job Output Directory - view output files in job directories
    6. Usage Details - breakdown of jobs counts by command and user
  6. LDAS Log Filter - filter LDAS logs by jobId or by time
  7. MPI Jobs
    1. MPI Jobs control - view MPI job details, abort MPI jobs
    2. Load Summary - plot of load summary on beowulf nodes
    3. Node Usage Statistics - plot of number of nodes used vs time
  8. Beowulf System
    1. Node Balance and Login - add or delete beowulf nodes, login to nodes
    2. MPI Users Queue - display the MPI users queue
    3. Verify Beowulf Users (wrapperAPI processes) - display list of wrapper jobs
    4. Verify all Beowulf Users and nodes are ready for ldas jobs - PASS/FAIL test on all nodes and beowulf users
    5. Monitor Node Usage by groups - display unix ps information for group of nodes
  9. LDAS Test - run sample LDAS user commands to validate system
  10. View Database - plots of database numerical column data vs time inserted
  11. Cache View - graph of frame times in diskcacheAPI
  12. Directories - display directory tree for diskcacheAPI mount points


Design

  1. Client and Server Model

  2. Client States